Sample records for multi-variant analysis maps

  1. Airborne gamma-ray spectrometer and magnetometer survey, Durango B, Colorado. Final report Volume II C. Detail area

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    1983-01-01

    This volume contains eight appendices: flight line maps, geology maps, explanation of geologic legend, flight line/geology maps, radiometric contour maps, magnetic contour maps, multi-variant analysis maps, and geochemical factor analysis maps. These appendices pertain to the Durango B detail area.

  2. Airborne gamma-ray spectrometer and magnetometer survey, Durango D, Colorado. Final report Volume II B. Detail area

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    1983-01-01

    This volume comprises eight appendices containing the following information for the Durango D detail area: flight line maps, geology maps, explanation of geologic legend, flight line/geology maps, radiometric contour maps, magnetic contour maps, multi-variant analysis maps, and geochemical factor analysis maps.

  3. Airborne gamma-ray spectrometer and magnetometer survey, Durango C, Colorado. Final report Volume II B. Detail area

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    1983-01-01

    This volume comprises eight appendices containing the following information for the Durango C detail area: flight line maps, geology maps, explanation of geologic legend, flight line/geology maps, radiometric contour maps, magnetic contour maps, multi-variant analysis maps, and geochemical factor analysis maps.

  4. Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.

    PubMed

    Decap, Dries; Reumers, Joke; Herzeel, Charlotte; Costanza, Pascal; Fostier, Jan

    2017-01-01

    Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel computing. Determining the effectively expressed variants from transcriptomics (RNA-seq) data has only recently become possible, and as such does not yet benefit from efficiently parallelized workflows. We introduce Halvade-RNA, a parallel, multi-node RNA-seq variant calling pipeline based on the GATK Best Practices recommendations. Halvade-RNA makes use of the MapReduce programming model to create and manage parallel data streams on which multiple instances of existing tools such as STAR and GATK operate concurrently. Whereas the single-threaded processing of a typical RNA-seq sample requires ∼28h, Halvade-RNA reduces this runtime to ∼2h using a small cluster with two 20-core machines. Even on a single, multi-core workstation, Halvade-RNA can significantly reduce runtime compared to using multi-threading, thus providing for a more cost-effective processing of RNA-seq data. Halvade-RNA is written in Java and uses the Hadoop MapReduce 2.0 API. It supports a wide range of distributions of Hadoop, including Cloudera and Amazon EMR.

  5. Multi-locus mixed model analysis of stem rust resistance in a worldwide collection of winter wheat

    USDA-ARS?s Scientific Manuscript database

    Genome-wide association mapping is a powerful tool for dissecting the relationship between phenotypes and genetic variants in diverse populations. With improved cost efficiency of high-throughput genotyping platforms, association mapping is a desirable method to mine populations for favorable allele...

  6. Airborne gamma-ray spectrometer and magnetometer survey, Durango A, B, C, and D, Colorado. Volume I. Detail area. Final report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    1983-01-01

    An airborne combined radiometric and magnetic survey was performed for the Department of Energy (DOE) over the Durango A, Durango B, Durango C, and Durango D Detail Areas of southwestern Colorado. The Durango A Detail Area is within the coverage of the Needle Mountains and Silverton 15' map sheets, and the Pole Creek Mountain, Rio Grande Pyramid, Emerald Lake, Granite Peak, Vallecito Reservoir, and Lemon Reservoir 7.5' map sheets of the National Topographic Map Series (NTMS). The Durango B Detail Area is within the coverage of the Silverton 15' map sheet and the Wetterhorn Peak, Uncompahgre Peak, Lake City, Redcloudmore » Peak, Lake San Cristobal, Pole Creek Mountain, and Finger Mesa 7.5' map sheets of the NTMS. The Durango C Detail Area is within the coverage of the Platoro and Wolf Creek Pass 15' map sheets of the NTMS. The Durango D Detail Area is within the coverage of the Granite Lake, Cimarrona Peak, Bear Mountain, and Oakbrush Ridge 7.5' map sheets of the NTMS. Radiometric data were corrected for live time, aircraft and equipment background, cosmic background, atmospheric radon, Compton scatter, and altitude dependence. The corrected data were statistically evaluated, gridded, and contoured to produce maps of the radiometric variables, uranium, potassium, and thorium; their ratios; and the residual magnetic field. These maps have been analyzed in order to produce a multi-variant analysis contour map based on the radiometric response of the individual geological units. A geochemical analysis has been performed, using the radiometric and magnetic contour maps, the multi-variant analysis map, and factor analysis techniques, to produce a geochemical analysis map for the area.« less

  7. Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria

    PubMed Central

    Farasat, Iman; Kushwaha, Manish; Collens, Jason; Easterbrook, Michael; Guido, Matthew; Salis, Howard M

    2014-01-01

    Developing predictive models of multi-protein genetic systems to understand and optimize their behavior remains a combinatorial challenge, particularly when measurement throughput is limited. We developed a computational approach to build predictive models and identify optimal sequences and expression levels, while circumventing combinatorial explosion. Maximally informative genetic system variants were first designed by the RBS Library Calculator, an algorithm to design sequences for efficiently searching a multi-protein expression space across a > 10,000-fold range with tailored search parameters and well-predicted translation rates. We validated the algorithm's predictions by characterizing 646 genetic system variants, encoded in plasmids and genomes, expressed in six gram-positive and gram-negative bacterial hosts. We then combined the search algorithm with system-level kinetic modeling, requiring the construction and characterization of 73 variants to build a sequence-expression-activity map (SEAMAP) for a biosynthesis pathway. Using model predictions, we designed and characterized 47 additional pathway variants to navigate its activity space, find optimal expression regions with desired activity response curves, and relieve rate-limiting steps in metabolism. Creating sequence-expression-activity maps accelerates the optimization of many protein systems and allows previous measurements to quantitatively inform future designs. PMID:24952589

  8. Colour segmentation of multi variants tuberculosis sputum images using self organizing map

    NASA Astrophysics Data System (ADS)

    Rulaningtyas, Riries; Suksmono, Andriyan B.; Mengko, Tati L. R.; Saptawati, Putri

    2017-05-01

    Lung tuberculosis detection is still identified from Ziehl-Neelsen sputum smear images in low and middle countries. The clinicians decide the grade of this disease by counting manually the amount of tuberculosis bacilli. It is very tedious for clinicians with a lot number of patient and without standardization for sputum staining. The tuberculosis sputum images have multi variant characterizations in colour, because of no standardization in staining. The sputum has more variants colour and they are difficult to be identified. For helping the clinicians, this research examined the Self Organizing Map method for colouring image segmentation in sputum images based on colour clustering. This method has better performance than k-means clustering which also tried in this research. The Self Organizing Map could segment the sputum images with y good result and cluster the colours adaptively.

  9. Multi-ethnic fine-mapping of 14 central adiposity loci.

    PubMed

    Liu, Ching-Ti; Buchkovich, Martin L; Winkler, Thomas W; Heid, Iris M; Borecki, Ingrid B; Fox, Caroline S; Mohlke, Karen L; North, Kari E; Adrienne Cupples, L

    2014-09-01

    The Genetic Investigation of Anthropometric Traits (GIANT) consortium identified 14 loci in European Ancestry (EA) individuals associated with waist-to-hip ratio (WHR) adjusted for body mass index. These loci are wide and narrowing the signals remains necessary. Twelve of 14 loci identified in GIANT EA samples retained strong associations with WHR in our joint EA/individuals of African Ancestry (AA) analysis (log-Bayes factor >6.1). Trans-ethnic analyses at five loci (TBX15-WARS2, LYPLAL1, ADAMTS9, LY86 and ITPR2-SSPN) substantially narrowed the signals to smaller sets of variants, some of which are in regions that have evidence of regulatory activity. By leveraging varying linkage disequilibrium structures across different populations, single-nucleotide polymorphisms (SNPs) with strong signals and narrower credible sets from trans-ethnic meta-analysis of central obesity provide more precise localizations of potential functional variants and suggest a possible regulatory role. Meta-analysis results for WHR were obtained from 77 167 EA participants from GIANT and 23 564 AA participants from the African Ancestry Anthropometry Genetics Consortium. For fine mapping we interrogated SNPs within ± 250 kb flanking regions of 14 previously reported index SNPs from loci discovered in EA populations by performing trans-ethnic meta-analysis of results from the EA and AA meta-analyses. We applied a Bayesian approach that leverages allelic heterogeneity across populations to combine meta-analysis results and aids in fine-mapping shared variants at these locations. We annotated variants using information from the ENCODE Consortium and Roadmap Epigenomics Project to prioritize variants for possible functionality. Published by Oxford University Press 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  10. Multi-template analysis of human perirhinal cortex in brain MRI: Explicitly accounting for anatomical variability

    PubMed Central

    Xie, Long; Pluta, John B.; Das, Sandhitsu R.; Wisse, Laura E.M.; Wang, Hongzhi; Mancuso, Lauren; Kliot, Dasha; Avants, Brian B.; Ding, Song-Lin; Manjón, José V.; Wolk, David A.; Yushkevich, Paul A.

    2016-01-01

    Rational The human perirhinal cortex (PRC) plays critical roles in episodic and semantic memory and visual perception. The PRC consists of Brodmann areas 35 and 36 (BA35, BA36). In Alzheimer's disease (AD), BA35 is the first cortical site affected by neurofibrillary tangle pathology, which is closely linked to neural injury in AD. Large anatomical variability, manifested in the form of different cortical folding and branching patterns, makes it difficult to segment the PRC in MRI scans. Pathology studies have found that in ~97% of specimens, the PRC falls into one of three discrete anatomical variants. However, current methods for PRC segmentation and morphometry in MRI are based on single-template approaches, which may not be able to accurately model these discrete variants Methods A multi-template analysis pipeline that explicitly accounts for anatomical variability is used to automatically label the PRC and measure its thickness in T2-weighted MRI scans. The pipeline uses multi-atlas segmentation to automatically label medial temporal lobe cortices including entorhinal cortex, PRC and the parahippocampal cortex. Pairwise registration between label maps and clustering based on residual dissimilarity after registration are used to construct separate templates for the anatomical variants of the PRC. An optimal path of deformations linking these templates is used to establish correspondences between all the subjects. Experimental evaluation focuses on the ability of single-template and multi-template analyses to detect differences in the thickness of medial temporal lobe cortices between patients with amnestic mild cognitive impairment (aMCI, n=41) and age-matched controls (n=44). Results The proposed technique is able to generate templates that recover the three dominant discrete variants of PRC and establish more meaningful correspondences between subjects than a single-template approach. The largest reduction in thickness associated with aMCI, in absolute terms, was found in left BA35 using both regional and summary thickness measures. Further, statistical maps of regional thickness difference between aMCI and controls revealed different patterns for the three anatomical variants. PMID:27702610

  11. Comparison of Tasseled Cap-based Landsat data structures for use in forest disturbance detection.

    Treesearch

    Sean P. Healey; Warren B. Cohen; Yang Zhiqiang; Olga N. Krankina

    2005-01-01

    Landsat satellite data has become ubiquitous in regional-scale forest disturbance detection. The Tasseled Cap (TC) transformation for Landsat data has been used in several disturbance-mapping projects because of its ability to highlight relevant vegetation changes. We used an automated composite analysis procedure to test four multi-date variants of the TC...

  12. Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.

    PubMed

    Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao

    2016-11-30

    Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. [Fine mapping of complex disease susceptibility loci].

    PubMed

    Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao

    2014-01-01

    Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.

  14. Identification of Nitrogen Consumption Genetic Variants in Yeast Through QTL Mapping and Bulk Segregant RNA-Seq Analyses.

    PubMed

    Cubillos, Francisco A; Brice, Claire; Molinet, Jennifer; Tisné, Sebastién; Abarca, Valentina; Tapia, Sebastián M; Oporto, Christian; García, Verónica; Liti, Gianni; Martínez, Claudio

    2017-06-07

    Saccharomyces cerevisiae is responsible for wine must fermentation. In this process, nitrogen represents a limiting nutrient and its scarcity results in important economic losses for the wine industry. Yeast isolates use different strategies to grow in poor nitrogen environments and their genomic plasticity enables adaptation to multiple habitats through improvements in nitrogen consumption. Here, we used a highly recombinant S. cerevisiae multi-parent population (SGRP-4X) derived from the intercross of four parental strains of different origins to identify new genetic variants responsible for nitrogen consumption differences during wine fermentation. Analysis of 165 fully sequenced F12 segregants allowed us to map 26 QTL in narrow intervals for 14 amino acid sources and ammonium, the majority of which represent genomic regions previously unmapped for these traits. To complement this strategy, we performed Bulk segregant RNA-seq (BSR-seq) analysis in segregants exhibiting extremely high and low ammonium consumption levels. This identified several QTL overlapping differentially expressed genes and refined the gene candidate search. Based on these approaches, we were able to validate ARO1 , PDC1 , CPS1 , ASI2 , LYP1 , and ALP1 allelic variants underlying nitrogen consumption differences between strains, providing evidence of many genes with small phenotypic effects. Altogether, these variants significantly shape yeast nitrogen consumption with important implications for evolution, ecological, and quantitative genomics. Copyright © 2017 Cubillos et al.

  15. Identification of Nitrogen Consumption Genetic Variants in Yeast Through QTL Mapping and Bulk Segregant RNA-Seq Analyses

    PubMed Central

    Cubillos, Francisco A.; Brice, Claire; Molinet, Jennifer; Tisné, Sebastién; Abarca, Valentina; Tapia, Sebastián M.; Oporto, Christian; García, Verónica; Liti, Gianni; Martínez, Claudio

    2017-01-01

    Saccharomyces cerevisiae is responsible for wine must fermentation. In this process, nitrogen represents a limiting nutrient and its scarcity results in important economic losses for the wine industry. Yeast isolates use different strategies to grow in poor nitrogen environments and their genomic plasticity enables adaptation to multiple habitats through improvements in nitrogen consumption. Here, we used a highly recombinant S. cerevisiae multi-parent population (SGRP-4X) derived from the intercross of four parental strains of different origins to identify new genetic variants responsible for nitrogen consumption differences during wine fermentation. Analysis of 165 fully sequenced F12 segregants allowed us to map 26 QTL in narrow intervals for 14 amino acid sources and ammonium, the majority of which represent genomic regions previously unmapped for these traits. To complement this strategy, we performed Bulk segregant RNA-seq (BSR-seq) analysis in segregants exhibiting extremely high and low ammonium consumption levels. This identified several QTL overlapping differentially expressed genes and refined the gene candidate search. Based on these approaches, we were able to validate ARO1, PDC1, CPS1, ASI2, LYP1, and ALP1 allelic variants underlying nitrogen consumption differences between strains, providing evidence of many genes with small phenotypic effects. Altogether, these variants significantly shape yeast nitrogen consumption with important implications for evolution, ecological, and quantitative genomics. PMID:28592651

  16. Stereotypic and complex phrase types provide structural evidence for a multi-message display in humpback whales (Megaptera novaeangliae).

    PubMed

    Murray, Anita; Dunlop, Rebecca A; Noad, Michael J; Goldizen, Anne W

    2018-02-01

    Male humpback whales produce a mating display called "song." Behavioral studies indicate song has inter- and/or intra-sexual functionality, suggesting song may be a multi-message display. Multi-message displays often include stereotypic components that convey group membership for mate attraction and/or male-male interactions, and complex components that convey individual quality for courtship. Humpback whale song contains sounds ("units") arranged into sequences ("phrases"). Repetitions of a specific phrase create a "theme." Within a theme, imperfect phrase repetitions ("phrase variants") create variability among phrases of the same type ("phrase type"). The hypothesis that song contains stereotypic and complex phrase types, structural characteristics consistent with a multi-message display, is investigated using recordings of 17 east Australian males (8:2004, 9:2011). Phrase types are categorized as stereotypic or complex using number of unit types, number of phrase variants, and the proportion of phrases that is unique to an individual versus shared amongst males. Unit types are determined using self-organizing maps. Phrase variants are determined by Levenshtein distances between phrases. Stereotypic phrase types have smaller numbers of unit types and shared phrase variants. Complex phrase types have larger numbers of unit types and unique phrase variants. This study supports the hypothesis that song could be a multi-message display.

  17. Improved methods for multi-trait fine mapping of pleiotropic risk loci.

    PubMed

    Kichaev, Gleb; Roytman, Megan; Johnson, Ruth; Eskin, Eleazar; Lindström, Sara; Kraft, Peter; Pasaniuc, Bogdan

    2017-01-15

    Genome-wide association studies (GWAS) have identified thousands of regions in the genome that contain genetic variants that increase risk for complex traits and diseases. However, the variants uncovered in GWAS are typically not biologically causal, but rather, correlated to the true causal variant through linkage disequilibrium (LD). To discern the true causal variant(s), a variety of statistical fine-mapping methods have been proposed to prioritize variants for functional validation. In this work we introduce a new approach, fastPAINTOR, that leverages evidence across correlated traits, as well as functional annotation data, to improve fine-mapping accuracy at pleiotropic risk loci. To improve computational efficiency, we describe an new importance sampling scheme to perform model inference. First, we demonstrate in simulations that by leveraging functional annotation data, fastPAINTOR increases fine-mapping resolution relative to existing methods. Next, we show that jointly modeling pleiotropic risk regions improves fine-mapping resolution compared to standard single trait and pleiotropic fine mapping strategies. We report a reduction in the number of SNPs required for follow-up in order to capture 90% of the causal variants from 23 SNPs per locus using a single trait to 12 SNPs when fine-mapping two traits simultaneously. Finally, we analyze summary association data from a large-scale GWAS of lipids and show that these improvements are largely sustained in real data. The fastPAINTOR framework is implemented in the PAINTOR v3.0 package which is publicly available to the research community http://bogdan.bioinformatics.ucla.edu/software/paintor CONTACT: gkichaev@ucla.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. First High-Density Linkage Map and Single Nucleotide Polymorphisms Significantly Associated With Traits of Economic Importance in Yellowtail Kingfish Seriola lalandi.

    PubMed

    Nguyen, Nguyen H; Rastas, Pasi M A; Premachandra, H K A; Knibb, Wayne

    2018-01-01

    The genetic resources available for the commercially important fish species Yellowtail kingfish (YTK) ( Seriola lalandi) are relative sparse. To overcome this, we aimed (1) to develop a linkage map for this species, and (2) to identify markers/variants associated with economically important traits in kingfish (with an emphasis on body weight). Genetic and genomic analyses were conducted using 13,898 single nucleotide polymorphisms (SNPs) generated from a new high-throughput genotyping by sequencing platform, Diversity Arrays Technology (DArTseq TM ) in a pedigreed population comprising 752 animals. The linkage analysis enabled to map about 4,000 markers to 24 linkage groups (LGs), with an average density of 3.4 SNPs per cM. The linkage map was integrated into a genome-wide association study (GWAS) and identified six variants/SNPs associated with body weight ( P < 5e -8 ) when a multi-locus mixed model was used. Two out of the six significant markers were mapped to LGs 17 and 23, and collectively they explained 5.8% of the total genetic variance. It is concluded that the newly developed linkage map and the significantly associated markers with body weight provide fundamental information to characterize genetic architecture of growth-related traits in this population of YTK S. lalandi .

  19. Predicting primary progressive aphasias with support vector machine approaches in structural MRI data.

    PubMed

    Bisenius, Sandrine; Mueller, Karsten; Diehl-Schmid, Janine; Fassbender, Klaus; Grimmer, Timo; Jessen, Frank; Kassubek, Jan; Kornhuber, Johannes; Landwehrmeyer, Bernhard; Ludolph, Albert; Schneider, Anja; Anderl-Straub, Sarah; Stuke, Katharina; Danek, Adrian; Otto, Markus; Schroeter, Matthias L

    2017-01-01

    Primary progressive aphasia (PPA) encompasses the three subtypes nonfluent/agrammatic variant PPA, semantic variant PPA, and the logopenic variant PPA, which are characterized by distinct patterns of language difficulties and regional brain atrophy. To validate the potential of structural magnetic resonance imaging data for early individual diagnosis, we used support vector machine classification on grey matter density maps obtained by voxel-based morphometry analysis to discriminate PPA subtypes (44 patients: 16 nonfluent/agrammatic variant PPA, 17 semantic variant PPA, 11 logopenic variant PPA) from 20 healthy controls (matched for sample size, age, and gender) in the cohort of the multi-center study of the German consortium for frontotemporal lobar degeneration. Here, we compared a whole-brain with a meta-analysis-based disease-specific regions-of-interest approach for support vector machine classification. We also used support vector machine classification to discriminate the three PPA subtypes from each other. Whole brain support vector machine classification enabled a very high accuracy between 91 and 97% for identifying specific PPA subtypes vs. healthy controls, and 78/95% for the discrimination between semantic variant vs. nonfluent/agrammatic or logopenic PPA variants. Only for the discrimination between nonfluent/agrammatic and logopenic PPA variants accuracy was low with 55%. Interestingly, the regions that contributed the most to the support vector machine classification of patients corresponded largely to the regions that were atrophic in these patients as revealed by group comparisons. Although the whole brain approach took also into account regions that were not covered in the regions-of-interest approach, both approaches showed similar accuracies due to the disease-specificity of the selected networks. Conclusion, support vector machine classification of multi-center structural magnetic resonance imaging data enables prediction of PPA subtypes with a very high accuracy paving the road for its application in clinical settings.

  20. Efficient Integrative Multi-SNP Association Analysis via Deterministic Approximation of Posteriors.

    PubMed

    Wen, Xiaoquan; Lee, Yeji; Luca, Francesca; Pique-Regi, Roger

    2016-06-02

    With the increasing availability of functional genomic data, incorporating genomic annotations into genetic association analysis has become a standard procedure. However, the existing methods often lack rigor and/or computational efficiency and consequently do not maximize the utility of functional annotations. In this paper, we propose a rigorous inference procedure to perform integrative association analysis incorporating genomic annotations for both traditional GWASs and emerging molecular QTL mapping studies. In particular, we propose an algorithm, named deterministic approximation of posteriors (DAP), which enables highly efficient and accurate joint enrichment analysis and identification of multiple causal variants. We use a series of simulation studies to highlight the power and computational efficiency of our proposed approach and further demonstrate it by analyzing the cross-population eQTL data from the GEUVADIS project and the multi-tissue eQTL data from the GTEx project. In particular, we find that genetic variants predicted to disrupt transcription factor binding sites are enriched in cis-eQTLs across all tissues. Moreover, the enrichment estimates obtained across the tissues are correlated with the cell types for which the annotations are derived. Copyright © 2016 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  1. Fine mapping in TERT-CLPTM1L region identified three independent lung cancer susceptibility signals: a large-scale multi-ethnic population study.

    PubMed

    Li, Zhihua; Fan, Jingyi; Li, Ni; Zhu, Meng; Zhang, Jiahui; Wang, Yuzhuo; Geng, Liguo; Cheng, Yang; Ma, Hongxia; Jin, Guangfu; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing

    2018-05-29

    Genome-wide association studies (GWAS) and fine mapping studies have identified multiple lung cancer susceptibility variants in TERT-CLPTM1L region. However, it is still unclear about the relationship between these risk variants and the independent lung cancer risk signals in this region. Therefore, we evaluated the independent susceptibility signals for lung cancer and explored the potential functional variants in this region. Sequential conditional analysis was used to detect the independent susceptibility loci based on four lung cancer GWAS datasets with 12,843 lung cases and 12,639 controls. Comprehensively functional annotations were performed for each independent signal. Three independent susceptibility signals were identified in multi-ethnic population. For the first signal, rs2736100 showed the most significant association with lung cancer risk (C > A, OR = 0.82, 95%CI: 0.79-0.85, P = 1.98 × 10 -25 ). Rs36019446 was the top-ranked site (A > G, OR = 0.88, 95%CI: 0.84-0.92, P = 1.74 × 10 -9 ) in the second signal. For the third signal, rs326048 was the leading SNP (A > G, OR = 0.91, 95%CI: 0.87-0.95, P = 1.38 × 10 -5 ). The following subgroup analysis found the same three loci among Asian population. Further, we compared the difference between various subgroup populations. Functional annotations revealed that rs2736100, rs27996 (r 2  = 0.85 with rs36019446) and rs326049 (r 2  = 0.73 with rs326048) could be potential functional variants in these three risk signals, respectively. In conclusion, although multiple variants have been found associated with lung cancer risk in TERT-CLPTM1L region, our findings indicated that there are three independent lung cancer susceptibility signals in this region. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  2. De novo and inherited private variants in MAP1B in periventricular nodular heterotopia.

    PubMed

    Heinzen, Erin L; O'Neill, Adam C; Zhu, Xiaolin; Allen, Andrew S; Bahlo, Melanie; Chelly, Jamel; Chen, Ming Hui; Dobyns, William B; Freytag, Saskia; Guerrini, Renzo; Leventer, Richard J; Poduri, Annapurna; Robertson, Stephen P; Walsh, Christopher A; Zhang, Mengqi

    2018-05-01

    Periventricular nodular heterotopia (PVNH) is a malformation of cortical development commonly associated with epilepsy. We exome sequenced 202 individuals with sporadic PVNH to identify novel genetic risk loci. We first performed a trio-based analysis and identified 219 de novo variants. Although no novel genes were implicated in this initial analysis, PVNH cases were found overall to have a significant excess of nonsynonymous de novo variants in intolerant genes (p = 3.27x10-7), suggesting a role for rare new alleles in genes yet to be associated with the condition. Using a gene-level collapsing analysis comparing cases and controls, we identified a genome-wide significant signal driven by four ultra-rare loss-of-function heterozygous variants in MAP1B, including one de novo variant. In at least one instance, the MAP1B variant was inherited from a parent with previously undiagnosed PVNH. The PVNH was frontally predominant and associated with perisylvian polymicrogyria. These results implicate MAP1B in PVNH. More broadly, our findings suggest that detrimental mutations likely arising in immediately preceding generations with incomplete penetrance may also be responsible for some apparently sporadic diseases.

  3. G2S: a web-service for annotating genomic variants on 3D protein structures.

    PubMed

    Wang, Juexin; Sheridan, Robert; Sumer, S Onur; Schultz, Nikolaus; Xu, Dong; Gao, Jianjiong

    2018-06-01

    Accurately mapping and annotating genomic locations on 3D protein structures is a key step in structure-based analysis of genomic variants detected by recent large-scale sequencing efforts. There are several mapping resources currently available, but none of them provides a web API (Application Programming Interface) that supports programmatic access. We present G2S, a real-time web API that provides automated mapping of genomic variants on 3D protein structures. G2S can align genomic locations of variants, protein locations, or protein sequences to protein structures and retrieve the mapped residues from structures. G2S API uses REST-inspired design and it can be used by various clients such as web browsers, command terminals, programming languages and other bioinformatics tools for bringing 3D structures into genomic variant analysis. The webserver and source codes are freely available at https://g2s.genomenexus.org. g2s@genomenexus.org. Supplementary data are available at Bioinformatics online.

  4. Discovery and characterization of antibody variants using mass spectrometry-based comparative analysis for biosimilar candidates of monoclonal antibody drugs.

    PubMed

    Li, Wenhua; Yang, Bin; Zhou, Dongmei; Xu, Jun; Ke, Zhi; Suen, Wen-Chen

    2016-07-01

    Liquid chromatography mass spectrometry (LC-MS) is the most commonly used technique for the characterization of antibody variants. MAb-X and mAb-Y are two approved IgG1 subtype monoclonal antibody drugs recombinantly produced in Chinese hamster ovary (CHO) cells. We report here that two unexpected and rare antibody variants have been discovered during cell culture process development of biosimilars for these two approved drugs through intact mass analysis. We then used comprehensive mass spectrometry-based comparative analysis including reduced light, heavy chains, and domain-specific mass as well as peptide mapping analysis to fully characterize the observed antibody variants. The "middle-up" mass comparative analysis demonstrated that the antibody variant from mAb-X biosimilar candidate was caused by mass variation of antibody crystalline fragment (Fc), whereas a different variant with mass variation in antibody antigen-binding fragment (Fab) from mAb-Y biosimilar candidate was identified. Endoproteinase Lys-C digested peptide mapping and tandem mass spectrometry analysis further revealed that a leucine to glutamine change in N-terminal 402 site of heavy chain was responsible for the generation of mAb-X antibody variant. Lys-C and trypsin coupled non-reduced and reduced peptide mapping comparative analysis showed that the formation of the light-heavy interchain trisulfide bond resulted in the mAb-Y antibody variant. These two cases confirmed that mass spectrometry-based comparative analysis plays a critical role for the characterization of monoclonal antibody variants, and biosimilar developers should start with a comprehensive structural assessment and comparative analysis to decrease the risk of the process development for biosimilars. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. A novel multi-variant epitope ensemble vaccine against avian leukosis virus subgroup J.

    PubMed

    Wang, Xiaoyu; Zhou, Defang; Wang, Guihua; Huang, Libo; Zheng, Qiankun; Li, Chengui; Cheng, Ziqiang

    2017-12-04

    The hypervariable antigenicity and immunosuppressive features of avian leukosis virus subgroup J (ALV-J) has led to great challenges to develop effective vaccines. Epitope vaccine will be a perspective trend. Previously, we identified a variant antigenic neutralizing epitope in hypervariable region 1 (hr1) of ALV-J, N-LRDFIA/E/TKWKS/GDDL/HLIRPYVNQS-C. BLAST analysis showed that the mutation of A, E, T and H in this epitope cover 79% of all ALV-J strains. Base on this data, we designed a multi-variant epitope ensemble vaccine comprising the four mutation variants linked with glycine and serine. The recombinant multi-variant epitope gene was expressed in Escherichia coli BL21. The expressed protein of the variant multi-variant epitope gene can react with positive sera and monoclonal antibodies of ALV-J, while cannot react with ALV-J negative sera. The multi-variant epitope vaccine that conjugated Freund's adjuvant complete/incomplete showed high immunogenicity that reached the titer of 1:64,000 at 42 days post immunization and maintained the immune period for at least 126 days in SPF chickens. Further, we demonstrated that the antibody induced by the variant multi-variant ensemble epitope vaccine recognized and neutralized different ALV-J strains (NX0101, TA1, WS1, BZ1224 and BZ4). Protection experiment that was evaluated by clinical symptom, viral shedding, weight gain, gross and histopathology showed 100% chickens that inoculated the multi-epitope vaccine were well protected against ALV-J challenge. The result shows a promising multi-variant epitope ensemble vaccine against hypervariable viruses in animals. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. TESTING TREE-CLASSIFIER VARIANTS AND ALTERNATE MODELING METHODOLOGIES IN THE EAST GREAT BASIN MAPPING UNIT OF THE SOUTHWEST REGIONAL GAP ANALYSIS PROJECT (SW REGAP)

    EPA Science Inventory

    We tested two methods for dataset generation and model construction, and three tree-classifier variants to identify the most parsimonious and thematically accurate mapping methodology for the SW ReGAP project. Competing methodologies were tested in the East Great Basin mapping un...

  7. Fine mapping on chromosome 13q32-34 and brain expression analysis implicates MYO16 in schizophrenia.

    PubMed

    Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria

    2014-03-01

    We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32-34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32-34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case-control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case-control data sets of European descent highlighted a region across introns 2-6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia.

  8. OVAS: an open-source variant analysis suite with inheritance modelling.

    PubMed

    Mozere, Monika; Tekman, Mehmet; Kari, Jameela; Bockenhauer, Detlef; Kleta, Robert; Stanescu, Horia

    2018-02-08

    The advent of modern high-throughput genetics continually broadens the gap between the rising volume of sequencing data, and the tools required to process them. The need to pinpoint a small subset of functionally important variants has now shifted towards identifying the critical differences between normal variants and disease-causing ones. The ever-increasing reliance on cloud-based services for sequence analysis and the non-transparent methods they utilize has prompted the need for more in-situ services that can provide a safer and more accessible environment to process patient data, especially in circumstances where continuous internet usage is limited. To address these issues, we herein propose our standalone Open-source Variant Analysis Sequencing (OVAS) pipeline; consisting of three key stages of processing that pertain to the separate modes of annotation, filtering, and interpretation. Core annotation performs variant-mapping to gene-isoforms at the exon/intron level, append functional data pertaining the type of variant mutation, and determine hetero/homozygosity. An extensive inheritance-modelling module in conjunction with 11 other filtering components can be used in sequence ranging from single quality control to multi-file penetrance model specifics such as X-linked recessive or mosaicism. Depending on the type of interpretation required, additional annotation is performed to identify organ specificity through gene expression and protein domains. In the course of this paper we analysed an autosomal recessive case study. OVAS made effective use of the filtering modules to recapitulate the results of the study by identifying the prescribed compound-heterozygous disease pattern from exome-capture sequence input samples. OVAS is an offline open-source modular-driven analysis environment designed to annotate and extract useful variants from Variant Call Format (VCF) files, and process them under an inheritance context through a top-down filtering schema of swappable modules, run entirely off a live bootable medium and accessed locally through a web-browser.

  9. Multi-site genetic analysis of diffusion images and voxelwise heritability analysis: A pilot project of the ENIGMA–DTI working group

    PubMed Central

    Jahanshad, Neda; Kochunov, Peter; Sprooten, Emma; Mandl, René C.; Nichols, Thomas E.; Almassy, Laura; Blangero, John; Brouwer, Rachel M.; Curran, Joanne E.; de Zubicaray, Greig I.; Duggirala, Ravi; Fox, Peter T.; Hong, L. Elliot; Landman, Bennett A.; Martin, Nicholas G.; McMahon, Katie L.; Medland, Sarah E.; Mitchell, Braxton D.; Olvera, Rene L.; Peterson, Charles P.; Starr, John M.; Sussmann, Jessika E.; Toga, Arthur W.; Wardlaw, Joanna M.; Wright, Margaret J.; Hulshoff Pol, Hilleke E.; Bastin, Mark E.; McIntosh, Andrew M.; Deary, Ian J.; Thompson, Paul M.; Glahn, David C.

    2013-01-01

    The ENIGMA (Enhancing NeuroImaging Genetics through Meta-Analysis) Consortium was set up to analyze brain measures and genotypes from multiple sites across the world to improve the power to detect genetic variants that influence the brain. Diffusion tensor imaging (DTI) yields quantitative measures sensitive to brain development and degeneration, and some common genetic variants may be associated with white matter integrity or connectivity. DTI measures, such as the fractional anisotropy (FA) of water diffusion, may be useful for identifying genetic variants that influence brain microstructure. However, genome-wide association studies (GWAS) require large populations to obtain sufficient power to detect and replicate significant effects, motivating a multi-site consortium effort. As part of an ENIGMA–DTI working group, we analyzed high-resolution FA images from multiple imaging sites across North America, Australia, and Europe, to address the challenge of harmonizing imaging data collected at multiple sites. Four hundred images of healthy adults aged 18–85 from four sites were used to create a template and corresponding skeletonized FA image as a common reference space. Using twin and pedigree samples of different ethnicities, we used our common template to evaluate the heritability of tract-derived FA measures. We show that our template is reliable for integrating multiple datasets by combining results through meta-analysis and unifying the data through exploratory mega-analyses. Our results may help prioritize regions of the FA map that are consistently influenced by additive genetic factors for future genetic discovery studies. Protocols and templates are publicly available at (http://enigma.loni.ucla.edu/ongoing/dti-working-group/). PMID:23629049

  10. Enhancing genomic prediction with genome-wide association studies in multiparental maize populations

    USDA-ARS?s Scientific Manuscript database

    Genome-wide association mapping using dense marker sets has identified some nucleotide variants affecting complex traits which have been validated with fine-mapping and functional analysis. Many sequence variants associated with complex traits in maize have small effects and low repeatability, howev...

  11. Fine Mapping on Chromosome 13q32–34 and Brain Expression Analysis Implicates MYO16 in Schizophrenia

    PubMed Central

    Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria

    2014-01-01

    We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32–34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32–34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case–control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case–control data sets of European descent highlighted a region across introns 2–6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia. PMID:24141571

  12. Variant Discovery and Fine Mapping of Genetic Loci Associated with Blood Pressure Traits in Hispanics and African Americans.

    PubMed

    Franceschini, Nora; Carty, Cara L; Lu, Yingchang; Tao, Ran; Sung, Yun Ju; Manichaikul, Ani; Haessler, Jeff; Fornage, Myriam; Schwander, Karen; Zubair, Niha; Bien, Stephanie; Hindorff, Lucia A; Guo, Xiuqing; Bielinski, Suzette J; Ehret, Georg; Kaufman, Joel D; Rich, Stephen S; Carlson, Christopher S; Bottinger, Erwin P; North, Kari E; Rao, D C; Chakravarti, Aravinda; Barrett, Paula Q; Loos, Ruth J F; Buyske, Steven; Kooperberg, Charles

    2016-01-01

    Despite the substantial burden of hypertension in US minority populations, few genetic studies of blood pressure have been conducted in Hispanics and African Americans, and it is unclear whether many of the established loci identified in European-descent populations contribute to blood pressure variation in non-European descent populations. Using the Metabochip array, we sought to characterize the genetic architecture of previously identified blood pressure loci, and identify novel cardiometabolic variants related to systolic and diastolic blood pressure in a multi-ethnic US population including Hispanics (n = 19,706) and African Americans (n = 18,744). Several known blood pressure loci replicated in African Americans and Hispanics. Fourteen variants in three loci (KCNK3, FGF5, ATXN2-SH2B3) were significantly associated with blood pressure in Hispanics. The most significant diastolic blood pressure variant identified in our analysis, rs2586886/KCNK3 (P = 5.2 x 10-9), also replicated in independent Hispanic and European-descent samples. African American and trans-ethnic meta-analysis data identified novel variants in the FGF5, ULK4 and HOXA-EVX1 loci, which have not been previously associated with blood pressure traits. Our identification and independent replication of variants in KCNK3, a gene implicated in primary hyperaldosteronism, as well as a variant in HOTTIP (HOXA-EVX1) suggest that further work to clarify the roles of these genes may be warranted. Overall, our findings suggest that loci identified in European descent populations also contribute to blood pressure variation in diverse populations including Hispanics and African Americans-populations that are understudied for hypertension genetic risk factors.

  13. Lupus Risk Variant Increases pSTAT1 Binding and Decreases ETS1 Expression

    PubMed Central

    Lu, Xiaoming; Zoller, Erin E.; Weirauch, Matthew T.; Wu, Zhiguo; Namjou, Bahram; Williams, Adrienne H.; Ziegler, Julie T.; Comeau, Mary E.; Marion, Miranda C.; Glenn, Stuart B.; Adler, Adam; Shen, Nan; Nath, Swapan K.; Stevens, Anne M.; Freedman, Barry I.; Tsao, Betty P.; Jacob, Chaim O.; Kamen, Diane L.; Brown, Elizabeth E.; Gilkeson, Gary S.; Alarcón, Graciela S.; Reveille, John D.; Anaya, Juan-Manuel; James, Judith A.; Sivils, Kathy L.; Criswell, Lindsey A.; Vilá, Luis M.; Alarcón-Riquelme, Marta E.; Petri, Michelle; Scofield, R. Hal; Kimberly, Robert P.; Ramsey-Goldman, Rosalind; Joo, Young Bin; Choi, Jeongim; Bae, Sang-Cheol; Boackle, Susan A.; Graham, Deborah Cunninghame; Vyse, Timothy J.; Guthridge, Joel M.; Gaffney, Patrick M.; Langefeld, Carl D.; Kelly, Jennifer A.; Greis, Kenneth D.; Kaufman, Kenneth M.; Harley, John B.; Kottyan, Leah C.

    2015-01-01

    Genetic variants at chromosomal region 11q23.3, near the gene ETS1, have been associated with systemic lupus erythematosus (SLE), or lupus, in independent cohorts of Asian ancestry. Several recent studies have implicated ETS1 as a critical driver of immune cell function and differentiation, and mice deficient in ETS1 develop an SLE-like autoimmunity. We performed a fine-mapping study of 14,551 subjects from multi-ancestral cohorts by starting with genotyped variants and imputing to all common variants spanning ETS1. By constructing genetic models via frequentist and Bayesian association methods, we identified 16 variants that are statistically likely to be causal. We functionally assessed each of these variants on the basis of their likelihood of affecting transcription factor binding, miRNA binding, or chromatin state. Of the four variants that we experimentally examined, only rs6590330 differentially binds lysate from B cells. Using mass spectrometry, we found more binding of the transcription factor signal transducer and activator of transcription 1 (STAT1) to DNA near the risk allele of rs6590330 than near the non-risk allele. Immunoblot analysis and chromatin immunoprecipitation of pSTAT1 in B cells heterozygous for rs6590330 confirmed that the risk allele increased binding to the active form of STAT1. Analysis with expression quantitative trait loci indicated that the risk allele of rs6590330 is associated with decreased ETS1 expression in Han Chinese, but not other ancestral cohorts. We propose a model in which the risk allele of rs6590330 is associated with decreased ETS1 expression and increases SLE risk by enhancing the binding of pSTAT1. PMID:25865496

  14. Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants.

    PubMed

    Dadaev, Tokhir; Saunders, Edward J; Newcombe, Paul J; Anokian, Ezequiel; Leongamornlert, Daniel A; Brook, Mark N; Cieza-Borrella, Clara; Mijuskovic, Martina; Wakerell, Sarah; Olama, Ali Amin Al; Schumacher, Fredrick R; Berndt, Sonja I; Benlloch, Sara; Ahmed, Mahbubl; Goh, Chee; Sheng, Xin; Zhang, Zhuo; Muir, Kenneth; Govindasami, Koveela; Lophatananon, Artitaya; Stevens, Victoria L; Gapstur, Susan M; Carter, Brian D; Tangen, Catherine M; Goodman, Phyllis; Thompson, Ian M; Batra, Jyotsna; Chambers, Suzanne; Moya, Leire; Clements, Judith; Horvath, Lisa; Tilley, Wayne; Risbridger, Gail; Gronberg, Henrik; Aly, Markus; Nordström, Tobias; Pharoah, Paul; Pashayan, Nora; Schleutker, Johanna; Tammela, Teuvo L J; Sipeky, Csilla; Auvinen, Anssi; Albanes, Demetrius; Weinstein, Stephanie; Wolk, Alicja; Hakansson, Niclas; West, Catharine; Dunning, Alison M; Burnet, Neil; Mucci, Lorelei; Giovannucci, Edward; Andriole, Gerald; Cussenot, Olivier; Cancel-Tassin, Géraldine; Koutros, Stella; Freeman, Laura E Beane; Sorensen, Karina Dalsgaard; Orntoft, Torben Falck; Borre, Michael; Maehle, Lovise; Grindedal, Eli Marie; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Martin, Richard M; Travis, Ruth C; Key, Tim J; Hamilton, Robert J; Fleshner, Neil E; Finelli, Antonio; Ingles, Sue Ann; Stern, Mariana C; Rosenstein, Barry; Kerns, Sarah; Ostrer, Harry; Lu, Yong-Jie; Zhang, Hong-Wei; Feng, Ninghan; Mao, Xueying; Guo, Xin; Wang, Guomin; Sun, Zan; Giles, Graham G; Southey, Melissa C; MacInnis, Robert J; FitzGerald, Liesel M; Kibel, Adam S; Drake, Bettina F; Vega, Ana; Gómez-Caamaño, Antonio; Fachal, Laura; Szulkin, Robert; Eklund, Martin; Kogevinas, Manolis; Llorca, Javier; Castaño-Vinyals, Gemma; Penney, Kathryn L; Stampfer, Meir; Park, Jong Y; Sellers, Thomas A; Lin, Hui-Yi; Stanford, Janet L; Cybulski, Cezary; Wokolorczyk, Dominika; Lubinski, Jan; Ostrander, Elaine A; Geybels, Milan S; Nordestgaard, Børge G; Nielsen, Sune F; Weisher, Maren; Bisbjerg, Rasmus; Røder, Martin Andreas; Iversen, Peter; Brenner, Hermann; Cuk, Katarina; Holleczek, Bernd; Maier, Christiane; Luedeke, Manuel; Schnoeller, Thomas; Kim, Jeri; Logothetis, Christopher J; John, Esther M; Teixeira, Manuel R; Paulo, Paula; Cardoso, Marta; Neuhausen, Susan L; Steele, Linda; Ding, Yuan Chun; De Ruyck, Kim; De Meerleer, Gert; Ost, Piet; Razack, Azad; Lim, Jasmine; Teo, Soo-Hwang; Lin, Daniel W; Newcomb, Lisa F; Lessel, Davor; Gamulin, Marija; Kulis, Tomislav; Kaneva, Radka; Usmani, Nawaid; Slavov, Chavdar; Mitev, Vanio; Parliament, Matthew; Singhal, Sandeep; Claessens, Frank; Joniau, Steven; Van den Broeck, Thomas; Larkin, Samantha; Townsend, Paul A; Aukim-Hastie, Claire; Gago-Dominguez, Manuela; Castelao, Jose Esteban; Martinez, Maria Elena; Roobol, Monique J; Jenster, Guido; van Schaik, Ron H N; Menegaux, Florence; Truong, Thérèse; Koudou, Yves Akoli; Xu, Jianfeng; Khaw, Kay-Tee; Cannon-Albright, Lisa; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Thibodeau, Stephen N; McDonnell, Shannon K; Schaid, Daniel J; Lindstrom, Sara; Turman, Constance; Ma, Jing; Hunter, David J; Riboli, Elio; Siddiq, Afshan; Canzian, Federico; Kolonel, Laurence N; Le Marchand, Loic; Hoover, Robert N; Machiela, Mitchell J; Kraft, Peter; Freedman, Matthew; Wiklund, Fredrik; Chanock, Stephen; Henderson, Brian E; Easton, Douglas F; Haiman, Christopher A; Eeles, Rosalind A; Conti, David V; Kote-Jarai, Zsofia

    2018-06-11

    Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.

  15. A global comparability approach for biosimilar monoclonal antibodies using LC-tandem MS based proteomics.

    PubMed

    Chen, Shun-Li; Wu, Shiaw-Lin; Huang, Li-Juan; Huang, Jia-Bao; Chen, Shu-Hui

    2013-06-01

    Liquid chromatography-tandem mass spectrometry-based proteomics for peptide mapping and sequencing was used to characterize the marketed monoclonal antibody trastuzumab and compare it with two biosimilar products, mAb A containing D359E and L361M variations at the Fc site and mAb B without variants. Complete sequence coverage (100%) including disulfide linkages, glycosylations and other commonly occurring modifications (i.e., deamidation, oxidation, dehydration and K-clipping) were identified using maps generated from multi-enzyme digestions. In addition to the targeted comparison for the relative populations of targeted modification forms, a non-targeted approach was used to globally compare ion intensities in tryptic maps. The non-targeted comparison provided an extra-dimensional view to examine any possible differences related to variants or modifications. A peptide containing the two variants in mAb A, D359E and L361M, was revealed using the non-targeted comparison of the tryptic maps. In contrast, no significant differences were observed when trastuzumab was self-compared or compared with mAb B. These results were consistent with the data derived from peptide sequencing via collision induced dissociation/electron transfer dissociation. Thus, combined targeted and non-targeted approaches using powerful mass spectrometry-based proteomic tools hold great promise for the structural characterization of biosimilar products. Copyright © 2013 Elsevier B.V. All rights reserved.

  16. Knowledge-driven binning approach for rare variant association analysis: application to neuroimaging biomarkers in Alzheimer's disease.

    PubMed

    Kim, Dokyoon; Basile, Anna O; Bang, Lisa; Horgusluoglu, Emrin; Lee, Seunggeun; Ritchie, Marylyn D; Saykin, Andrew J; Nho, Kwangsik

    2017-05-18

    Rapid advancement of next generation sequencing technologies such as whole genome sequencing (WGS) has facilitated the search for genetic factors that influence disease risk in the field of human genetics. To identify rare variants associated with human diseases or traits, an efficient genome-wide binning approach is needed. In this study we developed a novel biological knowledge-based binning approach for rare-variant association analysis and then applied the approach to structural neuroimaging endophenotypes related to late-onset Alzheimer's disease (LOAD). For rare-variant analysis, we used the knowledge-driven binning approach implemented in Bin-KAT, an automated tool, that provides 1) binning/collapsing methods for multi-level variant aggregation with a flexible, biologically informed binning strategy and 2) an option of performing unified collapsing and statistical rare variant analyses in one tool. A total of 750 non-Hispanic Caucasian participants from the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort who had both WGS data and magnetic resonance imaging (MRI) scans were used in this study. Mean bilateral cortical thickness of the entorhinal cortex extracted from MRI scans was used as an AD-related neuroimaging endophenotype. SKAT was used for a genome-wide gene- and region-based association analysis of rare variants (MAF (minor allele frequency) < 0.05) and potential confounding factors (age, gender, years of education, intracranial volume (ICV) and MRI field strength) for entorhinal cortex thickness were used as covariates. Significant associations were determined using FDR adjustment for multiple comparisons. Our knowledge-driven binning approach identified 16 functional exonic rare variants in FANCC significantly associated with entorhinal cortex thickness (FDR-corrected p-value < 0.05). In addition, the approach identified 7 evolutionary conserved regions, which were mapped to FAF1, RFX7, LYPLAL1 and GOLGA3, significantly associated with entorhinal cortex thickness (FDR-corrected p-value < 0.05). In further analysis, the functional exonic rare variants in FANCC were also significantly associated with hippocampal volume and cerebrospinal fluid (CSF) Aβ 1-42 (p-value < 0.05). Our novel binning approach identified rare variants in FANCC as well as 7 evolutionary conserved regions significantly associated with a LOAD-related neuroimaging endophenotype. FANCC (fanconi anemia complementation group C) has been shown to modulate TLR and p38 MAPK-dependent expression of IL-1β in macrophages. Our results warrant further investigation in a larger independent cohort and demonstrate that the biological knowledge-driven binning approach is a powerful strategy to identify rare variants associated with AD and other complex disease.

  17. Trans-Ethnic Meta-Analysis Identifies Common and Rare Variants Associated with Hepatocyte Growth Factor Levels in the Multi-Ethnic Study of Atherosclerosis (MESA)

    PubMed Central

    Larson, Nicholas B.; Berardi, Cecilia; Decker, Paul A.; Wassel, Christina L.; Kirsch, Phillip S.; Pankow, James S.; Sale, Michele M.; de Andrade, Mariza; Sicotte, Hugues; Tang, Weihong; Hanson, Naomi Q.; Tsai, Michael Y.; Taylor, Kent D.; Bielinski, Suzette J.

    2015-01-01

    Summary Hepatocyte growth factor (HGF) is a mesenchyme-derived pleiotropic factor that regulates cell growth, motility, mitogenesis, and morphogenesis in a variety of cells, and increased serum levels of HGF have been linked to a number of clinical and subclinical cardiovascular disease phenotypes. However, little is currently known regarding what genetic factors influence HGF levels, despite evidence of substantial genetic contributions to HGF variation. Based upon ethnicity-stratified single-variant association analysis and trans-ethnic meta-analysis of 6201 participants of the Multi-Ethnic Study of Atherosclerosis (MESA), we discovered five statistically significant common and low-frequency variants: HGF missense polymorphism rs5745687 (p.E299K) as well as four variants (rs16844364, rs4690098, rs114303452, rs3748034) within or in proximity to HGFAC. We also identified two significant ethnicity-specific gene-level associations (A1BG in African Americans; FASN in Chinese Americans) based upon low-frequency/rare variants, while meta-analysis of gene-level results identified a significant association for HGFAC. However, identified single-variant associations explained modest proportions of the total trait variation and were not significantly associated with coronary artery calcium or coronary heart disease. Our findings indicate genetic factors influencing circulating HGF levels may be complex and ethnically diverse. PMID:25998175

  18. Comparison of gene-based rare variant association mapping methods for quantitative traits in a bovine population with complex familial relationships.

    PubMed

    Zhang, Qianqian; Guldbrandtsen, Bernt; Calus, Mario P L; Lund, Mogens Sandø; Sahana, Goutam

    2016-08-17

    There is growing interest in the role of rare variants in the variation of complex traits due to increasing evidence that rare variants are associated with quantitative traits. However, association methods that are commonly used for mapping common variants are not effective to map rare variants. Besides, livestock populations have large half-sib families and the occurrence of rare variants may be confounded with family structure, which makes it difficult to disentangle their effects from family mean effects. We compared the power of methods that are commonly applied in human genetics to map rare variants in cattle using whole-genome sequence data and simulated phenotypes. We also studied the power of mapping rare variants using linear mixed models (LMM), which are the method of choice to account for both family relationships and population structure in cattle. We observed that the power of the LMM approach was low for mapping a rare variant (defined as those that have frequencies lower than 0.01) with a moderate effect (5 to 8 % of phenotypic variance explained by multiple rare variants that vary from 5 to 21 in number) contributing to a QTL with a sample size of 1000. In contrast, across the scenarios studied, statistical methods that are specialized for mapping rare variants increased power regardless of whether multiple rare variants or a single rare variant underlie a QTL. Different methods for combining rare variants in the test single nucleotide polymorphism set resulted in similar power irrespective of the proportion of total genetic variance explained by the QTL. However, when the QTL variance is very small (only 0.1 % of the total genetic variance), these specialized methods for mapping rare variants and LMM generally had no power to map the variants within a gene with sample sizes of 1000 or 5000. We observed that the methods that combine multiple rare variants within a gene into a meta-variant generally had greater power to map rare variants compared to LMM. Therefore, it is recommended to use rare variant association mapping methods to map rare genetic variants that affect quantitative traits in livestock, such as bovine populations.

  19. GARLIC: a bioinformatic toolkit for aetiologically connecting diseases and cell type-specific regulatory maps

    PubMed Central

    Nikolić, Miloš; Papantonis, Argyris

    2017-01-01

    Abstract Genome-wide association studies (GWAS) have emerged as a powerful tool to uncover the genetic basis of human common diseases, which often show a complex, polygenic and multi-factorial aetiology. These studies have revealed that 70–90% of all single nucleotide polymorphisms (SNPs) associated with common complex diseases do not occur within genes (i.e. they are non-coding), making the discovery of disease-causative genetic variants and the elucidation of the underlying pathological mechanisms far from straightforward. Based on emerging evidences suggesting that disease-associated SNPs are frequently found within cell type-specific regulatory sequences, here we present GARLIC (GWAS-based Prediction Toolkit for Connecting Diseases and Cell Types), a user-friendly, multi-purpose software with an associated database and online viewer that, using global maps of cis-regulatory elements, can aetiologically connect human diseases with relevant cell types. Additionally, GARLIC can be used to retrieve potential disease-causative genetic variants overlapping regulatory sequences of interest. Overall, GARLIC can satisfy several important needs within the field of medical genetics, thus potentially assisting in the ultimate goal of uncovering the elusive and complex genetic basis of common human disorders. PMID:28007912

  20. Fine-Scale Mapping of the 5q11.2 Breast Cancer Locus Reveals at Least Three Independent Risk Variants Regulating MAP3K1

    PubMed Central

    Glubb, Dylan M.; Maranian, Mel J.; Michailidou, Kyriaki; Pooley, Karen A.; Meyer, Kerstin B.; Kar, Siddhartha; Carlebur, Saskia; O’Reilly, Martin; Betts, Joshua A.; Hillman, Kristine M.; Kaufmann, Susanne; Beesley, Jonathan; Canisius, Sander; Hopper, John L.; Southey, Melissa C.; Tsimiklis, Helen; Apicella, Carmel; Schmidt, Marjanka K.; Broeks, Annegien; Hogervorst, Frans B.; van der Schoot, C. Ellen; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A.; Ruebner, Matthias; Ekici, Arif B.; Beckmann, Matthias W.; Peto, Julian; dos-Santos-Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Pharoah, Paul D.P.; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Burwinkel, Barbara; Marme, Frederik; Yang, Rongxi; Surowy, Harald; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; González-Neira, Anna; Benitez, Javier; Zamora, M. Pilar; Arias Perez, Jose Ignacio; Anton-Culver, Hoda; Neuhausen, Susan L.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K.; Brauch, Hiltrud; Ko, Yon-Dschun; Brüning, Thomas; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tanaka, Hideo; Dörk, Thilo; Bogdanova, Natalia V.; Helbig, Sonja; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Lambrechts, Diether; Zhao, Hui; Weltens, Caroline; van Limbergen, Erik; Chang-Claude, Jenny; Flesch-Janys, Dieter; Rudolph, Anja; Seibold, Petra; Radice, Paolo; Peterlongo, Paolo; Barile, Monica; Capra, Fabio; Couch, Fergus J.; Olson, Janet E.; Hallberg, Emily; Vachon, Celine; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Teo, Soo Hwang; Yip, Cheng Har; See, Mee-Hoong; Cornes, Belinda; Cheng, Ching-Yu; Ikram, M. Kamran; Kristensen, Vessela; Zheng, Wei; Halverson, Sandra L.; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Van Asperen, Christi J.; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Czene, Kamila; Klevebring, Daniel; Darabi, Hatef; Eriksson, Mikael; Hooning, Maartje J.; Hollestelle, Antoinette; Martens, John W.M.; Collée, J. Margriet; Hall, Per; Li, Jingmei; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Cross, Simon S.; Reed, Malcolm W.R.; Blot, William; Signorello, Lisa B.; Cai, Qiuyin; Shah, Mitul; Ghoussaini, Maya; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Tang, Anthony; Hamann, Ute; Torres, Diana; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Olswold, Curtis; Slager, Susan; Toland, Amanda E.; Yannoukakos, Drakoulis; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Pita, Guillermo; Alonso, M. Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Healey, Catherine S.; Brown, Melissa A.; Ponder, Bruce A.J.; Chenevix-Trench, Georgia; Thompson, Deborah J.; Edwards, Stacey L.; Easton, Douglas F.; Dunning, Alison M.; French, Juliet D.

    2015-01-01

    Genome-wide association studies (GWASs) have revealed SNP rs889312 on 5q11.2 to be associated with breast cancer risk in women of European ancestry. In an attempt to identify the biologically relevant variants, we analyzed 909 genetic variants across 5q11.2 in 103,991 breast cancer individuals and control individuals from 52 studies in the Breast Cancer Association Consortium. Multiple logistic regression analyses identified three independent risk signals: the strongest associations were with 15 correlated variants (iCHAV1), where the minor allele of the best candidate, rs62355902, associated with significantly increased risks of both estrogen-receptor-positive (ER+: odds ratio [OR] = 1.24, 95% confidence interval [CI] = 1.21–1.27, ptrend = 5.7 × 10−44) and estrogen-receptor-negative (ER−: OR = 1.10, 95% CI = 1.05–1.15, ptrend = 3.0 × 10−4) tumors. After adjustment for rs62355902, we found evidence of association of a further 173 variants (iCHAV2) containing three subsets with a range of effects (the strongest was rs113317823 [pcond = 1.61 × 10−5]) and five variants composing iCHAV3 (lead rs11949391; ER+: OR = 0.90, 95% CI = 0.87–0.93, pcond = 1.4 × 10−4). Twenty-six percent of the prioritized candidate variants coincided with four putative regulatory elements that interact with the MAP3K1 promoter through chromatin looping and affect MAP3K1 promoter activity. Functional analysis indicated that the cancer risk alleles of four candidates (rs74345699 and rs62355900 [iCHAV1], rs16886397 [iCHAV2a], and rs17432750 [iCHAV3]) increased MAP3K1 transcriptional activity. Chromatin immunoprecipitation analysis revealed diminished GATA3 binding to the minor (cancer-protective) allele of rs17432750, indicating a mechanism for its action. We propose that the cancer risk alleles act to increase MAP3K1 expression in vivo and might promote breast cancer cell survival. PMID:25529635

  1. Fine-scale mapping of the 5q11.2 breast cancer locus reveals at least three independent risk variants regulating MAP3K1.

    PubMed

    Glubb, Dylan M; Maranian, Mel J; Michailidou, Kyriaki; Pooley, Karen A; Meyer, Kerstin B; Kar, Siddhartha; Carlebur, Saskia; O'Reilly, Martin; Betts, Joshua A; Hillman, Kristine M; Kaufmann, Susanne; Beesley, Jonathan; Canisius, Sander; Hopper, John L; Southey, Melissa C; Tsimiklis, Helen; Apicella, Carmel; Schmidt, Marjanka K; Broeks, Annegien; Hogervorst, Frans B; van der Schoot, C Ellen; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A; Ruebner, Matthias; Ekici, Arif B; Beckmann, Matthias W; Peto, Julian; dos-Santos-Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Pharoah, Paul D P; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Burwinkel, Barbara; Marme, Frederik; Yang, Rongxi; Surowy, Harald; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; González-Neira, Anna; Benitez, Javier; Zamora, M Pilar; Arias Perez, Jose Ignacio; Anton-Culver, Hoda; Neuhausen, Susan L; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K; Brauch, Hiltrud; Ko, Yon-Dschun; Brüning, Thomas; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tanaka, Hideo; Dörk, Thilo; Bogdanova, Natalia V; Helbig, Sonja; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Wu, Anna H; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O; Lambrechts, Diether; Zhao, Hui; Weltens, Caroline; van Limbergen, Erik; Chang-Claude, Jenny; Flesch-Janys, Dieter; Rudolph, Anja; Seibold, Petra; Radice, Paolo; Peterlongo, Paolo; Barile, Monica; Capra, Fabio; Couch, Fergus J; Olson, Janet E; Hallberg, Emily; Vachon, Celine; Giles, Graham G; Milne, Roger L; McLean, Catriona; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Teo, Soo Hwang; Yip, Cheng Har; See, Mee-Hoong; Cornes, Belinda; Cheng, Ching-Yu; Ikram, M Kamran; Kristensen, Vessela; Zheng, Wei; Halverson, Sandra L; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Van Asperen, Christi J; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Czene, Kamila; Klevebring, Daniel; Darabi, Hatef; Eriksson, Mikael; Hooning, Maartje J; Hollestelle, Antoinette; Martens, John W M; Collée, J Margriet; Hall, Per; Li, Jingmei; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Blot, William; Signorello, Lisa B; Cai, Qiuyin; Shah, Mitul; Ghoussaini, Maya; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Tang, Anthony; Hamann, Ute; Torres, Diana; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Olswold, Curtis; Slager, Susan; Toland, Amanda E; Yannoukakos, Drakoulis; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Pita, Guillermo; Alonso, M Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C; Vincent, Daniel; Bacot, Francois; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Healey, Catherine S; Brown, Melissa A; Ponder, Bruce A J; Chenevix-Trench, Georgia; Thompson, Deborah J; Edwards, Stacey L; Easton, Douglas F; Dunning, Alison M; French, Juliet D

    2015-01-08

    Genome-wide association studies (GWASs) have revealed SNP rs889312 on 5q11.2 to be associated with breast cancer risk in women of European ancestry. In an attempt to identify the biologically relevant variants, we analyzed 909 genetic variants across 5q11.2 in 103,991 breast cancer individuals and control individuals from 52 studies in the Breast Cancer Association Consortium. Multiple logistic regression analyses identified three independent risk signals: the strongest associations were with 15 correlated variants (iCHAV1), where the minor allele of the best candidate, rs62355902, associated with significantly increased risks of both estrogen-receptor-positive (ER(+): odds ratio [OR] = 1.24, 95% confidence interval [CI] = 1.21-1.27, ptrend = 5.7 × 10(-44)) and estrogen-receptor-negative (ER(-): OR = 1.10, 95% CI = 1.05-1.15, ptrend = 3.0 × 10(-4)) tumors. After adjustment for rs62355902, we found evidence of association of a further 173 variants (iCHAV2) containing three subsets with a range of effects (the strongest was rs113317823 [pcond = 1.61 × 10(-5)]) and five variants composing iCHAV3 (lead rs11949391; ER(+): OR = 0.90, 95% CI = 0.87-0.93, pcond = 1.4 × 10(-4)). Twenty-six percent of the prioritized candidate variants coincided with four putative regulatory elements that interact with the MAP3K1 promoter through chromatin looping and affect MAP3K1 promoter activity. Functional analysis indicated that the cancer risk alleles of four candidates (rs74345699 and rs62355900 [iCHAV1], rs16886397 [iCHAV2a], and rs17432750 [iCHAV3]) increased MAP3K1 transcriptional activity. Chromatin immunoprecipitation analysis revealed diminished GATA3 binding to the minor (cancer-protective) allele of rs17432750, indicating a mechanism for its action. We propose that the cancer risk alleles act to increase MAP3K1 expression in vivo and might promote breast cancer cell survival. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  2. A note on the efficiencies of sampling strategies in two-stage Bayesian regional fine mapping of a quantitative trait.

    PubMed

    Chen, Zhijian; Craiu, Radu V; Bull, Shelley B

    2014-11-01

    In focused studies designed to follow up associations detected in a genome-wide association study (GWAS), investigators can proceed to fine-map a genomic region by targeted sequencing or dense genotyping of all variants in the region, aiming to identify a functional sequence variant. For the analysis of a quantitative trait, we consider a Bayesian approach to fine-mapping study design that incorporates stratification according to a promising GWAS tag SNP in the same region. Improved cost-efficiency can be achieved when the fine-mapping phase incorporates a two-stage design, with identification of a smaller set of more promising variants in a subsample taken in stage 1, followed by their evaluation in an independent stage 2 subsample. To avoid the potential negative impact of genetic model misspecification on inference we incorporate genetic model selection based on posterior probabilities for each competing model. Our simulation study shows that, compared to simple random sampling that ignores genetic information from GWAS, tag-SNP-based stratified sample allocation methods reduce the number of variants continuing to stage 2 and are more likely to promote the functional sequence variant into confirmation studies. © 2014 WILEY PERIODICALS, INC.

  3. Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT).

    PubMed

    Urrutia, Eugene; Lee, Seunggeun; Maity, Arnab; Zhao, Ni; Shen, Judong; Li, Yun; Wu, Michael C

    Analysis of rare genetic variants has focused on region-based analysis wherein a subset of the variants within a genomic region is tested for association with a complex trait. Two important practical challenges have emerged. First, it is difficult to choose which test to use. Second, it is unclear which group of variants within a region should be tested. Both depend on the unknown true state of nature. Therefore, we develop the Multi-Kernel SKAT (MK-SKAT) which tests across a range of rare variant tests and groupings. Specifically, we demonstrate that several popular rare variant tests are special cases of the sequence kernel association test which compares pair-wise similarity in trait value to similarity in the rare variant genotypes between subjects as measured through a kernel function. Choosing a particular test is equivalent to choosing a kernel. Similarly, choosing which group of variants to test also reduces to choosing a kernel. Thus, MK-SKAT uses perturbation to test across a range of kernels. Simulations and real data analyses show that our framework controls type I error while maintaining high power across settings: MK-SKAT loses power when compared to the kernel for a particular scenario but has much greater power than poor choices.

  4. Mapping of Epitopes Occurring in Bovine α(s1)-Casein Variants by Peptide Microarray Immunoassay.

    PubMed

    Lisson, Maria; Erhardt, Georg

    2016-01-01

    Immunoglobulin E epitope mapping of milk proteins reveals important information about their immunologic properties. Genetic variants of αS1-casein, one of the major allergens in bovine milk, are until now not considered when discussing the allergenic potential. Here we describe the complete procedure to assess the allergenicity of αS1-casein variants B and C, which are frequent in most breeds, starting from milk with identification and purification of casein variants by isoelectric focusing (IEF) and anion-exchange chromatography, followed by in vitro gastrointestinal digestion of the casein variants, identification of the resulting peptides by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), in silico analysis of the variant-specific peptides as allergenic epitopes, and determination of their IgE-binding properties by microarray immunoassay with cow's milk allergic human sera.

  5. Mendelian randomization with fine-mapped genetic data: Choosing from large numbers of correlated instrumental variables.

    PubMed

    Burgess, Stephen; Zuber, Verena; Valdes-Marquez, Elsa; Sun, Benjamin B; Hopewell, Jemma C

    2017-12-01

    Mendelian randomization uses genetic variants to make causal inferences about the effect of a risk factor on an outcome. With fine-mapped genetic data, there may be hundreds of genetic variants in a single gene region any of which could be used to assess this causal relationship. However, using too many genetic variants in the analysis can lead to spurious estimates and inflated Type 1 error rates. But if only a few genetic variants are used, then the majority of the data is ignored and estimates are highly sensitive to the particular choice of variants. We propose an approach based on summarized data only (genetic association and correlation estimates) that uses principal components analysis to form instruments. This approach has desirable theoretical properties: it takes the totality of data into account and does not suffer from numerical instabilities. It also has good properties in simulation studies: it is not particularly sensitive to varying the genetic variants included in the analysis or the genetic correlation matrix, and it does not have greatly inflated Type 1 error rates. Overall, the method gives estimates that are less precise than those from variable selection approaches (such as using a conditional analysis or pruning approach to select variants), but are more robust to seemingly arbitrary choices in the variable selection step. Methods are illustrated by an example using genetic associations with testosterone for 320 genetic variants to assess the effect of sex hormone related pathways on coronary artery disease risk, in which variable selection approaches give inconsistent inferences. © 2017 The Authors Genetic Epidemiology Published by Wiley Periodicals, Inc.

  6. Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis

    PubMed Central

    Beecham, Ashley H; Patsopoulos, Nikolaos A; Xifara, Dionysia K; Davis, Mary F; Kemppinen, Anu; Cotsapas, Chris; Shahi, Tejas S; Spencer, Chris; Booth, David; Goris, An; Oturai, Annette; Saarela, Janna; Fontaine, Bertrand; Hemmer, Bernhard; Martin, Claes; Zipp, Frauke; D’alfonso, Sandra; Martinelli-Boneschi, Filippo; Taylor, Bruce; Harbo, Hanne F; Kockum, Ingrid; Hillert, Jan; Olsson, Tomas; Ban, Maria; Oksenberg, Jorge R; Hintzen, Rogier; Barcellos, Lisa F; Agliardi, Cristina; Alfredsson, Lars; Alizadeh, Mehdi; Anderson, Carl; Andrews, Robert; Søndergaard, Helle Bach; Baker, Amie; Band, Gavin; Baranzini, Sergio E; Barizzone, Nadia; Barrett, Jeffrey; Bellenguez, Céline; Bergamaschi, Laura; Bernardinelli, Luisa; Berthele, Achim; Biberacher, Viola; Binder, Thomas M C; Blackburn, Hannah; Bomfim, Izaura L; Brambilla, Paola; Broadley, Simon; Brochet, Bruno; Brundin, Lou; Buck, Dorothea; Butzkueven, Helmut; Caillier, Stacy J; Camu, William; Carpentier, Wassila; Cavalla, Paola; Celius, Elisabeth G; Coman, Irène; Comi, Giancarlo; Corrado, Lucia; Cosemans, Leentje; Cournu-Rebeix, Isabelle; Cree, Bruce A C; Cusi, Daniele; Damotte, Vincent; Defer, Gilles; Delgado, Silvia R; Deloukas, Panos; di Sapio, Alessia; Dilthey, Alexander T; Donnelly, Peter; Dubois, Bénédicte; Duddy, Martin; Edkins, Sarah; Elovaara, Irina; Esposito, Federica; Evangelou, Nikos; Fiddes, Barnaby; Field, Judith; Franke, Andre; Freeman, Colin; Frohlich, Irene Y; Galimberti, Daniela; Gieger, Christian; Gourraud, Pierre-Antoine; Graetz, Christiane; Graham, Andrew; Grummel, Verena; Guaschino, Clara; Hadjixenofontos, Athena; Hakonarson, Hakon; Halfpenny, Christopher; Hall, Gillian; Hall, Per; Hamsten, Anders; Harley, James; Harrower, Timothy; Hawkins, Clive; Hellenthal, Garrett; Hillier, Charles; Hobart, Jeremy; Hoshi, Muni; Hunt, Sarah E; Jagodic, Maja; Jelčić, Ilijas; Jochim, Angela; Kendall, Brian; Kermode, Allan; Kilpatrick, Trevor; Koivisto, Keijo; Konidari, Ioanna; Korn, Thomas; Kronsbein, Helena; Langford, Cordelia; Larsson, Malin; Lathrop, Mark; Lebrun-Frenay, Christine; Lechner-Scott, Jeannette; Lee, Michelle H; Leone, Maurizio A; Leppä, Virpi; Liberatore, Giuseppe; Lie, Benedicte A; Lill, Christina M; Lindén, Magdalena; Link, Jenny; Luessi, Felix; Lycke, Jan; Macciardi, Fabio; Männistö, Satu; Manrique, Clara P; Martin, Roland; Martinelli, Vittorio; Mason, Deborah; Mazibrada, Gordon; McCabe, Cristin; Mero, Inger-Lise; Mescheriakova, Julia; Moutsianas, Loukas; Myhr, Kjell-Morten; Nagels, Guy; Nicholas, Richard; Nilsson, Petra; Piehl, Fredrik; Pirinen, Matti; Price, Siân E; Quach, Hong; Reunanen, Mauri; Robberecht, Wim; Robertson, Neil P; Rodegher, Mariaemma; Rog, David; Salvetti, Marco; Schnetz-Boutaud, Nathalie C; Sellebjerg, Finn; Selter, Rebecca C; Schaefer, Catherine; Shaunak, Sandip; Shen, Ling; Shields, Simon; Siffrin, Volker; Slee, Mark; Sorensen, Per Soelberg; Sorosina, Melissa; Sospedra, Mireia; Spurkland, Anne; Strange, Amy; Sundqvist, Emilie; Thijs, Vincent; Thorpe, John; Ticca, Anna; Tienari, Pentti; van Duijn, Cornelia; Visser, Elizabeth M; Vucic, Steve; Westerlind, Helga; Wiley, James S; Wilkins, Alastair; Wilson, James F; Winkelmann, Juliane; Zajicek, John; Zindler, Eva; Haines, Jonathan L; Pericak-Vance, Margaret A; Ivinson, Adrian J; Stewart, Graeme; Hafler, David; Hauser, Stephen L; Compston, Alastair; McVean, Gil; De Jager, Philip; Sawcer, Stephen; McCauley, Jacob L

    2013-01-01

    Using the ImmunoChip custom genotyping array, we analysed 14,498 multiple sclerosis subjects and 24,091 healthy controls for 161,311 autosomal variants and identified 135 potentially associated regions (p-value < 1.0 × 10-4). In a replication phase, we combined these data with previous genome-wide association study (GWAS) data from an independent 14,802 multiple sclerosis subjects and 26,703 healthy controls. In these 80,094 individuals of European ancestry we identified 48 new susceptibility variants (p-value < 5.0 × 10-8); three found after conditioning on previously identified variants. Thus, there are now 110 established multiple sclerosis risk variants in 103 discrete loci outside of the Major Histocompatibility Complex. With high resolution Bayesian fine-mapping, we identified five regions where one variant accounted for more than 50% of the posterior probability of association. This study enhances the catalogue of multiple sclerosis risk variants and illustrates the value of fine-mapping in the resolution of GWAS signals. PMID:24076602

  7. Validation of a next-generation sequencing assay for clinical molecular oncology.

    PubMed

    Cottrell, Catherine E; Al-Kateb, Hussam; Bredemeyer, Andrew J; Duncavage, Eric J; Spencer, David H; Abel, Haley J; Lockwood, Christina M; Hagemann, Ian S; O'Guin, Stephanie M; Burcea, Lauren C; Sawyer, Christopher S; Oschwald, Dayna M; Stratman, Jennifer L; Sher, Dorie A; Johnson, Mark R; Brown, Justin T; Cliften, Paul F; George, Bijoy; McIntosh, Leslie D; Shrivastava, Savita; Nguyen, Tudung T; Payton, Jacqueline E; Watson, Mark A; Crosby, Seth D; Head, Richard D; Mitra, Robi D; Nagarajan, Rakesh; Kulkarni, Shashikant; Seibert, Karen; Virgin, Herbert W; Milbrandt, Jeffrey; Pfeifer, John D

    2014-01-01

    Currently, oncology testing includes molecular studies and cytogenetic analysis to detect genetic aberrations of clinical significance. Next-generation sequencing (NGS) allows rapid analysis of multiple genes for clinically actionable somatic variants. The WUCaMP assay uses targeted capture for NGS analysis of 25 cancer-associated genes to detect mutations at actionable loci. We present clinical validation of the assay and a detailed framework for design and validation of similar clinical assays. Deep sequencing of 78 tumor specimens (≥ 1000× average unique coverage across the capture region) achieved high sensitivity for detecting somatic variants at low allele fraction (AF). Validation revealed sensitivities and specificities of 100% for detection of single-nucleotide variants (SNVs) within coding regions, compared with SNP array sequence data (95% CI = 83.4-100.0 for sensitivity and 94.2-100.0 for specificity) or whole-genome sequencing (95% CI = 89.1-100.0 for sensitivity and 99.9-100.0 for specificity) of HapMap samples. Sensitivity for detecting variants at an observed 10% AF was 100% (95% CI = 93.2-100.0) in HapMap mixes. Analysis of 15 masked specimens harboring clinically reported variants yielded concordant calls for 13/13 variants at AF of ≥ 15%. The WUCaMP assay is a robust and sensitive method to detect somatic variants of clinical significance in molecular oncology laboratories, with reduced time and cost of genetic analysis allowing for strategic patient management. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  8. Fine-Mapping of Common Genetic Variants Associated with Colorectal Tumor Risk Identified Potential Functional Variants

    PubMed Central

    Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike

    2016-01-01

    Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672

  9. Novel features and enhancements in BioBin, a tool for the biologically inspired binning and association analysis of rare variants

    PubMed Central

    Byrska-Bishop, Marta; Wallace, John; Frase, Alexander T; Ritchie, Marylyn D

    2018-01-01

    Abstract Motivation BioBin is an automated bioinformatics tool for the multi-level biological binning of sequence variants. Herein, we present a significant update to BioBin which expands the software to facilitate a comprehensive rare variant analysis and incorporates novel features and analysis enhancements. Results In BioBin 2.3, we extend our software tool by implementing statistical association testing, updating the binning algorithm, as well as incorporating novel analysis features providing for a robust, highly customizable, and unified rare variant analysis tool. Availability and implementation The BioBin software package is open source and freely available to users at http://www.ritchielab.com/software/biobin-download Contact mdritchie@geisinger.edu Supplementary information Supplementary data are available at Bioinformatics online. PMID:28968757

  10. Multi-ethnic analysis reveals soluble L-selectin may be post-transcriptionally regulated by 3′UTR polymorphism: the Multi-Ethnic Study of Atherosclerosis (MESA)

    PubMed Central

    Berardi, Cecilia; Larson, Nicholas B.; Decker, Paul A.; Wassel, Christina L.; Kirsch, Phillip S.; Pankow, James S.; Sale, Michele M.; de Andrade, Mariza; Sicotte, Hugues; Tang, Weihong; Hanson, Naomi Q.; Tsai, Michael Y.; da Chen, Yii-Der I; Bielinski, Suzette J.

    2015-01-01

    L-selectin is constitutively expressed on leukocytes and mediates their interaction with endothelial cells during inflammation. Previous studies on the association of soluble L-selectin (sL-selectin) with cardiovascular disease (CVD) are inconsistent. Genetic variants associated with sL-selectin levels may be a better surrogate of levels over a lifetime. We explored the association of genetic variants and sL-selectin levels in a race/ethnicity stratified random sample of 2,403 participants in the Multi-Ethnic Study of Atherosclerosis (MESA). Through a genome-wide analysis with additive linear regression models, we found that rs12938 on the SELL gene accounted for a significant portion of the protein level variance across all four races/ethnicities. To evaluate potential additional associations, elastic net models were used for variants located in the SELL/SELP/SELE genetic region and an additional two SNPs, rs3917768 and rs4987361, were associated with sL-selectin levels in African Americans. These variants accounted for a portion of protein variance that ranged from 4% in Hispanic to 14% in African Americans. To investigate the relationship of these variants with CVD, 6,317 subjects were used. No significant association was found between any of the identified SNPs and carotid intima-media thickness or presence of carotid plaque using linear and logistic regression, respectively. Similarly no significant results were found for coronary artery calcium or coronary heart disease events. In conclusion, we found that variants within the SELL gene are associated with sL-selectin levels. Despite accounting for a significant portion of the protein level variance, none of the variants was associated with clinical or subclinical CVD. PMID:25576479

  11. Quantitative transmission electron microscopy analysis of multi-variant grains in present L1{sub 0}-FePt based heat assisted magnetic recording media

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ho, Hoan, E-mail: hoan.ho@wdc.com; Department of Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; Zhu, Jingxi, E-mail: jingxiz@andrew.cmu.edu

    2014-11-21

    We present a study on atomic ordering within individual grains in granular L1{sub 0}-FePt thin films using transmission electron microscopy techniques. The film, used as a medium for heat assisted magnetic recording, consists of a single layer of FePt grains separated by non-magnetic grain boundaries and is grown on an MgO underlayer. Using convergent-beam techniques, diffraction patterns of individual grains are obtained for a large number of crystallites. The study found that although the majority of grains are ordered in the perpendicular direction, more than 15% of them are multi-variant, or of in-plane c-axis orientation, or disordered fcc. It wasmore » also found that these multi-variant and in-plane grains have always grown across MgO grain boundaries separating two or more MgO grains of the underlayer. The in-plane ordered portion within a multi-variant L1{sub 0}-FePt grain always lacks atomic coherence with the MgO directly underneath it, whereas, the perpendicularly ordered portion is always coherent with the underlying MgO grain. Since the existence of multi-variant and in-plane ordered grains are severely detrimental to high density data storage capability, the understanding of their formation mechanism obtained here should make a significant impact on the future development of hard disk drive technology.« less

  12. MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping.

    PubMed

    Lee, Wan-Ping; Stromberg, Michael P; Ward, Alistair; Stewart, Chip; Garrison, Erik P; Marth, Gabor T

    2014-01-01

    MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me).

  13. MOSAIK: A Hash-Based Algorithm for Accurate Next-Generation Sequencing Short-Read Mapping

    PubMed Central

    Lee, Wan-Ping; Stromberg, Michael P.; Ward, Alistair; Stewart, Chip; Garrison, Erik P.; Marth, Gabor T.

    2014-01-01

    MOSAIK is a stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome. Uniquely among current mapping tools, MOSAIK can align reads generated by all the major sequencing technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, Ion Torrent and Pacific BioSciences SMRT. Indeed, MOSAIK was the only aligner to provide consistent mappings for all the generated data (sequencing technologies, low-coverage and exome) in the 1000 Genomes Project. To provide highly accurate alignments, MOSAIK employs a hash clustering strategy coupled with the Smith-Waterman algorithm. This method is well-suited to capture mismatches as well as short insertions and deletions. To support the growing interest in larger structural variant (SV) discovery, MOSAIK provides explicit support for handling known-sequence SVs, e.g. mobile element insertions (MEIs) as well as generating outputs tailored to aid in SV discovery. All variant discovery benefits from an accurate description of the read placement confidence. To this end, MOSAIK uses a neural-network based training scheme to provide well-calibrated mapping quality scores, demonstrated by a correlation coefficient between MOSAIK assigned and actual mapping qualities greater than 0.98. In order to ensure that studies of any genome are supported, a training pipeline is provided to ensure optimal mapping quality scores for the genome under investigation. MOSAIK is multi-threaded, open source, and incorporated into our command and pipeline launcher system GKNO (http://gkno.me). PMID:24599324

  14. A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data.

    PubMed

    Buchanan, Carrie C; Torstenson, Eric S; Bush, William S; Ritchie, Marylyn D

    2012-01-01

    Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects.

  15. Structural analysis of an HLA-B27 functional variant, B27d detected in American blacks

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rojo, S.; Aparicio, P.; Hansen, J.A.

    1987-11-15

    The structure of a new functional variant B27d has been established by comparative peptide mapping and radiochemical sequencing. This analysis complete the structural characterization of the six know histocompatibility leukocyte antigen (HLA)-B27 subtypes. The only detected amino acid change between the main HLA-B27.1 subtype and B27d is that of Try/sub 59/ to His/sub 59/. Position 59 has not been previously found to vary among class I HLA or H-2 antigens. Such substitution accounts for the reported isoelectric focusing pattern of this variant. HLA-B27d is the only B27 variant found to differ from other subtypes by a single amino acid replacement.more » The nature of the change is compatible with its origin by a point mutation from HLB-B27.1. Because B27d was found only American blacks and in no other ethnic groups, it is suggested that this variant originated as a result of a mutation of the B27.1 gene that occurred within the black population. Structural analysis of B27d was done by comparative mapping. Radiochemical sequencing was carried out with /sup 14/C-labeled and /sup 3/H-labeled amino acids.« less

  16. Variants for HDL-C, LDL-C and Triglycerides Identified from Admixture Mapping and Fine-Mapping Analysis in African-American Families

    PubMed Central

    Shetty, Priya B.; Tang, Hua; Feng, Tao; Tayo, Bamidele; Morrison, Alanna C.; Kardia, Sharon L.R.; Hanis, Craig L.; Arnett, Donna K.; Hunt, Steven C.; Boerwinkle, Eric; Rao, D.C.; Cooper, R.S.; Risch, Neil; Zhu, Xiaofeng

    2015-01-01

    Background Admixture mapping of lipids was followed-up by family-based association analysis to identify variants for cardiovascular disease in African-Americans. Methods and Results The present study conducted admixture mapping analysis for total cholesterol, high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C) and triglycerides. The analysis was performed in 1,905 unrelated African-American subjects from the National Heart, Lung and Blood Institute’s Family Blood Pressure Program. Regions showing admixture evidence were followed-up with family-based association analysis in 3,556 African-American subjects from the FBPP. The admixture mapping and family-based association analyses were adjusted for age, age2, sex, body-mass-index, and genome-wide mean ancestry to minimize the confounding due to population stratification. Regions that were suggestive of local ancestry association evidence were found on chromosomes 7 (LDL-C), 8 (HDL-C), 14 (triglycerides) and 19 (total cholesterol and triglycerides). In the fine-mapping analysis, 52,939 SNPs were tested and 11 SNPs (8 independent SNPs) showed nominal significant association with HDL-C (2 SNPs), LDL-C (4 SNPs) and triglycerides (5 SNPs). The family data was used in the fine-mapping to identify SNPs that showed novel associations with lipids and regions including genes with known associations for cardiovascular disease. Conclusions This study identified regions on chromosomes 7, 8, 14 and 19 and 11 SNPs from the fine-mapping analysis that were associated with HDL-C, LDL-C and triglycerides for further studies of cardiovascular disease in African-Americans. PMID:25552592

  17. A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data

    PubMed Central

    Buchanan, Carrie C; Torstenson, Eric S; Bush, William S

    2012-01-01

    Background Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. Methods To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Results Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Conclusions Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects. PMID:22319179

  18. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits.

    PubMed

    Wu, Yang; Zeng, Jian; Zhang, Futao; Zhu, Zhihong; Qi, Ting; Zheng, Zhili; Lloyd-Jones, Luke R; Marioni, Riccardo E; Martin, Nicholas G; Montgomery, Grant W; Deary, Ian J; Wray, Naomi R; Visscher, Peter M; McRae, Allan F; Yang, Jian

    2018-03-02

    The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.

  19. G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.

    PubMed

    Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran

    2016-08-26

    Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.

  20. One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies

    PubMed Central

    Yuan, Shuai; Johnston, H. Richard; Zhang, Guosheng; Li, Yun; Hu, Yi-Juan; Qin, Zhaohui S.

    2015-01-01

    With rapid decline of the sequencing cost, researchers today rush to embrace whole genome sequencing (WGS), or whole exome sequencing (WES) approach as the next powerful tool for relating genetic variants to human diseases and phenotypes. A fundamental step in analyzing WGS and WES data is mapping short sequencing reads back to the reference genome. This is an important issue because incorrectly mapped reads affect the downstream variant discovery, genotype calling and association analysis. Although many read mapping algorithms have been developed, the majority of them uses the universal reference genome and do not take sequence variants into consideration. Given that genetic variants are ubiquitous, it is highly desirable if they can be factored into the read mapping procedure. In this work, we developed a novel strategy that utilizes genotypes obtained a priori to customize the universal haploid reference genome into a personalized diploid reference genome. The new strategy is implemented in a program named RefEditor. When applying RefEditor to real data, we achieved encouraging improvements in read mapping, variant discovery and genotype calling. Compared to standard approaches, RefEditor can significantly increase genotype calling consistency (from 43% to 61% at 4X coverage; from 82% to 92% at 20X coverage) and reduce Mendelian inconsistency across various sequencing depths. Because many WGS and WES studies are conducted on cohorts that have been genotyped using array-based genotyping platforms previously or concurrently, we believe the proposed strategy will be of high value in practice, which can also be applied to the scenario where multiple NGS experiments are conducted on the same cohort. The RefEditor sources are available at https://github.com/superyuan/refeditor. PMID:26267278

  1. Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

    PubMed Central

    Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

    2015-01-01

    ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133

  2. Genetic epidemiology of pharmacogenetic variants in South East Asian Malays using whole-genome sequences.

    PubMed

    Sivadas, A; Salleh, M Z; Teh, L K; Scaria, V

    2017-10-01

    Expanding the scope of pharmacogenomic research by including multiple global populations is integral to building robust evidence for its clinical translation. Deep whole-genome sequencing of diverse ethnic populations provides a unique opportunity to study rare and common pharmacogenomic markers that often vary in frequency across populations. In this study, we aim to build a diverse map of pharmacogenetic variants in South East Asian (SEA) Malay population using deep whole-genome sequences of 100 healthy SEA Malay individuals. We investigated the allelic diversity of potentially deleterious pharmacogenomic variants in SEA Malay population. Our analysis revealed 227 common and 466 rare potentially functional single nucleotide variants (SNVs) in 437 pharmacogenomic genes involved in drug metabolism, transport and target genes, including 74 novel variants. This study has created one of the most comprehensive maps of pharmacogenetic markers in any population from whole genomes and will hugely benefit pharmacogenomic investigations and drug dosage recommendations in SEA Malays.

  3. Filtering genetic variants and placing informative priors based on putative biological function.

    PubMed

    Friedrichs, Stefanie; Malzahn, Dörthe; Pugh, Elizabeth W; Almeida, Marcio; Liu, Xiao Qing; Bailey, Julia N

    2016-02-03

    High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure.

  4. Variants for HDL-C, LDL-C, and triglycerides identified from admixture mapping and fine-mapping analysis in African American families.

    PubMed

    Shetty, Priya B; Tang, Hua; Feng, Tao; Tayo, Bamidele; Morrison, Alanna C; Kardia, Sharon L R; Hanis, Craig L; Arnett, Donna K; Hunt, Steven C; Boerwinkle, Eric; Rao, Dabeeru C; Cooper, Richard S; Risch, Neil; Zhu, Xiaofeng

    2015-02-01

    Admixture mapping of lipids was followed-up by family-based association analysis to identify variants for cardiovascular disease in African Americans. The present study conducted admixture mapping analysis for total cholesterol, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and triglycerides. The analysis was performed in 1905 unrelated African American subjects from the National Heart, Lung and Blood Institute's Family Blood Pressure Program (FBPP). Regions showing admixture evidence were followed-up with family-based association analysis in 3556 African American subjects from the FBPP. The admixture mapping and family-based association analyses were adjusted for age, age(2), sex, body mass index, and genome-wide mean ancestry to minimize the confounding caused by population stratification. Regions that were suggestive of local ancestry association evidence were found on chromosomes 7 (low-density lipoprotein cholesterol), 8 (high-density lipoprotein cholesterol), 14 (triglycerides), and 19 (total cholesterol and triglycerides). In the fine-mapping analysis, 52 939 single-nucleotide polymorphisms (SNPs) were tested and 11 SNPs (8 independent SNPs) showed nominal significant association with high-density lipoprotein cholesterol (2 SNPs), low-density lipoprotein cholesterol (4 SNPs), and triglycerides (5 SNPs). The family data were used in the fine-mapping to identify SNPs that showed novel associations with lipids and regions, including genes with known associations for cardiovascular disease. This study identified regions on chromosomes 7, 8, 14, and 19 and 11 SNPs from the fine-mapping analysis that were associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and triglycerides for further studies of cardiovascular disease in African Americans. © 2014 American Heart Association, Inc.

  5. MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.

    PubMed

    Murillo, Gabriel H; You, Na; Su, Xiaoquan; Cui, Wei; Reilly, Muredach P; Li, Mingyao; Ning, Kang; Cui, Xinping

    2016-05-15

    Single nucleotide variant (SNV) detection procedures are being utilized as never before to analyze the recent abundance of high-throughput DNA sequencing data, both on single and multiple sample datasets. Building on previously published work with the single sample SNV caller genotype model selection (GeMS), a multiple sample version of GeMS (MultiGeMS) is introduced. Unlike other popular multiple sample SNV callers, the MultiGeMS statistical model accounts for enzymatic substitution sequencing errors. It also addresses the multiple testing problem endemic to multiple sample SNV calling and utilizes high performance computing (HPC) techniques. A simulation study demonstrates that MultiGeMS ranks highest in precision among a selection of popular multiple sample SNV callers, while showing exceptional recall in calling common SNVs. Further, both simulation studies and real data analyses indicate that MultiGeMS is robust to low-quality data. We also demonstrate that accounting for enzymatic substitution sequencing errors not only improves SNV call precision at low mapping quality regions, but also improves recall at reference allele-dominated sites with high mapping quality. The MultiGeMS package can be downloaded from https://github.com/cui-lab/multigems xinping.cui@ucr.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. MAP3K1-related gonadal dysgenesis: Six new cases and review of the literature.

    PubMed

    Granados, Andrea; Alaniz, Veronica I; Mohnach, Lauren; Barseghyan, Hayk; Vilain, Eric; Ostrer, Harry; Quint, Elisabeth H; Chen, Ming; Keegan, Catherine E

    2017-06-01

    Investigation of disorders of sex development (DSD) has resulted in the discovery of multiple sex-determining genes. MAP3K1 encodes a signal transduction regulator in the sex determination pathway and is emerging as one of the more common genes responsible for 46,XY DSD presenting as complete or partial gonadal dysgenesis. Clinical assessment, endocrine evaluation, and genetic analysis were performed in six individuals from four unrelated families with 46,XY DSD. All six individuals were found to have likely pathogenic MAP3K1 variants. Three of these individuals presented with complete gonadal dysgenesis, characterized by bilateral streak gonads with typical internal and external female genitalia, while the other three presented with partial gonadal dysgenesis, characterized by incomplete testicular development, resulting in clitoral hypertrophy with otherwise typical female external genitalia. Testing for MAP3K1 variants should be considered in patients with 46,XY complete or partial gonadal dysgenesis, particularly in families with multiple members affected with 46,XY DSD. Identification of a MAP3K1 variant should prompt an evaluation for DSD in female siblings of the proband. © 2017 Wiley Periodicals, Inc.

  7. Multi-gene panel testing in Korean patients with common genetic generalized epilepsy syndromes.

    PubMed

    Lee, Cha Gon; Lee, Jeehun; Lee, Munhyang

    2018-01-01

    Genetic heterogeneity of common genetic generalized epilepsy syndromes is frequently considered. The present study conducted a focused analysis of potential candidate or susceptibility genes for common genetic generalized epilepsy syndromes using multi-gene panel testing with next-generation sequencing. This study included patients with juvenile myoclonic epilepsy, juvenile absence epilepsy, and epilepsy with generalized tonic-clonic seizures alone. We identified pathogenic variants according to the American College of Medical Genetics and Genomics guidelines and identified susceptibility variants using case-control association analyses and family analyses for familial cases. A total of 57 patients were enrolled, including 51 sporadic cases and 6 familial cases. Twenty-two pathogenic and likely pathogenic variants of 16 different genes were identified. CACNA1H was the most frequently observed single gene. Variants of voltage-gated Ca2+ channel genes, including CACNA1A, CACNA1G, and CACNA1H were observed in 32% of variants (n = 7/22). Analyses to identify susceptibility variants using case-control association analysis indicated that KCNMA1 c.400G>C was associated with common genetic generalized epilepsy syndromes. Only 1 family (family A) exhibited a candidate pathogenic variant p.(Arg788His) on CACNA1H, as determined via family analyses. This study identified candidate genetic variants in about a quarter of patients (n = 16/57) and an average of 2.8 variants was identified in each patient. The results reinforced the polygenic disorder with very high locus and allelic heterogeneity of common GGE syndromes. Further, voltage-gated Ca2+ channels are suggested as important contributors to common genetic generalized epilepsy syndromes. This study extends our comprehensive understanding of common genetic generalized epilepsy syndromes.

  8. Integrated analysis of germline and somatic variants in ovarian cancer.

    PubMed

    Kanchi, Krishna L; Johnson, Kimberly J; Lu, Charles; McLellan, Michael D; Leiserson, Mark D M; Wendl, Michael C; Zhang, Qunyuan; Koboldt, Daniel C; Xie, Mingchao; Kandoth, Cyriac; McMichael, Joshua F; Wyczalkowski, Matthew A; Larson, David E; Schmidt, Heather K; Miller, Christopher A; Fulton, Robert S; Spellman, Paul T; Mardis, Elaine R; Druley, Todd E; Graubert, Timothy A; Goodfellow, Paul J; Raphael, Benjamin J; Wilson, Richard K; Ding, Li

    2014-01-01

    We report the first large-scale exome-wide analysis of the combined germline-somatic landscape in ovarian cancer. Here we analyse germline and somatic alterations in 429 ovarian carcinoma cases and 557 controls. We identify 3,635 high confidence, rare truncation and 22,953 missense variants with predicted functional impact. We find germline truncation variants and large deletions across Fanconi pathway genes in 20% of cases. Enrichment of rare truncations is shown in BRCA1, BRCA2 and PALB2. In addition, we observe germline truncation variants in genes not previously associated with ovarian cancer susceptibility (NF1, MAP3K4, CDKN2B and MLL3). Evidence for loss of heterozygosity was found in 100 and 76% of cases with germline BRCA1 and BRCA2 truncations, respectively. Germline-somatic interaction analysis combined with extensive bioinformatics annotation identifies 222 candidate functional germline truncation and missense variants, including two pathogenic BRCA1 and 1 TP53 deleterious variants. Finally, integrated analyses of germline and somatic variants identify significantly altered pathways, including the Fanconi, MAPK and MLL pathways.

  9. Simulating and mapping spatial complexity using multi-scale techniques

    USGS Publications Warehouse

    De Cola, L.

    1994-01-01

    A central problem in spatial analysis is the mapping of data for complex spatial fields using relatively simple data structures, such as those of a conventional GIS. This complexity can be measured using such indices as multi-scale variance, which reflects spatial autocorrelation, and multi-fractal dimension, which characterizes the values of fields. These indices are computed for three spatial processes: Gaussian noise, a simple mathematical function, and data for a random walk. Fractal analysis is then used to produce a vegetation map of the central region of California based on a satellite image. This analysis suggests that real world data lie on a continuum between the simple and the random, and that a major GIS challenge is the scientific representation and understanding of rapidly changing multi-scale fields. -Author

  10. A general framework of TOPSIS method for integration of airborne geophysics, satellite imagery, geochemical and geological data

    NASA Astrophysics Data System (ADS)

    Abedi, Maysam; Norouzi, Gholam-Hossain

    2016-04-01

    This work presents the promising application of three variants of TOPSIS method (namely the conventional, adjusted and modified versions) as a straightforward knowledge-driven technique in multi criteria decision making processes for data fusion of a broad exploratory geo-dataset in mineral potential/prospectivity mapping. The method is implemented to airborne geophysical data (e.g. potassium radiometry, aeromagnetic and frequency domain electromagnetic data), surface geological layers (fault and host rock zones), extracted alteration layers from remote sensing satellite imagery data, and five evidential attributes from stream sediment geochemical data. The central Iranian volcanic-sedimentary belt in Kerman province at the SE of Iran that is embedded in the Urumieh-Dokhtar Magmatic Assemblage arc (UDMA) is chosen to integrate broad evidential layers in the region of prospect. The studied area has high potential of ore mineral occurrences especially porphyry copper/molybdenum and the generated mineral potential maps aim to outline new prospect zones for further investigation in future. Two evidential layers of the downward continued aeromagnetic data and its analytic signal filter are prepared to be incorporated in fusion process as geophysical plausible footprints of the porphyry type mineralization. The low values of the apparent resistivity layer calculated from the airborne frequency domain electromagnetic data are also used as an electrical criterion in this investigation. Four remote sensing evidential layers of argillic, phyllic, propylitic and hydroxyl alterations were extracted from ASTER images in order to map the altered areas associated with porphyry type deposits, whilst the ETM+ satellite imagery data were used as well to map iron oxide layer. Since potassium alteration is generally the mainstay of porphyry ore mineralization, the airborne potassium radiometry data was used. The geochemical layers of Cu/B/Pb/Zn elements and the first component of PCA analysis were considered as powerful traces to prepare final maps. The conventional, adjusted and modified variants of the TOPSIS method produced three mineral potential maps, in which the outputs indicate adequately matching of high potential zones with previous working and active mines in the region.

  11. Comprehensive multi-stage linkage analyses identify a locus for adult height on chromosome 3p in a healthy Caucasian population.

    PubMed

    Ellis, Justine A; Scurrah, Katrina J; Duncan, Anna E; Lamantia, Angela; Byrnes, Graham B; Harrap, Stephen B

    2007-04-01

    There have been a number of genome-wide linkage studies for adult height in recent years. These studies have yielded few well-replicated loci, and none have been further confirmed by the identification of associated gene variants. The inconsistent results may be attributable to the fact that few studies have combined accurate phenotype measures with informative statistical modelling in healthy populations. We have performed a multi-stage genome-wide linkage analysis for height in 275 adult sibling pairs drawn randomly from the Victorian Family Heart Study (VFHS), a healthy population-based Caucasian cohort. Height was carefully measured in a standardised fashion on regularly calibrated equipment. Following genome-wide identification of a peak Z-score of 3.14 on chromosome 3 at 69 cM, we performed a fine-mapping analysis of this region in an extended sample of 392 two-generation families. We used a number of variance components models that incorporated assortative mating and shared environment effects, and we observed a peak LOD score of approximately 3.5 at 78 cM in four of the five models tested. We also demonstrated that the most prevalent model in the literature gave the worst fit, and the lowest LOD score (2.9) demonstrating the importance of appropriate modelling. The region identified in this study replicates the results of other genome-wide scans of height and bone-related phenotypes, strongly suggesting the presence of a gene important in bone growth on chromosome 3p. Association analyses of relevant candidate genes should identify the genetic variants responsible for the chromosome 3p linkage signal in our population.

  12. Contribution of Common PCSK1 Genetic Variants to Obesity in 8,359 Subjects from Multi-Ethnic American Population

    PubMed Central

    Choquet, Hélène; Kasberger, Jay; Hamidovic, Ajna; Jorgenson, Eric

    2013-01-01

    Common PCSK1 variants (notably rs6232 and rs6235) have been shown to be associated with obesity in European, Asian and Mexican populations. To determine whether common PCSK1 variants contribute to obesity in American population, we conducted association analyses in 8,359 subjects using two multi-ethnic American studies: the Coronary Artery Risk Development in Young Adults (CARDIA) study and the Multi-Ethnic Study of Atherosclerosis (MESA). By evaluating the contribution of rs6232 and rs6235 in each ethnic group, we found that in European-American subjects from CARDIA, only rs6232 was associated with BMI (P = 0.006) and obesity (P = 0.018) but also increased the obesity incidence during the 20 years of follow-up (HR = 1.53 [1.07–2.19], P = 0.019). Alternatively, in African-American subjects from CARDIA, rs6235 was associated with BMI (P = 0.028) and obesity (P = 0.018). Further, by combining the two case-control ethnic groups from the CARDIA study in a meta-analysis, association between rs6235 and obesity risk remained significant (OR = 1.23 [1.05–1.45], P = 9.5×10−3). However, neither rs6232 nor rs6235 was associated with BMI or obesity in the MESA study. Interestingly, rs6232 was associated with BMI (P = 4.2×10−3) and obesity (P = 3.4×10−3) in the younger European-American group combining samples from the both studies [less than median age (53 years)], but not among the older age group (P = 0.756 and P = 0.935 for BMI and obesity, respectively). By combining all the case-control ethnic groups from CARDIA and MESA in a meta-analysis, we found no significant association for the both variants and obesity risk. Finally, by exploring the full PCSK1 locus, we observed that no variant remained significant after correction for multiple testing. These results indicate that common PCSK1 variants (notably rs6232 and rs6235) contribute modestly to obesity in multi-ethnic American population. Further, these results suggest that the association of rs6232 with obesity may be age-dependent in European-Americans. However, multiple replication studies in multi-ethnic American population are needed to confirm our findings. PMID:23451278

  13. Contribution of common PCSK1 genetic variants to obesity in 8,359 subjects from multi-ethnic American population.

    PubMed

    Choquet, Hélène; Kasberger, Jay; Hamidovic, Ajna; Jorgenson, Eric

    2013-01-01

    Common PCSK1 variants (notably rs6232 and rs6235) have been shown to be associated with obesity in European, Asian and Mexican populations. To determine whether common PCSK1 variants contribute to obesity in American population, we conducted association analyses in 8,359 subjects using two multi-ethnic American studies: the Coronary Artery Risk Development in Young Adults (CARDIA) study and the Multi-Ethnic Study of Atherosclerosis (MESA). By evaluating the contribution of rs6232 and rs6235 in each ethnic group, we found that in European-American subjects from CARDIA, only rs6232 was associated with BMI (P = 0.006) and obesity (P = 0.018) but also increased the obesity incidence during the 20 years of follow-up (HR = 1.53 [1.07-2.19], P = 0.019). Alternatively, in African-American subjects from CARDIA, rs6235 was associated with BMI (P = 0.028) and obesity (P = 0.018). Further, by combining the two case-control ethnic groups from the CARDIA study in a meta-analysis, association between rs6235 and obesity risk remained significant (OR = 1.23 [1.05-1.45], P = 9.5×10(-3)). However, neither rs6232 nor rs6235 was associated with BMI or obesity in the MESA study. Interestingly, rs6232 was associated with BMI (P = 4.2×10(-3)) and obesity (P = 3.4×10(-3)) in the younger European-American group combining samples from the both studies [less than median age (53 years)], but not among the older age group (P = 0.756 and P = 0.935 for BMI and obesity, respectively). By combining all the case-control ethnic groups from CARDIA and MESA in a meta-analysis, we found no significant association for the both variants and obesity risk. Finally, by exploring the full PCSK1 locus, we observed that no variant remained significant after correction for multiple testing. These results indicate that common PCSK1 variants (notably rs6232 and rs6235) contribute modestly to obesity in multi-ethnic American population. Further, these results suggest that the association of rs6232 with obesity may be age-dependent in European-Americans. However, multiple replication studies in multi-ethnic American population are needed to confirm our findings.

  14. Functional characterization of a multi-cancer risk locus on chr5p15.33 reveals regulation of TERT by ZNF148.

    PubMed

    Fang, Jun; Jia, Jinping; Makowski, Matthew; Xu, Mai; Wang, Zhaoming; Zhang, Tongwu; Hoskins, Jason W; Choi, Jiyeon; Han, Younghun; Zhang, Mingfeng; Thomas, Janelle; Kovacs, Michael; Collins, Irene; Dzyadyk, Marta; Thompson, Abbey; O'Neill, Maura; Das, Sudipto; Lan, Qi; Koster, Roelof; Stolzenberg-Solomon, Rachael S; Kraft, Peter; Wolpin, Brian M; Jansen, Pascal W T C; Olson, Sara; McGlynn, Katherine A; Kanetsky, Peter A; Chatterjee, Nilanjan; Barrett, Jennifer H; Dunning, Alison M; Taylor, John C; Newton-Bishop, Julia A; Bishop, D Timothy; Andresson, Thorkell; Petersen, Gloria M; Amos, Christopher I; Iles, Mark M; Nathanson, Katherine L; Landi, Maria Teresa; Vermeulen, Michiel; Brown, Kevin M; Amundadottir, Laufey T

    2017-05-02

    Genome wide association studies (GWAS) have mapped multiple independent cancer susceptibility loci to chr5p15.33. Here, we show that fine-mapping of pancreatic and testicular cancer GWAS within one of these loci (Region 2 in CLPTM1L) focuses the signal to nine highly correlated SNPs. Of these, rs36115365-C associated with increased pancreatic and testicular but decreased lung cancer and melanoma risk, and exhibited preferred protein-binding and enhanced regulatory activity. Transcriptional gene silencing of this regulatory element repressed TERT expression in an allele-specific manner. Proteomic analysis identifies allele-preferred binding of Zinc finger protein 148 (ZNF148) to rs36115365-C, further supported by binding of purified recombinant ZNF148. Knockdown of ZNF148 results in reduced TERT expression, telomerase activity and telomere length. Our results indicate that the association with chr5p15.33-Region 2 may be explained by rs36115365, a variant influencing TERT expression via ZNF148 in a manner consistent with elevated TERT in carriers of the C allele.

  15. Discovery and Fine-Mapping of Glycaemic and Obesity-Related Trait Loci Using High-Density Imputation.

    PubMed

    Horikoshi, Momoko; Mӓgi, Reedik; van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S; Winkler, Thomas W; Willems, Sara M; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P; Willenborg, Christina; Wiltshire, Steven; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K E; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R; Groves, Christopher J; Bennett, Amanda J; Lehtimӓki, Terho; Viikari, Jorma S; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M; Karssen, Lennart C; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J; de Craen, Anton J M; Deelen, Joris; Havulinna, Aki S; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D; Samani, Nilesh J; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M; Slagboom, P Eline; Metspalu, Andres; van Duijn, Cornelia M; Eriksson, Johan G; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T; Power, Chris; Penninx, Brenda W J H; de Geus, Eco; Smit, Johannes H; Boomsma, Dorret I; Pedersen, Nancy L; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I; Morris, Andrew P

    2015-07-01

    Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated.

  16. Discovery and Fine-Mapping of Glycaemic and Obesity-Related Trait Loci Using High-Density Imputation

    PubMed Central

    van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S.; Winkler, Thomas W.; Willems, Sara M.; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P.; Willenborg, Christina; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J.; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K. E.; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R.; Groves, Christopher J.; Bennett, Amanda J.; Lehtimӓki, Terho; Viikari, Jorma S.; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M.; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M.; Karssen, Lennart C.; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J.; de Craen, Anton J. M.; Deelen, Joris; Havulinna, Aki S.; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D.; Samani, Nilesh J.; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M.; Slagboom, P. Eline; Metspalu, Andres; van Duijn, Cornelia M.; Eriksson, Johan G.; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T.; Power, Chris; Penninx, Brenda W. J. H.; de Geus, Eco; Smit, Johannes H.; Boomsma, Dorret I.; Pedersen, Nancy L.; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I.; Morris, Andrew P.

    2015-01-01

    Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated. PMID:26132169

  17. Joint explorative analysis of neuroreceptor subsystems in the human brain: application to receptor-transporter correlation using PET data.

    PubMed

    Cselényi, Zsolt; Lundberg, Johan; Halldin, Christer; Farde, Lars; Gulyás, Balázs

    2004-10-01

    Positron emission tomography (PET) has proved to be a highly successful technique in the qualitative and quantitative exploration of the human brain's neurotransmitter-receptor systems. In recent years, the number of PET radioligands, targeted to different neuroreceptor systems of the human brain, has increased considerably. This development paves the way for a simultaneous analysis of different receptor systems and subsystems in the same individual. The detailed exploration of the versatility of neuroreceptor systems requires novel technical approaches, capable of operating on huge parametric image datasets. An initial step of such explorative data processing and analysis should be the development of novel exploratory data-mining tools to gain insight into the "structure" of complex multi-individual, multi-receptor data sets. For practical reasons, a possible and feasible starting point of multi-receptor research can be the analysis of the pre- and post-synaptic binding sites of the same neurotransmitter. In the present study, we propose an unsupervised, unbiased data-mining tool for this task and demonstrate its usefulness by using quantitative receptor maps, obtained with positron emission tomography, from five healthy subjects on (pre-synaptic) serotonin transporters (5-HTT or SERT) and (post-synaptic) 5-HT(1A) receptors. Major components of the proposed technique include the projection of the input receptor maps to a feature space, the quasi-clustering and classification of projected data (neighbourhood formation), trans-individual analysis of neighbourhood properties (trajectory analysis), and the back-projection of the results of trajectory analysis to normal space (creation of multi-receptor maps). The resulting multi-receptor maps suggest that complex relationships and tendencies in the relationship between pre- and post-synaptic transporter-receptor systems can be revealed and classified by using this method. As an example, we demonstrate the regional correlation of the serotonin transporter-receptor systems. These parameter-specific multi-receptor maps can usefully guide the researchers in their endeavour to formulate models of multi-receptor interactions and changes in the human brain.

  18. An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer.

    PubMed

    Ruggles, Kelly V; Tang, Zuojian; Wang, Xuya; Grover, Himanshu; Askenazi, Manor; Teubl, Jennifer; Cao, Song; McLellan, Michael D; Clauser, Karl R; Tabb, David L; Mertins, Philipp; Slebos, Robbert; Erdmann-Gilmore, Petra; Li, Shunqiang; Gunawardena, Harsha P; Xie, Ling; Liu, Tao; Zhou, Jian-Ying; Sun, Shisheng; Hoadley, Katherine A; Perou, Charles M; Chen, Xian; Davies, Sherri R; Maher, Christopher A; Kinsinger, Christopher R; Rodland, Karen D; Zhang, Hui; Zhang, Zhen; Ding, Li; Townsend, R Reid; Rodriguez, Henry; Chan, Daniel; Smith, Richard D; Liebler, Daniel C; Carr, Steven A; Payne, Samuel; Ellis, Matthew J; Fenyő, David

    2016-03-01

    Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  19. Integral-geometry characterization of photobiomodulation effects on retinal vessel morphology

    PubMed Central

    Barbosa, Marconi; Natoli, Riccardo; Valter, Kriztina; Provis, Jan; Maddess, Ted

    2014-01-01

    The morphological characterization of quasi-planar structures represented by gray-scale images is challenging when object identification is sub-optimal due to registration artifacts. We propose two alternative procedures that enhances object identification in the integral-geometry morphological image analysis (MIA) framework. The first variant streamlines the framework by introducing an active contours segmentation process whose time step is recycled as a multi-scale parameter. In the second variant, we used the refined object identification produced in the first variant to perform the standard MIA with exact dilation radius as multi-scale parameter. Using this enhanced MIA we quantify the extent of vaso-obliteration in oxygen-induced retinopathic vascular growth, the preventative effect (by photobiomodulation) of exposure during tissue development to near-infrared light (NIR, 670 nm), and the lack of adverse effects due to exposure to NIR light. PMID:25071966

  20. Genomic Rearrangements in Arabidopsis Considered as Quantitative Traits.

    PubMed

    Imprialou, Martha; Kahles, André; Steffen, Joshua G; Osborne, Edward J; Gan, Xiangchao; Lempe, Janne; Bhomra, Amarjit; Belfield, Eric; Visscher, Anne; Greenhalgh, Robert; Harberd, Nicholas P; Goram, Richard; Hein, Jotun; Robert-Seilaniantz, Alexandre; Jones, Jonathan; Stegle, Oliver; Kover, Paula; Tsiantis, Miltos; Nordborg, Magnus; Rätsch, Gunnar; Clark, Richard M; Mott, Richard

    2017-04-01

    To understand the population genetics of structural variants and their effects on phenotypes, we developed an approach to mapping structural variants that segregate in a population sequenced at low coverage. We avoid calling structural variants directly. Instead, the evidence for a potential structural variant at a locus is indicated by variation in the counts of short-reads that map anomalously to that locus. These structural variant traits are treated as quantitative traits and mapped genetically, analogously to a gene expression study. Association between a structural variant trait at one locus, and genotypes at a distant locus indicate the origin and target of a transposition. Using ultra-low-coverage (0.3×) population sequence data from 488 recombinant inbred Arabidopsis thaliana genomes, we identified 6502 segregating structural variants. Remarkably, 25% of these were transpositions. While many structural variants cannot be delineated precisely, we validated 83% of 44 predicted transposition breakpoints by polymerase chain reaction. We show that specific structural variants may be causative for quantitative trait loci for germination and resistance to infection by the fungus Albugo laibachii , isolate Nc14. Further we show that the phenotypic heritability attributable to read-mapping anomalies differs from, and, in the case of time to germination and bolting, exceeds that due to standard genetic variation. Genes within structural variants are also more likely to be silenced or dysregulated. This approach complements the prevalent strategy of structural variant discovery in fewer individuals sequenced at high coverage. It is generally applicable to large populations sequenced at low-coverage, and is particularly suited to mapping transpositions. Copyright © 2017 by the Genetics Society of America.

  1. Genetic variation in the TNF/TRAF2/ASK1/p38 kinase signaling pathway as markers for postoperative pulmonary complications in lung cancer patients.

    PubMed

    Hildebrandt, Michelle A T; Roth, Jack A; Vaporciyan, Ara A; Pu, Xia; Ye, Yuanqing; Correa, Arlene M; Kim, Jae Y; Swisher, Stephen G; Wu, Xifeng

    2015-07-13

    Post-operative pulmonary complications are the most common morbidity associated with lung resection in non-small cell lung cancer (NSCLC) patients. The TNF/TRAF2/ASK1/p38 kinase pathway is activated by stress stimuli and inflammatory signals. We hypothesized that genetic polymorphisms within this pathway may contribute to risk of complications. In this case-only study, we genotyped 173 germline genetic variants in a discovery population of 264 NSCLC patients who underwent a lobectomy followed by genotyping of the top variants in a replication population of 264 patients. Complications data was obtained from a prospective database at MD Anderson. MAP2K4:rs12452497 was significantly associated with a decreased risk in both phases, resulting in a 40% reduction in the pooled population (95% CI:0.43-0.83, P = 0.0018). In total, seven variants were significant for risk in the pooled analysis. Gene-based analysis supported the involvement of TRAF2, MAP2K4, and MAP3K5 as mediating complications risk and a highly significant trend was identified between the number of risk genotypes and complications risk (P = 1.63 × 10(-8)). An inverse relationship was observed between association with clinical outcomes and complications for two variants. These results implicate the TNF/TRAF2/ASK1/p38 kinase pathway in modulating risk of pulmonary complications following lobectomy and may be useful biomarkers to identify patients at high risk.

  2. Mapping cis- and trans-regulatory effects across multiple tissues in twins

    PubMed Central

    Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.

    2013-01-01

    Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192

  3. Identification of lung cancer histology-specific variants applying Bayesian framework variant prioritization approaches within the TRICL and ILCCO consortia

    PubMed Central

    Brenner, Darren R.; Amos, Christopher I.; Brhane, Yonathan; Timofeeva, Maria N.; Caporaso, Neil; Wang, Yufei; Christiani, David C.; Bickeböller, Heike; Yang, Ping; Albanes, Demetrius; Stevens, Victoria L.; Gapstur, Susan; McKay, James; Boffetta, Paolo; Zaridze, David; Szeszenia-Dabrowska, Neonilia; Lissowska, Jolanta; Rudnai, Peter; Fabianova, Eleonora; Mates, Dana; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Krokan, Hans E.; Skorpen, Frank; Gabrielsen, Maiken E.; Vatten, Lars; Njølstad, Inger; Chen, Chu; Goodman, Gary; Lathrop, Mark; Vooder, Tõnu; Välk, Kristjan; Nelis, Mari; Metspalu, Andres; Broderick, Peter; Eisen, Timothy; Wu, Xifeng; Zhang, Di; Chen, Wei; Spitz, Margaret R.; Wei, Yongyue; Su, Li; Xie, Dong; She, Jun; Matsuo, Keitaro; Matsuda, Fumihiko; Ito, Hidemi; Risch, Angela; Heinrich, Joachim; Rosenberger, Albert; Muley, Thomas; Dienemann, Hendrik; Field, John K.; Raji, Olaide; Chen, Ying; Gosney, John; Liloglou, Triantafillos; Davies, Michael P.A.; Marcus, Michael; McLaughlin, John; Orlow, Irene; Han, Younghun; Li, Yafang; Zong, Xuchen; Johansson, Mattias; Liu, Geoffrey; Tworoger, Shelley S.; Le Marchand, Loic; Henderson, Brian E.; Wilkens, Lynne R.; Dai, Juncheng; Shen, Hongbing; Houlston, Richard S.; Landi, Maria T.; Brennan, Paul; Hung, Rayjean J.

    2015-01-01

    Large-scale genome-wide association studies (GWAS) have likely uncovered all common variants at the GWAS significance level. Additional variants within the suggestive range (0.0001> P > 5×10−8) are, however, still of interest for identifying causal associations. This analysis aimed to apply novel variant prioritization approaches to identify additional lung cancer variants that may not reach the GWAS level. Effects were combined across studies with a total of 33456 controls and 6756 adenocarcinoma (AC; 13 studies), 5061 squamous cell carcinoma (SCC; 12 studies) and 2216 small cell lung cancer cases (9 studies). Based on prior information such as variant physical properties and functional significance, we applied stratified false discovery rates, hierarchical modeling and Bayesian false discovery probabilities for variant prioritization. We conducted a fine mapping analysis as validation of our methods by examining top-ranking novel variants in six independent populations with a total of 3128 cases and 2966 controls. Three novel loci in the suggestive range were identified based on our Bayesian framework analyses: KCNIP4 at 4p15.2 (rs6448050, P = 4.6×10−7) and MTMR2 at 11q21 (rs10501831, P = 3.1×10−6) with SCC, as well as GAREM at 18q12.1 (rs11662168, P = 3.4×10−7) with AC. Use of our prioritization methods validated two of the top three loci associated with SCC (P = 1.05×10−4 for KCNIP4, represented by rs9799795) and AC (P = 2.16×10−4 for GAREM, represented by rs3786309) in the independent fine mapping populations. This study highlights the utility of using prior functional data for sequence variants in prioritization analyses to search for robust signals in the suggestive range. PMID:26363033

  4. Identification of lung cancer histology-specific variants applying Bayesian framework variant prioritization approaches within the TRICL and ILCCO consortia.

    PubMed

    Brenner, Darren R; Amos, Christopher I; Brhane, Yonathan; Timofeeva, Maria N; Caporaso, Neil; Wang, Yufei; Christiani, David C; Bickeböller, Heike; Yang, Ping; Albanes, Demetrius; Stevens, Victoria L; Gapstur, Susan; McKay, James; Boffetta, Paolo; Zaridze, David; Szeszenia-Dabrowska, Neonilia; Lissowska, Jolanta; Rudnai, Peter; Fabianova, Eleonora; Mates, Dana; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Krokan, Hans E; Skorpen, Frank; Gabrielsen, Maiken E; Vatten, Lars; Njølstad, Inger; Chen, Chu; Goodman, Gary; Lathrop, Mark; Vooder, Tõnu; Välk, Kristjan; Nelis, Mari; Metspalu, Andres; Broderick, Peter; Eisen, Timothy; Wu, Xifeng; Zhang, Di; Chen, Wei; Spitz, Margaret R; Wei, Yongyue; Su, Li; Xie, Dong; She, Jun; Matsuo, Keitaro; Matsuda, Fumihiko; Ito, Hidemi; Risch, Angela; Heinrich, Joachim; Rosenberger, Albert; Muley, Thomas; Dienemann, Hendrik; Field, John K; Raji, Olaide; Chen, Ying; Gosney, John; Liloglou, Triantafillos; Davies, Michael P A; Marcus, Michael; McLaughlin, John; Orlow, Irene; Han, Younghun; Li, Yafang; Zong, Xuchen; Johansson, Mattias; Liu, Geoffrey; Tworoger, Shelley S; Le Marchand, Loic; Henderson, Brian E; Wilkens, Lynne R; Dai, Juncheng; Shen, Hongbing; Houlston, Richard S; Landi, Maria T; Brennan, Paul; Hung, Rayjean J

    2015-11-01

    Large-scale genome-wide association studies (GWAS) have likely uncovered all common variants at the GWAS significance level. Additional variants within the suggestive range (0.0001> P > 5×10(-8)) are, however, still of interest for identifying causal associations. This analysis aimed to apply novel variant prioritization approaches to identify additional lung cancer variants that may not reach the GWAS level. Effects were combined across studies with a total of 33456 controls and 6756 adenocarcinoma (AC; 13 studies), 5061 squamous cell carcinoma (SCC; 12 studies) and 2216 small cell lung cancer cases (9 studies). Based on prior information such as variant physical properties and functional significance, we applied stratified false discovery rates, hierarchical modeling and Bayesian false discovery probabilities for variant prioritization. We conducted a fine mapping analysis as validation of our methods by examining top-ranking novel variants in six independent populations with a total of 3128 cases and 2966 controls. Three novel loci in the suggestive range were identified based on our Bayesian framework analyses: KCNIP4 at 4p15.2 (rs6448050, P = 4.6×10(-7)) and MTMR2 at 11q21 (rs10501831, P = 3.1×10(-6)) with SCC, as well as GAREM at 18q12.1 (rs11662168, P = 3.4×10(-7)) with AC. Use of our prioritization methods validated two of the top three loci associated with SCC (P = 1.05×10(-4) for KCNIP4, represented by rs9799795) and AC (P = 2.16×10(-4) for GAREM, represented by rs3786309) in the independent fine mapping populations. This study highlights the utility of using prior functional data for sequence variants in prioritization analyses to search for robust signals in the suggestive range. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Generalization and fine mapping of European ancestry-based central adiposity variants in African ancestry populations

    PubMed Central

    Yoneyama, Sachiko; Yao, Jie; Guo, Xiuqing; Fernandez-Rhodes, Lindsay; Lim, Unhee; Boston, Jonathan; Buzková, Petra; Carlson, Christopher S.; Cheng, Iona; Cochran, Barbara; Cooper, Richard; Ehret, Georg; Fornage, Myriam; Gong, Jian; Gross, Myron; Gu, C. Charles; Haessler, Jeff; Haiman, Christopher A.; Henderson, Brian; Hindorff, Lucia A.; Houston, Denise; Irvin, Marguerite R.; Jackson, Rebecca; Kuller, Lew; Leppert, Mark; Lewis, Cora E.; Li, Rongling; Le Marchand, Loic; Matise, Tara C.; Nguyen, Khanh-Dung H.; Chakravarti, Aravinda; Pankow, James S.; Pankratz, Nathan; Pooler, Loreall; Ritchie, Marylyn D.; Bien, Stephanie A.; Wassel, Christina L.; Chen, Yii-Der I.; Taylor, Kent D.; Allison, Matthew; Rotter, Jerome I.; Schreiner, Pamela J.; Schumacher, Fredrick; Wilkens, Lynne; Boerwinkle, Eric; Kooperberg, Charles; Peters, Ulrike; Buyske, Steven; Graff, Mariaelisa; North, Kari E.

    2016-01-01

    Background/Objectives Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of BMI and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. Subjects/Methods To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine mapping cardiovascular associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. Results Of the 17 WHR loci, eight SNPs located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. Conclusions Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci. PMID:27867202

  6. Comparison of Ion Personal Genome Machine Platforms for the Detection of Variants in BRCA1 and BRCA2.

    PubMed

    Hwang, Sang Mee; Lee, Ki Chan; Lee, Min Seob; Park, Kyoung Un

    2018-01-01

    Transition to next generation sequencing (NGS) for BRCA1 / BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1 /2. The results of Ion PGM with OTG-snpcaller, a pipeline based on Torrent mapping alignment program and Genome Analysis Toolkit, from 75 clinical samples and 14 reference DNA samples were compared with Sanger sequencing for BRCA1 / BRCA2 . Ten clinical samples and 14 reference DNA samples were additionally sequenced by Ion PGM Dx with Torrent Suite. Fifty types of variants including 18 pathogenic or variants of unknown significance were identified from 75 clinical samples and known variants of the reference samples were confirmed by Sanger sequencing and/or NGS. One false-negative results were present for Ion PGM/OTG-snpcaller for an indel variant misidentified as a single nucleotide variant. However, eight discordant results were present for Ion PGM Dx/Torrent Suite with both false-positive and -negative results. A 40-bp deletion, a 4-bp deletion and a 1-bp deletion variant was not called and a false-positive deletion was identified. Four other variants were misidentified as another variant. Ion PGM/OTG-snpcaller showed acceptable performance with good concordance with Sanger sequencing. However, Ion PGM Dx/Torrent Suite showed many discrepant results not suitable for use in a clinical laboratory, requiring further optimization of the data analysis for calling variants.

  7. Trans-ethnic meta-analysis of white blood cell phenotypes

    PubMed Central

    Keller, Margaux F.; Reiner, Alexander P.; Okada, Yukinori; van Rooij, Frank J.A.; Johnson, Andrew D.; Chen, Ming-Huei; Smith, Albert V.; Morris, Andrew P.; Tanaka, Toshiko; Ferrucci, Luigi; Zonderman, Alan B.; Lettre, Guillaume; Harris, Tamara; Garcia, Melissa; Bandinelli, Stefania; Qayyum, Rehan; Yanek, Lisa R.; Becker, Diane M.; Becker, Lewis C.; Kooperberg, Charles; Keating, Brendan; Reis, Jared; Tang, Hua; Boerwinkle, Eric; Kamatani, Yoichiro; Matsuda, Koichi; Kamatani, Naoyuki; Nakamura, Yusuke; Kubo, Michiaki; Liu, Simin; Dehghan, Abbas; Felix, Janine F.; Hofman, Albert; Uitterlinden, André G.; van Duijn, Cornelia M.; Franco, Oscar H.; Longo, Dan L.; Singleton, Andrew B.; Psaty, Bruce M.; Evans, Michelle K.; Cupples, L. Adrienne; Rotter, Jerome I.; O'Donnell, Christopher J.; Takahashi, Atsushi; Wilson, James G.; Ganesh, Santhi K.; Nalls, Mike A.

    2014-01-01

    White blood cell (WBC) count is a common clinical measure used as a predictor of certain aspects of human health, including immunity and infection status. WBC count is also a complex trait that varies among individuals and ancestry groups. Differences in linkage disequilibrium structure and heterogeneity in allelic effects are expected to play a role in the associations observed between populations. Prior genome-wide association study (GWAS) meta-analyses have identified genomic loci associated with WBC and its subtypes, but much of the heritability of these phenotypes remains unexplained. Using GWAS summary statistics for over 50 000 individuals from three diverse populations (Japanese, African-American and European ancestry), a Bayesian model methodology was employed to account for heterogeneity between ancestry groups. This approach was used to perform a trans-ethnic meta-analysis of total WBC, neutrophil and monocyte counts. Ten previously known associations were replicated and six new loci were identified, including several regions harboring genes related to inflammation and immune cell function. Ninety-five percent credible interval regions were calculated to narrow the association signals and fine-map the putatively causal variants within loci. Finally, a conditional analysis was performed on the most significant SNPs identified by the trans-ethnic meta-analysis (MA), and nine secondary signals within loci previously associated with WBC or its subtypes were identified. This work illustrates the potential of trans-ethnic analysis and ascribes a critical role to multi-ethnic cohorts and consortia in exploring complex phenotypes with respect to variants that lie outside the European-biased GWAS pool. PMID:25096241

  8. Exploiting induced variation to dissect quantitative traits in barley.

    PubMed

    Druka, Arnis; Franckowiak, Jerome; Lundqvist, Udda; Bonar, Nicola; Alexander, Jill; Guzy-Wrobelska, Justyna; Ramsay, Luke; Druka, Ilze; Grant, Iain; Macaulay, Malcolm; Vendramin, Vera; Shahinnia, Fahimeh; Radovic, Slobodanka; Houston, Kelly; Harrap, David; Cardle, Linda; Marshall, David; Morgante, Michele; Stein, Nils; Waugh, Robbie

    2010-04-01

    The identification of genes underlying complex quantitative traits such as grain yield by means of conventional genetic analysis (positional cloning) requires the development of several large mapping populations. However, it is possible that phenotypically related, but more extreme, allelic variants generated by mutational studies could provide a means for more efficient cloning of QTLs (quantitative trait loci). In barley (Hordeum vulgare), with the development of high-throughput genome analysis tools, efficient genome-wide identification of genetic loci harbouring mutant alleles has recently become possible. Genotypic data from NILs (near-isogenic lines) that carry induced or natural variants of genes that control aspects of plant development can be compared with the location of QTLs to potentially identify candidate genes for development--related traits such as grain yield. As yield itself can be divided into a number of allometric component traits such as tillers per plant, kernels per spike and kernel size, mutant alleles that both affect these traits and are located within the confidence intervals for major yield QTLs may represent extreme variants of the underlying genes. In addition, the development of detailed comparative genomic models based on the alignment of a high-density barley gene map with the rice and sorghum physical maps, has enabled an informed prioritization of 'known function' genes as candidates for both QTLs and induced mutant genes.

  9. A comprehensive quality control workflow for paired tumor-normal NGS experiments.

    PubMed

    Schroeder, Christopher M; Hilke, Franz J; Löffler, Markus W; Bitzer, Michael; Lenz, Florian; Sturm, Marc

    2017-06-01

    Quality control (QC) is an important part of all NGS data analysis stages. Many available tools calculate QC metrics from different analysis steps of single sample experiments (raw reads, mapped reads and variant lists). Multi-sample experiments, as sequencing of tumor-normal pairs, require additional QC metrics to ensure validity of results. These multi-sample QC metrics still lack standardization. We therefore suggest a new workflow for QC of DNA sequencing of tumor-normal pairs. With this workflow well-known single-sample QC metrics and additional metrics specific for tumor-normal pairs can be calculated. The segmentation into different tools offers a high flexibility and allows reuse for other purposes. All tools produce qcML, a generic XML format for QC of -omics experiments. qcML uses quality metrics defined in an ontology, which was adapted for NGS. All QC tools are implemented in C ++ and run both under Linux and Windows. Plotting requires python 2.7 and matplotlib. The software is available under the 'GNU General Public License version 2' as part of the ngs-bits project: https://github.com/imgag/ngs-bits. christopher.schroeder@med.uni-tuebingen.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Na, Seungjin; Payne, Samuel H.; Bandeira, Nuno

    The spectral networks approach enables the detection of pairs of spectra from related peptides and thus allows for the propagation of annotations from identified peptides to unidentified spectra. Beyond allowing for unbiased discovery of unexpected post-translational modifications, spectral networks are also applicable to multi-species comparative proteomics or metaproteomics to identify numerous orthologous versions of a protein. We present algorithmic and statistical advances in spectral networks that have made it possible to rigorously assess the statistical significance of spectral pairs and accurately estimate the error rate of identifications via propagation. In the analysis of three related Cyanothece species, a model organismmore » for biohydrogen production, spectral networks identified peptides with highly divergent sequences with up to dozens of variants per peptide, including many novel peptides in species that lack a sequenced genome. Furthermore, spectral networks strongly suggested the presence of novel peptides even in genomically characterized species (i.e. missing from databases) in that a significant portion of unidentified multi-species networks included at least two polymorphic peptide variants.« less

  11. Identifying Likely Transmission Pathways within a 10-Year Community Outbreak of Tuberculosis by High-Depth Whole Genome Sequencing

    PubMed Central

    Sadsad, Rosemarie; Martinez, Elena; Jelfs, Peter; Hill-Cawthorne, Grant A.; Gilbert, Gwendolyn L.; Marais, Ben J.; Sintchenko, Vitali

    2016-01-01

    Background Improved tuberculosis control and the need to contain the spread of drug-resistant strains provide a strong rationale for exploring tuberculosis transmission dynamics at the population level. Whole-genome sequencing provides optimal strain resolution, facilitating detailed mapping of potential transmission pathways. Methods We sequenced 22 isolates from a Mycobacterium tuberculosis cluster in New South Wales, Australia, identified during routine 24-locus mycobacterial interspersed repetitive unit typing. Following high-depth paired-end sequencing using the Illumina HiSeq 2000 platform, two independent pipelines were employed for analysis, both employing read mapping onto reference genomes as well as de novo assembly, to control biases in variant detection. In addition to single-nucleotide polymorphisms, the analyses also sought to identify insertions, deletions and structural variants. Results Isolates were highly similar, with a distance of 13 variants between the most distant members of the cluster. The most sensitive analysis classified the 22 isolates into 18 groups. Four of the isolates did not appear to share a recent common ancestor with the largest clade; another four isolates had an uncertain ancestral relationship with the largest clade. Conclusion Whole genome sequencing, with analysis of single-nucleotide polymorphisms, insertions, deletions, structural variants and subpopulations, enabled the highest possible level of discrimination between cluster members, clarifying likely transmission pathways and exposing the complexity of strain origin. The analysis provides a basis for targeted public health intervention and enhanced classification of future isolates linked to the cluster. PMID:26938641

  12. Arrowland v1.0

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    BIRKEL, GARRETT; GARCIA MARTIN, HECTOR; MORRELL, WILLIAM

    "Arrowland" is a web-based software application primarily for mapping, integrating and visualizing a variety of metabolism data of living organisms, including but not limited to metabolomics, proteomics, transcriptomics and fluxomics. This software application makes multi-omics data analysis intuitive and interactive. It improves data sharing and communication by enabling users to visualize their omics data using a web browser (on a PC or mobile device). It increases user's productivity by simplifying multi-omics data analysis using well developed maps as a guide. Users using this tool can gain insights into their data sets that would be difficult or even impossible to teasemore » out by looking at raw number, or using their currently existing toolchains to generate static single-use maps. Arrowland helps users save time by visualizing relative changes in different conditions or over time, and helps users to produce more significant insights faster. Preexisting maps decrease the learning curve for beginners in the omics field. Sets of multi-omics data are presented in the browser, as a two-dimensional flowchart resembling a map, with varying levels of detail information, based on the scaling of the map. Users can pan and zoom to explore different maps, compare maps, upload their own research data sets onto desired maps, alter map appearance in ways that facilitate interpretation, visualization and analysis of the given data, and export data, reports and actionable items to help the user initiative.« less

  13. Complex nature of SNP genotype effects on gene expression in primary human leucocytes.

    PubMed

    Heap, Graham A; Trynka, Gosia; Jansen, Ritsert C; Bruinenberg, Marcel; Swertz, Morris A; Dinesen, Lotte C; Hunt, Karen A; Wijmenga, Cisca; Vanheel, David A; Franke, Lude

    2009-01-07

    Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease - a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects. In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, cis expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.

  14. Incorporating Functional Annotations for Fine-Mapping Causal Variants in a Bayesian Framework Using Summary Statistics.

    PubMed

    Chen, Wenan; McDonnell, Shannon K; Thibodeau, Stephen N; Tillmans, Lori S; Schaid, Daniel J

    2016-11-01

    Functional annotations have been shown to improve both the discovery power and fine-mapping accuracy in genome-wide association studies. However, the optimal strategy to incorporate the large number of existing annotations is still not clear. In this study, we propose a Bayesian framework to incorporate functional annotations in a systematic manner. We compute the maximum a posteriori solution and use cross validation to find the optimal penalty parameters. By extending our previous fine-mapping method CAVIARBF into this framework, we require only summary statistics as input. We also derived an exact calculation of Bayes factors using summary statistics for quantitative traits, which is necessary when a large proportion of trait variance is explained by the variants of interest, such as in fine mapping expression quantitative trait loci (eQTL). We compared the proposed method with PAINTOR using different strategies to combine annotations. Simulation results show that the proposed method achieves the best accuracy in identifying causal variants among the different strategies and methods compared. We also find that for annotations with moderate effects from a large annotation pool, screening annotations individually and then combining the top annotations can produce overly optimistic results. We applied these methods on two real data sets: a meta-analysis result of lipid traits and a cis-eQTL study of normal prostate tissues. For the eQTL data, incorporating annotations significantly increased the number of potential causal variants with high probabilities. Copyright © 2016 by the Genetics Society of America.

  15. Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes.

    PubMed

    Mahajan, Anubha; Wessel, Jennifer; Willems, Sara M; Zhao, Wei; Robertson, Neil R; Chu, Audrey Y; Gan, Wei; Kitajima, Hidetoshi; Taliun, Daniel; Rayner, N William; Guo, Xiuqing; Lu, Yingchang; Li, Man; Jensen, Richard A; Hu, Yao; Huo, Shaofeng; Lohman, Kurt K; Zhang, Weihua; Cook, James P; Prins, Bram Peter; Flannick, Jason; Grarup, Niels; Trubetskoy, Vassily Vladimirovich; Kravic, Jasmina; Kim, Young Jin; Rybin, Denis V; Yaghootkar, Hanieh; Müller-Nurasyid, Martina; Meidtner, Karina; Li-Gao, Ruifang; Varga, Tibor V; Marten, Jonathan; Li, Jin; Smith, Albert Vernon; An, Ping; Ligthart, Symen; Gustafsson, Stefan; Malerba, Giovanni; Demirkan, Ayse; Tajes, Juan Fernandez; Steinthorsdottir, Valgerdur; Wuttke, Matthias; Lecoeur, Cécile; Preuss, Michael; Bielak, Lawrence F; Graff, Marielisa; Highland, Heather M; Justice, Anne E; Liu, Dajiang J; Marouli, Eirini; Peloso, Gina Marie; Warren, Helen R; Afaq, Saima; Afzal, Shoaib; Ahlqvist, Emma; Almgren, Peter; Amin, Najaf; Bang, Lia B; Bertoni, Alain G; Bombieri, Cristina; Bork-Jensen, Jette; Brandslund, Ivan; Brody, Jennifer A; Burtt, Noël P; Canouil, Mickaël; Chen, Yii-Der Ida; Cho, Yoon Shin; Christensen, Cramer; Eastwood, Sophie V; Eckardt, Kai-Uwe; Fischer, Krista; Gambaro, Giovanni; Giedraitis, Vilmantas; Grove, Megan L; de Haan, Hugoline G; Hackinger, Sophie; Hai, Yang; Han, Sohee; Tybjærg-Hansen, Anne; Hivert, Marie-France; Isomaa, Bo; Jäger, Susanne; Jørgensen, Marit E; Jørgensen, Torben; Käräjämäki, Annemari; Kim, Bong-Jo; Kim, Sung Soo; Koistinen, Heikki A; Kovacs, Peter; Kriebel, Jennifer; Kronenberg, Florian; Läll, Kristi; Lange, Leslie A; Lee, Jung-Jin; Lehne, Benjamin; Li, Huaixing; Lin, Keng-Hung; Linneberg, Allan; Liu, Ching-Ti; Liu, Jun; Loh, Marie; Mägi, Reedik; Mamakou, Vasiliki; McKean-Cowdin, Roberta; Nadkarni, Girish; Neville, Matt; Nielsen, Sune F; Ntalla, Ioanna; Peyser, Patricia A; Rathmann, Wolfgang; Rice, Kenneth; Rich, Stephen S; Rode, Line; Rolandsson, Olov; Schönherr, Sebastian; Selvin, Elizabeth; Small, Kerrin S; Stančáková, Alena; Surendran, Praveen; Taylor, Kent D; Teslovich, Tanya M; Thorand, Barbara; Thorleifsson, Gudmar; Tin, Adrienne; Tönjes, Anke; Varbo, Anette; Witte, Daniel R; Wood, Andrew R; Yajnik, Pranav; Yao, Jie; Yengo, Loïc; Young, Robin; Amouyel, Philippe; Boeing, Heiner; Boerwinkle, Eric; Bottinger, Erwin P; Chowdhury, Rajiv; Collins, Francis S; Dedoussis, George; Dehghan, Abbas; Deloukas, Panos; Ferrario, Marco M; Ferrières, Jean; Florez, Jose C; Frossard, Philippe; Gudnason, Vilmundur; Harris, Tamara B; Heckbert, Susan R; Howson, Joanna M M; Ingelsson, Martin; Kathiresan, Sekar; Kee, Frank; Kuusisto, Johanna; Langenberg, Claudia; Launer, Lenore J; Lindgren, Cecilia M; Männistö, Satu; Meitinger, Thomas; Melander, Olle; Mohlke, Karen L; Moitry, Marie; Morris, Andrew D; Murray, Alison D; de Mutsert, Renée; Orho-Melander, Marju; Owen, Katharine R; Perola, Markus; Peters, Annette; Province, Michael A; Rasheed, Asif; Ridker, Paul M; Rivadineira, Fernando; Rosendaal, Frits R; Rosengren, Anders H; Salomaa, Veikko; Sheu, Wayne H-H; Sladek, Rob; Smith, Blair H; Strauch, Konstantin; Uitterlinden, André G; Varma, Rohit; Willer, Cristen J; Blüher, Matthias; Butterworth, Adam S; Chambers, John Campbell; Chasman, Daniel I; Danesh, John; van Duijn, Cornelia; Dupuis, Josée; Franco, Oscar H; Franks, Paul W; Froguel, Philippe; Grallert, Harald; Groop, Leif; Han, Bok-Ghee; Hansen, Torben; Hattersley, Andrew T; Hayward, Caroline; Ingelsson, Erik; Kardia, Sharon L R; Karpe, Fredrik; Kooner, Jaspal Singh; Köttgen, Anna; Kuulasmaa, Kari; Laakso, Markku; Lin, Xu; Lind, Lars; Liu, Yongmei; Loos, Ruth J F; Marchini, Jonathan; Metspalu, Andres; Mook-Kanamori, Dennis; Nordestgaard, Børge G; Palmer, Colin N A; Pankow, James S; Pedersen, Oluf; Psaty, Bruce M; Rauramaa, Rainer; Sattar, Naveed; Schulze, Matthias B; Soranzo, Nicole; Spector, Timothy D; Stefansson, Kari; Stumvoll, Michael; Thorsteinsdottir, Unnur; Tuomi, Tiinamaija; Tuomilehto, Jaakko; Wareham, Nicholas J; Wilson, James G; Zeggini, Eleftheria; Scott, Robert A; Barroso, Inês; Frayling, Timothy M; Goodarzi, Mark O; Meigs, James B; Boehnke, Michael; Saleheen, Danish; Morris, Andrew P; Rotter, Jerome I; McCarthy, Mark I

    2018-04-01

    We aggregated coding variant data for 81,412 type 2 diabetes cases and 370,832 controls of diverse ancestry, identifying 40 coding variant association signals (P < 2.2 × 10 -7 ); of these, 16 map outside known risk-associated loci. We make two important observations. First, only five of these signals are driven by low-frequency variants: even for these, effect sizes are modest (odds ratio ≤1.29). Second, when we used large-scale genome-wide association data to fine-map the associated variants in their regional context, accounting for the global enrichment of complex trait associations in coding sequence, compelling evidence for coding variant causality was obtained for only 16 signals. At 13 others, the associated coding variants clearly represent 'false leads' with potential to generate erroneous mechanistic inference. Coding variant associations offer a direct route to biological insight for complex diseases and identification of validated therapeutic targets; however, appropriate mechanistic inference requires careful specification of their causal contribution to disease predisposition.

  16. An integrated map of genetic variation from 1,092 human genomes

    PubMed Central

    2012-01-01

    Summary Through characterising the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help understand the genetic contribution to disease. We describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methodologies to integrate information across multiple algorithms and diverse data sources we provide a validated haplotype map of 38 million SNPs, 1.4 million indels and over 14 thousand larger deletions. We show that individuals from different populations carry different profiles of rare and common variants and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways and that each individual harbours hundreds of rare non-coding variants at conserved sites, such as transcription-factor-motif disrupting changes. This resource, which captures up to 98% of accessible SNPs at a frequency of 1% in populations of medical genetics focus, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations. PMID:23128226

  17. CLIP-seq analysis of multi-mapped reads discovers novel functional RNA regulatory sites in the human transcriptome.

    PubMed

    Zhang, Zijun; Xing, Yi

    2017-09-19

    Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Mapping Informative Clusters in a Hierarchial Framework of fMRI Multivariate Analysis

    PubMed Central

    Xu, Rui; Zhen, Zonglei; Liu, Jia

    2010-01-01

    Pattern recognition methods have become increasingly popular in fMRI data analysis, which are powerful in discriminating between multi-voxel patterns of brain activities associated with different mental states. However, when they are used in functional brain mapping, the location of discriminative voxels varies significantly, raising difficulties in interpreting the locus of the effect. Here we proposed a hierarchical framework of multivariate approach that maps informative clusters rather than voxels to achieve reliable functional brain mapping without compromising the discriminative power. In particular, we first searched for local homogeneous clusters that consisted of voxels with similar response profiles. Then, a multi-voxel classifier was built for each cluster to extract discriminative information from the multi-voxel patterns. Finally, through multivariate ranking, outputs from the classifiers were served as a multi-cluster pattern to identify informative clusters by examining interactions among clusters. Results from both simulated and real fMRI data demonstrated that this hierarchical approach showed better performance in the robustness of functional brain mapping than traditional voxel-based multivariate methods. In addition, the mapped clusters were highly overlapped for two perceptually equivalent object categories, further confirming the validity of our approach. In short, the hierarchical framework of multivariate approach is suitable for both pattern classification and brain mapping in fMRI studies. PMID:21152081

  19. Fine mapping of the celiac disease-associated LPP locus reveals a potential functional variant.

    PubMed

    Almeida, Rodrigo; Ricaño-Ponce, Isis; Kumar, Vinod; Deelen, Patrick; Szperl, Agata; Trynka, Gosia; Gutierrez-Achury, Javier; Kanterakis, Alexandros; Westra, Harm-Jan; Franke, Lude; Swertz, Morris A; Platteel, Mathieu; Bilbao, Jose Ramon; Barisani, Donatella; Greco, Luigi; Mearin, Luisa; Wolters, Victorien M; Mulder, Chris; Mazzilli, Maria Cristina; Sood, Ajit; Cukrowska, Bozena; Núñez, Concepción; Pratesi, Riccardo; Withoff, Sebo; Wijmenga, Cisca

    2014-05-01

    Using the Immunochip for genotyping, we identified 39 non-human leukocyte antigen (non-HLA) loci associated to celiac disease (CeD), an immune-mediated disease with a worldwide frequency of ∼1%. The most significant non-HLA signal mapped to the intronic region of 70 kb in the LPP gene. Our aim was to fine map and identify possible functional variants in the LPP locus. We performed a meta-analysis in a cohort of 25 169 individuals from six different populations previously genotyped using Immunochip. Imputation using data from the Genome of the Netherlands and 1000 Genomes projects, followed by meta-analysis, confirmed the strong association signal on the LPP locus (rs2030519, P = 1.79 × 10(-49)), without any novel associations. The conditional analysis on this top SNP-indicated association to a single common haplotype. By performing haplotype analyses in each population separately, as well as in a combined group of the four populations that reach the significant threshold after correction (P < 0.008), we narrowed down the CeD-associated region from 70 to 2.8 kb (P = 1.35 × 10(-44)). By intersecting regulatory data from the ENCODE project, we found a functional SNP, rs4686484 (P = 3.12 × 10(-49)), that maps to several B-cell enhancer elements and a highly conserved region. This SNP was also predicted to change the binding motif of the transcription factors IRF4, IRF11, Nkx2.7 and Nkx2.9, suggesting its role in transcriptional regulation. We later found significantly low levels of LPP mRNA in CeD biopsies compared with controls, thus our results suggest that rs4686484 is the functional variant in this locus, while LPP expression is decreased in CeD.

  20. A meta-analysis of 87,040 individuals identifies 23 new susceptibility loci for prostate cancer.

    PubMed

    Al Olama, Ali Amin; Kote-Jarai, Zsofia; Berndt, Sonja I; Conti, David V; Schumacher, Fredrick; Han, Ying; Benlloch, Sara; Hazelett, Dennis J; Wang, Zhaoming; Saunders, Ed; Leongamornlert, Daniel; Lindstrom, Sara; Jugurnauth-Little, Sara; Dadaev, Tokhir; Tymrakiewicz, Malgorzata; Stram, Daniel O; Rand, Kristin; Wan, Peggy; Stram, Alex; Sheng, Xin; Pooler, Loreall C; Park, Karen; Xia, Lucy; Tyrer, Jonathan; Kolonel, Laurence N; Le Marchand, Loic; Hoover, Robert N; Machiela, Mitchell J; Yeager, Merideth; Burdette, Laurie; Chung, Charles C; Hutchinson, Amy; Yu, Kai; Goh, Chee; Ahmed, Mahbubl; Govindasami, Koveela; Guy, Michelle; Tammela, Teuvo L J; Auvinen, Anssi; Wahlfors, Tiina; Schleutker, Johanna; Visakorpi, Tapio; Leinonen, Katri A; Xu, Jianfeng; Aly, Markus; Donovan, Jenny; Travis, Ruth C; Key, Tim J; Siddiq, Afshan; Canzian, Federico; Khaw, Kay-Tee; Takahashi, Atsushi; Kubo, Michiaki; Pharoah, Paul; Pashayan, Nora; Weischer, Maren; Nordestgaard, Borge G; Nielsen, Sune F; Klarskov, Peter; Røder, Martin Andreas; Iversen, Peter; Thibodeau, Stephen N; McDonnell, Shannon K; Schaid, Daniel J; Stanford, Janet L; Kolb, Suzanne; Holt, Sarah; Knudsen, Beatrice; Coll, Antonio Hurtado; Gapstur, Susan M; Diver, W Ryan; Stevens, Victoria L; Maier, Christiane; Luedeke, Manuel; Herkommer, Kathleen; Rinckleb, Antje E; Strom, Sara S; Pettaway, Curtis; Yeboah, Edward D; Tettey, Yao; Biritwum, Richard B; Adjei, Andrew A; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P; Cannon-Albright, Lisa; Cybulski, Cezary; Wokołorczyk, Dominika; Kluźniak, Wojciech; Park, Jong; Sellers, Thomas; Lin, Hui-Yi; Isaacs, William B; Partin, Alan W; Brenner, Hermann; Dieffenbach, Aida Karina; Stegmaier, Christa; Chen, Constance; Giovannucci, Edward L; Ma, Jing; Stampfer, Meir; Penney, Kathryn L; Mucci, Lorelei; John, Esther M; Ingles, Sue A; Kittles, Rick A; Murphy, Adam B; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej M; Blot, William; Signorello, Lisa B; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, Cristina; Wu, Suh-Yuh; Hennis, Anselm; Kibel, Adam S; Rybicki, Benjamin A; Neslund-Dudas, Christine; Hsing, Ann W; Chu, Lisa; Goodman, Phyllis J; Klein, Eric A; Zheng, S Lilly; Batra, Jyotsna; Clements, Judith; Spurdle, Amanda; Teixeira, Manuel R; Paulo, Paula; Maia, Sofia; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Witte, John S; Casey, Graham; Gillanders, Elizabeth M; Seminara, Daniella; Riboli, Elio; Hamdy, Freddie C; Coetzee, Gerhard A; Li, Qiyuan; Freedman, Matthew L; Hunter, David J; Muir, Kenneth; Gronberg, Henrik; Neal, David E; Southey, Melissa; Giles, Graham G; Severi, Gianluca; Cook, Michael B; Nakagawa, Hidewaki; Wiklund, Fredrik; Kraft, Peter; Chanock, Stephen J; Henderson, Brian E; Easton, Douglas F; Eeles, Rosalind A; Haiman, Christopher A

    2014-10-01

    Genome-wide association studies (GWAS) have identified 76 variants associated with prostate cancer risk predominantly in populations of European ancestry. To identify additional susceptibility loci for this common cancer, we conducted a meta-analysis of > 10 million SNPs in 43,303 prostate cancer cases and 43,737 controls from studies in populations of European, African, Japanese and Latino ancestry. Twenty-three new susceptibility loci were identified at association P < 5 × 10(-8); 15 variants were identified among men of European ancestry, 7 were identified in multi-ancestry analyses and 1 was associated with early-onset prostate cancer. These 23 variants, in combination with known prostate cancer risk variants, explain 33% of the familial risk for this disease in European-ancestry populations. These findings provide new regions for investigation into the pathogenesis of prostate cancer and demonstrate the usefulness of combining ancestrally diverse populations to discover risk loci for disease.

  1. Multi-Disciplinary, Multi-Fidelity Discrete Data Transfer Using Degenerate Geometry Forms

    NASA Technical Reports Server (NTRS)

    Olson, Erik D.

    2016-01-01

    In a typical multi-fidelity design process, different levels of geometric abstraction are used for different analysis methods, and transitioning from one phase of design to the next often requires a complete re-creation of the geometry. To maintain consistency between lower-order and higher-order analysis results, Vehicle Sketch Pad (OpenVSP) recently introduced the ability to generate and export several degenerate forms of the geometry, representing the type of abstraction required to perform low- to medium-order analysis for a range of aeronautical disciplines. In this research, the functionality of these degenerate models was extended, so that in addition to serving as repositories for the geometric information that is required as input to an analysis, the degenerate models can also store the results of that analysis mapped back onto the geometric nodes. At the same time, the results are also mapped indirectly onto the nodes of lower-order degenerate models using a process called aggregation, and onto higher-order models using a process called disaggregation. The mapped analysis results are available for use by any subsequent analysis in an integrated design and analysis process. A simple multi-fidelity analysis process for a single-aisle subsonic transport aircraft is used as an example case to demonstrate the value of the approach.

  2. FAVR (Filtering and Annotation of Variants that are Rare): methods to facilitate the analysis of rare germline genetic variants from massively parallel sequencing datasets

    PubMed Central

    2013-01-01

    Background Characterising genetic diversity through the analysis of massively parallel sequencing (MPS) data offers enormous potential to significantly improve our understanding of the genetic basis for observed phenotypes, including predisposition to and progression of complex human disease. Great challenges remain in resolving genetic variants that are genuine from the millions of artefactual signals. Results FAVR is a suite of new methods designed to work with commonly used MPS analysis pipelines to assist in the resolution of some of the issues related to the analysis of the vast amount of resulting data, with a focus on relatively rare genetic variants. To the best of our knowledge, no equivalent method has previously been described. The most important and novel aspect of FAVR is the use of signatures in comparator sequence alignment files during variant filtering, and annotation of variants potentially shared between individuals. The FAVR methods use these signatures to facilitate filtering of (i) platform and/or mapping-specific artefacts, (ii) common genetic variants, and, where relevant, (iii) artefacts derived from imbalanced paired-end sequencing, as well as annotation of genetic variants based on evidence of co-occurrence in individuals. We applied conventional variant calling applied to whole-exome sequencing datasets, produced using both SOLiD and TruSeq chemistries, with or without downstream processing by FAVR methods. We demonstrate a 3-fold smaller rare single nucleotide variant shortlist with no detected reduction in sensitivity. This analysis included Sanger sequencing of rare variant signals not evident in dbSNP131, assessment of known variant signal preservation, and comparison of observed and expected rare variant numbers across a range of first cousin pairs. The principles described herein were applied in our recent publication identifying XRCC2 as a new breast cancer risk gene and have been made publically available as a suite of software tools. Conclusions FAVR is a platform-agnostic suite of methods that significantly enhances the analysis of large volumes of sequencing data for the study of rare genetic variants and their influence on phenotypes. PMID:23441864

  3. [Clinical value evaluation of Chinese herbal formula in context of multi-omics network].

    PubMed

    Li, Bing; Han, Fei; Wang, Zhong; Wang, Yong-Yan

    2017-03-01

    Clinical value evaluation is the key issue to solve the problems such as high repetition rate, fuzzy clinical positioning, broad indications and unclear clinical values in Chinese herbal formula(Chinese patent medicine). By analyzing the challenges and opportunities of Chinese herbal formula in clinical value evaluation, this paper introduced a strategy of multi-omic network analysis. Through comparative analysis of three stroke treatment formulas, we suggested their different characteristic advantages for variant symptoms or phenotypes of stroke, which may provide reference for rational clinical choice. Such multi-omic network analysis strategy may open a unique angle of view for clinical evaluation and comparison of Chinese herbal formula. Copyright© by the Chinese Pharmaceutical Association.

  4. Racial disparity in pathophysiologic pathways of preterm birth based on genetic variants

    PubMed Central

    Menon, Ramkumar; Pearce, Brad; Velez, Digna R; Merialdi, Mario; Williams, Scott M; Fortunato, Stephen J; Thorsen, Poul

    2009-01-01

    Objective To study pathophysiologic pathways in spontaneous preterm birth and possibly the racial disparity associating with maternal and fetal genetic variations, using bioinformatics tools. Methods A large scale candidate gene association study was performed on 1442 SNPs in 130 genes in a case (preterm birth < 36 weeks) control study (term birth > 37 weeks). Both maternal and fetal DNA from Caucasians (172 cases and 198 controls) and 279 African-Americans (82 cases and 197 controls) were used. A single locus association (genotypic) analysis followed by hierarchical clustering was performed, where clustering was based on p values for significant associations within each race. Using Ingenuity Pathway Analysis (IPA) software, known pathophysiologic pathways in both races were determined. Results From all SNPs entered into the analysis, the IPA mapped genes to specific disease functions. Gene variants in Caucasians were implicated in disease functions shared with other known disorders; specifically, dermatopathy, inflammation, and hematological disorders. This may reflect abnormal cervical ripening and decidual hemorrhage. In African-Americans inflammatory pathways were the most prevalent. In Caucasians, maternal gene variants showed the most prominent role in disease functions, whereas in African Americans it was fetal variants. The IPA software was used to generate molecular interaction maps that differed between races and also between maternal and fetal genetic variants. Conclusion Differences at the genetic level revealed distinct disease functions and operational pathways in African Americans and Caucasians in spontaneous preterm birth. Differences in maternal and fetal contributions in pregnancy outcome are also different between African Americans and Caucasians. These results present a set of explicit testable hypotheses regarding genetic associations with preterm birth in African Americans and Caucasians PMID:19527514

  5. Higher criticism approach to detect rare variants using whole genome sequencing data

    PubMed Central

    2014-01-01

    Because of low statistical power of single-variant tests for whole genome sequencing (WGS) data, the association test for variant groups is a key approach for genetic mapping. To address the features of sparse and weak genetic effects to be detected, the higher criticism (HC) approach has been proposed and theoretically has proven optimal for detecting sparse and weak genetic effects. Here we develop a strategy to apply the HC approach to WGS data that contains rare variants as the majority. By using Genetic Analysis Workshop 18 "dose" genetic data with simulated phenotypes, we assess the performance of HC under a variety of strategies for grouping variants and collapsing rare variants. The HC approach is compared with the minimal p-value method and the sequence kernel association test. The results show that the HC approach is preferred for detecting weak genetic effects. PMID:25519367

  6. Fine mapping of a distal chromosome 4 QTL affecting growth and muscle mass in a chicken advanced intercross line.

    PubMed

    Lyu, S; Arends, D; Nassar, M K; Brockmann, G A

    2017-06-01

    In our previous research, QTL analysis in an F 2 cross between the inbred New Hampshire (NHI) and White Leghorn (WL77) lines revealed a growth QTL in the distal part of chromosome 4. To physically reduce the chromosomal interval and the number of potential candidate genes, we performed fine mapping using individuals of generations F 10 , F 11 and F 12 in an advanced intercross line that had been established from the initial F 2 mapping population. Using nine single nucleotide polymorphism (SNP) markers within the QTL region for an association analysis with several growth traits from hatch to 20 weeks and body composition traits at 20 weeks, we could reduce the confidence interval from 26.9 to 3.4 Mb. Within the fine mapped region, markers rs14490774, rs314961352 and rs318175270 were in full linkage disequilibrium (D' = 1.0) and showed the strongest effect on growth and muscle mass (LOD ≥ 4.00). This reduced region contains 30 genes, compared to 292 genes in the original region. Chicken 60 K and 600 K SNP chips combined with DNA sequencing of the parental lines were used to call mutations in the reduced region. In the narrowed-down region, 489 sequence variants were detected between NHI and WL77. The most deleterious variants are a missense variant in ADGRA3 (SIFT = 0.02) and a frameshift deletion in the functional unknown gene ENSGALG00000014401 in NHI chicken. In addition, five synonymous variants were discovered in genes PPARGC1A, ADGRA3, PACRGL, SLIT2 and FAM184B. In our study, the confidence interval and the number of potential genes could be reduced 8- and 10- fold respectively. Further research will focus on functional effects of mutant genes. © 2017 Stichting International Foundation for Animal Genetics.

  7. Improved Topographic Mapping Through Multi-Baseline SAR Interferometry with MAP Estimation

    NASA Astrophysics Data System (ADS)

    Dong, Yuting; Jiang, Houjun; Zhang, Lu; Liao, Mingsheng; Shi, Xuguo

    2015-05-01

    There is an inherent contradiction between the sensitivity of height measurement and the accuracy of phase unwrapping for SAR interferometry (InSAR) over rough terrain. This contradiction can be resolved by multi-baseline InSAR analysis, which exploits multiple phase observations with different normal baselines to improve phase unwrapping accuracy, or even avoid phase unwrapping. In this paper we propose a maximum a posteriori (MAP) estimation method assisted by SRTM DEM data for multi-baseline InSAR topographic mapping. Based on our method, a data processing flow is established and applied in processing multi-baseline ALOS/PALSAR dataset. The accuracy of resultant DEMs is evaluated by using a standard Chinese national DEM of scale 1:10,000 as reference. The results show that multi-baseline InSAR can improve DEM accuracy compared with single-baseline case. It is noteworthy that phase unwrapping is avoided and the quality of multi-baseline InSAR DEM can meet the DTED-2 standard.

  8. Mapping Creativity: Creativity Measurements Network Analysis

    ERIC Educational Resources Information Center

    Pinheiro, Igor Reszka; Cruz, Roberto Moraes

    2014-01-01

    This article borrowed network analysis tools to discover how the construct formed by the set of all measures of creativity configures itself. To this end, using a variant of the meta-analytical method, a database was compiled simulating 42,381 responses to 974 variables centered on 64 creativity measures. Results, although preliminary, indicate…

  9. VariantSpark: population scale clustering of genotype information.

    PubMed

    O'Brien, Aidan R; Saunders, Neil F W; Guo, Yi; Buske, Fabian A; Scott, Rodney J; Bauer, Denis C

    2015-12-10

    Genomic information is increasingly used in medical practice giving rise to the need for efficient analysis methodology able to cope with thousands of individuals and millions of variants. The widely used Hadoop MapReduce architecture and associated machine learning library, Mahout, provide the means for tackling computationally challenging tasks. However, many genomic analyses do not fit the Map-Reduce paradigm. We therefore utilise the recently developed SPARK engine, along with its associated machine learning library, MLlib, which offers more flexibility in the parallelisation of population-scale bioinformatics tasks. The resulting tool, VARIANTSPARK provides an interface from MLlib to the standard variant format (VCF), offers seamless genome-wide sampling of variants and provides a pipeline for visualising results. To demonstrate the capabilities of VARIANTSPARK, we clustered more than 3,000 individuals with 80 Million variants each to determine the population structure in the dataset. VARIANTSPARK is 80 % faster than the SPARK-based genome clustering approach, ADAM, the comparable implementation using Hadoop/Mahout, as well as ADMIXTURE, a commonly used tool for determining individual ancestries. It is over 90 % faster than traditional implementations using R and Python. The benefits of speed, resource consumption and scalability enables VARIANTSPARK to open up the usage of advanced, efficient machine learning algorithms to genomic data.

  10. A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping

    NASA Astrophysics Data System (ADS)

    Feizizadeh, Bakhtiar; Shadman Roodposhti, Majid; Jankowski, Piotr; Blaschke, Thomas

    2014-12-01

    Landslide susceptibility mapping (LSM) is making increasing use of GIS-based spatial analysis in combination with multi-criteria evaluation (MCE) methods. We have developed a new multi-criteria decision analysis (MCDA) method for LSM and applied it to the Izeh River basin in south-western Iran. Our method is based on fuzzy membership functions (FMFs) derived from GIS analysis. It makes use of nine causal landslide factors identified by local landslide experts. Fuzzy set theory was first integrated with an analytical hierarchy process (AHP) in order to use pairwise comparisons to compare LSM criteria for ranking purposes. FMFs were then applied in order to determine the criteria weights to be used in the development of a landslide susceptibility map. Finally, a landslide inventory database was used to validate the LSM map by comparing it with known landslides within the study area. Results indicated that the integration of fuzzy set theory with AHP produced significantly improved accuracies and a high level of reliability in the resulting landslide susceptibility map. Approximately 53% of known landslides within our study area fell within zones classified as having "very high susceptibility", with the further 31% falling into zones classified as having "high susceptibility".

  11. A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping

    PubMed Central

    Feizizadeh, Bakhtiar; Shadman Roodposhti, Majid; Jankowski, Piotr; Blaschke, Thomas

    2014-01-01

    Landslide susceptibility mapping (LSM) is making increasing use of GIS-based spatial analysis in combination with multi-criteria evaluation (MCE) methods. We have developed a new multi-criteria decision analysis (MCDA) method for LSM and applied it to the Izeh River basin in south-western Iran. Our method is based on fuzzy membership functions (FMFs) derived from GIS analysis. It makes use of nine causal landslide factors identified by local landslide experts. Fuzzy set theory was first integrated with an analytical hierarchy process (AHP) in order to use pairwise comparisons to compare LSM criteria for ranking purposes. FMFs were then applied in order to determine the criteria weights to be used in the development of a landslide susceptibility map. Finally, a landslide inventory database was used to validate the LSM map by comparing it with known landslides within the study area. Results indicated that the integration of fuzzy set theory with AHP produced significantly improved accuracies and a high level of reliability in the resulting landslide susceptibility map. Approximately 53% of known landslides within our study area fell within zones classified as having “very high susceptibility”, with the further 31% falling into zones classified as having “high susceptibility”. PMID:26089577

  12. A GIS-based extended fuzzy multi-criteria evaluation for landslide susceptibility mapping.

    PubMed

    Feizizadeh, Bakhtiar; Shadman Roodposhti, Majid; Jankowski, Piotr; Blaschke, Thomas

    2014-12-01

    Landslide susceptibility mapping (LSM) is making increasing use of GIS-based spatial analysis in combination with multi-criteria evaluation (MCE) methods. We have developed a new multi-criteria decision analysis (MCDA) method for LSM and applied it to the Izeh River basin in south-western Iran. Our method is based on fuzzy membership functions (FMFs) derived from GIS analysis. It makes use of nine causal landslide factors identified by local landslide experts. Fuzzy set theory was first integrated with an analytical hierarchy process (AHP) in order to use pairwise comparisons to compare LSM criteria for ranking purposes. FMFs were then applied in order to determine the criteria weights to be used in the development of a landslide susceptibility map. Finally, a landslide inventory database was used to validate the LSM map by comparing it with known landslides within the study area. Results indicated that the integration of fuzzy set theory with AHP produced significantly improved accuracies and a high level of reliability in the resulting landslide susceptibility map. Approximately 53% of known landslides within our study area fell within zones classified as having "very high susceptibility", with the further 31% falling into zones classified as having "high susceptibility".

  13. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution

    PubMed Central

    Mägi, Reedik; Horikoshi, Momoko; Sofer, Tamar; Mahajan, Anubha; Kitajima, Hidetoshi; Franceschini, Nora; McCarthy, Mark I.; Morris, Andrew P.

    2017-01-01

    Abstract Trans-ethnic meta-analysis of genome-wide association studies (GWAS) across diverse populations can increase power to detect complex trait loci when the underlying causal variants are shared between ancestry groups. However, heterogeneity in allelic effects between GWAS at these loci can occur that is correlated with ancestry. Here, a novel approach is presented to detect SNP association and quantify the extent of heterogeneity in allelic effects that is correlated with ancestry. We employ trans-ethnic meta-regression to model allelic effects as a function of axes of genetic variation, derived from a matrix of mean pairwise allele frequency differences between GWAS, and implemented in the MR-MEGA software. Through detailed simulations, we demonstrate increased power to detect association for MR-MEGA over fixed- and random-effects meta-analysis across a range of scenarios of heterogeneity in allelic effects between ethnic groups. We also demonstrate improved fine-mapping resolution, in loci containing a single causal variant, compared to these meta-analysis approaches and PAINTOR, and equivalent performance to MANTRA at reduced computational cost. Application of MR-MEGA to trans-ethnic GWAS of kidney function in 71,461 individuals indicates stronger signals of association than fixed-effects meta-analysis when heterogeneity in allelic effects is correlated with ancestry. Application of MR-MEGA to fine-mapping four type 2 diabetes susceptibility loci in 22,086 cases and 42,539 controls highlights: (i) strong evidence for heterogeneity in allelic effects that is correlated with ancestry only at the index SNP for the association signal at the CDKAL1 locus; and (ii) 99% credible sets with six or fewer variants for five distinct association signals. PMID:28911207

  14. A Visual Analytics Approach for Station-Based Air Quality Data

    PubMed Central

    Du, Yi; Ma, Cuixia; Wu, Chao; Xu, Xiaowei; Guo, Yike; Zhou, Yuanchun; Li, Jianhui

    2016-01-01

    With the deployment of multi-modality and large-scale sensor networks for monitoring air quality, we are now able to collect large and multi-dimensional spatio-temporal datasets. For these sensed data, we present a comprehensive visual analysis approach for air quality analysis. This approach integrates several visual methods, such as map-based views, calendar views, and trends views, to assist the analysis. Among those visual methods, map-based visual methods are used to display the locations of interest, and the calendar and the trends views are used to discover the linear and periodical patterns. The system also provides various interaction tools to combine the map-based visualization, trends view, calendar view and multi-dimensional view. In addition, we propose a self-adaptive calendar-based controller that can flexibly adapt the changes of data size and granularity in trends view. Such a visual analytics system would facilitate big-data analysis in real applications, especially for decision making support. PMID:28029117

  15. A Visual Analytics Approach for Station-Based Air Quality Data.

    PubMed

    Du, Yi; Ma, Cuixia; Wu, Chao; Xu, Xiaowei; Guo, Yike; Zhou, Yuanchun; Li, Jianhui

    2016-12-24

    With the deployment of multi-modality and large-scale sensor networks for monitoring air quality, we are now able to collect large and multi-dimensional spatio-temporal datasets. For these sensed data, we present a comprehensive visual analysis approach for air quality analysis. This approach integrates several visual methods, such as map-based views, calendar views, and trends views, to assist the analysis. Among those visual methods, map-based visual methods are used to display the locations of interest, and the calendar and the trends views are used to discover the linear and periodical patterns. The system also provides various interaction tools to combine the map-based visualization, trends view, calendar view and multi-dimensional view. In addition, we propose a self-adaptive calendar-based controller that can flexibly adapt the changes of data size and granularity in trends view. Such a visual analytics system would facilitate big-data analysis in real applications, especially for decision making support.

  16. Robust nonlinear canonical correlation analysis: application to seasonal climate forecasting

    NASA Astrophysics Data System (ADS)

    Cannon, A. J.; Hsieh, W. W.

    2008-02-01

    Robust variants of nonlinear canonical correlation analysis (NLCCA) are introduced to improve performance on datasets with low signal-to-noise ratios, for example those encountered when making seasonal climate forecasts. The neural network model architecture of standard NLCCA is kept intact, but the cost functions used to set the model parameters are replaced with more robust variants. The Pearson product-moment correlation in the double-barreled network is replaced by the biweight midcorrelation, and the mean squared error (mse) in the inverse mapping networks can be replaced by the mean absolute error (mae). Robust variants of NLCCA are demonstrated on a synthetic dataset and are used to forecast sea surface temperatures in the tropical Pacific Ocean based on the sea level pressure field. Results suggest that adoption of the biweight midcorrelation can lead to improved performance, especially when a strong, common event exists in both predictor/predictand datasets. Replacing the mse by the mae leads to improved performance on the synthetic dataset, but not on the climate dataset except at the longest lead time, which suggests that the appropriate cost function for the inverse mapping networks is more problem dependent.

  17. A Protein Domain and Family Based Approach to Rare Variant Association Analysis.

    PubMed

    Richardson, Tom G; Shihab, Hashem A; Rivas, Manuel A; McCarthy, Mark I; Campbell, Colin; Timpson, Nicholas J; Gaunt, Tom R

    2016-01-01

    It has become common practice to analyse large scale sequencing data with statistical approaches based around the aggregation of rare variants within the same gene. We applied a novel approach to rare variant analysis by collapsing variants together using protein domain and family coordinates, regarded to be a more discrete definition of a biologically functional unit. Using Pfam definitions, we collapsed rare variants (Minor Allele Frequency ≤ 1%) together in three different ways 1) variants within single genomic regions which map to individual protein domains 2) variants within two individual protein domain regions which are predicted to be responsible for a protein-protein interaction 3) all variants within combined regions from multiple genes responsible for coding the same protein domain (i.e. protein families). A conventional collapsing analysis using gene coordinates was also undertaken for comparison. We used UK10K sequence data and investigated associations between regions of variants and lipid traits using the sequence kernel association test (SKAT). We observed no strong evidence of association between regions of variants based on Pfam domain definitions and lipid traits. Quantile-Quantile plots illustrated that the overall distributions of p-values from the protein domain analyses were comparable to that of a conventional gene-based approach. Deviations from this distribution suggested that collapsing by either protein domain or gene definitions may be favourable depending on the trait analysed. We have collapsed rare variants together using protein domain and family coordinates to present an alternative approach over collapsing across conventionally used gene-based regions. Although no strong evidence of association was detected in these analyses, future studies may still find value in adopting these approaches to detect previously unidentified association signals.

  18. Fine-mapping inflammatory bowel disease loci to single variant resolution

    PubMed Central

    Huang, Hailiang; Fang, Ming; Jostins, Luke; Mirkov, Maša Umićević; Boucher, Gabrielle; Anderson, Carl A; Andersen, Vibeke; Cleynen, Isabelle; Cortes, Adrian; Crins, François; D'Amato, Mauro; Deffontaine, Valérie; Dimitrieva, Julia; Docampo, Elisa; Elansary, Mahmoud; Farh, Kyle Kai-How; Franke, Andre; Gori, Ann-Stephan; Goyette, Philippe; Halfvarson, Jonas; Haritunians, Talin; Knight, Jo; Lawrance, Ian C; Lees, Charlie W; Louis, Edouard; Mariman, Rob; Meuwissen, Theo; Mni, Myriam; Momozawa, Yukihide; Parkes, Miles; Spain, Sarah L; Théâtre, Emilie; Trynka, Gosia; Satsangi, Jack; van Sommeren, Suzanne; Vermeire, Severine; Xavier, Ramnik J; Weersma, Rinse K; Duerr, Richard H; Mathew, Christopher G; Rioux, John D; McGovern, Dermot PB; Cho, Judy H; Georges, Michel; Daly, Mark J; Barrett, Jeffrey C

    2017-01-01

    Summary The inflammatory bowel diseases (IBD) are chronic gastrointestinal inflammatory disorders that affect millions worldwide. Genome-wide association studies have identified 200 IBD-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 IBD loci using high-density genotyping in 67,852 individuals. We pinpointed 18 associations to a single causal variant with >95% certainty, and an additional 27 associations to a single variant with >50% certainty. These 45 variants are significantly enriched for protein-coding changes (n=13), direct disruption of transcription factor binding sites (n=3) and tissue specific epigenetic marks (n=10), with the latter category showing enrichment in specific immune cells among associations stronger in CD and in gut mucosa among associations stronger in UC. The results of this study suggest that high-resolution fine-mapping in large samples can convert many GWAS discoveries into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms. PMID:28658209

  19. Quantitative trait nucleotide analysis using Bayesian model selection.

    PubMed

    Blangero, John; Goring, Harald H H; Kent, Jack W; Williams, Jeff T; Peterson, Charles P; Almasy, Laura; Dyer, Thomas D

    2005-10-01

    Although much attention has been given to statistical genetic methods for the initial localization and fine mapping of quantitative trait loci (QTLs), little methodological work has been done to date on the problem of statistically identifying the most likely functional polymorphisms using sequence data. In this paper we provide a general statistical genetic framework, called Bayesian quantitative trait nucleotide (BQTN) analysis, for assessing the likely functional status of genetic variants. The approach requires the initial enumeration of all genetic variants in a set of resequenced individuals. These polymorphisms are then typed in a large number of individuals (potentially in families), and marker variation is related to quantitative phenotypic variation using Bayesian model selection and averaging. For each sequence variant a posterior probability of effect is obtained and can be used to prioritize additional molecular functional experiments. An example of this quantitative nucleotide analysis is provided using the GAW12 simulated data. The results show that the BQTN method may be useful for choosing the most likely functional variants within a gene (or set of genes). We also include instructions on how to use our computer program, SOLAR, for association analysis and BQTN analysis.

  20. Detecting epistasis with the marginal epistasis test in genetic mapping studies of quantitative traits

    PubMed Central

    Zeng, Ping; Mukherjee, Sayan; Zhou, Xiang

    2017-01-01

    Epistasis, commonly defined as the interaction between multiple genes, is an important genetic component underlying phenotypic variation. Many statistical methods have been developed to model and identify epistatic interactions between genetic variants. However, because of the large combinatorial search space of interactions, most epistasis mapping methods face enormous computational challenges and often suffer from low statistical power due to multiple test correction. Here, we present a novel, alternative strategy for mapping epistasis: instead of directly identifying individual pairwise or higher-order interactions, we focus on mapping variants that have non-zero marginal epistatic effects—the combined pairwise interaction effects between a given variant and all other variants. By testing marginal epistatic effects, we can identify candidate variants that are involved in epistasis without the need to identify the exact partners with which the variants interact, thus potentially alleviating much of the statistical and computational burden associated with standard epistatic mapping procedures. Our method is based on a variance component model, and relies on a recently developed variance component estimation method for efficient parameter inference and p-value computation. We refer to our method as the “MArginal ePIstasis Test”, or MAPIT. With simulations, we show how MAPIT can be used to estimate and test marginal epistatic effects, produce calibrated test statistics under the null, and facilitate the detection of pairwise epistatic interactions. We further illustrate the benefits of MAPIT in a QTL mapping study by analyzing the gene expression data of over 400 individuals from the GEUVADIS consortium. PMID:28746338

  1. Admixture Mapping in Lupus Identifies Multiple Functional Variants within IFIH1 Associated with Apoptosis, Inflammation, and Autoantibody Production

    PubMed Central

    Looger, Loren L.; Han, Shizhong; Kim-Howard, Xana; Glenn, Stuart; Adler, Adam; Kelly, Jennifer A.; Niewold, Timothy B.; Gilkeson, Gary S.; Brown, Elizabeth E.; Alarcón, Graciela S.; Edberg, Jeffrey C.; Petri, Michelle; Ramsey-Goldman, Rosalind; Reveille, John D.; Vilá, Luis M.; Freedman, Barry I.; Tsao, Betty P.; Criswell, Lindsey A.; Jacob, Chaim O.; Moore, Jason H.; Vyse, Timothy J.; Langefeld, Carl L.; Guthridge, Joel M.; Gaffney, Patrick M.; Moser, Kathy L.; Scofield, R. Hal; Alarcón-Riquelme, Marta E.; Williams, Scott M.; Merrill, Joan T.; James, Judith A.; Kaufman, Kenneth M.; Kimberly, Robert P.; Harley, John B.; Nath, Swapan K.

    2013-01-01

    Systemic lupus erythematosus (SLE) is an inflammatory autoimmune disease with a strong genetic component. African-Americans (AA) are at increased risk of SLE, but the genetic basis of this risk is largely unknown. To identify causal variants in SLE loci in AA, we performed admixture mapping followed by fine mapping in AA and European-Americans (EA). Through genome-wide admixture mapping in AA, we identified a strong SLE susceptibility locus at 2q22–24 (LOD = 6.28), and the admixture signal is associated with the European ancestry (ancestry risk ratio ∼1.5). Large-scale genotypic analysis on 19,726 individuals of African and European ancestry revealed three independently associated variants in the IFIH1 gene: an intronic variant, rs13023380 [Pmeta = 5.20×10−14; odds ratio, 95% confidence interval = 0.82 (0.78–0.87)], and two missense variants, rs1990760 (Ala946Thr) [Pmeta = 3.08×10−7; 0.88 (0.84–0.93)] and rs10930046 (Arg460His) [Pdom = 1.16×10−8; 0.70 (0.62–0.79)]. Both missense variants produced dramatic phenotypic changes in apoptosis and inflammation-related gene expression. We experimentally validated function of the intronic SNP by DNA electrophoresis, protein identification, and in vitro protein binding assays. DNA carrying the intronic risk allele rs13023380 showed reduced binding efficiency to a cellular protein complex including nucleolin and lupus autoantigen Ku70/80, and showed reduced transcriptional activity in vivo. Thus, in SLE patients, genetic susceptibility could create a biochemical imbalance that dysregulates nucleolin, Ku70/80, or other nucleic acid regulatory proteins. This could promote antibody hypermutation and auto-antibody generation, further destabilizing the cellular network. Together with molecular modeling, our results establish a distinct role for IFIH1 in apoptosis, inflammation, and autoantibody production, and explain the molecular basis of these three risk alleles for SLE pathogenesis. PMID:23441136

  2. Characterization of European-ancestry NAFLD-Associated Variants in Individuals of African and Hispanic Descent

    PubMed Central

    Palmer, Nicholette D; Musani, Solomon K; Yerges-Armstrong, Laura M; Feitosa, Mary F; Bielak, Lawrence F; Hernaez, Ruben; Kahali, Bratati; Carr, J Jeffrey; Harris, Tamara B; Jhun, Min A; Kardia, Sharon LR; Langefeld, Carl D; Mosley, Thomas H; Norris, Jill M; Smith, Albert V; Taylor, Herman A; Wagenknecht, Lynne E; Liu, Jiankang; Borecki, Ingrid B; Peyser, Patricia A; Speliotes, Elizabeth K

    2013-01-01

    Nonalcoholic Fatty Liver Disease (NAFLD) is an obesity-related condition affecting over 50% of individuals in some populations and is expected to become the number one cause of liver disease worldwide by 2020. Common, robustly associated genetic variants in/near five genes were identified for hepatic steatosis, a quantifiable component of NAFLD, in European-ancestry individuals. Here we tested whether these variants were associated with hepatic steatosis in African and/or Hispanic Americans and fine-mapped the observed association signals. We measured hepatic steatosis using computed tomography in five African-American (n=3124) and one Hispanic-American (n=849) cohorts. All analyses controlled for variation in age, age2, gender, alcoholic drinks, and population substructure. Heritability of hepatic steatosis was estimated in three cohorts. Variants in/near PNPLA3, NCAN, LYPLAL1, GCKR, and PPP1R3B were tested for association with hepatic steatosis using a regression framework in each cohort and meta-analyzed. Fine-mapping across African-American cohorts was conducted using meta-analysis. African- and Hispanic-American cohorts were 33.9/37.5% male, with average age of 58.6/42.6 years and body mass index of 31.8/28.9kg/m2, respectively. Hepatic steatosis was 0.20–0.34 heritable in African-and Hispanic-American families (p<0.02 in each cohort). Variants in or near PNPLA3, NCAN, GCKR, PPP1R3B in African Americans and PNPLA3 and PPP1R3B in Hispanic Americans were significantly associated with hepatic steatosis; however, allele frequency and effect size varied across ancestries. Fine-mapping in African Americans highlighted missense variants at PNPLA3 and GCKR and redefined the association region at LYPLAL1. Conclusions We show for the first time that multiple genetic variants are associated with hepatic steatosis across ancestries and explain a substantial proportion of the genetic predisposition in African and Hispanic Americans. Missense variants in PNPLA3 and GCKR are likely functional across multiple ancestries. PMID:23564467

  3. Characterization of European ancestry nonalcoholic fatty liver disease-associated variants in individuals of African and Hispanic descent.

    PubMed

    Palmer, Nicholette D; Musani, Solomon K; Yerges-Armstrong, Laura M; Feitosa, Mary F; Bielak, Lawrence F; Hernaez, Ruben; Kahali, Bratati; Carr, J Jeffrey; Harris, Tamara B; Jhun, Min A; Kardia, Sharon L R; Langefeld, Carl D; Mosley, Thomas H; Norris, Jill M; Smith, Albert V; Taylor, Herman A; Wagenknecht, Lynne E; Liu, Jiankang; Borecki, Ingrid B; Peyser, Patricia A; Speliotes, Elizabeth K

    2013-09-01

    Nonalcoholic fatty liver disease (NAFLD) is an obesity-related condition affecting over 50% of individuals in some populations and is expected to become the number one cause of liver disease worldwide by 2020. Common, robustly associated genetic variants in/near five genes were identified for hepatic steatosis, a quantifiable component of NAFLD, in European ancestry individuals. Here we tested whether these variants were associated with hepatic steatosis in African- and/or Hispanic-Americans and fine-mapped the observed association signals. We measured hepatic steatosis using computed tomography in five African American (n = 3,124) and one Hispanic American (n = 849) cohorts. All analyses controlled for variation in age, age(2) , gender, alcoholic drinks, and population substructure. Heritability of hepatic steatosis was estimated in three cohorts. Variants in/near PNPLA3, NCAN, LYPLAL1, GCKR, and PPP1R3B were tested for association with hepatic steatosis using a regression framework in each cohort and meta-analyzed. Fine-mapping across African American cohorts was conducted using meta-analysis. African- and Hispanic-American cohorts were 33.9/37.5% male, with average age of 58.6/42.6 years and body mass index of 31.8/28.9 kg/m(2) , respectively. Hepatic steatosis was 0.20-0.34 heritable in African- and Hispanic-American families (P < 0.02 in each cohort). Variants in or near PNPLA3, NCAN, GCKR, PPP1R3B in African Americans and PNPLA3 and PPP1R3B in Hispanic Americans were significantly associated with hepatic steatosis; however, allele frequency and effect size varied across ancestries. Fine-mapping in African Americans highlighted missense variants at PNPLA3 and GCKR and redefined the association region at LYPLAL1. Multiple genetic variants are associated with hepatic steatosis across ancestries. This explains a substantial proportion of the genetic predisposition in African- and Hispanic-Americans. Missense variants in PNPLA3 and GCKR are likely functional across multiple ancestries. © 2013 by the American Association for the Study of Liver Diseases.

  4. Mapping rare and common causal alleles for complex human diseases

    PubMed Central

    Raychaudhuri, Soumya

    2011-01-01

    Advances in genotyping and sequencing technologies have revolutionized the genetics of complex disease by locating rare and common variants that influence an individual’s risk for diseases, such as diabetes, cancers, and psychiatric disorders. However, to capitalize on this data for prevention and therapies requires the identification of causal alleles and a mechanistic understanding for how these variants contribute to the disease. After discussing the strategies currently used to map variants for complex diseases, this Primer explores how variants may be prioritized for follow-up functional studies and the challenges and approaches for assessing the contributions of rare and common variants to disease phenotypes. PMID:21962507

  5. HAPRAP: a haplotype-based iterative method for statistical fine mapping using GWAS summary statistics.

    PubMed

    Zheng, Jie; Rodriguez, Santiago; Laurin, Charles; Baird, Denis; Trela-Larsen, Lea; Erzurumluoglu, Mesut A; Zheng, Yi; White, Jon; Giambartolomei, Claudia; Zabaneh, Delilah; Morris, Richard; Kumari, Meena; Casas, Juan P; Hingorani, Aroon D; Evans, David M; Gaunt, Tom R; Day, Ian N M

    2017-01-01

    Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel. Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits height data, HAPRAP performs well with a small training sample size (N < 2000) while other methods become suboptimal. Moreover, HAPRAP's performance is not affected substantially by single nucleotide polymorphisms (SNPs) with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization). The HAPRAP package and documentation are available at http://apps.biocompute.org.uk/haprap/ CONTACT: : jie.zheng@bristol.ac.uk or tom.gaunt@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  6. An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.

    PubMed

    Ribeiro, Antonio; Golicz, Agnieszka; Hackett, Christine Anne; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J; Bayer, Micha

    2015-11-11

    Single Nucleotide Polymorphisms (SNPs) are widely used molecular markers, and their use has increased massively since the inception of Next Generation Sequencing (NGS) technologies, which allow detection of large numbers of SNPs at low cost. However, both NGS data and their analysis are error-prone, which can lead to the generation of false positive (FP) SNPs. We explored the relationship between FP SNPs and seven factors involved in mapping-based variant calling - quality of the reference sequence, read length, choice of mapper and variant caller, mapping stringency and filtering of SNPs by read mapping quality and read depth. This resulted in 576 possible factor level combinations. We used error- and variant-free simulated reads to ensure that every SNP found was indeed a false positive. The variation in the number of FP SNPs generated ranged from 0 to 36,621 for the 120 million base pairs (Mbp) genome. All of the experimental factors tested had statistically significant effects on the number of FP SNPs generated and there was a considerable amount of interaction between the different factors. Using a fragmented reference sequence led to a dramatic increase in the number of FP SNPs generated, as did relaxed read mapping and a lack of SNP filtering. The choice of reference assembler, mapper and variant caller also significantly affected the outcome. The effect of read length was more complex and suggests a possible interaction between mapping specificity and the potential for contributing more false positives as read length increases. The choice of tools and parameters involved in variant calling can have a dramatic effect on the number of FP SNPs produced, with particularly poor combinations of software and/or parameter settings yielding tens of thousands in this experiment. Between-factor interactions make simple recommendations difficult for a SNP discovery pipeline but the quality of the reference sequence is clearly of paramount importance. Our findings are also a stark reminder that it can be unwise to use the relaxed mismatch settings provided as defaults by some read mappers when reads are being mapped to a relatively unfinished reference sequence from e.g. a non-model organism in its early stages of genomic exploration.

  7. A map of human microRNA variation uncovers unexpectedly high levels of variability

    PubMed Central

    2012-01-01

    Background MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in a healthy population seems an obvious requirement for future research on miRNA-related pathologies. Methods Data on 14 populations from the 1000 Genomes Project were analyzed, along with new data extracted from 60 exomes of healthy individuals from a population from southern Spain, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. Results Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. Conclusions Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. PMID:22906193

  8. Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data.

    PubMed

    Chung, Dongjun; Kuan, Pei Fen; Li, Bo; Sanalkumar, Rajendran; Liang, Kun; Bresnick, Emery H; Dewey, Colin; Keleş, Sündüz

    2011-07-01

    Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is rapidly replacing chromatin immunoprecipitation combined with genome-wide tiling array analysis (ChIP-chip) as the preferred approach for mapping transcription-factor binding sites and chromatin modifications. The state of the art for analyzing ChIP-seq data relies on using only reads that map uniquely to a relevant reference genome (uni-reads). This can lead to the omission of up to 30% of alignable reads. We describe a general approach for utilizing reads that map to multiple locations on the reference genome (multi-reads). Our approach is based on allocating multi-reads as fractional counts using a weighted alignment scheme. Using human STAT1 and mouse GATA1 ChIP-seq datasets, we illustrate that incorporation of multi-reads significantly increases sequencing depths, leads to detection of novel peaks that are not otherwise identifiable with uni-reads, and improves detection of peaks in mappable regions. We investigate various genome-wide characteristics of peaks detected only by utilization of multi-reads via computational experiments. Overall, peaks from multi-read analysis have similar characteristics to peaks that are identified by uni-reads except that the majority of them reside in segmental duplications. We further validate a number of GATA1 multi-read only peaks by independent quantitative real-time ChIP analysis and identify novel target genes of GATA1. These computational and experimental results establish that multi-reads can be of critical importance for studying transcription factor binding in highly repetitive regions of genomes with ChIP-seq experiments.

  9. Hepatitis C Virus Antigenic Convergence

    PubMed Central

    Campo, David S.; Dimitrova, Zoya; Yokosawa, Jonny; Hoang, Duc; Perez, Nestor O.; Ramachandran, Sumathi; Khudyakov, Yury

    2012-01-01

    Vaccine development against hepatitis C virus (HCV) is hindered by poor understanding of factors defining cross-immunoreactivity among heterogeneous epitopes. Using synthetic peptides and mouse immunization as a model, we conducted a quantitative analysis of cross-immunoreactivity among variants of the HCV hypervariable region 1 (HVR1). Analysis of 26,883 immunological reactions among pairs of peptides showed that the distribution of cross-immunoreactivity among HVR1 variants was skewed, with antibodies against a few variants reacting with all tested peptides. The HVR1 cross-immunoreactivity was accurately modeled based on amino acid sequence alone. The tested peptides were mapped in the HVR1 sequence space, which was visualized as a network of 11,319 sequences. The HVR1 variants with a greater network centrality showed a broader cross-immunoreactivity. The entire sequence space is explored by each HCV genotype and subtype. These findings indicate that HVR1 antigenic diversity is extensively convergent and effectively limited, suggesting significant implications for vaccine development. PMID:22355779

  10. Variant-aware saturating mutagenesis using multiple Cas9 nucleases identifies regulatory elements at trait-associated loci.

    PubMed

    Canver, Matthew C; Lessard, Samuel; Pinello, Luca; Wu, Yuxuan; Ilboudo, Yann; Stern, Emily N; Needleman, Austen J; Galactéros, Frédéric; Brugnara, Carlo; Kutlar, Abdullah; McKenzie, Colin; Reid, Marvin; Chen, Diane D; Das, Partha Pratim; A Cole, Mitchel; Zeng, Jing; Kurita, Ryo; Nakamura, Yukio; Yuan, Guo-Cheng; Lettre, Guillaume; Bauer, Daniel E; Orkin, Stuart H

    2017-04-01

    Cas9-mediated, high-throughput, saturating in situ mutagenesis permits fine-mapping of function across genomic segments. Disease- and trait-associated variants identified in genome-wide association studies largely cluster at regulatory loci. Here we demonstrate the use of multiple designer nucleases and variant-aware library design to interrogate trait-associated regulatory DNA at high resolution. We developed a computational tool for the creation of saturating-mutagenesis libraries with single or multiple nucleases with incorporation of variants. We applied this methodology to the HBS1L-MYB intergenic region, which is associated with red-blood-cell traits, including fetal hemoglobin levels. This approach identified putative regulatory elements that control MYB expression. Analysis of genomic copy number highlighted potential false-positive regions, thus emphasizing the importance of off-target analysis in the design of saturating-mutagenesis experiments. Together, these data establish a widely applicable high-throughput and high-resolution methodology to identify minimal functional sequences within large disease- and trait-associated regions.

  11. SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing.

    PubMed

    Sato, Yukuto; Kojima, Kaname; Nariai, Naoki; Yamaguchi-Kabata, Yumi; Kawai, Yosuke; Takahashi, Mamoru; Mimori, Takahiro; Nagasaki, Masao

    2014-08-08

    Next-generation sequencers (NGSs) have become one of the main tools for current biology. To obtain useful insights from the NGS data, it is essential to control low-quality portions of the data affected by technical errors such as air bubbles in sequencing fluidics. We develop a software SUGAR (subtile-based GUI-assisted refiner) which can handle ultra-high-throughput data with user-friendly graphical user interface (GUI) and interactive analysis capability. The SUGAR generates high-resolution quality heatmaps of the flowcell, enabling users to find possible signals of technical errors during the sequencing. The sequencing data generated from the error-affected regions of a flowcell can be selectively removed by automated analysis or GUI-assisted operations implemented in the SUGAR. The automated data-cleaning function based on sequence read quality (Phred) scores was applied to a public whole human genome sequencing data and we proved the overall mapping quality was improved. The detailed data evaluation and cleaning enabled by SUGAR would reduce technical problems in sequence read mapping, improving subsequent variant analysis that require high-quality sequence data and mapping results. Therefore, the software will be especially useful to control the quality of variant calls to the low population cells, e.g., cancers, in a sample with technical errors of sequencing procedures.

  12. Variation resources at UC Santa Cruz.

    PubMed

    Thomas, Daryl J; Trumbower, Heather; Kern, Andrew D; Rhead, Brooke L; Kuhn, Robert M; Haussler, David; Kent, W James

    2007-01-01

    The variation resources within the University of California Santa Cruz Genome Browser include polymorphism data drawn from public collections and analyses of these data, along with their display in the context of other genomic annotations. Primary data from dbSNP is included for many organisms, with added information including genomic alleles and orthologous alleles for closely related organisms. Display filtering and coloring is available by variant type, functional class or other annotations. Annotation of potential errors is highlighted and a genomic alignment of the variant's flanking sequence is displayed. HapMap allele frequencies and linkage disequilibrium (LD) are available for each HapMap population, along with non-human primate alleles. The browsing and analysis tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.

  13. GenPlay Multi-Genome, a tool to compare and analyze multiple human genomes in a graphical interface.

    PubMed

    Lajugie, Julien; Fourel, Nicolas; Bouhassira, Eric E

    2015-01-01

    Parallel visualization of multiple individual human genomes is a complex endeavor that is rapidly gaining importance with the increasing number of personal, phased and cancer genomes that are being generated. It requires the display of variants such as SNPs, indels and structural variants that are unique to specific genomes and the introduction of multiple overlapping gaps in the reference sequence. Here, we describe GenPlay Multi-Genome, an application specifically written to visualize and analyze multiple human genomes in parallel. GenPlay Multi-Genome is ideally suited for the comparison of allele-specific expression and functional genomic data obtained from multiple phased genomes in a graphical interface with access to multiple-track operation. It also allows the analysis of data that have been aligned to custom genomes rather than to a standard reference and can be used as a variant calling format file browser and as a tool to compare different genome assembly, such as hg19 and hg38. GenPlay is available under the GNU public license (GPL-3) from http://genplay.einstein.yu.edu. The source code is available at https://github.com/JulienLajugie/GenPlay. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Unique Variants in OPN1LW Cause Both Syndromic and Nonsyndromic X-Linked High Myopia Mapped to MYP1.

    PubMed

    Li, Jiali; Gao, Bei; Guan, Liping; Xiao, Xueshan; Zhang, Jianguo; Li, Shiqiang; Jiang, Hui; Jia, Xiaoyun; Yang, Jianhua; Guo, Xiangming; Yin, Ye; Wang, Jun; Zhang, Qingjiong

    2015-06-01

    MYP1 is a locus for X-linked syndromic and nonsyndromic high myopia. Recently, unique haplotypes in OPN1LW were found to be responsible for X-linked syndromic high myopia mapped to MYP1. The current study is to test if such variants in OPN1LW are also responsible for X-linked nonsyndromic high myopia mapped to MYP1. The proband of the family previously mapped to MYP1 was initially analyzed using whole-exome sequencing and whole-genome sequencing. Additional probands with early-onset high myopia were analyzed using whole-exome sequencing. Variants in OPN1LW were selected and confirmed by Sanger sequencing. Long-range and second PCR were used to determine the haplotype and the first gene of the red-green gene array. Candidate variants were further validated in family members and controls. The unique LVAVA haplotype in OPN1LW was detected in the family with X-linked nonsyndromic high myopia mapped to MYP1. In addition, this haplotype and a novel frameshift mutation (c.617_620dup, p.Phe208Argfs*51) in OPN1LW were detected in two other families with X-linked high myopia. The unique haplotype cosegregated with high myopia in the two families, with a maximum LOD score of 3.34 and 2.31 at θ = 0. OPN1LW with the variants in these families was the first gene in the red-green gene array and was not present in 247 male controls. Reevaluation of the clinical data in both families with the unique haplotype suggested nonsyndromic high myopia. Our study confirms the findings that unique variants in OPN1LW are responsible for both syndromic and nonsyndromic X-linked high myopia mapped to MYP1.

  15. Variations on a theme of Lander and Waterman

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Speed, T.

    1997-12-01

    The original Lander and Waterman mathematical analysis was for fingerprinting random clones. Since that time, a number of variants of their theory have appeared, including ones which apply to mapping by anchoring random clones, and to non-random or directed clone mapping. The same theory is now widely used to devise random sequencing strategies. In this talk I will review these developments, and go on the discuss the theory required for directed sequencing strategies.

  16. A Rb1 promoter variant with reduced activity contributes to osteosarcoma susceptibility in irradiated mice

    PubMed Central

    2014-01-01

    Background Syndromic forms of osteosarcoma (OS) account for less than 10% of all recorded cases of this malignancy. An individual OS predisposition is also possible by the inheritance of low penetrance alleles of tumor susceptibility genes, usually without evidence of a syndromic condition. Genetic variants involved in such a non-syndromic form of tumor predisposition are difficult to identify, given the low incidence of osteosarcoma cases and the genetic heterogeneity of patients. We recently mapped a major OS susceptibility QTL to mouse chromosome 14 by comparing alpha-radiation induced osteosarcoma in mouse strains which differ in their tumor susceptibility. Methods Tumor-specific allelic losses in murine osteosacoma were mapped along chromosome 14 using microsatellite markers and SNP allelotyping. Candidate gene search in the mapped interval was refined using PosMed data mining and mRNA expression analysis in normal osteoblasts. A strain-specific promoter variant in Rb1 was tested for its influence on mRNA expression using reporter assay. Results A common Rb1 allele derived from the BALB/cHeNhg strain was identified as the major determinant of radiation-induced OS risk at this locus. Increased OS-risk is linked with a hexanucleotide deletion in the promoter region which is predicted to change WT1 and SP1 transcription factor-binding sites. Both in-vitro reporter and in-vivo expression assays confirmed an approx. 1.5 fold reduced gene expression by this promoter variant. Concordantly, the 50% reduction in Rb1 expression in mice bearing a conditional hemizygous Rb1 deletion causes a significant rise of OS incidence following alpha-irradiation. Conclusion This is the first experimental demonstration of a functional and genetic link between reduced Rb1 expression from a common promoter variant and increased tumor risk after radiation exposure. We propose that a reduced Rb1 expression by common variants in regulatory regions can modify the risk for a malignant transformation of bone cells after radiation exposure. PMID:25092376

  17. An xQTL map integrates the genetic architecture of the human brain's transcriptome and epigenome.

    PubMed

    Ng, Bernard; White, Charles C; Klein, Hans-Ulrich; Sieberts, Solveig K; McCabe, Cristin; Patrick, Ellis; Xu, Jishu; Yu, Lei; Gaiteri, Chris; Bennett, David A; Mostafavi, Sara; De Jager, Philip L

    2017-10-01

    We report a multi-omic resource generated by applying quantitative trait locus (xQTL) analyses to RNA sequence, DNA methylation and histone acetylation data from the dorsolateral prefrontal cortex of 411 older adults who have all three data types. We identify SNPs significantly associated with gene expression, DNA methylation and histone modification levels. Many of these SNPs influence multiple molecular features, and we demonstrate that SNP effects on RNA expression are fully mediated by epigenetic features in 9% of these loci. Further, we illustrate the utility of our new resource, xQTL Serve, by using it to prioritize the cell type(s) most affected by an xQTL. We also reanalyze published genome wide association studies using an xQTL-weighted analysis approach and identify 18 new schizophrenia and 2 new bipolar susceptibility variants, which is more than double the number of loci that can be discovered with a larger blood-based expression eQTL resource.

  18. Fine-mapping inflammatory bowel disease loci to single-variant resolution.

    PubMed

    Huang, Hailiang; Fang, Ming; Jostins, Luke; Umićević Mirkov, Maša; Boucher, Gabrielle; Anderson, Carl A; Andersen, Vibeke; Cleynen, Isabelle; Cortes, Adrian; Crins, François; D'Amato, Mauro; Deffontaine, Valérie; Dmitrieva, Julia; Docampo, Elisa; Elansary, Mahmoud; Farh, Kyle Kai-How; Franke, Andre; Gori, Ann-Stephan; Goyette, Philippe; Halfvarson, Jonas; Haritunians, Talin; Knight, Jo; Lawrance, Ian C; Lees, Charlie W; Louis, Edouard; Mariman, Rob; Meuwissen, Theo; Mni, Myriam; Momozawa, Yukihide; Parkes, Miles; Spain, Sarah L; Théâtre, Emilie; Trynka, Gosia; Satsangi, Jack; van Sommeren, Suzanne; Vermeire, Severine; Xavier, Ramnik J; Weersma, Rinse K; Duerr, Richard H; Mathew, Christopher G; Rioux, John D; McGovern, Dermot P B; Cho, Judy H; Georges, Michel; Daly, Mark J; Barrett, Jeffrey C

    2017-07-13

    Inflammatory bowel diseases are chronic gastrointestinal inflammatory disorders that affect millions of people worldwide. Genome-wide association studies have identified 200 inflammatory bowel disease-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 inflammatory bowel disease loci using high-density genotyping in 67,852 individuals. We pinpoint 18 associations to a single causal variant with greater than 95% certainty, and an additional 27 associations to a single variant with greater than 50% certainty. These 45 variants are significantly enriched for protein-coding changes (n = 13), direct disruption of transcription-factor binding sites (n = 3), and tissue-specific epigenetic marks (n = 10), with the last category showing enrichment in specific immune cells among associations stronger in Crohn's disease and in gut mucosa among associations stronger in ulcerative colitis. The results of this study suggest that high-resolution fine-mapping in large samples can convert many discoveries from genome-wide association studies into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms.

  19. MAI-free performance of PMU-OFDM transceiver in time-variant environment

    NASA Astrophysics Data System (ADS)

    Tadjpour, Layla; Tsai, Shang-Ho; Kuo, C.-C. J.

    2005-06-01

    An approximately multi-user OFDM transceiver was introduced to reduce the multi-access interference (MAI ) due to the carrier frequency offset (CFO) to a negligible amount via precoding by Tsai, Lin and Kuo. In this work, we investigate the performance of this precoded multi-user (PMU) OFDM system in a time-variant channel environment. We analyze and compare the MAI effect caused by time-variant channels in the PMU-OFDM and the OFDMA systems. Generally speaking, the MAI effect consists of two parts. The first part is due to the loss of orthogonality among subchannels for all users while the second part is due to the CFO effect caused by the Doppler shift. Simulation results show that, although OFDMA outperforms the PMU-OFDM transceiver in a fast time-variant environment without CFO, PMU-OFDM outperforms OFDMA in a slow time-variant channel via the use of M/2 symmetric or anti-symmetric codewords of M Hadamard-Walsh codes.

  20. Characterization of Centromeric Histone H3 (CENH3) Variants in Cultivated and Wild Carrots (Daucus sp.)

    PubMed Central

    Dunemann, Frank; Schrader, Otto; Budahn, Holger; Houben, Andreas

    2014-01-01

    In eukaryotes, centromeres are the assembly sites for the kinetochore, a multi-protein complex to which spindle microtubules are attached at mitosis and meiosis, thereby ensuring segregation of chromosomes during cell division. They are specified by incorporation of CENH3, a centromere specific histone H3 variant which replaces canonical histone H3 in the nucleosomes of functional centromeres. To lay a first foundation of a putative alternative haploidization strategy based on centromere-mediated genome elimination in cultivated carrots, in the presented research we aimed at the identification and cloning of functional CENH3 genes in Daucus carota and three distantly related wild species of genus Daucus varying in basic chromosome numbers. Based on mining the carrot transcriptome followed by a subsequent PCR-based cloning, homologous coding sequences for CENH3s of the four Daucus species were identified. The ORFs of the CENH3 variants were very similar, and an amino acid sequence length of 146 aa was found in three out of the four species. Comparison of Daucus CENH3 amino acid sequences with those of other plant CENH3s as well as their phylogenetic arrangement among other dicot CENH3s suggest that the identified genes are authentic CENH3 homologs. To verify the location of the CENH3 protein in the kinetochore regions of the Daucus chromosomes, a polyclonal antibody based on a peptide corresponding to the N-terminus of DcCENH3 was developed and used for anti-CENH3 immunostaining of mitotic root cells. The chromosomal location of CENH3 proteins in the centromere regions of the chromosomes could be confirmed. For genetic localization of the CENH3 gene in the carrot genome, a previously constructed linkage map for carrot was used for mapping a CENH3-specific simple sequence repeat (SSR) marker, and the CENH3 locus was mapped on the carrot chromosome 9. PMID:24887084

  1. VLBI-resolution radio-map algorithms: Performance analysis of different levels of data-sharing on multi-socket, multi-core architectures

    NASA Astrophysics Data System (ADS)

    Tabik, S.; Romero, L. F.; Mimica, P.; Plata, O.; Zapata, E. L.

    2012-09-01

    A broad area in astronomy focuses on simulating extragalactic objects based on Very Long Baseline Interferometry (VLBI) radio-maps. Several algorithms in this scope simulate what would be the observed radio-maps if emitted from a predefined extragalactic object. This work analyzes the performance and scaling of this kind of algorithms on multi-socket, multi-core architectures. In particular, we evaluate a sharing approach, a privatizing approach and a hybrid approach on systems with complex memory hierarchy that includes shared Last Level Cache (LLC). In addition, we investigate which manual processes can be systematized and then automated in future works. The experiments show that the data-privatizing model scales efficiently on medium scale multi-socket, multi-core systems (up to 48 cores) while regardless of algorithmic and scheduling optimizations, the sharing approach is unable to reach acceptable scalability on more than one socket. However, the hybrid model with a specific level of data-sharing provides the best scalability over all used multi-socket, multi-core systems.

  2. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  3. Mortar and artillery variants classification by exploiting characteristics of the acoustic signature

    NASA Astrophysics Data System (ADS)

    Hohil, Myron E.; Grasing, David; Desai, Sachi; Morcos, Amir

    2007-10-01

    Feature extraction methods based on the discrete wavelet transform and multiresolution analysis facilitate the development of a robust classification algorithm that reliably discriminates mortar and artillery variants via acoustic signals produced during the launch/impact events. Utilizing acoustic sensors to exploit the sound waveform generated from the blast for the identification of mortar and artillery variants. Distinct characteristics arise within the different mortar variants because varying HE mortar payloads and related charges emphasize concussive and shrapnel effects upon impact employing varying magnitude explosions. The different mortar variants are characterized by variations in the resulting waveform of the event. The waveform holds various harmonic properties distinct to a given mortar/artillery variant that through advanced signal processing techniques can employed to classify a given set. The DWT and other readily available signal processing techniques will be used to extract the predominant components of these characteristics from the acoustic signatures at ranges exceeding 2km. Exploiting these techniques will help develop a feature set highly independent of range, providing discrimination based on acoustic elements of the blast wave. Highly reliable discrimination will be achieved with a feed-forward neural network classifier trained on a feature space derived from the distribution of wavelet coefficients, frequency spectrum, and higher frequency details found within different levels of the multiresolution decomposition. The process that will be described herein extends current technologies, which emphasis multi modal sensor fusion suites to provide such situational awareness. A two fold problem of energy consumption and line of sight arise with the multi modal sensor suites. The process described within will exploit the acoustic properties of the event to provide variant classification as added situational awareness to the solider.

  4. Automated Geo/Co-Registration of Multi-Temporal Very-High-Resolution Imagery.

    PubMed

    Han, Youkyung; Oh, Jaehong

    2018-05-17

    For time-series analysis using very-high-resolution (VHR) multi-temporal satellite images, both accurate georegistration to the map coordinates and subpixel-level co-registration among the images should be conducted. However, applying well-known matching methods, such as scale-invariant feature transform and speeded up robust features for VHR multi-temporal images, has limitations. First, they cannot be used for matching an optical image to heterogeneous non-optical data for georegistration. Second, they produce a local misalignment induced by differences in acquisition conditions, such as acquisition platform stability, the sensor's off-nadir angle, and relief displacement of the considered scene. Therefore, this study addresses the problem by proposing an automated geo/co-registration framework for full-scene multi-temporal images acquired from a VHR optical satellite sensor. The proposed method comprises two primary steps: (1) a global georegistration process, followed by (2) a fine co-registration process. During the first step, two-dimensional multi-temporal satellite images are matched to three-dimensional topographic maps to assign the map coordinates. During the second step, a local analysis of registration noise pixels extracted between the multi-temporal images that have been mapped to the map coordinates is conducted to extract a large number of well-distributed corresponding points (CPs). The CPs are finally used to construct a non-rigid transformation function that enables minimization of the local misalignment existing among the images. Experiments conducted on five Kompsat-3 full scenes confirmed the effectiveness of the proposed framework, showing that the georegistration performance resulted in an approximately pixel-level accuracy for most of the scenes, and the co-registration performance further improved the results among all combinations of the georegistered Kompsat-3 image pairs by increasing the calculated cross-correlation values.

  5. Fine Structure in Multi-Phase Zr8Ni21-Zr7Ni10-Zr2Ni7 Alloy Revealed by Transmission Electron Microscope

    PubMed Central

    Shen, Haoting; Bendersky, Leonid A.; Young, Kwo; Nei, Jean

    2015-01-01

    The microstructure of an annealed alloy with a Zr8Ni21 composition was studied by both scanning electron microscopy (SEM) and transmission electron microscopy (TEM). The presence of three phases, Zr8Ni21, Zr2Ni7, and Zr7Ni10, was confirmed by SEM/X-ray energy dispersive spectroscopy compositional mapping and TEM electron diffraction. Distribution of the phases and their morphology can be linked to a multi-phase structure formed by a sequence of reactions: (1) L → Zr2Ni7 + L’; (2) peritectic Zr2Ni7 + L’ → Zr2Ni7 + Zr8Ni21 + L”; (3) eutectic L” → Zr8Ni21 + Zr7Ni10. The effect of annealing at 960 °C, which was intended to convert a cast structure into a single-phase Zr8Ni21 structure, was only moderate and the resulting alloy was still multi-phased. TEM and crystallographic analysis of the Zr2Ni7 phase show a high density of planar (001) defects that were explained as low-energy boundaries between rotational variants and stacking faults. The crystallographic features arise from the pseudo-hexagonal structure of Zr2Ni7. This highly defective Zr2Ni7 phase was identified as the source of the broad X-ray diffraction peaks at around 38.4° and 44.6° when a Cu-K was used as the radiation source. PMID:28793460

  6. Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework.

    PubMed

    Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W

    2017-12-18

    The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.

  7. The Topic Analysis of Hospice Care Research Using Co-word Analysis and GHSOM

    NASA Astrophysics Data System (ADS)

    Yang, Yu-Hsiang; Bhikshu, Huimin; Tsaih, Rua-Huan

    The purpose of this study was to propose a multi-layer topic map analysis of palliative care research using co-word analysis of informetrics with Growing Hierarchical Self-Organizing Map (GHSOM). The topic map illustrated the delicate intertwining of subject areas and provided a more explicit illustration of the concepts within each subject area. We applied GHSOM, a text-mining Neural Networks tool, to obtain a hierarchical topic map. The result of the topic map may indicate that the subject area of health care science and service played an importance role in multidiscipline within the research related to palliative care.

  8. Mapping DNA Methylation with High Throughput Nanopore Sequencing

    PubMed Central

    Rand, Arthur C.; Jain, Miten; Eizenga, Jordan M.; Musselman-Brown, Audrey; Olsen, Hugh E.; Akeson, Mark

    2017-01-01

    Chemical modifications to DNA regulate its biological function. We present a framework for mapping methylation to cytosine and adenosine with the Oxford Nanopore Technologies MinION using its ionic current signal. We map three cytosine variants and two adenine variants. The results show that our model is sensitive enough to detect changes in genomic DNA methylation levels as a function of growth phase in E. coli. PMID:28218897

  9. Revisions to the 1995 map of ecological subregions that affect users of the southern variant of the Forest Vegetation Simulator

    Treesearch

    W. Henry McNab; Chad E. Keyser

    2011-01-01

    The Southern Variant of the Forest Vegetation Simulator utilizes ecological units mapped in 1995 by the Forest Service, U.S. Department of Agriculture, to refine tree growth models for the Southern United States. The 2007 revision of the 1995 map resulted in changes of identification and boundary delineation for some ecoregion units. In this report, we summarize the...

  10. Comparing GWAS Results of Complex Traits Using Full Genetic Model and Additive Models for Revealing Genetic Architecture

    PubMed Central

    Monir, Md. Mamun; Zhu, Jun

    2017-01-01

    Most of the genome-wide association studies (GWASs) for human complex diseases have ignored dominance, epistasis and ethnic interactions. We conducted comparative GWASs for total cholesterol using full model and additive models, which illustrate the impacts of the ignoring genetic variants on analysis results and demonstrate how genetic effects of multiple loci could differ across different ethnic groups. There were 15 quantitative trait loci with 13 individual loci and 3 pairs of epistasis loci identified by full model, whereas only 14 loci (9 common loci and 5 different loci) identified by multi-loci additive model. Again, 4 full model detected loci were not detected using multi-loci additive model. PLINK-analysis identified two loci and GCTA-analysis detected only one locus with genome-wide significance. Full model identified three previously reported genes as well as several new genes. Bioinformatics analysis showed some new genes are related with cholesterol related chemicals and/or diseases. Analyses of cholesterol data and simulation studies revealed that the full model performs were better than the additive-model performs in terms of detecting power and unbiased estimations of genetic variants of complex traits. PMID:28079101

  11. Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification

    PubMed Central

    Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei

    2013-01-01

    Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724

  12. Extended Linkage Disequilibrium Surrounding the Hemoglobin E Variant Due to Malarial Selection

    PubMed Central

    Ohashi, Jun ; Naka, Izumi ; Patarapotikul, Jintana ; Hananantachai, Hathairad ; Brittenham, Gary ; Looareesuwan, Sornchai ; Clark, Andrew G. ; Tokunaga, Katsushi 

    2004-01-01

    The hemoglobin E variant (HbE; β26Glu→Lys) is concentrated in parts of Southeast Asia where malaria is endemic, and HbE carrier status has been shown to confer some protection against Plasmodium falciparum malaria. To examine the effect of natural selection on the pattern of linkage disequilibrium (LD) and to infer the evolutionary history of the HbE variant, we analyzed biallelic markers surrounding the HbE variant in a Thai population. Pairwise LD analysis of HbE and 43 surrounding biallelic markers revealed LD of HbE extending beyond 100 kb, whereas no LD was observed between non-HbE variants and the same markers. The inferred haplotype network suggests a single origin of the HbE variant in the Thai population. Forward-in-time computer simulations under a variety of selection models indicate that the HbE variant arose 1,240–4,440 years ago. These results support the conjecture that the HbE mutation occurred recently, and the allele frequency has increased rapidly. Our study provides another clear demonstration that a high-resolution LD map across the human genome can detect recent variants that have been subjected to positive selection. PMID:15114532

  13. Extended linkage disequilibrium surrounding the hemoglobin E variant due to malarial selection.

    PubMed

    Ohashi, Jun; Naka, Izumi; Patarapotikul, Jintana; Hananantachai, Hathairad; Brittenham, Gary; Looareesuwan, Sornchai; Clark, Andrew G; Tokunaga, Katsushi

    2004-06-01

    The hemoglobin E variant (HbE; ( beta )26Glu-->Lys) is concentrated in parts of Southeast Asia where malaria is endemic, and HbE carrier status has been shown to confer some protection against Plasmodium falciparum malaria. To examine the effect of natural selection on the pattern of linkage disequilibrium (LD) and to infer the evolutionary history of the HbE variant, we analyzed biallelic markers surrounding the HbE variant in a Thai population. Pairwise LD analysis of HbE and 43 surrounding biallelic markers revealed LD of HbE extending beyond 100 kb, whereas no LD was observed between non-HbE variants and the same markers. The inferred haplotype network suggests a single origin of the HbE variant in the Thai population. Forward-in-time computer simulations under a variety of selection models indicate that the HbE variant arose 1,240-4,440 years ago. These results support the conjecture that the HbE mutation occurred recently, and the allele frequency has increased rapidly. Our study provides another clear demonstration that a high-resolution LD map across the human genome can detect recent variants that have been subjected to positive selection.

  14. Fast single-pass alignment and variant calling using sequencing data

    USDA-ARS?s Scientific Manuscript database

    Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...

  15. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect the rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an incre...

  16. Decoding the Effect of Isobaric Substitutions on Identifying Missing Proteins and Variant Peptides in Human Proteome.

    PubMed

    Choong, Wai-Kok; Lih, Tung-Shing Mamie; Chen, Yu-Ju; Sung, Ting-Yi

    2017-12-01

    To confirm the existence of missing proteins, we need to identify at least two unique peptides with length of 9-40 amino acids of a missing protein in bottom-up mass-spectrometry-based proteomic experiments. However, an identified unique peptide of the missing protein, even identified with high level of confidence, could possibly coincide with a peptide of a commonly observed protein due to isobaric substitutions, mass modifications, alternative splice isoforms, or single amino acid variants (SAAVs). Besides unique peptides of missing proteins, identified variant peptides (SAAV-containing peptides) could also alternatively map to peptides of other proteins due to the aforementioned issues. Therefore, we conducted a thorough comparative analysis on data sets in PeptideAtlas Tiered Human Integrated Search Proteome (THISP, 2017-03 release), including neXtProt (2017-01 release), to systematically investigate the possibility of unique peptides in missing proteins (PE2-4), unique peptides in dubious proteins, and variant peptides affected by isobaric substitutions, causing doubtful identification results. In this study, we considered 11 isobaric substitutions. From our analysis, we found <5% of the unique peptides of missing proteins and >6% of variant peptides became shared with peptides of PE1 proteins after isobaric substitutions.

  17. Genome-Wide Analysis of Alternative Splicing Landscapes Modulated during Plant-Virus Interactions in Brachypodium distachyon

    PubMed Central

    Scholthof, Karen-Beth G.

    2015-01-01

    In eukaryotes, alternative splicing (AS) promotes transcriptome and proteome diversity. The extent of genome-wide AS changes occurring during a plant-microbe interaction is largely unknown. Here, using high-throughput, paired-end RNA sequencing, we generated an isoform-level spliceome map of Brachypodium distachyon infected with Panicum mosaic virus and its satellite virus. Overall, we detected ∼44,443 transcripts in B. distachyon, ∼30% more than those annotated in the reference genome. Expression of ∼28,900 transcripts was ≥2 fragments per kilobase of transcript per million mapped fragments, and ∼42% of multi-exonic genes were alternatively spliced. Comparative analysis of AS patterns in B. distachyon, rice (Oryza sativa), maize (Zea mays), sorghum (Sorghum bicolor), Arabidopsis thaliana, potato (Solanum tuberosum), Medicago truncatula, and poplar (Populus trichocarpa) revealed conserved ratios of the AS types between monocots and dicots. Virus infection quantitatively altered AS events in Brachypodium with little effect on the AS ratios. We discovered AS events for >100 immune-related genes encoding receptor-like kinases, NB-LRR resistance proteins, transcription factors, RNA silencing, and splicing-associated proteins. Cloning and molecular characterization of SCL33, a serine/arginine-rich splicing factor, identified multiple novel intron-retaining splice variants that are developmentally regulated and modulated during virus infection. B. distachyon SCL33 splicing patterns are also strikingly conserved compared with a distant Arabidopsis SCL33 ortholog. This analysis provides new insights into AS landscapes conserved among monocots and dicots and uncovered AS events in plant defense-related genes. PMID:25634987

  18. FINGERPRINT ANALYSIS OF CONTAMINANT DATA: A FORENSIC TOOL FOR EVALUATING ENVIRONMENTAL CONTAMINATION

    EPA Science Inventory

    Several studies have been conducted on behalf of the U .S. Environmental Protection Agency (EPA) to identify detection monitoring parameters for specific industries.1,2,3,4,5 One outcome of these studies was the evolution of an empirical multi-variant contaminant fingerprinting p...

  19. High-resolution analysis of selection sweeps identified between fine-wool Merino and coarse-wool Churra sheep breeds.

    PubMed

    Gutiérrez-Gil, Beatriz; Esteban-Blanco, Cristina; Wiener, Pamela; Chitneedi, Praveen Krishna; Suarez-Vega, Aroa; Arranz, Juan-Jose

    2017-11-07

    With the aim of identifying selection signals in three Merino sheep lines that are highly specialized for fine wool production (Australian Industry Merino, Australian Merino and Australian Poll Merino) and considering that these lines have been subjected to selection not only for wool traits but also for growth and carcass traits and parasite resistance, we contrasted the OvineSNP50 BeadChip (50 K-chip) pooled genotypes of these Merino lines with the genotypes of a coarse-wool breed, phylogenetically related breed, Spanish Churra dairy sheep. Genome re-sequencing datasets of the two breeds were analyzed to further explore the genetic variation of the regions initially identified as putative selection signals. Based on the 50 K-chip genotypes, we used the overlapping selection signals (SS) identified by four selection sweep mapping analyses (that detect genetic differentiation, reduced heterozygosity and patterns of haplotype diversity) to define 18 convergence candidate regions (CCR), five associated with positive selection in Australian Merino and the remainder indicating positive selection in Churra. Subsequent analysis of whole-genome sequences from 15 Churra and 13 Merino samples identified 142,400 genetic variants (139,745 bi-allelic SNPs and 2655 indels) within the 18 defined CCR. Annotation of 1291 variants that were significantly associated with breed identity between Churra and Merino samples identified 257 intragenic variants that caused 296 functional annotation variants, 275 of which were located across 31 coding genes. Among these, four synonymous and four missense variants (NPR2_His847Arg, NCAPG_Ser585Phe, LCORL_Asp1214Glu and LCORL_Ile1441Leu) were included. Here, we report the mapping and genetic variation of 18 selection signatures that were identified between Australian Merino and Spanish Churra sheep breeds, which were validated by an additional contrast between Spanish Merino and Churra genotypes. Analysis of whole-genome sequencing datasets allowed us to identify divergent variants that may be viewed as candidates involved in the phenotypic differences for wool, growth and meat production/quality traits between the breeds analyzed. The four missense variants located in the NPR2, NCAPG and LCORL genes may be related to selection sweep regions previously identified and various QTL reported in sheep in relation to growth traits and carcass composition.

  20. Loci Contributing to Boric Acid Toxicity in Two Reference Populations of Drosophila melanogaster

    PubMed Central

    Najarro, Michael A.; Hackett, Jennifer L.; Macdonald, Stuart J.

    2017-01-01

    Populations maintain considerable segregating variation in the response to toxic, xenobiotic compounds. To identify variants associated with resistance to boric acid, a commonly-used household insecticide with a poorly understood mechanism of action, we assayed thousands of individuals from hundreds of strains. Using the Drosophila Synthetic Population Resource (DSPR), a multi-parental population (MPP) of inbred genotypes, we mapped six QTL to short genomic regions containing few protein-coding genes (3–188), allowing us to identify plausible candidate genes underlying resistance to boric acid toxicity. One interval contains multiple genes from the cytochrome P450 family, and we show that ubiquitous RNAi of one of these genes, Cyp9b2, markedly reduces resistance to the toxin. Resistance to boric acid is positively correlated with caffeine resistance. The two phenotypes additionally share a pair of QTL, potentially suggesting a degree of pleiotropy in the genetic control of resistance to these two distinct xenobiotics. Finally, we screened the Drosophila Genetic Reference Panel (DGRP) in an attempt to identify sequence variants within mapped QTL that are associated with boric acid resistance. The approach was largely unsuccessful, with only one QTL showing any associations at QTL-specific 20% False Discovery Rate (FDR) thresholds. Nonetheless, these associations point to a potential candidate gene that can be targeted in future validation efforts. Although the mapping data resulting from the two reference populations do not clearly overlap, our work provides a starting point for further genetic dissection of the processes underlying boric acid toxicity in insects. PMID:28592646

  1. Loci Contributing to Boric Acid Toxicity in Two Reference Populations of Drosophila melanogaster.

    PubMed

    Najarro, Michael A; Hackett, Jennifer L; Macdonald, Stuart J

    2017-06-07

    Populations maintain considerable segregating variation in the response to toxic, xenobiotic compounds. To identify variants associated with resistance to boric acid, a commonly-used household insecticide with a poorly understood mechanism of action, we assayed thousands of individuals from hundreds of strains. Using the Drosophila Synthetic Population Resource (DSPR), a multi-parental population (MPP) of inbred genotypes, we mapped six QTL to short genomic regions containing few protein-coding genes (3-188), allowing us to identify plausible candidate genes underlying resistance to boric acid toxicity. One interval contains multiple genes from the cytochrome P450 family, and we show that ubiquitous RNAi of one of these genes, Cyp9b2 , markedly reduces resistance to the toxin. Resistance to boric acid is positively correlated with caffeine resistance. The two phenotypes additionally share a pair of QTL, potentially suggesting a degree of pleiotropy in the genetic control of resistance to these two distinct xenobiotics. Finally, we screened the Drosophila Genetic Reference Panel (DGRP) in an attempt to identify sequence variants within mapped QTL that are associated with boric acid resistance. The approach was largely unsuccessful, with only one QTL showing any associations at QTL-specific 20% False Discovery Rate (FDR) thresholds. Nonetheless, these associations point to a potential candidate gene that can be targeted in future validation efforts. Although the mapping data resulting from the two reference populations do not clearly overlap, our work provides a starting point for further genetic dissection of the processes underlying boric acid toxicity in insects. Copyright © 2017 Najarro et al.

  2. The use of population-scale sequencing to identify CNVs impacting productive traits in different cattle breeds

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  3. Checking of individuality by DNA profiling.

    PubMed

    Brdicka, R; Nürnberg, P

    1993-08-25

    A review of methods of DNA analysis used in forensic medicine for identification, paternity testing, etc. is provided. Among other techniques, DNA fingerprinting using different probes and polymerase chain reaction-based techniques such as amplified sequence polymorphisms and minisatellite variant repeat mapping are thoroughly described and both theoretical and practical aspects are discussed.

  4. Genome characterization of the selected long- and short-sleep mouse lines.

    PubMed

    Dowell, Robin; Odell, Aaron; Richmond, Phillip; Malmer, Daniel; Halper-Stromberg, Eitan; Bennett, Beth; Larson, Colin; Leach, Sonia; Radcliffe, Richard A

    2016-12-01

    The Inbred Long- and Short-Sleep (ILS, ISS) mouse lines were selected for differences in acute ethanol sensitivity using the loss of righting response (LORR) as the selection trait. The lines show an over tenfold difference in LORR and, along with a recombinant inbred panel derived from them (the LXS), have been widely used to dissect the genetic underpinnings of acute ethanol sensitivity. Here we have sequenced the genomes of the ILS and ISS to investigate the DNA variants that contribute to their sensitivity difference. We identified ~2.7 million high-confidence SNPs and small indels and ~7000 structural variants between the lines; variants were found to occur in 6382 annotated genes. Using a hidden Markov model, we were able to reconstruct the genome-wide ancestry patterns of the eight inbred progenitor strains from which the ILS and ISS were derived, and found that quantitative trait loci that have been mapped for LORR were slightly enriched for DNA variants. Finally, by mapping and quantifying RNA-seq reads from the ILS and ISS to their strain-specific genomes rather than to the reference genome, we found a substantial improvement in a differential expression analysis between the lines. This work will help in identifying and characterizing the DNA sequence variants that contribute to the difference in ethanol sensitivity between the ILS and ISS and will also aid in accurate quantification of RNA-seq data generated from the LXS RIs.

  5. Admixture mapping in two Mexican samples identifies significant associations of locus ancestry with triglyceride levels in the BUD13/ZNF259/APOA5 region and fine mapping points to rs964184 as the main driver of the association signal.

    PubMed

    Parra, Esteban J; Mazurek, Andrew; Gignoux, Christopher R; Sockell, Alexandra; Agostino, Michael; Morris, Andrew P; Petty, Lauren E; Hanis, Craig L; Cox, Nancy J; Valladares-Salgado, Adan; Below, Jennifer E; Cruz, Miguel

    2017-01-01

    We carried out an admixture mapping study of lipid traits in two samples from Mexico City. Native American locus ancestry was significantly associated with triglyceride levels in a broad region of chromosome 11 overlapping the BUD13, ZNF259 and APOA5 genes. In our fine-mapping analysis of this region using dense genome-wide data, rs964184 is the only marker included in the 99% credible set of SNPs, providing strong support for rs964184 as the causal variant within this region. The frequency of the allele associated with increased triglyceride concentrations (rs964184-G) is between 30-40% higher in Native American populations from Mexico than in European populations. The evidence currently available for this variant indicates that it may be exerting its effect through three potential mechanisms: 1) modification of enhancer activity, 2) regulation of the expression of several genes in cis and/or trans, or 3) modification of the methylation patterns of the promoter of the APOA5 gene.

  6. Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs

    PubMed Central

    Adoue, Veronique; Schiavi, Alicia; Light, Nicholas; Almlöf, Jonas Carlsson; Lundmark, Per; Ge, Bing; Kwan, Tony; Caron, Maxime; Rönnblom, Lars; Wang, Chuan; Chen, Shu-Huang; Goodall, Alison H; Cambien, Francois; Deloukas, Panos; Ouwehand, Willem H; Syvänen, Ann-Christine; Pastinen, Tomi

    2014-01-01

    Most complex disease-associated genetic variants are located in non-coding regions and are therefore thought to be regulatory in nature. Association mapping of differential allelic expression (AE) is a powerful method to identify SNPs with direct cis-regulatory impact (cis-rSNPs). We used AE mapping to identify cis-rSNPs regulating gene expression in 55 and 63 HapMap lymphoblastoid cell lines from a Caucasian and an African population, respectively, 70 fibroblast cell lines, and 188 purified monocyte samples and found 40–60% of these cis-rSNPs to be shared across cell types. We uncover a new class of cis-rSNPs, which disrupt footprint-derived de novo motifs that are predominantly bound by repressive factors and are implicated in disease susceptibility through overlaps with GWAS SNPs. Finally, we provide the proof-of-principle for a new approach for genome-wide functional validation of transcription factor–SNP interactions. By perturbing NFκB action in lymphoblasts, we identified 489 cis-regulated transcripts with altered AE after NFκB perturbation. Altogether, we perform a comprehensive analysis of cis-variation in four cell populations and provide new tools for the identification of functional variants associated to complex diseases. PMID:25326100

  7. Admixture mapping in two Mexican samples identifies significant associations of locus ancestry with triglyceride levels in the BUD13/ZNF259/APOA5 region and fine mapping points to rs964184 as the main driver of the association signal

    PubMed Central

    Mazurek, Andrew; Sockell, Alexandra; Morris, Andrew P.; Petty, Lauren E.; Hanis, Craig L.; Cox, Nancy J.; Cruz, Miguel

    2017-01-01

    We carried out an admixture mapping study of lipid traits in two samples from Mexico City. Native American locus ancestry was significantly associated with triglyceride levels in a broad region of chromosome 11 overlapping the BUD13, ZNF259 and APOA5 genes. In our fine-mapping analysis of this region using dense genome-wide data, rs964184 is the only marker included in the 99% credible set of SNPs, providing strong support for rs964184 as the causal variant within this region. The frequency of the allele associated with increased triglyceride concentrations (rs964184-G) is between 30–40% higher in Native American populations from Mexico than in European populations. The evidence currently available for this variant indicates that it may be exerting its effect through three potential mechanisms: 1) modification of enhancer activity, 2) regulation of the expression of several genes in cis and/or trans, or 3) modification of the methylation patterns of the promoter of the APOA5 gene. PMID:28245265

  8. MultiElec: A MATLAB Based Application for MEA Data Analysis.

    PubMed

    Georgiadis, Vassilis; Stephanou, Anastasis; Townsend, Paul A; Jackson, Thomas R

    2015-01-01

    We present MultiElec, an open source MATLAB based application for data analysis of microelectrode array (MEA) recordings. MultiElec displays an extremely user-friendly graphic user interface (GUI) that allows the simultaneous display and analysis of voltage traces for 60 electrodes and includes functions for activation-time determination, the production of activation-time heat maps with activation time and isoline display. Furthermore, local conduction velocities are semi-automatically calculated along with their corresponding vector plots. MultiElec allows ad hoc signal suppression, enabling the user to easily and efficiently handle signal artefacts and for incomplete data sets to be analysed. Voltage traces and heat maps can be simply exported for figure production and presentation. In addition, our platform is able to produce 3D videos of signal progression over all 60 electrodes. Functions are controlled entirely by a single GUI with no need for command line input or any understanding of MATLAB code. MultiElec is open source under the terms of the GNU General Public License as published by the Free Software Foundation, version 3. Both the program and source code are available to download from http://www.cancer.manchester.ac.uk/MultiElec/.

  9. GWAS and fine-mapping of 35 production, reproduction and conformation traits with imputed sequences of 27K Holstein bulls

    USDA-ARS?s Scientific Manuscript database

    Fine-mapping of causal variants is becoming feasible for complex traits in livestock GWAS, as an increasing number of animals are sequenced. Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on small reference populations of sequenced animals. ...

  10. GWAS and fine-mapping of 35 production, reproduction, and conformation traits with imputed sequences of 27K Holstein bulls

    USDA-ARS?s Scientific Manuscript database

    Imputation has been routinely applied to ascertain sequence variants in large genotyped populations based on reference populations of sequenced animals. With the implementation of the 1000 Bull Genomes Project and increasing numbers of animals sequenced, fine-mapping of causal variants is becoming f...

  11. Collective odor source estimation and search in time-variant airflow environments using mobile robots.

    PubMed

    Meng, Qing-Hao; Yang, Wei-Xing; Wang, Yang; Zeng, Ming

    2011-01-01

    This paper addresses the collective odor source localization (OSL) problem in a time-varying airflow environment using mobile robots. A novel OSL methodology which combines odor-source probability estimation and multiple robots' search is proposed. The estimation phase consists of two steps: firstly, the separate probability-distribution map of odor source is estimated via Bayesian rules and fuzzy inference based on a single robot's detection events; secondly, the separate maps estimated by different robots at different times are fused into a combined map by way of distance based superposition. The multi-robot search behaviors are coordinated via a particle swarm optimization algorithm, where the estimated odor-source probability distribution is used to express the fitness functions. In the process of OSL, the estimation phase provides the prior knowledge for the searching while the searching verifies the estimation results, and both phases are implemented iteratively. The results of simulations for large-scale advection-diffusion plume environments and experiments using real robots in an indoor airflow environment validate the feasibility and robustness of the proposed OSL method.

  12. Collective Odor Source Estimation and Search in Time-Variant Airflow Environments Using Mobile Robots

    PubMed Central

    Meng, Qing-Hao; Yang, Wei-Xing; Wang, Yang; Zeng, Ming

    2011-01-01

    This paper addresses the collective odor source localization (OSL) problem in a time-varying airflow environment using mobile robots. A novel OSL methodology which combines odor-source probability estimation and multiple robots’ search is proposed. The estimation phase consists of two steps: firstly, the separate probability-distribution map of odor source is estimated via Bayesian rules and fuzzy inference based on a single robot’s detection events; secondly, the separate maps estimated by different robots at different times are fused into a combined map by way of distance based superposition. The multi-robot search behaviors are coordinated via a particle swarm optimization algorithm, where the estimated odor-source probability distribution is used to express the fitness functions. In the process of OSL, the estimation phase provides the prior knowledge for the searching while the searching verifies the estimation results, and both phases are implemented iteratively. The results of simulations for large-scale advection–diffusion plume environments and experiments using real robots in an indoor airflow environment validate the feasibility and robustness of the proposed OSL method. PMID:22346650

  13. Functional Variants at the 11q13 Risk Locus for Breast Cancer Regulate Cyclin D1 Expression through Long-Range Enhancers

    PubMed Central

    French, Juliet D.; Ghoussaini, Maya; Edwards, Stacey L.; Meyer, Kerstin B.; Michailidou, Kyriaki; Ahmed, Shahana; Khan, Sofia; Maranian, Mel J.; O’Reilly, Martin; Hillman, Kristine M.; Betts, Joshua A.; Carroll, Thomas; Bailey, Peter J.; Dicks, Ed; Beesley, Jonathan; Tyrer, Jonathan; Maia, Ana-Teresa; Beck, Andrew; Knoblauch, Nicholas W.; Chen, Constance; Kraft, Peter; Barnes, Daniel; González-Neira, Anna; Alonso, M. Rosario; Herrero, Daniel; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; Luccarini, Craig; Baynes, Caroline; Conroy, Don; Dennis, Joe; Bolla, Manjeet K.; Wang, Qin; Hopper, John L.; Southey, Melissa C.; Schmidt, Marjanka K.; Broeks, Annegien; Verhoef, Senno; Cornelissen, Sten; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A.; Loehberg, Christian R.; Ekici, Arif B.; Beckmann, Matthias W.; Peto, Julian; dos Santos Silva, Isabel; Johnson, Nichola; Aitken, Zoe; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Milne, Roger L.; Zamora, M. Pilar; Arias Perez, Jose Ignacio; Benitez, Javier; Anton-Culver, Hoda; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Lichtner, Peter; Schmutzler, Rita K.; Engel, Christoph; Brauch, Hiltrud; Hamann, Ute; Justenhoven, Christina; Aaltonen, Kirsimari; Heikkilä, Päivi; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Sueta, Aiko; Bogdanova, Natalia V.; Antonenkova, Natalia N.; Dörk, Thilo; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Lambrechts, Diether; Peeters, Stephanie; Smeets, Ann; Floris, Giuseppe; Chang-Claude, Jenny; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Sardella, Domenico; Couch, Fergus J.; Wang, Xianshu; Pankratz, Vernon S.; Lee, Adam; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Teo, Soo Hwang; Yip, Cheng Har; Ng, Char-Hong; Vithana, Eranga Nishanthie; Kristensen, Vessela; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Devilee, Peter; Seynaeve, Caroline; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Czene, Kamila; Klevebring, Daniel; Schoof, Nils; Hooning, Maartje J.; Martens, John W.M.; Collée, J. Margriet; Tilanus-Linthorst, Madeleine; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Balasubramanian, Sabapathy P.; Blot, William; Signorello, Lisa B.; Cai, Qiuyin; Pharoah, Paul D.P.; Healey, Catherine S.; Shah, Mitul; Pooley, Karen A.; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Hartman, Mikael; Miao, Hui; Sng, Jen-Hwei; Sim, Xueling; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; McKay, James; Toland, Amanda E.; Ambrosone, Christine B.; Yannoukakos, Drakoulis; Godwin, Andrew K.; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Chen, Shou-Tung; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; Ponder, Bruce A.J.; Nevanlinna, Heli; Brown, Melissa A.; Chenevix-Trench, Georgia; Easton, Douglas F.; Dunning, Alison M.

    2013-01-01

    Analysis of 4,405 variants in 89,050 European subjects from 41 case-control studies identified three independent association signals for estrogen-receptor-positive tumors at 11q13. The strongest signal maps to a transcriptional enhancer element in which the G allele of the best candidate causative variant rs554219 increases risk of breast cancer, reduces both binding of ELK4 transcription factor and luciferase activity in reporter assays, and may be associated with low cyclin D1 protein levels in tumors. Another candidate variant, rs78540526, lies in the same enhancer element. Risk association signal 2, rs75915166, creates a GATA3 binding site within a silencer element. Chromatin conformation studies demonstrate that these enhancer and silencer elements interact with each other and with their likely target gene, CCND1. PMID:23540573

  14. A pooling-based approach to mapping genetic variants associated with DNA methylation

    PubMed Central

    Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; McEwen, Lisa M.; Kobor, Michael S.; Fraser, Hunter B.

    2015-01-01

    DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a truly genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. We found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data. PMID:25910490

  15. A pooling-based approach to mapping genetic variants associated with DNA methylation

    DOE PAGES

    Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; ...

    2015-04-24

    DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less

  16. Multi-Sensor Characterization of the Boreal Forest: Initial Findings

    NASA Technical Reports Server (NTRS)

    Reith, Ernest; Roberts, Dar A.; Prentiss, Dylan

    2001-01-01

    Results are presented in an initial apriori knowledge approach toward using complementary multi-sensor multi-temporal imagery in characterizing vegetated landscapes over a site in the Boreal Ecosystem-Atmosphere Study (BOREAS). Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) and Airborne Synthetic Aperture Radar (AIRSAR) data were segmented using multiple endmember spectral mixture analysis and binary decision tree approaches. Individual date/sensor land cover maps had overall accuracies between 55.0% - 69.8%. The best eight land cover layers from all dates and sensors correctly characterized 79.3% of the cover types. An overlay approach was used to create a final land cover map. An overall accuracy of 71.3% was achieved in this multi-sensor approach, a 1.5% improvement over our most accurate single scene technique, but 8% less than the original input. Black spruce was evaluated to be particularly undermapped in the final map possibly because it was also contained within jack pine and muskeg land coverages.

  17. Four Families of Multi-Variant Issues in Graduate-Level Asynchronous Online Courses

    ERIC Educational Resources Information Center

    Gisburne, Jaclyn M.; Fairchild, Patricia J.

    2004-01-01

    This is the first of several papers developed from a faculty and student perspective describing a new distance learning (DL) model. Integral to the model are four interrelated families of multi-variant issues, referred to here as (a) the academic divide, (b) student misalignment, (c) administrative influences, and (d) the use of student…

  18. Mid-Atlantic multichannel seismic-reflection profiles 14, 15, 16, and 17

    USGS Publications Warehouse

    Schlee, John Stevens

    1980-01-01

    The U. S. Geological Survey (USGS) is making available four multi­channel profiles collected by Teledyne Exploration in 1977 by means of a 48-channel streamer (3600 m long) and four airguns (2160 in). Profiles 15 and 16 were processed by.Teledyne Exploration and profiles 14 and 17 were processed on the Phoenix "I"* computer·by the USGS. The processing included standard demultiplexing, deconvolution before and aftfer stack, Common Depth Point (CDP) gathers, velocity analyses every 3 km, move-out correction, stacking, time-variant, filtering, and time-variant scaling.The released lines are over the outer edge of the Continental Shelf in the northern part of the Baltimore Canyon trough (Line 14: 140 km long and Line 15: 157 km long), over the Long Island platform. (Line 16: 313 km long), and over the Carolina platform (Line 17: 186 km long). These profiles were collected as a part of a regional grid over offshore Atlantic sedimentary basins in a continuing program to assess the resource potential by means of nonproprietary data.These profiles, plus the velocity scans and shotpoint maps, may be viewed at U. S. Geological Survey, Quissett Campus, Woods Hole, MA. 02543, and U. S. Geological Survey, Bldg. 25, Denver Federal Center, Denver, CO. Copies of maps, scans, and profiles can be purchased only from the National Geophysical Solar-Terrestrial Data Center, Environmental Data Service (NOM), Code D 621, Boulder, CO 80302.

  19. Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

    PubMed Central

    Aslam, Luqman; Beal, Kathryn; Ann Blomberg, Le; Bouffard, Pascal; Burt, David W.; Crasta, Oswald; Crooijmans, Richard P. M. A.; Cooper, Kristal; Coulombe, Roger A.; De, Supriyo; Delany, Mary E.; Dodgson, Jerry B.; Dong, Jennifer J.; Evans, Clive; Frederickson, Karin M.; Flicek, Paul; Florea, Liliana; Folkerts, Otto; Groenen, Martien A. M.; Harkins, Tim T.; Herrero, Javier; Hoffmann, Steve; Megens, Hendrik-Jan; Jiang, Andrew; de Jong, Pieter; Kaiser, Pete; Kim, Heebal; Kim, Kyu-Won; Kim, Sungwon; Langenberger, David; Lee, Mi-Kyung; Lee, Taeheon; Mane, Shrinivasrao; Marcais, Guillaume; Marz, Manja; McElroy, Audrey P.; Modise, Thero; Nefedov, Mikhail; Notredame, Cédric; Paton, Ian R.; Payne, William S.; Pertea, Geo; Prickett, Dennis; Puiu, Daniela; Qioa, Dan; Raineri, Emanuele; Ruffier, Magali; Salzberg, Steven L.; Schatz, Michael C.; Scheuring, Chantel; Schmidt, Carl J.; Schroeder, Steven; Searle, Stephen M. J.; Smith, Edward J.; Smith, Jacqueline; Sonstegard, Tad S.; Stadler, Peter F.; Tafer, Hakim; Tu, Zhijian (Jake); Van Tassell, Curtis P.; Vilella, Albert J.; Williams, Kelly P.; Yorke, James A.; Zhang, Liqing; Zhang, Hong-Bin; Zhang, Xiaojun; Zhang, Yang; Reed, Kent M.

    2010-01-01

    A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest. PMID:20838655

  20. Imputation of variants from the 1000 Genomes Project modestly improves known associations and can identify low-frequency variant-phenotype associations undetected by HapMap based imputation.

    PubMed

    Wood, Andrew R; Perry, John R B; Tanaka, Toshiko; Hernandez, Dena G; Zheng, Hou-Feng; Melzer, David; Gibbs, J Raphael; Nalls, Michael A; Weedon, Michael N; Spector, Tim D; Richards, J Brent; Bandinelli, Stefania; Ferrucci, Luigi; Singleton, Andrew B; Frayling, Timothy M

    2013-01-01

    Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF <5%) and rare variants (<1%)) can enhance previously identified associations and identify novel loci, we selected 93 quantitative circulating factors where data was available from the InCHIANTI population study. These phenotypes included cytokines, binding proteins, hormones, vitamins and ions. We selected these phenotypes because many have known strong genetic associations and are potentially important to help understand disease processes. We performed a genome-wide scan for these 93 phenotypes in InCHIANTI. We identified 21 signals and 33 signals that reached P<5×10(-8) based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P<5×10(-11) respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P<5×10(-8) in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10(-12)). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations.

  1. Imputation of Variants from the 1000 Genomes Project Modestly Improves Known Associations and Can Identify Low-frequency Variant - Phenotype Associations Undetected by HapMap Based Imputation

    PubMed Central

    Wood, Andrew R.; Perry, John R. B.; Tanaka, Toshiko; Hernandez, Dena G.; Zheng, Hou-Feng; Melzer, David; Gibbs, J. Raphael; Nalls, Michael A.; Weedon, Michael N.; Spector, Tim D.; Richards, J. Brent; Bandinelli, Stefania; Ferrucci, Luigi; Singleton, Andrew B.; Frayling, Timothy M.

    2013-01-01

    Genome-wide association (GWA) studies have been limited by the reliance on common variants present on microarrays or imputable from the HapMap Project data. More recently, the completion of the 1000 Genomes Project has provided variant and haplotype information for several million variants derived from sequencing over 1,000 individuals. To help understand the extent to which more variants (including low frequency (1% ≤ MAF <5%) and rare variants (<1%)) can enhance previously identified associations and identify novel loci, we selected 93 quantitative circulating factors where data was available from the InCHIANTI population study. These phenotypes included cytokines, binding proteins, hormones, vitamins and ions. We selected these phenotypes because many have known strong genetic associations and are potentially important to help understand disease processes. We performed a genome-wide scan for these 93 phenotypes in InCHIANTI. We identified 21 signals and 33 signals that reached P<5×10−8 based on HapMap and 1000 Genomes imputation, respectively, and 9 and 11 that reached a stricter, likely conservative, threshold of P<5×10−11 respectively. Imputation of 1000 Genomes genotype data modestly improved the strength of known associations. Of 20 associations detected at P<5×10−8 in both analyses (17 of which represent well replicated signals in the NHGRI catalogue), six were captured by the same index SNP, five were nominally more strongly associated in 1000 Genomes imputed data and one was nominally more strongly associated in HapMap imputed data. We also detected an association between a low frequency variant and phenotype that was previously missed by HapMap based imputation approaches. An association between rs112635299 and alpha-1 globulin near the SERPINA gene represented the known association between rs28929474 (MAF = 0.007) and alpha1-antitrypsin that predisposes to emphysema (P = 2.5×10−12). Our data provide important proof of principle that 1000 Genomes imputation will detect novel, low frequency-large effect associations. PMID:23696881

  2. ABCA4 midigenes reveal the full splice spectrum of all reported noncanonical splice site variants in Stargardt disease.

    PubMed

    Sangermano, Riccardo; Khan, Mubeen; Cornelis, Stéphanie S; Richelle, Valerie; Albert, Silvia; Garanto, Alejandro; Elmelik, Duaa; Qamar, Raheel; Lugtenberg, Dorien; van den Born, L Ingeborgh; Collin, Rob W J; Cremers, Frans P M

    2018-01-01

    Stargardt disease is caused by variants in the ABCA4 gene, a significant part of which are noncanonical splice site (NCSS) variants. In case a gene of interest is not expressed in available somatic cells, small genomic fragments carrying potential disease-associated variants are tested for splice abnormalities using in vitro splice assays. We recently discovered that when using small minigenes lacking the proper genomic context, in vitro results do not correlate with splice defects observed in patient cells. We therefore devised a novel strategy in which a bacterial artificial chromosome was employed to generate midigenes, splice vectors of varying lengths (up to 11.7 kb) covering almost the entire ABCA4 gene. These midigenes were used to analyze the effect of all 44 reported and three novel NCSS variants on ABCA4 pre-mRNA splicing. Intriguingly, multi-exon skipping events were observed, as well as exon elongation and intron retention. The analysis of all reported NCSS variants in ABCA4 allowed us to reveal the nature of aberrant splicing events and to classify the severity of these mutations based on the residual fraction of wild-type mRNA. Our strategy to generate large overlapping splice vectors carrying multiple exons, creating a toolbox for robust and high-throughput analysis of splice variants, can be applied to all human genes. © 2018 Sangermano et al.; Published by Cold Spring Harbor Laboratory Press.

  3. A Modified P1 Moiety Enhances in vitro Antiviral Activity against Various Multi-Drug-Resistant HIV-1 Variants and in vitro CNS Penetration Properties of a Novel Nonpeptidic Protease Inhibitor, GRL-10413

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Amano, Masayuki; Salcedo-Gómez, Pedro Miguel; Zhao, Rui

    We here report that GRL-10413, a novel non-peptidic HIV-1 protease inhibitor (PI) containing a modified P1 moiety and a sulfonamide isostere, is highly active against laboratory HIV-1 strains and primary clinical isolates (EC 50: 0.00035 - 0.0018 μM) with minimal cytotoxicity (CC 50: 35.7 μM). GRL-10413 blocked the infectivity and replication of HIV-1 NL4-3variants selected by up to 5 μM concentrations of atazanavir, lopinavir, or amprenavir (EC 50: 0.0021 - 0.0023 μM). GRL-10413 also maintained its strong antiviral activity against multi-drug-resistant clinical HIV-1 variants isolated from patients, who no longer responded to various antiviral regimens after long-term antiretroviral therapy. Themore » development of resistance against GRL-10413 was significantly delayed compared to that of APV. In addition, GRL-10413 showed a favorable central nervous system (CNS) penetration property as assessed with anin vitroblood brain barrier (BBB) reconstruction system. Analysis of the crystal structure of HIV-1 protease in complex with GRL-10413 demonstrated that the modified P1 moiety of GRL-10413 has a greater hydrophobic surface area and makes greater van der Waals contacts with active-site amino acids of protease than in the case of darunavir. Moreover, the chlorine substituent in the P1 moiety interacts with protease in two distinct configurations. The present data demonstrate that GRL-10413 has desirable features for treating patients infected with wild-type and/or multi-drug-resistant HIV-1 variants with favorable CNS-penetration capability and that the newly modified P1-moiety may confer desirable features in designing novel anti-HIV-1 PIs.« less

  4. BlackOPs: increasing confidence in variant detection through mappability filtering.

    PubMed

    Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil

    2013-10-01

    Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.

  5. Trans-Ethnic Fine-Mapping of Lipid Loci Identifies Population-Specific Signals and Allelic Heterogeneity That Increases the Trait Variance Explained

    PubMed Central

    Wu, Ying; Waite, Lindsay L.; Jackson, Anne U.; Sheu, Wayne H-H.; Buyske, Steven; Absher, Devin; Arnett, Donna K.; Boerwinkle, Eric; Bonnycastle, Lori L.; Carty, Cara L.; Cheng, Iona; Cochran, Barbara; Croteau-Chonka, Damien C.; Dumitrescu, Logan; Eaton, Charles B.; Franceschini, Nora; Guo, Xiuqing; Henderson, Brian E.; Hindorff, Lucia A.; Kim, Eric; Kinnunen, Leena; Komulainen, Pirjo; Lee, Wen-Jane; Le Marchand, Loic; Lin, Yi; Lindström, Jaana; Lingaas-Holmen, Oddgeir; Mitchell, Sabrina L.; Narisu, Narisu; Robinson, Jennifer G.; Schumacher, Fred; Stančáková, Alena; Sundvall, Jouko; Sung, Yun-Ju; Swift, Amy J.; Wang, Wen-Chang; Wilkens, Lynne; Wilsgaard, Tom; Young, Alicia M.; Adair, Linda S.; Ballantyne, Christie M.; Bůžková, Petra; Chakravarti, Aravinda; Collins, Francis S.; Duggan, David; Feranil, Alan B.; Ho, Low-Tone; Hung, Yi-Jen; Hunt, Steven C.; Hveem, Kristian; Juang, Jyh-Ming J.; Kesäniemi, Antero Y.; Kuusisto, Johanna; Laakso, Markku; Lakka, Timo A.; Lee, I-Te; Leppert, Mark F.; Matise, Tara C.; Moilanen, Leena; Njølstad, Inger; Peters, Ulrike; Quertermous, Thomas; Rauramaa, Rainer; Rotter, Jerome I.; Saramies, Jouko; Tuomilehto, Jaakko; Uusitupa, Matti; Wang, Tzung-Dau; Mohlke, Karen L.

    2013-01-01

    Genome-wide association studies (GWAS) have identified ∼100 loci associated with blood lipid levels, but much of the trait heritability remains unexplained, and at most loci the identities of the trait-influencing variants remain unknown. We conducted a trans-ethnic fine-mapping study at 18, 22, and 18 GWAS loci on the Metabochip for their association with triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), respectively, in individuals of African American (n = 6,832), East Asian (n = 9,449), and European (n = 10,829) ancestry. We aimed to identify the variants with strongest association at each locus, identify additional and population-specific signals, refine association signals, and assess the relative significance of previously described functional variants. Among the 58 loci, 33 exhibited evidence of association at P<1×10−4 in at least one ancestry group. Sequential conditional analyses revealed that ten, nine, and four loci in African Americans, Europeans, and East Asians, respectively, exhibited two or more signals. At these loci, accounting for all signals led to a 1.3- to 1.8-fold increase in the explained phenotypic variance compared to the strongest signals. Distinct signals across ancestry groups were identified at PCSK9 and APOA5. Trans-ethnic analyses narrowed the signals to smaller sets of variants at GCKR, PPP1R3B, ABO, LCAT, and ABCA1. Of 27 variants reported previously to have functional effects, 74% exhibited the strongest association at the respective signal. In conclusion, trans-ethnic high-density genotyping and analysis confirm the presence of allelic heterogeneity, allow the identification of population-specific variants, and limit the number of candidate SNPs for functional studies. PMID:23555291

  6. Accurate and fast multiple-testing correction in eQTL studies.

    PubMed

    Sul, Jae Hoon; Raj, Towfique; de Jong, Simone; de Bakker, Paul I W; Raychaudhuri, Soumya; Ophoff, Roel A; Stranger, Barbara E; Eskin, Eleazar; Han, Buhm

    2015-06-04

    In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  7. Applications of next-generation sequencing analysis for the detection of hepatocellular carcinoma-associated hepatitis B virus mutations.

    PubMed

    Wu, I-Chin; Liu, Wen-Chun; Chang, Ting-Tsung

    2018-06-02

    Next-generation sequencing (NGS) is a powerful and high-throughput method for the detection of viral mutations. This article provides a brief overview about optimization of NGS analysis for hepatocellular carcinoma (HCC)-associated hepatitis B virus (HBV) mutations, and hepatocarcinogenesis of relevant mutations. For the application of NGS analysis in the genome of HBV, four noteworthy steps were discovered in testing. First, a sample-specific reference sequence was the most effective mapping reference for NGS. Second, elongating the end of reference sequence improved mapping performance at the end of the genome. Third, resetting the origin of mapping reference sequence could probed deletion mutations and variants at a certain location with common mutations. Fourth, using a platform-specific cut-off value to distinguish authentic minority variants from technical artifacts was found to be highly effective. One hundred and sixty-seven HBV single nucleotide variants (SNVs) were found to be studied previously through a systematic literature review, and 12 SNVs were determined to be associated with HCC by meta-analysis. From comprehensive research using a HBV genome-wide NGS analysis, 60 NGS-defined HCC-associated SNVs with their pathogenic frequencies were identified, with 19 reported previously. All the 12 HCC-associated SNVs proved by meta-analysis were confirmed by NGS analysis, except for C1766T and T1768A which were mainly expressed in genotypes A and D, but including the subgroup analysis of A1762T. In the 41 novel NGS-defined HCC-associated SNVs, 31.7% (13/41) had cut-off values of SNV frequency lower than 20%. This showed that NGS could be used to detect HCC-associated SNVs with low SNV frequency. Most SNV II (the minor strains in the majority of non-HCC patients) had either low (< 20%) or high (> 80%) SNV frequencies in HCC patients, a characteristic U-shaped distribution pattern. The cut-off values of SNV frequency for HCC-associated SNVs represent their pathogenic frequencies. The pathogenic frequencies of HCC-associated SNV II also showed a U-shaped distribution. Hepatocarcinogenesis induced by HBV mutated proteins through cellular pathways was reviewed. NGS analysis is useful to discover novel HCC-associated HBV SNVs, especially those with low SNV frequency. The hepatocarcinogenetic mechanisms of novel HCC-associated HBV SNVs defined by NGS analysis deserve further investigation.

  8. Cross-Population Joint Analysis of eQTLs: Fine Mapping and Functional Annotation

    PubMed Central

    Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger

    2015-01-01

    Mapping expression quantitative trait loci (eQTLs) has been shown as a powerful tool to uncover the genetic underpinnings of many complex traits at molecular level. In this paper, we present an integrative analysis approach that leverages eQTL data collected from multiple population groups. In particular, our approach effectively identifies multiple independent cis-eQTL signals that are consistent across populations, accounting for population heterogeneity in allele frequencies and linkage disequilibrium patterns. Furthermore, by integrating genomic annotations, our analysis framework enables high-resolution functional analysis of eQTLs. We applied our statistical approach to analyze the GEUVADIS data consisting of samples from five population groups. From this analysis, we concluded that i) jointly analysis across population groups greatly improves the power of eQTL discovery and the resolution of fine mapping of causal eQTL ii) many genes harbor multiple independent eQTLs in their cis regions iii) genetic variants that disrupt transcription factor binding are significantly enriched in eQTLs (p-value = 4.93 × 10-22). PMID:25906321

  9. The evaluation of multi-structure, multi-atlas pelvic anatomy features in a prostate MR lymphography CAD system

    NASA Astrophysics Data System (ADS)

    Meijs, M.; Debats, O.; Huisman, H.

    2015-03-01

    In prostate cancer, the detection of metastatic lymph nodes indicates progression from localized disease to metastasized cancer. The detection of positive lymph nodes is, however, a complex and time consuming task for experienced radiologists. Assistance of a two-stage Computer-Aided Detection (CAD) system in MR Lymphography (MRL) is not yet feasible due to the large number of false positives in the first stage of the system. By introducing a multi-structure, multi-atlas segmentation, using an affine transformation followed by a B-spline transformation for registration, the organ location is given by a mean density probability map. The atlas segmentation is semi-automatically drawn with ITK-SNAP, using Active Contour Segmentation. Each anatomic structure is identified by a label number. Registration is performed using Elastix, using Mutual Information and an Adaptive Stochastic Gradient optimization. The dataset consists of the MRL scans of ten patients, with lymph nodes manually annotated in consensus by two expert readers. The feature map of the CAD system consists of the Multi-Atlas and various other features (e.g. Normalized Intensity and multi-scale Blobness). The voxel-based Gentleboost classifier is evaluated using ROC analysis with cross validation. We show in a set of 10 studies that adding multi-structure, multi-atlas anatomical structure likelihood features improves the quality of the lymph node voxel likelihood map. Multiple structure anatomy maps may thus make MRL CAD more feasible.

  10. Reanalysis of BRCA1/2 negative high risk ovarian cancer patients reveals novel germline risk loci and insights into missing heritability

    PubMed Central

    Dyson, Gregory; Levin, Nancy K.; Chaudhry, Sophia; Rosati, Rita; Kalpage, Hasini; Simon, Michael S.; Tainsky, Michael A.

    2017-01-01

    While up to 25% of ovarian cancer (OVCA) cases are thought to be due to inherited factors, the majority of genetic risk remains unexplained. To address this gap, we sought to identify previously undescribed OVCA risk variants through the whole exome sequencing (WES) and candidate gene analysis of 48 women with ovarian cancer and selected for high risk of genetic inheritance, yet negative for any known pathogenic variants in either BRCA1 or BRCA2. In silico SNP analysis was employed to identify suspect variants followed by validation using Sanger DNA sequencing. We identified five pathogenic variants in our sample, four of which are in two genes featured on current multi-gene panels; (RAD51D, ATM). In addition, we found a pathogenic FANCM variant (R1931*) which has been recently implicated in familial breast cancer risk. Numerous rare and predicted to be damaging variants of unknown significance were detected in genes on current commercial testing panels, most prominently in ATM (n = 6) and PALB2 (n = 5). The BRCA2 variant p.K3326*, resulting in a 93 amino acid truncation, was overrepresented in our sample (odds ratio = 4.95, p = 0.01) and coexisted in the germline of these women with other deleterious variants, suggesting a possible role as a modifier of genetic penetrance. Furthermore, we detected loss of function variants in non-panel genes involved in OVCA relevant pathways; DNA repair and cell cycle control, including CHEK1, TP53I3, REC8, HMMR, RAD52, RAD1, POLK, POLQ, and MCM4. In summary, our study implicates novel risk loci as well as highlights the clinical utility for retesting BRCA1/2 negative OVCA patients by genomic sequencing and analysis of genes in relevant pathways. PMID:28591191

  11. GWAS4D: multidimensional analysis of context-specific regulatory variant for human complex diseases and traits.

    PubMed

    Huang, Dandan; Yi, Xianfu; Zhang, Shijie; Zheng, Zhanye; Wang, Panwen; Xuan, Chenghao; Sham, Pak Chung; Wang, Junwen; Li, Mulin Jun

    2018-05-16

    Genome-wide association studies have generated over thousands of susceptibility loci for many human complex traits, and yet for most of these associations the true causal variants remain unknown. Tissue/cell type-specific prediction and prioritization of non-coding regulatory variants will facilitate the identification of causal variants and underlying pathogenic mechanisms for particular complex diseases and traits. By leveraging recent large-scale functional genomics/epigenomics data, we develop an intuitive web server, GWAS4D (http://mulinlab.tmu.edu.cn/gwas4d or http://mulinlab.org/gwas4d), that systematically evaluates GWAS signals and identifies context-specific regulatory variants. The updated web server includes six major features: (i) updates the regulatory variant prioritization method with our new algorithm; (ii) incorporates 127 tissue/cell type-specific epigenomes data; (iii) integrates motifs of 1480 transcriptional regulators from 13 public resources; (iv) uniformly processes Hi-C data and generates significant interactions at 5 kb resolution across 60 tissues/cell types; (v) adds comprehensive non-coding variant functional annotations; (vi) equips a highly interactive visualization function for SNP-target interaction. Using a GWAS fine-mapped set for 161 coronary artery disease risk loci, we demonstrate that GWAS4D is able to efficiently prioritize disease-causal regulatory variants.

  12. Analysis of Plasminogen Genetic Variants in Multiple Sclerosis Patients

    PubMed Central

    Sadovnick, A. Dessa; Traboulsee, Anthony L.; Bernales, Cecily Q.; Ross, Jay P.; Forwell, Amanda L.; Yee, Irene M.; Guillot-Noel, Lena; Fontaine, Bertrand; Cournu-Rebeix, Isabelle; Alcina, Antonio; Fedetz, Maria; Izquierdo, Guillermo; Matesanz, Fuencisla; Hilven, Kelly; Dubois, Bénédicte; Goris, An; Astobiza, Ianire; Alloza, Iraide; Antigüedad, Alfredo; Vandenbroeck, Koen; Akkad, Denis A.; Aktas, Orhan; Blaschke, Paul; Buttmann, Mathias; Chan, Andrew; Epplen, Joerg T.; Gerdes, Lisa-Ann; Kroner, Antje; Kubisch, Christian; Kümpfel, Tania; Lohse, Peter; Rieckmann, Peter; Zettl, Uwe K.; Zipp, Frauke; Bertram, Lars; Lill, Christina M; Fernandez, Oscar; Urbaneja, Patricia; Leyva, Laura; Alvarez-Cermeño, Jose Carlos; Arroyo, Rafael; Garagorri, Aroa M.; García-Martínez, Angel; Villar, Luisa M.; Urcelay, Elena; Malhotra, Sunny; Montalban, Xavier; Comabella, Manuel; Berger, Thomas; Fazekas, Franz; Reindl, Markus; Schmied, Mascha C.; Zimprich, Alexander; Vilariño-Güell, Carles

    2016-01-01

    Multiple sclerosis (MS) is a prevalent neurological disease of complex etiology. Here, we describe the characterization of a multi-incident MS family that nominated a rare missense variant (p.G420D) in plasminogen (PLG) as a putative genetic risk factor for MS. Genotyping of PLG p.G420D (rs139071351) in 2160 MS patients, and 886 controls from Canada, identified 10 additional probands, two sporadic patients and one control with the variant. Segregation in families harboring the rs139071351 variant, identified p.G420D in 26 out of 30 family members diagnosed with MS, 14 unaffected parents, and 12 out of 30 family members not diagnosed with disease. Despite considerably reduced penetrance, linkage analysis supports cosegregation of PLG p.G420D and disease. Genotyping of PLG p.G420D in 14446 patients, and 8797 controls from Canada, France, Spain, Germany, Belgium, and Austria failed to identify significant association with disease (P = 0.117), despite an overall higher prevalence in patients (OR = 1.32; 95% CI = 0.93–1.87). To assess whether additional rare variants have an effect on MS risk, we sequenced PLG in 293 probands, and genotyped all rare variants in cases and controls. This analysis identified nine rare missense variants, and although three of them were exclusively observed in MS patients, segregation does not support pathogenicity. PLG is a plausible biological candidate for MS owing to its involvement in immune system response, blood-brain barrier permeability, and myelin degradation. Moreover, components of its activation cascade have been shown to present increased activity or expression in MS patients compared to controls; further studies are needed to clarify whether PLG is involved in MS susceptibility. PMID:27194806

  13. Dual-pathway multi-echo sequence for simultaneous frequency and T2 mapping

    NASA Astrophysics Data System (ADS)

    Cheng, Cheng-Chieh; Mei, Chang-Sheng; Duryea, Jeffrey; Chung, Hsiao-Wen; Chao, Tzu-Cheng; Panych, Lawrence P.; Madore, Bruno

    2016-04-01

    Purpose: To present a dual-pathway multi-echo steady state sequence and reconstruction algorithm to capture T2, T2∗ and field map information. Methods: Typically, pulse sequences based on spin echoes are needed for T2 mapping while gradient echoes are needed for field mapping, making it difficult to jointly acquire both types of information. A dual-pathway multi-echo pulse sequence is employed here to generate T2 and field maps from the same acquired data. The approach might be used, for example, to obtain both thermometry and tissue damage information during thermal therapies, or susceptibility and T2 information from a same head scan, or to generate bonus T2 maps during a knee scan. Results: Quantitative T2, T2∗ and field maps were generated in gel phantoms, ex vivo bovine muscle, and twelve volunteers. T2 results were validated against a spin-echo reference standard: A linear regression based on ROI analysis in phantoms provided close agreement (slope/R2 = 0.99/0.998). A pixel-wise in vivo Bland-Altman analysis of R2 = 1/T2 showed a bias of 0.034 Hz (about 0.3%), as averaged over four volunteers. Ex vivo results, with and without motion, suggested that tissue damage detection based on T2 rather than temperature-dose measurements might prove more robust to motion. Conclusion: T2, T2∗ and field maps were obtained simultaneously, from the same datasets, in thermometry, susceptibility-weighted imaging and knee-imaging contexts.

  14. Association mapping to discover significant marker-trait associations for resistance against fusarium wilt variant 2 in pigeonpea [Cajanus cajan (L.) Millspaugh] using SSR markers.

    PubMed

    Patil, Prakash G; Dubey, Jyotirmay; Bohra, Abhishek; Mishra, R K; Saabale, P R; Das, Alok; Rathore, Meenal; Singh, N P

    2017-08-01

    Pigeonpea production is severely constrained by wilt disease caused by Fusarium udum. In the current study, we discover the putative genomic regions that control resistance response to variant 2 of fusarium wilt using association mapping approach. The association panel comprised of 89 diverse pigeonpea genotypes including seven varieties, three landraces and 79 germplasm lines. The panel was screened rigorously for 3 consecutive years (2013-14, 2014-15 and 2015-2016) against variant 2 in a wilt-sick field. A total of 65 pigeonpea specific hypervariable SSR markers (HASSRs) were screened representing seven linkage groups and 29 scaffolds of the pigeonpea genome. A total of 181 alleles were detected, with average values of gene diversity and polymorphism information content (PIC) of 0.55 and 0.47, respectively. Further analysis using model based (STRUCTURE) and distance based (clustering) approaches separated the entire pigeonpea collection into two distinct subgroups (K = 2). The marker trait associations (MTAs) were established based on three-year wilt incidence data and SSR dataset using a unified mixed linear model. Consequently, six SSR markers were identified, which were significantly associated with wilt resistance and explained up to 6% phenotypic variance (PV) across the years. Among these SSRs, HASSR18 was found to be the most stable and significant, accounting for 5-6% PV across the years. To the best of our knowledge, this is the first report of identification of favourable alleles for resistance to variant 2 of Fusarium udum in pigeonpea using association mapping. The SSR markers identified here will greatly facilitate marker assisted resistance breeding against fusarium wilt in pigeonpea.

  15. Fine-scale mapping of the 4q24 locus identifies two independent loci associated with breast cancer risk.

    PubMed

    Guo, Xingyi; Long, Jirong; Zeng, Chenjie; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K; Wang, Qin; Milne, Roger L; Shu, Xiao-Ou; Cai, Qiuyin; Beesley, Jonathan; Kar, Siddhartha P; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Beeghly-Fadiel, Alicia; Benitez, Javier; Blot, William; Bogdanova, Natalia; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dörk, Thilo; Fasching, Peter A; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G; Grip, Mervi; Guénel, Pascal; Haiman, Christopher A; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A; Kosma, Veli-Matti; Lambrechts, Diether; Le Marchand, Loic; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona A; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Nord, Silje; Olson, Janet E; Orr, Nick; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Shen, Chen-Yang; Shi, Jiajun; Shrubsole, Martha J; Southey, Melissa C; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda Ewart; Tollenaar, Robert A E M; Tomlinson, Ian P M; Truong, Thérèse; Tseng, Chiu-Chen; van den Ouweland, Ans; Wen, Wanqing; Winqvist, Robert; Wu, Anna; Yip, Cheng Har; Zamora, M Pilar; Zheng, Ying; Hall, Per; Pharoah, Paul D P; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M; Easton, Douglas F; Zheng, Wei

    2015-11-01

    A recent association study identified a common variant (rs9790517) at 4q24 to be associated with breast cancer risk. Independent association signals and potential functional variants in this locus have not been explored. We conducted a fine-mapping analysis in 55,540 breast cancer cases and 51,168 controls from the Breast Cancer Association Consortium. Conditional analyses identified two independent association signals among women of European ancestry, represented by rs9790517 [conditional P = 2.51 × 10(-4); OR, 1.04; 95% confidence interval (CI), 1.02-1.07] and rs77928427 (P = 1.86 × 10(-4); OR, 1.04; 95% CI, 1.02-1.07). Functional annotation using data from the Encyclopedia of DNA Elements (ENCODE) project revealed two putative functional variants, rs62331150 and rs73838678 in linkage disequilibrium (LD) with rs9790517 (r(2) ≥ 0.90) residing in the active promoter or enhancer, respectively, of the nearest gene, TET2. Both variants are located in DNase I hypersensitivity and transcription factor-binding sites. Using data from both The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), we showed that rs62331150 was associated with level of expression of TET2 in breast normal and tumor tissue. Our study identified two independent association signals at 4q24 in relation to breast cancer risk and suggested that observed association in this locus may be mediated through the regulation of TET2. Fine-mapping study with large sample size warranted for identification of independent loci for breast cancer risk. ©2015 American Association for Cancer Research.

  16. Fine-scale mapping of the 4q24 locus identifies two independent loci associated with breast cancer risk

    PubMed Central

    Guo, Xingyi; Long, Jirong; Zeng, Chenjie; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K.; Wang, Qin; Milne, Roger L.; Shu, Xiao-Ou; Cai, Qiuyin; Beesley, Jonathan; Kar, Siddhartha P.; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Beeghly-Fadiel, Alicia; Benitez, Javier; Blot, William; Bogdanova, Natalia; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dörk, Thilo; Fasching, Peter A.; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G.; Grip, Mervi; Guénel, Pascal; Haiman, Christopher A.; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A.; Kosma, Veli-Matti; Lambrechts, Diether; Marchand, Loic Le; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona A.; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Nord, Silje; Olson, Janet E.; Orr, Nick; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Shen, Chen-Yang; Shi, Jiajun; Shrubsole, Martha J; Southey, Melissa C.; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda Ewart; Tollenaar, Robert A.E.M.; Tomlinson, Ian P.M.; Truong, Thérèse; Tseng, Chiu-chen; van den Ouweland, Ans; Wen, Wanqing; Winqvist, Robert; Wu, Anna; Yip, Cheng Har; Zamora, M. Pilar; Zheng, Ying; Hall, Per; Pharoah, Paul D.P.; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M.; Easton, Douglas F.; Zheng, Wei

    2015-01-01

    Background A recent association study identified a common variant (rs9790517) at 4q24 to be associated with breast cancer risk. Independent association signals and potential functional variants in this locus have not been explored. Methods We conducted a fine-mapping analysis in 55,540 breast cancer cases and 51,168 controls from the Breast Cancer Association Consortium. Results Conditional analyses identified two independent association signals among women of European ancestry, represented by rs9790517 (conditional p = 2.51 × 10−4; OR = 1.04; 95% CI 1.02–1.07) and rs77928427 (p = 1.86 × 10−4; OR = 1.04; 95% CI 1.02–1.07). Functional annotation using data from the Encyclopedia of DNA Elements (ENCODE) project revealed two putative functional variants, rs62331150 and rs73838678 in linkage disequilibrium (LD) with rs9790517 (r2 ≥ 0.90) residing in the active promoter or enhancer, respectively, of the nearest gene, TET2. Both variants are located in DNase I hypersensitivity and transcription factor binding sites. Using data from both The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), we showed that rs62331150 was associated with level of expression of TET2 in breast normal and tumor tissue. Conclusion Our study identified two independent association signals at 4q24 in relation to breast cancer risk and suggested that observed association in this locus may be mediated through the regulation of TET2. Impact Fine-mapping study with large sample size warranted for identification of independent loci for breast cancer risk. PMID:26354892

  17. Exploring DNA variant segregation types in pooled genome sequencing enables effective mapping of weeping trait in Malus

    USDA-ARS?s Scientific Manuscript database

    In recent years, next generation sequencing (NGS) based bulked segregant analysis (BSA) has become a powerful approach for allele discovery in non-model plant species. However, challenges remain, particular for out-crossing species with complex genomes. Here, the genetic control of a weeping bran...

  18. The Contribution of Mosaic Variants to Autism Spectrum Disorder.

    PubMed

    Freed, Donald; Pevsner, Jonathan

    2016-09-01

    De novo mutation is highly implicated in autism spectrum disorder (ASD). However, the contribution of post-zygotic mutation to ASD is poorly characterized. We performed both exome sequencing of paired samples and analysis of de novo variants from whole-exome sequencing of 2,388 families. While we find little evidence for tissue-specific mosaic mutation, multi-tissue post-zygotic mutation (i.e. mosaicism) is frequent, with detectable mosaic variation comprising 5.4% of all de novo mutations. We identify three mosaic missense and likely-gene disrupting mutations in genes previously implicated in ASD (KMT2C, NCKAP1, and MYH10) in probands but none in siblings. We find a strong ascertainment bias for mosaic mutations in probands relative to their unaffected siblings (p = 0.003). We build a model of de novo variation incorporating mosaic variants and errors in classification of mosaic status and from this model we estimate that 33% of mosaic mutations in probands contribute to 5.1% of simplex ASD diagnoses (95% credible interval 1.3% to 8.9%). Our results indicate a contributory role for multi-tissue mosaic mutation in some individuals with an ASD diagnosis.

  19. Transferability and Fine-Mapping of Genome-Wide Associated Loci for Adult Height across Human Populations

    PubMed Central

    Shriner, Daniel; Adeyemo, Adebowale; Gerry, Norman P.; Herbert, Alan; Chen, Guanjie; Doumatey, Ayo; Huang, Hanxia; Zhou, Jie; Christman, Michael F.; Rotimi, Charles N.

    2009-01-01

    Human height is the prototypical polygenic quantitative trait. Recently, several genetic variants influencing adult height were identified, primarily in individuals of East Asian (Chinese Han or Korean) or European ancestry. Here, we examined 152 genetic variants representing 107 independent loci previously associated with adult height for transferability in a well-powered sample of 1,016 unrelated African Americans. When we tested just the reported variants originally identified as associated with adult height in individuals of East Asian or European ancestry, only 8.3% of these loci transferred (p-values≤0.05 under an additive genetic model with directionally consistent effects) to our African American sample. However, when we comprehensively evaluated all HapMap variants in linkage disequilibrium (r 2≥0.3) with the reported variants, the transferability rate increased to 54.1%. The transferability rate was 70.8% for associations originally reported as genome-wide significant and 38.0% for associations originally reported as suggestive. An additional 23 loci were significantly associated but failed to transfer because of directionally inconsistent effects. Six loci were associated with adult height in all three groups. Using differences in linkage disequilibrium patterns between HapMap CEU or CHB reference data and our African American sample, we fine-mapped these six loci, improving both the localization and the annotation of these transferable associations. PMID:20027299

  20. Enrichment of colorectal cancer associations in functional regions: Insight for using epigenomics data in the analysis of whole genome sequence-imputed GWAS data.

    PubMed

    Bien, Stephanie A; Auer, Paul L; Harrison, Tabitha A; Qu, Conghui; Connolly, Charles M; Greenside, Peyton G; Chen, Sai; Berndt, Sonja I; Bézieau, Stéphane; Kang, Hyun M; Huyghe, Jeroen; Brenner, Hermann; Casey, Graham; Chan, Andrew T; Hopper, John L; Banbury, Barbara L; Chang-Claude, Jenny; Chanock, Stephen J; Haile, Robert W; Hoffmeister, Michael; Fuchsberger, Christian; Jenkins, Mark A; Leal, Suzanne M; Lemire, Mathieu; Newcomb, Polly A; Gallinger, Steven; Potter, John D; Schoen, Robert E; Slattery, Martha L; Smith, Joshua D; Le Marchand, Loic; White, Emily; Zanke, Brent W; Abeçasis, Goncalo R; Carlson, Christopher S; Peters, Ulrike; Nickerson, Deborah A; Kundaje, Anshul; Hsu, Li

    2017-01-01

    The evaluation of less frequent genetic variants and their effect on complex disease pose new challenges for genomic research. To investigate whether epigenetic data can be used to inform aggregate rare-variant association methods (RVAM), we assessed whether variants more significantly associated with colorectal cancer (CRC) were preferentially located in non-coding regulatory regions, and whether enrichment was specific to colorectal tissues. Active regulatory elements (ARE) were mapped using data from 127 tissues and cell-types from NIH Roadmap Epigenomics and Encyclopedia of DNA Elements (ENCODE) projects. We investigated whether CRC association p-values were more significant for common variants inside versus outside AREs, or 2) inside colorectal (CR) AREs versus AREs of other tissues and cell-types. We employed an integrative epigenomic RVAM for variants with allele frequency <1%. Gene sets were defined as ARE variants within 200 kilobases of a transcription start site (TSS) using either CR ARE or ARE from non-digestive tissues. CRC-set association p-values were used to evaluate enrichment of less frequent variant associations in CR ARE versus non-digestive ARE. ARE from 126/127 tissues and cell-types were significantly enriched for stronger CRC-variant associations. Strongest enrichment was observed for digestive tissues and immune cell types. CR-specific ARE were also enriched for stronger CRC-variant associations compared to ARE combined across non-digestive tissues (p-value = 9.6 × 10-4). Additionally, we found enrichment of stronger CRC association p-values for rare variant sets of CR ARE compared to non-digestive ARE (p-value = 0.029). Integrative epigenomic RVAM may enable discovery of less frequent variants associated with CRC, and ARE of digestive and immune tissues are most informative. Although distance-based aggregation of less frequent variants in CR ARE surrounding TSS showed modest enrichment, future association studies would likely benefit from joint analysis of transcriptomes and epigenomes to better link regulatory variation with target genes.

  1. Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

    PubMed

    Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

    2012-02-17

    The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.

  2. Exceptions to the rule: case studies in the prediction of pathogenicity for genetic variants in hereditary cancer genes.

    PubMed

    Rosenthal, E T; Bowles, K R; Pruss, D; van Kan, A; Vail, P J; McElroy, H; Wenstrup, R J

    2015-12-01

    Based on current consensus guidelines and standard practice, many genetic variants detected in clinical testing are classified as disease causing based on their predicted impact on the normal expression or function of the gene in the absence of additional data. However, our laboratory has identified a subset of such variants in hereditary cancer genes for which compelling contradictory evidence emerged after the initial evaluation following the first observation of the variant. Three representative examples of variants in BRCA1, BRCA2 and MSH2 that are predicted to disrupt splicing, prematurely truncate the protein, or remove the start codon were evaluated for pathogenicity by analyzing clinical data with multiple classification algorithms. Available clinical data for all three variants contradicts the expected pathogenic classification. These variants illustrate potential pitfalls associated with standard approaches to variant classification as well as the challenges associated with monitoring data, updating classifications, and reporting potentially contradictory interpretations to the clinicians responsible for translating test outcomes to appropriate clinical action. It is important to address these challenges now as the model for clinical testing moves toward the use of large multi-gene panels and whole exome/genome analysis, which will dramatically increase the number of genetic variants identified. © 2015 The Authors. Clinical Genetics published by John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. A biomimetic, energy-harvesting, obstacle-avoiding, path-planning algorithm for UAVs

    NASA Astrophysics Data System (ADS)

    Gudmundsson, Snorri

    This dissertation presents two new approaches to energy harvesting for Unmanned Aerial Vehicles (UAV). One method is based on the Potential Flow Method (PFM); the other method seeds a wind-field map based on updraft peak analysis and then applies a variant of the Bellman-Ford algorithm to find the minimum-cost path. Both methods are enhanced by taking into account the performance characteristics of the aircraft using advanced performance theory. The combined approach yields five possible trajectories from which the one with the minimum energy cost is selected. The dissertation concludes by using the developed theory and modeling tools to simulate the flight paths of two small Unmanned Aerial Vehicles (sUAV) in the 500 kg and 250 kg class. The results show that, in mountainous regions, substantial energy can be recovered, depending on topography and wind characteristics. For the examples presented, as much as 50% of the energy was recovered for a complex, multi-heading, multi-altitude, 170 km mission in an average wind speed of 9 m/s. The algorithms constitute a Generic Intelligent Control Algorithm (GICA) for autonomous unmanned aerial vehicles that enables an extraction of atmospheric energy while completing a mission trajectory. At the same time, the algorithm. automatically adjusts the flight path in order to avoid obstacles, in a fashion not unlike what one would expect from living organisms, such as birds and insects. This multi-disciplinary approach renders the approach biomimetic, i.e. it constitutes a synthetic system that “mimics the formation and function of biological mechanisms and processes.”.

  4. Quantitative characterization of all single amino acid variants of a viral capsid-based drug delivery vehicle.

    PubMed

    Hartman, Emily C; Jakobson, Christopher M; Favor, Andrew H; Lobba, Marco J; Álvarez-Benedicto, Ester; Francis, Matthew B; Tullman-Ercek, Danielle

    2018-04-11

    Self-assembling proteins are critical to biological systems and industrial technologies, but predicting how mutations affect self-assembly remains a significant challenge. Here, we report a technique, termed SyMAPS (Systematic Mutation and Assembled Particle Selection), that can be used to characterize the assembly competency of all single amino acid variants of a self-assembling viral structural protein. SyMAPS studies on the MS2 bacteriophage coat protein revealed a high-resolution fitness landscape that challenges some conventional assumptions of protein engineering. An additional round of selection identified a previously unknown variant (CP[T71H]) that is stable at neutral pH but less tolerant to acidic conditions than the wild-type coat protein. The capsids formed by this variant could be more amenable to disassembly in late endosomes or early lysosomes-a feature that is advantageous for delivery applications. In addition to providing a mutability blueprint for virus-like particles, SyMAPS can be readily applied to other self-assembling proteins.

  5. Shared activity patterns arising at genetic susceptibility loci reveal underlying genomic and cellular architecture of human disease.

    PubMed

    Baillie, J Kenneth; Bretherick, Andrew; Haley, Christopher S; Clohisey, Sara; Gray, Alan; Neyton, Lucile P A; Barrett, Jeffrey; Stahl, Eli A; Tenesa, Albert; Andersson, Robin; Brown, J Ben; Faulkner, Geoffrey J; Lizio, Marina; Schaefer, Ulf; Daub, Carsten; Itoh, Masayoshi; Kondo, Naoto; Lassmann, Timo; Kawai, Jun; Mole, Damian; Bajic, Vladimir B; Heutink, Peter; Rehli, Michael; Kawaji, Hideya; Sandelin, Albin; Suzuki, Harukazu; Satsangi, Jack; Wells, Christine A; Hacohen, Nir; Freeman, Thomas C; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Hume, David A

    2018-03-01

    Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn's disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits.

  6. Shared activity patterns arising at genetic susceptibility loci reveal underlying genomic and cellular architecture of human disease

    PubMed Central

    Gray, Alan; Neyton, Lucile P. A.; Barrett, Jeffrey; Stahl, Eli A.; Tenesa, Albert; Andersson, Robin; Brown, J. Ben; Faulkner, Geoffrey J.; Lizio, Marina; Schaefer, Ulf; Daub, Carsten; Kondo, Naoto; Lassmann, Timo; Kawai, Jun; Kawaji, Hideya; Suzuki, Harukazu; Satsangi, Jack; Wells, Christine A.; Hacohen, Nir; Freeman, Thomas C.; Hayashizaki, Yoshihide; Forrest, Alistair R. R.; Hume, David A.

    2018-01-01

    Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn’s disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits. PMID:29494619

  7. Designing Antibacterial Peptides with Enhanced Killing Kinetics

    PubMed Central

    Waghu, Faiza H.; Joseph, Shaini; Ghawali, Sanket; Martis, Elvis A.; Madan, Taruna; Venkatesh, Kareenhalli V.; Idicula-Thomas, Susan

    2018-01-01

    Antimicrobial peptides (AMPs) are gaining attention as substitutes for antibiotics in order to combat the risk posed by multi-drug resistant pathogens. Several research groups are engaged in design of potent anti-infective agents using natural AMPs as templates. In this study, a library of peptides with high sequence similarity to Myeloid Antimicrobial Peptide (MAP) family were screened using popular online prediction algorithms. These peptide variants were designed in a manner to retain the conserved residues within the MAP family. The prediction algorithms were found to effectively classify peptides based on their antimicrobial nature. In order to improve the activity of the identified peptides, molecular dynamics (MD) simulations, using bilayer and micellar systems could be used to design and predict effect of residue substitution on membranes of microbial and mammalian cells. The inference from MD simulation studies well corroborated with the wet-lab observations indicating that MD-guided rational design could lead to discovery of potent AMPs. The effect of the residue substitution on membrane activity was studied in greater detail using killing kinetic analysis. Killing kinetics studies on Gram-positive, negative and human erythrocytes indicated that a single residue change has a drastic effect on the potency of AMPs. An interesting outcome was a switch from monophasic to biphasic death rate constant of Staphylococcus aureus due to a single residue mutation in the peptide. PMID:29527201

  8. Novel association of neurofibromatosis type 1-causing mutations in families with neurofibromatosis-Noonan syndrome.

    PubMed

    Ekvall, Sara; Sjörs, Kerstin; Jonzon, Anders; Vihinen, Mauno; Annerén, Göran; Bondeson, Marie-Louise

    2014-03-01

    Neurofibromatosis-Noonan syndrome (NFNS) is a rare condition with clinical features of both neurofibromatosis type 1 (NF1) and Noonan syndrome (NS). All three syndromes belong to the RASopathies, which are caused by dysregulation of the RAS-MAPK pathway. The major gene involved in NFNS is NF1, but co-occurring NF1 and PTPN11 mutations in NFNS have been reported. Knowledge about possible involvement of additional RASopathy-associated genes in NFNS is, however, very limited. We present a comprehensive clinical and molecular analysis of eight affected individuals from three unrelated families displaying features of NF1 and NFNS. The genetic etiology of the clinical phenotypes was investigated by mutation analysis, including NF1, PTPN11, SOS1, KRAS, NRAS, BRAF, RAF1, SHOC2, SPRED1, MAP2K1, MAP2K2, and CBL. All three families harbored a heterozygous NF1 variant, where the first family had a missense variant, c.5425C>T;p.R1809C, the second family a recurrent 4bp-deletion, c.6789_6792delTTAC;p.Y2264Tfs*6, and the third family a splice-site variant, c.2991-1G>A, resulting in skipping of exon 18 and an in-frame deletion of 41 amino acids. These NF1 variants have all previously been reported in NF1 patients. Surprisingly, both c.6789_6792delTTAC and c.2991-1G>A are frequently associated with NF1, but association to NFNS has, to our knowledge, not previously been reported. Our results support the notion that NFNS represents a variant of NF1, genetically distinct from NS, and is caused by mutations in NF1, some of which also cause classical NF1. Due to phenotypic overlap between NFNS and NS, we propose screening for NF1 mutations in NS patients, preferentially when café-au-lait spots are present. © 2013 Wiley Periodicals, Inc.

  9. Expression Variants of the Lipogenic AGPAT6 Gene Affect Diverse Milk Composition Phenotypes in Bos taurus

    PubMed Central

    Littlejohn, Mathew D.; Tiplady, Kathryn; Lopdell, Thomas; Law, Tania A.; Scott, Andrew; Harland, Chad; Sherlock, Ric; Henty, Kristen; Obolonkin, Vlad; Lehnert, Klaus; MacGibbon, Alistair; Spelman, Richard J.; Davis, Stephen R.; Snell, Russell G.

    2014-01-01

    Milk is composed of a complex mixture of lipids, proteins, carbohydrates and various vitamins and minerals as a source of nutrition for young mammals. The composition of milk varies between individuals, with lipid composition in particular being highly heritable. Recent reports have highlighted a region of bovine chromosome 27 harbouring variants affecting milk fat percentage and fatty acid content. We aimed to further investigate this locus in two independent cattle populations, consisting of a Holstein-Friesian x Jersey crossbreed pedigree of 711 F2 cows, and a collection of 32,530 mixed ancestry Bos taurus cows. Bayesian genome-wide association mapping using markers imputed from the Illumina BovineHD chip revealed a large quantitative trait locus (QTL) for milk fat percentage on chromosome 27, present in both populations. We also investigated a range of other milk composition phenotypes, and report additional associations at this locus for fat yield, protein percentage and yield, lactose percentage and yield, milk volume, and the proportions of numerous milk fatty acids. We then used mammary RNA sequence data from 212 lactating cows to assess the transcript abundance of genes located in the milk fat percentage QTL interval. This analysis revealed a strong eQTL for AGPAT6, demonstrating that high milk fat percentage genotype is also additively associated with increased expression of the AGPAT6 gene. Finally, we used whole genome sequence data from six F1 sires to target a panel of novel AGPAT6 locus variants for genotyping in the F2 crossbreed population. Association analysis of 58 of these variants revealed highly significant association for polymorphisms mapping to the 5′UTR exons and intron 1 of AGPAT6. Taken together, these data suggest that variants affecting the expression of AGPAT6 are causally involved in differential milk fat synthesis, with pleiotropic consequences for a diverse range of other milk components. PMID:24465687

  10. Mapping quorum sensing onto neural networks to understand collective decision making in heterogeneous microbial communities

    NASA Astrophysics Data System (ADS)

    Yusufaly, Tahir I.; Boedicker, James Q.

    2017-08-01

    Microbial communities frequently communicate via quorum sensing (QS), where cells produce, secrete, and respond to a threshold level of an autoinducer (AI) molecule, thereby modulating gene expression. However, the biology of QS remains incompletely understood in heterogeneous communities, where variant bacterial strains possess distinct QS systems that produce chemically unique AIs. AI molecules bind to ‘cognate’ receptors, but also to ‘non-cognate’ receptors found in other strains, resulting in inter-strain crosstalk. Understanding these interactions is a prerequisite for deciphering the consequences of crosstalk in real ecosystems, where multiple AIs are regularly present in the same environment. As a step towards this goal, we map crosstalk in a heterogeneous community of variant QS strains onto an artificial neural network model. This formulation allows us to systematically analyze how crosstalk regulates the community’s capacity for flexible decision making, as quantified by the Boltzmann entropy of all QS gene expression states of the system. In a mean-field limit of complete cross-inhibition between variant strains, the model is exactly solvable, allowing for an analytical formula for the number of variants that maximize capacity as a function of signal kinetics and activation parameters. An analysis of previous experimental results on the Staphylococcus aureus two-component Agr system indicates that the observed combination of variant numbers, gene expression rates and threshold concentrations lies near this critical regime of parameter space where capacity peaks. The results are suggestive of a potential evolutionary driving force for diversification in certain QS systems.

  11. Joint Identification of Genetic Variants for Physical Activity in Korean Population

    PubMed Central

    Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung

    2014-01-01

    There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172

  12. Genetic fine-mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci

    PubMed Central

    Mahajan, Anubha; Locke, Adam; Rayner, N William; Robertson, Neil; Scott, Robert A; Prokopenko, Inga; Scott, Laura J; Green, Todd; Sparso, Thomas; Thuillier, Dorothee; Yengo, Loic; Grallert, Harald; Wahl, Simone; Frånberg, Mattias; Strawbridge, Rona J; Kestler, Hans; Chheda, Himanshu; Eisele, Lewin; Gustafsson, Stefan; Steinthorsdottir, Valgerdur; Thorleifsson, Gudmar; Qi, Lu; Karssen, Lennart C; van Leeuwen, Elisabeth M; Willems, Sara M; Li, Man; Chen, Han; Fuchsberger, Christian; Kwan, Phoenix; Ma, Clement; Linderman, Michael; Lu, Yingchang; Thomsen, Soren K; Rundle, Jana K; Beer, Nicola L; van de Bunt, Martijn; Chalisey, Anil; Kang, Hyun Min; Voight, Benjamin F; Abecasis, Goncalo R; Almgren, Peter; Baldassarre, Damiano; Balkau, Beverley; Benediktsson, Rafn; Blüher, Matthias; Boeing, Heiner; Bonnycastle, Lori L; Borringer, Erwin P; Burtt, Noël P; Carey, Jason; Charpentier, Guillaume; Chines, Peter S; Cornelis, Marilyn C; Couper, David J; Crenshaw, Andrew T; van Dam, Rob M; Doney, Alex SF; Dorkhan, Mozhgan; Edkins, Sarah; Eriksson, Johan G; Esko, Tonu; Eury, Elodie; Fadista, João; Flannick, Jason; Fontanillas, Pierre; Fox, Caroline; Franks, Paul W; Gertow, Karl; Gieger, Christian; Gigante, Bruna; Gottesman, Omri; Grant, George B; Grarup, Niels; Groves, Christopher J; Hassinen, Maija; Have, Christian T; Herder, Christian; Holmen, Oddgeir L; Hreidarsson, Astradur B; Humphries, Steve E; Hunter, David J; Jackson, Anne U; Jonsson, Anna; Jørgensen, Marit E; Jørgensen, Torben; Kerrison, Nicola D; Kinnunen, Leena; Klopp, Norman; Kong, Augustine; Kovacs, Peter; Kraft, Peter; Kravic, Jasmina; Langford, Cordelia; Leander, Karin; Liang, Liming; Lichtner, Peter; Lindgren, Cecilia M; Lindholm, Eero; Linneberg, Allan; Liu, Ching-Ti; Lobbens, Stéphane; Luan, Jian’an; Lyssenko, Valeriya; Männistö, Satu; McLeod, Olga; Meyer, Julia; Mihailov, Evelin; Mirza, Ghazala; Mühleisen, Thomas W; Müller-Nurasyid, Martina; Navarro, Carmen; Nöthen, Markus M; Oskolkov, Nikolay N; Owen, Katharine R; Palli, Domenico; Pechlivanis, Sonali; Perry, John RB; Platou, Carl GP; Roden, Michael; Ruderfer, Douglas; Rybin, Denis; van der Schouw, Yvonne T; Sennblad, Bengt; Sigurðsson, Gunnar; Stančáková, Alena; Steinbach, Gerald; Storm, Petter; Strauch, Konstantin; Stringham, Heather M; Sun, Qi; Thorand, Barbara; Tikkanen, Emmi; Tonjes, Anke; Trakalo, Joseph; Tremoli, Elena; Tuomi, Tiinamaija; Wennauer, Roman; Wood, Andrew R; Zeggini, Eleftheria; Dunham, Ian; Birney, Ewan; Pasquali, Lorenzo; Ferrer, Jorge; Loos, Ruth JF; Dupuis, Josée; Florez, Jose C; Boerwinkle, Eric; Pankow, James S; van Duijn, Cornelia; Sijbrands, Eric; Meigs, James B; Hu, Frank B; Thorsteinsdottir, Unnur; Stefansson, Kari; Lakka, Timo A; Rauramaa, Rainer; Stumvoll, Michael; Pedersen, Nancy L; Lind, Lars; Keinanen-Kiukaanniemi, Sirkka M; Korpi-Hyövälti, Eeva; Saaristo, Timo E; Saltevo, Juha; Kuusisto, Johanna; Laakso, Markku; Metspalu, Andres; Erbel, Raimund; Jöckel, Karl-Heinz; Moebus, Susanne; Ripatti, Samuli; Salomaa, Veikko; Ingelsson, Erik; Boehm, Bernhard O; Bergman, Richard N; Collins, Francis S; Mohlke, Karen L; Koistinen, Heikki; Tuomilehto, Jaakko; Hveem, Kristian; Njølstad, Inger; Deloukas, Panagiotis; Donnelly, Peter J; Frayling, Timothy M; Hattersley, Andrew T; de Faire, Ulf; Hamsten, Anders; Illig, Thomas; Peters, Annette; Cauchi, Stephane; Sladek, Rob; Froguel, Philippe; Hansen, Torben; Pedersen, Oluf; Morris, Andrew D; Palmer, Collin NA; Kathiresan, Sekar; Melander, Olle; Nilsson, Peter M; Groop, Leif C; Barroso, Inês; Langenberg, Claudia; Wareham, Nicholas J; O’Callaghan, Christopher A; Gloyn, Anna L; Altshuler, David; Boehnke, Michael; Teslovich, Tanya M; McCarthy, Mark I; Morris, Andrew P

    2015-01-01

    We performed fine-mapping of 39 established type 2 diabetes (T2D) loci in 27,206 cases and 57,574 controls of European ancestry. We identified 49 distinct association signals at these loci, including five mapping in/near KCNQ1. “Credible sets” of variants most likely to drive each distinct signal mapped predominantly to non-coding sequence, implying that T2D association is mediated through gene regulation. Credible set variants were enriched for overlap with FOXA2 chromatin immunoprecipitation binding sites in human islet and liver cells, including at MTNR1B, where fine-mapping implicated rs10830963 as driving T2D association. We confirmed that this T2D-risk allele increases FOXA2-bound enhancer activity in islet- and liver-derived cells. We observed allele-specific differences in NEUROD1 binding in islet-derived cells, consistent with evidence that the T2D-risk allele increases islet MTNR1B expression. Our study demonstrates how integration of genetic and genomic information can define molecular mechanisms through which variants underlying association signals exert their effects on disease. PMID:26551672

  13. Three-dimensional spatial analysis of missense variants in RTEL1 identifies pathogenic variants in patients with Familial Interstitial Pneumonia.

    PubMed

    Sivley, R Michael; Sheehan, Jonathan H; Kropski, Jonathan A; Cogan, Joy; Blackwell, Timothy S; Phillips, John A; Bush, William S; Meiler, Jens; Capra, John A

    2018-01-23

    Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease. To illustrate this approach in a clinical application, we analyzed 13 candidate missense variants in regulator of telomere elongation helicase 1 (RTEL1) identified in patients with Familial Interstitial Pneumonia (FIP). We curated pathogenic and neutral RTEL1 variants from the literature and public databases. We then used homology modeling to construct a 3D structural model of RTEL1 and mapped known variants into this structure. We next developed a pathogenicity prediction algorithm based on proximity to known disease causing and neutral variants and evaluated its performance with leave-one-out cross-validation. We further validated our predictions with segregation analyses, telomere lengths, and mutagenesis data from the homologous XPD protein. Our algorithm for classifying RTEL1 VUS based on spatial proximity to pathogenic and neutral variation accurately distinguished 7 known pathogenic from 29 neutral variants (ROC AUC = 0.85) in the N-terminal domains of RTEL1. Pathogenic proximity scores were also significantly correlated with effects on ATPase activity (Pearson r = -0.65, p = 0.0004) in XPD, a related helicase. Applying the algorithm to 13 VUS identified from sequencing of RTEL1 from patients predicted five out of six disease-segregating VUS to be pathogenic. We provide structural hypotheses regarding how these mutations may disrupt RTEL1 ATPase and helicase function. Spatial analysis of missense variation accurately classified candidate VUS in RTEL1 and suggests how such variants cause disease. Incorporating spatial proximity analyses into other pathogenicity prediction tools may improve accuracy for other genes and genetic diseases.

  14. Fast progressive lower motor neuron disease is an ALS variant: A two-centre tract of interest-based MRI data analysis.

    PubMed

    Müller, Hans-Peter; Agosta, Federica; Riva, Nilo; Spinelli, Edoardo G; Comi, Giancarlo; Ludolph, Albert C; Filippi, Massimo; Kassubek, Jan

    2018-01-01

    The criteria for assessing upper motor neuron pathology in pure lower motor neuron disease (LMND) still remain a major issue of debate with respect to the clinical classification as an amyotrophic lateral sclerosis (ALS) variant. The study was designed to investigate white matter damage by a hypothesis-guided tract-of-interest-based approach in patients with LMND compared with healthy controls and ´classical´ ALS patients in order to identify in vivo brain structural changes according to the neuropathologically defined ALS affectation pattern. Data were pooled from two previous studies at two different study sites (Ulm, Germany and Milano, Italy). DTI-based white matter integrity mapping was performed by voxelwise statistical comparison and by a tractwise analysis of fractional anisotropy (FA) maps according to the ALS-staging pattern for 65 LMND patients (clinically differentiated in fast and slow progressors) vs. 92 matched controls and 101 ALS patients with a 'classical' phenotype to identify white matter structural alterations. The analysis of white matter structural connectivity by regional FA reductions demonstrated the characteristic alteration patterns along the CST and also in frontal and prefrontal brain areas in LMND patients compared to controls and ALS. Fast progressing LMND showed substantial involvement, like in ALS, while slow progressors showed less severe alterations. In the tract-specific analysis according to the ALS-staging pattern, fast progressing LMND showed significant alterations of ALS-related tract systems as compared to slow progressors and controls. This study showed an affectation pattern for corticoefferent fibers in LMND with fast disease progression as defined for ALS, that way confirming the hypothesis that fast progressing LMND is a phenotypical variant of ALS.

  15. Exome sequencing reveals novel genetic loci influencing obesity-related traits in Hispanic children

    USDA-ARS?s Scientific Manuscript database

    To perform whole exome sequencing in 928 Hispanic children and identify variants and genes associated with childhood obesity.Single-nucleotide variants (SNVs) were identified from Illumina whole exome sequencing data using integrated read mapping, variant calling, and an annotation pipeline (Mercury...

  16. Approaches in Characterizing Genetic Structure and Mapping in a Rice Multiparental Population.

    PubMed

    Raghavan, Chitra; Mauleon, Ramil; Lacorte, Vanica; Jubay, Monalisa; Zaw, Hein; Bonifacio, Justine; Singh, Rakesh Kumar; Huang, B Emma; Leung, Hei

    2017-06-07

    Multi-parent Advanced Generation Intercross (MAGIC) populations are fast becoming mainstream tools for research and breeding, along with the technology and tools for analysis. This paper demonstrates the analysis of a rice MAGIC population from data filtering to imputation and processing of genetic data to characterizing genomic structure, and finally quantitative trait loci (QTL) mapping. In this study, 1316 S6:8 indica MAGIC (MI) lines and the eight founders were sequenced using Genotyping by Sequencing (GBS). As the GBS approach often includes missing data, the first step was to impute the missing SNPs. The observable number of recombinations in the population was then explored. Based on this case study, a general outline of procedures for a MAGIC analysis workflow is provided, as well as for QTL mapping of agronomic traits and biotic and abiotic stress, using the results from both association and interval mapping approaches. QTL for agronomic traits (yield, flowering time, and plant height), physical (grain length and grain width) and cooking properties (amylose content) of the rice grain, abiotic stress (submergence tolerance), and biotic stress (brown spot disease) were mapped. Through presenting this extensive analysis in the MI population in rice, we highlight important considerations when choosing analytical approaches. The methods and results reported in this paper will provide a guide to future genetic analysis methods applied to multi-parent populations. Copyright © 2017 Raghavan et al.

  17. Genome-Wide Linkage and Regional Association Study of Blood Pressure Response to the Cold Pressor Test in Han Chinese: The GenSalt Study

    PubMed Central

    Yang, Xueli; Gu, Dongfeng; He, Jiang; Hixson, James E.; Rao, Dabeeru C.; Lu, Fanghong; Mu, Jianjun; Jaquish, Cashell E.; Chen, Jing; Huang, Jianfeng; Shimmin, Lawrence C.; Rice, Treva K.; Chen, Jichun; Wu, Xigui; Liu, Depei; Kelly, Tanika N.

    2014-01-01

    Background Blood pressure (BP) response to cold pressor test (CPT) is associated with increased risk of cardiovascular disease. We performed a genome-wide linkage scan and regional association analysis to identify genetic determinants of BP response to CPT. Methods and Results A total of 1,961 Chinese participants completed the CPT. Multipoint quantitative trait linkage analysis was performed, followed by single-marker and gene-based analyses of variants in promising linkage regions (logarithm of odds, LOD ≥ 2). A suggestive linkage signal was identified for systolic BP (SBP) response to CPT at 20p13-20p12.3, with a maximum multipoint LOD score of 2.37. Based on regional association analysis with 1,351 SNPs in the linkage region, we found that marker rs2326373 at 20p13 was significantly associated with mean arterial pressure (MAP) responses to CPT (P = 8.8×10−6) after FDR adjustment for multiple comparisons. A similar trend was also observed for SBP response (P = 0.03) and DBP response (P = 4.6×10−5). Results of gene-based analyses showed that variants in genes MCM8 and SLC23A2 were associated with SBP response to CPT (P = 4.0×10−5 and 2.7×10−4, respectively), and variants in genes MCM8 and STK35 were associated with MAP response to CPT (P = 1.5×10−5 and 5.0×10−5, respectively). Conclusions Within a suggestive linkage region on chromosome 20, we identified a novel variant associated with BP responses to CPT. We also found gene-based associations of MCM8, SLC23A2 and STK35 in this region. Further work is warranted to confirm these findings. Clinical Trial Registration URL: http://www.clinicaltrials.gov; Unique identifier: NCT00721721. PMID:25028485

  18. Computer assisted detection and analysis of tall cell variant papillary thyroid carcinoma in histological images

    NASA Astrophysics Data System (ADS)

    Kim, Edward; Baloch, Zubair; Kim, Caroline

    2015-03-01

    The number of new cases of thyroid cancer are dramatically increasing as incidences of this cancer have more than doubled since the early 1970s. Tall cell variant (TCV-PTC) papillary thyroid carcinoma is one type of thyroid cancer that is more aggressive and usually associated with higher local recurrence and distant metastasis. This variant can be identified through visual characteristics of cells in histological images. Thus, we created a fully automatic algorithm that is able to segment cells using a multi-stage approach. Our method learns the statistical characteristics of nuclei and cells during the segmentation process and utilizes this information for a more accurate result. Furthermore, we are able to analyze the detected regions and extract characteristic cell data that can be used to assist in clinical diagnosis.

  19. Consortium analysis of gene and gene-folate interactions in purine and pyrimidine metabolism pathways with ovarian carcinoma risk

    PubMed Central

    Kelemen, Linda E.; Terry, Kathryn L.; Goodman, Marc T.; Webb, Penelope M.; Bandera, Elisa V.; McGuire, Valerie; Rossing, Mary Anne; Wang, Qinggang; Dicks, Ed; Tyrer, Jonathan P.; Song, Honglin; Kupryjanczyk, Jolanta; Dansonka-Mieszkowska, Agnieszka; Plisiecka-Halasa, Joanna; Timorek, Agnieszka; Menon, Usha; Gentry-Maharaj, Aleksandra; Gayther, Simon A.; Ramus, Susan J.; Narod, Steven A.; Risch, Harvey A.; McLaughlin, John R.; Siddiqui, Nadeem; Glasspool, Rosalind; Paul, James; Carty, Karen; Gronwald, Jacek; Lubiński, Jan; Jakubowska, Anna; Cybulski, Cezary; Kiemeney, Lambertus A.; Massuger, Leon F. A. G.; van Altena, Anne M.; Aben, Katja K. H.; Olson, Sara H.; Orlow, Irene; Cramer, Daniel W.; Levine, Douglas A.; Bisogna, Maria; Giles, Graham G.; Southey, Melissa C.; Bruinsma, Fiona; Kjær, Susanne Krüger; Høgdall, Estrid; Jensen, Allan; Høgdall, Claus K.; Lundvall, Lene; Engelholm, Svend-Aage; Heitz, Florian; du Bois, Andreas; Harter, Philipp; Schwaab, Ira; Butzow, Ralf; Nevanlinna, Heli; Pelttari, Liisa M.; Leminen, Arto; Thompson, Pamela J.; Lurie, Galina; Wilkens, Lynne R.; Lambrechts, Diether; Van Nieuwenhuysen, Els; Lambrechts, Sandrina; Vergote, Ignace; Beesley, Jonathan; Fasching, Peter A.; Beckmann, Matthias W.; Hein, Alexander; Ekici, Arif B.; Doherty, Jennifer A.; Wu, Anna H.; Pearce, Celeste L.; Pike, Malcolm C.; Stram, Daniel; Chang-Claude, Jenny; Rudolph, Anja; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo B.; Bogdanova, Natalia; Antonenkova, Natalia; Odunsi, Kunle; Edwards, Robert P.; Kelley, Joseph L.; Modugno, Francesmary; Ness, Roberta B.; Karlan, Beth Y.; Walsh, Christine; Lester, Jenny; Orsulic, Sandra; Fridley, Brooke L.; Vierkant, Robert A.; Cunningham, Julie M.; Wu, Xifeng; Lu, Karen; Liang, Dong; Hildebrandt, Michelle A.T.; Weber, Rachel Palmieri; Iversen, Edwin S.; Tworoger, Shelley S.; Poole, Elizabeth M.; Salvesen, Helga B.; Krakstad, Camilla; Bjorge, Line; Tangen, Ingvild L.; Pejovic, Tanja; Bean, Yukie; Kellar, Melissa; Wentzensen, Nicolas; Brinton, Louise A.; Lissowska, Jolanta; Garcia-Closas, Montserrat; Campbell, Ian G.; Eccles, Diana; Whittemore, Alice S.; Sieh, Weiva; Rothstein, Joseph H.; Anton-Culver, Hoda; Ziogas, Argyrios; Phelan, Catherine M.; Moysich, Kirsten B.; Goode, Ellen L.; Schildkraut, Joellen M.; Berchuck, Andrew; Pharoah, Paul D.P.; Sellers, Thomas A.; Brooks-Wilson, Angela; Cook, Linda S.; Le, Nhu D.

    2014-01-01

    Scope We re-evaluated previously reported associations between variants in pathways of one-carbon (folate) transfer genes and ovarian carcinoma (OC) risk, and in related pathways of purine and pyrimidine metabolism, and assessed interactions with folate intake. Methods and Results Odds ratios (OR) for 446 genetic variants were estimated among 13,410 OC cases and 22,635 controls and among 2,281 cases and 3,444 controls with folate information. Following multiple testing correction, the most significant main effect associations were for DPYD variants rs11587873 (OR=0.92, P=6x10−5) and rs828054 (OR=1.06, P=1x10−4). Thirteen variants in the pyrimidine metabolism genes, DPYD, DPYS, PPAT and TYMS, also interacted significantly with folate in a multi-variant analysis (corrected P=9.9x10−6) but collectively explained only 0.2% of OC risk. Although no other associations were significant after multiple testing correction, variants in SHMT1 in one-carbon transfer, previously reported with OC, suggested lower risk at higher folate (Pinteraction=0.03-0.006). Conclusions Variation in pyrimidine metabolism genes, particularly DPYD, which was previously reported to be associated with OC, may influence risk; however, stratification by folate intake is unlikely to modify disease risk appreciably in these women. SHMT1 SNP-byfolate interactions are plausible but require further validation. Polymorphisms in selected genes in purine metabolism were not associated with OC. PMID:25066213

  20. Mobile Interspersed Repeats Are Major Structural Variants in the Human Genome

    PubMed Central

    Huang, Cheng Ran Lisa; Schneider, Anna M.; Lu, Yunqi; Niranjan, Tejasvi; Shen, Peilin; Robinson, Matoya A.; Steranka, Jared P.; Valle, David; Civin, Curt I.; Wang, Tao; Wheelan, Sarah J.; Ji, Hongkai; Boeke, Jef D.; Burns, Kathleen H.

    2010-01-01

    Summary Characterizing structural variants in the human genome is of great importance, but a genome wide analysis to detect interspersed repeats has not been done. Thus, the degree to which mobile DNAs contribute to genetic diversity, heritable disease, and oncogenesis remains speculative. We perform transposon insertion profiling by microarray (TIP-chip) to map human L1(Ta) retrotransposons (LINE-1 s) genome-wide. This identified numerous novel human L1(Ta) insertional polymorphisms with highly variant allelic frequencies. We also explored TIP-chip's usefulness to identify candidate alleles associated with different phenotypes in clinical cohorts. Our data suggest that the occurrence of new insertions is twice as high as previously estimated, and that these repeats are under-recognized as sources of human genomic and phenotypic diversity. We have just begun to probe the universe of human L1(Ta) polymorphisms, and as TIP-chip is applied to other insertions such as Alu SINEs, it will expand the catalog of genomic variants even further. PMID:20602999

  1. Expansion of the spectrum of ITGB6-related disorders to adolescent alopecia, dentogingival abnormalities and intellectual disability.

    PubMed

    Ansar, Muhammad; Jan, Abid; Santos-Cortez, Regie Lyn P; Wang, Xin; Suliman, Muhammad; Acharya, Anushree; Habib, Rabia; Abbe, Izoduwa; Ali, Ghazanfar; Lee, Kwanghyuk; Smith, Joshua D; Nickerson, Deborah A; Shendure, Jay; Bamshad, Michael J; Ahmad, Wasim; Leal, Suzanne M

    2016-08-01

    Alopecia with mental retardation (APMR) is a very rare disorder. In this study, we report on a consanguineous Pakistani family (AP91) with mild-to-moderate intellectual disability, adolescent alopecia and dentogingival abnormalities. Using homozygosity mapping, linkage analysis and exome sequencing, we identified a novel rare missense variant c.898G>A (p.(Glu300Lys)) in ITGB6, which co-segregates with the phenotype within the family and is predicted to be deleterious. Structural modeling shows that Glu300 lies in the β-propeller domain, and is surrounded by several residues that are important for heterodimerization with α integrin. Previous studies showed that ITGB6 variants can cause amelogenesis imperfecta in humans, but patients from family AP91 who are homozygous for the c.898G>A variant present with neurological and dermatological features, indicating a role for ITGB6 beyond enamel formation. Our study demonstrates that a rare deleterious variant within ITGB6 causes not only dentogingival anomalies but also intellectual disability and alopecia.

  2. Integrating multi-criteria decision analysis for a GIS-based hazardous waste landfill sitting in Kurdistan Province, western Iran

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sharifi, Mozafar; Hadidi, Mosslem; Vessali, Elahe

    2009-10-15

    The evaluation of a hazardous waste disposal site is a complicated process because it requires data from diverse social and environmental fields. These data often involve processing of a significant amount of spatial information which can be used by GIS as an important tool for land use suitability analysis. This paper presents a multi-criteria decision analysis alongside with a geospatial analysis for the selection of hazardous waste landfill sites in Kurdistan Province, western Iran. The study employs a two-stage analysis to provide a spatial decision support system for hazardous waste management in a typically under developed region. The purpose ofmore » GIS was to perform an initial screening process to eliminate unsuitable land followed by utilization of a multi-criteria decision analysis (MCDA) to identify the most suitable sites using the information provided by the regional experts with reference to new chosen criteria. Using 21 exclusionary criteria, as input layers, masked maps were prepared. Creating various intermediate or analysis map layers a final overlay map was obtained representing areas for hazardous waste landfill sites. In order to evaluate different landfill sites produced by the overlaying a landfill suitability index system was developed representing cumulative effects of relative importance (weights) and suitability values of 14 non-exclusionary criteria including several criteria resulting from field observation. Using this suitability index 15 different sites were visited and based on the numerical evaluation provided by MCDA most suitable sites were determined.« less

  3. Integrating multi-criteria decision analysis for a GIS-based hazardous waste landfill sitting in Kurdistan Province, western Iran.

    PubMed

    Sharifi, Mozafar; Hadidi, Mosslem; Vessali, Elahe; Mosstafakhani, Parasto; Taheri, Kamal; Shahoie, Saber; Khodamoradpour, Mehran

    2009-10-01

    The evaluation of a hazardous waste disposal site is a complicated process because it requires data from diverse social and environmental fields. These data often involve processing of a significant amount of spatial information which can be used by GIS as an important tool for land use suitability analysis. This paper presents a multi-criteria decision analysis alongside with a geospatial analysis for the selection of hazardous waste landfill sites in Kurdistan Province, western Iran. The study employs a two-stage analysis to provide a spatial decision support system for hazardous waste management in a typically under developed region. The purpose of GIS was to perform an initial screening process to eliminate unsuitable land followed by utilization of a multi-criteria decision analysis (MCDA) to identify the most suitable sites using the information provided by the regional experts with reference to new chosen criteria. Using 21 exclusionary criteria, as input layers, masked maps were prepared. Creating various intermediate or analysis map layers a final overlay map was obtained representing areas for hazardous waste landfill sites. In order to evaluate different landfill sites produced by the overlaying a landfill suitability index system was developed representing cumulative effects of relative importance (weights) and suitability values of 14 non-exclusionary criteria including several criteria resulting from field observation. Using this suitability index 15 different sites were visited and based on the numerical evaluation provided by MCDA most suitable sites were determined.

  4. Mapping Infected Area after a Flash-Flooding Storm Using Multi Criteria Analysis and Spectral Indices

    NASA Astrophysics Data System (ADS)

    Al-Akad, S.; Akensous, Y.; Hakdaoui, M.

    2017-11-01

    This research article is summarize the applications of remote sensing and GIS to study the urban floods risk in Al Mukalla. Satellite acquisition of a flood event on October 2015 in Al Mukalla (Yemen) by using flood risk mapping techniques illustrate the potential risk present in this city. Satellite images (The Landsat and DEM images data were atmospherically corrected, radiometric corrected, and geometric and topographic distortions rectified.) are used for flood risk mapping to afford a hazard (vulnerability) map. This map is provided by applying image-processing techniques and using geographic information system (GIS) environment also the application of NDVI, NDWI index, and a method to estimate the flood-hazard areas. Four factors were considered in order to estimate the spatial distribution of the hazardous areas: flow accumulation, slope, land use, geology and elevation. The multi-criteria analysis, allowing to deal with vulnerability to flooding, as well as mapping areas at the risk of flooding of the city Al Mukalla. The main object of this research is to provide a simple and rapid method to reduce and manage the risks caused by flood in Yemen by take as example the city of Al Mukalla.

  5. STORMSeq: an open-source, user-friendly pipeline for processing personal genomics data in the cloud.

    PubMed

    Karczewski, Konrad J; Fernald, Guy Haskin; Martin, Alicia R; Snyder, Michael; Tatonetti, Nicholas P; Dudley, Joel T

    2014-01-01

    The increasing public availability of personal complete genome sequencing data has ushered in an era of democratized genomics. However, read mapping and variant calling software is constantly improving and individuals with personal genomic data may prefer to customize and update their variant calls. Here, we describe STORMSeq (Scalable Tools for Open-Source Read Mapping), a graphical interface cloud computing solution that does not require a parallel computing environment or extensive technical experience. This customizable and modular system performs read mapping, read cleaning, and variant calling and annotation. At present, STORMSeq costs approximately $2 and 5-10 hours to process a full exome sequence and $30 and 3-8 days to process a whole genome sequence. We provide this open-access and open-source resource as a user-friendly interface in Amazon EC2.

  6. Family-based association testing strongly implicates DRD2 as a risk gene for schizophrenia in Han Chinese from Taiwan

    PubMed Central

    Glatt, SJ; Faraone, SV; Lasky-Su, JA; Kanazawa, T; Hwu, H-G; Tsuang, MT

    2009-01-01

    The gene that codes for dopamine receptor D2 (DRD2 on chromosome 11q23) has long been a prime functional and positional candidate risk gene for schizophrenia. Collectively, prior case–control studies found a reliable effect of the Ser311Cys DRD2 polymorphism (rs1801028) on risk for schizophrenia, but few other polymorphisms in the gene had ever been evaluated and no adequately powered family-based association study has been performed to date. Our objective was to test 21 haplotype-tagging and all three known nonsynonymous single-nucleotide polymorphisms (SNPs) in DRD2 for association with schizophrenia in a family-based study of 2408 Han Chinese, including 1214 affected individuals from 616 families. We did not find a significant effect of rs1801028, but we did find significant evidence for association of schizophrenia with two multi-marker haplotypes spanning blocks of strong linkage disequilibrium (LD) and nine individual SNPs (Ps < 0.05). Importantly, two SNPs (rs1079727 and rs2283265) and both multi-marker haplotypes spanning entire LD blocks (including one that contained rs1801028) remained significant after correcting for multiple testing. These results further add to the body of data implicating DRD2 as a schizophrenia risk gene; however, a causal variant(s) in DRD2 remains to be elucidated by further fine mapping of the gene, with particular attention given to the area surrounding the third through fifth exons. PMID:18332877

  7. Multi-ethnic genome-wide association study identifies novel locus for type 2 diabetes susceptibility

    PubMed Central

    Cook, James P; Morris, Andrew P

    2016-01-01

    Genome-wide association studies (GWAS) have traditionally been undertaken in homogeneous populations from the same ancestry group. However, with the increasing availability of GWAS in large-scale multi-ethnic cohorts, we have evaluated a framework for detecting association of genetic variants with complex traits, allowing for population structure, and developed a powerful test of heterogeneity in allelic effects between ancestry groups. We have applied the methodology to identify and characterise loci associated with susceptibility to type 2 diabetes (T2D) using GWAS data from the Resource for Genetic Epidemiology on Adult Health and Aging, a large multi-ethnic population-based cohort, created for investigating the genetic and environmental basis of age-related diseases. We identified a novel locus for T2D susceptibility at genome-wide significance (P<5 × 10−8) that maps to TOMM40-APOE, a region previously implicated in lipid metabolism and Alzheimer's disease. We have also confirmed previous reports that single-nucleotide polymorphisms at the TCF7L2 locus demonstrate the greatest extent of heterogeneity in allelic effects between ethnic groups, with the lowest risk observed in populations of East Asian ancestry. PMID:27189021

  8. Evaluating the association of common APOA2 variants with type 2 diabetes

    PubMed Central

    Duesing, Konsta; Charpentier, Guillaume; Marre, Michel; Tichet, Jean; Hercberg, Serge; Balkau, Beverley; Froguel, Philippe; Gibson, Fernando

    2009-01-01

    Background APOA2 is a positional and biological candidate gene for type 2 diabetes at the chromosome 1q21-q24 susceptibility locus. The aim of this study was to examine if HapMap phase II tag SNPs in APOA2 are associated with type 2 diabetes and quantitative traits in French Caucasian subjects. Methods We genotyped the three HapMap phase II tagging SNPs (rs6413453, rs5085 and rs5082) required to capture the common variation spanning the APOA2 locus in our type 2 diabetes case-control cohort comprising 3,093 French Caucasian subjects. The association between these variants and quantitative traits was also examined in the normoglycaemic adults of the control cohort. In addition, meta-analysis of publicly available whole genome association data was performed. Results None of the APOA2 tag SNPs were associated with type 2 diabetes in the French Caucasian case-control cohort (rs6413453, P = 0.619; rs5085, P = 0.245; rs5082, P = 0.591). However, rs5082 was marginally associated with total cholesterol levels (P = 0.026) and waist-to-hip ratio (P = 0.029). The meta-analysis of data from 12,387 subjects confirmed our finding that common variation at the APOA2 locus is not associated with type 2 diabetes. Conclusion The available data does not support a role for common variants in APOA2 on type 2 diabetes susceptibility or related quantitative traits in Northern Europeans. PMID:19216768

  9. Evaluating the association of common APOA2 variants with type 2 diabetes.

    PubMed

    Duesing, Konsta; Charpentier, Guillaume; Marre, Michel; Tichet, Jean; Hercberg, Serge; Balkau, Beverley; Froguel, Philippe; Gibson, Fernando

    2009-02-13

    APOA2 is a positional and biological candidate gene for type 2 diabetes at the chromosome 1q21-q24 susceptibility locus. The aim of this study was to examine if HapMap phase II tag SNPs in APOA2 are associated with type 2 diabetes and quantitative traits in French Caucasian subjects. We genotyped the three HapMap phase II tagging SNPs (rs6413453, rs5085 and rs5082) required to capture the common variation spanning the APOA2 locus in our type 2 diabetes case-control cohort comprising 3,093 French Caucasian subjects. The association between these variants and quantitative traits was also examined in the normoglycaemic adults of the control cohort. In addition, meta-analysis of publicly available whole genome association data was performed. None of the APOA2 tag SNPs were associated with type 2 diabetes in the French Caucasian case-control cohort (rs6413453, P = 0.619; rs5085, P = 0.245; rs5082, P = 0.591). However, rs5082 was marginally associated with total cholesterol levels (P = 0.026) and waist-to-hip ratio (P = 0.029). The meta-analysis of data from 12,387 subjects confirmed our finding that common variation at the APOA2 locus is not associated with type 2 diabetes. The available data does not support a role for common variants in APOA2 on type 2 diabetes susceptibility or related quantitative traits in Northern Europeans.

  10. Exome genotyping arrays to identify rare and low frequency variants associated with epithelial ovarian cancer risk

    PubMed Central

    Permuth, Jennifer B.; Pirie, Ailith; Ann Chen, Y.; Lin, Hui-Yi; Reid, Brett M.; Chen, Zhihua; Monteiro, Alvaro; Dennis, Joe; Mendoza-Fandino, Gustavo; Anton-Culver, Hoda; Bandera, Elisa V.; Bisogna, Maria; Brinton, Louise; Brooks-Wilson, Angela; Carney, Michael E.; Chenevix-Trench, Georgia; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; D’Aloisio, Aimee A.; Anne Doherty, Jennifer; Earp, Madalene; Edwards, Robert P.; Fridley, Brooke L.; Gayther, Simon A.; Gentry-Maharaj, Aleksandra; Goodman, Marc T.; Gronwald, Jacek; Hogdall, Estrid; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Karlan, Beth Y.; Kelemen, Linda E.; Kjaer, Suzanne K.; Kraft, Peter; Le, Nhu D.; Levine, Douglas A.; Lissowska, Jolanta; Lubinski, Jan; Matsuo, Keitaro; Menon, Usha; Modugno, Rosemary; Moysich, Kirsten B.; Nakanishi, Toru; Ness, Roberta B.; Olson, Sara; Orlow, Irene; Pearce, Celeste L.; Pejovic, Tanja; Poole, Elizabeth M.; Ramus, Susan J.; Anne Rossing, Mary; Sandler, Dale P.; Shu, Xiao-Ou; Song, Honglin; Taylor, Jack A.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Tworoger, Shelley S.; Webb, Penelope M.; Wentzensen, Nicolas; Wilkens, Lynne R.; Winham, Stacey; Woo, Yin-Ling; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Phelan, Catherine M.; Schildkraut, Joellen M.; Berchuck, Andrew; Goode, Ellen L.; Pharoah, Paul D. P.; Sellers, Thomas A.

    2016-01-01

    Rare and low frequency variants are not well covered in most germline genotyping arrays and are understudied in relation to epithelial ovarian cancer (EOC) risk. To address this gap, we used genotyping arrays targeting rarer protein-coding variation in 8,165 EOC cases and 11,619 controls from the international Ovarian Cancer Association Consortium (OCAC). Pooled association analyses were conducted at the variant and gene level for 98,543 variants directly genotyped through two exome genotyping projects. Only common variants that represent or are in strong linkage disequilibrium (LD) with previously-identified signals at established loci reached traditional thresholds for exome-wide significance (P < 5.0 × 10 − 7). One of the most significant signals (Pall histologies = 1.01 × 10 − 13;Pserous = 3.54 × 10 − 14) occurred at 3q25.31 for rs62273959, a missense variant mapping to the LEKR1 gene that is in LD (r2 = 0.90) with a previously identified ‘best hit’ (rs7651446) mapping to an intron of TIPARP. Suggestive associations (5.0 × 10 − 5 > P≥5.0 ×10 − 7) were detected for rare and low-frequency variants at 16 novel loci. Four rare missense variants were identified (ACTBL2 rs73757391 (5q11.2), BTD rs200337373 (3p25.1), KRT13 rs150321809 (17q21.2) and MC2R rs104894658 (18p11.21)), but only MC2R rs104894668 had a large effect size (OR = 9.66). Genes most strongly associated with EOC risk included ACTBL2 (PAML = 3.23 × 10 − 5; PSKAT-o = 9.23 × 10 − 4) and KRT13 (PAML = 1.67 × 10 − 4; PSKAT-o = 1.07 × 10 − 5), reaffirming variant-level analysis. In summary, this large study identified several rare and low-frequency variants and genes that may contribute to EOC susceptibility, albeit with possible small effects. Future studies that integrate epidemiology, sequencing, and functional assays are needed to further unravel the unexplained heritability and biology of this disease. PMID:27378695

  11. CRISPR-directed mitotic recombination enables genetic mapping without crosses.

    PubMed

    Sadhu, Meru J; Bloom, Joshua S; Day, Laura; Kruglyak, Leonid

    2016-05-27

    Linkage and association studies have mapped thousands of genomic regions that contribute to phenotypic variation, but narrowing these regions to the underlying causal genes and variants has proven much more challenging. Resolution of genetic mapping is limited by the recombination rate. We developed a method that uses CRISPR (clustered, regularly interspaced, short palindromic repeats) to build mapping panels with targeted recombination events. We tested the method by generating a panel with recombination events spaced along a yeast chromosome arm, mapping trait variation, and then targeting a high density of recombination events to the region of interest. Using this approach, we fine-mapped manganese sensitivity to a single polymorphism in the transporter Pmr1. Targeting recombination events to regions of interest allows us to rapidly and systematically identify causal variants underlying trait differences. Copyright © 2016, American Association for the Advancement of Science.

  12. An Automated Pipeline for Engineering Many-Enzyme Pathways: Computational Sequence Design, Pathway Expression-Flux Mapping, and Scalable Pathway Optimization.

    PubMed

    Halper, Sean M; Cetnar, Daniel P; Salis, Howard M

    2018-01-01

    Engineering many-enzyme metabolic pathways suffers from the design curse of dimensionality. There are an astronomical number of synonymous DNA sequence choices, though relatively few will express an evolutionary robust, maximally productive pathway without metabolic bottlenecks. To solve this challenge, we have developed an integrated, automated computational-experimental pipeline that identifies a pathway's optimal DNA sequence without high-throughput screening or many cycles of design-build-test. The first step applies our Operon Calculator algorithm to design a host-specific evolutionary robust bacterial operon sequence with maximally tunable enzyme expression levels. The second step applies our RBS Library Calculator algorithm to systematically vary enzyme expression levels with the smallest-sized library. After characterizing a small number of constructed pathway variants, measurements are supplied to our Pathway Map Calculator algorithm, which then parameterizes a kinetic metabolic model that ultimately predicts the pathway's optimal enzyme expression levels and DNA sequences. Altogether, our algorithms provide the ability to efficiently map the pathway's sequence-expression-activity space and predict DNA sequences with desired metabolic fluxes. Here, we provide a step-by-step guide to applying the Pathway Optimization Pipeline on a desired multi-enzyme pathway in a bacterial host.

  13. GRL-09510, a Unique P2-Crown-Tetrahydrofuranylurethane -Containing HIV-1 Protease Inhibitor, Maintains Its Favorable Antiviral Activity against Highly-Drug-Resistant HIV-1 Variants in vitro

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Amano, Masayuki; Miguel Salcedo-Gómez, Pedro; Yedidi, Ravikiran S.

    We report that GRL-09510, a novel HIV-1 protease inhibitor (PI) containing a newly-generated P2-crown-tetrahydrofuranylurethane (Crwn-THF), a P2'-methoxybenzene, and a sulfonamide isostere, is highly active against laboratory and primary clinical HIV-1 isolates (EC50: 0.0014–0.0028 μM) with minimal cytotoxicity (CC50: 39.0 μM). Similarly, GRL-09510 efficiently blocked the replication of HIV-1NL4-3 variants, which were capable of propagating at high-concentrations of atazanavir, lopinavir, and amprenavir (APV). GRL-09510 was also potent against multi-drug-resistant clinical HIV-1 variants and HIV-2ROD. Under the selection condition, where HIV-1NL4-3 rapidly acquired significant resistance to APV, an integrase inhibitor raltegravir, and a GRL-09510 congener (GRL-09610), no variants highly resistant against GRL-09510more » emerged over long-term in vitro passage of the virus. Crystallographic analysis demonstrated that the Crwn-THF moiety of GRL-09510 forms strong hydrogen-bond-interactions with HIV-1 protease (PR) active-site amino acids and is bulkier with a larger contact surface, making greater van der Waals contacts with PR than the bis-THF moiety of darunavir. The present data demonstrate that GRL-09510 has favorable features for treating patients infected with wild-type and/or multi-drug-resistant HIV-1 variants, that the newly generated P2-Crwn-THF moiety confers highly desirable anti-HIV-1 potency. The use of the novel Crwn-THF moiety sheds lights in the design of novel PIs.« less

  14. VESsel GENeration Analysis (VESGEN): Innovative Vascular Mappings for Astronaut Exploration Health Risks and Human Terrestrial Medicine

    NASA Technical Reports Server (NTRS)

    Parsons-Wingerter, Patricia; Kao, David; Valizadegan, Hamed; Martin, Rodney; Murray, Matthew C.; Ramesh, Sneha; Sekaran, Srinivaas

    2017-01-01

    Currently, astronauts face significant health risks in future long-duration exploration missions such as colonizing the Moon and traveling to Mars. Numerous risks include greatly increased radiation exposures beyond the low earth orbit (LEO) of the ISS, and visual and ocular impairments in response to microgravity environments. The cardiovascular system is a key mediator in human physiological responses to radiation and microgravity. Moreover, blood vessels are necessarily involved in the progression and treatment of vascular-dependent terrestrial diseases such as cancer, coronary vessel disease, wound-healing, reproductive disorders, and diabetes. NASA developed an innovative, globally requested beta-level software, VESsel GENeration Analysis (VESGEN) to map and quantify vascular remodeling for application to astronaut and terrestrial health challenges. VESGEN mappings of branching vascular trees and networks are based on a weighted multi-parametric analysis derived from vascular physiological branching rules. Complex vascular branching patterns are determined by biological signaling mechanisms together with the fluid mechanics of multi-phase laminar blood flow.

  15. Association-heterogeneity mapping identifies an Asian-specific association of the GTF2I locus with rheumatoid arthritis

    PubMed Central

    Kim, Kwangwoo; Bang, So-Young; Ikari, Katsunori; Yoo, Dae Hyun; Cho, Soo-Kyung; Choi, Chan-Bum; Sung, Yoon-Kyoung; Kim, Tae-Hwan; Jun, Jae-Bum; Kang, Young Mo; Suh, Chang-Hee; Shim, Seung-Cheol; Lee, Shin-Seok; Lee, Jisoo; Chung, Won Tae; Kim, Seong-Kyu; Choe, Jung-Yoon; Momohara, Shigeki; Taniguchi, Atsuo; Yamanaka, Hisashi; Nath, Swapan K.; Lee, Hye-Soon; Bae, Sang-Cheol

    2016-01-01

    Considerable sharing of disease alleles among populations is well-characterized in autoimmune disorders (e.g., rheumatoid arthritis), but there are some exceptional loci showing heterogenic association among populations. Here we investigated genetic variants with distinct effects on the development of rheumatoid arthritis in Asian and European populations. Ancestry-related association heterogeneity was examined using Cochran’s homogeneity tests for the disease association data from large Asian (n = 14,465; 9,299 discovery subjects and 5,166 validation subjects; 4 collections) and European (n = 45,790; 11 collections) rheumatoid arthritis case-control cohorts with Immunochip and genome-wide SNP array data. We identified significant heterogeneity between the two ancestries for the common variants in the GTF2I locus (PHeterogeneity = 9.6 × 10−9 at rs73366469) and showed that this heterogeneity was due to an Asian-specific association effect (ORMeta = 1.37 and PMeta = 4.2 × 10−13 in Asians; ORMeta = 1.00 and PMeta = 1.00 in Europeans). Trans-ancestral comparison and bioinfomatics analysis revealed a plausibly causal or disease-variant-tagging SNP (rs117026326; in linkage disequilibrium with rs73366469), whose minor allele is common in Asians but rare in Europeans. In conclusion, we identified largest-ever effect on Asian rheumatoid arthritis across human non-HLA regions at GTF2I by heterogeneity mapping followed by replication studies, and pinpointed a possible causal variant. PMID:27272985

  16. Ataxia telangiectasia presenting as dopa-responsive cervical dystonia

    PubMed Central

    Mohire, Mahavir D.; Schneider, Susanne A.; Stamelou, Maria; Wood, Nicholas W.; Bhatia, Kailash P.

    2013-01-01

    Objective: To identify the cause of cervical dopa-responsive dystonia (DRD) in a Muslim Indian family inherited in an apparently autosomal recessive fashion, as previously described in this journal. Methods: Previous testing for mutations in the genes known to cause DRD (GCH1, TH, and SPR) had been negative. Whole exome sequencing was performed on all 3 affected individuals for whom DNA was available to identify potentially pathogenic shared variants. Genotyping data obtained for all 3 affected individuals using the OmniExpress single nucleotide polymorphism chip (Illumina, San Diego, CA) were used to perform linkage analysis, autozygosity mapping, and copy number variation analysis. Sanger sequencing was used to confirm all variants. Results: After filtering of the variants, exome sequencing revealed 2 genes harboring potentially pathogenic compound heterozygous variants (ATM and LRRC16A). Of these, the variants in ATM segregated perfectly with the cervical DRD. Both mutations detected in ATM have been shown to be pathogenic, and α-fetoprotein, a marker of ataxia telangiectasia, was increased in all affected individuals. Conclusion: Biallelic mutations in ATM can cause DRD, and mutations in this gene should be considered in the differential diagnosis of unexplained DRD, particularly if the dystonia is cervical and if there is a recessive family history. ATM has previously been reported to cause isolated cervical dystonia, but never, to our knowledge, DRD. Individuals with dystonia related to ataxia telangiectasia may benefit from a trial of levodopa. PMID:23946315

  17. Fine Mapping and Functional Analysis Reveal a Role of SLC22A1 in Acylcarnitine Transport.

    PubMed

    Kim, Hye In; Raffler, Johannes; Lu, Wenyun; Lee, Jung-Jin; Abbey, Deepti; Saleheen, Danish; Rabinowitz, Joshua D; Bennett, Michael J; Hand, Nicholas J; Brown, Christopher; Rader, Daniel J

    2017-10-05

    Genome-wide association studies have identified a signal at the SLC22A1 locus for serum acylcarnitines, intermediate metabolites of mitochondrial oxidation whose plasma levels associate with metabolic diseases. Here, we refined the association signal, performed conditional analyses, and examined the linkage structure to find coding variants of SLC22A1 that mediate independent association signals at the locus. We also employed allele-specific expression analysis to find potential regulatory variants of SLC22A1 and demonstrated the effect of one variant on the splicing of SLC22A1. SLC22A1 encodes a hepatic plasma membrane transporter whose role in acylcarnitine physiology has not been described. By targeted metabolomics and isotope tracing experiments in loss- and gain-of-function cell and mouse models of Slc22a1, we uncovered a role of SLC22A1 in the efflux of acylcarnitines from the liver to the circulation. We further validated the impacts of human variants on SLC22A1-mediated acylcarnitine efflux in vitro, explaining their association with serum acylcarnitine levels. Our findings provide the detailed molecular mechanisms of the GWAS association for serum acylcarnitines at the SLC22A1 locus by functionally validating the impact of SLC22A1 and its variants on acylcarnitine transport. Copyright © 2017. Published by Elsevier Inc.

  18. Multi-Temporal Multi-Sensor Analysis of Urbanization and Environmental/Climate Impact in China for Sustainable Urban Development

    NASA Astrophysics Data System (ADS)

    Ban, Yifang; Gong, Peng; Gamba, Paolo; Taubenbock, Hannes; Du, Peijun

    2016-08-01

    The overall objective of this research is to investigate multi-temporal, multi-scale, multi-sensor satellite data for analysis of urbanization and environmental/climate impact in China to support sustainable planning. Multi- temporal multi-scale SAR and optical data have been evaluated for urban information extraction using innovative methods and algorithms, including KTH- Pavia Urban Extractor, Pavia UEXT, and an "exclusion- inclusion" framework for urban extent extraction, and KTH-SEG, a novel object-based classification method for detailed urban land cover mapping. Various pixel- based and object-based change detection algorithms were also developed to extract urban changes. Several Chinese cities including Beijing, Shanghai and Guangzhou are selected as study areas. Spatio-temporal urbanization patterns and environmental impact at regional, metropolitan and city core were evaluated through ecosystem service, landscape metrics, spatial indices, and/or their combinations. The relationship between land surface temperature and land-cover classes was also analyzed.The urban extraction results showed that urban areas and small towns could be well extracted using multitemporal SAR data with the KTH-Pavia Urban Extractor and UEXT. The fusion of SAR data at multiple scales from multiple sensors was proven to improve urban extraction. For urban land cover mapping, the results show that the fusion of multitemporal SAR and optical data could produce detailed land cover maps with improved accuracy than that of SAR or optical data alone. Pixel-based and object-based change detection algorithms developed with the project were effective to extract urban changes. Comparing the urban land cover results from mulitemporal multisensor data, the environmental impact analysis indicates major losses for food supply, noise reduction, runoff mitigation, waste treatment and global climate regulation services through landscape structural changes in terms of decreases in service area, edge contamination and fragmentation. In terms ofclimate impact, the results indicate that land surface temperature can be related to land use/land cover classes.

  19. A chromosome 10 variant with a 12 Mb inversion [inv(10)(q11.22q21.1)] identical by descent and frequent in the Swedish population.

    PubMed

    Entesarian, Miriam; Carlsson, Birgit; Mansouri, Mahmoud Reza; Stattin, Eva-Lena; Holmberg, Eva; Golovleva, Irina; Stefansson, Hreinn; Klar, Joakim; Dahl, Niklas

    2009-03-01

    We identified a paracentric inversion of chromosome 10 [inv(10)(q11.22q21.1)] in 0.20% of Swedish individuals (15/7,439) referred for cytogenetic analysis. A retrospective analysis of 8,896 karyotypes from amniocenteses in Sweden revealed a carrier frequency of 0.079% (7/8,896) for the inversion. Cloning and detailed analysis of the inversion breakpoint regions show enrichment for interspersed repeat elements and AT-stretches. The centromeric breakpoint coincides with that of a predicted inversion from HapMap data, which suggests that this region is involved in several chromosome 10 variants. No known gene or predicted transcript are disrupted by the inversion which spans approximately 12 Mb. Carriers from four non-related Swedish families have identical inversion breakpoints and haplotype analysis confirmed that the rearrangement is identical by descent. Diagnosis was retrieved in 6 out of the 15 carriers referred for cytogenetic analysis. No consistent phenotype was found to be associated with the inversion. Our study demonstrates that the inv(10)(q11.22q21.1) is a rare and inherited chromosome variant with a broad geographical distribution in Sweden. 2009 Wiley-Liss, Inc.

  20. Biphasic responses in multi-site phosphorylation systems.

    PubMed

    Suwanmajo, Thapanar; Krishnan, J

    2013-12-06

    Multi-site phosphorylation systems are repeatedly encountered in cellular biology and multi-site modification is a basic building block of post-translational modification. In this paper, we demonstrate how distributive multi-site modification mechanisms by a single kinase/phosphatase pair can lead to biphasic/partial biphasic dose-response characteristics for the maximally phosphorylated substrate at steady state. We use simulations and analysis to uncover a hidden competing effect which is responsible for this and analyse how it may be accentuated. We build on this to analyse different variants of multi-site phosphorylation mechanisms showing that some mechanisms are intrinsically not capable of displaying this behaviour. This provides both a consolidated understanding of how and under what conditions biphasic responses are obtained in multi-site phosphorylation and a basis for discriminating between different mechanisms based on this. We also demonstrate how this behaviour may be combined with other behaviour such as threshold and bistable responses, demonstrating the capacity of multi-site phosphorylation systems to act as complex molecular signal processors.

  1. Mutation of ATF6 causes autosomal recessive achromatopsia.

    PubMed

    Ansar, Muhammad; Santos-Cortez, Regie Lyn P; Saqib, Muhammad Arif Nadeem; Zulfiqar, Fareeha; Lee, Kwanghyuk; Ashraf, Naeem Mahmood; Ullah, Ehsan; Wang, Xin; Sajid, Sundus; Khan, Falak Sher; Amin-ud-Din, Muhammad; Smith, Joshua D; Shendure, Jay; Bamshad, Michael J; Nickerson, Deborah A; Hameed, Abdul; Riazuddin, Saima; Ahmed, Zubair M; Ahmad, Wasim; Leal, Suzanne M

    2015-09-01

    Achromatopsia (ACHM) is an early-onset retinal dystrophy characterized by photophobia, nystagmus, color blindness and severely reduced visual acuity. Currently mutations in five genes CNGA3, CNGB3, GNAT2, PDE6C and PDE6H have been implicated in ACHM. We performed homozygosity mapping and linkage analysis in a consanguineous Pakistani ACHM family and mapped the locus to a 15.12-Mb region on chromosome 1q23.1-q24.3 with a maximum LOD score of 3.6. A DNA sample from an affected family member underwent exome sequencing. Within the ATF6 gene, a single-base insertion variant c.355_356dupG (p.Glu119Glyfs*8) was identified, which completely segregates with the ACHM phenotype within the family. The frameshift variant was absent in public variant databases, in 130 exomes from unrelated Pakistani individuals, and in 235 ethnically matched controls. The variant is predicted to result in a truncated protein that lacks the DNA binding and transmembrane domains and therefore affects the function of ATF6 as a transcription factor that initiates the unfolded protein response during endoplasmic reticulum (ER) stress. Immunolabeling with anti-ATF6 antibodies showed localization throughout the mouse neuronal retina, including retinal pigment epithelium, photoreceptor cells, inner nuclear layer, inner and outer plexiform layers, with a more prominent signal in retinal ganglion cells. In contrast to cytoplasmic expression of wild-type protein, in heterologous cells ATF6 protein with the p.Glu119Glyfs*8 variant is mainly confined to the nucleus. Our results imply that response to ER stress as mediated by the ATF6 pathway is essential for color vision in humans.

  2. Averaged ratio between complementary profiles for evaluating shape distortions of map projections and spherical hierarchical tessellations

    NASA Astrophysics Data System (ADS)

    Yan, Jin; Song, Xiao; Gong, Guanghong

    2016-02-01

    We describe a metric named averaged ratio between complementary profiles to represent the distortion of map projections, and the shape regularity of spherical cells derived from map projections or non-map-projection methods. The properties and statistical characteristics of our metric are investigated. Our metric (1) is a variable of numerical equivalence to both scale component and angular deformation component of Tissot indicatrix, and avoids the invalidation when using Tissot indicatrix and derived differential calculus for evaluating non-map-projection based tessellations where mathematical formulae do not exist (e.g., direct spherical subdivisions), (2) exhibits simplicity (neither differential nor integral calculus) and uniformity in the form of calculations, (3) requires low computational cost, while maintaining high correlation with the results of differential calculus, (4) is a quasi-invariant under rotations, and (5) reflects the distortions of map projections, distortion of spherical cells, and the associated distortions of texels. As an indicator of quantitative evaluation, we investigated typical spherical tessellation methods, some variants of tessellation methods, and map projections. The tessellation methods we evaluated are based on map projections or direct spherical subdivisions. The evaluation involves commonly used Platonic polyhedrons, Catalan polyhedrons, etc. Quantitative analyses based on our metric of shape regularity and an essential metric of area uniformity implied that (1) Uniform Spherical Grids and its variant show good qualities in both area uniformity and shape regularity, and (2) Crusta, Unicube map, and a variant of Unicube map exhibit fairly acceptable degrees of area uniformity and shape regularity.

  3. Site Map | USDA Plant Hardiness Zone Map

    Science.gov Websites

    Acknowledgments & Citation Copyright Map & Data Downloads Map Downloads Geography (GIS) Downloads Multi ; Citation Copyright Map & Data Downloads Map Downloads Geography (GIS) Downloads Multi-ZIP Code Finder

  4. Application of the High Resolution Melting analysis for genetic mapping of Sequence Tagged Site markers in narrow-leafed lupin (Lupinus angustifolius L.).

    PubMed

    Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech

    2015-01-01

    Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.

  5. Use of Genome Sequence Information for Meat Quality Trait QTL Mining for Causal Genes and Mutations on Pig Chromosome 17

    PubMed Central

    Hu, Zhi-Liang; Ramos, Antonio M.; Humphray, Sean J.; Rogers, Jane; Reecy, James M.; Rothschild, Max F.

    2011-01-01

    The newly available pig genome sequence has provided new information to fine map quantitative trait loci (QTL) in order to eventually identify causal variants. With targeted genomic sequencing efforts, we were able to obtain high quality BAC sequences that cover a region on pig chromosome 17 where a number of meat quality QTL have been previously discovered. Sequences from 70 BAC clones were assembled to form an 8-Mbp contig. Subsequently, we successfully mapped five previously identified QTL, three for meat color and two for lactate related traits, to the contig. With an additional 25 genetic markers that were identified by sequence comparison, we were able to carry out further linkage disequilibrium analysis to narrow down the genomic locations of these QTL, which allowed identification of the chromosomal regions that likely contain the causative variants. This research has provided one practical approach to combine genetic and molecular information for QTL mining. PMID:22303339

  6. GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

    PubMed

    Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

    2018-03-24

    Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.

  7. Rare Variant Association Test with Multiple Phenotypes

    PubMed Central

    Lee, Selyeong; Won, Sungho; Kim, Young Jin; Kim, Yongkang; Kim, Bong-Jo; Park, Taesung

    2016-01-01

    Although genome-wide association studies (GWAS) have now discovered thousands of genetic variants associated with common traits, such variants cannot explain the large degree of “missing heritability,” likely due to rare variants. The advent of next generation sequencing technology has allowed rare variant detection and association with common traits, often by investigating specific genomic regions for rare variant effects on a trait. Although multiply correlated phenotypes are often concurrently observed in GWAS, most studies analyze only single phenotypes, which may lessen statistical power. To increase power, multivariate analyses, which consider correlations between multiple phenotypes, can be used. However, few existing multi-variant analyses can identify rare variants for assessing multiple phenotypes. Here, we propose Multivariate Association Analysis using Score Statistics (MAAUSS), to identify rare variants associated with multiple phenotypes, based on the widely used Sequence Kernel Association Test (SKAT) for a single phenotype. We applied MAAUSS to Whole Exome Sequencing (WES) data from a Korean population of 1,058 subjects, to discover genes associated with multiple traits of liver function. We then assessed validation of those genes by a replication study, using an independent dataset of 3,445 individuals. Notably, we detected the gene ZNF620 among five significant genes. We then performed a simulation study to compare MAAUSS's performance with existing methods. Overall, MAAUSS successfully conserved type 1 error rates and in many cases, had a higher power than the existing methods. This study illustrates a feasible and straightforward approach for identifying rare variants correlated with multiple phenotypes, with likely relevance to missing heritability. PMID:28039885

  8. Mapping moderate-scale land-cover over very large geographic areas within a collaborative framework: A case study of the Southwest Regional Gap Analysis Project (SWReGAP)

    USGS Publications Warehouse

    Lowry, J.; Ramsey, R.D.; Thomas, K.; Schrupp, D.; Sajwaj, T.; Kirby, J.; Waller, E.; Schrader, S.; Falzarano, S.; Langs, L.; Manis, G.; Wallace, C.; Schulz, K.; Comer, P.; Pohs, K.; Rieth, W.; Velasquez, C.; Wolk, B.; Kepner, W.; Boykin, K.; O'Brien, L.; Bradford, D.; Thompson, B.; Prior-Magee, J.

    2007-01-01

    Land-cover mapping efforts within the USGS Gap Analysis Program have traditionally been state-centered; each state having the responsibility of implementing a project design for the geographic area within their state boundaries. The Southwest Regional Gap Analysis Project (SWReGAP) was the first formal GAP project designed at a regional, multi-state scale. The project area comprises the southwestern states of Arizona, Colorado, Nevada, New Mexico, and Utah. The land-cover map/dataset was generated using regionally consistent geospatial data (Landsat ETM+ imagery (1999-2001) and DEM derivatives), similar field data collection protocols, a standardized land-cover legend, and a common modeling approach (decision tree classifier). Partitioning of mapping responsibilities amongst the five collaborating states was organized around ecoregion-based "mapping zones". Over the course of 21/2 field seasons approximately 93,000 reference samples were collected directly, or obtained from other contemporary projects, for the land-cover modeling effort. The final map was made public in 2004 and contains 125 land-cover classes. An internal validation of 85 of the classes, representing 91% of the land area was performed. Agreement between withheld samples and the validated dataset was 61% (KHAT = .60, n = 17,030). This paper presents an overview of the methodologies used to create the regional land-cover dataset and highlights issues associated with large-area mapping within a coordinated, multi-institutional management framework. ?? 2006 Elsevier Inc. All rights reserved.

  9. Manifold absolute pressure estimation using neural network with hybrid training algorithm

    PubMed Central

    Selamat, Hazlina; Alimin, Ahmad Jais; Haniff, Mohamad Fadzli

    2017-01-01

    In a modern small gasoline engine fuel injection system, the load of the engine is estimated based on the measurement of the manifold absolute pressure (MAP) sensor, which took place in the intake manifold. This paper present a more economical approach on estimating the MAP by using only the measurements of the throttle position and engine speed, resulting in lower implementation cost. The estimation was done via two-stage multilayer feed-forward neural network by combining Levenberg-Marquardt (LM) algorithm, Bayesian Regularization (BR) algorithm and Particle Swarm Optimization (PSO) algorithm. Based on the results found in 20 runs, the second variant of the hybrid algorithm yields a better network performance than the first variant of hybrid algorithm, LM, LM with BR and PSO by estimating the MAP closely to the simulated MAP values. By using a valid experimental training data, the estimator network that trained with the second variant of the hybrid algorithm showed the best performance among other algorithms when used in an actual retrofit fuel injection system (RFIS). The performance of the estimator was also validated in steady-state and transient condition by showing a closer MAP estimation to the actual value. PMID:29190779

  10. Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci.

    PubMed

    Gaulton, Kyle J; Ferreira, Teresa; Lee, Yeji; Raimondo, Anne; Mägi, Reedik; Reschen, Michael E; Mahajan, Anubha; Locke, Adam; Rayner, N William; Robertson, Neil; Scott, Robert A; Prokopenko, Inga; Scott, Laura J; Green, Todd; Sparso, Thomas; Thuillier, Dorothee; Yengo, Loic; Grallert, Harald; Wahl, Simone; Frånberg, Mattias; Strawbridge, Rona J; Kestler, Hans; Chheda, Himanshu; Eisele, Lewin; Gustafsson, Stefan; Steinthorsdottir, Valgerdur; Thorleifsson, Gudmar; Qi, Lu; Karssen, Lennart C; van Leeuwen, Elisabeth M; Willems, Sara M; Li, Man; Chen, Han; Fuchsberger, Christian; Kwan, Phoenix; Ma, Clement; Linderman, Michael; Lu, Yingchang; Thomsen, Soren K; Rundle, Jana K; Beer, Nicola L; van de Bunt, Martijn; Chalisey, Anil; Kang, Hyun Min; Voight, Benjamin F; Abecasis, Gonçalo R; Almgren, Peter; Baldassarre, Damiano; Balkau, Beverley; Benediktsson, Rafn; Blüher, Matthias; Boeing, Heiner; Bonnycastle, Lori L; Bottinger, Erwin P; Burtt, Noël P; Carey, Jason; Charpentier, Guillaume; Chines, Peter S; Cornelis, Marilyn C; Couper, David J; Crenshaw, Andrew T; van Dam, Rob M; Doney, Alex S F; Dorkhan, Mozhgan; Edkins, Sarah; Eriksson, Johan G; Esko, Tonu; Eury, Elodie; Fadista, João; Flannick, Jason; Fontanillas, Pierre; Fox, Caroline; Franks, Paul W; Gertow, Karl; Gieger, Christian; Gigante, Bruna; Gottesman, Omri; Grant, George B; Grarup, Niels; Groves, Christopher J; Hassinen, Maija; Have, Christian T; Herder, Christian; Holmen, Oddgeir L; Hreidarsson, Astradur B; Humphries, Steve E; Hunter, David J; Jackson, Anne U; Jonsson, Anna; Jørgensen, Marit E; Jørgensen, Torben; Kao, Wen-Hong L; Kerrison, Nicola D; Kinnunen, Leena; Klopp, Norman; Kong, Augustine; Kovacs, Peter; Kraft, Peter; Kravic, Jasmina; Langford, Cordelia; Leander, Karin; Liang, Liming; Lichtner, Peter; Lindgren, Cecilia M; Lindholm, Eero; Linneberg, Allan; Liu, Ching-Ti; Lobbens, Stéphane; Luan, Jian'an; Lyssenko, Valeriya; Männistö, Satu; McLeod, Olga; Meyer, Julia; Mihailov, Evelin; Mirza, Ghazala; Mühleisen, Thomas W; Müller-Nurasyid, Martina; Navarro, Carmen; Nöthen, Markus M; Oskolkov, Nikolay N; Owen, Katharine R; Palli, Domenico; Pechlivanis, Sonali; Peltonen, Leena; Perry, John R B; Platou, Carl G P; Roden, Michael; Ruderfer, Douglas; Rybin, Denis; van der Schouw, Yvonne T; Sennblad, Bengt; Sigurðsson, Gunnar; Stančáková, Alena; Steinbach, Gerald; Storm, Petter; Strauch, Konstantin; Stringham, Heather M; Sun, Qi; Thorand, Barbara; Tikkanen, Emmi; Tonjes, Anke; Trakalo, Joseph; Tremoli, Elena; Tuomi, Tiinamaija; Wennauer, Roman; Wiltshire, Steven; Wood, Andrew R; Zeggini, Eleftheria; Dunham, Ian; Birney, Ewan; Pasquali, Lorenzo; Ferrer, Jorge; Loos, Ruth J F; Dupuis, Josée; Florez, Jose C; Boerwinkle, Eric; Pankow, James S; van Duijn, Cornelia; Sijbrands, Eric; Meigs, James B; Hu, Frank B; Thorsteinsdottir, Unnur; Stefansson, Kari; Lakka, Timo A; Rauramaa, Rainer; Stumvoll, Michael; Pedersen, Nancy L; Lind, Lars; Keinanen-Kiukaanniemi, Sirkka M; Korpi-Hyövälti, Eeva; Saaristo, Timo E; Saltevo, Juha; Kuusisto, Johanna; Laakso, Markku; Metspalu, Andres; Erbel, Raimund; Jöcke, Karl-Heinz; Moebus, Susanne; Ripatti, Samuli; Salomaa, Veikko; Ingelsson, Erik; Boehm, Bernhard O; Bergman, Richard N; Collins, Francis S; Mohlke, Karen L; Koistinen, Heikki; Tuomilehto, Jaakko; Hveem, Kristian; Njølstad, Inger; Deloukas, Panagiotis; Donnelly, Peter J; Frayling, Timothy M; Hattersley, Andrew T; de Faire, Ulf; Hamsten, Anders; Illig, Thomas; Peters, Annette; Cauchi, Stephane; Sladek, Rob; Froguel, Philippe; Hansen, Torben; Pedersen, Oluf; Morris, Andrew D; Palmer, Collin N A; Kathiresan, Sekar; Melander, Olle; Nilsson, Peter M; Groop, Leif C; Barroso, Inês; Langenberg, Claudia; Wareham, Nicholas J; O'Callaghan, Christopher A; Gloyn, Anna L; Altshuler, David; Boehnke, Michael; Teslovich, Tanya M; McCarthy, Mark I; Morris, Andrew P

    2015-12-01

    We performed fine mapping of 39 established type 2 diabetes (T2D) loci in 27,206 cases and 57,574 controls of European ancestry. We identified 49 distinct association signals at these loci, including five mapping in or near KCNQ1. 'Credible sets' of the variants most likely to drive each distinct signal mapped predominantly to noncoding sequence, implying that association with T2D is mediated through gene regulation. Credible set variants were enriched for overlap with FOXA2 chromatin immunoprecipitation binding sites in human islet and liver cells, including at MTNR1B, where fine mapping implicated rs10830963 as driving T2D association. We confirmed that the T2D risk allele for this SNP increases FOXA2-bound enhancer activity in islet- and liver-derived cells. We observed allele-specific differences in NEUROD1 binding in islet-derived cells, consistent with evidence that the T2D risk allele increases islet MTNR1B expression. Our study demonstrates how integration of genetic and genomic information can define molecular mechanisms through which variants underlying association signals exert their effects on disease.

  11. Difference in C3-C4 metabolism underlies tradeoff between growth rate and biomass yield in Methylobacterium extorquens AM1.

    PubMed

    Fu, Yanfen; Beck, David A C; Lidstrom, Mary E

    2016-07-19

    Two variants of Methylobacterium extorquens AM1 demonstrated a trade-off between growth rate and biomass yield. In addition, growth rate and biomass yield were also affected by supplementation of growth medium with different amounts of cobalt. The metabolism changes relating to these growth phenomena as well as the trade-off were investigated in this study. (13)C metabolic flux analysis was used to generate a detailed central carbon metabolic flux map with both absolute and normalized flux values. The major differences between the two variants occurred at the formate node as well as within C3-C4 inter-conversion pathways. Higher relative fluxes through formyltetrahydrofolate ligase, phosphoenolpyruvate carboxylase, and malic enzyme led to higher biomass yield, while higher relative fluxes through pyruvate kinase and pyruvate dehydrogenase led to higher growth rate. These results were then tested by phenotypic studies on three mutants (null pyk, null pck mutant and null dme mutant) in both variants, which agreed with the model prediction. In this study, (13)C metabolic flux analysis for two strain variants of M. extorquens AM1 successfully identified metabolic pathways contributing to the trade-off between cell growth and biomass yield. Phenotypic analysis of mutants deficient in corresponding genes supported the conclusion that C3-C4 inter-conversion strategies were the major response to the trade-off.

  12. Characterization of BRCA1 and BRCA2 variants in multi-ethnic Asian cohort from a Malaysian case-control study.

    PubMed

    Lai, Kah Nyin; Ho, Weang Kee; Kang, In Nee; Kang, Peter Choon Eng; Phuah, Sze Yee; Mariapun, Shivaani; Yip, Cheng-Har; Mohd Taib, Nur Aishah; Teo, Soo-Hwang

    2017-02-22

    Genetic testing for BRCA1 and BRCA2 has led to the accurate identification of individuals at higher risk of cancer and the development of new therapies. Approximately 10-20% of the genetic testing for BRCA1 and BRCA2 leads to the identification of variants of uncertain significance (VUS), with higher proportions in Asians. We investigated the functional significance of 7 BRCA1 and 25 BRCA2 variants in a multi-ethnic Asian cohort using a case-control approach. The MassARRAY genotyping was conducted in 1,394 Chinese, 406 Malay and 310 Indian breast cancer cases and 1,071 Chinese, 167 Malay and 255 Indian healthy controls. The association of individual variant with breast cancer risk was analyzed using logistic regression model adjusted for ethnicity, age and family history. Our study confirmed BRCA2 p.Ile3412Val is presented in >2% of unaffected women and is likely benign, and BRCA2 p.Ala1996Thr which is predicted to be likely pathogenic by in-silico models is presented in 2% of healthy Indian women suggesting that it may not be associated with breast cancer risk. Single-variant analysis suggests that BRCA1 p.Arg762Ser may be associated with breast cancer risk (OR = 7.4; 95% CI, 0.9-62.3; p = 0.06). Our study shows that BRCA2 p.Ile3412Val and p.Ala1996Thr are likely benign and highlights the need for population-specific studies to determine the likely functional significance of population-specific variants. Our study also suggests that BRCA1 p.Arg762Ser may be associated with increased risk of breast cancer but other methods or larger studies are required to determine a more precise estimate of breast cancer risk.

  13. Trans-ancestry Fine Mapping and Molecular Assays Identify Regulatory Variants at the ANGPTL8 HDL-C GWAS Locus

    PubMed Central

    Cannon, Maren E.; Duan, Qing; Wu, Ying; Zeynalzadeh, Monica; Xu, Zheng; Kangas, Antti J.; Soininen, Pasi; Ala-Korpela, Mika; Civelek, Mete; Lusis, Aldons J.; Kuusisto, Johanna; Collins, Francis S.; Boehnke, Michael; Tang, Hua; Laakso, Markku; Li, Yun; Mohlke, Karen L.

    2017-01-01

    Recent genome-wide association studies (GWAS) have identified variants associated with high-density lipoprotein cholesterol (HDL-C) located in or near the ANGPTL8 gene. Given the extensive sharing of GWAS loci across populations, we hypothesized that at least one shared variant at this locus affects HDL-C. The HDL-C–associated variants are coincident with expression quantitative trait loci for ANGPTL8 and DOCK6 in subcutaneous adipose tissue; however, only ANGPTL8 expression levels are associated with HDL-C levels. We identified a 400-bp promoter region of ANGPTL8 and enhancer regions within 5 kb that contribute to regulating expression in liver and adipose. To identify variants functionally responsible for the HDL-C association, we performed fine-mapping analyses and selected 13 candidate variants that overlap putative regulatory regions to test for allelic differences in regulatory function. Of these variants, rs12463177-G increased transcriptional activity (1.5-fold, P = 0.004) and showed differential protein binding. Six additional variants (rs17699089, rs200788077, rs56322906, rs3760782, rs737337, and rs3745683) showed evidence of allelic differences in transcriptional activity and/or protein binding. Taken together, these data suggest a regulatory mechanism at the ANGPTL8 HDL-C GWAS locus involving tissue-selective expression and at least one functional variant. PMID:28754724

  14. STORMSeq: An Open-Source, User-Friendly Pipeline for Processing Personal Genomics Data in the Cloud

    PubMed Central

    Karczewski, Konrad J.; Fernald, Guy Haskin; Martin, Alicia R.; Snyder, Michael; Tatonetti, Nicholas P.; Dudley, Joel T.

    2014-01-01

    The increasing public availability of personal complete genome sequencing data has ushered in an era of democratized genomics. However, read mapping and variant calling software is constantly improving and individuals with personal genomic data may prefer to customize and update their variant calls. Here, we describe STORMSeq (Scalable Tools for Open-Source Read Mapping), a graphical interface cloud computing solution that does not require a parallel computing environment or extensive technical experience. This customizable and modular system performs read mapping, read cleaning, and variant calling and annotation. At present, STORMSeq costs approximately $2 and 5–10 hours to process a full exome sequence and $30 and 3–8 days to process a whole genome sequence. We provide this open-access and open-source resource as a user-friendly interface in Amazon EC2. PMID:24454756

  15. DangerTrack: A scoring system to detect difficult-to-assess regions.

    PubMed

    Dolgalev, Igor; Sedlazeck, Fritz; Busby, Ben

    2017-01-01

    Over recent years, multiple groups have shown that a large number of structural variants, repeats, or problems with the underlying genome assembly have dramatic effects on the mapping, calling, and overall reliability of single nucleotide polymorphism calls. This project endeavored to develop an easy-to-use track for looking at structural variant and repeat regions. This track, DangerTrack, can be displayed alongside the existing Genome Reference Consortium assembly tracks to warn clinicians and biologists when variants of interest may be incorrectly called, of dubious quality, or on an insertion or copy number expansion. While mapping and variant calling can be automated, it is our opinion that when these regions are of interest to a particular clinical or research group, they warrant a careful examination, potentially involving localized reassembly. DangerTrack is available at https://github.com/DCGenomics/DangerTrack.

  16. Association and haplotype analysis of the insulin-degrading enzyme (IDE) gene, a strong positional and biological candidate for type 2 diabetes susceptibility.

    PubMed

    Groves, Christopher J; Wiltshire, Steven; Smedley, Damian; Owen, Katherine R; Frayling, Timothy M; Walker, Mark; Hitman, Graham A; Levy, Jonathan C; O'Rahilly, Stephen; Menzel, Stephan; Hattersley, Andrew T; McCarthy, Mark I

    2003-05-01

    The gene for insulin-degrading enzyme (IDE) represents a strong positional and biological candidate for type 2 diabetes susceptibility. IDE maps to chromosome 10q23.3, a region linked to diabetes in several populations; the rat homolog has been directly implicated in diabetes susceptibility; and known functions of IDE support an important role in glucose homeostasis. We sought evidence for association between IDE variation and diabetes by mutation screening, defining local haplotype structure, and genotyping variants delineating common haplotypic diversity. An initial case-control analysis (628 diabetic probands from multiplex sibships and 604 control subjects) found no haplotypic associations, although one variant (IDE2, -179T-->C) showed modest association with diabetes (odds ratio [OR]1.25, P = 0.03). Linkage partitioning analyses failed to support this association, but provided borderline evidence for a different variant (IDE10, IVS20-405A-->G) (P = 0.06). Neither variant was associated with diabetes when replication was sought in 377 early onset diabetic subjects and 825 control subjects, though combined analysis of all typed cohorts indicated a nominally significant effect at IDE2 (OR 1.21 [1.04-1.40], P = 0.013). In the absence of convincing support for this association from linkage partitioning or analyses of continuous measures of glycemia, we conclude that analysis of over 2,400 samples provides no compelling evidence that variation in IDE contributes to diabetes susceptibility in humans.

  17. Exploring the molecular mechanisms of Traditional Chinese Medicine components using gene expression signatures and connectivity map.

    PubMed

    Yoo, Minjae; Shin, Jimin; Kim, Hyunmin; Kim, Jihye; Kang, Jaewoo; Tan, Aik Choon

    2018-04-04

    Traditional Chinese Medicine (TCM) has been practiced over thousands of years in China and other Asian countries for treating various symptoms and diseases. However, the underlying molecular mechanisms of TCM are poorly understood, partly due to the "multi-component, multi-target" nature of TCM. To uncover the molecular mechanisms of TCM, we perform comprehensive gene expression analysis using connectivity map. We interrogated gene expression signatures obtained 102 TCM components using the next generation Connectivity Map (CMap) resource. We performed systematic data mining and analysis on the mechanism of action (MoA) of these TCM components based on the CMap results. We clustered the 102 TCM components into four groups based on their MoAs using next generation CMap resource. We performed gene set enrichment analysis on these components to provide additional supports for explaining these molecular mechanisms. We also provided literature evidence to validate the MoAs identified through this bioinformatics analysis. Finally, we developed the Traditional Chinese Medicine Drug Repurposing Hub (TCM Hub) - a connectivity map resource to facilitate the elucidation of TCM MoA for drug repurposing research. TCMHub is freely available in http://tanlab.ucdenver.edu/TCMHub. Molecular mechanisms of TCM could be uncovered by using gene expression signatures and connectivity map. Through this analysis, we identified many of the TCM components possess diverse MoAs, this may explain the applications of TCM in treating various symptoms and diseases. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  18. Multi-scale statistical analysis of coronal solar activity

    DOE PAGES

    Gamborino, Diana; del-Castillo-Negrete, Diego; Martinell, Julio J.

    2016-07-08

    Multi-filter images from the solar corona are used to obtain temperature maps that are analyzed using techniques based on proper orthogonal decomposition (POD) in order to extract dynamical and structural information at various scales. Exploring active regions before and after a solar flare and comparing them with quiet regions, we show that the multi-scale behavior presents distinct statistical properties for each case that can be used to characterize the level of activity in a region. Information about the nature of heat transport is also to be extracted from the analysis.

  19. Chaotic attractors of relaxation oscillators

    NASA Astrophysics Data System (ADS)

    Guckenheimer, John; Wechselberger, Martin; Young, Lai-Sang

    2006-03-01

    We develop a general technique for proving the existence of chaotic attractors for three-dimensional vector fields with two time scales. Our results connect two important areas of dynamical systems: the theory of chaotic attractors for discrete two-dimensional Henon-like maps and geometric singular perturbation theory. Two-dimensional Henon-like maps are diffeomorphisms that limit on non-invertible one-dimensional maps. Wang and Young formulated hypotheses that suffice to prove the existence of chaotic attractors in these families. Three-dimensional singularly perturbed vector fields have return maps that are also two-dimensional diffeomorphisms limiting on one-dimensional maps. We describe a generic mechanism that produces folds in these return maps and demonstrate that the Wang-Young hypotheses are satisfied. Our analysis requires a careful study of the convergence of the return maps to their singular limits in the Ck topology for k >= 3. The theoretical results are illustrated with a numerical study of a variant of the forced van der Pol oscillator.

  20. In silico analysis of SIGMAR1 variant (rs4879809) segregating in a consanguineous Pakistani family showing amyotrophic lateral sclerosis without frontotemporal lobar dementia.

    PubMed

    Ullah, Muhammad Ikram; Ahmad, Arsalan; Raza, Syed Irfan; Amar, Ali; Ali, Amjad; Bhatti, Attya; John, Peter; Mohyuddin, Aisha; Ahmad, Wasim; Hassan, Muhammad Jawad

    2015-10-01

    Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disorder affecting upper motor neurons in the brain and lower motor neurons in the brain stem and spinal cord, resulting in fatal paralysis. It has been found to be associated with frontotemporal lobar degeneration (FTLD). In the present study, we have described homozygosity mapping and gene sequencing in a consanguineous autosomal recessive Pakistani family showing non-juvenile ALS without signs of FTLD. Gene mapping was carried out in all recruited family members using microsatellite markers, and linkage was established with sigma non-opioid intracellular receptor 1 (SIGMAR1) gene at chromosome 9p13.2. Gene sequencing of SIGMAR1 revealed a novel 3'-UTR nucleotide variation c.672*31A>G (rs4879809) segregating with disease in this family. The C9ORF72 repeat region in intron 1, previously implicated in a related phenotype, was excluded through linkage, and further confirmation of exclusion was obtained by amplifying intron 1 of C9ORF72 with multiple primers in affected individuals and controls. In silico analysis was carried out to explore the possible role of 3'-UTR variant of SIGMAR1 in ALS. The Regulatory RNA motif and Element Finder program revealed disturbance in miRNA (hsa-miR-1205) binding site due to this variation. ESEFinder analysis showed new SRSF1 and SRSF1-IgM-BRCA1 binding sites with significant scores due to this variation. Our results indicate that the 3'-UTR SIGMAR1 variant c.672*31A>G may have a role in the pathogenesis of ALS in this family.

  1. Rapid Determination of Mineral Abundance by X-ray Microfluorescence Mapping and Multispectral Image Analysis

    NASA Astrophysics Data System (ADS)

    Moscati, R. J.; Marshall, B. D.

    2005-12-01

    X-ray microfluorescence (XRMF) spectrometry is a rapid, accurate technique to map element abundances of rock surfaces (such as thin-section billets, the block remaining when a thin section is prepared). Scanning a specimen with a collimated primary X-ray beam (100 μm diameter) generates characteristic secondary X-rays that yield the relative chemical abundances for the major rock-/mineral-forming analytes (such as Si, Al, K, Ca, and Fe). When Cu-rich epoxy is used to impregnate billets, XRMF also can determine porosity from the Cu abundance. Common billet scan size is 30 x 15 mm and the typical mapping time rarely exceeds 2.5 hrs (much faster than traditional point-counting). No polishing or coating is required for the billets, although removing coarse striations or gross irregularities on billet surfaces should improve the spatial accuracy of the maps. Background counts, spectral artifacts, and diffraction peaks typically are inconsequential for maps of major elements. An operational check is performed after every 10 analyses on a standard that contains precisely measured areas of Mn and Mo. Reproducibility of the calculated area ratio of Mn:Mo is consistently within 5% of the known value. For each billet, the single element maps (TIFF files) generated by XRMF are imported into MultiSpec© (a program developed at Purdue University for analysis of multispectral image data, available from http://dynamo.ecn.purdue.edu/~biehl/MultiSpec/) where mineral phases can be spectrally identified and their relative abundances quantified. The element maps for each billet are layered to produce a multi-element file for mineral classification and statistical processing, including modal estimates of mineral abundance. Although mineral identification is possible even if the mineralogy is unknown, prior petrographic examination of the corresponding thin section yields more accurate maps because the software can be set to identify all similar pixels. Caution is needed when using MultiSpec© to distinguish mineral phases with similar chemistry (for example, opal and quartz) and minerals that occupy very small surface areas (<10 pixels). In either case, careful petrography and informed use of the software will allow rapid use of MultiSpec© to create accurate mineral maps of rock and thin-section billet surfaces. This technique, for example, has allowed quantitative estimates of calcite and silica abundances to be determined on about 200 samples of secondary mineral coatings from the unsaturated zone at Yucca Mountain, Nevada.

  2. NaviCom: a web application to create interactive molecular network portraits using multi-level omics data.

    PubMed

    Dorel, Mathurin; Viara, Eric; Barillot, Emmanuel; Zinovyev, Andrei; Kuperstein, Inna

    2017-01-01

    Human diseases such as cancer are routinely characterized by high-throughput molecular technologies, and multi-level omics data are accumulated in public databases at increasing rate. Retrieval and visualization of these data in the context of molecular network maps can provide insights into the pattern of regulation of molecular functions reflected by an omics profile. In order to make this task easy, we developed NaviCom, a Python package and web platform for visualization of multi-level omics data on top of biological network maps. NaviCom is bridging the gap between cBioPortal, the most used resource of large-scale cancer omics data and NaviCell, a data visualization web service that contains several molecular network map collections. NaviCom proposes several standardized modes of data display on top of molecular network maps, allowing addressing specific biological questions. We illustrate how users can easily create interactive network-based cancer molecular portraits via NaviCom web interface using the maps of Atlas of Cancer Signalling Network (ACSN) and other maps. Analysis of these molecular portraits can help in formulating a scientific hypothesis on the molecular mechanisms deregulated in the studied disease. NaviCom is available at https://navicom.curie.fr. © The Author(s) 2017. Published by Oxford University Press.

  3. Divergent biophysical properties, gating mechanisms, and possible functions of the two skeletal muscle Ca(V)1.1 calcium channel splice variants.

    PubMed

    Tuluc, Petronel; Flucher, Bernhard E

    2011-12-01

    Voltage-gated calcium channels are multi-subunit protein complexes that specifically allow calcium ions to enter the cell in response to membrane depolarization. But, for many years it seemed that the skeletal muscle calcium channel Ca(V)1.1 is the exception. The classical splice variant Ca(V)1.1a activates slowly, has a very small current amplitude and poor voltage sensitivity. In fact adult muscle fibers work perfectly well even in the absence of calcium influx. Recently a new splice variant of the skeletal muscle calcium channel Ca(V)1.1e has been characterized. The lack of the 19 amino acid exon 29 in this splice variant results in a rapidly activating calcium channel with high current amplitude and good voltage sensitivity. Ca(V)1.1e is the dominant channel in embryonic muscle, where the expression of this high calcium-conducting Ca(V)1.1 isoform readily explains developmental processes depending on L-type calcium currents. Moreover, the availability of these two structurally similar but functionally distinct channel variants facilitates the analysis of the molecular mechanisms underlying the unique current properties of the classical Ca(V)1.1a channel.

  4. Mapping of land cover in northern California with simulated hyperspectral satellite imagery

    NASA Astrophysics Data System (ADS)

    Clark, Matthew L.; Kilham, Nina E.

    2016-09-01

    Land-cover maps are important science products needed for natural resource and ecosystem service management, biodiversity conservation planning, and assessing human-induced and natural drivers of land change. Analysis of hyperspectral, or imaging spectrometer, imagery has shown an impressive capacity to map a wide range of natural and anthropogenic land cover. Applications have been mostly with single-date imagery from relatively small spatial extents. Future hyperspectral satellites will provide imagery at greater spatial and temporal scales, and there is a need to assess techniques for mapping land cover with these data. Here we used simulated multi-temporal HyspIRI satellite imagery over a 30,000 km2 area in the San Francisco Bay Area, California to assess its capabilities for mapping classes defined by the international Land Cover Classification System (LCCS). We employed a mapping methodology and analysis framework that is applicable to regional and global scales. We used the Random Forests classifier with three sets of predictor variables (reflectance, MNF, hyperspectral metrics), two temporal resolutions (summer, spring-summer-fall), two sample scales (pixel, polygon) and two levels of classification complexity (12, 20 classes). Hyperspectral metrics provided a 16.4-21.8% and 3.1-6.7% increase in overall accuracy relative to MNF and reflectance bands, respectively, depending on pixel or polygon scales of analysis. Multi-temporal metrics improved overall accuracy by 0.9-3.1% over summer metrics, yet increases were only significant at the pixel scale of analysis. Overall accuracy at pixel scales was 72.2% (Kappa 0.70) with three seasons of metrics. Anthropogenic and homogenous natural vegetation classes had relatively high confidence and producer and user accuracies were over 70%; in comparison, woodland and forest classes had considerable confusion. We next focused on plant functional types with relatively pure spectra by removing open-canopy shrublands, woodlands and mixed forests from the classification. This 12-class map had significantly improved accuracy of 85.1% (Kappa 0.83) and most classes had over 70% producer and user accuracies. Finally, we summarized important metrics from the multi-temporal Random Forests to infer the underlying chemical and structural properties that best discriminated our land-cover classes across seasons.

  5. Adaptation of tick-borne encephalitis virus from human brain to different cell cultures induces multiple genomic substitutions.

    PubMed

    Ponomareva, Eugenia P; Ternovoi, Vladimir A; Mikryukova, Tamara P; Protopopova, Elena V; Gladysheva, Anastasia V; Shvalov, Alexander N; Konovalova, Svetlana N; Chausov, Eugene V; Loktev, Valery B

    2017-10-01

    The C11-13 strain from the Siberian subtype of tick-borne encephalitis virus (TBEV) was isolated from human brain using pig embryo kidney (PEK), 293, and Neuro-2a cells. Analysis of the complete viral genome of the C11-13 variants during six passages in these cells revealed that the cell-adapted C11-13 variants had multiple amino acid substitutions as compared to TBEV from human brain. Seven out of eight amino acids substitutions in the high-replicating C11-13(PEK) variant mapped to non-structural proteins; 13 out of 14 substitutions in the well-replicating C11-13(293) variant, and all four substitutions in the low-replicating C11-13(Neuro-2a) variant were also localized in non-structural proteins, predominantly in the NS2a (2), NS3 (6) and NS5 (3) proteins. The substitutions NS2a 1067 (Asn → Asp), NS2a 1168 (Leu → Val) in the N-terminus of NS2a and NS3 1745 (His → Gln) in the helicase domain of NS3 were found in all selected variants. We postulate that multiple substitutions in the NS2a, NS3 and NS5 genes play a key role in adaptation of TBEV to different cells.

  6. A 16 kb naturally occurring genomic deletion including mce and PPE genes in Mycobacterium avium subspecies paratuberculosis isolates from goats with Johne's disease.

    PubMed

    Castellanos, Elena; Aranaz, Alicia; de Juan, Lucia; Dominguez, Lucas; Linedale, Richard; Bull, Tim J

    2012-09-14

    In this study we characterise the genomic and transcriptomic variability of a natural deletion strain of Mycobacterium avium subspecies paratuberculosis (MAP) prevalent in Spanish Guadarrama goats. Using a pan-genome microarray including MAP and M. avium subspecies hominissuis 104 genomes (MAPAC) we demonstrate the genotype to be MAP Type II with a single deletion of 19 contiguous ORFs (16 kb) including a complete mammalian cell entry (mce7_1) operon and adjacent proline-glutamic acid (PE)/proline-proline-glutamic acid (PPE) genes. A deletion specific PCR test was developed and a subsequent screening identified four goat herds infected with the variant strain. Each was located in central Spain and showed epidemiological links suggestive of transmission between herds. A majority of animals infected with the variant manifested a paucibacillary form of the disease. Comparisons between virulent complete genome compliment strains isolated from multibacillary diseased goats and the MAP variant strain during entry into activated macrophages demonstrated an increased sensitivity in the variant to intracellular killing in human and ovine macrophages. As PPE and mce genes are associated with mycobacterial virulence and pathogenesis we investigated the interplay of these gene sets during cell entry using the MAPAC array. This showed significant differential transcriptome profiles compared to full genome complement MAP controls that included changes in other undeleted mce operons and PE/PPE genes, esx-like signalling operons and stress response/fatty acid metabolism pathways. This strain represents the first report of a MAP Type II genotype with significant natural genomic deletions which remains able to cause disease and is transmissible in goats. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. A Functional ATG16L1 (T300A) Variant is Associated with Necrotizing Enterocolitis in Premature Infants

    PubMed Central

    Sampath, Venkatesh; Bhandari, Vineet; Berger, Jessica; Merchant, Daniel; Zhang, Liyun; Ladd, Mihoko; Menden, Heather; Garland, Jeffery; Ambalavanan, Namasivayam; Mulrooney, Neil; Quasney, Michael; Dagle, John; Lavoie, Pascal M; Simpson, Pippa; Dahmer, Mary

    2017-01-01

    Background The genetic basis of dysfunctional immune responses in necrotizing enterocolitis (NEC) remains unknown. We hypothesized that variants in Nucleotide binding and Oligomerization Domain (NOD)-Like Receptors (NLRs) and Autophagy (ATG) genes modulate vulnerability to NEC. Methods We genotyped a multi-center cohort of premature infants with and without NEC for NOD1, NOD2, ATG16L1, CARD8 and NLRP3 variants. Chi-square tests and logistic regression were used for statistical analysis. Results In our primary cohort (n=1015), 86 (8.5%) infants developed NEC. The A allele of the ATG16L1 (Thr300Ala) variant was associated with increased NEC (AA vs. AG vs. GG; 11.3% vs. 8.4% vs. 4.8%, p=0.009). In regression models for NEC that adjusted for epidemiological confounders, GA (p=0.033) and the AA genotype (p=0.038) of ATG16L1 variant were associated with NEC. The association between the A allele of the ATG16L1 variant and NEC remained significant among Caucasian infants (p=0.02). In a replication cohort (n=259), NEC rates were highest among infants with the AA genotype but did not reach statistical significance. Conclusion We report a novel association between a hypomorphic variant in an autophagy gene (ATG16L1) and NEC in premature infants. Our data suggest that decreased autophagy arising from genetic variants may confer protection against NEC. PMID:27893720

  8. Protein side chain rotational isomerization: A minimum perturbation mapping study

    NASA Astrophysics Data System (ADS)

    Haydock, Christopher

    1993-05-01

    A theory of the rotational isomerization of the indole side chain of tryptophan-47 of variant-3 scorpion neurotoxin is presented. The isomerization potential energy, entropic part of the isomerization free energy, isomer probabilities, transition state theory reaction rates, and indole order parameters are calculated from a minimum perturbation mapping over tryptophan-47 χ1×χ2 torsion space. A new method for calculating the fluorescence anisotropy from molecular dynamics simulations is proposed. The method is based on an expansion that separates transition dipole orientation from chromophore dynamics. The minimum perturbation potential energy map is inverted and applied as a bias potential for a 100 ns umbrella sampling simulation. The entropic part of the isomerization free energy as calculated by minimum perturbation mapping and umbrella sampling are in fairly close agreement. Throughout, the approximation is made that two glutamine and three tyrosine side chains neighboring tryptophan-47 are truncated at the Cβ atom. Comparison with the previous combination thermodynamic perturbation and umbrella sampling study suggests that this truncated neighbor side chain approximation leads to at least a qualitatively correct theory of tryptophan-47 rotational isomerization in the wild type variant-3 scorpion neurotoxin. Analysis of van der Waals interactions in a transition state region indicates that for the simulation of barrier crossing trajectories a linear combination of three specially defined dihedral angles will be superior to a simple side chain dihedral reaction coordinate.

  9. Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases

    PubMed Central

    Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.

    2012-01-01

    The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524

  10. A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome

    USDA-ARS?s Scientific Manuscript database

    Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a mor...

  11. RadMAP: The Radiological Multi-sensor Analysis Platform

    NASA Astrophysics Data System (ADS)

    Bandstra, Mark S.; Aucott, Timothy J.; Brubaker, Erik; Chivers, Daniel H.; Cooper, Reynold J.; Curtis, Joseph C.; Davis, John R.; Joshi, Tenzing H.; Kua, John; Meyer, Ross; Negut, Victor; Quinlan, Michael; Quiter, Brian J.; Srinivasan, Shreyas; Zakhor, Avideh; Zhang, Richard; Vetter, Kai

    2016-12-01

    The variability of gamma-ray and neutron background during the operation of a mobile detector system greatly limits the ability of the system to detect weak radiological and nuclear threats. The natural radiation background measured by a mobile detector system is the result of many factors, including the radioactivity of nearby materials, the geometric configuration of those materials and the system, the presence of absorbing materials, and atmospheric conditions. Background variations tend to be highly non-Poissonian, making it difficult to set robust detection thresholds using knowledge of the mean background rate alone. The Radiological Multi-sensor Analysis Platform (RadMAP) system is designed to allow the systematic study of natural radiological background variations and to serve as a development platform for emerging concepts in mobile radiation detection and imaging. To do this, RadMAP has been used to acquire extensive, systematic background measurements and correlated contextual data that can be used to test algorithms and detector modalities at low false alarm rates. By combining gamma-ray and neutron detector systems with data from contextual sensors, the system enables the fusion of data from multiple sensors into novel data products. The data are curated in a common format that allows for rapid querying across all sensors, creating detailed multi-sensor datasets that are used to study correlations between radiological and contextual data, and develop and test novel techniques in mobile detection and imaging. In this paper we will describe the instruments that comprise the RadMAP system, the effort to curate and provide access to multi-sensor data, and some initial results on the fusion of contextual and radiological data.

  12. Common variants in Mendelian kidney disease genes and their association with renal function.

    PubMed

    Parsa, Afshin; Fuchsberger, Christian; Köttgen, Anna; O'Seaghdha, Conall M; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M; Borecki, Ingrid; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Bochud, Murielle; Heid, Iris M; Siscovick, David S; Fox, Caroline S; Kao, W Linda; Böger, Carsten A

    2013-12-01

    Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research.

  13. Genotypic variants at 2q33 and risk of esophageal squamous cell carcinoma in China: a meta-analysis of genome-wide association studies

    PubMed Central

    Abnet, Christian C.; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D.; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M.; Liao, Linda M.; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C.; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B.; Giffen, Carol A.; Burdett, Laurie; Fraumeni, Joseph F.; Tucker, Margaret A.; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M.; Yuan, Zhi-Qing; Chanock, Stephen J.; Zhang, Xue-Jun; Taylor, Philip R.; Wang, Li-Dong

    2012-01-01

    Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10−8, and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19–1.40) and P= 7.63 × 10−10. An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants. PMID:22323360

  14. Genotypic variants at 2q33 and risk of esophageal squamous cell carcinoma in China: a meta-analysis of genome-wide association studies.

    PubMed

    Abnet, Christian C; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M; Liao, Linda M; Lee, Maxwell P; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B; Giffen, Carol A; Burdett, Laurie; Fraumeni, Joseph F; Tucker, Margaret A; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M; Yuan, Zhi-Qing; Chanock, Stephen J; Zhang, Xue-Jun; Taylor, Philip R; Wang, Li-Dong

    2012-05-01

    Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10(-8), and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19-1.40) and P= 7.63 × 10(-10). An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants.

  15. Missense variants in hMLH1 identified in patients from the German HNPCC consortium and functional studies.

    PubMed

    Hardt, Karin; Heick, Sven Boris; Betz, Beate; Goecke, Timm; Yazdanparast, Haniyeh; Küppers, Robin; Servan, Kati; Steinke, Verena; Rahner, Nils; Morak, Monika; Holinski-Feder, Elke; Engel, Christoph; Möslein, Gabriela; Schackert, Hans-Konrad; von Knebel Doeberitz, Magnus; Pox, Christian; Hegemann, Johannes H; Royer-Pokora, Brigitte

    2011-06-01

    Missense mutations of the DNA mismatch repair gene MLH1 are found in a significant fraction of patients with Lynch syndrome (hereditary nonpolyposis colorectal cancer, HNPCC) and their pathogenicity often remains unclear. We report here all 88 MLH1 missense variants identified in families from the German HNPCC consortium with clinical details of these patients/families. We investigated 23 MLH1 missense variants by two functional in vivo assays in yeast; seven map to the ATPase and 16 to the protein interaction domain. In the yeast-2-hybrid (Y2H) assay three variants in the ATPase and twelve variants in the interaction domain showed no or a reduced interaction with PMS2; seven showed a normal and one a significantly higher interaction. Using the Lys2A (14) reporter system to study the dominant negative mutator effect (DNE), 16 variants showed no or a low mutator effect, suggesting that these are nonfunctional, three were intermediate and four wild type in this assay. The DNE and Y2H results were concordant for all variants in the interaction domain, whereas slightly divergent results were obtained for variants in the ATPase domain. Analysis of the stability of the missense proteins in yeast and human embryonic kidney cells (293T) revealed a very low expression for seven of the variants in yeast and for nine in human cells. In total 15 variants were classified as deleterious, five were classified as variants of unclassified significance (VUS) and three were basically normal in the functional assays, P603R, K618R, Q689R, suggesting that these are neutral.

  16. Assembly and diploid architecture of an individual human genome via single-molecule technologies

    PubMed Central

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-01-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality. PMID:26121404

  17. Assembly and diploid architecture of an individual human genome via single-molecule technologies.

    PubMed

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-08-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

  18. Cloning and characterization of human immunodeficiency virus type 1 variants diminished in the ability to induce syncytium-independent cytolysis.

    PubMed Central

    Stevenson, M; Haggerty, S; Lamonica, C; Mann, A M; Meier, C; Wasiak, A

    1990-01-01

    The phenomenon of interference was exploited to isolate low-abundance noncytopathic human immunodeficiency virus type 1 (HIV-1) variants from a primary HIV-1 isolate from an asymptomatic HIV-1-seropositive hemophiliac. Successive rounds of virus infection of a cytolysis-susceptible CD4+ cell line and isolation of surviving cells resulted in selective amplification of an HIV-1 variant reduced in the ability to induce cytolysis. The presence of a PvuII polymorphism facilitated subsequent amplification and cloning of cytopathic and noncytopathic HIV-1 variants from the primary isolate. Cloned virus stocks from cytopathic and noncytopathic variants exhibited similar replication kinetics, infectivity, and syncytium induction in susceptible host cells. The noncytopathic HIV-1 variant was unable, however, to induce single-cell killing in susceptible host cells. Construction of viral hybrids in which regions of cytopathic and noncytopathic variants were exchanged indicated that determinants for the noncytopathic phenotype map to the envelope glycoprotein. Sequence analysis of the envelope coding regions indicated the absence of two highly conserved N-linked glycosylation sites in the noncytopathic HIV-1 variant, which accompanied differences in processing of precursor gp160 envelope glycoprotein. These results demonstrate that determinants for syncytium-independent single-cell killing are located within the envelope glycoprotein and suggest that single-cell killing is profoundly influenced by alterations in envelope sequence which affect posttranslational processing of HIV-1 envelope glycoprotein within the infected cell. Images PMID:1695254

  19. Fine-mapping of the HNF1B multicancer locus identifies candidate variants that mediate endometrial cancer risk.

    PubMed

    Painter, Jodie N; O'Mara, Tracy A; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S; Kaufmann, Susanne; Hillman, Kristine M; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W; Webb, Penelope M; Scott, Rodney J; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G; Nyholt, Dale R; Henders, Anjali K; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Renner, Stefan P; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C; Goode, Ellen L; Teoman, Attila; Salvesen, Helga B; Trovik, Jone; Njolstad, Tormund S; Werner, Henrica M J; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L; Southey, Melissa C; Ekici, Arif B; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J; Olson, Janet E; Giles, Graham G; Bruinsma, Fiona; Cunningham, Julie M; Fridley, Brooke L; Børresen-Dale, Anne-Lise; Kristensen, Vessela N; Cox, Angela; Swerdlow, Anthony J; Orr, Nicholas; Bolla, Manjeet K; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Easton, Douglas F; Edwards, Stacey L; Thompson, Deborah J; Spurdle, Amanda B

    2015-03-01

    Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10(-14), odds ratio = 0.86, 95% confidence interval = 0.82-0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  20. Fine-mapping of the HNF1B multicancer locus identifies candidate variants that mediate endometrial cancer risk

    PubMed Central

    Painter, Jodie N.; O'Mara, Tracy A.; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A.; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P.; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S.; Kaufmann, Susanne; Hillman, Kristine M.; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma. Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R.; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W.; Webb, Penelope M.; Scott, Rodney J.; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G.; Nyholt, Dale R.; Henders, Anjali K.; Fasching, Peter A.; Hein, Alexander; Beckmann, Matthias W.; Renner, Stefan P.; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C.; Goode, Ellen L.; Teoman, Attila; Salvesen, Helga B.; Trovik, Jone; Njolstad, Tormund S.; Werner, Henrica M.J.; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L.; Southey, Melissa C.; Ekici, Arif B.; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K.; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; Bruinsma, Fiona; Cunningham, Julie M.; Fridley, Brooke L.; Børresen-Dale, Anne-Lise; Kristensen, Vessela N.; Cox, Angela; Swerdlow, Anthony J.; Orr, Nicholas; Bolla, Manjeet K.; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D.; Pharoah, Paul D.P.; Dunning, Alison M.; Tomlinson, Ian; Easton, Douglas F.; Edwards, Stacey L.; Thompson, Deborah J.; Spurdle, Amanda B.

    2015-01-01

    Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10−14, odds ratio = 0.86, 95% confidence interval = 0.82–0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. PMID:25378557

  1. Linear Subpixel Learning Algorithm for Land Cover Classification from WELD using High Performance Computing

    NASA Technical Reports Server (NTRS)

    Kumar, Uttam; Nemani, Ramakrishna R.; Ganguly, Sangram; Kalia, Subodh; Michaelis, Andrew

    2017-01-01

    In this work, we use a Fully Constrained Least Squares Subpixel Learning Algorithm to unmix global WELD (Web Enabled Landsat Data) to obtain fractions or abundances of substrate (S), vegetation (V) and dark objects (D) classes. Because of the sheer nature of data and compute needs, we leveraged the NASA Earth Exchange (NEX) high performance computing architecture to optimize and scale our algorithm for large-scale processing. Subsequently, the S-V-D abundance maps were characterized into 4 classes namely, forest, farmland, water and urban areas (with NPP-VIIRS-national polar orbiting partnership visible infrared imaging radiometer suite nighttime lights data) over California, USA using Random Forest classifier. Validation of these land cover maps with NLCD (National Land Cover Database) 2011 products and NAFD (North American Forest Dynamics) static forest cover maps showed that an overall classification accuracy of over 91 percent was achieved, which is a 6 percent improvement in unmixing based classification relative to per-pixel-based classification. As such, abundance maps continue to offer an useful alternative to high-spatial resolution data derived classification maps for forest inventory analysis, multi-class mapping for eco-climatic models and applications, fast multi-temporal trend analysis and for societal and policy-relevant applications needed at the watershed scale.

  2. Linear Subpixel Learning Algorithm for Land Cover Classification from WELD using High Performance Computing

    NASA Astrophysics Data System (ADS)

    Ganguly, S.; Kumar, U.; Nemani, R. R.; Kalia, S.; Michaelis, A.

    2017-12-01

    In this work, we use a Fully Constrained Least Squares Subpixel Learning Algorithm to unmix global WELD (Web Enabled Landsat Data) to obtain fractions or abundances of substrate (S), vegetation (V) and dark objects (D) classes. Because of the sheer nature of data and compute needs, we leveraged the NASA Earth Exchange (NEX) high performance computing architecture to optimize and scale our algorithm for large-scale processing. Subsequently, the S-V-D abundance maps were characterized into 4 classes namely, forest, farmland, water and urban areas (with NPP-VIIRS - national polar orbiting partnership visible infrared imaging radiometer suite nighttime lights data) over California, USA using Random Forest classifier. Validation of these land cover maps with NLCD (National Land Cover Database) 2011 products and NAFD (North American Forest Dynamics) static forest cover maps showed that an overall classification accuracy of over 91% was achieved, which is a 6% improvement in unmixing based classification relative to per-pixel based classification. As such, abundance maps continue to offer an useful alternative to high-spatial resolution data derived classification maps for forest inventory analysis, multi-class mapping for eco-climatic models and applications, fast multi-temporal trend analysis and for societal and policy-relevant applications needed at the watershed scale.

  3. Evaluating the Effectiveness of Flood Control Strategies in Contrasting Urban Watersheds and Implications for Houston's Future Flood Vulnerability

    NASA Astrophysics Data System (ADS)

    Ganguly, S.; Kumar, U.; Nemani, R. R.; Kalia, S.; Michaelis, A.

    2016-12-01

    In this work, we use a Fully Constrained Least Squares Subpixel Learning Algorithm to unmix global WELD (Web Enabled Landsat Data) to obtain fractions or abundances of substrate (S), vegetation (V) and dark objects (D) classes. Because of the sheer nature of data and compute needs, we leveraged the NASA Earth Exchange (NEX) high performance computing architecture to optimize and scale our algorithm for large-scale processing. Subsequently, the S-V-D abundance maps were characterized into 4 classes namely, forest, farmland, water and urban areas (with NPP-VIIRS - national polar orbiting partnership visible infrared imaging radiometer suite nighttime lights data) over California, USA using Random Forest classifier. Validation of these land cover maps with NLCD (National Land Cover Database) 2011 products and NAFD (North American Forest Dynamics) static forest cover maps showed that an overall classification accuracy of over 91% was achieved, which is a 6% improvement in unmixing based classification relative to per-pixel based classification. As such, abundance maps continue to offer an useful alternative to high-spatial resolution data derived classification maps for forest inventory analysis, multi-class mapping for eco-climatic models and applications, fast multi-temporal trend analysis and for societal and policy-relevant applications needed at the watershed scale.

  4. Generating Multi-Destination Maps.

    PubMed

    Zhang, Junsong; Fan, Jiepeng; Luo, Zhenshan

    2017-08-01

    Multi-destination maps are a kind of navigation maps aimed to guide visitors to multiple destinations within a region, which can be of great help to urban visitors. However, they have not been developed in the current online map service. To address this issue, we introduce a novel layout model designed especially for generating multi-destination maps, which considers the global and local layout of a multi-destination map. We model the layout problem as a graph drawing that satisfies a set of hard and soft constraints. In the global layout phase, we balance the scale factor between ROIs. In the local layout phase, we make all edges have good visibility and optimize the map layout to preserve the relative length and angle of roads. We also propose a perturbation-based optimization method to find an optimal layout in the complex solution space. The multi-destination maps generated by our system are potential feasible on the modern mobile devices and our result can show an overview and a detail view of the whole map at the same time. In addition, we perform a user study to evaluate the effectiveness of our method, and the results prove that the multi-destination maps achieve our goals well.

  5. Multi-template tensor-based morphometry: Application to analysis of Alzheimer's disease

    PubMed Central

    Koikkalainen, Juha; Lötjönen, Jyrki; Thurfjell, Lennart; Rueckert, Daniel; Waldemar, Gunhild; Soininen, Hilkka

    2012-01-01

    In this paper methods for using multiple templates in tensor-based morphometry (TBM) are presented and comparedtothe conventional single-template approach. TBM analysis requires non-rigid registrations which are often subject to registration errors. When using multiple templates and, therefore, multiple registrations, it can be assumed that the registration errors are averaged and eventually compensated. Four different methods are proposed for multi-template TBM. The methods were evaluated using magnetic resonance (MR) images of healthy controls, patients with stable or progressive mild cognitive impairment (MCI), and patients with Alzheimer's disease (AD) from the ADNI database (N=772). The performance of TBM features in classifying images was evaluated both quantitatively and qualitatively. Classification results show that the multi-template methods are statistically significantly better than the single-template method. The overall classification accuracy was 86.0% for the classification of control and AD subjects, and 72.1%for the classification of stable and progressive MCI subjects. The statistical group-level difference maps produced using multi-template TBM were smoother, formed larger continuous regions, and had larger t-values than the maps obtained with single-template TBM. PMID:21419228

  6. An evaluation of the utility of ERTS-1 data for mapping and developing natural resources of Iran

    NASA Technical Reports Server (NTRS)

    Ebtehadj, K. (Principal Investigator)

    1973-01-01

    The author has identified the following significant results. Significant results are reported in the creation of an Iranian photomosaic from ERTS-1 imagery; in tectonic and structural mapping and interpretation, including discovery of significant new fault patterns in Iran; in river and lake mapping; in wetlands and fisheries nursery delineation and mapping; in range and agricultural surveys and inventories using multi-stage sample methods; and in the computer analysis of ERTS-1 digital tapes for urban land use.

  7. QTL mapping of fruit mineral contents provides new chances for molecular breeding of tomato nutritional traits.

    PubMed

    Capel, Carmen; Yuste-Lisbona, Fernando J; López-Casado, Gloria; Angosto, Trinidad; Heredia, Antonio; Cuartero, Jesús; Fernández-Muñoz, Rafael; Lozano, Rafael; Capel, Juan

    2017-05-01

    Agronomical characterization of a RIL population for fruit mineral contents allowed for the identification of QTL controlling these fruit quality traits, flanked by co-dominant markers useful for marker-assisted breeding. Tomato quality is a multi-variant attribute directly depending on fruit chemical composition, which in turn determines the benefits of tomato consumption for human health. Commercially available tomato varieties possess limited variability in fruit quality traits. Wild species, such as Solanum pimpinellifolium, could provide different nutritional advantages and can be used for tomato breeding to improve overall fruit quality. Determining the genetic basis of the inheritance of all the traits that contribute to tomato fruit quality will increase the efficiency of the breeding program necessary to take advantage of the wild species variability. A high-density linkage map has been constructed from a recombinant inbred line (RIL) population derived from a cross between tomato Solanum lycopersicum and the wild-relative species S. pimpinellifolium. The RIL population was evaluated for fruit mineral contents during three consecutive growing seasons. The data obtained allowed for the identification of main QTL and novel epistatic interaction among QTL controlling fruit mineral contents on the basis of a multiple-environment analysis. Most of the QTL were flanked by candidate genes providing valuable information for both tomato breeding for new varieties with novel nutritional properties and the starting point to identify the genes underlying these QTL, which will help to reveal the genetic basis of tomato fruit nutritional properties.

  8. A comparison of digital multi-spectral imagery versus conventional photography for mapping seagrass in Indian River Lagoon, Florida

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Virnstein, R.; Tepera, M.; Beazley, L.

    1997-06-01

    A pilot study is very briefly summarized in the article. The study tested the potential of multi-spectral digital imagery for discrimination of seagrass densities and species, algae, and bottom types. Imagery was obtained with the Compact Airborne Spectral Imager (casi) and two flight lines flown with hyper-spectral mode. The photogrammetric method used allowed interpretation of the highest quality product, eliminating limitations caused by outdated or poor quality base maps and the errors associated with transfer of polygons. Initial image analysis indicates that the multi-spectral imagery has several advantages, including sophisticated spectral signature recognition and classification, ease of geo-referencing, and rapidmore » mosaicking.« less

  9. Assessing groundwater accessibility in the Kharga Basin, Egypt: A remote sensing approach

    NASA Astrophysics Data System (ADS)

    Parks, Shawna; Byrnes, Jeffrey; Abdelsalam, Mohamed G.; Laó Dávila, Daniel A.; Atekwana, Estella A.; Atya, Magdy A.

    2017-12-01

    We used multi-map analysis of remote sensing and ancillary data to identify potentially accessible sites for groundwater resources in the Kharga Basin in the Western Desert of Egypt. This basin is dominated by Cretaceous sandstone formations and extends within the Nubian Sandstone Aquifer. It is dissected by N-S and E-W trending faults, possibly acting as conduits for upward migration of groundwater. Analysis of paleo-drainage using Digital Elevation Model (DEM) generated from the Shuttle Radar Topography Mission (SRTM) data shows that the Kharga was a closed basin that might have been the site of a paleo-lake. Lake water recharged the Nubian Sandstone Aquifer during the wetter Holocene time. We generated the following layers for the multi-map analysis: (1) Fracture density map from the interpretation of Landsat Operational Land Imager (OLI), SRTM DEM, and RADARSAT data. (2) Thermal Inertia (TI) map (for moisture content imaging) from the Moderate Resolution Imaging Spectroradiometer (MODIS) data. (3) Hydraulic conductivity map from mapping lithological units using the Landsat OLI and previously published data. (4) Aquifer thickness map from previously published data. We quantitatively ranked the Kharga Basin by considering that regions of high fracture density, high TI, thicker aquifer, and high hydraulic conductivity have higher potential for groundwater accessibility. Our analysis shows that part of the southern Kharga Basin is suitable for groundwater extraction. This region is where N-S and E-W trending faults intersect, has relatively high TI and it is underlain by thick aquifer. However, the suitability of this region for groundwater use will be reduced significantly when considering the changes in land suitability and economic depth to groundwater extraction in the next 50 years.

  10. Identification of RNF213 as a susceptibility gene for moyamoya disease and its possible role in vascular development.

    PubMed

    Liu, Wanyang; Morito, Daisuke; Takashima, Seiji; Mineharu, Yohei; Kobayashi, Hatasu; Hitomi, Toshiaki; Hashikata, Hirokuni; Matsuura, Norio; Yamazaki, Satoru; Toyoda, Atsushi; Kikuta, Ken-ichiro; Takagi, Yasushi; Harada, Kouji H; Fujiyama, Asao; Herzig, Roman; Krischek, Boris; Zou, Liping; Kim, Jeong Eun; Kitakaze, Masafumi; Miyamoto, Susumu; Nagata, Kazuhiro; Hashimoto, Nobuo; Koizumi, Akio

    2011-01-01

    Moyamoya disease is an idiopathic vascular disorder of intracranial arteries. Its susceptibility locus has been mapped to 17q25.3 in Japanese families, but the susceptibility gene is unknown. Genome-wide linkage analysis in eight three-generation families with moyamoya disease revealed linkage to 17q25.3 (P<10(-4)). Fine mapping demonstrated a 1.5-Mb disease locus bounded by D17S1806 and rs2280147. We conducted exome analysis of the eight index cases in these families, with results filtered through Ng criteria. There was a variant of p.N321S in PCMTD1 and p.R4810K in RNF213 in the 1.5-Mb locus of the eight index cases. The p.N321S variant in PCMTD1 could not be confirmed by the Sanger method. Sequencing RNF213 in 42 index cases confirmed p.R4810K and revealed it to be the only unregistered variant. Genotyping 39 SNPs around RNF213 revealed a founder haplotype transmitted in 42 families. Sequencing the 260-kb region covering the founder haplotype in one index case did not show any coding variants except p.R4810K. A case-control study demonstrated strong association of p.R4810K with moyamoya disease in East Asian populations (251 cases and 707 controls) with an odds ratio of 111.8 (P = 10(-119)). Sequencing of RNF213 in East Asian cases revealed additional novel variants: p.D4863N, p.E4950D, p.A5021V, p.D5160E, and p.E5176G. Among Caucasian cases, variants p.N3962D, p.D4013N, p.R4062Q and p.P4608S were identified. RNF213 encodes a 591-kDa cytosolic protein that possesses two functional domains: a Walker motif and a RING finger domain. These exhibit ATPase and ubiquitin ligase activities. Although the mutant alleles (p.R4810K or p.D4013N in the RING domain) did not affect transcription levels or ubiquitination activity, knockdown of RNF213 in zebrafish caused irregular wall formation in trunk arteries and abnormal sprouting vessels. We provide evidence suggesting, for the first time, the involvement of RNF213 in genetic susceptibility to moyamoya disease.

  11. Expansion of the spectrum of ITGB6-related disorders to adolescent alopecia, dentogingival abnormalities and intellectual disability

    PubMed Central

    Ansar, Muhammad; Jan, Abid; Santos-Cortez, Regie Lyn P; Wang, Xin; Suliman, Muhammad; Acharya, Anushree; Habib, Rabia; Abbe, Izoduwa; Ali, Ghazanfar; Lee, Kwanghyuk; Smith, Joshua D; Bamshad, Michael J; Shendure, Jay; Nickerson, Deborah A; Abecasis, Gonçalo R; Anderson, Peter; Annable, Marcus; Beightol, Mallory; Browning, Brian L; Buckingham, Kati J; Chen, Christina; Chin, Jennifer; Chong, Jessica X; Cooper, Gregory M; Davis, Colleen; Felker, Lindsay; Frazar, Christopher; Hanna, David; He, Zongxiao; Jain, Preti; Jarvik, Gail P; Johanson, Eric; Jun, Goo; Kircher, Martin; Kolar, Tom; Leal, Suzanne M; Luksic, Daniel; McMillin, Margaret J; McGee, Sean; Munson, Brenton; O'Roak, Brian J; Paeper, Bryan; Patterson, Karynne; Phillips, Eric; Pijoan, Jessica; Poel, Christa; Robertson, Peggy D; Santos-Cortez, Regie Lyn P; Shaffer, Tristan; Shephard, Cindy; Siegel, Deborah L; Smith, Joshua D; Staples, Jeffrey C; Tabor, Holly K; Tackett, Monica; Wang, Gao T; Yi, Qian; Nickerson, Deborah A; Shendure, Jay; Bamshad, Michael J; Ahmad, Wasim; Leal, Suzanne M

    2016-01-01

    Alopecia with mental retardation (APMR) is a very rare disorder. In this study, we report on a consanguineous Pakistani family (AP91) with mild-to-moderate intellectual disability, adolescent alopecia and dentogingival abnormalities. Using homozygosity mapping, linkage analysis and exome sequencing, we identified a novel rare missense variant c.898G>A (p.(Glu300Lys)) in ITGB6, which co-segregates with the phenotype within the family and is predicted to be deleterious. Structural modeling shows that Glu300 lies in the β-propeller domain, and is surrounded by several residues that are important for heterodimerization with α integrin. Previous studies showed that ITGB6 variants can cause amelogenesis imperfecta in humans, but patients from family AP91 who are homozygous for the c.898G>A variant present with neurological and dermatological features, indicating a role for ITGB6 beyond enamel formation. Our study demonstrates that a rare deleterious variant within ITGB6 causes not only dentogingival anomalies but also intellectual disability and alopecia. PMID:26695873

  12. Variants in Nebulin (NEB) Are Linked to the Development of Familial Primary Angle Closure Glaucoma in Basset Hounds

    PubMed Central

    Ahram, Dina F.; Grozdanic, Sinisa D.; Kecova, Helga; Henkes, Arjen; Collin, Rob W. J.; Kuehn, Markus H.

    2015-01-01

    Several dog breeds are susceptible to developing primary angle closure glaucoma (PACG), which suggests a genetic basis for the disease. We have identified a four-generation Basset Hound pedigree with characteristic autosomal recessive PACG that closely recapitulates PACG in humans. Our aim is to utilize gene mapping and whole exome sequencing approaches to identify PACG-causing sequence variants in the Basset. Extensive clinical phenotyping of all pedigree members was conducted. SNP-chip genotyping was carried out in 9 affected and 15 unaffected pedigree members. Two-point and multipoint linkage analyses of genome-wide SNP data were performed using Superlink-Online SNP-1.1 and a locus was mapped to chromosome 19q with a maximum LOD score of 3.24. The locus contains 12 Ensemble predicted canine genes and is syntenic to a region on chromosome 2 in the human genome. Using exome-sequencing analysis, a possibly damaging, non-synonymous variant in the gene Nebulin (NEB) was found to segregate with PACG which alters a phylogenetically conserved Lysine residue. The association of this variants with PACG was confirmed in a secondary cohort of unrelated Basset Hounds (p = 3.4 × 10-4, OR = 15.3 for homozygosity). Nebulin, a protein that promotes the contractile function of sarcomeres, was found to be prominently expressed in the ciliary muscles of the anterior segment. Our findings may provide insight into the molecular mechanisms that underlie PACG. The phenotypic similarities of disease presentation in dogs and humans may enable the translation of findings made in this study to patients with PACG. PMID:25938837

  13. Variants in Nebulin (NEB) Are Linked to the Development of Familial Primary Angle Closure Glaucoma in Basset Hounds.

    PubMed

    Ahram, Dina F; Grozdanic, Sinisa D; Kecova, Helga; Henkes, Arjen; Collin, Rob W J; Kuehn, Markus H

    2015-01-01

    Several dog breeds are susceptible to developing primary angle closure glaucoma (PACG), which suggests a genetic basis for the disease. We have identified a four-generation Basset Hound pedigree with characteristic autosomal recessive PACG that closely recapitulates PACG in humans. Our aim is to utilize gene mapping and whole exome sequencing approaches to identify PACG-causing sequence variants in the Basset. Extensive clinical phenotyping of all pedigree members was conducted. SNP-chip genotyping was carried out in 9 affected and 15 unaffected pedigree members. Two-point and multipoint linkage analyses of genome-wide SNP data were performed using Superlink-Online SNP-1.1 and a locus was mapped to chromosome 19q with a maximum LOD score of 3.24. The locus contains 12 Ensemble predicted canine genes and is syntenic to a region on chromosome 2 in the human genome. Using exome-sequencing analysis, a possibly damaging, non-synonymous variant in the gene Nebulin (NEB) was found to segregate with PACG which alters a phylogenetically conserved Lysine residue. The association of this variants with PACG was confirmed in a secondary cohort of unrelated Basset Hounds (p = 3.4 × 10-4, OR = 15.3 for homozygosity). Nebulin, a protein that promotes the contractile function of sarcomeres, was found to be prominently expressed in the ciliary muscles of the anterior segment. Our findings may provide insight into the molecular mechanisms that underlie PACG. The phenotypic similarities of disease presentation in dogs and humans may enable the translation of findings made in this study to patients with PACG.

  14. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions

    PubMed Central

    Han, Ying; Hazelett, Dennis J.; Wiklund, Fredrik; Schumacher, Fredrick R.; Stram, Daniel O.; Berndt, Sonja I.; Wang, Zhaoming; Rand, Kristin A.; Hoover, Robert N.; Machiela, Mitchell J.; Yeager, Merideth; Burdette, Laurie; Chung, Charles C.; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C.; Key, Timothy J.; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L.; Kolb, Suzanne; Gapstur, Susan M.; Diver, W. Ryan; Stevens, Victoria L.; Strom, Sara S.; Pettaway, Curtis A.; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A.; Yeboah, Edward D.; Tettey, Yao; Biritwum, Richard B.; Adjei, Andrew A.; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P.; Isaacs, William B.; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L.; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M.; Ingles, Sue A.; Kittles, Rick A.; Murphy, Adam B.; Blot, William J.; Signorello, Lisa B.; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M. Cristina; Wu, Suh-Yuh; Hennis, Anselm J. M.; Rybicki, Benjamin A.; Neslund-Dudas, Christine; Hsing, Ann W.; Chu, Lisa; Goodman, Phyllis J.; Klein, Eric A.; Zheng, S. Lilly; Witte, John S.; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L.; Hunter, David J.; Gronberg, Henrik; Cook, Michael B.; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J.; Easton, Douglas F.; Henderson, Brian E.; Coetzee, Gerhard A.; Conti, David V.; Haiman, Christopher A.

    2015-01-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10−4–5.6 × 10−3) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10−6) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. PMID:26162851

  15. Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions.

    PubMed

    Han, Ying; Hazelett, Dennis J; Wiklund, Fredrik; Schumacher, Fredrick R; Stram, Daniel O; Berndt, Sonja I; Wang, Zhaoming; Rand, Kristin A; Hoover, Robert N; Machiela, Mitchell J; Yeager, Merideth; Burdette, Laurie; Chung, Charles C; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C; Key, Timothy J; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L; Kolb, Suzanne; Gapstur, Susan M; Diver, W Ryan; Stevens, Victoria L; Strom, Sara S; Pettaway, Curtis A; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A; Yeboah, Edward D; Tettey, Yao; Biritwum, Richard B; Adjei, Andrew A; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P; Isaacs, William B; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M; Ingles, Sue A; Kittles, Rick A; Murphy, Adam B; Blot, William J; Signorello, Lisa B; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M Cristina; Wu, Suh-Yuh; Hennis, Anselm J M; Rybicki, Benjamin A; Neslund-Dudas, Christine; Hsing, Ann W; Chu, Lisa; Goodman, Phyllis J; Klein, Eric A; Zheng, S Lilly; Witte, John S; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L; Hunter, David J; Gronberg, Henrik; Cook, Michael B; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J; Easton, Douglas F; Henderson, Brian E; Coetzee, Gerhard A; Conti, David V; Haiman, Christopher A

    2015-10-01

    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10(-4)-5.6 × 10(-3)) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10(-6)) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  16. graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture.

    PubMed

    Chung, Dongjun; Kim, Hang J; Zhao, Hongyu

    2017-02-01

    Genome-wide association studies (GWAS) have identified tens of thousands of genetic variants associated with hundreds of phenotypes and diseases, which have provided clinical and medical benefits to patients with novel biomarkers and therapeutic targets. However, identification of risk variants associated with complex diseases remains challenging as they are often affected by many genetic variants with small or moderate effects. There has been accumulating evidence suggesting that different complex traits share common risk basis, namely pleiotropy. Recently, several statistical methods have been developed to improve statistical power to identify risk variants for complex traits through a joint analysis of multiple GWAS datasets by leveraging pleiotropy. While these methods were shown to improve statistical power for association mapping compared to separate analyses, they are still limited in the number of phenotypes that can be integrated. In order to address this challenge, in this paper, we propose a novel statistical framework, graph-GPA, to integrate a large number of GWAS datasets for multiple phenotypes using a hidden Markov random field approach. Application of graph-GPA to a joint analysis of GWAS datasets for 12 phenotypes shows that graph-GPA improves statistical power to identify risk variants compared to statistical methods based on smaller number of GWAS datasets. In addition, graph-GPA also promotes better understanding of genetic mechanisms shared among phenotypes, which can potentially be useful for the development of improved diagnosis and therapeutics. The R implementation of graph-GPA is currently available at https://dongjunchung.github.io/GGPA/.

  17. Comprehensive framework for visualizing and analyzing spatio-temporal dynamics of racial diversity in the entire United States

    PubMed Central

    Netzel, Pawel

    2017-01-01

    The United States is increasingly becoming a multi-racial society. To understand multiple consequences of this overall trend to our neighborhoods we need a methodology capable of spatio-temporal analysis of racial diversity at the local level but also across the entire U.S. Furthermore, such methodology should be accessible to stakeholders ranging from analysts to decision makers. In this paper we present a comprehensive framework for visualizing and analyzing diversity data that fulfills such requirements. The first component of our framework is a U.S.-wide, multi-year database of race sub-population grids which is freely available for download. These 30 m resolution grids have being developed using dasymetric modeling and are available for 1990-2000-2010. We summarize numerous advantages of gridded population data over commonly used Census tract-aggregated data. Using these grids frees analysts from constructing their own and allows them to focus on diversity analysis. The second component of our framework is a set of U.S.-wide, multi-year diversity maps at 30 m resolution. A diversity map is our product that classifies the gridded population into 39 communities based on their degrees of diversity, dominant race, and population density. It provides spatial information on diversity in a single, easy-to-understand map that can be utilized by analysts and end users alike. Maps based on subsequent Censuses provide information about spatio-temporal dynamics of diversity. Diversity maps are accessible through the GeoWeb application SocScape (http://sil.uc.edu/webapps/socscape_usa/) for an immediate online exploration. The third component of our framework is a proposal to quantitatively analyze diversity maps using a set of landscape metrics. Because of its form, a grid-based diversity map could be thought of as a diversity “landscape” and analyzed quantitatively using landscape metrics. We give a brief summary of most pertinent metrics and demonstrate how they can be applied to diversity maps. PMID:28358862

  18. Multi-breed and multi-trait co-association analysis of meat tenderness and other meat quality traits in three French beef cattle breeds.

    PubMed

    Ramayo-Caldas, Yuliaxis; Renand, Gilles; Ballester, Maria; Saintilan, Romain; Rocha, Dominique

    2016-04-23

    Studies to identify markers associated with beef tenderness have focused on Warner-Bratzler shear force (WBSF) but the interplay between the genes associated with WBSF has not been explored. We used the association weight matrix (AWM), a systems biology approach, to identify a set of interacting genes that are co-associated with tenderness and other meat quality traits, and shared across the Charolaise, Limousine and Blonde d'Aquitaine beef cattle breeds. Genome-wide association studies were performed using ~500K single nucleotide polymorphisms (SNPs) and 17 phenotypes measured on more than 1000 animals for each breed. First, this multi-trait approach was applied separately for each breed across 17 phenotypes and second, between- and across-breed comparisons at the AWM and functional levels were performed. Genetic heterogeneity was observed, and most of the variants that were associated with WBSF segregated within rather than across breeds. We identified 206 common candidate genes associated with WBSF across the three breeds. SNPs in these common genes explained between 28 and 30 % of the phenotypic variance for WBSF. A reduced number of common SNPs mapping to the 206 common genes were identified, suggesting that different mutations may target the same genes in a breed-specific manner. Therefore, it is likely that, depending on allele frequencies and linkage disequilibrium patterns, a SNP that is identified for one breed may not be informative for another unrelated breed. Well-known candidate genes affecting beef tenderness were identified. In addition, some of the 206 common genes are located within previously reported quantitative trait loci for WBSF in several cattle breeds. Moreover, the multi-breed co-association analysis detected new candidate genes, regulators and metabolic pathways that are likely involved in the determination of meat tenderness and other meat quality traits in beef cattle. Our results suggest that systems biology approaches that explore associations of correlated traits increase statistical power to identify candidate genes beyond the one-dimensional approach. Further studies on the 206 common genes, their pathways, regulators and interactions will expand our knowledge on the molecular basis of meat tenderness and could lead to the discovery of functional mutations useful for genomic selection in a multi-breed beef cattle context.

  19. Identification of missing variants by combining multiple analytic pipelines.

    PubMed

    Ren, Yingxue; Reddy, Joseph S; Pottier, Cyril; Sarangi, Vivekananda; Tian, Shulan; Sinnwell, Jason P; McDonnell, Shannon K; Biernacka, Joanna M; Carrasquillo, Minerva M; Ross, Owen A; Ertekin-Taner, Nilüfer; Rademakers, Rosa; Hudson, Matthew; Mainzer, Liudmila Sergeevna; Asmann, Yan W

    2018-04-16

    After decades of identifying risk factors using array-based genome-wide association studies (GWAS), genetic research of complex diseases has shifted to sequencing-based rare variants discovery. This requires large sample sizes for statistical power and has brought up questions about whether the current variant calling practices are adequate for large cohorts. It is well-known that there are discrepancies between variants called by different pipelines, and that using a single pipeline always misses true variants exclusively identifiable by other pipelines. Nonetheless, it is common practice today to call variants by one pipeline due to computational cost and assume that false negative calls are a small percent of total. We analyzed 10,000 exomes from the Alzheimer's Disease Sequencing Project (ADSP) using multiple analytic pipelines consisting of different read aligners and variant calling strategies. We compared variants identified by using two aligners in 50,100, 200, 500, 1000, and 1952 samples; and compared variants identified by adding single-sample genotyping to the default multi-sample joint genotyping in 50,100, 500, 2000, 5000 and 10,000 samples. We found that using a single pipeline missed increasing numbers of high-quality variants correlated with sample sizes. By combining two read aligners and two variant calling strategies, we rescued 30% of pass-QC variants at sample size of 2000, and 56% at 10,000 samples. The rescued variants had higher proportions of low frequency (minor allele frequency [MAF] 1-5%) and rare (MAF < 1%) variants, which are the very type of variants of interest. In 660 Alzheimer's disease cases with earlier onset ages of ≤65, 4 out of 13 (31%) previously-published rare pathogenic and protective mutations in APP, PSEN1, and PSEN2 genes were undetected by the default one-pipeline approach but recovered by the multi-pipeline approach. Identification of the complete variant set from sequencing data is the prerequisite of genetic association analyses. The current analytic practice of calling genetic variants from sequencing data using a single bioinformatics pipeline is no longer adequate with the increasingly large projects. The number and percentage of quality variants that passed quality filters but are missed by the one-pipeline approach rapidly increased with sample size.

  20. Targeted Deep Resequencing Identifies Coding Variants in the PEAR1 Gene That Play a Role in Platelet Aggregation

    PubMed Central

    Kim, Yoonhee; Suktitipat, Bhoom; Yanek, Lisa R.; Faraday, Nauder; Wilson, Alexander F.; Becker, Diane M.; Becker, Lewis C.; Mathias, Rasika A.

    2013-01-01

    Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1) gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13) selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate). Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5%) were noted in African Americans compared to European Americans (108 vs. 45). The common intronic GWAS-identified variant (rs12041331) demonstrated the most significant association signal in African Americans (p = 4.020×10−4); no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331). Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965) supports the results noted in the sequenced discovery sample: p = 3.56×10−4, 2.27×10−7, 5.20×10−5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans, and show that exonic variants play an additional role in platelet aggregation in European Americans. PMID:23704978

  1. Multi-variants synthesis of Petri nets for FPGA devices

    NASA Astrophysics Data System (ADS)

    Bukowiec, Arkadiusz; Doligalski, Michał

    2015-09-01

    There is presented new method of synthesis of application specific logic controllers for FPGA devices. The specification of control algorithm is made with use of control interpreted Petri net (PT type). It allows specifying parallel processes in easy way. The Petri net is decomposed into state-machine type subnets. In this case, each subnet represents one parallel process. For this purpose there are applied algorithms of coloring of Petri nets. There are presented two approaches of such decomposition: with doublers of macroplaces or with one global wait place. Next, subnets are implemented into two-level logic circuit of the controller. The levels of logic circuit are obtained as a result of its architectural decomposition. The first level combinational circuit is responsible for generation of next places and second level decoder is responsible for generation output symbols. There are worked out two variants of such circuits: with one shared operational memory or with many flexible distributed memories as a decoder. Variants of Petri net decomposition and structures of logic circuits can be combined together without any restrictions. It leads to existence of four variants of multi-variants synthesis.

  2. Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.

    PubMed

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Alejandro Q; Musolf, Anthony; Matise, Tara C; Finch, Stephen J; Gordon, Derek

    2012-01-01

    As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single-variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p value, no matter how many loci. Copyright © 2013 S. Karger AG, Basel.

  3. Single variant and multi-variant trend tests for genetic association with next generation sequencing that are robust to sequencing error

    PubMed Central

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Andrew; Musolf, Anthony; Matise, Tara C.; Finch, Stephen J.; Gordon, Derek

    2013-01-01

    As with any new technology, next generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model, based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to that data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p-value, no matter how many loci. PMID:23594495

  4. Genome-wide Association Mapping of Qualitatively Inherited Traits in a Germplasm Collection.

    PubMed

    Bandillo, Nonoy B; Lorenz, Aaron J; Graef, George L; Jarquin, Diego; Hyten, David L; Nelson, Randall L; Specht, James E

    2017-07-01

    Genome-wide association (GWA) has been used as a tool for dissecting the genetic architecture of quantitatively inherited traits. We demonstrate here that GWA can also be highly useful for detecting many major genes governing categorically defined phenotype variants that exist for qualitatively inherited traits in a germplasm collection. Genome-wide association mapping was applied to categorical phenotypic data available for 10 descriptive traits in a collection of ∼13,000 soybean [ (L.) Merr.] accessions that had been genotyped with a 50,000 single nucleotide polymorphism (SNP) chip. A GWA on a panel of accessions of this magnitude can offer substantial statistical power and mapping resolution, and we found that GWA mapping resulted in the identification of strong SNP signals for 24 classical genes as well as several heretofore unknown genes controlling the phenotypic variants in those traits. Because some of these genes had been cloned, we were able to show that the narrow GWA mapping SNP signal regions that we detected for the phenotypic variants had chromosomal bp spans that, with just one exception, overlapped the bp region of the cloned genes, despite local variation in SNP number and nonuniform SNP distribution in the chip set. Copyright © 2017 Crop Science Society of America.

  5. Exome genotyping arrays to identify rare and low frequency variants associated with epithelial ovarian cancer risk.

    PubMed

    Permuth, Jennifer B; Pirie, Ailith; Ann Chen, Y; Lin, Hui-Yi; Reid, Brett M; Chen, Zhihua; Monteiro, Alvaro; Dennis, Joe; Mendoza-Fandino, Gustavo; Anton-Culver, Hoda; Bandera, Elisa V; Bisogna, Maria; Brinton, Louise; Brooks-Wilson, Angela; Carney, Michael E; Chenevix-Trench, Georgia; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; D'Aloisio, Aimee A; Anne Doherty, Jennifer; Earp, Madalene; Edwards, Robert P; Fridley, Brooke L; Gayther, Simon A; Gentry-Maharaj, Aleksandra; Goodman, Marc T; Gronwald, Jacek; Hogdall, Estrid; Iversen, Edwin S; Jakubowska, Anna; Jensen, Allan; Karlan, Beth Y; Kelemen, Linda E; Kjaer, Suzanne K; Kraft, Peter; Le, Nhu D; Levine, Douglas A; Lissowska, Jolanta; Lubinski, Jan; Matsuo, Keitaro; Menon, Usha; Modugno, Rosemary; Moysich, Kirsten B; Nakanishi, Toru; Ness, Roberta B; Olson, Sara; Orlow, Irene; Pearce, Celeste L; Pejovic, Tanja; Poole, Elizabeth M; Ramus, Susan J; Anne Rossing, Mary; Sandler, Dale P; Shu, Xiao-Ou; Song, Honglin; Taylor, Jack A; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Tworoger, Shelley S; Webb, Penelope M; Wentzensen, Nicolas; Wilkens, Lynne R; Winham, Stacey; Woo, Yin-Ling; Wu, Anna H; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Phelan, Catherine M; Schildkraut, Joellen M; Berchuck, Andrew; Goode, Ellen L; Pharoah, Paul D P; Sellers, Thomas A

    2016-08-15

    Rare and low frequency variants are not well covered in most germline genotyping arrays and are understudied in relation to epithelial ovarian cancer (EOC) risk. To address this gap, we used genotyping arrays targeting rarer protein-coding variation in 8,165 EOC cases and 11,619 controls from the international Ovarian Cancer Association Consortium (OCAC). Pooled association analyses were conducted at the variant and gene level for 98,543 variants directly genotyped through two exome genotyping projects. Only common variants that represent or are in strong linkage disequilibrium (LD) with previously-identified signals at established loci reached traditional thresholds for exome-wide significance (P < 5.0 × 10  -   7 ). One of the most significant signals (P all histologies  =   1.01 × 10  -   13 ;P serous  =   3.54 × 10  -   14 ) occurred at 3q25.31 for rs62273959, a missense variant mapping to the LEKR1 gene that is in LD (r 2  =   0.90) with a previously identified 'best hit' (rs7651446) mapping to an intron of TIPARP. Suggestive associations (5.0 × 10  -   5  >   P≥5.0 ×10  -   7 ) were detected for rare and low-frequency variants at 16 novel loci. Four rare missense variants were identified (ACTBL2 rs73757391 (5q11.2), BTD rs200337373 (3p25.1), KRT13 rs150321809 (17q21.2) and MC2R rs104894658 (18p11.21)), but only MC2R rs104894668 had a large effect size (OR = 9.66). Genes most strongly associated with EOC risk included ACTBL2 (P AML  =   3.23 × 10  -   5 ; P SKAT-o  =   9.23 × 10  -   4 ) and KRT13 (P AML  =   1.67 × 10  -   4 ; P SKAT-o  =   1.07 × 10  -   5 ), reaffirming variant-level analysis. In summary, this large study identified several rare and low-frequency variants and genes that may contribute to EOC susceptibility, albeit with possible small effects. Future studies that integrate epidemiology, sequencing, and functional assays are needed to further unravel the unexplained heritability and biology of this disease. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  6. Genetic Modifiers of Neurofibromatosis Type 1-Associated Café-au-Lait Macule Count Identified Using Multi-platform Analysis

    PubMed Central

    Pemov, Alexander; Sung, Heejong; Hyland, Paula L.; Sloan, Jennifer L.; Ruppert, Sarah L.; Baldwin, Andrea M.; Boland, Joseph F.; Bass, Sara E.; Lee, Hyo Jung; Jones, Kristine M.; Zhang, Xijun; Mullikin, James C.; Widemann, Brigitte C.; Wilson, Alexander F.; Stewart, Douglas R.

    2014-01-01

    Neurofibromatosis type 1 (NF1) is an autosomal dominant, monogenic disorder of dysregulated neurocutaneous tissue growth. Pleiotropy, variable expressivity and few NF1 genotype-phenotype correlates limit clinical prognostication in NF1. Phenotype complexity in NF1 is hypothesized to derive in part from genetic modifiers unlinked to the NF1 locus. In this study, we hypothesized that normal variation in germline gene expression confers risk for certain phenotypes in NF1. In a set of 79 individuals with NF1, we examined the association between gene expression in lymphoblastoid cell lines with NF1-associated phenotypes and sequenced select genes with significant phenotype/expression correlations. In a discovery cohort of 89 self-reported European-Americans with NF1 we examined the association between germline sequence variants of these genes with café-au-lait macule (CALM) count, a tractable, tumor-like phenotype in NF1. Two correlated, common SNPs (rs4660761 and rs7161) between DPH2 and ATP6V0B were significantly associated with the CALM count. Analysis with tiled regression also identified SNP rs4660761 as significantly associated with CALM count. SNP rs1800934 and 12 rare variants in the mismatch repair gene MSH6 were also associated with CALM count. Both SNPs rs7161 and rs4660761 (DPH2 and ATP6V0B) were highly significant in a mega-analysis in a combined cohort of 180 self-reported European-Americans; SNP rs1800934 (MSH6) was near-significant in a meta-analysis assuming dominant effect of the minor allele. SNP rs4660761 is predicted to regulate ATP6V0B, a gene associated with melanosome biology. Individuals with homozygous mutations in MSH6 can develop an NF1-like phenotype, including multiple CALMs. Through a multi-platform approach, we identified variants that influence NF1 CALM count. PMID:25329635

  7. Cis-Regulatory Variants Affect CHRNA5 mRNA Expression in Populations of African and European Ancestry

    PubMed Central

    Wang, Jen-Chyong; Spiegel, Noah; Bertelsen, Sarah; Le, Nhung; McKenna, Nicholas; Budde, John P.; Harari, Oscar; Kapoor, Manav; Brooks, Andrew; Hancock, Dana; Tischfield, Jay; Foroud, Tatiana; Bierut, Laura J.; Steinbach, Joe Henry; Edenberg, Howard J.; Traynor, Bryan J.; Goate, Alison M.

    2013-01-01

    Variants within the gene cluster encoding α3, α5, and β4 nicotinic receptor subunits are major risk factors for substance dependence. The strongest impact on risk is associated with variation in the CHRNA5 gene, where at least two mechanisms are at work: amino acid variation and altered mRNA expression levels. The risk allele of the non-synonymous variant (rs16969968; D398N) primarily occurs on the haplotype containing the low mRNA expression allele. In populations of European ancestry, there are approximately 50 highly correlated variants in the CHRNA5-CHRNA3-CHRNB4 gene cluster and the adjacent PSMA4 gene region that are associated with CHRNA5 mRNA levels. It is not clear which of these variants contribute to the changes in CHRNA5 transcript level. Because populations of African ancestry have reduced linkage disequilibrium among variants spanning this gene cluster, eQTL mapping in subjects of African ancestry could potentially aid in defining the functional variants that affect CHRNA5 mRNA levels. We performed quantitative allele specific gene expression using frontal cortices derived from 49 subjects of African ancestry and 111 subjects of European ancestry. This method measures allele-specific transcript levels in the same individual, which eliminates other biological variation that occurs when comparing expression levels between different samples. This analysis confirmed that substance dependence associated variants have a direct cis-regulatory effect on CHRNA5 transcript levels in human frontal cortices of African and European ancestry and identified 10 highly correlated variants, located in a 9 kb region, that are potential functional variants modifying CHRNA5 mRNA expression levels. PMID:24303001

  8. VLBI Analysis with the Multi-Technique Software GEOSAT

    NASA Technical Reports Server (NTRS)

    Kierulf, Halfdan Pascal; Andersen, Per-Helge; Boeckmann, Sarah; Kristiansen, Oddgeir

    2010-01-01

    GEOSAT is a multi-technique geodetic analysis software developed at Forsvarets Forsknings Institutt (Norwegian defense research establishment). The Norwegian Mapping Authority has now installed the software and has, together with Forsvarets Forsknings Institutt, adapted the software to deliver datum-free normal equation systems in SINEX format. The goal is to be accepted as an IVS Associate Analysis Center and to provide contributions to the IVS EOP combination on a routine basis. GEOSAT is based on an upper diagonal factorized Kalman filter which allows estimation of time variable parameters like the troposphere and clocks as stochastic parameters. The tropospheric delays in various directions are mapped to tropospheric zenith delay using ray-tracing. Meteorological data from ECMWF with a resolution of six hours is used to perform the ray-tracing which depends both on elevation and azimuth. Other models are following the IERS and IVS conventions. The Norwegian Mapping Authority has submitted test SINEX files produced with GEOSAT to IVS. The results have been compared with the existing IVS combined products. In this paper the outcome of these comparisons is presented.

  9. Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq) in Wheat.

    PubMed

    Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie

    2018-01-01

    Powdery mildew resistance gene Pm4b , originating from Triticum persicum , is effective against the prevalent Blumeria graminis f. sp. tritici ( Bgt ) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F 2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7 ∗ Bainong 3217 F 4 (carrying Pm4b ) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F 2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b -linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13 , Xics43 , and Xics76 , were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification of Pm4b during its MAS practice.

  10. Development of Molecular Markers Linked to Powdery Mildew Resistance Gene Pm4b by Combining SNP Discovery from Transcriptome Sequencing Data with Bulked Segregant Analysis (BSR-Seq) in Wheat

    PubMed Central

    Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie

    2018-01-01

    Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification of Pm4b during its MAS practice. PMID:29491869

  11. A comprehensive collection of annotations to interpret sequence variation in human mitochondrial transfer RNAs.

    PubMed

    Diroma, Maria Angela; Lubisco, Paolo; Attimonelli, Marcella

    2016-11-08

    The abundance of biological data characterizing the genomics era is contributing to a comprehensive understanding of human mitochondrial genetics. Nevertheless, many aspects are still unclear, specifically about the variability of the 22 human mitochondrial transfer RNA (tRNA) genes and their involvement in diseases. The complex enrichment and isolation of tRNAs in vitro leads to an incomplete knowledge of their post-transcriptional modifications and three-dimensional folding, essential for correct tRNA functioning. An accurate annotation of mitochondrial tRNA variants would be definitely useful and appreciated by mitochondrial researchers and clinicians since the most of bioinformatics tools for variant annotation and prioritization available so far cannot shed light on the functional role of tRNA variations. To this aim, we updated our MToolBox pipeline for mitochondrial DNA analysis of high throughput and Sanger sequencing data by integrating tRNA variant annotations in order to identify and characterize relevant variants not only in protein coding regions, but also in tRNA genes. The annotation step in the pipeline now provides detailed information for variants mapping onto the 22 mitochondrial tRNAs. For each mt-tRNA position along the entire genome, the relative tRNA numbering, tRNA type, cloverleaf secondary domains (loops and stems), mature nucleotide and interactions in the three-dimensional folding were reported. Moreover, pathogenicity predictions for tRNA and rRNA variants were retrieved from the literature and integrated within the annotations provided by MToolBox, both in the stand-alone version and web-based tool at the Mitochondrial Disease Sequence Data Resource (MSeqDR) website. All the information available in the annotation step of MToolBox were exploited to generate custom tracks which can be displayed in the GBrowse instance at MSeqDR website. To the best of our knowledge, specific data regarding mitochondrial variants in tRNA genes were introduced for the first time in a tool for mitochondrial genome analysis, supporting the interpretation of genetic variants in specific genomic contexts.

  12. 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function

    PubMed Central

    Gorski, Mathias; van der Most, Peter J.; Teumer, Alexander; Chu, Audrey Y.; Li, Man; Mijatovic, Vladan; Nolte, Ilja M.; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F.; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P.; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C.; Curhan, Gary C.; d’Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H.; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J.; Harris, Tamara B.; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G.; Homuth, Georg; Hu, Frank B.; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K.; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J.; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J. F.; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A.; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J.; Olden, Matthias; WJH Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P.; Probst-Hensch, Nicole; Raitakari, Olli T.; Rettig, Rainer; Ridker, Paul M.; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E.; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J.; Sedaghat, Sanaz; Smith, Albert V.; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G.; Ulivi, Sheila; Viikari, Jorma S.; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I.; Tromp, Gerard; Snieder, Harold; Heid, Iris M.; Fox, Caroline S.; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A.; Fuchsberger, Christian

    2017-01-01

    HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10−8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples. PMID:28452372

  13. 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function.

    PubMed

    Gorski, Mathias; van der Most, Peter J; Teumer, Alexander; Chu, Audrey Y; Li, Man; Mijatovic, Vladan; Nolte, Ilja M; Cocca, Massimiliano; Taliun, Daniel; Gomez, Felicia; Li, Yong; Tayo, Bamidele; Tin, Adrienne; Feitosa, Mary F; Aspelund, Thor; Attia, John; Biffar, Reiner; Bochud, Murielle; Boerwinkle, Eric; Borecki, Ingrid; Bottinger, Erwin P; Chen, Ming-Huei; Chouraki, Vincent; Ciullo, Marina; Coresh, Josef; Cornelis, Marilyn C; Curhan, Gary C; d'Adamo, Adamo Pio; Dehghan, Abbas; Dengler, Laura; Ding, Jingzhong; Eiriksdottir, Gudny; Endlich, Karlhans; Enroth, Stefan; Esko, Tõnu; Franco, Oscar H; Gasparini, Paolo; Gieger, Christian; Girotto, Giorgia; Gottesman, Omri; Gudnason, Vilmundur; Gyllensten, Ulf; Hancock, Stephen J; Harris, Tamara B; Helmer, Catherine; Höllerer, Simon; Hofer, Edith; Hofman, Albert; Holliday, Elizabeth G; Homuth, Georg; Hu, Frank B; Huth, Cornelia; Hutri-Kähönen, Nina; Hwang, Shih-Jen; Imboden, Medea; Johansson, Åsa; Kähönen, Mika; König, Wolfgang; Kramer, Holly; Krämer, Bernhard K; Kumar, Ashish; Kutalik, Zoltan; Lambert, Jean-Charles; Launer, Lenore J; Lehtimäki, Terho; de Borst, Martin; Navis, Gerjan; Swertz, Morris; Liu, Yongmei; Lohman, Kurt; Loos, Ruth J F; Lu, Yingchang; Lyytikäinen, Leo-Pekka; McEvoy, Mark A; Meisinger, Christa; Meitinger, Thomas; Metspalu, Andres; Metzger, Marie; Mihailov, Evelin; Mitchell, Paul; Nauck, Matthias; Oldehinkel, Albertine J; Olden, Matthias; Wjh Penninx, Brenda; Pistis, Giorgio; Pramstaller, Peter P; Probst-Hensch, Nicole; Raitakari, Olli T; Rettig, Rainer; Ridker, Paul M; Rivadeneira, Fernando; Robino, Antonietta; Rosas, Sylvia E; Ruderfer, Douglas; Ruggiero, Daniela; Saba, Yasaman; Sala, Cinzia; Schmidt, Helena; Schmidt, Reinhold; Scott, Rodney J; Sedaghat, Sanaz; Smith, Albert V; Sorice, Rossella; Stengel, Benedicte; Stracke, Sylvia; Strauch, Konstantin; Toniolo, Daniela; Uitterlinden, Andre G; Ulivi, Sheila; Viikari, Jorma S; Völker, Uwe; Vollenweider, Peter; Völzke, Henry; Vuckovic, Dragana; Waldenberger, Melanie; Jin Wang, Jie; Yang, Qiong; Chasman, Daniel I; Tromp, Gerard; Snieder, Harold; Heid, Iris M; Fox, Caroline S; Köttgen, Anna; Pattaro, Cristian; Böger, Carsten A; Fuchsberger, Christian

    2017-04-28

    HapMap imputed genome-wide association studies (GWAS) have revealed >50 loci at which common variants with minor allele frequency >5% are associated with kidney function. GWAS using more complete reference sets for imputation, such as those from The 1000 Genomes project, promise to identify novel loci that have been missed by previous efforts. To investigate the value of such a more complete variant catalog, we conducted a GWAS meta-analysis of kidney function based on the estimated glomerular filtration rate (eGFR) in 110,517 European ancestry participants using 1000 Genomes imputed data. We identified 10 novel loci with p-value < 5 × 10 -8 previously missed by HapMap-based GWAS. Six of these loci (HOXD8, ARL15, PIK3R1, EYA4, ASTN2, and EPB41L3) are tagged by common SNPs unique to the 1000 Genomes reference panel. Using pathway analysis, we identified 39 significant (FDR < 0.05) genes and 127 significantly (FDR < 0.05) enriched gene sets, which were missed by our previous analyses. Among those, the 10 identified novel genes are part of pathways of kidney development, carbohydrate metabolism, cardiac septum development and glucose metabolism. These results highlight the utility of re-imputing from denser reference panels, until whole-genome sequencing becomes feasible in large samples.

  14. Trace metal mapping by laser-induced breakdown spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kaiser, Jozef; Novotny, Dr. Karel; Hrdlicka, A

    2012-01-01

    Abstract: Laser-Induced Breakdown Spectroscopy (LIBS) is a sensitive optical technique capable of fast multi-elemental analysis of solid, gaseous and liquid samples. The potential applications of lasers for spectrochemical analysis were developed shortly after its invention; however the massive development of LIBS is connected with the availability of powerful pulsed laser sources. Since the late 80s of 20th century LIBS dominated the analytical atomic spectroscopy scene and its application are developed continuously. Here we review the utilization of LIBS for trace elements mapping in different matrices. The main emphasis is on trace metal mapping in biological samples.

  15. Methodology in the Assessment of Construction and Development Investment Projects, Including the Graphic Multi-Criteria Analysis - a Systemic Approach

    NASA Astrophysics Data System (ADS)

    Szafranko, Elżbieta

    2017-10-01

    Assessment of variant solutions developed for a building investment project needs to be made at the stage of planning. While considering alternative solutions, the investor defines various criteria, but a direct evaluation of the degree of their fulfilment by developed variant solutions can be very difficult. In practice, there are different methods which enable the user to include a large number of parameters into an analysis, but their implementation can be challenging. Some methods require advanced mathematical computations, preceded by complicating input data processing, and the generated results may not lend themselves easily to interpretation. Hence, during her research, the author has developed a systemic approach, which involves several methods and whose goal is to compare their outcome. The final stage of the proposed method consists of graphic interpretation of results. The method has been tested on a variety of building and development projects.

  16. Difference in C3–C4 metabolism underlies tradeoff between growth rate and biomass yield in Methylobacterium extorquens AM1

    DOE PAGES

    Fu, Yanfen; Beck, David A. C.; Lidstrom, Mary E.

    2016-07-19

    In this study, two variants of Methylobacterium extorquens AM1 demonstrated a trade-off between growth rate and biomass yield. In addition, growth rate and biomass yield were also affected by supplementation of growth medium with different amounts of cobalt. The metabolism changes relating to these growth phenomena as well as the trade-off were investigated in this study. 13C metabolic flux analysis was used to generate a detailed central carbon metabolic flux map with both absolute and normalized flux values. As a result, the major differences between the two variants occurred at the formate node as well as within C3-C4 inter-conversion pathways.more » Higher relative fluxes through formyltetrahydrofolate ligase, phosphoenolpyruvate carboxylase, and malic enzyme led to higher biomass yield, while higher relative fluxes through pyruvate kinase and pyruvate dehydrogenase led to higher growth rate. These results were then tested by phenotypic studies on three mutants (null pyk, null pck mutant and null dme mutant) in both variants, which agreed with the model prediction. In this study, 13C metabolic flux analysis for two strain variants of M. extorquens AM1 successfully identified metabolic pathways contributing to the trade-off between cell growth and biomass yield. Phenotypic analysis of mutants deficient in corresponding genes supported the conclusion that C3-C4 inter-conversion strategies were the major response to the trade-off.« less

  17. Difference in C3–C4 metabolism underlies tradeoff between growth rate and biomass yield in Methylobacterium extorquens AM1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fu, Yanfen; Beck, David A. C.; Lidstrom, Mary E.

    In this study, two variants of Methylobacterium extorquens AM1 demonstrated a trade-off between growth rate and biomass yield. In addition, growth rate and biomass yield were also affected by supplementation of growth medium with different amounts of cobalt. The metabolism changes relating to these growth phenomena as well as the trade-off were investigated in this study. 13C metabolic flux analysis was used to generate a detailed central carbon metabolic flux map with both absolute and normalized flux values. As a result, the major differences between the two variants occurred at the formate node as well as within C3-C4 inter-conversion pathways.more » Higher relative fluxes through formyltetrahydrofolate ligase, phosphoenolpyruvate carboxylase, and malic enzyme led to higher biomass yield, while higher relative fluxes through pyruvate kinase and pyruvate dehydrogenase led to higher growth rate. These results were then tested by phenotypic studies on three mutants (null pyk, null pck mutant and null dme mutant) in both variants, which agreed with the model prediction. In this study, 13C metabolic flux analysis for two strain variants of M. extorquens AM1 successfully identified metabolic pathways contributing to the trade-off between cell growth and biomass yield. Phenotypic analysis of mutants deficient in corresponding genes supported the conclusion that C3-C4 inter-conversion strategies were the major response to the trade-off.« less

  18. GIS Data Downloads | USDA Plant Hardiness Zone Map

    Science.gov Websites

    Acknowledgments & Citation Copyright Map & Data Downloads Map Downloads Geography (GIS) Downloads Multi & Data Downloads / GIS Data Downloads Topics Map Downloads Geography (GIS) Downloads Multi-Zip Code

  19. Distinct [18F]THK5351 binding patterns in primary progressive aphasia variants.

    PubMed

    Schaeverbeke, Jolien; Evenepoel, Charlotte; Declercq, Lieven; Gabel, Silvy; Meersmans, Karen; Bruffaerts, Rose; Adamczuk, Kate; Dries, Eva; Van Bouwel, Karen; Sieben, Anne; Pijnenburg, Yolande; Peeters, Ronald; Bormans, Guy; Van Laere, Koen; Koole, Michel; Dupont, Patrick; Vandenberghe, Rik

    2018-06-26

    To assess the binding of the PET tracer [ 18 F]THK5351 in patients with different primary progressive aphasia (PPA) variants and its correlation with clinical deficits. The majority of patients with nonfluent variant (NFV) and logopenic variant (LV) PPA have underlying tauopathy of the frontotemporal lobar or Alzheimer disease type, respectively, while patients with the semantic variant (SV) have predominantly transactive response DNA binding protein 43-kDa pathology. The study included 20 PPA patients consecutively recruited through a memory clinic (12 NFV, 5 SV, 3 LV), and 20 healthy controls. All participants received an extensive neurolinguistic assessment, magnetic resonance imaging and amyloid biomarker tests. [ 18 F]THK5351 binding patterns were assessed on standardized uptake value ratio (SUVR) images with the cerebellar grey matter as the reference using statistical parametric mapping. Whole-brain voxel-wise regression analysis was performed to evaluate the association between [ 18 F]THK5351 SUVR images and neurolinguistic scores. Analyses were performed with and without partial volume correction. Patients with NFV showed increased binding in the supplementary motor area, left premotor cortex, thalamus, basal ganglia and midbrain compared with controls and patients with SV. Patients with SV had increased binding in the temporal lobes bilaterally and in the right ventromedial frontal cortex compared with controls and patients with NFV. The whole-brain voxel-wise regression analysis revealed a correlation between agrammatism and motor speech impairment, and [ 18 F]THK5351 binding in the left supplementary motor area and left postcentral gyrus. Analysis of [ 18 F]THK5351 scans without partial volume correction revealed similar results. [ 18 F]THK5351 imaging shows a topography closely matching the anatomical distribution of predicted underlying pathology characteristic of NFV and SV PPA. [ 18 F]THK5351 binding correlates with the severity of clinical impairment.

  20. Mapping copy number variation by population-scale genome sequencing.

    PubMed

    Mills, Ryan E; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R Keira; Chinwalla, Asif; Conrad, Donald F; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M; Konkel, Miriam K; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y K; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P; Stütz, Adrian M; Urban, Alexander Eckehart; Walker, Jerilyn A; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D; Batzer, Mark A; Ding, Li; Marth, Gabor T; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E; Gerstein, Mark B; Hurles, Matthew E; Lee, Charles; McCarroll, Steven A; Korbel, Jan O

    2011-02-03

    Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.

  1. Surrogate based wind farm layout optimization using manifold mapping

    NASA Astrophysics Data System (ADS)

    Kaja Kamaludeen, Shaafi M.; van Zuijle, Alexander; Bijl, Hester

    2016-09-01

    High computational cost associated with the high fidelity wake models such as RANS or LES serves as a primary bottleneck to perform a direct high fidelity wind farm layout optimization (WFLO) using accurate CFD based wake models. Therefore, a surrogate based multi-fidelity WFLO methodology (SWFLO) is proposed. The surrogate model is built using an SBO method referred as manifold mapping (MM). As a verification, optimization of spacing between two staggered wind turbines was performed using the proposed surrogate based methodology and the performance was compared with that of direct optimization using high fidelity model. Significant reduction in computational cost was achieved using MM: a maximum computational cost reduction of 65%, while arriving at the same optima as that of direct high fidelity optimization. The similarity between the response of models, the number of mapping points and its position, highly influences the computational efficiency of the proposed method. As a proof of concept, realistic WFLO of a small 7-turbine wind farm is performed using the proposed surrogate based methodology. Two variants of Jensen wake model with different decay coefficients were used as the fine and coarse model. The proposed SWFLO method arrived at the same optima as that of the fine model with very less number of fine model simulations.

  2. New mutations in BBS genes in small consanguineous families with Bardet-Biedl syndrome: Detection of candidate regions by homozygosity mapping

    PubMed Central

    Pereiro, Ines; Piñeiro-Gallego, Teresa; Baiget, Montserrat; Borrego, Salud; Ayuso, Carmen; Searby, Charles; Nishimura, Darryl

    2010-01-01

    Purpose Bardet-Biedl syndrome (BBS, OMIM 209900) is a rare multi-organ disorder in which BBS patients manifest a variable phenotype that includes retinal dystrophy, polydactyly, mental delay, obesity, and also reproductive tract and renal abnormalities. Mutations in 14 genes (BBS1–BBS14) are found in 70% of the patients, indicating that additional mutations in known and new BBS genes remain to be identified. Therefore, the molecular diagnosis of this complex disorder is a challenging task. Methods In this study we show the use of the genome-wide homozygosity mapping strategy in the mutation detection of nine Caucasian BBS families, eight of them consanguineous and one from the same geographic area with no proven consanguinity. Results We identified the disease-causing mutation in six of the families studied, five of which had novel sequence variants in BBS3, BBS6, and BBS12. This is the first null mutation reported in BBS3. Furthermore, this approach defined homozygous candidate regions that could harbor potential candidate genes for BBS in three of the families. Conclusions These findings further underline the importance of homozygosity mapping as a useful technology for diagnosis in small consanguineous families with a complex disease like BBS. PMID:20142850

  3. A radial map of multi-whisker correlation selectivity in the rat barrel cortex

    PubMed Central

    Estebanez, Luc; Bertherat, Julien; Shulz, Daniel E.; Bourdieu, Laurent; Léger, Jean- François

    2016-01-01

    In the barrel cortex, several features of single-whisker stimuli are organized in functional maps. The barrel cortex also encodes spatio-temporal correlation patterns of multi-whisker inputs, but so far the cortical mapping of neurons tuned to such input statistics is unknown. Here we report that layer 2/3 of the rat barrel cortex contains an additional functional map based on neuronal tuning to correlated versus uncorrelated multi-whisker stimuli: neuron responses to uncorrelated multi-whisker stimulation are strongest above barrel centres, whereas neuron responses to correlated and anti-correlated multi-whisker stimulation peak above the barrel–septal borders, forming rings of multi-whisker synchrony-preferring cells. PMID:27869114

  4. A radial map of multi-whisker correlation selectivity in the rat barrel cortex.

    PubMed

    Estebanez, Luc; Bertherat, Julien; Shulz, Daniel E; Bourdieu, Laurent; Léger, Jean-François

    2016-11-21

    In the barrel cortex, several features of single-whisker stimuli are organized in functional maps. The barrel cortex also encodes spatio-temporal correlation patterns of multi-whisker inputs, but so far the cortical mapping of neurons tuned to such input statistics is unknown. Here we report that layer 2/3 of the rat barrel cortex contains an additional functional map based on neuronal tuning to correlated versus uncorrelated multi-whisker stimuli: neuron responses to uncorrelated multi-whisker stimulation are strongest above barrel centres, whereas neuron responses to correlated and anti-correlated multi-whisker stimulation peak above the barrel-septal borders, forming rings of multi-whisker synchrony-preferring cells.

  5. Optimizing the construction of devices to control inaccesible surfaces - case study

    NASA Astrophysics Data System (ADS)

    Niţu, E. L.; Costea, A.; Iordache, M. D.; Rizea, A. D.; Babă, Al

    2017-10-01

    The modern concept for the evolution of manufacturing systems requires multi-criteria optimization of technological processes and equipments, prioritizing associated criteria according to their importance. Technological preparation of the manufacturing can be developed, depending on the volume of production, to the limit of favourable economical effects related to the recovery of the costs for the design and execution of the technological equipment. Devices, as subsystems of the technological system, in the general context of modernization and diversification of machines, tools, semi-finished products and drives, are made in a multitude of constructive variants, which in many cases do not allow their identification, study and improvement. This paper presents a case study in which the multi-criteria analysis of some structures, based on a general optimization method, of novelty character, is used in order to determine the optimal construction variant of a control device. The rational construction of the control device confirms that the optimization method and the proposed calculation methods are correct and determine a different system configuration, new features and functions, and a specific method of working to control inaccessible surfaces.

  6. Multi-criteria decision making to support waste management: A critical review of current practices and methods.

    PubMed

    Goulart Coelho, Lineker M; Lange, Liséte C; Coelho, Hosmanny Mg

    2017-01-01

    Solid waste management is a complex domain involving the interaction of several dimensions; thus, its analysis and control impose continuous challenges for decision makers. In this context, multi-criteria decision-making models have become important and convenient supporting tools for solid waste management because they can handle problems involving multiple dimensions and conflicting criteria. However, the selection of the multi-criteria decision-making method is a hard task since there are several multi-criteria decision-making approaches, each one with a large number of variants whose applicability depends on information availability and the aim of the study. Therefore, to support researchers and decision makers, the objectives of this article are to present a literature review of multi-criteria decision-making applications used in solid waste management, offer a critical assessment of the current practices, and provide suggestions for future works. A brief review of fundamental concepts on this topic is first provided, followed by the analysis of 260 articles related to the application of multi-criteria decision making in solid waste management. These studies were investigated in terms of the methodology, including specific steps such as normalisation, weighting, and sensitivity analysis. In addition, information related to waste type, the study objective, and aspects considered was recorded. From the articles analysed it is noted that studies using multi-criteria decision making in solid waste management are predominantly addressed to problems related to municipal solid waste involving facility location or management strategy.

  7. Alternative transitions between existing representations in multi-scale maps

    NASA Astrophysics Data System (ADS)

    Dumont, Marion; Touya, Guillaume; Duchêne, Cécile

    2018-05-01

    Map users may have issues to achieve multi-scale navigation tasks, as cartographic objects may have various representations across scales. We assume that adding intermediate representations could be one way to reduce the differences between existing representations, and to ease the transitions across scales. We consider an existing multiscale map on the scale range from 1 : 25k to 1 : 100k scales. Based on hypotheses about intermediate representations design, we build custom multi-scale maps with alternative transitions. We will conduct in a next future a user evaluation to compare the efficiency of these alternative maps for multi-scale navigation. This paper discusses the hypotheses and production process of these alternative maps.

  8. Fine-scale mapping of 8q24 locus identifies multiple independent risk variants for breast cancer

    PubMed Central

    Zheng, Wei; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Lush, Michael; Milne, Roger L.; Shu, Xiao-Ou; Beesley, Jonathan; Kar, Siddhartha; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Zhao, Zhiguo; Guo, Xingyi; Benitez, Javier; Beeghly-Fadiel, Alicia; Blot, William; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dork, Thilo; Fasching, Peter A.; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G.; Guenel, Pascal; Haiman, Christopher A.; Hamann, Ute; Hartman, Mikael; Miao, Hui; Hollestelle, Antoinette; Hopper, John L.; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Torres, Diana; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A.; Kosma, Veli-Matti; Lambrechts, Diether; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Nord, Silje; Børresen-Dale, Anne-Lise; Olson, Janet E.; Orr, Nick; van den Ouweland, Ans M.W.; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Shen, Chen-Yang; Hou, Ming-Feng; Shrubsole, Matha J; Southey, Melissa C.; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda E.; Tollenaar, Robert A.E.M.; Tomlinson, Ian; Truong, Therese; Tseng, Chiu-chen; Wen, Wanqing; Winqvist, Robert; Wu, Anna H.; Yip, Cheng Har; Zamora, Pilar M.; Zheng, Ying; Floris, Giuseppe; Cheng, Ching-Yu; Hooning, Maartje J.; Martens, John W.M.; Seynaeve, Caroline; Kristensen, Vessela N.; Hall, Per; Pharoah, Paul D.P.; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M.; Antoniou, Antonis C.; Easton, Douglas F.; Cai, Qiuyin; Long, Jirong

    2016-01-01

    Previous genome-wide association studies among women of European ancestry identified two independent breast cancer susceptibility loci represented by single nucleotide polymorphisms (SNPs) rs13281615 and rs11780156 at 8q24. We conducted a fine-mapping study across 2.06 Mb (chr8:127,561,724 −129,624,067, hg19) in 55,540 breast cancer cases and 51,168 controls within the Breast Cancer Association Consortium. We found three additional independent association signals in women of European ancestry, represented by rs35961416 (OR = 0.95, 95% CI = 0.93-0.97, conditional P = 5.8 × 10−6), rs7815245 (OR = 0.94, 95% CI = 0.91-0.96, conditional P = 1.1 × 10−6), and rs2033101 (OR = 1.05, 95% CI = 1.02-1.07, conditional P = 1.1 × 10−4). Integrative analysis using functional genomic data from the Roadmap Epigenomics, the Encyclopedia of DNA Elements project, the Cancer Genome Atlas, and other public resources implied that SNPs rs7815245 in Signal 3, and rs1121948 in Signal 5 (in linkage disequilibrium with rs11780156, r2 = 0.77), were putatively functional variants for two of the five independent association signals. Our results highlight multiple 8q24 variants associated with breast cancer susceptibility in women of European ancestry. PMID:27087578

  9. Using self-organizing maps to determine observation threshold limit predictions in highly variant data

    USGS Publications Warehouse

    Paganoni, C.A.; Chang, K.C.; Robblee, M.B.

    2006-01-01

    A significant data quality challenge for highly variant systems surrounds the limited ability to quantify operationally reasonable limits on the data elements being collected and provide reasonable threshold predictions. In many instances, the number of influences that drive a resulting value or operational range is too large to enable physical sampling for each influencer, or is too complicated to accurately model in an explicit simulation. An alternative method to determine reasonable observation thresholds is to employ an automation algorithm that would emulate a human analyst visually inspecting data for limits. Using the visualization technique of self-organizing maps (SOM) on data having poorly understood relationships, a methodology for determining threshold limits was developed. To illustrate this approach, analysis of environmental influences that drive the abundance of a target indicator species (the pink shrimp, Farfantepenaeus duorarum) provided a real example of applicability. The relationship between salinity and temperature and abundance of F. duorarum is well documented, but the effect of changes in water quality upstream on pink shrimp abundance is not well understood. The highly variant nature surrounding catch of a specific number of organisms in the wild, and the data available from up-stream hydrology measures for salinity and temperature, made this an ideal candidate for the approach to provide a determination about the influence of changes in hydrology on populations of organisms.

  10. Using self-organizing maps to determine observation threshold limit predictions in highly variant data

    NASA Astrophysics Data System (ADS)

    Paganoni, Christopher A.; Chang, K. C.; Robblee, Michael B.

    2006-05-01

    A significant data quality challenge for highly variant systems surrounds the limited ability to quantify operationally reasonable limits on the data elements being collected and provide reasonable threshold predictions. In many instances, the number of influences that drive a resulting value or operational range is too large to enable physical sampling for each influencer, or is too complicated to accurately model in an explicit simulation. An alternative method to determine reasonable observation thresholds is to employ an automation algorithm that would emulate a human analyst visually inspecting data for limits. Using the visualization technique of self-organizing maps (SOM) on data having poorly understood relationships, a methodology for determining threshold limits was developed. To illustrate this approach, analysis of environmental influences that drive the abundance of a target indicator species (the pink shrimp, Farfantepenaeus duorarum) provided a real example of applicability. The relationship between salinity and temperature and abundance of F. duorarum is well documented, but the effect of changes in water quality upstream on pink shrimp abundance is not well understood. The highly variant nature surrounding catch of a specific number of organisms in the wild, and the data available from up-stream hydrology measures for salinity and temperature, made this an ideal candidate for the approach to provide a determination about the influence of changes in hydrology on populations of organisms.

  11. A dataset of multiresolution functional brain parcellations in an elderly population with no or mild cognitive impairment.

    PubMed

    Tam, Angela; Dansereau, Christian; Badhwar, AmanPreet; Orban, Pierre; Belleville, Sylvie; Chertkow, Howard; Dagher, Alain; Hanganu, Alexandru; Monchi, Oury; Rosa-Neto, Pedro; Shmuel, Amir; Breitner, John; Bellec, Pierre

    2016-12-01

    We present group eight resolutions of brain parcellations for clusters generated from resting-state functional magnetic resonance images for 99 cognitively normal elderly persons and 129 patients with mild cognitive impairment, pooled from four independent datasets. This dataset was generated as part of the following study: Common Effects of Amnestic Mild Cognitive Impairment on Resting-State Connectivity Across Four Independent Studies (Tam et al., 2015) [1]. The brain parcellations have been registered to both symmetric and asymmetric MNI brain templates and generated using a method called bootstrap analysis of stable clusters (BASC) (Bellec et al., 2010) [2]. We present two variants of these parcellations. One variant contains bihemisphereic parcels (4, 6, 12, 22, 33, 65, 111, and 208 total parcels across eight resolutions). The second variant contains spatially connected regions of interest (ROIs) that span only one hemisphere (10, 17, 30, 51, 77, 199, and 322 total ROIs across eight resolutions). We also present maps illustrating functional connectivity differences between patients and controls for four regions of interest (striatum, dorsal prefrontal cortex, middle temporal lobe, and medial frontal cortex). The brain parcels and associated statistical maps have been publicly released as 3D volumes, available in .mnc and .nii file formats on figshare and on Neurovault. Finally, the code used to generate this dataset is available on Github.

  12. Fine-scale mapping of 8q24 locus identifies multiple independent risk variants for breast cancer.

    PubMed

    Shi, Jiajun; Zhang, Yanfeng; Zheng, Wei; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Lush, Michael; Milne, Roger L; Shu, Xiao-Ou; Beesley, Jonathan; Kar, Siddhartha; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Zhao, Zhiguo; Guo, Xingyi; Benitez, Javier; Beeghly-Fadiel, Alicia; Blot, William; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dork, Thilo; Fasching, Peter A; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G; Guenel, Pascal; Haiman, Christopher A; Hamann, Ute; Hartman, Mikael; Miao, Hui; Hollestelle, Antoinette; Hopper, John L; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Torres, Diana; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A; Kosma, Veli-Matti; Lambrechts, Diether; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Nord, Silje; Børresen-Dale, Anne-Lise; Olson, Janet E; Orr, Nick; van den Ouweland, Ans M W; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Shen, Chen-Yang; Hou, Ming-Feng; Shrubsole, Matha J; Southey, Melissa C; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda E; Tollenaar, Robert A E M; Tomlinson, Ian; Truong, Therese; Tseng, Chiu-Chen; Wen, Wanqing; Winqvist, Robert; Wu, Anna H; Yip, Cheng Har; Zamora, Pilar M; Zheng, Ying; Floris, Giuseppe; Cheng, Ching-Yu; Hooning, Maartje J; Martens, John W M; Seynaeve, Caroline; Kristensen, Vessela N; Hall, Per; Pharoah, Paul D P; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M; Antoniou, Antonis C; Easton, Douglas F; Cai, Qiuyin; Long, Jirong

    2016-09-15

    Previous genome-wide association studies among women of European ancestry identified two independent breast cancer susceptibility loci represented by single nucleotide polymorphisms (SNPs) rs13281615 and rs11780156 at 8q24. A fine-mapping study across 2.06 Mb (chr8:127,561,724-129,624,067, hg19) in 55,540 breast cancer cases and 51,168 controls within the Breast Cancer Association Consortium was conducted. Three additional independent association signals in women of European ancestry, represented by rs35961416 (OR = 0.95, 95% CI = 0.93-0.97, conditional p = 5.8 × 10(-6) ), rs7815245 (OR = 0.94, 95% CI = 0.91-0.96, conditional p = 1.1 × 10(-6) ) and rs2033101 (OR = 1.05, 95% CI = 1.02-1.07, conditional p = 1.1 × 10(-4) ) were found. Integrative analysis using functional genomic data from the Roadmap Epigenomics, the Encyclopedia of DNA Elements project, the Cancer Genome Atlas and other public resources implied that SNPs rs7815245 in Signal 3, and rs1121948 in Signal 5 (in linkage disequilibrium with rs11780156, r(2)  = 0.77), were putatively functional variants for two of the five independent association signals. The results highlighted multiple 8q24 variants associated with breast cancer susceptibility in women of European ancestry. © 2016 UICC.

  13. The Model Analyst’s Toolkit: Scientific Model Development, Analysis, and Validation

    DTIC Science & Technology

    2013-11-20

    Granger causality F-test validation 3.1.2. Dynamic time warping for uneven temporal relationships Many causal relationships are imperfectly...mapping for dynamic feedback models Granger causality and DTW can identify causal relationships and consider complex temporal factors. However, many ...variant of the tf-idf algorithm (Manning, Raghavan, Schutze et al., 2008), typically used in search engines, to “score” features. The (-log tf) in

  14. Whole-exome sequencing identifies common and rare variant metabolic QTLs in a Middle Eastern population.

    PubMed

    Yousri, Noha A; Fakhro, Khalid A; Robay, Amal; Rodriguez-Flores, Juan L; Mohney, Robert P; Zeriri, Hassina; Odeh, Tala; Kader, Sara Abdul; Aldous, Eman K; Thareja, Gaurav; Kumar, Manish; Al-Shakaki, Alya; Chidiac, Omar M; Mohamoud, Yasmin A; Mezey, Jason G; Malek, Joel A; Crystal, Ronald G; Suhre, Karsten

    2018-01-23

    Metabolomics-genome-wide association studies (mGWAS) have uncovered many metabolic quantitative trait loci (mQTLs) influencing human metabolic individuality, though predominantly in European cohorts. By combining whole-exome sequencing with a high-resolution metabolomics profiling for a highly consanguineous Middle Eastern population, we discover 21 common variant and 12 functional rare variant mQTLs, of which 45% are novel altogether. We fine-map 10 common variant mQTLs to new metabolite ratio associations, and 11 common variant mQTLs to putative protein-altering variants. This is the first work to report common and rare variant mQTLs linked to diseases and/or pharmacological targets in a consanguineous Arab cohort, with wide implications for precision medicine in the Middle East.

  15. Genetic Analysis of Aspartate Aminotransferase Isozymes from Hybrids between DROSOPHILA MELANOGASTER and DROSOPHILA SIMULANS and Mutagen-Induced Isozyme Variants

    PubMed Central

    Grell, E. H.

    1976-01-01

    The aspartate aminotransferases (designated GOT1 and GOT2) are two enzymes of Drosophila melanogaster for which naturally occurring electrophoretic variants were not found. There is an electrophoretic difference between D. melanogaster and D. simulans. Since the F 1 hybrid offspring of these species are sterile, a genetic analysis of the ordinary type cannot be done on differences between the two species. A method was devised to make "partial hybrids" in which one chromosome arm is homozygous for melanogaster genes in an otherwise hybrid background. By using this method, Got1 was localized to 2R and Got2 to 2L. Once a gene can be assigned to a chromosome, it may be followed in crossing schemes and mutations from mutagen treatments may be looked for. At the locus of Got1 a mutation with low activity was recovered and designated Got1lo. It was located at a genetic map position of 75 on 2R. A Got2 mutant with a greater migration to the anode was recovered and designated Got2 J. It was located at a genetic map position of 3.0, and in the salivary chromosome was between 22B1 and 22B4 inclusive. PMID:823072

  16. Fine-scale mapping of the FGFR2 breast cancer risk locus: putative functional variants differentially bind FOXA1 and E2F1.

    PubMed

    Meyer, Kerstin B; O'Reilly, Martin; Michailidou, Kyriaki; Carlebur, Saskia; Edwards, Stacey L; French, Juliet D; Prathalingham, Radhika; Dennis, Joe; Bolla, Manjeet K; Wang, Qin; de Santiago, Ines; Hopper, John L; Tsimiklis, Helen; Apicella, Carmel; Southey, Melissa C; Schmidt, Marjanka K; Broeks, Annegien; Van 't Veer, Laura J; Hogervorst, Frans B; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A; Lux, Michael P; Ekici, Arif B; Beckmann, Matthias W; Peto, Julian; Dos Santos Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Zamora, M Pilar; Arias, Jose I; Benitez, Javier; Neuhausen, Susan; Anton-Culver, Hoda; Ziogas, Argyrios; Dur, Christina C; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K; Engel, Christoph; Ditsch, Nina; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Dörk, Thilo; Helbig, Sonja; Bogdanova, Natalia V; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Chenevix-Trench, Georgia; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Lambrechts, Diether; Thienpont, Bernard; Christiaens, Marie-Rose; Smeets, Ann; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Bernard, Loris; Couch, Fergus J; Olson, Janet E; Wang, Xianshu; Purrington, Kristen; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Teo, Soo-Hwang; Yip, Cheng-Har; Phuah, Sze-Yee; Kristensen, Vessela; Grenaker Alnæs, Grethe; Børresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline M; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Czene, Kamila; Darabi, Hartef; Eriksson, Kimael; Hooning, Maartje J; Martens, John W M; van den Ouweland, Ans M W; van Deurzen, Carolien H M; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Reed, Malcolm W R; Blot, William; Signorello, Lisa B; Cai, Qiuyin; Pharoah, Paul D P; Ghoussaini, Maya; Harrington, Patricia; Tyrer, Jonathan; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Hartman, Mikael; Hui, Miao; Lim, Wei-Yen; Buhari, Shaik A; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Vachon, Celine; Slager, Susan; Fostira, Florentia; Pilarski, Robert; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J; Ponder, Bruce A J; Dunning, Alison M; Easton, Douglas F

    2013-12-05

    The 10q26 locus in the second intron of FGFR2 is the locus most strongly associated with estrogen-receptor-positive breast cancer in genome-wide association studies. We conducted fine-scale mapping in case-control studies genotyped with a custom chip (iCOGS), comprising 41 studies (n = 89,050) of European ancestry, 9 Asian ancestry studies (n = 13,983), and 2 African ancestry studies (n = 2,028) from the Breast Cancer Association Consortium. We identified three statistically independent risk signals within the locus. Within risk signals 1 and 3, genetic analysis identified five and two variants, respectively, highly correlated with the most strongly associated SNPs. By using a combination of genetic fine mapping, data on DNase hypersensitivity, and electrophoretic mobility shift assays to study protein-DNA binding, we identified rs35054928, rs2981578, and rs45631563 as putative functional SNPs. Chromatin immunoprecipitation showed that FOXA1 preferentially bound to the risk-associated allele (C) of rs2981578 and was able to recruit ERα to this site in an allele-specific manner, whereas E2F1 preferentially bound the risk variant of rs35054928. The risk alleles were preferentially found in open chromatin and bound by Ser5 phosphorylated RNA polymerase II, suggesting that the risk alleles are associated with changes in transcription. Chromatin conformation capture demonstrated that the risk region was able to interact with the promoter of FGFR2, the likely target gene of this risk region. A role for FOXA1 in mediating breast cancer susceptibility at this locus is consistent with the finding that the FGFR2 risk locus primarily predisposes to estrogen-receptor-positive disease. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  17. Fine-Scale Mapping of the FGFR2 Breast Cancer Risk Locus: Putative Functional Variants Differentially Bind FOXA1 and E2F1

    PubMed Central

    Meyer, Kerstin B.; O’Reilly, Martin; Michailidou, Kyriaki; Carlebur, Saskia; Edwards, Stacey L.; French, Juliet D.; Prathalingham, Radhika; Dennis, Joe; Bolla, Manjeet K.; Wang, Qin; de Santiago, Ines; Hopper, John L.; Tsimiklis, Helen; Apicella, Carmel; Southey, Melissa C.; Schmidt, Marjanka K.; Broeks, Annegien; Van ’t Veer, Laura J.; Hogervorst, Frans B.; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A.; Lux, Michael P.; Ekici, Arif B.; Beckmann, Matthias W.; Peto, Julian; dos Santos Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Milne, Roger L.; Zamora, M. Pilar; Arias, Jose I.; Benitez, Javier; Neuhausen, Susan; Anton-Culver, Hoda; Ziogas, Argyrios; Dur, Christina C.; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K.; Engel, Christoph; Ditsch, Nina; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Dörk, Thilo; Helbig, Sonja; Bogdanova, Natalia V.; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Chenevix-Trench, Georgia; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Lambrechts, Diether; Thienpont, Bernard; Christiaens, Marie-Rose; Smeets, Ann; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Bernard, Loris; Couch, Fergus J.; Olson, Janet E.; Wang, Xianshu; Purrington, Kristen; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Teo, Soo-Hwang; Yip, Cheng-Har; Phuah, Sze-Yee; Kristensen, Vessela; Grenaker Alnæs, Grethe; Børresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline M.; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Czene, Kamila; Darabi, Hartef; Eriksson, Kimael; Hooning, Maartje J.; Martens, John W.M.; van den Ouweland, Ans M.W.; van Deurzen, Carolien H.M.; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Reed, Malcolm W.R.; Blot, William; Signorello, Lisa B.; Cai, Qiuyin; Pharoah, Paul D.P.; Ghoussaini, Maya; Harrington, Patricia; Tyrer, Jonathan; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Hartman, Mikael; Hui, Miao; Lim, Wei-Yen; Buhari, Shaik A.; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Vachon, Celine; Slager, Susan; Fostira, Florentia; Pilarski, Robert; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; Ponder, Bruce A.J.; Dunning, Alison M.; Easton, Douglas F.

    2013-01-01

    The 10q26 locus in the second intron of FGFR2 is the locus most strongly associated with estrogen-receptor-positive breast cancer in genome-wide association studies. We conducted fine-scale mapping in case-control studies genotyped with a custom chip (iCOGS), comprising 41 studies (n = 89,050) of European ancestry, 9 Asian ancestry studies (n = 13,983), and 2 African ancestry studies (n = 2,028) from the Breast Cancer Association Consortium. We identified three statistically independent risk signals within the locus. Within risk signals 1 and 3, genetic analysis identified five and two variants, respectively, highly correlated with the most strongly associated SNPs. By using a combination of genetic fine mapping, data on DNase hypersensitivity, and electrophoretic mobility shift assays to study protein-DNA binding, we identified rs35054928, rs2981578, and rs45631563 as putative functional SNPs. Chromatin immunoprecipitation showed that FOXA1 preferentially bound to the risk-associated allele (C) of rs2981578 and was able to recruit ERα to this site in an allele-specific manner, whereas E2F1 preferentially bound the risk variant of rs35054928. The risk alleles were preferentially found in open chromatin and bound by Ser5 phosphorylated RNA polymerase II, suggesting that the risk alleles are associated with changes in transcription. Chromatin conformation capture demonstrated that the risk region was able to interact with the promoter of FGFR2, the likely target gene of this risk region. A role for FOXA1 in mediating breast cancer susceptibility at this locus is consistent with the finding that the FGFR2 risk locus primarily predisposes to estrogen-receptor-positive disease. PMID:24290378

  18. A computer software system for integration and analysis of grid-based remote sensing data with other natural resource data. Remote Sensing Project

    NASA Technical Reports Server (NTRS)

    Tilmann, S. E.; Enslin, W. R.; Hill-Rowley, R.

    1977-01-01

    A computer-based information system is described designed to assist in the integration of commonly available spatial data for regional planning and resource analysis. The Resource Analysis Program (RAP) provides a variety of analytical and mapping phases for single factor or multi-factor analyses. The unique analytical and graphic capabilities of RAP are demonstrated with a study conducted in Windsor Township, Eaton County, Michigan. Soil, land cover/use, topographic and geological maps were used as a data base to develope an eleven map portfolio. The major themes of the portfolio are land cover/use, non-point water pollution, waste disposal, and ground water recharge.

  19. Characterization of the basic charge variants of a human IgG1

    PubMed Central

    Lu, Franklin; Derfus, Gayle; Kluck, Brian; Nogal, Bartek; Emery, Craig; Summers, Christie; Zheng, Kai; Bayer, Robert; Amanullah, Ashraf

    2011-01-01

    We report a case study of an IgG1 with a unique basic charge variant profile caused by C-terminal proline amidation on either one or two heavy chains. The proline amidation was sensitive to copper ion concentration in the production media during cell culture: the higher the Cu2+ ion concentration, the higher the level of proline amidation detected. This conclusion was supported by the analysis of samples that revealed direct correlation between the proline amidation level observed from peptide maps and the level of basic peaks measured by imaged capillary isoelectric focusing and a pH gradient ion-exchange chromatography method. The importance of these observations to therapeutic antibody production is discussed. PMID:22123059

  20. Characterization of the basic charge variants of a human IgG1: effect of copper concentration in cell culture media.

    PubMed

    Kaschak, Timothy; Boyd, Daniel; Lu, Franklin; Derfus, Gayle; Kluck, Brian; Nogal, Bartek; Emery, Craig; Summers, Christie; Zheng, Kai; Bayer, Robert; Amanullah, Ashraf; Yan, Boxu

    2011-01-01

    We report a case study of an IgG1 with a unique basic charge variant profile caused by C-terminal proline amidation on either one or two heavy chains. The proline amidation was sensitive to copper ion concentration in the production media during cell culture: the higher the Cu ( 2+) ion concentration, the higher the level of proline amidation detected. This conclusion was supported by the analysis of samples that revealed direct correlation between the proline amidation level observed from peptide maps and the level of basic peaks measured by imaged capillary isoelectric focusing and a pH gradient ion-exchange chromatography method. The importance of these observations to therapeutic antibody production is discussed.

  1. Structural Variation Shapes the Landscape of Recombination in Mouse

    PubMed Central

    Morgan, Andrew P.; Gatti, Daniel M.; Najarian, Maya L.; Keane, Thomas M.; Galante, Raymond J.; Pack, Allan I.; Mott, Richard; Churchill, Gary A.; de Villena, Fernando Pardo-Manuel

    2017-01-01

    Meiotic recombination is an essential feature of sexual reproduction that ensures faithful segregation of chromosomes and redistributes genetic variants in populations. Multiparent populations such as the Diversity Outbred (DO) mouse stock accumulate large numbers of crossover (CO) events between founder haplotypes, and thus present a unique opportunity to study the role of genetic variation in shaping the recombination landscape. We obtained high-density genotype data from 6886 DO mice, and localized 2.2 million CO events to intervals with a median size of 28 kb. The resulting sex-averaged genetic map of the DO population is highly concordant with large-scale (order 10 Mb) features of previously reported genetic maps for mouse. To examine fine-scale (order 10 kb) patterns of recombination in the DO, we overlaid putative recombination hotspots onto our CO intervals. We found that CO intervals are enriched in hotspots compared to the genomic background. However, as many as 26% of CO intervals do not overlap any putative hotspots, suggesting that our understanding of hotspots is incomplete. We also identified coldspots encompassing 329 Mb, or 12% of observable genome, in which there is little or no recombination. In contrast to hotspots, which are a few kilobases in size, and widely scattered throughout the genome, coldspots have a median size of 2.1 Mb and are spatially clustered. Coldspots are strongly associated with copy-number variant (CNV) regions, especially multi-allelic clusters, identified from whole-genome sequencing of 228 DO mice. Genes in these regions have reduced expression, and epigenetic features of closed chromatin in male germ cells, which suggests that CNVs may repress recombination by altering chromatin structure in meiosis. Our findings demonstrate how multiparent populations, by bridging the gap between large-scale and fine-scale genetic mapping, can reveal new features of the recombination landscape. PMID:28592499

  2. Structural Variation Shapes the Landscape of Recombination in Mouse.

    PubMed

    Morgan, Andrew P; Gatti, Daniel M; Najarian, Maya L; Keane, Thomas M; Galante, Raymond J; Pack, Allan I; Mott, Richard; Churchill, Gary A; de Villena, Fernando Pardo-Manuel

    2017-06-01

    Meiotic recombination is an essential feature of sexual reproduction that ensures faithful segregation of chromosomes and redistributes genetic variants in populations. Multiparent populations such as the Diversity Outbred (DO) mouse stock accumulate large numbers of crossover (CO) events between founder haplotypes, and thus present a unique opportunity to study the role of genetic variation in shaping the recombination landscape. We obtained high-density genotype data from [Formula: see text] DO mice, and localized 2.2 million CO events to intervals with a median size of 28 kb. The resulting sex-averaged genetic map of the DO population is highly concordant with large-scale (order 10 Mb) features of previously reported genetic maps for mouse. To examine fine-scale (order 10 kb) patterns of recombination in the DO, we overlaid putative recombination hotspots onto our CO intervals. We found that CO intervals are enriched in hotspots compared to the genomic background. However, as many as [Formula: see text] of CO intervals do not overlap any putative hotspots, suggesting that our understanding of hotspots is incomplete. We also identified coldspots encompassing 329 Mb, or [Formula: see text] of observable genome, in which there is little or no recombination. In contrast to hotspots, which are a few kilobases in size, and widely scattered throughout the genome, coldspots have a median size of 2.1 Mb and are spatially clustered. Coldspots are strongly associated with copy-number variant (CNV) regions, especially multi-allelic clusters, identified from whole-genome sequencing of 228 DO mice. Genes in these regions have reduced expression, and epigenetic features of closed chromatin in male germ cells, which suggests that CNVs may repress recombination by altering chromatin structure in meiosis. Our findings demonstrate how multiparent populations, by bridging the gap between large-scale and fine-scale genetic mapping, can reveal new features of the recombination landscape. Copyright © 2017 by the Genetics Society of America.

  3. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.

    DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less

  4. A comprehensive examination of breast cancer risk loci in African American women

    PubMed Central

    Feng, Ye; Stram, Daniel O.; Rhie, Suhn Kyong; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Olshan, Andrew F.; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Palmer, Julie R.; Olopade, Olufunmilayo I.; Huo, Dezheng; Adebamowo, Clement A.; Ogundiran, Temidayo; Chen, Gary K.; Stram, Alex; Park, Karen; Rand, Kristin A.; Chanock, Stephen J.; Le Marchand, Loic; Kolonel, Laurence N.; Conti, David V.; Easton, Douglas; Henderson, Brian E.; Haiman, Christopher A.

    2014-01-01

    Genome-wide association studies have identified 73 breast cancer risk variants mainly in European populations. Given considerable differences in linkage disequilibrium structure between populations of European and African ancestry, the known risk variants may not be informative for risk in African ancestry populations. In a previous fine-mapping investigation of 19 breast cancer loci, we were able to identify SNPs in four regions that better captured risk associations in African American women. In this study of breast cancer in African American women (3016 cases, 2745 controls), we tested an additional 54 novel breast cancer risk variants. Thirty-eight variants (70%) were found to have an association with breast cancer in the same direction as previously reported, with eight (15%) replicating at P < 0.05. Through fine-mapping, in three regions (1q32, 3p24, 10q25), we identified variants that better captured associations with overall breast cancer or estrogen receptor positive disease. We also observed suggestive associations with variants (at P < 5 × 10−6) in three separate regions (6q25, 14q13, 22q12) that may represent novel risk variants. Directional consistency of association observed for ∼65–70% of currently known genetic variants for breast cancer in women of African ancestry implies a shared functional common variant at most loci. To validate and enhance the spectrum of alleles that define associations at the known breast cancer risk loci, as well as genome-wide, will require even larger collaborative efforts in women of African ancestry. PMID:24852375

  5. Quantitative X-ray mapping, scatter diagrams and the generation of correction maps to obtain more information about your material

    NASA Astrophysics Data System (ADS)

    Wuhrer, R.; Moran, K.

    2014-03-01

    Quantitative X-ray mapping with silicon drift detectors and multi-EDS detector systems have become an invaluable analysis technique and one of the most useful methods of X-ray microanalysis today. The time to perform an X-ray map has reduced considerably with the ability to map minor and trace elements very accurately due to the larger detector area and higher count rate detectors. Live X-ray imaging can now be performed with a significant amount of data collected in a matter of minutes. A great deal of information can be obtained from X-ray maps. This includes; elemental relationship or scatter diagram creation, elemental ratio mapping, chemical phase mapping (CPM) and quantitative X-ray maps. In obtaining quantitative x-ray maps, we are able to easily generate atomic number (Z), absorption (A), fluorescence (F), theoretical back scatter coefficient (η), and quantitative total maps from each pixel in the image. This allows us to generate an image corresponding to each factor (for each element present). These images allow the user to predict and verify where they are likely to have problems in our images, and are especially helpful to look at possible interface artefacts. The post-processing techniques to improve the quantitation of X-ray map data and the development of post processing techniques for improved characterisation are covered in this paper.

  6. Multi-Scale Surface Descriptors

    PubMed Central

    Cipriano, Gregory; Phillips, George N.; Gleicher, Michael

    2010-01-01

    Local shape descriptors compactly characterize regions of a surface, and have been applied to tasks in visualization, shape matching, and analysis. Classically, curvature has be used as a shape descriptor; however, this differential property characterizes only an infinitesimal neighborhood. In this paper, we provide shape descriptors for surface meshes designed to be multi-scale, that is, capable of characterizing regions of varying size. These descriptors capture statistically the shape of a neighborhood around a central point by fitting a quadratic surface. They therefore mimic differential curvature, are efficient to compute, and encode anisotropy. We show how simple variants of mesh operations can be used to compute the descriptors without resorting to expensive parameterizations, and additionally provide a statistical approximation for reduced computational cost. We show how these descriptors apply to a number of uses in visualization, analysis, and matching of surfaces, particularly to tasks in protein surface analysis. PMID:19834190

  7. Empirical characteristics of family-based linkage to a complex trait: the ADIPOQ region and adiponectin levels.

    PubMed

    Hellwege, Jacklyn N; Palmer, Nicholette D; Mark Brown, W; Brown, Mark W; Ziegler, Julie T; Sandy An, S; An, Sandy S; Guo, Xiuqing; Ida Chen, Y-D; Chen, Ida Y-D; Taylor, Kent; Hawkins, Gregory A; Ng, Maggie C Y; Speliotes, Elizabeth K; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Wagenknecht, Lynne E; Langefeld, Carl D; Bowden, Donald W

    2015-02-01

    We previously identified a low-frequency (1.1 %) coding variant (G45R; rs200573126) in the adiponectin gene (ADIPOQ) which was the basis for a multipoint microsatellite linkage signal (LOD = 8.2) for plasma adiponectin levels in Hispanic families. We have empirically evaluated the ability of data from targeted common variants, exome chip genotyping, and genome-wide association study data to detect linkage and association to adiponectin protein levels at this locus. Simple two-point linkage and association analyses were performed in 88 Hispanic families (1,150 individuals) using 10,958 SNPs on chromosome 3. Approaches were compared for their ability to map the functional variant, G45R, which was strongly linked (two-point LOD = 20.98) and powerfully associated (p value = 8.1 × 10(-50)). Over 450 SNPs within a broad 61 Mb interval around rs200573126 showed nominal evidence of linkage (LOD > 3) but only four other SNPs in this region were associated with p values < 1.0 × 10(-4). When G45R was accounted for, the maximum LOD score across the interval dropped to 4.39 and the best p value was 1.1 × 10(-5). Linked and/or associated variants ranged in frequency (0.0018-0.50) and type (coding, non-coding) and had little detectable linkage disequilibrium with rs200573126 (r (2) < 0.20). In addition, the two-point linkage approach empirically outperformed multipoint microsatellite and multipoint SNP analysis. In the absence of data for rs200573126, family-based linkage analysis using a moderately dense SNP dataset, including both common and low-frequency variants, resulted in stronger evidence for an adiponectin locus than association data alone. Thus, linkage analysis can be a useful tool to facilitate identification of high-impact genetic variants.

  8. In-depth Investigation of Genetic Region Identifies Mechanism that Contributes to Cancer Risk

    Cancer.gov

    Investigators in the Laboratory of Translational Genomics have identified a genetic variant in a multi-cancer risk locus at chromosome 5p15.33 that explains, at least in part, the molecular mechanism through which this variant influences cancer risk.

  9. Clinical Applications of Molecular Genetic Discoveries

    PubMed Central

    Marian, A.J.

    2015-01-01

    Genome-wide association studies (GWAS) of complex traits have mapped more than 15,000 common single nucleotide variants (SNVs). Likewise, applications of massively parallel nucleic acid sequencing technologies often referred to as Next Generation Sequencing, to molecular genetic studies of complex traits have catalogued a large number of rare variants (population frequency of <0.01) in cases with complex traits. Moreover, high throughput nucleic acid sequencing, variant burden analysis, and linkage studies are illuminating the presence of large number of SNVs in cases and families with single gene disorders. The plethora of the genetic variants has exposed the formidable challenge of identifying the causal and pathogenic variants from the enormous number of innocuous common and rare variants that exist in the population as well as in an individual genome. The arduous task of identifying the causal and pathogenic variants is further compounded by the pleiotropic effects of the variants, complexity of cis and trans interactions in the genome, variability in phenotypic expression of the disease, as well as phenotypic plasticity, and the multifarious determinants of the phenotype. Population genetic studies offer the initial roadmaps and have the potential to elucidate novel pathways involved in the pathogenesis of the disease. However, the genome of an individual is unique, rendering unambiguous identification of the causal or pathogenic variant in a single individual exceedingly challenging. Yet, the focus of the practice of medicine is on the individual, as Sir William Osler elegantly expressed in his insightful quotation: “The good physician treats the disease; the great physician treats the patient who has the disease.” The daunting task facing physicians, patients, and researchers alike is to apply the modern genetic discoveries to care of the individual with or at risk of the disease. PMID:26548329

  10. Simple and efficient identification of rare recessive pathologically important sequence variants from next generation exome sequence data.

    PubMed

    Carr, Ian M; Morgan, Joanne; Watson, Christopher; Melnik, Svitlana; Diggle, Christine P; Logan, Clare V; Harrison, Sally M; Taylor, Graham R; Pena, Sergio D J; Markham, Alexander F; Alkuraya, Fowzan S; Black, Graeme C M; Ali, Manir; Bonthron, David T

    2013-07-01

    Massively parallel ("next generation") DNA sequencing (NGS) has quickly become the method of choice for seeking pathogenic mutations in rare uncharacterized monogenic diseases. Typically, before DNA sequencing, protein-coding regions are enriched from patient genomic DNA, representing either the entire genome ("exome sequencing") or selected mapped candidate loci. Sequence variants, identified as differences between the patient's and the human genome reference sequences, are then filtered according to various quality parameters. Changes are screened against datasets of known polymorphisms, such as dbSNP and the 1000 Genomes Project, in the effort to narrow the list of candidate causative variants. An increasing number of commercial services now offer to both generate and align NGS data to a reference genome. This potentially allows small groups with limited computing infrastructure and informatics skills to utilize this technology. However, the capability to effectively filter and assess sequence variants is still an important bottleneck in the identification of deleterious sequence variants in both research and diagnostic settings. We have developed an approach to this problem comprising a user-friendly suite of programs that can interactively analyze, filter and screen data from enrichment-capture NGS data. These programs ("Agile Suite") are particularly suitable for small-scale gene discovery or for diagnostic analysis. © 2013 WILEY PERIODICALS, INC.

  11. The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease

    PubMed Central

    Lugo-Martinez, Jose; Pejaver, Vikas; Pagel, Kymberleigh A.; Mort, Matthew; Cooper, David N.; Mooney, Sean D.; Radivojac, Predrag

    2016-01-01

    Elucidating the precise molecular events altered by disease-causing genetic variants represents a major challenge in translational bioinformatics. To this end, many studies have investigated the structural and functional impact of amino acid substitutions. Most of these studies were however limited in scope to either individual molecular functions or were concerned with functional effects (e.g. deleterious vs. neutral) without specifically considering possible molecular alterations. The recent growth of structural, molecular and genetic data presents an opportunity for more comprehensive studies to consider the structural environment of a residue of interest, to hypothesize specific molecular effects of sequence variants and to statistically associate these effects with genetic disease. In this study, we analyzed data sets of disease-causing and putatively neutral human variants mapped to protein 3D structures as part of a systematic study of the loss and gain of various types of functional attribute potentially underlying pathogenic molecular alterations. We first propose a formal model to assess probabilistically function-impacting variants. We then develop an array of structure-based functional residue predictors, evaluate their performance, and use them to quantify the impact of disease-causing amino acid substitutions on catalytic activity, metal binding, macromolecular binding, ligand binding, allosteric regulation and post-translational modifications. We show that our methodology generates actionable biological hypotheses for up to 41% of disease-causing genetic variants mapped to protein structures suggesting that it can be reliably used to guide experimental validation. Our results suggest that a significant fraction of disease-causing human variants mapping to protein structures are function-altering both in the presence and absence of stability disruption. PMID:27564311

  12. The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease.

    PubMed

    Lugo-Martinez, Jose; Pejaver, Vikas; Pagel, Kymberleigh A; Jain, Shantanu; Mort, Matthew; Cooper, David N; Mooney, Sean D; Radivojac, Predrag

    2016-08-01

    Elucidating the precise molecular events altered by disease-causing genetic variants represents a major challenge in translational bioinformatics. To this end, many studies have investigated the structural and functional impact of amino acid substitutions. Most of these studies were however limited in scope to either individual molecular functions or were concerned with functional effects (e.g. deleterious vs. neutral) without specifically considering possible molecular alterations. The recent growth of structural, molecular and genetic data presents an opportunity for more comprehensive studies to consider the structural environment of a residue of interest, to hypothesize specific molecular effects of sequence variants and to statistically associate these effects with genetic disease. In this study, we analyzed data sets of disease-causing and putatively neutral human variants mapped to protein 3D structures as part of a systematic study of the loss and gain of various types of functional attribute potentially underlying pathogenic molecular alterations. We first propose a formal model to assess probabilistically function-impacting variants. We then develop an array of structure-based functional residue predictors, evaluate their performance, and use them to quantify the impact of disease-causing amino acid substitutions on catalytic activity, metal binding, macromolecular binding, ligand binding, allosteric regulation and post-translational modifications. We show that our methodology generates actionable biological hypotheses for up to 41% of disease-causing genetic variants mapped to protein structures suggesting that it can be reliably used to guide experimental validation. Our results suggest that a significant fraction of disease-causing human variants mapping to protein structures are function-altering both in the presence and absence of stability disruption.

  13. Population-based structural variation discovery with Hydra-Multi.

    PubMed

    Lindberg, Michael R; Hall, Ira M; Quinlan, Aaron R

    2015-04-15

    Current strategies for SNP and INDEL discovery incorporate sequence alignments from multiple individuals to maximize sensitivity and specificity. It is widely accepted that this approach also improves structural variant (SV) detection. However, multisample SV analysis has been stymied by the fundamental difficulties of SV calling, e.g. library insert size variability, SV alignment signal integration and detecting long-range genomic rearrangements involving disjoint loci. Extant tools suffer from poor scalability, which limits the number of genomes that can be co-analyzed and complicates analysis workflows. We have developed an approach that enables multisample SV analysis in hundreds to thousands of human genomes using commodity hardware. Here, we describe Hydra-Multi and measure its accuracy, speed and scalability using publicly available datasets provided by The 1000 Genomes Project and by The Cancer Genome Atlas (TCGA). Hydra-Multi is written in C++ and is freely available at https://github.com/arq5x/Hydra. aaronquinlan@gmail.com or ihall@genome.wustl.edu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  14. The multi-level perspective analysis: Indonesia geothermal energy transition study

    NASA Astrophysics Data System (ADS)

    Wisaksono, A.; Murphy, J.; Sharp, J. H.; Younger, P. L.

    2018-01-01

    The study adopts a multi-level perspective in technology transition to analyse how the transition process in the development of geothermal energy in Indonesia is able to compete against the incumbent fossil-fuelled energy sources. Three levels of multi-level perspective are socio-technical landscape (ST-landscape), socio-technical regime (ST-regime) and niche innovations in Indonesia geothermal development. The identification, mapping and analysis of the dynamic relationship between each level are the important pillars of the multi-level perspective framework. The analysis considers the set of rules, actors and controversies that may arise in the technological transition process. The identified geothermal resource risks are the basis of the emerging geothermal technological innovations in Indonesian geothermal. The analysis of this study reveals the transition pathway, which yields a forecast for the Indonesian geothermal technology transition in the form of scenarios and probable impacts.

  15. Genetic Analyses of the NF1 Gene in Turkish Neurofibromatosis Type I Patients and Definition of three Novel Variants

    PubMed Central

    Ulusal, SD; Gürkan, H; Atlı, E; Özal, SA; Çiftdemir, M; Tozkır, H; Karal, Y; Güçlü, H; Eker, D; Görker, I

    2017-01-01

    Abstract Neurofibromatosis Type I (NF1) is a multi systemic autosomal dominant neurocutaneous disorder predisposing patients to have benign and/or malignant lesions predominantly of the skin, nervous system and bone. Loss of function mutations or deletions of the NF1 gene is responsible for NF1 disease. Involvement of various pathogenic variants, the size of the gene and presence of pseudogenes makes it difficult to analyze. We aimed to report the results of 2 years of multiplex ligation-dependent probe amplification (MLPA) and next generation sequencing (NGS) for genetic diagnosis of NF1 applied at our genetic diagnosis center. The MLPA, semiconductor sequencing and Sanger sequencing were performed in genomic DNA samples from 24 unrelated patients and their affected family members referred to our center suspected of having NF1. In total, three novel and 12 known pathogenic variants and a whole gene deletion were determined. We suggest that next generation sequencing is a practical tool for genetic analysis of NF1. Deletion/duplication analysis with MLPA may also be helpful for patients clinically diagnosed to carry NF1 but do not have a detectable mutation in NGS. PMID:28924536

  16. Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load

    PubMed Central

    Coulonges, Cedric; Bartha, István; Lenz, Tobias L.; Deutsch, Aaron J.; Bashirova, Arman; Buchbinder, Susan; Carrington, Mary N.; Cossarizza, Andrea; Dalmau, Judith; De Luca, Andrea; Goedert, James J.; Gurdasani, Deepti; Haas, David W.; Herbeck, Joshua T.; Johnson, Eric O.; Kirk, Gregory D.; Lambotte, Olivier; Luo, Ma; Mallal, Simon; van Manen, Daniëlle; Martinez-Picado, Javier; Meyer, Laurence; Miro, José M.; Mullins, James I.; Obel, Niels; Poli, Guido; Sandhu, Manjinder S.; Schuitemaker, Hanneke; Shea, Patrick R.; Theodorou, Ioannis; Walker, Bruce D.; Weintrob, Amy C.; Winkler, Cheryl A.; Wolinsky, Steven M.; Raychaudhuri, Soumya; Goldstein, David B.; Telenti, Amalio; de Bakker, Paul I. W.; Zagury, Jean-François; Fellay, Jacques

    2015-01-01

    Previous genome-wide association studies (GWAS) of HIV-1–infected populations have been underpowered to detect common variants with moderate impact on disease outcome and have not assessed the phenotypic variance explained by genome-wide additive effects. By combining the majority of available genome-wide genotyping data in HIV-infected populations, we tested for association between ∼8 million variants and viral load (HIV RNA copies per milliliter of plasma) in 6,315 individuals of European ancestry. The strongest signal of association was observed in the HLA class I region that was fully explained by independent effects mapping to five variable amino acid positions in the peptide binding grooves of the HLA-B and HLA-A proteins. We observed a second genome-wide significant association signal in the chemokine (C-C motif) receptor (CCR) gene cluster on chromosome 3. Conditional analysis showed that this signal could not be fully attributed to the known protective CCR5Δ32 allele and the risk P1 haplotype, suggesting further causal variants in this region. Heritability analysis demonstrated that common human genetic variation—mostly in the HLA and CCR5 regions—explains 25% of the variability in viral load. This study suggests that analyses in non-European populations and of variant classes not assessed by GWAS should be priorities for the field going forward. PMID:26553974

  17. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya

    Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less

  18. Common Variants in Mendelian Kidney Disease Genes and Their Association with Renal Function

    PubMed Central

    Fuchsberger, Christian; Köttgen, Anna; O’Seaghdha, Conall M.; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I.; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J.; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V.; O’Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M.; Bochud, Murielle; Heid, Iris M.; Siscovick, David S.; Fox, Caroline S.; Kao, W. Linda; Böger, Carsten A.

    2013-01-01

    Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research. PMID:24029420

  19. Chosen aspects of multi-criteria analysis applied to support the choice of materials for building structures

    NASA Astrophysics Data System (ADS)

    Szafranko, E.

    2017-08-01

    When planning a building structure, dilemmas arise as to what construction and material solutions are feasible. The decisions are not always obvious. A procedure for selecting the variant that will best satisfy the expectations of the investor and future users of a structure must be founded on mathematical methods. The following deserve special attention: the MCE methods, Hierarchical Analysis Methods and Weighting Methods. Another interesting solution, particularly useful when dealing with evaluations which take into account negative values, is the Indicator Method. MCE methods are relatively popular owing to the simplicity of the calculations and ease of the interpretation of the results. Having prepared the input data properly, they enable the user to compare them on the same level. In a situation where an analysis involves a large number of data, it is more convenient to divide them into groups according to main criteria and subcriteria. This option is provided by hierarchical analysis methods. They are based on ordered sets of criteria, which are evaluated in groups. In some cases, this approach yields the results that are superior and easier to read. If an analysis encompasses direct and indirect effects, an Indicator Method seems to be a justified choice for selecting the right solution. The Indicator Method is different in character and relies on weights and assessments of effects. It allows the user to evaluate effectively the analyzed variants. This article explains the methodology of conducting a multi-criteria analysis, showing its advantages and disadvantages. An example of calculations contained in the article shows what problems can be encountered when making an assessment of various solutions regarding building materials and structures. For comparison, an analysis based on graphical methods developed by the author was presented.

  20. Identification of new susceptibility loci for type 2 diabetes and shared etiological pathways with coronary heart disease

    PubMed Central

    Zhao, Wei; Rasheed, Asif; Tikkanen, Emmi; Lee, Jung-Jin; Butterworth, Adam S; Howson, Joanna MM; Assimes, Themistocles L; Chowdhury, Rajiv; Orho-Melander, Marju; Damrauer, Scott; Small, Aeron; Asma, Senay; Imamura, Minako; Yamauch, Toshimasa; Chambers, John C; Chen, Peng; Sapkota, Bishwa R; Shah, Nabi; Jabeen, Sehrish; Surendran, Praveen; Lu, Yingchang; Zhang, Weihua; Imran, Atif; Abbas, Shahid; Majeed, Faisal; Trindade, Kevin; Qamar, Nadeem; Mallick, Nadeem Hayyat; Yaqoob, Zia; Saghir, Tahir; Rizvi, Syed Nadeem Hasan; Memon, Anis; Rasheed, Syed Zahed; Memon, Fazal-ur-Rehman; Mehmood, Khalid; Ahmed, Naveeduddin; Qureshi, Irshad Hussain; Tanveer-us-Salam; Iqbal, Wasim; Malik, Uzma; Mehra, Narinder; Kuo, Jane Z; Sheu, Wayne H-H; Guo, Xiuqing; Hsiung, Chao A; Juang, Jyh-Ming J; Taylor, Kent D; Hung, Yi-Jen; Lee, Wen-Jane; Quertermous, Thomas; Lee, I-Te; Hsu, Chih-Cheng; Bottinger, Erwin P.; Ralhan, Sarju; Teo, Yik Ying; Wang, Tzung-Dau; Alam, Dewan S; Di Angelantonio, Emanuele; Epstein, Steve; Nielsen, Sune F; Nordestgaard, Børge G; Tybjaerg-Hansen, Anne; Young, Robin; Benn, Marianne; Frikke-Schmidt, Ruth; Kamstrup, Pia R; Biobank, Michigan; Jukema, J Wouter; Sattar, Naveed; Smit, Roelof; Chung, Ren-Hua; Liang, Kae-Woei; Anand, Sonia; Sanghera, Dharambir K; Ripatti, Samuli; Loos, Ruth J.F.; Kooner, Jaspal S; Tai, E Shyong; Rotter, Jerome I; Chen, Yii-Der Ida; Frossard, Philippe; Maeda, Shiro; Kadowaki, Takashi; Reilly, Muredach; Pare, Guillaume; Melander, Olle; Salomaa, Veikko; Rader, Daniel J; Danesh, John; Voight, Benjamin F; Saleheen, Danish

    2018-01-01

    To evaluate the shared genetic etiology of type-2 diabetes (T2D) and coronary heart disease (CHD), we conducted a multi-ethnic study of genetic variation genome-wide for both diseases in up to 265,678 subjects for T2D and 260,365 subjects for CHD. We identify 16 previously unreported loci for T2D and one for CHD, including a novel T2D association at a missense variant in HLA-DRB5 (OR=1.29). We show that genetically mediated increase in T2D risk also confers higher CHD risk. Joint analysis of T2D loci demonstrated that 24% are associated with CHD, highlighting eight variants - two of which are coding - where T2D and CHD associations appear to co-localize, and a novel joint T2D/CHD association which also replicated for T2D. Variants associated with both outcomes implicate several novel pathways including cellular proliferation and cardiovascular development. PMID:28869590

  1. Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency.

    PubMed

    Breitfeld, Jana; Martens, Susanne; Klammt, Jürgen; Schlicke, Marina; Pfäffle, Roland; Krause, Kerstin; Weidle, Kerstin; Schleinitz, Dorit; Stumvoll, Michael; Führer, Dagmar; Kovacs, Peter; Tönjes, Anke

    2013-12-01

    The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD.

  2. Genetic analyses of bone morphogenetic protein 2, 4 and 7 in congenital combined pituitary hormone deficiency

    PubMed Central

    2013-01-01

    Background The complex process of development of the pituitary gland is regulated by a number of signalling molecules and transcription factors. Mutations in these factors have been identified in rare cases of congenital hypopituitarism but for most subjects with combined pituitary hormone deficiency (CPHD) genetic causes are unknown. Bone morphogenetic proteins (BMPs) affect induction and growth of the pituitary primordium and thus represent plausible candidates for mutational screening of patients with CPHD. Methods We sequenced BMP2, 4 and 7 in 19 subjects with CPHD. For validation purposes, novel genetic variants were genotyped in 1046 healthy subjects. Additionally, potential functional relevance for most promising variants has been assessed by phylogenetic analyses and prediction of effects on protein structure. Results Sequencing revealed two novel variants and confirmed 30 previously known polymorphisms and mutations in BMP2, 4 and 7. Although phylogenetic analyses indicated that these variants map within strongly conserved gene regions, there was no direct support for their impact on protein structure when applying predictive bioinformatics tools. Conclusions A mutation in the BMP4 coding region resulting in an amino acid exchange (p.Arg300Pro) appeared most interesting among the identified variants. Further functional analyses are required to ultimately map the relevance of these novel variants in CPHD. PMID:24289245

  3. A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing.

    PubMed

    Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv

    2018-01-01

    Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.

  4. New forest vegetation maps facilitate assessment of biodiversity indicators over large, multi-ownership regions.

    Treesearch

    Janet L. Ohmann

    2003-01-01

    Natural resource policy analysis and conservation planning are best served by broad-scale information about vegetation that is detailed, spatially complete, and consistent across land ownerships and allocations. In this paper I describe how a new generation of forest vegetation maps can be used to assess the distribution of vegetation biodiversity among land ownerships...

  5. Mapping healthcare systems: a policy relevant analytic tool

    PubMed Central

    Sekhri Feachem, Neelam; Afshar, Ariana; Pruett, Cristina; Avanceña, Anton L.V.

    2017-01-01

    Abstract Background In the past decade, an international consensus on the value of well-functioning systems has driven considerable health systems research. This research falls into two broad categories. The first provides conceptual frameworks that take complex healthcare systems and create simplified constructs of interactions and functions. The second focuses on granular inputs and outputs. This paper presents a novel translational mapping tool – the University of California, San Francisco mapping tool (the Tool) - which bridges the gap between these two areas of research, creating a platform for multi-country comparative analysis. Methods Using the Murray-Frenk framework, we create a macro-level representation of a country's structure, focusing on how it finances and delivers healthcare. The map visually depicts the fundamental policy questions in healthcare system design: funding sources and amount spent through each source, purchasers, populations covered, provider categories; and the relationship between these entities. Results We use the Tool to provide a macro-level comparative analysis of the structure of India's and Thailand's healthcare systems. Conclusions As part of the systems strengthening arsenal, the Tool can stimulate debate about the merits and consequences of different healthcare systems structural designs, using a common framework that fosters multi-country comparative analyses. PMID:28541518

  6. A generalized parametric response mapping method for analysis of multi-parametric imaging: A feasibility study with application to glioblastoma.

    PubMed

    Lausch, Anthony; Yeung, Timothy Pok-Chi; Chen, Jeff; Law, Elton; Wang, Yong; Urbini, Benedetta; Donelli, Filippo; Manco, Luigi; Fainardi, Enrico; Lee, Ting-Yim; Wong, Eugene

    2017-11-01

    Parametric response map (PRM) analysis of functional imaging has been shown to be an effective tool for early prediction of cancer treatment outcomes and may also be well-suited toward guiding personalized adaptive radiotherapy (RT) strategies such as sub-volume boosting. However, the PRM method was primarily designed for analysis of longitudinally acquired pairs of single-parameter image data. The purpose of this study was to demonstrate the feasibility of a generalized parametric response map analysis framework, which enables analysis of multi-parametric data while maintaining the key advantages of the original PRM method. MRI-derived apparent diffusion coefficient (ADC) and relative cerebral blood volume (rCBV) maps acquired at 1 and 3-months post-RT for 19 patients with high-grade glioma were used to demonstrate the algorithm. Images were first co-registered and then standardized using normal tissue image intensity values. Tumor voxels were then plotted in a four-dimensional Cartesian space with coordinate values equal to a voxel's image intensity in each of the image volumes and an origin defined as the multi-parametric mean of normal tissue image intensity values. Voxel positions were orthogonally projected onto a line defined by the origin and a pre-determined response vector. The voxels are subsequently classified as positive, negative or nil, according to whether projected positions along the response vector exceeded a threshold distance from the origin. The response vector was selected by identifying the direction in which the standard deviation of tumor image intensity values was maximally different between responding and non-responding patients within a training dataset. Voxel classifications were visualized via familiar three-class response maps and then the fraction of tumor voxels associated with each of the classes was investigated for predictive utility analogous to the original PRM method. Independent PRM and MPRM analyses of the contrast-enhancing lesion (CEL) and a 1 cm shell of surrounding peri-tumoral tissue were performed. Prediction using tumor volume metrics was also investigated. Leave-one-out cross validation (LOOCV) was used in combination with permutation testing to assess preliminary predictive efficacy and estimate statistically robust P-values. The predictive endpoint was overall survival (OS) greater than or equal to the median OS of 18.2 months. Single-parameter PRM and multi-parametric response maps (MPRMs) were generated for each patient and used to predict OS via the LOOCV. Tumor volume metrics (P ≥ 0.071 ± 0.01) and single-parameter PRM analyses (P ≥ 0.170 ± 0.01) were not found to be predictive of OS within this study. MPRM analysis of the peri-tumoral region but not the CEL was found to be predictive of OS with a classification sensitivity, specificity and accuracy of 80%, 100%, and 89%, respectively (P = 0.001 ± 0.01). The feasibility of a generalized MPRM analysis framework was demonstrated with improved prediction of overall survival compared to the original single-parameter method when applied to a glioblastoma dataset. The proposed algorithm takes the spatial heterogeneity in multi-parametric response into consideration and enables visualization. MPRM analysis of peri-tumoral regions was shown to have predictive potential supporting further investigation of a larger glioblastoma dataset. © 2017 American Association of Physicists in Medicine.

  7. Fine mapping of Xq28: both MECP2 and IRAK1 contribute to risk for systemic lupus erythematosus in multiple ancestral groups.

    PubMed

    Kaufman, Kenneth M; Zhao, Jian; Kelly, Jennifer A; Hughes, Travis; Adler, Adam; Sanchez, Elena; Ojwang, Joshua O; Langefeld, Carl D; Ziegler, Julie T; Williams, Adrienne H; Comeau, Mary E; Marion, Miranda C; Glenn, Stuart B; Cantor, Rita M; Grossman, Jennifer M; Hahn, Bevra H; Song, Yeong Wook; Yu, Chack-Yung; James, Judith A; Guthridge, Joel M; Brown, Elizabeth E; Alarcón, Graciela S; Kimberly, Robert P; Edberg, Jeffrey C; Ramsey-Goldman, Rosalind; Petri, Michelle A; Reveille, John D; Vilá, Luis M; Anaya, Juan-Manuel; Boackle, Susan A; Stevens, Anne M; Freedman, Barry I; Criswell, Lindsey A; Pons Estel, Bernardo A; Lee, Joo-Hyun; Lee, Ji-Seon; Chang, Deh-Ming; Scofield, R Hal A; Gilkeson, Gary S; Merrill, Joan T; Niewold, Timothy B; Vyse, Timothy James; Bae, Sang-Cheol; Alarcón-Riquelme, Marta E; Jacob, Chaim O; Moser Sivils, Kathy; Gaffney, Patrick M; Harley, John B; Sawalha, Amr H; Tsao, Betty P

    2013-03-01

    The Xq28 region containing IRAK1 and MECP2 has been identified as a risk locus for systemic lupus erythematosus (SLE) in previous genetic association studies. However, due to the strong linkage disequilibrium between IRAK1 and MECP2, it remains unclear which gene is affected by the underlying causal variant(s) conferring risk of SLE. We fine-mapped ≥136 SNPs in a ∼227 kb region on Xq28, containing IRAK1, MECP2 and seven adjacent genes (L1CAM, AVPR2, ARHGAP4, NAA10, RENBP, HCFC1 and TMEM187), for association with SLE in 15 783 case-control subjects derived from four different ancestral groups. Multiple SNPs showed strong association with SLE in European Americans, Asians and Hispanics at p<5×10(-8) with consistent association in subjects with African ancestry. Of these, six SNPs located in the TMEM187-IRAK1-MECP2 region captured the underlying causal variant(s) residing in a common risk haplotype shared by all four ancestral groups. Among them, rs1059702 best explained the Xq28 association signals in conditional testings and exhibited the strongest p value in transancestral meta-analysis (p(meta )= 1.3×10(-27), OR=1.43), and thus was considered to be the most likely causal variant. The risk allele of rs1059702 results in the amino acid substitution S196F in IRAK1 and had previously been shown to increase NF-κB activity in vitro. We also found that the homozygous risk genotype of rs1059702 was associated with lower mRNA levels of MECP2, but not IRAK1, in SLE patients (p=0.0012) and healthy controls (p=0.0064). These data suggest contributions of both IRAK1 and MECP2 to SLE susceptibility.

  8. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus

    PubMed Central

    Carbonell-Caballero, Jose; Alonso, Roberto; Ibañez, Victoria; Terol, Javier; Talon, Manuel; Dopazo, Joaquin

    2015-01-01

    Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels (insertions and deletions), and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analyzed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus but also provides original insights into other elusive evolutionary processes, such as chloroplast inheritance, heteroplasmy, and gene selection. PMID:25873589

  9. Multi-component intrinsic brain activities as a safe, alternative to cortical stimulation for sensori-motor mapping in neurosurgery.

    PubMed

    Neshige, Shuichiro; Matsuhashi, Masao; Kobayashi, Katsuya; Sakurai, Takeyo; Shimotake, Akihiro; Hitomi, Takefumi; Kikuchi, Takayuki; Yoshida, Kazumichi; Kunieda, Takeharu; Matsumoto, Riki; Takahashi, Ryosuke; Miyamoto, Susumu; Maruyama, Hirofumi; Matsumoto, Masayasu; Ikeda, Akio

    2018-06-18

    To assess the feasibility of multi-component electrocorticography (ECoG)-based mapping using "wide-spectrum, intrinsic-brain activities" for identifying the primary sensori-motor area (S1-M1) by comparing that using electrical cortical stimulation (ECS). We evaluated 14 epilepsy patients with 1514 subdural electrodes implantation covering the perirolandic cortices at Kyoto University Hospital between 2011 and 2016. We performed multi-component, ECoG-based mapping (band-pass filter, 0.016-300/600 Hz) involving combined analyses of the single components: movement-related cortical potential (<0.5-1 Hz), event-related synchronization (76-200 Hz), and event-related de-synchronization (8-24 Hz) to identify the S1-M1. The feasibility of multi-component mapping was assessed through comparisons with single-component mapping and ECS. Among 54 functional areas evaluation, ECoG-based maps showed significantly higher rate of localization concordances with ECS maps when the three single-component maps were consistent than when those were inconsistent with each other (p < 0.001 in motor, and p = 0.02 in sensory mappings). Multi-component mapping revealed high sensitivity (89-90%) and specificity (94-97%) as compared with ECS. Wide-spectrum, multi-component ECoG-based mapping is feasible, having high sensitivity/specificity relative to ECS. This safe (non-stimulus) mapping strategy, alternative to ECS, would allow clinicians to rule in/out the possibility of brain function prior to resection surgery. Copyright © 2018 International Federation of Clinical Neurophysiology. Published by Elsevier B.V. All rights reserved.

  10. The PHF21B gene is associated with major depression and modulates the stress response.

    PubMed

    Wong, M-L; Arcos-Burgos, M; Liu, S; Vélez, J I; Yu, C; Baune, B T; Jawahar, M C; Arolt, V; Dannlowski, U; Chuah, A; Huttley, G A; Fogarty, R; Lewis, M D; Bornstein, S R; Licinio, J

    2017-07-01

    Major depressive disorder (MDD) affects around 350 million people worldwide; however, the underlying genetic basis remains largely unknown. In this study, we took into account that MDD is a gene-environment disorder, in which stress is a critical component, and used whole-genome screening of functional variants to investigate the 'missing heritability' in MDD. Genome-wide association studies (GWAS) using single- and multi-locus linear mixed-effect models were performed in a Los Angeles Mexican-American cohort (196 controls, 203 MDD) and in a replication European-ancestry cohort (499 controls, 473 MDD). Our analyses took into consideration the stress levels in the control populations. The Mexican-American controls, comprised primarily of recent immigrants, had high levels of stress due to acculturation issues and the European-ancestry controls with high stress levels were given higher weights in our analysis. We identified 44 common and rare functional variants associated with mild to moderate MDD in the Mexican-American cohort (genome-wide false discovery rate, FDR, <0.05), and their pathway analysis revealed that the three top overrepresented Gene Ontology (GO) processes were innate immune response, glutamate receptor signaling and detection of chemical stimulus in smell sensory perception. Rare variant analysis replicated the association of the PHF21B gene in the ethnically unrelated European-ancestry cohort. The TRPM2 gene, previously implicated in mood disorders, may also be considered replicated by our analyses. Whole-genome sequencing analyses of a subset of the cohorts revealed that European-ancestry individuals have a significantly reduced (50%) number of single nucleotide variants compared with Mexican-American individuals, and for this reason the role of rare variants may vary across populations. PHF21b variants contribute significantly to differences in the levels of expression of this gene in several brain areas, including the hippocampus. Furthermore, using an animal model of stress, we found that Phf21b hippocampal gene expression is significantly decreased in animals resilient to chronic restraint stress when compared with non-chronically stressed animals. Together, our results reveal that including stress level data enables the identification of novel rare functional variants associated with MDD.

  11. An extensive analysis of the hereditary hemochromatosis gene HFE and neighboring histone genes: associations with childhood leukemia.

    PubMed

    Davis, Charronne F; Dorak, M Tevfik

    2010-04-01

    The most common mutation of the HFE gene C282Y has shown a risk association with childhood acute lymphoblastic leukemia (ALL) in Welsh and Scottish case-control studies. This finding has not been replicated outside Britain. Here, we present a thorough analysis of the HFE gene in a panel of HLA homozygous reference cell lines and in the original population sample from South Wales (117 childhood ALL cases and 414 newborn controls). The 21 of 24 variants analyzed were from the HFE gene region extending 52 kb from the histone gene HIST1H1C to HIST1H1T. We identified the single-nucleotide polymorphism (SNP) rs807212 as a tagging SNP for the most common HFE region haplotype, which contains wild-type alleles of all HFE variants examined. This intergenic SNP rs807212 yielded a strong male-specific protective association (per allele OR = 0.38, 95% CI = 0.22-0.64, P (trend) = 0.0002; P = 0.48 in females), which accounted for the original C282Y risk association. In the HapMap project data, rs807212 was in strong linkage disequilibrium with 25 other SNPs spanning 151 kb around HFE. Minor alleles of these 26 SNPs characterized the most common haplotype for the HFE region, which lacked all disease-associated HFE variants. The HapMap data suggested positive selection in this region even in populations where the HFE C282Y mutation is absent. These results have implications for the sex-specific associations observed in this region and suggest the inclusion of rs807212 in future studies of the HFE gene and the extended HLA class I region.

  12. Merging of multi-string BWTs with applications

    PubMed Central

    Holt, James; McMillan, Leonard

    2014-01-01

    Motivation: The throughput of genomic sequencing has increased to the point that is overrunning the rate of downstream analysis. This, along with the desire to revisit old data, has led to a situation where large quantities of raw, and nearly impenetrable, sequence data are rapidly filling the hard drives of modern biology labs. These datasets can be compressed via a multi-string variant of the Burrows–Wheeler Transform (BWT), which provides the side benefit of searches for arbitrary k-mers within the raw data as well as the ability to reconstitute arbitrary reads as needed. We propose a method for merging such datasets for both increased compression and downstream analysis. Results: We present a novel algorithm that merges multi-string BWTs in O(LCS×N) time where LCS is the length of their longest common substring between any of the inputs, and N is the total length of all inputs combined (number of symbols) using O(N×log2(F)) bits where F is the number of multi-string BWTs merged. This merged multi-string BWT is also shown to have a higher compressibility compared with the input multi-string BWTs separately. Additionally, we explore some uses of a merged multi-string BWT for bioinformatics applications. Availability and implementation: The MSBWT package is available through PyPI with source code located at https://code.google.com/p/msbwt/. Contact: holtjma@cs.unc.edu PMID:25172922

  13. Hemoglobin Brigham (α2Aβ2100 Pro→Leu). HEMOGLOBIN VARIANT ASSOCIATED WITH FAMILIAL ERYTHROCYTOSIS

    PubMed Central

    Lokich, Jacob J.; Moloney, William C.; Bunn, H. Franklin; Bruckheimer, Sally M.; Ranney, Helen M.

    1973-01-01

    Erythrocytosis associated with the presence of a hemoglobin with increased oxygen affinity has been reported for 10 hemoglobin variants, most of which demonstrate altered electrophoretic mobility. Several members of a family were found to have erythrocytosis, and both the whole blood and the hemoglobin exhibited increased oxygen affinity. Phosphate-free hemoglobin solutions had a normal Bohr effect and reactivity to 2,3-diphosphoglycerate. The electrophoretic properties of the hemoglobin were normal, but on peptide mapping of a tryptic digest of the isolated β-chains, a normal βT11 peptide and an abnormal βT11 with greater Rf were seen. Analysis of the abnormal peptide showed the substitution of leucine for the normal proline at β100 (helical residue G2). The hemoglobin variant, designated Hb Brigham, serves to emphasize the necessity for detailed evaluation of the structure and function of hemoglobin in familial erythrocytosis even with electrophoretically “normal” hemoglobin. PMID:4719677

  14. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis

    PubMed Central

    van Rheenen, Wouter; Shatunov, Aleksey; Dekker, Annelot M; McLaughlin, Russell L; Diekstra, Frank P; Pulit, Sara L; van der Spek, Rick A A; Võsa, Urmo; de Jong, Simone; Robinson, Matthew R; Yang, Jian; Fogh, Isabella; van Doormaal, Perry TC; Tazelaar, Gijs H P; Koppers, Max; Blokhuis, Anna M; Sproviero, William; Jones, Ashley R; Kenna, Kevin P; van Eijk, Kristel R; Harschnitz, Oliver; Schellevis, Raymond D; Brands, William J; Medic, Jelena; Menelaou, Androniki; Vajda, Alice; Ticozzi, Nicola; Lin, Kuang; Rogelj, Boris; Vrabec, Katarina; Ravnik-Glavač, Metka; Koritnik, Blaž; Zidar, Janez; Leonardis, Lea; Grošelj, Leja Dolenc; Millecamps, Stéphanie; Salachas, François; Meininger, Vincent; de Carvalho, Mamede; Pinto, Susana; Mora, Jesus S; Rojas-García, Ricardo; Polak, Meraida; Chandran, Siddharthan; Colville, Shuna; Swingler, Robert; Morrison, Karen E; Shaw, Pamela J; Hardy, John; Orrell, Richard W; Pittman, Alan; Sidle, Katie; Fratta, Pietro; Malaspina, Andrea; Topp, Simon; Petri, Susanne; Abdulla, Susanne; Drepper, Carsten; Sendtner, Michael; Meyer, Thomas; Ophoff, Roel A; Staats, Kim A; Wiedau-Pazos, Martina; Lomen-Hoerth, Catherine; Van Deerlin, Vivianna M; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Basak, A Nazli; Tunca, Ceren; Hamzeiy, Hamid; Parman, Yesim; Meitinger, Thomas; Lichtner, Peter; Radivojkov-Blagojevic, Milena; Andres, Christian R; Maurel, Cindy; Bensimon, Gilbert; Landwehrmeyer, Bernhard; Brice, Alexis; Payan, Christine A M; Saker-Delye, Safaa; Dürr, Alexandra; Wood, Nicholas W; Tittmann, Lukas; Lieb, Wolfgang; Franke, Andre; Rietschel, Marcella; Cichon, Sven; Nöthen, Markus M; Amouyel, Philippe; Tzourio, Christophe; Dartigues, Jean-François; Uitterlinden, Andre G; Rivadeneira, Fernando; Estrada, Karol; Hofman, Albert; Curtis, Charles; Blauw, Hylke M; van der Kooi, Anneke J; de Visser, Marianne; Goris, An; Weber, Markus; Shaw, Christopher E; Smith, Bradley N; Pansarasa, Orietta; Cereda, Cristina; Bo, Roberto Del; Comi, Giacomo P; D’Alfonso, Sandra; Bertolin, Cinzia; Sorarù, Gianni; Mazzini, Letizia; Pensato, Viviana; Gellera, Cinzia; Tiloca, Cinzia; Ratti, Antonia; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Arcuti, Simona; Capozzo, Rosa; Zecca, Chiara; Lunetta, Christian; Penco, Silvana; Riva, Nilo; Padovani, Alessandro; Filosto, Massimiliano; Muller, Bernard; Stuit, Robbert Jan; Blair, Ian; Zhang, Katharine; McCann, Emily P; Fifita, Jennifer A; Nicholson, Garth A; Rowe, Dominic B; Pamphlett, Roger; Kiernan, Matthew C; Grosskreutz, Julian; Witte, Otto W; Ringer, Thomas; Prell, Tino; Stubendorff, Beatrice; Kurth, Ingo; Hübner, Christian A; Leigh, P Nigel; Casale, Federico; Chio, Adriano; Beghi, Ettore; Pupillo, Elisabetta; Tortelli, Rosanna; Logroscino, Giancarlo; Powell, John; Ludolph, Albert C; Weishaupt, Jochen H; Robberecht, Wim; Van Damme, Philip; Franke, Lude; Pers, Tune H; Brown, Robert H; Glass, Jonathan D; Landers, John E; Hardiman, Orla; Andersen, Peter M; Corcia, Philippe; Vourc’h, Patrick; Silani, Vincenzo; Wray, Naomi R; Visscher, Peter M; de Bakker, Paul I W; van Es, Michael A; Pasterkamp, R Jeroen; Lewis, Cathryn M; Breen, Gerome; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan H

    2017-01-01

    To elucidate the genetic architecture of amyotrophic lateral sclerosis (ALS) and find associated loci, we assembled a custom imputation reference panel from whole-genome-sequenced patients with ALS and matched controls (n = 1,861). Through imputation and mixed-model association analysis in 12,577 cases and 23,475 controls, combined with 2,579 cases and 2,767 controls in an independent replication cohort, we fine-mapped a new risk locus on chromosome 21 and identified C21orf2 as a gene associated with ALS risk. In addition, we identified MOBP and SCFD1 as new associated risk loci. We established evidence of ALS being a complex genetic trait with a polygenic architecture. Furthermore, we estimated the SNP-based heritability at 8.5%, with a distinct and important role for low-frequency variants (frequency 1–10%). This study motivates the interrogation of larger samples with full genome coverage to identify rare causal variants that underpin ALS risk. PMID:27455348

  15. Rex and a Suppressor of Rex Are Repeated Neomorphic Loci in the Drosophila Melanogaster Ribosomal DNA

    PubMed Central

    Rasooly, R. S.; Robbins, L. G.

    1991-01-01

    The Rex locus of Drosophila melanogaster induces a high frequency of mitotic exchange between two separated ribosomal DNA arrays on a single chromosome. The exchanges take place in the progeny of Rex mothers and occur very early, before the third mitotic division. A number of common laboratory stocks have also been found to carry dominant suppressors of Rex (Su(Rex)). Rex was mapped to the X centric heterochromatin, proximal to su(f), by genetic and molecular analysis of two spontaneous recombinants. Using deficiencies and duplications of the heterochromatin, both Rex and one Su(Rex) were shown to behave as neomorphs. Rex-induced exchange in a target chromosome bearing both Rex and Su(Rex) was then used to map these functions to the bb locus itself. Molecular analysis of the recombinants, using length variants of the ribosomal DNA intergenic spacer as genetic markers, mapped Su(Rex) and Rex within the bb locus and demonstrated that both are repeated elements. PMID:1936953

  16. PXK locus in systemic lupus erythematosus: fine mapping and functional analysis reveals novel susceptibility gene ABHD6.

    PubMed

    Oparina, Nina Y; Delgado-Vega, Angelica M; Martinez-Bueno, Manuel; Magro-Checa, César; Fernández, Concepción; Castro, Rafaela Ortega; Pons-Estel, Bernardo A; D'Alfonso, Sandra; Sebastiani, Gian Domenico; Witte, Torsten; Lauwerys, Bernard R; Endreffy, Emoke; Kovács, László; Escudero, Alejandro; López-Pedrera, Chary; Vasconcelos, Carlos; da Silva, Berta Martins; Frostegård, Johan; Truedsson, Lennart; Martin, Javier; Raya, Enrique; Ortego-Centeno, Norberto; de Los Angeles Aguirre, Maria; de Ramón Garrido, Enrique; Palma, María-Jesús Castillo; Alarcon-Riquelme, Marta E; Kozyrev, Sergey V

    2015-03-01

    To perform fine mapping of the PXK locus associated with systemic lupus erythematosus (SLE) and study functional effects that lead to susceptibility to the disease. Linkage disequilibrium (LD) mapping was conducted by using 1251 SNPs (single nucleotide polymorphism) covering a 862 kb genomic region on 3p14.3 comprising the PXK locus in 1467 SLE patients and 2377 controls of European origin. Tag SNPs and genotypes imputed with IMPUTE2 were tested for association by using SNPTEST and PLINK. The expression QTLs data included three independent datasets for lymphoblastoid cells of European donors: HapMap3, MuTHER and the cross-platform eQTL catalogue. Correlation analysis of eQTLs was performed using Vassarstats. Alternative splicing for the PXK gene was analysed on mRNA from PBMCs. Fine mapping revealed long-range LD (>200 kb) extended over the ABHD6, RPP14, PXK, and PDHB genes on 3p14.3. The highly correlated variants tagged an SLE-associated haplotype that was less frequent in the patients compared with the controls (OR=0.89, p=0.00684). A robust correlation between the association with SLE and enhanced expression of ABHD6 gene was revealed, while neither expression, nor splicing alterations associated with SLE susceptibility were detected for PXK. The SNP allele frequencies as well as eQTL pattern analysed in the CEU and CHB HapMap3 populations indicate that the SLE association and the effect on ABHD6 expression are specific to Europeans. These results confirm the genetic association of the locus 3p14.3 with SLE in Europeans and point to the ABHD6 and not PXK, as the major susceptibility gene in the region. We suggest a pathogenic mechanism mediated by the upregulation of ABHD6 in individuals carrying the SLE-risk variants. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  17. G-protein-coupled receptor kinase 4 polymorphisms and blood pressure response to metoprolol among African Americans: sex-specificity and interactions.

    PubMed

    Bhatnagar, Vibha; O'Connor, Daniel T; Brophy, Victoria H; Schork, Nicholas J; Richard, Erin; Salem, Rany M; Nievergelt, Caroline M; Bakris, George L; Middleton, John P; Norris, Keith C; Wright, Jackson; Hiremath, Leena; Contreras, Gabriel; Appel, Lawrence J; Lipkowitz, Michael S

    2009-03-01

    African Americans have a disproportionate burden of hypertension and comorbid disease. Pharmacogenetic markers of blood pressure response have yet to be defined clearly. This study explores the association between G-protein-coupled receptor kinase type 4 (GRK4) variants and blood pressure response to metoprolol among African Americans with early hypertensive nephrosclerosis. Participants from the African American Study of Kidney Disease and Hypertension (AASK) trial were genotyped at three GRK4 polymorphisms: R65L, A142V, and A486V. A Cox proportional hazards model, stratified by gender, was used to determine the relationship between GRK4 variants and time to reach a mean arterial pressure (MAP) of 107 mm Hg, adjusted for other predictors of blood pressure response. Potential interactions between the three polymorphisms were explored by analyzing the effects of gene haplotypes and by stratifying the analysis by neighboring sites. The hazard ratio with 95% confidence interval by A142V among men randomized to a usual MAP (102-107 mm Hg) was 1.54 (1.11-2.44; P = 0.0009). The hazard ratio by A142V with R65/L65 or L65/L65 was 2.14 (1.35-3.39; P = 0.001). Haplotype analyses were consistent but inconclusive. There was no association between A142V and blood pressure response among women. Results suggest a sex-specific relationship between GRK4 A142V and blood pressure response among African-American men with early hypertensive nephrosclerosis. Men with a GRK4 A142 were less responsive to metoprolol if they had a GRK4 L65 variant. The effect of GRK4 variants and blood pressure response to metoprolol should be studied in larger clinical trials.

  18. Molecular Basis for Necitumumab Inhibition of EGFR Variants Associated with Acquired Cetuximab Resistance.

    PubMed

    Bagchi, Atrish; Haidar, Jaafar N; Eastman, Scott W; Vieth, Michal; Topper, Michael; Iacolina, Michelle D; Walker, Jason M; Forest, Amelie; Shen, Yang; Novosiadly, Ruslan D; Ferguson, Kathryn M

    2018-02-01

    Acquired resistance to cetuximab, an antibody that targets the EGFR, impacts clinical benefit in head and neck, and colorectal cancers. One of the mechanisms of resistance to cetuximab is the acquisition of mutations that map to the cetuximab epitope on EGFR and prevent drug binding. We find that necitumumab, another FDA-approved EGFR antibody, can bind to EGFR that harbors the most common cetuximab-resistant substitution, S468R (or S492R, depending on the amino acid numbering system). We determined an X-ray crystal structure to 2.8 Å resolution of the necitumumab Fab bound to an S468R variant of EGFR domain III. The arginine is accommodated in a large, preexisting cavity in the necitumumab paratope. We predict that this paratope shape will be permissive to other epitope substitutions, and show that necitumumab binds to most cetuximab- and panitumumab-resistant EGFR variants. We find that a simple computational approach can predict with high success which EGFR epitope substitutions abrogate antibody binding. This computational method will be valuable to determine whether necitumumab will bind to EGFR as new epitope resistance variants are identified. This method could also be useful for rapid evaluation of the effect on binding of alterations in other antibody/antigen interfaces. Together, these data suggest that necitumumab may be active in patients who are resistant to cetuximab or panitumumab through EGFR epitope mutation. Furthermore, our analysis leads us to speculate that antibodies with large paratope cavities may be less susceptible to resistance due to mutations mapping to the antigen epitope. Mol Cancer Ther; 17(2); 521-31. ©2017 AACR . ©2017 American Association for Cancer Research.

  19. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Deshmukh, Ranjit; Wu, Grace

    The MapRE (Multi-criteria Analysis for Planning Renewable Energy) GIS (Geographic Information Systems) Tools are a set of ArcGIS tools to a) conduct site suitability analysis for wind and solar resources using inclusion and exclusion criteria, and create resource maps, b) create project opportunity areas and compute various attributes such as cost, distances to existing and planned infrastructure. and environmental impact factors; and c) calculate and update various attributes for already processed renewable energy zones. In addition, MapRE data sets are geospatial data of renewable energy project opportunity areas and zones with pre-calculated attributes for several countries. These tools and datamore » are available at mapre.lbl.gov.« less

  20. Discovery and fine-mapping of adiposity loci using high density imputation of genome-wide association studies in individuals of African ancestry: African Ancestry Anthropometry Genetics Consortium.

    PubMed

    Ng, Maggie C Y; Graff, Mariaelisa; Lu, Yingchang; Justice, Anne E; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Yanek, Lisa R; Feitosa, Mary F; Wojczynski, Mary K; Rand, Kristin; Brody, Jennifer A; Cade, Brian E; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A; Nalls, Michael A; Okut, Hayrettin; Tajuddin, Salman M; Tayo, Bamidele O; Vedantam, Sailaja; Bradfield, Jonathan P; Chen, Guanjie; Chen, Wei-Min; Chesi, Alessandra; Irvin, Marguerite R; Padhukasahasram, Badri; Smith, Jennifer A; Zheng, Wei; Allison, Matthew A; Ambrosone, Christine B; Bandera, Elisa V; Bartz, Traci M; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Bottinger, Erwin P; Carpten, John; Chanock, Stephen J; Chen, Yii-Der Ida; Conti, David V; Cooper, Richard S; Fornage, Myriam; Freedman, Barry I; Garcia, Melissa; Goodman, Phyllis J; Hsu, Yu-Han H; Hu, Jennifer; Huff, Chad D; Ingles, Sue A; John, Esther M; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Ogunniyi, Adesola; Olshan, Andrew; Press, Michael F; Rohde, Rebecca; Rybicki, Benjamin A; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S; Stanford, Janet L; Stevens, Victoria L; Stram, Alex; Strom, Sara S; Vaidya, Dhananjay; Witte, John S; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G; Zonderman, Alan B; Adeyemo, Adebowale; Ambs, Stefan; Cushman, Mary; Faul, Jessica D; Hakonarson, Hakon; Levin, Albert M; Nathanson, Katherine L; Ware, Erin B; Weir, David R; Zhao, Wei; Zhi, Degui; Arnett, Donna K; Grant, Struan F A; Kardia, Sharon L R; Oloapde, Olufunmilayo I; Rao, D C; Rotimi, Charles N; Sale, Michele M; Williams, L Keoki; Zemel, Babette S; Becker, Diane M; Borecki, Ingrid B; Evans, Michele K; Harris, Tamara B; Hirschhorn, Joel N; Li, Yun; Patel, Sanjay R; Psaty, Bruce M; Rotter, Jerome I; Wilson, James G; Bowden, Donald W; Cupples, L Adrienne; Haiman, Christopher A; Loos, Ruth J F; North, Kari E

    2017-04-01

    Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10-8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations.

  1. Non-rigid multi-frame registration of cell nuclei in live cell fluorescence microscopy image data.

    PubMed

    Tektonidis, Marco; Kim, Il-Han; Chen, Yi-Chun M; Eils, Roland; Spector, David L; Rohr, Karl

    2015-01-01

    The analysis of the motion of subcellular particles in live cell microscopy images is essential for understanding biological processes within cells. For accurate quantification of the particle motion, compensation of the motion and deformation of the cell nucleus is required. We introduce a non-rigid multi-frame registration approach for live cell fluorescence microscopy image data. Compared to existing approaches using pairwise registration, our approach exploits information from multiple consecutive images simultaneously to improve the registration accuracy. We present three intensity-based variants of the multi-frame registration approach and we investigate two different temporal weighting schemes. The approach has been successfully applied to synthetic and live cell microscopy image sequences, and an experimental comparison with non-rigid pairwise registration has been carried out. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Mapping Migratory Bird Prevalence Using Remote Sensing Data Fusion

    PubMed Central

    Swatantran, Anu; Dubayah, Ralph; Goetz, Scott; Hofton, Michelle; Betts, Matthew G.; Sun, Mindy; Simard, Marc; Holmes, Richard

    2012-01-01

    Background Improved maps of species distributions are important for effective management of wildlife under increasing anthropogenic pressures. Recent advances in lidar and radar remote sensing have shown considerable potential for mapping forest structure and habitat characteristics across landscapes. However, their relative efficacies and integrated use in habitat mapping remain largely unexplored. We evaluated the use of lidar, radar and multispectral remote sensing data in predicting multi-year bird detections or prevalence for 8 migratory songbird species in the unfragmented temperate deciduous forests of New Hampshire, USA. Methodology and Principal Findings A set of 104 predictor variables describing vegetation vertical structure and variability from lidar, phenology from multispectral data and backscatter properties from radar data were derived. We tested the accuracies of these variables in predicting prevalence using Random Forests regression models. All data sets showed more than 30% predictive power with radar models having the lowest and multi-sensor synergy (“fusion”) models having highest accuracies. Fusion explained between 54% and 75% variance in prevalence for all the birds considered. Stem density from discrete return lidar and phenology from multispectral data were among the best predictors. Further analysis revealed different relationships between the remote sensing metrics and bird prevalence. Spatial maps of prevalence were consistent with known habitat preferences for the bird species. Conclusion and Significance Our results highlight the potential of integrating multiple remote sensing data sets using machine-learning methods to improve habitat mapping. Multi-dimensional habitat structure maps such as those generated from this study can significantly advance forest management and ecological research by facilitating fine-scale studies at both stand and landscape level. PMID:22235254

  3. Mapping migratory bird prevalence using remote sensing data fusion.

    PubMed

    Swatantran, Anu; Dubayah, Ralph; Goetz, Scott; Hofton, Michelle; Betts, Matthew G; Sun, Mindy; Simard, Marc; Holmes, Richard

    2012-01-01

    Improved maps of species distributions are important for effective management of wildlife under increasing anthropogenic pressures. Recent advances in lidar and radar remote sensing have shown considerable potential for mapping forest structure and habitat characteristics across landscapes. However, their relative efficacies and integrated use in habitat mapping remain largely unexplored. We evaluated the use of lidar, radar and multispectral remote sensing data in predicting multi-year bird detections or prevalence for 8 migratory songbird species in the unfragmented temperate deciduous forests of New Hampshire, USA. A set of 104 predictor variables describing vegetation vertical structure and variability from lidar, phenology from multispectral data and backscatter properties from radar data were derived. We tested the accuracies of these variables in predicting prevalence using Random Forests regression models. All data sets showed more than 30% predictive power with radar models having the lowest and multi-sensor synergy ("fusion") models having highest accuracies. Fusion explained between 54% and 75% variance in prevalence for all the birds considered. Stem density from discrete return lidar and phenology from multispectral data were among the best predictors. Further analysis revealed different relationships between the remote sensing metrics and bird prevalence. Spatial maps of prevalence were consistent with known habitat preferences for the bird species. Our results highlight the potential of integrating multiple remote sensing data sets using machine-learning methods to improve habitat mapping. Multi-dimensional habitat structure maps such as those generated from this study can significantly advance forest management and ecological research by facilitating fine-scale studies at both stand and landscape level.

  4. The MIND PALACE: A Multi-Spectral Imaging and Spectroscopy Database for Planetary Science

    NASA Astrophysics Data System (ADS)

    Eshelman, E.; Doloboff, I.; Hara, E. K.; Uckert, K.; Sapers, H. M.; Abbey, W.; Beegle, L. W.; Bhartia, R.

    2017-12-01

    The Multi-Instrument Database (MIND) is the web-based home to a well-characterized set of analytical data collected by a suite of deep-UV fluorescence/Raman instruments built at the Jet Propulsion Laboratory (JPL). Samples derive from a growing body of planetary surface analogs, mineral and microbial standards, meteorites, spacecraft materials, and other astrobiologically relevant materials. In addition to deep-UV spectroscopy, datasets stored in MIND are obtained from a variety of analytical techniques obtained over multiple spatial and spectral scales including electron microscopy, optical microscopy, infrared spectroscopy, X-ray fluorescence, and direct fluorescence imaging. Multivariate statistical analysis techniques, primarily Principal Component Analysis (PCA), are used to guide interpretation of these large multi-analytical spectral datasets. Spatial co-referencing of integrated spectral/visual maps is performed using QGIS (geographic information system software). Georeferencing techniques transform individual instrument data maps into a layered co-registered data cube for analysis across spectral and spatial scales. The body of data in MIND is intended to serve as a permanent, reliable, and expanding database of deep-UV spectroscopy datasets generated by this unique suite of JPL-based instruments on samples of broad planetary science interest.

  5. High-Resolution Sequence-Function Mapping of Full-Length Proteins

    PubMed Central

    Kowalsky, Caitlin A.; Klesmith, Justin R.; Stapleton, James A.; Kelly, Vince; Reichkitzer, Nolan; Whitehead, Timothy A.

    2015-01-01

    Comprehensive sequence-function mapping involves detailing the fitness contribution of every possible single mutation to a gene by comparing the abundance of each library variant before and after selection for the phenotype of interest. Deep sequencing of library DNA allows frequency reconstruction for tens of thousands of variants in a single experiment, yet short read lengths of current sequencers makes it challenging to probe genes encoding full-length proteins. Here we extend the scope of sequence-function maps to entire protein sequences with a modular, universal sequence tiling method. We demonstrate the approach with both growth-based selections and FACS screening, offer parameters and best practices that simplify design of experiments, and present analytical solutions to normalize data across independent selections. Using this protocol, sequence-function maps covering full sequences can be obtained in four to six weeks. Best practices introduced in this manuscript are fully compatible with, and complementary to, other recently published sequence-function mapping protocols. PMID:25790064

  6. Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds.

    PubMed

    Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

    2017-01-01

    Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.

  7. Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds

    PubMed Central

    Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

    2017-01-01

    Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836

  8. IsoMAP (Isoscape Modeling, Analysis, and Prediction)

    NASA Astrophysics Data System (ADS)

    Miller, C. C.; Bowen, G. J.; Zhang, T.; Zhao, L.; West, J. B.; Liu, Z.; Rapolu, N.

    2009-12-01

    IsoMAP is a TeraGrid-based web portal aimed at building the infrastructure that brings together distributed multi-scale and multi-format geospatial datasets to enable statistical analysis and modeling of environmental isotopes. A typical workflow enabled by the portal includes (1) data source exploration and selection, (2) statistical analysis and model development; (3) predictive simulation of isotope distributions using models developed in (1) and (2); (4) analysis and interpretation of simulated spatial isotope distributions (e.g., comparison with independent observations, pattern analysis). The gridded models and data products created by one user can be shared and reused among users within the portal, enabling collaboration and knowledge transfer. This infrastructure and the research it fosters can lead to fundamental changes in our knowledge of the water cycle and ecological and biogeochemical processes through analysis of network-based isotope data, but it will be important A) that those with whom the data and models are shared can be sure of the origin, quality, inputs, and processing history of these products, and B) the system is agile and intuitive enough to facilitate this sharing (rather than just ‘allow’ it). IsoMAP researchers are therefore building into the portal’s architecture several components meant to increase the amount of metadata about users’ products and to repurpose those metadata to make sharing and discovery more intuitive and robust to both expected, professional users as well as unforeseeable populations from other sectors.

  9. Bovine Polledness – An Autosomal Dominant Trait with Allelic Heterogeneity

    PubMed Central

    Medugorac, Ivica; Seichter, Doris; Graf, Alexander; Russ, Ingolf; Blum, Helmut; Göpel, Karl Heinrich; Rothammer, Sophie; Förster, Martin; Krebs, Stefan

    2012-01-01

    The persistent horns are an important trait of speciation for the family Bovidae with complex morphogenesis taking place briefly after birth. The polledness is highly favourable in modern cattle breeding systems but serious animal welfare issues urge for a solution in the production of hornless cattle other than dehorning. Although the dominant inhibition of horn morphogenesis was discovered more than 70 years ago, and the causative mutation was mapped almost 20 years ago, its molecular nature remained unknown. Here, we report allelic heterogeneity of the POLLED locus. First, we mapped the POLLED locus to a ∼381-kb interval in a multi-breed case-control design. Targeted re-sequencing of an enlarged candidate interval (547 kb) in 16 sires with known POLLED genotype did not detect a common allele associated with polled status. In eight sires of Alpine and Scottish origin (four polled versus four horned), we identified a single candidate mutation, a complex 202 bp insertion-deletion event that showed perfect association to the polled phenotype in various European cattle breeds, except Holstein-Friesian. The analysis of the same candidate interval in eight Holsteins identified five candidate variants which segregate as a 260 kb haplotype also perfectly associated with the POLLED gene without recombination or interference with the 202 bp insertion-deletion. We further identified bulls which are progeny tested as homozygous polled but bearing both, 202 bp insertion-deletion and Friesian haplotype. The distribution of genotypes of the two putative POLLED alleles in large semi-random sample (1,261 animals) supports the hypothesis of two independent mutations. PMID:22737241

  10. A Genome-wide Admixture Scan for Ancestry-linked Genes Predisposing to Sarcoidosis in African Americans

    PubMed Central

    Rybicki, Benjamin A.; Levin, Albert M.; McKeigue, Paul; Datta, Indrani; Gray-McGuire, Courtney; Colombo, Marco; Reich, David; Burke, Robert R.; Iannuzzi, Michael C.

    2010-01-01

    Genome-wide linkage and association studies have uncovered variants associated with sarcoidosis, a multi-organ granulomatous inflammatory disease. African ancestry may influence disease pathogenesis since African Americans are more commonly affected by sarcoidosis. Therefore, we conducted the first sarcoidosis genome-wide ancestry scan using a map of 1,384 highly ancestry informative single nucleotide polymorphisms genotyped on 1,357 sarcoidosis cases and 703 unaffected controls self-identified as African American. The most significant ancestry association was at marker rs11966463 on chromosome 6p22.3 (ancestry association risk ratio (aRR)= 1.90; p=0.0002). When we restricted the analysis to biopsy-confirmed cases, the aRR for this marker increased to 2.01; p=0.00007. Among the eight other markers that demonstrated suggestive ancestry associations with sarcoidosis were rs1462906 on chromosome 8p12 which had the most significant association with European ancestry (aRR=0.65; p=0.002), and markers on chromosomes 5p13 (aRR=1.46; p=0.005) and 5q31 (aRR=0.67; p=0.005), which correspond to regions we previously identified through sib pair linkage analyses. Overall, the most significant ancestry association for Scadding stage IV cases was to marker rs7919137 on chromosome 10p11.22 (aRR=0.27; p=2×10−5), a region not associated with disease susceptibility. In summary, through admixture mapping of sarcoidosis we have confirmed previous genetic linkages and identified several novel putative candidate loci for sarcoidosis. PMID:21179114

  11. Association analysis for udder index and milking speed with imputed whole-genome sequence variants in Nordic Holstein cattle.

    PubMed

    Jardim, Júlia Gazzoni; Guldbrandtsen, Bernt; Lund, Mogens Sandø; Sahana, Goutam

    2018-03-01

    Genome-wide association testing facilitates the identification of genetic variants associated with complex traits. Mapping genes that promote genetic resistance to mastitis could reduce the cost of antibiotic use and enhance animal welfare and milk production by improving outcomes of breeding for udder health. Using imputed whole-genome sequence variants, we carried out association studies for 2 traits related to udder health, udder index, and milking speed in Nordic Holstein cattle. A total of 4,921 bulls genotyped with the BovineSNP50 BeadChip array were imputed to high-density genotypes (Illumina BovineHD BeadChip, Illumina, San Diego, CA) and, subsequently, to whole-genome sequence variants. An association analysis was carried out using a linear mixed model. Phenotypes used in the association analyses were deregressed breeding values. Multitrait meta-analysis was carried out for these 2 traits. We identified 10 and 8 chromosomes harboring markers that were significantly associated with udder index and milking speed, respectively. Strongest association signals were observed on chromosome 20 for udder index and chromosome 19 for milking speed. Multitrait meta-analysis identified 13 chromosomes harboring associated markers for the combination of udder index and milking speed. The associated region on chromosome 20 overlapped with earlier reported quantitative trait loci for similar traits in other cattle populations. Moreover, this region was located close to the FYB gene, which is involved in platelet activation and controls IL-2 expression; FYB is a strong candidate gene for udder health and worthy of further investigation. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  12. A comprehensive examination of breast cancer risk loci in African American women.

    PubMed

    Feng, Ye; Stram, Daniel O; Rhie, Suhn Kyong; Millikan, Robert C; Ambrosone, Christine B; John, Esther M; Bernstein, Leslie; Zheng, Wei; Olshan, Andrew F; Hu, Jennifer J; Ziegler, Regina G; Nyante, Sarah; Bandera, Elisa V; Ingles, Sue A; Press, Michael F; Deming, Sandra L; Rodriguez-Gil, Jorge L; Palmer, Julie R; Olopade, Olufunmilayo I; Huo, Dezheng; Adebamowo, Clement A; Ogundiran, Temidayo; Chen, Gary K; Stram, Alex; Park, Karen; Rand, Kristin A; Chanock, Stephen J; Le Marchand, Loic; Kolonel, Laurence N; Conti, David V; Easton, Douglas; Henderson, Brian E; Haiman, Christopher A

    2014-10-15

    Genome-wide association studies have identified 73 breast cancer risk variants mainly in European populations. Given considerable differences in linkage disequilibrium structure between populations of European and African ancestry, the known risk variants may not be informative for risk in African ancestry populations. In a previous fine-mapping investigation of 19 breast cancer loci, we were able to identify SNPs in four regions that better captured risk associations in African American women. In this study of breast cancer in African American women (3016 cases, 2745 controls), we tested an additional 54 novel breast cancer risk variants. Thirty-eight variants (70%) were found to have an association with breast cancer in the same direction as previously reported, with eight (15%) replicating at P < 0.05. Through fine-mapping, in three regions (1q32, 3p24, 10q25), we identified variants that better captured associations with overall breast cancer or estrogen receptor positive disease. We also observed suggestive associations with variants (at P < 5 × 10(-6)) in three separate regions (6q25, 14q13, 22q12) that may represent novel risk variants. Directional consistency of association observed for ∼65-70% of currently known genetic variants for breast cancer in women of African ancestry implies a shared functional common variant at most loci. To validate and enhance the spectrum of alleles that define associations at the known breast cancer risk loci, as well as genome-wide, will require even larger collaborative efforts in women of African ancestry. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  13. Identification of independent association signals and putative functional variants for breast cancer risk through fine-scale mapping of the 12p11 locus.

    PubMed

    Zeng, Chenjie; Guo, Xingyi; Long, Jirong; Kuchenbaecker, Karoline B; Droit, Arnaud; Michailidou, Kyriaki; Ghoussaini, Maya; Kar, Siddhartha; Freeman, Adam; Hopper, John L; Milne, Roger L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Agata, Simona; Ahmed, Shahana; Aittomäki, Kristiina; Andrulis, Irene L; Anton-Culver, Hoda; Antonenkova, Natalia N; Arason, Adalgeir; Arndt, Volker; Arun, Banu K; Arver, Brita; Bacot, Francois; Barrowdale, Daniel; Baynes, Caroline; Beeghly-Fadiel, Alicia; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Blot, William J; Bogdanova, Natalia V; Bojesen, Stig E; Bonanni, Bernardo; Borresen-Dale, Anne-Lise; Brand, Judith S; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Buys, Saundra S; Cai, Qiuyin; Caldes, Trinidad; Campbell, Ian; Carpenter, Jane; Chang-Claude, Jenny; Choi, Ji-Yeob; Claes, Kathleen B M; Clarke, Christine; Cox, Angela; Cross, Simon S; Czene, Kamila; Daly, Mary B; de la Hoya, Miguel; De Leeneer, Kim; Devilee, Peter; Diez, Orland; Domchek, Susan M; Doody, Michele; Dorfling, Cecilia M; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Dworniczak, Bernd; Egan, Kathleen; Eilber, Ursula; Einbeigi, Zakaria; Ejlertsen, Bent; Ellis, Steve; Frost, Debra; Lalloo, Fiona; Fasching, Peter A; Figueroa, Jonine; Flyger, Henrik; Friedlander, Michael; Friedman, Eitan; Gambino, Gaetana; Gao, Yu-Tang; Garber, Judy; García-Closas, Montserrat; Gehrig, Andrea; Damiola, Francesca; Lesueur, Fabienne; Mazoyer, Sylvie; Stoppa-Lyonnet, Dominique; Giles, Graham G; Godwin, Andrew K; Goldgar, David E; González-Neira, Anna; Greene, Mark H; Guénel, Pascal; Haeberle, Lothar; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hansen, Thomas V O; Hart, Steven; Hartikainen, Jaana M; Hartman, Mikael; Hassan, Norhashimah; Healey, Sue; Hogervorst, Frans B L; Verhoef, Senno; Hendricks, Carolyn B; Hillemanns, Peter; Hollestelle, Antoinette; Hulick, Peter J; Hunter, David J; Imyanitov, Evgeny N; Isaacs, Claudine; Ito, Hidemi; Jakubowska, Anna; Janavicius, Ramunas; Jaworska-Bieniek, Katarzyna; Jensen, Uffe Birk; John, Esther M; Joly Beauparlant, Charles; Jones, Michael; Kabisch, Maria; Kang, Daehee; Karlan, Beth Y; Kauppila, Saila; Kerin, Michael J; Khan, Sofia; Khusnutdinova, Elza; Knight, Julia A; Konstantopoulou, Irene; Kraft, Peter; Kwong, Ava; Laitman, Yael; Lambrechts, Diether; Lazaro, Conxi; Le Marchand, Loic; Lee, Chuen Neng; Lee, Min Hyuk; Lester, Jenny; Li, Jingmei; Liljegren, Annelie; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mai, Phuong L; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McGuffog, Lesley; Meindl, Alfons; Menegaux, Florence; Montagna, Marco; Muir, Kenneth; Mulligan, Anna Marie; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Newcomb, Polly A; Nord, Silje; Nussbaum, Robert L; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I; Olswold, Curtis; Osorio, Ana; Papi, Laura; Park-Simon, Tjoung-Won; Paulsson-Karlsson, Ylva; Peeters, Stephanie; Peissel, Bernard; Peterlongo, Paolo; Peto, Julian; Pfeiler, Georg; Phelan, Catherine M; Presneau, Nadege; Radice, Paolo; Rahman, Nazneen; Ramus, Susan J; Rashid, Muhammad Usman; Rennert, Gad; Rhiem, Kerstin; Rudolph, Anja; Salani, Ritu; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schoemaker, Minouk J; Schürmann, Peter; Seynaeve, Caroline; Shen, Chen-Yang; Shrubsole, Martha J; Shu, Xiao-Ou; Sigurdson, Alice; Singer, Christian F; Slager, Susan; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Swerdlow, Anthony; Szabo, Csilla I; Tchatchou, Sandrine; Teixeira, Manuel R; Teo, Soo H; Terry, Mary Beth; Tessier, Daniel C; Teulé, Alex; Thomassen, Mads; Tihomirova, Laima; Tischkowitz, Marc; Toland, Amanda E; Tung, Nadine; Turnbull, Clare; van den Ouweland, Ans M W; van Rensburg, Elizabeth J; Ven den Berg, David; Vijai, Joseph; Wang-Gohrke, Shan; Weitzel, Jeffrey N; Whittemore, Alice S; Winqvist, Robert; Wong, Tien Y; Wu, Anna H; Yannoukakos, Drakoulis; Yu, Jyh-Cherng; Pharoah, Paul D P; Hall, Per; Chenevix-Trench, Georgia; Dunning, Alison M; Simard, Jacques; Couch, Fergus J; Antoniou, Antonis C; Easton, Douglas F; Zheng, Wei

    2016-06-21

    Multiple recent genome-wide association studies (GWAS) have identified a single nucleotide polymorphism (SNP), rs10771399, at 12p11 that is associated with breast cancer risk. We performed a fine-scale mapping study of a 700 kb region including 441 genotyped and more than 1300 imputed genetic variants in 48,155 cases and 43,612 controls of European descent, 6269 cases and 6624 controls of East Asian descent and 1116 cases and 932 controls of African descent in the Breast Cancer Association Consortium (BCAC; http://bcac.ccge.medschl.cam.ac.uk/ ), and in 15,252 BRCA1 mutation carriers in the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA). Stepwise regression analyses were performed to identify independent association signals. Data from the Encyclopedia of DNA Elements project (ENCODE) and the Cancer Genome Atlas (TCGA) were used for functional annotation. Analysis of data from European descendants found evidence for four independent association signals at 12p11, represented by rs7297051 (odds ratio (OR) = 1.09, 95 % confidence interval (CI) = 1.06-1.12; P = 3 × 10(-9)), rs805510 (OR = 1.08, 95 % CI = 1.04-1.12, P = 2 × 10(-5)), and rs1871152 (OR = 1.04, 95 % CI = 1.02-1.06; P = 2 × 10(-4)) identified in the general populations, and rs113824616 (P = 7 × 10(-5)) identified in the meta-analysis of BCAC ER-negative cases and BRCA1 mutation carriers. SNPs rs7297051, rs805510 and rs113824616 were also associated with breast cancer risk at P < 0.05 in East Asians, but none of the associations were statistically significant in African descendants. Multiple candidate functional variants are located in putative enhancer sequences. Chromatin interaction data suggested that PTHLH was the likely target gene of these enhancers. Of the six variants with the strongest evidence of potential functionality, rs11049453 was statistically significantly associated with the expression of PTHLH and its nearby gene CCDC91 at P < 0.05. This study identified four independent association signals at 12p11 and revealed potentially functional variants, providing additional insights into the underlying biological mechanism(s) for the association observed between variants at 12p11 and breast cancer risk.

  14. Fine-mapping analysis of the MHC region for vitiligo based on a new Han-MHC reference panel.

    PubMed

    Yang, Chao; Wu, Juan; Zhang, Xuelei; Wen, Leilei; Sun, Jingying; Cheng, Yuyan; Tang, Xianfa; Liang, Bo; Chen, Gang; Zhou, Fusheng; Cui, Yong; Zhang, Anping; Zhang, Xuejun; Zheng, Xiaodong; Yang, Sen; Sun, Liangdan

    2018-03-30

    Vitiligo is an immune-related disease with patchy depigmentation of skin and hair caused by selective destruction of melanocytes. In recent decades, many studies have shown the association between vitiligo and HLA genes; however, the results of Han Chinese are scarce. In this study, we performed a fine-mapping analysis of the MHC region in 2818 Han Chinese subjects through a widely used HLA imputation method with a newly built large-scale Han-MHC reference panel. Three new four-digit HLA alleles (HLA-DQB1 ∗ 02:02, HLA-DQA1 ∗ 02:01 and HLA-DPB1 ∗ 17:01) were identified to be associated with the risk of vitiligo, and four previously reported alleles were confirmed. Further conditional analysis revealed that two important variants, HLA-DQβ1 amino acid position 135 (OR = 1.79, P = 1.87 × 10 -11 ) and HLA-B amino acid positions 45-46 (OR = 1.44, P = 5.61 × 10 -11 ), conferred most of the MHC associations. Three-dimension ribbon models showed that the former is located within the β2 domain of the HLA-DQβ1 molecule, and the latter lies in the α1 domain of the HLA-B molecule, while both are involved in specific antigen presenting process. Finally, we summarized all significant signals in the MHC region to clarify their complex relationships, and 8.60% of phenotypic variance could be explained based on all reported variants in Han Chinese so far. Our findings highlight the complex genetic architecture of the MHC region for vitiligo in Han Chinese population and expand our understanding of the roles of HLA coding variants in the etiology of vitiligo. Copyright © 2018. Published by Elsevier B.V.

  15. Testing cross-phenotype effects of rare variants in longitudinal studies of complex traits.

    PubMed

    Rudra, Pratyaydipta; Broadaway, K Alaine; Ware, Erin B; Jhun, Min A; Bielak, Lawrence F; Zhao, Wei; Smith, Jennifer A; Peyser, Patricia A; Kardia, Sharon L R; Epstein, Michael P; Ghosh, Debashis

    2018-06-01

    Many gene mapping studies of complex traits have identified genes or variants that influence multiple phenotypes. With the advent of next-generation sequencing technology, there has been substantial interest in identifying rare variants in genes that possess cross-phenotype effects. In the presence of such effects, modeling both the phenotypes and rare variants collectively using multivariate models can achieve higher statistical power compared to univariate methods that either model each phenotype separately or perform separate tests for each variant. Several studies collect phenotypic data over time and using such longitudinal data can further increase the power to detect genetic associations. Although rare-variant approaches exist for testing cross-phenotype effects at a single time point, there is no analogous method for performing such analyses using longitudinal outcomes. In order to fill this important gap, we propose an extension of Gene Association with Multiple Traits (GAMuT) test, a method for cross-phenotype analysis of rare variants using a framework based on the distance covariance. The approach allows for both binary and continuous phenotypes and can also adjust for covariates. Our simple adjustment to the GAMuT test allows it to handle longitudinal data and to gain power by exploiting temporal correlation. The approach is computationally efficient and applicable on a genome-wide scale due to the use of a closed-form test whose significance can be evaluated analytically. We use simulated data to demonstrate that our method has favorable power over competing approaches and also apply our approach to exome chip data from the Genetic Epidemiology Network of Arteriopathy. © 2018 WILEY PERIODICALS, INC.

  16. Characterizing Genetic Susceptibility to Breast Cancer in Women of African Ancestry.

    PubMed

    Feng, Ye; Rhie, Suhn Kyong; Huo, Dezheng; Ruiz-Narvaez, Edward A; Haddad, Stephen A; Ambrosone, Christine B; John, Esther M; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J; Ziegler, Regina G; Nyante, Sarah; Bandera, Elisa V; Ingles, Sue A; Press, Michael F; Deming, Sandra L; Rodriguez-Gil, Jorge L; Zheng, Yonglan; Yao, Song; Han, Yoo-Jeong; Ogundiran, Temidayo O; Rebbeck, Timothy R; Adebamowo, Clement; Ojengbede, Oladosu; Falusi, Adeyinka G; Hennis, Anselm; Nemesure, Barbara; Ambs, Stefan; Blot, William; Cai, Qiuyin; Signorello, Lisa; Nathanson, Katherine L; Lunetta, Kathryn L; Sucheston-Campbell, Lara E; Bensen, Jeannette T; Chanock, Stephen J; Marchand, Loic Le; Olshan, Andrew F; Kolonel, Laurence N; Conti, David V; Coetzee, Gerhard A; Stram, Daniel O; Olopade, Olufunmilayo I; Palmer, Julie R; Haiman, Christopher A

    2017-07-01

    Background: Genome-wide association studies have identified approximately 100 common genetic variants associated with breast cancer risk, the majority of which were discovered in women of European ancestry. Because of different patterns of linkage disequilibrium, many of these genetic markers may not represent signals in populations of African ancestry. Methods: We tested 74 breast cancer risk variants and conducted fine-mapping of these susceptibility regions in 6,522 breast cancer cases and 7,643 controls of African ancestry from three genetic consortia (AABC, AMBER, and ROOT). Results: Fifty-four of the 74 variants (73%) were found to have ORs that were directionally consistent with those previously reported, of which 12 were nominally statistically significant ( P < 0.05). Through fine-mapping, in six regions ( 3p24, 12p11, 14q13, 16q12/FTO, 16q23, 19p13 ), we observed seven markers that better represent the underlying risk variant for overall breast cancer or breast cancer subtypes, whereas in another two regions ( 11q13, 16q12/TOX3 ), we identified suggestive evidence of signals that are independent of the reported index variant. Overlapping chromatin features and regulatory elements suggest that many of the risk alleles lie in regions with biological functionality. Conclusions: Through fine-mapping of known susceptibility regions, we have revealed alleles that better characterize breast cancer risk in women of African ancestry. Impact: The risk alleles identified represent genetic markers for modeling and stratifying breast cancer risk in women of African ancestry. Cancer Epidemiol Biomarkers Prev; 26(7); 1016-26. ©2017 AACR . ©2017 American Association for Cancer Research.

  17. Common genetic variants and subclinical atherosclerosis: The Multi-Ethnic Study of Atherosclerosis (MESA).

    PubMed

    Vargas, Jose D; Manichaikul, Ani; Wang, Xin-Qun; Rich, Stephen S; Rotter, Jerome I; Post, Wendy S; Polak, Joseph F; Budoff, Matthew J; Bluemke, David A

    2016-02-01

    Subclinical atherosclerosis (sCVD), measured by coronary artery calcium (CAC) and carotid intima media thickness (CIMT) is associated with cardiovascular disease (CVD). Genome-Wide Association Studies (GWAS) of sCVD and CVD have focused primarily on Caucasian populations. We hypothesized that these associations may differ in populations from distinct genetic backgrounds. The associations between sCVD and 66 single nucleotide polymorphisms (SNPs) from published GWAS of sCVD and CVD were tested in 8224 Multi-Ethnic Study of Atherosclerosis (MESA) and MESA Family participants [2329 Caucasians (EUA), 691 Chinese (CHN), 2482 African Americans (AFA), and 2012 Hispanic (HIS)] using an additive model adjusting for CVD risk factors, with SNP significance defined by a Bonferroni-corrected p < 7.6 × 10(-4) (0.05/66). In EUA there were significant associations for CAC with SNPs in 9p21 (rs1333049, P = 2 × 10(-9); rs4977574, P = 4 × 10(-9)), COL4A1 (rs9515203, P = 9 × 10(-6)), and PHACTR1 (rs9349379, P = 4 × 10(-4)). In HIS, CAC was associated with SNPs in 9p21 (rs1333049, P = 8 × 10(-5); rs4977574, P = 5 × 10(-5)), APOA5 (rs964184, P = 2 × 10(-4)), and ADAMTS7 (rs7173743, P = 4 × 10(-4)). There were no associations between CAC and 9p21 SNPs for AFA and CHN. Fine mapping of the 9p21 region revealed SNPs with robust associations with CAC in EUA and HIS but no significant associations in AFA and CHN. Our results suggest some shared genetic architecture for sCVD across ethnic groups, while also underscoring the possibility of novel variants and/or pathways in risk of CVD in ethnically diverse populations. Published by Elsevier Ireland Ltd.

  18. Genetic Architecture of Vitamin B12 and Folate Levels Uncovered Applying Deeply Sequenced Large Datasets

    PubMed Central

    Thorleifsson, Gudmar; Ahluwalia, Tarunveer S.; Steinthorsdottir, Valgerdur; Bjarnason, Helgi; Gudbjartsson, Daniel F.; Magnusson, Olafur T.; Sparsø, Thomas; Albrechtsen, Anders; Kong, Augustine; Masson, Gisli; Tian, Geng; Cao, Hongzhi; Nie, Chao; Kristiansen, Karsten; Husemoen, Lise Lotte; Thuesen, Betina; Li, Yingrui; Nielsen, Rasmus; Linneberg, Allan; Olafsson, Isleifur; Eyjolfsson, Gudmundur I.; Jørgensen, Torben; Wang, Jun; Hansen, Torben; Thorsteinsdottir, Unnur; Stefánsson, Kari; Pedersen, Oluf

    2013-01-01

    Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B12 and folate measurements, respectively. We found six novel loci associating with serum B12 (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B12 and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations. PMID:23754956

  19. Multi-channel Analysis of Passive Surface Waves (MAPS)

    NASA Astrophysics Data System (ADS)

    Xia, J.; Cheng, F. Mr; Xu, Z.; Wang, L.; Shen, C.; Liu, R.; Pan, Y.; Mi, B.; Hu, Y.

    2017-12-01

    Urbanization is an inevitable trend in modernization of human society. In the end of 2013 the Chinese Central Government launched a national urbanization plan—"Three 100 Million People", which aggressively and steadily pushes forward urbanization. Based on the plan, by 2020, approximately 100 million people from rural areas will permanently settle in towns, dwelling conditions of about 100 million people in towns and villages will be improved, and about 100 million people in the central and western China will permanently settle in towns. China's urbanization process will run at the highest speed in the urbanization history of China. Environmentally friendly, non-destructive and non-invasive geophysical assessment method has played an important role in the urbanization process in China. Because human noise and electromagnetic field due to industrial life, geophysical methods already used in urban environments (gravity, magnetics, electricity, seismic) face great challenges. But humanity activity provides an effective source of passive seismic methods. Claerbout pointed out that wavefileds that are received at one point with excitation at the other point can be reconstructed by calculating the cross-correlation of noise records at two surface points. Based on this idea (cross-correlation of two noise records) and the virtual source method, we proposed Multi-channel Analysis of Passive Surface Waves (MAPS). MAPS mainly uses traffic noise recorded with a linear receiver array. Because Multi-channel Analysis of Surface Waves can produces a shear (S) wave velocity model with high resolution in shallow part of the model, MPAS combines acquisition and processing of active source and passive source data in a same flow, which does not require to distinguish them. MAPS is also of ability of real-time quality control of noise recording that is important for near-surface applications in urban environment. The numerical and real-world examples demonstrated that MAPS can be used for accurate and fast imaging of high-frequency surface wave energy, and some examples also show that high quality imaging similar to those with active sources can be generated only by the use of a few minutes of noise. The use of cultural noise in town, MAPS can image S-wave velocity structure from the ground surface to hundreds of meters depth.

  20. Characterizing complex structural variation in germline and somatic genomes

    PubMed Central

    Quinlan, Aaron R.; Hall, Ira M.

    2011-01-01

    Genome structural variation (SV) is a major source of genetic diversity in mammals and a hallmark of cancer. While SV is typically defined by its canonical forms – duplication, deletion, insertion, inversion and translocation – recent breakpoint mapping studies have revealed a surprising number of “complex” variants that evade simple classification. Complex SVs are defined by clustered breakpoints that arose through a single mutation but cannot be explained by one simple end-joining or recombination event. Some complex variants exhibit profoundly complicated rearrangements between distinct loci from multiple chromosomes, while others involve more subtle alterations at a single locus. These diverse and unpredictable features present a challenge for SV mapping experiments. Here, we review current knowledge of complex SV in mammals, and outline techniques for identifying and characterizing complex variants using next-generation DNA sequencing. PMID:22094265

  1. Molecular and geographic analyses of vampire bat-transmitted cattle rabies in central Brazil

    PubMed Central

    Kobayashi, Yuki; Sato, Go; Mochizuki, Nobuyuki; Hirano, Shinji; Itou, Takuya; Carvalho, Adolorata AB; Albas, Avelino; Santos, Hamilton P; Ito, Fumio H; Sakai, Takeo

    2008-01-01

    Background Vampire bats are important rabies virus vectors, causing critical problems in both the livestock industry and public health sector in Latin America. In order to assess the epidemiological characteristics of vampire bat-transmitted rabies, the authors conducted phylogenetic and geographical analyses using sequence data of a large number of cattle rabies isolates collected from a wide geographical area in Brazil. Methods Partial nucleoprotein genes of rabies viruses isolated from 666 cattle and 18 vampire bats between 1987 and 2006 were sequenced and used for phylogenetic analysis. The genetic variants were plotted on topographical maps of Brazil. Results In this study, 593 samples consisting of 24 genetic variants were analyzed. Regional localization of variants was observed, with the distribution of several variants found to be delimited by mountain ranges which served as geographic boundaries. The geographical distributions of vampire-bat and cattle isolates that were classified as the identical phylogenetic group were found to overlap with high certainty. Most of the samples analyzed in this study were isolated from adjacent areas linked by rivers. Conclusion This study revealed the existence of several dozen regional variants associated with vampire bats in Brazil, with the distribution patterns of these variants found to be affected by mountain ranges and rivers. These results suggest that epidemiological characteristics of vampire bat-related rabies appear to be associated with the topographical and geographical characteristics of areas where cattle are maintained, and the factors affecting vampire bat ecology. PMID:18983685

  2. Multiscale recurrence analysis of spatio-temporal data

    NASA Astrophysics Data System (ADS)

    Riedl, M.; Marwan, N.; Kurths, J.

    2015-12-01

    The description and analysis of spatio-temporal dynamics is a crucial task in many scientific disciplines. In this work, we propose a method which uses the mapogram as a similarity measure between spatially distributed data instances at different time points. The resulting similarity values of the pairwise comparison are used to construct a recurrence plot in order to benefit from established tools of recurrence quantification analysis and recurrence network analysis. In contrast to other recurrence tools for this purpose, the mapogram approach allows the specific focus on different spatial scales that can be used in a multi-scale analysis of spatio-temporal dynamics. We illustrate this approach by application on mixed dynamics, such as traveling parallel wave fronts with additive noise, as well as more complicate examples, pseudo-random numbers and coupled map lattices with a semi-logistic mapping rule. Especially the complicate examples show the usefulness of the multi-scale consideration in order to take spatial pattern of different scales and with different rhythms into account. So, this mapogram approach promises new insights in problems of climatology, ecology, or medicine.

  3. Multiscale recurrence analysis of spatio-temporal data.

    PubMed

    Riedl, M; Marwan, N; Kurths, J

    2015-12-01

    The description and analysis of spatio-temporal dynamics is a crucial task in many scientific disciplines. In this work, we propose a method which uses the mapogram as a similarity measure between spatially distributed data instances at different time points. The resulting similarity values of the pairwise comparison are used to construct a recurrence plot in order to benefit from established tools of recurrence quantification analysis and recurrence network analysis. In contrast to other recurrence tools for this purpose, the mapogram approach allows the specific focus on different spatial scales that can be used in a multi-scale analysis of spatio-temporal dynamics. We illustrate this approach by application on mixed dynamics, such as traveling parallel wave fronts with additive noise, as well as more complicate examples, pseudo-random numbers and coupled map lattices with a semi-logistic mapping rule. Especially the complicate examples show the usefulness of the multi-scale consideration in order to take spatial pattern of different scales and with different rhythms into account. So, this mapogram approach promises new insights in problems of climatology, ecology, or medicine.

  4. Groundwater assessment in Salboni Block, West Bengal (India) using remote sensing, geographical information system and multi-criteria decision analysis techniques

    NASA Astrophysics Data System (ADS)

    Jha, Madan K.; Chowdary, V. M.; Chowdhury, Alivia

    2010-11-01

    An approach is presented for the evaluation of groundwater potential using remote sensing, geographic information system, geoelectrical, and multi-criteria decision analysis techniques. The approach divides the available hydrologic and hydrogeologic data into two groups, exogenous (hydrologic) and endogenous (subsurface). A case study in Salboni Block, West Bengal (India), uses six thematic layers of exogenous parameters and four thematic layers of endogenous parameters. These thematic layers and their features were assigned suitable weights which were normalized by analytic hierarchy process and eigenvector techniques. The layers were then integrated using ArcGIS software to generate two groundwater potential maps. The hydrologic parameters-based groundwater potential zone map indicated that the `good' groundwater potential zone covers 27.14% of the area, the `moderate' zone 45.33%, and the `poor' zone 27.53%. A comparison of this map with the groundwater potential map based on subsurface parameters revealed that the hydrologic parameters-based map accurately delineates groundwater potential zones in about 59% of the area, and hence it is dependable to a certain extent. More than 80% of the study area has moderate-to-poor groundwater potential, which necessitates efficient groundwater management for long-term water security. Overall, the integrated technique is useful for the assessment of groundwater resources at a basin or sub-basin scale.

  5. Pareto-optimal reversed-phase chromatography separation of three insulin variants with a solubility constraint.

    PubMed

    Arkell, Karolina; Knutson, Hans-Kristian; Frederiksen, Søren S; Breil, Martin P; Nilsson, Bernt

    2018-01-12

    With the shift of focus of the regulatory bodies, from fixed process conditions towards flexible ones based on process understanding, model-based optimization is becoming an important tool for process development within the biopharmaceutical industry. In this paper, a multi-objective optimization study of separation of three insulin variants by reversed-phase chromatography (RPC) is presented. The decision variables were the load factor, the concentrations of ethanol and KCl in the eluent, and the cut points for the product pooling. In addition to the purity constraints, a solubility constraint on the total insulin concentration was applied. The insulin solubility is a function of the ethanol concentration in the mobile phase, and the main aim was to investigate the effect of this constraint on the maximal productivity. Multi-objective optimization was performed with and without the solubility constraint, and visualized as Pareto fronts, showing the optimal combinations of the two objectives productivity and yield for each case. Comparison of the constrained and unconstrained Pareto fronts showed that the former diverges when the constraint becomes active, because the increase in productivity with decreasing yield is almost halted. Consequently, we suggest the operating point at which the total outlet concentration of insulin reaches the solubility limit as the most suitable one. According to the results from the constrained optimizations, the maximal productivity on the C 4 adsorbent (0.41 kg/(m 3  column h)) is less than half of that on the C 18 adsorbent (0.87 kg/(m 3  column h)). This is partly caused by the higher selectivity between the insulin variants on the C 18 adsorbent, but the main reason is the difference in how the solubility constraint affects the processes. Since the optimal ethanol concentration for elution on the C 18 adsorbent is higher than for the C 4 one, the insulin solubility is also higher, allowing a higher pool concentration. An alternative method of finding the suggested operating point was also evaluated, and it was shown to give very satisfactory results for well-mapped Pareto fronts. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. The High Resolution Stereo Camera (HRSC) of Mars Express and its approach to science analysis and mapping for Mars and its satellites

    NASA Astrophysics Data System (ADS)

    Gwinner, K.; Jaumann, R.; Hauber, E.; Hoffmann, H.; Heipke, C.; Oberst, J.; Neukum, G.; Ansan, V.; Bostelmann, J.; Dumke, A.; Elgner, S.; Erkeling, G.; Fueten, F.; Hiesinger, H.; Hoekzema, N. M.; Kersten, E.; Loizeau, D.; Matz, K.-D.; McGuire, P. C.; Mertens, V.; Michael, G.; Pasewaldt, A.; Pinet, P.; Preusker, F.; Reiss, D.; Roatsch, T.; Schmidt, R.; Scholten, F.; Spiegel, M.; Stesky, R.; Tirsch, D.; van Gasselt, S.; Walter, S.; Wählisch, M.; Willner, K.

    2016-07-01

    The High Resolution Stereo Camera (HRSC) of ESA's Mars Express is designed to map and investigate the topography of Mars. The camera, in particular its Super Resolution Channel (SRC), also obtains images of Phobos and Deimos on a regular basis. As HRSC is a push broom scanning instrument with nine CCD line detectors mounted in parallel, its unique feature is the ability to obtain along-track stereo images and four colors during a single orbital pass. The sub-pixel accuracy of 3D points derived from stereo analysis allows producing DTMs with grid size of up to 50 m and height accuracy on the order of one image ground pixel and better, as well as corresponding orthoimages. Such data products have been produced systematically for approximately 40% of the surface of Mars so far, while global shape models and a near-global orthoimage mosaic could be produced for Phobos. HRSC is also unique because it bridges between laser altimetry and topography data derived from other stereo imaging instruments, and provides geodetic reference data and geological context to a variety of non-stereo datasets. This paper, in addition to an overview of the status and evolution of the experiment, provides a review of relevant methods applied for 3D reconstruction and mapping, and respective achievements. We will also review the methodology of specific approaches to science analysis based on joint analysis of DTM and orthoimage information, or benefitting from high accuracy of co-registration between multiple datasets, such as studies using multi-temporal or multi-angular observations, from the fields of geomorphology, structural geology, compositional mapping, and atmospheric science. Related exemplary results from analysis of HRSC data will be discussed. After 10 years of operation, HRSC covered about 70% of the surface by panchromatic images at 10-20 m/pixel, and about 97% at better than 100 m/pixel. As the areas with contiguous coverage by stereo data are increasingly abundant, we also present original data related to the analysis of image blocks and address methodology aspects of newly established procedures for the generation of multi-orbit DTMs and image mosaics. The current results suggest that multi-orbit DTMs with grid spacing of 50 m can be feasible for large parts of the surface, as well as brightness-adjusted image mosaics with co-registration accuracy of adjacent strips on the order of one pixel, and at the highest image resolution available. These characteristics are demonstrated by regional multi-orbit data products covering the MC-11 (East) quadrangle of Mars, representing the first prototype of a new HRSC data product level.

  7. Characterization of Smoc-1 uncovers two transcript variants showing differential tissue and age specific expression in Bubalus bubalis

    PubMed Central

    Srivastava, Jyoti; Premi, Sanjay; Kumar, Sudhir; Parwez, Iqbal; Ali, Sher

    2007-01-01

    Background Secreted modular calcium binding protein-1 (Smoc-1) belongs to the BM-40 family which has been implicated with tissue remodeling, angiogenesis and bone mineralization. Besides its anticipated role in embryogenesis, Smoc-1 has been characterized only in a few mammalian species. We made use of the consensus sequence (5' CACCTCTCCACCTGCC 3') of 33.15 repeat loci to explore the buffalo transcriptome and uncovered the Smoc-1 transcript tagged with this repeat. The main objective of this study was to gain an insight into its structural and functional organization, and expressional status of Smoc-1 in water buffalo, Bubalus bubalis. Results We cloned and characterized the buffalo Smoc-1, including its copy number status, in-vitro protein expression, tissue & age specific transcription/translation, chromosomal mapping and localization to the basement membrane zone. Buffalo Smoc-1 was found to encode a secreted matricellular glycoprotein containing two EF-hand calcium binding motifs homologous to that of BM-40/SPARC family. In buffalo, this single copy gene consisted of 12 exons and was mapped onto the acrocentric chromosome 11. Though this gene was found to be evolutionarily conserved, the buffalo Smoc-1 showed conspicuous nucleotide/amino acid changes altering its secondary structure compared to that in other mammals. In silico analysis of the Smoc-1 proposed its glycoprotein nature with a calcium dependent conformation. Further, we unveiled two transcript variants of this gene, varying in their 3'UTR lengths but both coding for identical protein(s). Smoc-1 evinced highest expression of both the variants in liver and modest to negligible in other tissues. The relative expression of variant-02 was markedly higher compared to that of variant-01 in all the tissues examined. Moreover, expression of Smoc-1, though modest during the early ages, was conspicuously enhanced after 1 year and remained consistently higher during the entire life span of buffalo with gradual increment in expression of variant-02. Immunohistochemically, Smoc-1 was localized in the basement membrane zones and extracellular matrices of various tissues. Conclusion These data added to our understandings about the tissue, age and species specific functions of the Smoc-1. It also enabled us to demonstrate varying expression of the two transcript variants of Smoc-1 amongst different somatic tissues/gonads and ages, in spite of their identical coding frames. Pursuance of these variants for their roles in various disease phenotypes such as hepatocellular carcinoma and angiogenesis is envisaged to establish broader biological significance of this gene. PMID:18042303

  8. High-Speed Real-Time Resting-State fMRI Using Multi-Slab Echo-Volumar Imaging

    PubMed Central

    Posse, Stefan; Ackley, Elena; Mutihac, Radu; Zhang, Tongsheng; Hummatov, Ruslan; Akhtari, Massoud; Chohan, Muhammad; Fisch, Bruce; Yonas, Howard

    2013-01-01

    We recently demonstrated that ultra-high-speed real-time fMRI using multi-slab echo-volumar imaging (MEVI) significantly increases sensitivity for mapping task-related activation and resting-state networks (RSNs) compared to echo-planar imaging (Posse et al., 2012). In the present study we characterize the sensitivity of MEVI for mapping RSN connectivity dynamics, comparing independent component analysis (ICA) and a novel seed-based connectivity analysis (SBCA) that combines sliding-window correlation analysis with meta-statistics. This SBCA approach is shown to minimize the effects of confounds, such as movement, and CSF and white matter signal changes, and enables real-time monitoring of RSN dynamics at time scales of tens of seconds. We demonstrate highly sensitive mapping of eloquent cortex in the vicinity of brain tumors and arterio-venous malformations, and detection of abnormal resting-state connectivity in epilepsy. In patients with motor impairment, resting-state fMRI provided focal localization of sensorimotor cortex compared with more diffuse activation in task-based fMRI. The fast acquisition speed of MEVI enabled segregation of cardiac-related signal pulsation using ICA, which revealed distinct regional differences in pulsation amplitude and waveform, elevated signal pulsation in patients with arterio-venous malformations and a trend toward reduced pulsatility in gray matter of patients compared with healthy controls. Mapping cardiac pulsation in cortical gray matter may carry important functional information that distinguishes healthy from diseased tissue vasculature. This novel fMRI methodology is particularly promising for mapping eloquent cortex in patients with neurological disease, having variable degree of cooperation in task-based fMRI. In conclusion, ultra-high-real-time speed fMRI enhances the sensitivity of mapping the dynamics of resting-state connectivity and cerebro-vascular pulsatility for clinical and neuroscience research applications. PMID:23986677

  9. The choice of primary energy source including PV installation for providing electric energy to a public utility building - a case study

    NASA Astrophysics Data System (ADS)

    Radomski, Bartosz; Ćwiek, Barbara; Mróz, Tomasz M.

    2017-11-01

    The paper presents multicriteria decision aid analysis of the choice of PV installation providing electric energy to a public utility building. From the energy management point of view electricity obtained by solar radiation has become crucial renewable energy source. Application of PV installations may occur a profitable solution from energy, economic and ecologic point of view for both existing and newly erected buildings. Featured variants of PV installations have been assessed by multicriteria analysis based on ANP (Analytic Network Process) method. Technical, economical, energy and environmental criteria have been identified as main decision criteria. Defined set of decision criteria has an open character and can be modified in the dialog process between the decision-maker and the expert - in the present case, an expert in planning of development of energy supply systems. The proposed approach has been used to evaluate three variants of PV installation acceptable for existing educational building located in Poznań, Poland - the building of Faculty of Chemical Technology, Poznań University of Technology. Multi-criteria analysis based on ANP method and the calculation software Super Decisions has proven to be an effective tool for energy planning, leading to the indication of the recommended variant of PV installation in existing and newly erected public buildings. Achieved results show prospects and possibilities of rational renewable energy usage as complex solution to public utility buildings.

  10. Candidate Sequence Variants and Fetal Hemoglobin in Children with Sickle Cell Disease Treated with Hydroxyurea

    PubMed Central

    Green, Nancy S.; Ender, Katherine L.; Pashankar, Farzana; Driscoll, Catherine; Giardina, Patricia J.; Mullen, Craig A.; Clark, Lorraine N.; Manwani, Deepa; Crotty, Jennifer; Kisselev, Sergey; Neville, Kathleen A.; Hoppe, Carolyn; Barral, Sandra

    2013-01-01

    Background Fetal hemoglobin level is a heritable complex trait that strongly correlates swith the clinical severity of sickle cell disease. Only few genetic loci have been identified as robustly associated with fetal hemoglobin in patients with sickle cell disease, primarily adults. The sole approved pharmacologic therapy for this disease is hydroxyurea, with effects largely attributable to induction of fetal hemoglobin. Methodology/Principal Findings In a multi-site observational analysis of children with sickle cell disease, candidate single nucleotide polymorphisms associated with baseline fetal hemoglobin levels in adult sickle cell disease were examined in children at baseline and induced by hydroxyurea therapy. For baseline levels, single marker analysis demonstrated significant association with BCL11A and the beta and epsilon globin loci (HBB and HBE, respectively), with an additive attributable variance from these loci of 23%. Among a subset of children on hydroxyurea, baseline fetal hemoglobin levels explained 33% of the variance in induced levels. The variant in HBE accounted for an additional 13% of the variance in induced levels, while variants in the HBB and BCL11A loci did not contribute beyond baseline levels. Conclusions/Significance These findings clarify the overlap between baseline and hydroxyurea-induced fetal hemoglobin levels in pediatric disease. Studies assessing influences of specific sequence variants in these and other genetic loci in larger populations and in unusual hydroxyurea responders are needed to further understand the maintenance and therapeutic induction of fetal hemoglobin in pediatric sickle cell disease. PMID:23409025

  11. (Semi-)Automated landform mapping of the alpine valley Gradental (Austria) based on LiDAR data

    NASA Astrophysics Data System (ADS)

    Strasser, T.; Eisank, C.

    2012-04-01

    Alpine valleys are typically characterised as complex, hierarchical structured systems with rapid landform changes. Detection of landform changes can be supported by automated geomorphological mapping. Especially, the analysis over short time scales require a method for standardised, unbiased geomorphological map reproduction, which is delivered by automated mapping techniques. In general, digital geomorphological mapping is a challenging task, since knowledge about landforms with respect to their natural boundaries as well as their hierarchical and scaling relationships, has to be integrated in an objective way. A combination of very-high spatial resolution data (VHSR) such as LiDAR and new methods like object based image analysis (OBIA) allow for a more standardised production of geomorphological maps. In OBIA the processing units are spatially configured objects that are created by multi-scale segmentation. Therefore, not only spectral information can be used for assigning the objects to geomorphological classes, but also spatial and topological properties can be exploited. In this study we focus on the detection of landforms, especially bedrock sediment deposits (alluvion, debris cone, talus, moraine, rockglacier), as well as glaciers. The study site Gradental [N 46°58'29.1"/ E 12°48'53.8"] is located in the Schobergruppe (Austria, Carinthia) and is characterised by heterogenic geology conditions and high process activity. The area is difficult to access and dominated by steep slopes, thus hindering a fast and detailed geomorphological field mapping. Landforms are identified using aerial and terrestrial LiDAR data (1 m spatial resolution). These DEMs are analysed by an object based hierarchical approach, which is structured in three main steps. The first step is to define occurring landforms by basic land surface parameters (LSPs), topology and hierarchy relations. Based on those definitions a semantic model is created. Secondly, a multi-scale segmentation is performed on a three-band LSP that integrates slope, aspect and plan curvature, which expresses the driving forces of geomorphological processes. In the third step, the generated multi-level object structures are classified in order to produce the geomorphological map. The classification rules are derived from the semantic model. Due to landform type-specific scale dependencies of LSPs, the values of LSPs used in the classification are calculated in a multi-scale manner by constantly enlarging the size of the moving window. In addition, object form properties (density, compactness, rectangular fit) are utilised as additional information for landform characterisation. Validation of classification is performed by intersecting a visually interpreted reference map with the classification output map and calculating accuracy matrices. Validation shows an overall accuracy of 78.25 % and a Kappa of 0.65. The natural borders of landforms can be easily detected by the use of slope, aspect and plan curvature. This study illustrates the potential of OBIA for a more standardised and automated mapping of surface units (landforms, landcover). Therefore, the presented methodology features a prospective automated geomorphological mapping approach for alpine regions.

  12. dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms.

    PubMed

    Puritz, Jonathan B; Hollenbeck, Christopher M; Gold, John R

    2014-01-01

    Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful approach for population genomics. Currently, no software exists that utilizes both paired-end reads from RADseq data to efficiently produce population-informative variant calls, especially for non-model organisms with large effective population sizes and high levels of genetic polymorphism. dDocent is an analysis pipeline with a user-friendly, command-line interface designed to process individually barcoded RADseq data (with double cut sites) into informative SNPs/Indels for population-level analyses. The pipeline, written in BASH, uses data reduction techniques and other stand-alone software packages to perform quality trimming and adapter removal, de novo assembly of RAD loci, read mapping, SNP and Indel calling, and baseline data filtering. Double-digest RAD data from population pairings of three different marine fishes were used to compare dDocent with Stacks, the first generally available, widely used pipeline for analysis of RADseq data. dDocent consistently identified more SNPs shared across greater numbers of individuals and with higher levels of coverage. This is due to the fact that dDocent quality trims instead of filtering, incorporates both forward and reverse reads (including reads with INDEL polymorphisms) in assembly, mapping, and SNP calling. The pipeline and a comprehensive user guide can be found at http://dDocent.wordpress.com.

  13. dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms

    PubMed Central

    Hollenbeck, Christopher M.; Gold, John R.

    2014-01-01

    Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful approach for population genomics. Currently, no software exists that utilizes both paired-end reads from RADseq data to efficiently produce population-informative variant calls, especially for non-model organisms with large effective population sizes and high levels of genetic polymorphism. dDocent is an analysis pipeline with a user-friendly, command-line interface designed to process individually barcoded RADseq data (with double cut sites) into informative SNPs/Indels for population-level analyses. The pipeline, written in BASH, uses data reduction techniques and other stand-alone software packages to perform quality trimming and adapter removal, de novo assembly of RAD loci, read mapping, SNP and Indel calling, and baseline data filtering. Double-digest RAD data from population pairings of three different marine fishes were used to compare dDocent with Stacks, the first generally available, widely used pipeline for analysis of RADseq data. dDocent consistently identified more SNPs shared across greater numbers of individuals and with higher levels of coverage. This is due to the fact that dDocent quality trims instead of filtering, incorporates both forward and reverse reads (including reads with INDEL polymorphisms) in assembly, mapping, and SNP calling. The pipeline and a comprehensive user guide can be found at http://dDocent.wordpress.com. PMID:24949246

  14. Insight into Genotype-Phenotype Associations through eQTL Mapping in Multiple Cell Types in Health and Immune-Mediated Disease

    PubMed Central

    Peters, James E.; Lyons, Paul A.; Lee, James C.; Richard, Arianne C.; Fortune, Mary D.; Newcombe, Paul J.; Richardson, Sylvia; Smith, Kenneth G. C.

    2016-01-01

    Genome-wide association studies (GWAS) have transformed our understanding of the genetics of complex traits such as autoimmune diseases, but how risk variants contribute to pathogenesis remains largely unknown. Identifying genetic variants that affect gene expression (expression quantitative trait loci, or eQTLs) is crucial to addressing this. eQTLs vary between tissues and following in vitro cellular activation, but have not been examined in the context of human inflammatory diseases. We performed eQTL mapping in five primary immune cell types from patients with active inflammatory bowel disease (n = 91), anti-neutrophil cytoplasmic antibody-associated vasculitis (n = 46) and healthy controls (n = 43), revealing eQTLs present only in the context of active inflammatory disease. Moreover, we show that following treatment a proportion of these eQTLs disappear. Through joint analysis of expression data from multiple cell types, we reveal that previous estimates of eQTL immune cell-type specificity are likely to have been exaggerated. Finally, by analysing gene expression data from multiple cell types, we find eQTLs not previously identified by database mining at 34 inflammatory bowel disease-associated loci. In summary, this parallel eQTL analysis in multiple leucocyte subsets from patients with active disease provides new insights into the genetic basis of immune-mediated diseases. PMID:27015630

  15. A novel syndrome of hypohidrosis and intellectual disability is linked to COG6 deficiency.

    PubMed

    Shaheen, Ranad; Ansari, Shinu; Alshammari, Muneera J; Alkhalidi, Hisham; Alrukban, Hadeel; Eyaid, Wafaa; Alkuraya, Fowzan S

    2013-07-01

    Numerous syndromic forms of intellectual disability have been described including those with abnormal sweating pattern. To describe the clinical and molecular analysis of a large multiplex consanguineous Saudi family with an unusual constellation of severe intellectual disability, hypohidrosis, abnormal teeth, and acquired microcephaly. Clinical evaluation, autozygosity mapping, exome sequencing, and expression analysis. Autozygosity mapping revealed a single critical locus corresponding to chr13:39 338 062-40 857 430. Exome sequencing uncovered a deep intronic (NM_020751.2:c.1167-24A>G) variant in COG6 that largely replaces the consensus acceptor site, resulting in pronounced reduction of the normal transcript and consequent deficiency of COG6 protein. Patient cells also exhibited pronounced deficiency of STX6, consistent with the established stabilising effect of COG6 on STX6. Four additional patients representing two families of the same tribal origin as the original family were found to have the same mutation, confirming a founder effect. Remarkably, none of the patients displayed any detectable abnormality in the glycosylation pattern of transferrin, which contradicts a previously published report of a patient whose abnormal glycosylation pattern was presumed to be caused by a missense variant in COG6. Our data implicate COG6 in the pathogenesis of a novel hypohidrotic disorder in humans that is distinct from congenital disorders of glycosylation.

  16. Single Nucleotide Polymorphism Markers for Genetic Mapping in Drosophila melanogaster

    PubMed Central

    Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed; Mapa, Felipa A.; Ruddy, David A.; Ryan, Jessica J.; Young, Lynn M.; Wells, Trent; Kopczynski, Casey; Ellis, Michael C.

    2001-01-01

    For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that recently have revolutionized human, mouse, and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila by using a sequence tagged site-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that span the genome. Most of these markers are single nucleotide polymorphisms and sequences for these variants are provided in an accessible format. The average density of the new markers is one per 225 kb on the autosomes and one per megabase on the X chromosome. We include in this survey a set of P-element strains that provide additional use for high-resolution mapping. We show one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community. PMID:11381036

  17. Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed

    2001-04-16

    For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that have recently revolutionized human, mouse and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila using an STS-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that the genome. The majority of these markers are single nucleotide polymorphisms (SNPs) and sequences for these variants are provided in an accessible format. The average density of the new markersmore » is 1 marker per 225 kb on the autosomes and 1 marker per 1 Mb on the X chromosome. We include in this survey a set of P-element strains that provide additional utility for high-resolution mapping. We demonstrate one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community.« less

  18. A GIS-BASED MULTI-CRITERIA EVALUATION SYSTEM FOR SELECTION OF LANDFILL SITES: a case study from Abu Dhabi, United Arab Emirates

    NASA Astrophysics Data System (ADS)

    Issa, S. M.; Shehhi, B. Al

    2012-07-01

    Landfill sites receive 92% of total annual solid waste produced by municipalities in the emirate of Abu Dhabi. In this study, candidate sites for an appropriate landfill location for the Abu Dhabi municipal area are determined by integrating geographic information systems (GIS) and multi-criteria evaluation (MCE) analysis. To identify appropriate landfill sites, eight input map layers including proximity to urban areas, proximity to wells and water table depth, geology and topography, proximity to touristic and archeological sites, distance from roads network, distance from drainage networks, and land slope are used in constraint mapping. A final map was generated which identified potential areas showing suitability for the location of the landfill site. Results revealed that 30% of the study area was identified as highly suitable, 25% as suitable, and 45% as unsuitable. The selection of the final landfill site, however, requires further field research.

  19. Purinergic receptors P2RX4 and P2RX7 in familial multiple sclerosis

    PubMed Central

    Sadovnick, A Dessa; Gu, Ben J; Traboulsee, Anthony L; Bernales, Cecily Q; Encarnacion, Mary; Yee, Irene M; Criscuoli, Maria G; Huang, Xin; Ou, Amber; Milligan, Carol J; Petrou, Steven; Wiley, James S; Vilariño-Güell, Carles

    2017-01-01

    Genetic variants in the purinergic receptors P2RX4 and P2RX7 have been shown to affect susceptibility to multiple sclerosis (MS). In this study we set out to evaluate whether rare coding variants of major effect could also be identified in these purinergic receptors. Sequencing analysis of P2RX4 and P2RX7 in 193 MS patients and 100 controls led to the identification of a rare three variant haplotype (P2RX7 rs140915863:C>T (p.T205M), P2RX7 rs201921967:A>G (p.N361S) and P2RX4 rs765866317:G>A (p.G135S)) segregating with disease in a multi-incident family with six family members diagnosed with MS (LOD=3.07). Functional analysis of this haplotype in HEK293 cells revealed impaired P2X7 surface expression (p<0.01), resulting in over 95% inhibition of ATP-induced pore function (p<0.001) and a marked reduction in phagocytic ability (p<0.05). In addition, transfected cells showed 40% increased peak ATP-induced inward current (p<0.01), and a greater Ca2+ response to the P2X4 135S variant compared to wild type (p<0.0001). Our study nominates rare genetic variants in P2RX4 and P2RX7 as major genetic contributors to disease, further supporting a role for these purinergic receptors in MS and suggesting the disruption of transmembrane cation channels leading to impairment of phagocytosis as the pathological mechanisms of disease. PMID:28326637

  20. cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing.

    PubMed

    Takeuchi, Toshiki; Yamada, Atsuo; Aoki, Takashi; Nishimura, Kunihiro

    2016-01-01

    Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools.

  1. Multidimensional brain activity dictated by winner-take-all mechanisms.

    PubMed

    Tozzi, Arturo; Peters, James F

    2018-06-21

    A novel demon-based architecture is introduced to elucidate brain functions such as pattern recognition during human perception and mental interpretation of visual scenes. Starting from the topological concepts of invariance and persistence, we introduce a Selfridge pandemonium variant of brain activity that takes into account a novel feature, namely, demons that recognize short straight-line segments, curved lines and scene shapes, such as shape interior, density and texture. Low-level representations of objects can be mapped to higher-level views (our mental interpretations): a series of transformations can be gradually applied to a pattern in a visual scene, without affecting its invariant properties. This makes it possible to construct a symbolic multi-dimensional representation of the environment. These representations can be projected continuously to an object that we have seen and continue to see, thanks to the mapping from shapes in our memory to shapes in Euclidean space. Although perceived shapes are 3-dimensional (plus time), the evaluation of shape features (volume, color, contour, closeness, texture, and so on) leads to n-dimensional brain landscapes. Here we discuss the advantages of our parallel, hierarchical model in pattern recognition, computer vision and biological nervous system's evolution. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Natural Resource Assessments in Afghanistan Through High Resolution Digital Elevation Modeling and Multi-spectral Image Analysis

    NASA Technical Reports Server (NTRS)

    Chirico, Peter G.

    2007-01-01

    This viewgraph presentation provides USGS/USAID natural resource assessments in Afghanistan through the mapping of coal, oil and natural gas, minerals, hydrologic resources and earthquake and flood hazards.

  3. Expressed sequence tag analysis of human RPE/choroid for the NEIBank Project: over 6000 non-redundant transcripts, novel genes and splice variants.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

    2002-06-15

    The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.

  4. Synthesis multi-projector content for multi-projector three dimension display using a layered representation

    NASA Astrophysics Data System (ADS)

    Qin, Chen; Ren, Bin; Guo, Longfei; Dou, Wenhua

    2014-11-01

    Multi-projector three dimension display is a promising multi-view glass-free three dimension (3D) display technology, can produce full colour high definition 3D images on its screen. One key problem of multi-projector 3D display is how to acquire the source images of projector array while avoiding pseudoscopic problem. This paper analysis the displaying characteristics of multi-projector 3D display first and then propose a projector content synthetic method using tetrahedral transform. A 3D video format that based on stereo image pair and associated disparity map is presented, it is well suit for any type of multi-projector 3D display and has advantage in saving storage usage. Experiment results show that our method solved the pseudoscopic problem.

  5. LLMapReduce: Multi-Level Map-Reduce for High Performance Data Analysis

    DTIC Science & Technology

    2016-05-23

    LLMapReduce works with several schedulers such as SLURM, Grid Engine and LSF. Keywords—LLMapReduce; map-reduce; performance; scheduler; Grid Engine ...SLURM; LSF I. INTRODUCTION Large scale computing is currently dominated by four ecosystems: supercomputing, database, enterprise , and big data [1...interconnects [6]), High performance math libraries (e.g., BLAS [7, 8], LAPACK [9], ScaLAPACK [10]) designed to exploit special processing hardware, High

  6. Mutation of Phe413 to Tyr in catalase KatE from Escherichia coli leads to side chain damage and main chain cleavage.

    PubMed

    Jha, Vikash; Donald, Lynda J; Loewen, Peter C

    2012-09-15

    The monofunctional catalase KatE of Esherichia coli exhibits exceptional resistance to heat denaturation and proteolytic degradation. During an investigation of subtle conformation changes in Arg111 and Phe413 on the proximal side of the heme induced by H(2)O(2), variants at position R111, T115 and F413 were constructed. Because the residues are not situated in the distal side heme cavity where catalysis occurs, significant changes in reactivity were not expected and indeed, only small changes in the kinetic characteristics were observed in all of the variants. However, the F413Y variant was found to have undergone main chain cleavage whereas the R111A, T115A, F413E and F413K variants had not. Two sites of cleavage were identified in the crystal structure and by mass spectrometry at residues 111 and 115. In addition to main chain cleavage, modifications to the side chains of Tyr413, Thr115 and Arg111 were suggested by differences in the electron density maps compared to maps of the native and inactive variant H128N/F413Y. The inactive variant H128N/F413Y and the active variant T115A/F413Y both did not exhibit main chain cleavage and the R11A/F413Y variant exhibited less cleavage. In addition, the apparent modification of three side chains was largely absent in these variants. It is also significant that all three F413 single variants contained heme b suggesting that the fidelity of the phenyl group was important for mediating heme b oxidation to heme d. The reactions are attributed to the introduction of a new reactive center possibly involving a transient radical on Tyr413 formed during catalytic turn over. Copyright © 2011 Elsevier Inc. All rights reserved.

  7. Mapping healthcare systems: a policy relevant analytic tool.

    PubMed

    Sekhri Feachem, Neelam; Afshar, Ariana; Pruett, Cristina; Avanceña, Anton L V

    2017-07-01

    In the past decade, an international consensus on the value of well-functioning systems has driven considerable health systems research. This research falls into two broad categories. The first provides conceptual frameworks that take complex healthcare systems and create simplified constructs of interactions and functions. The second focuses on granular inputs and outputs. This paper presents a novel translational mapping tool - the University of California, San Francisco mapping tool (the Tool) - which bridges the gap between these two areas of research, creating a platform for multi-country comparative analysis. Using the Murray-Frenk framework, we create a macro-level representation of a country's structure, focusing on how it finances and delivers healthcare. The map visually depicts the fundamental policy questions in healthcare system design: funding sources and amount spent through each source, purchasers, populations covered, provider categories; and the relationship between these entities. We use the Tool to provide a macro-level comparative analysis of the structure of India's and Thailand's healthcare systems. As part of the systems strengthening arsenal, the Tool can stimulate debate about the merits and consequences of different healthcare systems structural designs, using a common framework that fosters multi-country comparative analyses. © The Author 2017. Published by Oxford University Press on behalf of Royal Society of Tropical Medicine and Hygiene.

  8. Computer-composite mapping for geologists

    USGS Publications Warehouse

    van Driel, J.N.

    1980-01-01

    A computer program for overlaying maps has been tested and evaluated as a means for producing geologic derivative maps. Four maps of the Sugar House Quadrangle, Utah, were combined, using the Multi-Scale Data Analysis and Mapping Program, in a single composite map that shows the relative stability of the land surface during earthquakes. Computer-composite mapping can provide geologists with a powerful analytical tool and a flexible graphic display technique. Digitized map units can be shown singly, grouped with different units from the same map, or combined with units from other source maps to produce composite maps. The mapping program permits the user to assign various values to the map units and to specify symbology for the final map. Because of its flexible storage, easy manipulation, and capabilities of graphic output, the composite-mapping technique can readily be applied to mapping projects in sedimentary and crystalline terranes, as well as to maps showing mineral resource potential. ?? 1980 Springer-Verlag New York Inc.

  9. MutAid: Sanger and NGS Based Integrated Pipeline for Mutation Identification, Validation and Annotation in Human Molecular Genetics.

    PubMed

    Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas

    2016-01-01

    Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid.

  10. MutAid: Sanger and NGS Based Integrated Pipeline for Mutation Identification, Validation and Annotation in Human Molecular Genetics

    PubMed Central

    Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas

    2016-01-01

    Traditional Sanger sequencing as well as Next-Generation Sequencing have been used for the identification of disease causing mutations in human molecular research. The majority of currently available tools are developed for research and explorative purposes and often do not provide a complete, efficient, one-stop solution. As the focus of currently developed tools is mainly on NGS data analysis, no integrative solution for the analysis of Sanger data is provided and consequently a one-stop solution to analyze reads from both sequencing platforms is not available. We have therefore developed a new pipeline called MutAid to analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and analysis of Sanger and NGS data under a single platform. It is capable of analyzing reads from multiple patients in a single run to create a list of potential disease causing base substitutions as well as insertions and deletions. MutAid has been developed for expert and non-expert users and supports four sequencing platforms including Sanger, Illumina, 454 and Ion Torrent. Furthermore, for NGS data analysis, five read mappers including BWA, TMAP, Bowtie, Bowtie2 and GSNAP and four variant callers including GATK-HaplotypeCaller, SAMTOOLS, Freebayes and VarScan2 pipelines are supported. MutAid is freely available at https://sourceforge.net/projects/mutaid. PMID:26840129

  11. Overexpression of the Cytokine BAFF and Autoimmunity Risk.

    PubMed

    Steri, Maristella; Orrù, Valeria; Idda, M Laura; Pitzalis, Maristella; Pala, Mauro; Zara, Ilenia; Sidore, Carlo; Faà, Valeria; Floris, Matteo; Deiana, Manila; Asunis, Isadora; Porcu, Eleonora; Mulas, Antonella; Piras, Maria G; Lobina, Monia; Lai, Sandra; Marongiu, Mara; Serra, Valentina; Marongiu, Michele; Sole, Gabriella; Busonero, Fabio; Maschio, Andrea; Cusano, Roberto; Cuccuru, Gianmauro; Deidda, Francesca; Poddie, Fausto; Farina, Gabriele; Dei, Mariano; Virdis, Francesca; Olla, Stefania; Satta, Maria A; Pani, Mario; Delitala, Alessandro; Cocco, Eleonora; Frau, Jessica; Coghe, Giancarlo; Lorefice, Lorena; Fenu, Giuseppe; Ferrigno, Paola; Ban, Maria; Barizzone, Nadia; Leone, Maurizio; Guerini, Franca R; Piga, Matteo; Firinu, Davide; Kockum, Ingrid; Lima Bomfim, Izaura; Olsson, Tomas; Alfredsson, Lars; Suarez, Ana; Carreira, Patricia E; Castillo-Palma, Maria J; Marcus, Joseph H; Congia, Mauro; Angius, Andrea; Melis, Maurizio; Gonzalez, Antonio; Alarcón Riquelme, Marta E; da Silva, Berta M; Marchini, Maurizio; Danieli, Maria G; Del Giacco, Stefano; Mathieu, Alessandro; Pani, Antonello; Montgomery, Stephen B; Rosati, Giulio; Hillert, Jan; Sawcer, Stephen; D'Alfonso, Sandra; Todd, John A; Novembre, John; Abecasis, Gonçalo R; Whalen, Michael B; Marrosu, Maria G; Meloni, Alessandra; Sanna, Serena; Gorospe, Myriam; Schlessinger, David; Fiorillo, Edoardo; Zoledziewska, Magdalena; Cucca, Francesco

    2017-04-27

    Genomewide association studies of autoimmune diseases have mapped hundreds of susceptibility regions in the genome. However, only for a few association signals has the causal gene been identified, and for even fewer have the causal variant and underlying mechanism been defined. Coincident associations of DNA variants affecting both the risk of autoimmune disease and quantitative immune variables provide an informative route to explore disease mechanisms and drug-targetable pathways. Using case-control samples from Sardinia, Italy, we performed a genomewide association study in multiple sclerosis followed by TNFSF13B locus-specific association testing in systemic lupus erythematosus (SLE). Extensive phenotyping of quantitative immune variables, sequence-based fine mapping, cross-population and cross-phenotype analyses, and gene-expression studies were used to identify the causal variant and elucidate its mechanism of action. Signatures of positive selection were also investigated. A variant in TNFSF13B, encoding the cytokine and drug target B-cell activating factor (BAFF), was associated with multiple sclerosis as well as SLE. The disease-risk allele was also associated with up-regulated humoral immunity through increased levels of soluble BAFF, B lymphocytes, and immunoglobulins. The causal variant was identified: an insertion-deletion variant, GCTGT→A (in which A is the risk allele), yielded a shorter transcript that escaped microRNA inhibition and increased production of soluble BAFF, which in turn up-regulated humoral immunity. Population genetic signatures indicated that this autoimmunity variant has been evolutionarily advantageous, most likely by augmenting resistance to malaria. A TNFSF13B variant was associated with multiple sclerosis and SLE, and its effects were clarified at the population, cellular, and molecular levels. (Funded by the Italian Foundation for Multiple Sclerosis and others.).

  12. Modeling Land Use/Cover Changes in an African Rural Landscape

    NASA Astrophysics Data System (ADS)

    Kamusoko, C.; Aniya, M.

    2006-12-01

    Land use/cover changes are analyzed in the Bindura district of Zimbabwe, Africa through the integration of data from a time series of Landsat imagery (1973, 1989 and 2000), a household survey and GIS coverages. We employed a hybrid supervised/unsupervised classification approach to generate land use/cover maps from which landscape metrics were calculated. Population and other household variables were derived from a sample of surveyed villages, while road accessibility and slope were obtained from topographic maps and digital elevation model, respectively. Markov-cellular automata modeling approach that incorporates Markov chain analysis, cellular automata and multi-criteria evaluation (MCE) / multi-objective allocation (MOLA) procedures was used to simulate land use/cover changes. A GIS-based MCE technique computed transition potential maps, whereas transition areas were derived from the 1973-2000 land use/cover maps using the Markov chain analysis. A 5 x 5 cellular automata filter was used to develop a spatially explicit contiguity- weighting factor to change the cells based on its previous state and those of its neighbors, while MOLA resolved land use/cover class allocation conflicts. The kappa index of agreement was used for model validation. Observed trends in land use/cover changes indicate that deforestation and the encroachment of cultivation in woodland areas is a continuous trend in the study area. This suggests that economic activities driven by agricultural expansion were the main causes of landscape fragmentation, leading to landscape degradation. Rigorous calibration of transition potential maps done by a MCE algorithm and Markovian transition probabilities produced accurate inputs for the simulation of land use/cover changes. Overall standard kappa index of agreement ranged from 0.73 to 0.83, which is sufficient for simulating land use/cover changes in the study area. Land use/cover simulations under the 1989 and 2000 scenario indicated further landscape degradation in the rural areas of the Bindura district. Keywords: Zimbabwe, land use/cover changes, landscape fragmentation, GIS, land use/cover change modeling, multi-criteria evaluation/multi-objective allocation procedures, Markov-cellular automata

  13. Germline mutations in the proof-reading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas

    PubMed Central

    Palles, Claire; Cazier, Jean-Baptiste; Howarth, Kimberley M; Domingo, Enric; Jones, Angela M.; Broderick, Peter; Kemp, Zoe; Spain, Sarah L; Almeida, Estrella Guarino; Salguero, Israel; Sherborne, Amy; Chubb, Daniel; Carvajal-Carmona, Luis G; Ma, Yusanne; Kaur, Kulvinder; Dobbins, Sara; Barclay, Ella; Gorman, Maggie; Martin, Lynn; Kovac, Michal B; Humphray, Sean; Lucassen, Anneke; Holmes, Christopher; Bentley, David; Donnelly, Peter; Taylor, Jenny; Petridis, Christos; Roylance, Rebecca; Sawyer, Elinor J; Kerr, David J.; Clark, Susan; Grimes, Jonathan; Kearsey, Stephen E; Thomas, Huw JW; McVean, Gilean; Houlston, Richard S; Tomlinson, Ian

    2013-01-01

    Many individuals with multiple or large colorectal adenomas, or early-onset colorectal cancer (CRC), have no detectable germline mutations in the known cancer predisposition genes. Using whole-genome sequencing, supplemented by linkage and association analysis, we identified specific heterozygous POLE or POLD1 germline variants in several multiple adenoma and/or CRC cases, but in no controls. The susceptibility variants appear to have high penetrance. POLD1 is also associated with endometrial cancer predisposition. The mutations map to equivalent sites in the proof-reading (exonuclease) domain of DNA polymerases ε and δ, and are predicted to impair correction of mispaired bases inserted during DNA replication. In agreement with this prediction, mutation carriers’ tumours were microsatellite-stable, but tended to acquire base substitution mutations, as confirmed by yeast functional assays. Further analysis of published data showed that the recently-described group of hypermutant, microsatellite-stable CRCs is likely to be caused by somatic POLE exonuclease domain mutations. PMID:23263490

  14. Genome sequencing reveals loci under artificial selection that underlie disease phenotypes in the laboratory rat.

    PubMed

    Atanur, Santosh S; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R; Kaisaki, Pamela J; Otto, Georg W; Ma, Man Chun John; Keane, Thomas M; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J

    2013-08-01

    Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Application of Nexus copy number software for CNV detection and analysis.

    PubMed

    Darvishi, Katayoon

    2010-04-01

    Among human structural genomic variation, copy number variants (CNVs) are the most frequently known component, comprised of gains/losses of DNA segments that are generally 1 kb in length or longer. Array-based comparative genomic hybridization (aCGH) has emerged as a powerful tool for detecting genomic copy number variants (CNVs). With the rapid increase in the density of array technology and with the adaptation of new high-throughput technology, a reliable and computationally scalable method for accurate mapping of recurring DNA copy number aberrations has become a main focus in research. Here we introduce Nexus Copy Number software, a platform-independent tool, to analyze the output files of all types of commercial and custom-made comparative genomic hybridization (CGH) and single-nucleotide polymorphism (SNP) arrays, such as those manufactured by Affymetrix, Agilent Technologies, Illumina, and Roche NimbleGen. It also supports data generated by various array image-analysis software tools such as GenePix, ImaGene, and BlueFuse. (c) 2010 by John Wiley & Sons, Inc.

  16. Genome Sequencing Reveals Loci under Artificial Selection that Underlie Disease Phenotypes in the Laboratory Rat

    PubMed Central

    Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.

    2013-01-01

    Summary Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. PaperClip PMID:23890820

  17. Dynamic labelling of neural connections in multiple colours by trans-synaptic fluorescence complementation

    PubMed Central

    Macpherson, Lindsey J.; Zaharieva, Emanuela E.; Kearney, Patrick J.; Alpert, Michael H.; Lin, Tzu-Yang; Turan, Zeynep; Lee, Chi-Hon; Gallio, Marco

    2015-01-01

    Determining the pattern of activity of individual connections within a neural circuit could provide insights into the computational processes that underlie brain function. Here, we develop new strategies to label active synapses by trans-synaptic fluorescence complementation in Drosophila. First, we demonstrate that a synaptobrevin-GRASP chimera functions as a powerful activity-dependent marker for synapses in vivo. Next, we create cyan and yellow variants, achieving activity-dependent, multi-colour fluorescence reconstitution across synapses (X-RASP). Our system allows for the first time retrospective labelling of synapses (rather than whole neurons) based on their activity, in multiple colours, in the same animal. As individual synapses often act as computational units in the brain, our method will promote the design of experiments that are not possible using existing techniques. Moreover, our strategies are easily adaptable to circuit mapping in any genetic system. PMID:26635273

  18. Novel Loci for Metabolic Networks and Multi-Tissue Expression Studies Reveal Genes for Atherosclerosis

    PubMed Central

    Inouye, Michael; Ripatti, Samuli; Kettunen, Johannes; Lyytikäinen, Leo-Pekka; Oksala, Niku; Laurila, Pirkka-Pekka; Kangas, Antti J.; Soininen, Pasi; Savolainen, Markku J.; Viikari, Jorma; Kähönen, Mika; Perola, Markus; Salomaa, Veikko; Raitakari, Olli; Lehtimäki, Terho; Taskinen, Marja-Riitta; Järvelin, Marjo-Riitta; Ala-Korpela, Mika; Palotie, Aarno; de Bakker, Paul I. W.

    2012-01-01

    Association testing of multiple correlated phenotypes offers better power than univariate analysis of single traits. We analyzed 6,600 individuals from two population-based cohorts with both genome-wide SNP data and serum metabolomic profiles. From the observed correlation structure of 130 metabolites measured by nuclear magnetic resonance, we identified 11 metabolic networks and performed a multivariate genome-wide association analysis. We identified 34 genomic loci at genome-wide significance, of which 7 are novel. In comparison to univariate tests, multivariate association analysis identified nearly twice as many significant associations in total. Multi-tissue gene expression studies identified variants in our top loci, SERPINA1 and AQP9, as eQTLs and showed that SERPINA1 and AQP9 expression in human blood was associated with metabolites from their corresponding metabolic networks. Finally, liver expression of AQP9 was associated with atherosclerotic lesion area in mice, and in human arterial tissue both SERPINA1 and AQP9 were shown to be upregulated (6.3-fold and 4.6-fold, respectively) in atherosclerotic plaques. Our study illustrates the power of multi-phenotype GWAS and highlights candidate genes for atherosclerosis. PMID:22916037

  19. Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma

    PubMed Central

    Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

    2015-01-01

    The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42–5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12–4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45–0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17–0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL. PMID:26506619

  20. Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma.

    PubMed

    Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

    2015-01-01

    The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42-5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12-4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45-0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17-0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL.

  1. Analysis of Shared Haplotypes amongst Palauans Maps Loci for Psychotic Disorders to 4q28 and 5q23-q31.

    PubMed

    Bodea, Corneliu A; Middleton, Frank A; Melhem, Nadine M; Klei, Lambertus; Song, Youeun; Tiobech, Josepha; Marumoto, Pearl; Yano, Victor; Faraone, Stephen V; Roeder, Kathryn; Myles-Worsley, Marina; Devlin, Bernie; Byerley, William

    2017-02-01

    To localize genetic variation affecting risk for psychotic disorders in the population of Palau, we genotyped DNA samples from 203 Palauan individuals diagnosed with psychotic disorders, broadly defined, and 125 control subjects using a genome-wide single nucleotide polymorphism array. Palau has unique features advantageous for this study: due to its population history, Palauans are substantially interrelated; affected individuals often, but not always, cluster in families; and we have essentially complete ascertainment of affected individuals. To localize risk variants to genomic regions, we evaluated long-shared haplotypes, ≥10 Mb, identifying clusters of affected individuals who share such haplotypes. This extensive sharing, typically identical by descent, was significantly greater in cases than population controls, even after controlling for relatedness. Several regions of the genome exhibited substantial excess of shared haplotypes for affected individuals, including 3p21, 3p12, 4q28, and 5q23-q31. Two of these regions, 4q28 and 5q23-q31, showed significant linkage by traditional LOD score analysis and could harbor variants of more sizeable risk for psychosis or a multiplicity of risk variants. The pattern of haplotype sharing in 4q28 highlights PCDH10 , encoding a cadherin-related neuronal receptor, as possibly involved in risk.

  2. aes, the gene encoding the esterase B in Escherichia coli, is a powerful phylogenetic marker of the species.

    PubMed

    Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand

    2009-12-29

    Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.

  3. TANK SPACE ALTERNATIVES ANALYSIS REPORT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    TURNER DA; KIRCH NW; WASHENFELDER DJ

    2010-04-27

    This report addresses the projected shortfall of double-shell tank (DST) space starting in 2018. Using a multi-variant methodology, a total of eight new-term options and 17 long-term options for recovering DST space were evaluated. These include 11 options that were previously evaluated in RPP-7702, Tank Space Options Report (Rev. 1). Based on the results of this evaluation, two near-term and three long-term options have been identified as being sufficient to overcome the shortfall of DST space projected to occur between 2018 and 2025.

  4. Discovery and fine-mapping of adiposity loci using high density imputation of genome-wide association studies in individuals of African ancestry: African Ancestry Anthropometry Genetics Consortium

    PubMed Central

    Lu, Yingchang; Justice, Anne E.; Mudgal, Poorva; Liu, Ching-Ti; Young, Kristin; Feitosa, Mary F.; Rand, Kristin; Dimitrov, Latchezar; Duan, Qing; Guo, Xiuqing; Lange, Leslie A.; Nalls, Michael A.; Okut, Hayrettin; Tayo, Bamidele O.; Vedantam, Sailaja; Bradfield, Jonathan P.; Chen, Guanjie; Chesi, Alessandra; Irvin, Marguerite R.; Padhukasahasram, Badri; Zheng, Wei; Allison, Matthew A.; Ambrosone, Christine B.; Bandera, Elisa V.; Berndt, Sonja I.; Blot, William J.; Bottinger, Erwin P.; Carpten, John; Chanock, Stephen J.; Chen, Yii-Der Ida; Conti, David V.; Cooper, Richard S.; Fornage, Myriam; Freedman, Barry I.; Garcia, Melissa; Goodman, Phyllis J.; Hsu, Yu-Han H.; Hu, Jennifer; Huff, Chad D.; Ingles, Sue A.; John, Esther M.; Kittles, Rick; Klein, Eric; Li, Jin; McKnight, Barbara; Nayak, Uma; Nemesure, Barbara; Olshan, Andrew; Salako, Babatunde; Sanderson, Maureen; Shao, Yaming; Siscovick, David S.; Stanford, Janet L.; Strom, Sara S.; Witte, John S.; Yao, Jie; Zhu, Xiaofeng; Ziegler, Regina G.; Zonderman, Alan B.; Ambs, Stefan; Cushman, Mary; Faul, Jessica D.; Hakonarson, Hakon; Levin, Albert M.; Nathanson, Katherine L.; Weir, David R.; Zhi, Degui; Arnett, Donna K.; Kardia, Sharon L. R.; Oloapde, Olufunmilayo I.; Rao, D. C.; Williams, L. Keoki; Becker, Diane M.; Borecki, Ingrid B.; Evans, Michele K.; Harris, Tamara B.; Hirschhorn, Joel N.; Psaty, Bruce M.; Wilson, James G.; Bowden, Donald W.; Cupples, L. Adrienne; Haiman, Christopher A.; Loos, Ruth J. F.; North, Kari E.

    2017-01-01

    Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10−8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations. PMID:28430825

  5. An alternative to Rasch analysis using triadic comparisons and multi-dimensional scaling

    NASA Astrophysics Data System (ADS)

    Bradley, C.; Massof, R. W.

    2016-11-01

    Rasch analysis is a principled approach for estimating the magnitude of some shared property of a set of items when a group of people assign ordinal ratings to them. In the general case, Rasch analysis not only estimates person and item measures on the same invariant scale, but also estimates the average thresholds used by the population to define rating categories. However, Rasch analysis fails when there is insufficient variance in the observed responses because it assumes a probabilistic relationship between person measures, item measures and the rating assigned by a person to an item. When only a single person is rating all items, there may be cases where the person assigns the same rating to many items no matter how many times he rates them. We introduce an alternative to Rasch analysis for precisely these situations. Our approach leverages multi-dimensional scaling (MDS) and requires only rank orderings of items and rank orderings of pairs of distances between items to work. Simulations show one variant of this approach - triadic comparisons with non-metric MDS - provides highly accurate estimates of item measures in realistic situations.

  6. Utilization of gene mapping and candidate gene mutation screening for diagnosing clinically equivocal conditions: a Norrie disease case study.

    PubMed

    Chini, Vasiliki; Stambouli, Danai; Nedelea, Florina Mihaela; Filipescu, George Alexandru; Mina, Diana; Kambouris, Marios; El-Shantil, Hatem

    2014-06-01

    Prenatal diagnosis was requested for an undiagnosed eye disease showing X-linked inheritance in a family. No medical records existed for the affected family members. Mapping of the X chromosome and candidate gene mutation screening identified a c.C267A[p.F89L] mutation in NPD previously described as possibly causing Norrie disease. The detection of the c.C267A[p.F89L] variant in another unrelated family confirms the pathogenic nature of the mutation for the Norrie disease phenotype. Gene mapping, haplotype analysis, and candidate gene screening have been previously utilized in research applications but were applied here in a diagnostic setting due to the scarcity of available clinical information. The clinical diagnosis and mutation identification were critical for providing proper genetic counseling and prenatal diagnosis for this family.

  7. A new national mosaic of state landcover data

    USGS Publications Warehouse

    Thomas, I.; Handley, Lawrence R.; D'Erchia, Frank J.; Charron, Tammy M.

    2000-01-01

    This presentation reviewed current landcover mapping efforts and presented a new preliminary, national mosaic of Gap Analysis Program (GAP) and Multi-Resolution Land Characteristics Consortium (MRLC) landcover data with a discussion of techniques, problems faced, and future refinements.

  8. Identification of new susceptibility loci for type 2 diabetes and shared etiological pathways with coronary heart disease.

    PubMed

    Zhao, Wei; Rasheed, Asif; Tikkanen, Emmi; Lee, Jung-Jin; Butterworth, Adam S; Howson, Joanna M M; Assimes, Themistocles L; Chowdhury, Rajiv; Orho-Melander, Marju; Damrauer, Scott; Small, Aeron; Asma, Senay; Imamura, Minako; Yamauch, Toshimasa; Chambers, John C; Chen, Peng; Sapkota, Bishwa R; Shah, Nabi; Jabeen, Sehrish; Surendran, Praveen; Lu, Yingchang; Zhang, Weihua; Imran, Atif; Abbas, Shahid; Majeed, Faisal; Trindade, Kevin; Qamar, Nadeem; Mallick, Nadeem Hayyat; Yaqoob, Zia; Saghir, Tahir; Rizvi, Syed Nadeem Hasan; Memon, Anis; Rasheed, Syed Zahed; Memon, Fazal-Ur-Rehman; Mehmood, Khalid; Ahmed, Naveeduddin; Qureshi, Irshad Hussain; Tanveer-Us-Salam; Iqbal, Wasim; Malik, Uzma; Mehra, Narinder; Kuo, Jane Z; Sheu, Wayne H-H; Guo, Xiuqing; Hsiung, Chao A; Juang, Jyh-Ming J; Taylor, Kent D; Hung, Yi-Jen; Lee, Wen-Jane; Quertermous, Thomas; Lee, I-Te; Hsu, Chih-Cheng; Bottinger, Erwin P; Ralhan, Sarju; Teo, Yik Ying; Wang, Tzung-Dau; Alam, Dewan S; Di Angelantonio, Emanuele; Epstein, Steve; Nielsen, Sune F; Nordestgaard, Børge G; Tybjaerg-Hansen, Anne; Young, Robin; Benn, Marianne; Frikke-Schmidt, Ruth; Kamstrup, Pia R; Jukema, J Wouter; Sattar, Naveed; Smit, Roelof; Chung, Ren-Hua; Liang, Kae-Woei; Anand, Sonia; Sanghera, Dharambir K; Ripatti, Samuli; Loos, Ruth J F; Kooner, Jaspal S; Tai, E Shyong; Rotter, Jerome I; Chen, Yii-Der Ida; Frossard, Philippe; Maeda, Shiro; Kadowaki, Takashi; Reilly, Muredach; Pare, Guillaume; Melander, Olle; Salomaa, Veikko; Rader, Daniel J; Danesh, John; Voight, Benjamin F; Saleheen, Danish

    2017-10-01

    To evaluate the shared genetic etiology of type 2 diabetes (T2D) and coronary heart disease (CHD), we conducted a genome-wide, multi-ancestry study of genetic variation for both diseases in up to 265,678 subjects for T2D and 260,365 subjects for CHD. We identify 16 previously unreported loci for T2D and 1 locus for CHD, including a new T2D association at a missense variant in HLA-DRB5 (odds ratio (OR) = 1.29). We show that genetically mediated increase in T2D risk also confers higher CHD risk. Joint T2D-CHD analysis identified eight variants-two of which are coding-where T2D and CHD associations appear to colocalize, including a new joint T2D-CHD association at the CCDC92 locus that also replicated for T2D. The variants associated with both outcomes implicate new pathways as well as targets of existing drugs, including icosapent ethyl and adipocyte fatty-acid-binding protein.

  9. Hybrid time-variant reliability estimation for active control structures under aleatory and epistemic uncertainties

    NASA Astrophysics Data System (ADS)

    Wang, Lei; Xiong, Chuang; Wang, Xiaojun; Li, Yunlong; Xu, Menghui

    2018-04-01

    Considering that multi-source uncertainties from inherent nature as well as the external environment are unavoidable and severely affect the controller performance, the dynamic safety assessment with high confidence is of great significance for scientists and engineers. In view of this, the uncertainty quantification analysis and time-variant reliability estimation corresponding to the closed-loop control problems are conducted in this study under a mixture of random, interval, and convex uncertainties. By combining the state-space transformation and the natural set expansion, the boundary laws of controlled response histories are first confirmed with specific implementation of random items. For nonlinear cases, the collocation set methodology and fourth Rounge-Kutta algorithm are introduced as well. Enlightened by the first-passage model in random process theory as well as by the static probabilistic reliability ideas, a new definition of the hybrid time-variant reliability measurement is provided for the vibration control systems and the related solution details are further expounded. Two engineering examples are eventually presented to demonstrate the validity and applicability of the methodology developed.

  10. Multi-Locus Candidate Gene Analyses of Lipid Levels in a Pediatric Turkish Cohort: Lessons Learned on LPL, CETP, LIPC, ABCA1, and SHBG

    PubMed Central

    Eren, Fatih; Agirbasli, Deniz; White, Marquitta J.; Williams, Scott M

    2013-01-01

    Abstract Cardiovascular risk factors and atherosclerosis precursors were examined in 365 Turkish children and adolescents. Study participants were recruited at five different state schools. We tested single and multi-locus effects of six polymorphisms from five candidate genes, chosen based on prior known association with lipid levels in adults, for association with low (≤10th percentile) high density lipoprotein cholesterol (HDL-C) and high (≥90th percentile) triglycerides (TG), and the related continuous outcomes. We observed an association between CETP variant rs708272 and low HDL-C (allelic p=0.020, genotypic p=0.046), which was supported by an independent analysis, PRAT (PRAT control p=0.027). Sex-stratified logistic regression analysis showed that the B2 allele of rs708272 decreased odds of being in the lower tenth percentile of HDL-C measurements (OR=0.36, p=0.02) in girls; this direction of effect was also seen in boys but was not significant (OR=0.64, p=0.21). Logistic regression analysis also revealed that the T allele of rs6257 (SHBG) decreased odds of being in the top tenth percentile of TG measurements in boys (OR=0.43, p=0.03). Analysis of lipid levels as a continuous trait revealed a significant association between rs708272 (CETP) and LDL-C levels in males (p=0.02) with the B2B2 genotype group having the lowest mean LDL-C; the same direction of effect was also seen in females (p=0.05). An effect was also seen between rs708272 and HDL-C levels in girls (p=0.01), with the B2B2 genotype having the highest mean HDL-C levels. Multi-locus analysis, using quantitative multifactor dimensionality reduction (qMDR) identified the previously mentioned CETP variant as the best single locus model, and overall model, for predicting HDL-C levels in children. This study provides evidence for association between CETP and low HDL-C phenotype in children, but the results appear to be weaker in children than previous results in adults and may also be subject to gender effects. PMID:23988150

  11. Group-level spatio-temporal pattern recovery in MEG decoding using multi-task joint feature learning.

    PubMed

    Kia, Seyed Mostafa; Pedregosa, Fabian; Blumenthal, Anna; Passerini, Andrea

    2017-06-15

    The use of machine learning models to discriminate between patterns of neural activity has become in recent years a standard analysis approach in neuroimaging studies. Whenever these models are linear, the estimated parameters can be visualized in the form of brain maps which can aid in understanding how brain activity in space and time underlies a cognitive function. However, the recovered brain maps often suffer from lack of interpretability, especially in group analysis of multi-subject data. To facilitate the application of brain decoding in group-level analysis, we present an application of multi-task joint feature learning for group-level multivariate pattern recovery in single-trial magnetoencephalography (MEG) decoding. The proposed method allows for recovering sparse yet consistent patterns across different subjects, and therefore enhances the interpretability of the decoding model. Our experimental results demonstrate that the mutli-task joint feature learning framework is capable of recovering more meaningful patterns of varying spatio-temporally distributed brain activity across individuals while still maintaining excellent generalization performance. We compare the performance of the multi-task joint feature learning in terms of generalization, reproducibility, and quality of pattern recovery against traditional single-subject and pooling approaches on both simulated and real MEG datasets. These results can facilitate the usage of brain decoding for the characterization of fine-level distinctive patterns in group-level inference. Considering the importance of group-level analysis, the proposed approach can provide a methodological shift towards more interpretable brain decoding models. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Rosetta:MSF: a modular framework for multi-state computational protein design.

    PubMed

    Löffler, Patrick; Schmitz, Samuel; Hupfeld, Enrico; Sterner, Reinhard; Merkl, Rainer

    2017-06-01

    Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta's protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta's single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design.

  13. Rosetta:MSF: a modular framework for multi-state computational protein design

    PubMed Central

    Hupfeld, Enrico; Sterner, Reinhard

    2017-01-01

    Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta’s protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta’s single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design. PMID:28604768

  14. Random Plant Viral Variants Attain Temporal Advantages During Systemic Infections and in Turn Resist other Variants of the Same Virus.

    PubMed

    Zhang, Xiao-Feng; Guo, Jiangbo; Zhang, Xiuchun; Meulia, Tea; Paul, Pierce; Madden, Laurence V; Li, Dawei; Qu, Feng

    2015-10-20

    Infection of plants with viruses containing multiple variants frequently leads to dominance by a few random variants in the systemically infected leaves (SLs), for which a plausible explanation is lacking. We show here that SL dominance by a given viral variant is adequately explained by its fortuitous lead in systemic spread, coupled with its resistance to superinfection by other variants. We analyzed the fate of a multi-variant turnip crinkle virus (TCV) population in Arabidopsis and N. benthamiana plants. Both wild-type and RNA silencing-defective plants displayed a similar pattern of random dominance by a few variant genotypes, thus discounting a prominent role for RNA silencing. When introduced to plants sequentially as two subpopulations, a twelve-hour head-start was sufficient for the first set to dominate. Finally, SLs of TCV-infected plants became highly resistant to secondary invasions of another TCV variant. We propose that random distribution of variant foci on inoculated leaves allows different variants to lead systemic movement in different plants. The leading variants then colonize large areas of SLs, and resist the superinfection of lagging variants in the same areas. In conclusion, superinfection resistance is the primary driver of random enrichment of viral variants in systemically infected plants.

  15. A reliable simultaneous representation of seismic hazard and of ground shaking recurrence

    NASA Astrophysics Data System (ADS)

    Peresan, A.; Panza, G. F.; Magrin, A.; Vaccari, F.

    2015-12-01

    Different earthquake hazard maps may be appropriate for different purposes - such as emergency management, insurance and engineering design. Accounting for the lower occurrence rate of larger sporadic earthquakes may allow to formulate cost-effective policies in some specific applications, provided that statistically sound recurrence estimates are used, which is not typically the case of PSHA (Probabilistic Seismic Hazard Assessment). We illustrate the procedure to associate the expected ground motions from Neo-deterministic Seismic Hazard Assessment (NDSHA) to an estimate of their recurrence. Neo-deterministic refers to a scenario-based approach, which allows for the construction of a broad range of earthquake scenarios via full waveforms modeling. From the synthetic seismograms the estimates of peak ground acceleration, velocity and displacement, or any other parameter relevant to seismic engineering, can be extracted. NDSHA, in its standard form, defines the hazard computed from a wide set of scenario earthquakes (including the largest deterministically or historically defined credible earthquake, MCE) and it does not supply the frequency of occurrence of the expected ground shaking. A recent enhanced variant of NDSHA that reliably accounts for recurrence has been developed and it is applied to the Italian territory. The characterization of the frequency-magnitude relation can be performed by any statistically sound method supported by data (e.g. multi-scale seismicity model), so that a recurrence estimate is associated to each of the pertinent sources. In this way a standard NDSHA map of ground shaking is obtained simultaneously with the map of the corresponding recurrences. The introduction of recurrence estimates in NDSHA naturally allows for the generation of ground shaking maps at specified return periods. This permits a straightforward comparison between NDSHA and PSHA maps.

  16. Genetic association of marbling score with intragenic nucleotide variants at selection signals of the bovine genome.

    PubMed

    Ryu, J; Lee, C

    2016-04-01

    Selection signals of Korean cattle might be attributed largely to artificial selection for meat quality. Rapidly increased intragenic markers of newly annotated genes in the bovine genome would help overcome limited findings of genetic markers associated with meat quality at the selection signals in a previous study. The present study examined genetic associations of marbling score (MS) with intragenic nucleotide variants at selection signals of Korean cattle. A total of 39 092 nucleotide variants of 407 Korean cattle were utilized in the association analysis. A total of 129 variants were selected within newly annotated genes in the bovine genome. Their genetic associations were analyzed using the mixed model with random polygenic effects based on identical-by-state genetic relationships among animals in order to control for spurious associations produced by population structure. Genetic associations of MS were found (P<3.88×10-4) with six intragenic nucleotide variants on bovine autosomes 3 (cache domain containing 1, CACHD1), 5 (like-glycosyltransferase, LARGE), 16 (cell division cycle 42 binding protein kinase alpha, CDC42BPA) and 21 (snurportin 1, SNUPN; protein tyrosine phosphatase, non-receptor type 9, PTPN9; chondroitin sulfate proteoglycan 4, CSPG4). In particular, the genetic associations with CDC42BPA and LARGE were confirmed using an independent data set of Korean cattle. The results implied that allele frequencies of functional variants and their proximity variants have been augmented by directional selection for greater MS and remain selection signals in the bovine genome. Further studies of fine mapping would be useful to incorporate favorable alleles in marker-assisted selection for MS of Korean cattle.

  17. MEG/EEG Source Reconstruction, Statistical Evaluation, and Visualization with NUTMEG

    PubMed Central

    Dalal, Sarang S.; Zumer, Johanna M.; Guggisberg, Adrian G.; Trumpis, Michael; Wong, Daniel D. E.; Sekihara, Kensuke; Nagarajan, Srikantan S.

    2011-01-01

    NUTMEG is a source analysis toolbox geared towards cognitive neuroscience researchers using MEG and EEG, including intracranial recordings. Evoked and unaveraged data can be imported to the toolbox for source analysis in either the time or time-frequency domains. NUTMEG offers several variants of adaptive beamformers, probabilistic reconstruction algorithms, as well as minimum-norm techniques to generate functional maps of spatiotemporal neural source activity. Lead fields can be calculated from single and overlapping sphere head models or imported from other software. Group averages and statistics can be calculated as well. In addition to data analysis tools, NUTMEG provides a unique and intuitive graphical interface for visualization of results. Source analyses can be superimposed onto a structural MRI or headshape to provide a convenient visual correspondence to anatomy. These results can also be navigated interactively, with the spatial maps and source time series or spectrogram linked accordingly. Animations can be generated to view the evolution of neural activity over time. NUTMEG can also display brain renderings and perform spatial normalization of functional maps using SPM's engine. As a MATLAB package, the end user may easily link with other toolboxes or add customized functions. PMID:21437174

  18. MEG/EEG source reconstruction, statistical evaluation, and visualization with NUTMEG.

    PubMed

    Dalal, Sarang S; Zumer, Johanna M; Guggisberg, Adrian G; Trumpis, Michael; Wong, Daniel D E; Sekihara, Kensuke; Nagarajan, Srikantan S

    2011-01-01

    NUTMEG is a source analysis toolbox geared towards cognitive neuroscience researchers using MEG and EEG, including intracranial recordings. Evoked and unaveraged data can be imported to the toolbox for source analysis in either the time or time-frequency domains. NUTMEG offers several variants of adaptive beamformers, probabilistic reconstruction algorithms, as well as minimum-norm techniques to generate functional maps of spatiotemporal neural source activity. Lead fields can be calculated from single and overlapping sphere head models or imported from other software. Group averages and statistics can be calculated as well. In addition to data analysis tools, NUTMEG provides a unique and intuitive graphical interface for visualization of results. Source analyses can be superimposed onto a structural MRI or headshape to provide a convenient visual correspondence to anatomy. These results can also be navigated interactively, with the spatial maps and source time series or spectrogram linked accordingly. Animations can be generated to view the evolution of neural activity over time. NUTMEG can also display brain renderings and perform spatial normalization of functional maps using SPM's engine. As a MATLAB package, the end user may easily link with other toolboxes or add customized functions.

  19. Mapping Quantitative Traits in Unselected Families: Algorithms and Examples

    PubMed Central

    Dupuis, Josée; Shi, Jianxin; Manning, Alisa K.; Benjamin, Emelia J.; Meigs, James B.; Cupples, L. Adrienne; Siegmund, David

    2009-01-01

    Linkage analysis has been widely used to identify from family data genetic variants influencing quantitative traits. Common approaches have both strengths and limitations. Likelihood ratio tests typically computed in variance component analysis can accommodate large families but are highly sensitive to departure from normality assumptions. Regression-based approaches are more robust but their use has primarily been restricted to nuclear families. In this paper, we develop methods for mapping quantitative traits in moderately large pedigrees. Our methods are based on the score statistic which in contrast to the likelihood ratio statistic, can use nonparametric estimators of variability to achieve robustness of the false positive rate against departures from the hypothesized phenotypic model. Because the score statistic is easier to calculate than the likelihood ratio statistic, our basic mapping methods utilize relatively simple computer code that performs statistical analysis on output from any program that computes estimates of identity-by-descent. This simplicity also permits development and evaluation of methods to deal with multivariate and ordinal phenotypes, and with gene-gene and gene-environment interaction. We demonstrate our methods on simulated data and on fasting insulin, a quantitative trait measured in the Framingham Heart Study. PMID:19278016

  20. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence.

    PubMed

    Savage, Jeanne E; Jansen, Philip R; Stringer, Sven; Watanabe, Kyoko; Bryois, Julien; de Leeuw, Christiaan A; Nagel, Mats; Awasthi, Swapnil; Barr, Peter B; Coleman, Jonathan R I; Grasby, Katrina L; Hammerschlag, Anke R; Kaminski, Jakob A; Karlsson, Robert; Krapohl, Eva; Lam, Max; Nygaard, Marianne; Reynolds, Chandra A; Trampush, Joey W; Young, Hannah; Zabaneh, Delilah; Hägg, Sara; Hansell, Narelle K; Karlsson, Ida K; Linnarsson, Sten; Montgomery, Grant W; Muñoz-Manchado, Ana B; Quinlan, Erin B; Schumann, Gunter; Skene, Nathan G; Webb, Bradley T; White, Tonya; Arking, Dan E; Avramopoulos, Dimitrios; Bilder, Robert M; Bitsios, Panos; Burdick, Katherine E; Cannon, Tyrone D; Chiba-Falek, Ornit; Christoforou, Andrea; Cirulli, Elizabeth T; Congdon, Eliza; Corvin, Aiden; Davies, Gail; Deary, Ian J; DeRosse, Pamela; Dickinson, Dwight; Djurovic, Srdjan; Donohoe, Gary; Conley, Emily Drabant; Eriksson, Johan G; Espeseth, Thomas; Freimer, Nelson A; Giakoumaki, Stella; Giegling, Ina; Gill, Michael; Glahn, David C; Hariri, Ahmad R; Hatzimanolis, Alex; Keller, Matthew C; Knowles, Emma; Koltai, Deborah; Konte, Bettina; Lahti, Jari; Le Hellard, Stephanie; Lencz, Todd; Liewald, David C; London, Edythe; Lundervold, Astri J; Malhotra, Anil K; Melle, Ingrid; Morris, Derek; Need, Anna C; Ollier, William; Palotie, Aarno; Payton, Antony; Pendleton, Neil; Poldrack, Russell A; Räikkönen, Katri; Reinvang, Ivar; Roussos, Panos; Rujescu, Dan; Sabb, Fred W; Scult, Matthew A; Smeland, Olav B; Smyrnis, Nikolaos; Starr, John M; Steen, Vidar M; Stefanis, Nikos C; Straub, Richard E; Sundet, Kjetil; Tiemeier, Henning; Voineskos, Aristotle N; Weinberger, Daniel R; Widen, Elisabeth; Yu, Jin; Abecasis, Goncalo; Andreassen, Ole A; Breen, Gerome; Christiansen, Lene; Debrabant, Birgit; Dick, Danielle M; Heinz, Andreas; Hjerling-Leffler, Jens; Ikram, M Arfan; Kendler, Kenneth S; Martin, Nicholas G; Medland, Sarah E; Pedersen, Nancy L; Plomin, Robert; Polderman, Tinca J C; Ripke, Stephan; van der Sluis, Sophie; Sullivan, Patrick F; Vrieze, Scott I; Wright, Margaret J; Posthuma, Danielle

    2018-06-25

    Intelligence is highly heritable 1 and a major determinant of human health and well-being 2 . Recent genome-wide meta-analyses have identified 24 genomic loci linked to variation in intelligence 3-7 , but much about its genetic underpinnings remains to be discovered. Here, we present a large-scale genetic association study of intelligence (n = 269,867), identifying 205 associated genomic loci (190 new) and 1,016 genes (939 new) via positional mapping, expression quantitative trait locus (eQTL) mapping, chromatin interaction mapping, and gene-based association analysis. We find enrichment of genetic effects in conserved and coding regions and associations with 146 nonsynonymous exonic variants. Associated genes are strongly expressed in the brain, specifically in striatal medium spiny neurons and hippocampal pyramidal neurons. Gene set analyses implicate pathways related to nervous system development and synaptic structure. We confirm previous strong genetic correlations with multiple health-related outcomes, and Mendelian randomization analysis results suggest protective effects of intelligence for Alzheimer's disease and ADHD and bidirectional causation with pleiotropic effects for schizophrenia. These results are a major step forward in understanding the neurobiology of cognitive function as well as genetically related neurological and psychiatric disorders.

  1. Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.)

    PubMed Central

    2009-01-01

    Background Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation between species allowed synteny comparisons to be made to sequenced genomes. This synteny analysis may support positional cloning of target genes in common bean through the use of genomic information from these other legumes. PMID:20030833

  2. Mapping Genetic Variants Underlying Differences in the Central Nitrogen Metabolism in Fermenter Yeasts

    PubMed Central

    García, Verónica; Salinas, Francisco; Aguilera, Omayra; Liti, Gianni; Martínez, Claudio

    2014-01-01

    Different populations within a species represent a rich reservoir of allelic variants, corresponding to an evolutionary signature of withstood environmental constraints. Saccharomyces cerevisiae strains are widely utilised in the fermentation of different kinds of alcoholic beverages, such as, wine and sake, each of them derived from must with distinct nutrient composition. Importantly, adequate nitrogen levels in the medium are essential for the fermentation process, however, a comprehensive understanding of the genetic variants determining variation in nitrogen consumption is lacking. Here, we assessed the genetic factors underlying variation in nitrogen consumption in a segregating population derived from a cross between two main fermenter yeasts, a Wine/European and a Sake isolate. By linkage analysis we identified 18 main effect QTLs for ammonium and amino acids sources. Interestingly, majority of QTLs were involved in more than a single trait, grouped based on amino acid structure and indicating high levels of pleiotropy across nitrogen sources, in agreement with the observed patterns of phenotypic co-variation. Accordingly, we performed reciprocal hemizygosity analysis validating an effect for three genes, GLT1, ASI1 and AGP1. Furthermore, we detected a widespread pleiotropic effect on these genes, with AGP1 affecting seven amino acids and nine in the case of GLT1 and ASI1. Based on sequence and comparative analysis, candidate causative mutations within these genes were also predicted. Altogether, the identification of these variants demonstrate how Sake and Wine/European genetic backgrounds differentially consume nitrogen sources, in part explaining independently evolved preferences for nitrogen assimilation and representing a niche of genetic diversity for the implementation of practical approaches towards more efficient strains for nitrogen metabolism. PMID:24466135

  3. SeqHBase: a big data toolset for family based sequencing data analysis.

    PubMed

    He, Min; Person, Thomas N; Hebbring, Scott J; Heinzen, Ethan; Ye, Zhan; Schrodi, Steven J; McPherson, Elizabeth W; Lin, Simon M; Peissig, Peggy L; Brilliant, Murray H; O'Rawe, Jason; Robison, Reid J; Lyon, Gholson J; Wang, Kai

    2015-04-01

    Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant prioritisation). We applied SeqHBase to a 5-member nuclear family and a 10-member 3-generation family with WGS data, as well as a 4-member nuclear family with WES data. Analysis times were almost linearly scalable with number of data nodes. With 20 data nodes, SeqHBase took about 5 secs to analyse WES familial data and approximately 1 min to analyse WGS familial data. These results demonstrate SeqHBase's high efficiency and scalability, which is necessary as WGS and WES are rapidly becoming standard methods to study the genetics of familial disorders. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  4. A frameshift mutation in MOCOS is associated with familial renal syndrome (xanthinuria) in Tyrolean Grey cattle.

    PubMed

    Murgiano, Leonardo; Jagannathan, Vidhya; Piffer, Christian; Diez-Prieto, Inmaculada; Bolcato, Marilena; Gentile, Arcangelo; Drögemüller, Cord

    2016-12-05

    Renal syndromes are occasionally reported in domestic animals. Two identical twin Tyrolean Grey calves exhibited weight loss, skeletal abnormalities and delayed development associated with kidney abnormalities and formation of uroliths. These signs resembled inherited renal tubular dysplasia found in Japanese Black cattle which is associated with mutations in the claudin 16 gene. Despite demonstrating striking phenotypic similarities, no obvious presence of pathogenic variants of this candidate gene were found. Therefore further analysis was required to decipher the genetic etiology of the condition. The family history of the cases suggested the possibility of an autosomal recessive inheritance. Homozygosity mapping combined with sequencing of the whole genome of one case detected two associated non-synonymous private coding variants: A homozygous missense variant in the uncharacterized KIAA2026 gene (g.39038055C > G; c.926C > G), located in a 15 Mb sized region of homozygosity on BTA 8; and a homozygous 1 bp deletion in the molybdenum cofactor sulfurase (MOCOS) gene (g.21222030delC; c.1881delG and c.1782delG), located in an 11 Mb region of homozygosity on BTA 24. Pathogenic variants in MOCOS have previously been associated with inherited metabolic syndromes and xanthinuria in different species including Japanese Black cattle. Genotyping of two additional clinically suspicious cases confirmed the association with the MOCOS variant, as both animals had a homozygous mutant genotype and did not show the variant KIAA2026 allele. The identified genomic deletion is predicted to be highly disruptive, creating a frameshift and premature termination of translation, resulting in severely truncated MOCOS proteins that lack two functionally essential domains. The variant MOCOS allele was absent from cattle of other breeds and approximately 4% carriers were detected among more than 1200 genotyped Tyrolean Grey cattle. Biochemical urolith analysis of one case revealed the presence of approximately 95% xanthine. The identified MOCOS loss of function variant is highly likely to cause the renal syndrome in the affected animals. The results suggest that the phenotypic features of the renal syndrome were related to an early onset form of xanthinuria, which is highly likely to lead to the progressive defects. The identification of the candidate causative mutation thus enables selection against this pathogenic variant in Tyrolean Grey cattle.

  5. Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori

    NASA Astrophysics Data System (ADS)

    Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny

    2015-09-01

    Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.

  6. Integrated change detection and temporal trajectory analysis of coastal wetlands using high spatial resolution Korean Multi-Purpose Satellite series imagery

    NASA Astrophysics Data System (ADS)

    Nguyen, Hoang Hai; Tran, Hien; Sunwoo, Wooyeon; Yi, Jong-hyuk; Kim, Dongkyun; Choi, Minha

    2017-04-01

    A series of multispectral high-resolution Korean Multi-Purpose Satellite (KOMPSAT) images was used to detect the geographical changes in four different tidal flats between the Yellow Sea and the west coast of South Korea. The method of unsupervised classification was used to generate a series of land use/land cover (LULC) maps from satellite images, which were then used as input for temporal trajectory analysis to detect the temporal change of coastal wetlands and its association with natural and anthropogenic activities. The accurately classified LULC maps of KOMPSAT images, with overall accuracy ranging from 83.34% to 95.43%, indicate that these multispectral high-resolution satellite data are highly applicable to the generation of high-quality thematic maps for extracting wetlands. The result of the trajectory analysis showed that, while the variation of the tidal flats in the Gyeonggi and Jeollabuk provinces was well correlated with the regular tidal regimes, the reductive trajectory of the wetland areas belonging to the Saemangeum province was caused by a high degree of human-induced activities including large reclamation and urbanization. The conservation of the Jeungdo Wetland Protected Area in the Jeollanam province revealed that effective social and environmental policies could help in protecting coastal wetlands from degradation.

  7. An integrated map of structural variation in 2,504 human genomes.

    PubMed

    Sudmant, Peter H; Rausch, Tobias; Gardner, Eugene J; Handsaker, Robert E; Abyzov, Alexej; Huddleston, John; Zhang, Yan; Ye, Kai; Jun, Goo; Fritz, Markus Hsi-Yang; Konkel, Miriam K; Malhotra, Ankit; Stütz, Adrian M; Shi, Xinghua; Casale, Francesco Paolo; Chen, Jieming; Hormozdiari, Fereydoun; Dayama, Gargi; Chen, Ken; Malig, Maika; Chaisson, Mark J P; Walter, Klaudia; Meiers, Sascha; Kashin, Seva; Garrison, Erik; Auton, Adam; Lam, Hugo Y K; Mu, Xinmeng Jasmine; Alkan, Can; Antaki, Danny; Bae, Taejeong; Cerveira, Eliza; Chines, Peter; Chong, Zechen; Clarke, Laura; Dal, Elif; Ding, Li; Emery, Sarah; Fan, Xian; Gujral, Madhusudan; Kahveci, Fatma; Kidd, Jeffrey M; Kong, Yu; Lameijer, Eric-Wubbo; McCarthy, Shane; Flicek, Paul; Gibbs, Richard A; Marth, Gabor; Mason, Christopher E; Menelaou, Androniki; Muzny, Donna M; Nelson, Bradley J; Noor, Amina; Parrish, Nicholas F; Pendleton, Matthew; Quitadamo, Andrew; Raeder, Benjamin; Schadt, Eric E; Romanovitch, Mallory; Schlattl, Andreas; Sebra, Robert; Shabalin, Andrey A; Untergasser, Andreas; Walker, Jerilyn A; Wang, Min; Yu, Fuli; Zhang, Chengsheng; Zhang, Jing; Zheng-Bradley, Xiangqun; Zhou, Wanding; Zichner, Thomas; Sebat, Jonathan; Batzer, Mark A; McCarroll, Steven A; Mills, Ryan E; Gerstein, Mark B; Bashir, Ali; Stegle, Oliver; Devine, Scott E; Lee, Charles; Eichler, Evan E; Korbel, Jan O

    2015-10-01

    Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.

  8. Applications of the 1000 Genomes Project resources

    PubMed Central

    Zheng-Bradley, Xiangqun

    2017-01-01

    Abstract The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. PMID:27436001

  9. Complex and multi-allelic copy number variation in human disease

    PubMed Central

    McCarroll, Steven A.

    2015-01-01

    Hundreds of copy number variants are complex and multi-allelic, in that they have many structural alleles and have rearranged multiple times in the ancestors who contributed chromosomes to current humans. Not only are the relationships of these multi-allelic CNVs (mCNVs) to phenotypes generally unknown, but many mCNVs have not yet been described at the basic levels—alleles, allele frequencies, structural features—that support genetic investigation. To date, most reported disease associations to these variants have been ascertained through candidate gene studies. However, only a few associations have reached the level of acceptance defined by durable replications in many cohorts. This likely stems from longstanding challenges in making precise molecular measurements of the alleles individuals have at these loci. However, approaches for mCNV analysis are improving quickly, and some of the unique characteristics of mCNVs may assist future association studies. Their various structural alleles are likely to have different magnitudes of effect, creating a natural allelic series of growing phenotypic impact and giving investigators a set of natural predictions and testable hypotheses about the extent to which each allele of an mCNV predisposes to a phenotype. Also, mCNVs’ low-to-modest correlation to individual single-nucleotide polymorphisms (SNPs) may make it easier to distinguish between mCNVs and nearby SNPs as the drivers of an association signal, and perhaps, make it possible to preliminarily screen candidate loci, or the entire genome, for the many mCNV–disease relationships that remain to be discovered. PMID:26163405

  10. High density genetic mapping identifies new susceptibility loci for rheumatoid arthritis

    PubMed Central

    Eyre, Steve; Bowes, John; Diogo, Dorothée; Lee, Annette; Barton, Anne; Martin, Paul; Zhernakova, Alexandra; Stahl, Eli; Viatte, Sebastien; McAllister, Kate; Amos, Christopher I.; Padyukov, Leonid; Toes, Rene E.M.; Huizinga, Tom W.J.; Wijmenga, Cisca; Trynka, Gosia; Franke, Lude; Westra, Harm-Jan; Alfredsson, Lars; Hu, Xinli; Sandor, Cynthia; de Bakker, Paul I.W.; Davila, Sonia; Khor, Chiea Chuen; Heng, Khai Koon; Andrews, Robert; Edkins, Sarah; Hunt, Sarah E; Langford, Cordelia; Symmons, Deborah; Concannon, Pat; Onengut-Gumuscu, Suna; Rich, Stephen S; Deloukas, Panos; Gonzalez-Gay, Miguel A.; Rodriguez-Rodriguez, Luis; Ärlsetig, Lisbeth; Martin, Javier; Rantapää-Dahlqvist, Solbritt; Plenge, Robert; Raychaudhuri, Soumya; Klareskog, Lars; Gregersen, Peter K; Worthington, Jane

    2012-01-01

    Summary Using the Immunochip custom single nucleotide polymorphism (SNP) array, designed for dense genotyping of 186 genome wide association study (GWAS) confirmed loci we analysed 11,475 rheumatoid arthritis cases of European ancestry and 15,870 controls for 129,464 markers. The data were combined in meta-analysis with GWAS data from additional independent cases (n=2,363) and controls (n=17,872). We identified fourteen novel loci; nine were associated with rheumatoid arthritis overall and 5 specifically in anti-citrillunated peptide antibody positive disease, bringing the number of confirmed European ancestry rheumatoid arthritis loci to 46. We refined the peak of association to a single gene for 19 loci, identified secondary independent effects at six loci and association to low frequency variants (minor allele frequency <0.05) at 4 loci. Bioinformatic analysis of the data generated strong hypotheses for the causal SNP at seven loci. This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations. PMID:23143596

  11. Functional annotation of HOT regions in the human genome: implications for human disease and cancer

    PubMed Central

    Li, Hao; Chen, Hebing; Liu, Feng; Ren, Chao; Wang, Shengqi; Bo, Xiaochen; Shu, Wenjie

    2015-01-01

    Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy. PMID:26113264

  12. Functional annotation of HOT regions in the human genome: implications for human disease and cancer.

    PubMed

    Li, Hao; Chen, Hebing; Liu, Feng; Ren, Chao; Wang, Shengqi; Bo, Xiaochen; Shu, Wenjie

    2015-06-26

    Advances in genome-wide association studies (GWAS) and large-scale sequencing studies have resulted in an impressive and growing list of disease- and trait-associated genetic variants. Most studies have emphasised the discovery of genetic variation in coding sequences, however, the noncoding regulatory effects responsible for human disease and cancer biology have been substantially understudied. To better characterise the cis-regulatory effects of noncoding variation, we performed a comprehensive analysis of the genetic variants in HOT (high-occupancy target) regions, which are considered to be one of the most intriguing findings of recent large-scale sequencing studies. We observed that GWAS variants that map to HOT regions undergo a substantial net decrease and illustrate development-specific localisation during haematopoiesis. Additionally, genetic risk variants are disproportionally enriched in HOT regions compared with LOT (low-occupancy target) regions in both disease-relevant and cancer cells. Importantly, this enrichment is biased toward disease- or cancer-specific cell types. Furthermore, we observed that cancer cells generally acquire cancer-specific HOT regions at oncogenes through diverse mechanisms of cancer pathogenesis. Collectively, our findings demonstrate the key roles of HOT regions in human disease and cancer and represent a critical step toward further understanding disease biology, diagnosis, and therapy.

  13. Role of protein surface charge in monellin sweetness.

    PubMed

    Xue, Wei-Feng; Szczepankiewicz, Olga; Thulin, Eva; Linse, Sara; Carey, Jannette

    2009-03-01

    A small number of proteins have the unusual property of tasting intensely sweet. Despite many studies aimed at identifying their sweet taste determinants, the molecular basis of protein sweetness is not fully understood. Recent mutational studies of monellin have implicated positively charged residues in sweetness. In the present work, the effect of overall net charge was investigated using the complementary approach of negative charge alterations. Multiple substitutions of Asp/Asn and Glu/Gln residues radically altered the surface charge of single-chain monellin by removing six negative charges or adding four negative charges. Biophysical characterization using circular dichroism, fluorescence, and two-dimensional NMR demonstrates that the native fold of monellin is preserved in the variant proteins under physiological solution conditions although their stability toward chemical denaturation is altered. A human taste test was employed to determine the sweetness detection threshold of the variants. Removal of negative charges preserves monellin sweetness, whereas added negative charge has a large negative impact on sweetness. Meta-analysis of published charge variants of monellin and other sweet proteins reveals a general trend toward increasing sweetness with increasing positive net charge. Structural mapping of monellin variants identifies a hydrophobic surface predicted to face the receptor where introduced positive or negative charge reduces sweetness, and a polar surface where charges modulate long-range electrostatic complementarity.

  14. GenProBiS: web server for mapping of sequence variants to protein binding sites.

    PubMed

    Konc, Janez; Skrlj, Blaz; Erzen, Nika; Kunej, Tanja; Janezic, Dusanka

    2017-07-03

    Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion binding sites. The concept of a protein-compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Performance of Multi-chaotic PSO on a shifted benchmark functions set

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pluhacek, Michal; Senkerik, Roman; Zelinka, Ivan

    2015-03-10

    In this paper the performance of Multi-chaotic PSO algorithm is investigated using two shifted benchmark functions. The purpose of shifted benchmark functions is to simulate the time-variant real-world problems. The results of chaotic PSO are compared with canonical version of the algorithm. It is concluded that using the multi-chaotic approach can lead to better results in optimization of shifted functions.

  16. Beyond mind-reading: multi-voxel pattern analysis of fMRI data.

    PubMed

    Norman, Kenneth A; Polyn, Sean M; Detre, Greg J; Haxby, James V

    2006-09-01

    A key challenge for cognitive neuroscience is determining how mental representations map onto patterns of neural activity. Recently, researchers have started to address this question by applying sophisticated pattern-classification algorithms to distributed (multi-voxel) patterns of functional MRI data, with the goal of decoding the information that is represented in the subject's brain at a particular point in time. This multi-voxel pattern analysis (MVPA) approach has led to several impressive feats of mind reading. More importantly, MVPA methods constitute a useful new tool for advancing our understanding of neural information processing. We review how researchers are using MVPA methods to characterize neural coding and information processing in domains ranging from visual perception to memory search.

  17. The OncoArray Consortium: A Network for Understanding the Genetic Architecture of Common Cancers.

    PubMed

    Amos, Christopher I; Dennis, Joe; Wang, Zhaoming; Byun, Jinyoung; Schumacher, Fredrick R; Gayther, Simon A; Casey, Graham; Hunter, David J; Sellers, Thomas A; Gruber, Stephen B; Dunning, Alison M; Michailidou, Kyriaki; Fachal, Laura; Doheny, Kimberly; Spurdle, Amanda B; Li, Yafang; Xiao, Xiangjun; Romm, Jane; Pugh, Elizabeth; Coetzee, Gerhard A; Hazelett, Dennis J; Bojesen, Stig E; Caga-Anan, Charlisse; Haiman, Christopher A; Kamal, Ahsan; Luccarini, Craig; Tessier, Daniel; Vincent, Daniel; Bacot, François; Van Den Berg, David J; Nelson, Stefanie; Demetriades, Stephen; Goldgar, David E; Couch, Fergus J; Forman, Judith L; Giles, Graham G; Conti, David V; Bickeböller, Heike; Risch, Angela; Waldenberger, Melanie; Brüske-Hohlfeld, Irene; Hicks, Belynda D; Ling, Hua; McGuffog, Lesley; Lee, Andrew; Kuchenbaecker, Karoline; Soucy, Penny; Manz, Judith; Cunningham, Julie M; Butterbach, Katja; Kote-Jarai, Zsofia; Kraft, Peter; FitzGerald, Liesel; Lindström, Sara; Adams, Marcia; McKay, James D; Phelan, Catherine M; Benlloch, Sara; Kelemen, Linda E; Brennan, Paul; Riggan, Marjorie; O'Mara, Tracy A; Shen, Hongbing; Shi, Yongyong; Thompson, Deborah J; Goodman, Marc T; Nielsen, Sune F; Berchuck, Andrew; Laboissiere, Sylvie; Schmit, Stephanie L; Shelford, Tameka; Edlund, Christopher K; Taylor, Jack A; Field, John K; Park, Sue K; Offit, Kenneth; Thomassen, Mads; Schmutzler, Rita; Ottini, Laura; Hung, Rayjean J; Marchini, Jonathan; Amin Al Olama, Ali; Peters, Ulrike; Eeles, Rosalind A; Seldin, Michael F; Gillanders, Elizabeth; Seminara, Daniela; Antoniou, Antonis C; Pharoah, Paul D P; Chenevix-Trench, Georgia; Chanock, Stephen J; Simard, Jacques; Easton, Douglas F

    2017-01-01

    Common cancers develop through a multistep process often including inherited susceptibility. Collaboration among multiple institutions, and funding from multiple sources, has allowed the development of an inexpensive genotyping microarray, the OncoArray. The array includes a genome-wide backbone, comprising 230,000 SNPs tagging most common genetic variants, together with dense mapping of known susceptibility regions, rare variants from sequencing experiments, pharmacogenetic markers, and cancer-related traits. The OncoArray can be genotyped using a novel technology developed by Illumina to facilitate efficient genotyping. The consortium developed standard approaches for selecting SNPs for study, for quality control of markers, and for ancestry analysis. The array was genotyped at selected sites and with prespecified replicate samples to permit evaluation of genotyping accuracy among centers and by ethnic background. The OncoArray consortium genotyped 447,705 samples. A total of 494,763 SNPs passed quality control steps with a sample success rate of 97% of the samples. Participating sites performed ancestry analysis using a common set of markers and a scoring algorithm based on principal components analysis. Results from these analyses will enable researchers to identify new susceptibility loci, perform fine-mapping of new or known loci associated with either single or multiple cancers, assess the degree of overlap in cancer causation and pleiotropic effects of loci that have been identified for disease-specific risk, and jointly model genetic, environmental, and lifestyle-related exposures. Ongoing analyses will shed light on etiology and risk assessment for many types of cancer. Cancer Epidemiol Biomarkers Prev; 26(1); 126-35. ©2016 AACR. ©2016 American Association for Cancer Research.

  18. The OncoArray Consortium: a Network for Understanding the Genetic Architecture of Common Cancers

    PubMed Central

    Amos, Christopher I.; Dennis, Joe; Wang, Zhaoming; Byun, Jinyoung; Schumacher, Fredrick R.; Gayther, Simon A.; Casey, Graham; Hunter, David J.; Sellers, Thomas A.; Gruber, Stephen B.; Dunning, Alison M.; Michailidou, Kyriaki; Fachal, Laura; Doheny, Kimberly; Spurdle, Amanda B.; Li, Yafang; Xiao, Xiangjun; Romm, Jane; Pugh, Elizabeth; Coetzee, Gerhard A.; Hazelett, Dennis J.; Bojesen, Stig E.; Caga-Anan, Charlisse; Haiman, Christopher A.; Kamal, Ahsan; Luccarini, Craig; Tessier, Daniel; Vincent, Daniel; Bacot, François; Van Den Berg, David J.; Nelson, Stefanie; Demetriades, Stephen; Goldgar, David E.; Couch, Fergus J.; Forman, Judith L.; Giles, Graham G.; Conti, David V.; Bickeböller, Heike; Risch, Angela; Waldenberger, Melanie; Brüske, Irene; Hicks, Belynda D.; Ling, Hua; McGuffog, Lesley; Lee, Andrew; Kuchenbaecker, Karoline B.; Soucy, Penny; Manz, Judith; Cunningham, Julie M.; Butterbach, Katja; Kote-Jarai, Zsofia; Kraft, Peter; FitzGerald, Liesel M.; Lindström, Sara; Adams, Marcia; McKay, James D.; Phelan, Catherine M.; Benlloch, Sara; Kelemen, Linda E.; Brennan, Paul; Riggan, Marjorie; O’Mara, Tracy A.; Shen, Hongbin; Shi, Yongyong; Thompson, Deborah J.; Goodman, Marc T.; Nielsen, Sune F.; Berchuck, Andrew; Laboissiere, Sylvie; Schmit, Stephanie L.; Shelford, Tameka; Edlund, Christopher K.; Taylor, Jack A.; Field, John K.; Park, Sue K.; Offit, Kenneth; Thomassen, Mads; Schmutzler, Rita; Ottini, Laura; Hung, Rayjean J.; Marchini, Jonathan; Al Olama, Ali Amin; Peters, Ulrike; Eeles, Rosalind A.; Seldin, Michael F.; Gillanders, Elizabeth; Seminara, Daniela; Antoniou, Antonis C.; Pharoah, Paul D.; Chenevix-Trench, Georgia; Chanock, Stephen J.; Simard, Jacques; Easton, Douglas F.

    2016-01-01

    Background Common cancers develop through a multistep process often including inherited susceptibility. Collaboration among multiple institutions, and funding from multiple sources, has allowed the development of an inexpensive genotyping microarray, the OncoArray. The array includes a genome-wide backbone, comprising 230,000 SNPs tagging most common genetic variants, together with dense mapping of known susceptibility regions, rare variants from sequencing experiments, pharmacogenetic markers and cancer related traits. Methods The OncoArray can be genotyped using a novel technology developed by Illumina to facilitate efficient genotyping. The consortium developed standard approaches for selecting SNPs for study, for quality control of markers and for ancestry analysis. The array was genotyped at selected sites and with prespecified replicate samples to permit evaluation of genotyping accuracy among centers and by ethnic background. Results The OncoArray consortium genotyped 447,705 samples. A total of 494,763 SNPs passed quality control steps with a sample success rate of 97% of the samples. Participating sites performed ancestry analysis using a common set of markers and a scoring algorithm based on principal components analysis. Conclusions Results from these analyses will enable researchers to identify new susceptibility loci, perform fine mapping of new or known loci associated with either single or multiple cancers, assess the degree of overlap in cancer causation and pleiotropic effects of loci that have been identified for disease-specific risk, and jointly model genetic, environmental and lifestyle related exposures. Impact Ongoing analyses will shed light on etiology and risk assessment for many types of cancer. PMID:27697780

  19. Genome-wide meta-analysis of observational studies shows common genetic variants associated with macronutrient intake.

    PubMed

    Tanaka, Toshiko; Ngwa, Julius S; van Rooij, Frank J A; Zillikens, M Carola; Wojczynski, Mary K; Frazier-Wood, Alexis C; Houston, Denise K; Kanoni, Stavroula; Lemaitre, Rozenn N; Luan, Jian'an; Mikkilä, Vera; Renstrom, Frida; Sonestedt, Emily; Zhao, Jing Hua; Chu, Audrey Y; Qi, Lu; Chasman, Daniel I; de Oliveira Otto, Marcia C; Dhurandhar, Emily J; Feitosa, Mary F; Johansson, Ingegerd; Khaw, Kay-Tee; Lohman, Kurt K; Manichaikul, Ani; McKeown, Nicola M; Mozaffarian, Dariush; Singleton, Andrew; Stirrups, Kathleen; Viikari, Jorma; Ye, Zheng; Bandinelli, Stefania; Barroso, Inês; Deloukas, Panos; Forouhi, Nita G; Hofman, Albert; Liu, Yongmei; Lyytikäinen, Leo-Pekka; North, Kari E; Dimitriou, Maria; Hallmans, Goran; Kähönen, Mika; Langenberg, Claudia; Ordovas, Jose M; Uitterlinden, André G; Hu, Frank B; Kalafati, Ioanna-Panagiota; Raitakari, Olli; Franco, Oscar H; Johnson, Andrew; Emilsson, Valur; Schrack, Jennifer A; Semba, Richard D; Siscovick, David S; Arnett, Donna K; Borecki, Ingrid B; Franks, Paul W; Kritchevsky, Stephen B; Lehtimäki, Terho; Loos, Ruth J F; Orho-Melander, Marju; Rotter, Jerome I; Wareham, Nicholas J; Witteman, Jacqueline C M; Ferrucci, Luigi; Dedoussis, George; Cupples, L Adrienne; Nettleton, Jennifer A

    2013-06-01

    Macronutrient intake varies substantially between individuals, and there is evidence that this variation is partly accounted for by genetic variants. The objective of the study was to identify common genetic variants that are associated with macronutrient intake. We performed 2-stage genome-wide association (GWA) meta-analysis of macronutrient intake in populations of European descent. Macronutrients were assessed by using food-frequency questionnaires and analyzed as percentages of total energy consumption from total fat, protein, and carbohydrate. From the discovery GWA (n = 38,360), 35 independent loci associated with macronutrient intake at P < 5 × 10(-6) were identified and taken forward to replication in 3 additional cohorts (n = 33,533) from the DietGen Consortium. For one locus, fat mass obesity-associated protein (FTO), cohorts with Illumina MetaboChip genotype data (n = 7724) provided additional replication data. A variant in the chromosome 19 locus (rs838145) was associated with higher carbohydrate (β ± SE: 0.25 ± 0.04%; P = 1.68 × 10(-8)) and lower fat (β ± SE: -0.21 ± 0.04%; P = 1.57 × 10(-9)) consumption. A candidate gene in this region, fibroblast growth factor 21 (FGF21), encodes a fibroblast growth factor involved in glucose and lipid metabolism. The variants in this locus were associated with circulating FGF21 protein concentrations (P < 0.05) but not mRNA concentrations in blood or brain. The body mass index (BMI)-increasing allele of the FTO variant (rs1421085) was associated with higher protein intake (β ± SE: 0.10 ± 0.02%; P = 9.96 × 10(-10)), independent of BMI (after adjustment for BMI, β ± SE: 0.08 ± 0.02%; P = 3.15 × 10(-7)). Our results indicate that variants in genes involved in nutrient metabolism and obesity are associated with macronutrient consumption in humans. Trials related to this study were registered at clinicaltrials.gov as NCT00005131 (Atherosclerosis Risk in Communities), NCT00005133 (Cardiovascular Health Study), NCT00005136 (Family Heart Study), NCT00005121 (Framingham Heart Study), NCT00083369 (Genetic and Environmental Determinants of Triglycerides), NCT01331512 (InCHIANTI Study), and NCT00005487 (Multi-Ethnic Study of Atherosclerosis).

  20. Multi-scale functional mapping of tidal marsh vegetation for restoration monitoring

    NASA Astrophysics Data System (ADS)

    Tuxen Bettman, Karin

    2007-12-01

    Nearly half of the world's natural wetlands have been destroyed or degraded, and in recent years, there have been significant endeavors to restore wetland habitat throughout the world. Detailed mapping of restoring wetlands can offer valuable information about changes in vegetation and geomorphology, which can inform the restoration process and ultimately help to improve chances of restoration success. I studied six tidal marshes in the San Francisco Estuary, CA, US, between 2003 and 2004 in order to develop techniques for mapping tidal marshes at multiple scales by incorporating specific restoration objectives for improved longer term monitoring. I explored a "pixel-based" remote sensing image analysis method for mapping vegetation in restored and natural tidal marshes, describing the benefits and limitations of this type of approach (Chapter 2). I also performed a multi-scale analysis of vegetation pattern metrics for a recently restored tidal marsh in order to target the metrics that are consistent across scales and will be robust measures of marsh vegetation change (Chapter 3). Finally, I performed an "object-based" image analysis using the same remotely sensed imagery, which maps vegetation type and specific wetland functions at multiple scales (Chapter 4). The combined results of my work highlight important trends and management implications for monitoring wetland restoration using remote sensing, and will better enable restoration ecologists to use remote sensing for tidal marsh monitoring. Several findings important for tidal marsh restoration monitoring were made. Overall results showed that pixel-based methods are effective at quantifying landscape changes in composition and diversity in recently restored marshes, but are limited in their use for quantifying smaller, more fine-scale changes. While pattern metrics can highlight small but important changes in vegetation composition and configuration across years, scientists should exercise caution when using metrics in their studies or to validate restoration management decisions, and multi-scale analyses should be performed before metrics are used in restoration science for important management decisions. Lastly, restoration objectives, ecosystem function, and scale can each be integrated into monitoring techniques using remote sensing for improved restoration monitoring.

  1. Chaotic map clustering algorithm for EEG analysis

    NASA Astrophysics Data System (ADS)

    Bellotti, R.; De Carlo, F.; Stramaglia, S.

    2004-03-01

    The non-parametric chaotic map clustering algorithm has been applied to the analysis of electroencephalographic signals, in order to recognize the Huntington's disease, one of the most dangerous pathologies of the central nervous system. The performance of the method has been compared with those obtained through parametric algorithms, as K-means and deterministic annealing, and supervised multi-layer perceptron. While supervised neural networks need a training phase, performed by means of data tagged by the genetic test, and the parametric methods require a prior choice of the number of classes to find, the chaotic map clustering gives a natural evidence of the pathological class, without any training or supervision, thus providing a new efficient methodology for the recognition of patterns affected by the Huntington's disease.

  2. A de novo missense mutation of FGFR2 causes facial dysplasia syndrome in Holstein cattle.

    PubMed

    Agerholm, Jørgen S; McEvoy, Fintan J; Heegaard, Steffen; Charlier, Carole; Jagannathan, Vidhya; Drögemüller, Cord

    2017-08-02

    Surveillance for bovine genetic diseases in Denmark identified a hitherto unreported congenital syndrome occurring among progeny of a Holstein sire used for artificial breeding. A genetic aetiology due to a dominant inheritance with incomplete penetrance or a mosaic germline mutation was suspected as all recorded cases were progeny of the same sire. Detailed investigations were performed to characterize the syndrome and to reveal its cause. Seven malformed calves were submitted examination. All cases shared a common morphology with the most striking lesions being severe facial dysplasia and complete prolapse of the eyes. Consequently the syndrome was named facial dysplasia syndrome (FDS). Furthermore, extensive brain malformations, including microencephaly, hydrocephalus, lobation of the cerebral hemispheres and compression of the brain were present. Subsequent data analysis of progeny of the sire revealed that around 0.5% of his offspring suffered from FDS. High density single nucleotide polymorphism (SNP) genotyping data of the seven cases and their parents were used to map the defect in the bovine genome. Significant genetic linkage was obtained for three regions, including chromosome 26 where whole genome sequencing of a case-parent trio revealed two de novo variants perfectly associated with the disease: an intronic SNP in the DMBT1 gene and a single non-synonymous variant in the FGFR2 gene. This FGFR2 missense variant (c.927G>T) affects a gene encoding a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and across species. It is predicted to change an evolutionary conserved tryptophan into a cysteine residue (p.Trp309Cys). Both variant alleles were proven to result from de novo mutation events in the germline of the sire. FDS is a novel genetic disorder of Holstein cattle. Mutations in the human FGFR2 gene are associated with various dominant inherited craniofacial dysostosis syndromes. Given the phenotypic similarities in FDS affected calves, the genetic mapping and absence of further high impact variants in the critical genome regions, it is highly likely that the missense mutation in the FGFR2 gene caused the FDS phenotype in a dominant mode of inheritance.

  3. Hyperspectral remote sensing exploration of carbonatite - an example from Epembe, Kunene region, Namibia

    NASA Astrophysics Data System (ADS)

    Zimmermann, Robert; Brandmeier, Melanie; Andreani, Louis; Gloaguen, Richard

    2015-04-01

    Remote sensing data can provide valuable information about ore deposits and their alteration zones at surface level. High spectral and spatial resolution of the data is essential for detailed mapping of mineral abundances and related structures. Carbonatites are well known for hosting economic enrichments in REE, Ta, Nb and P (Jones et al. 2013). These make them a preferential target for exploration for those critical elements. In this study we show how combining geomorphic, textural and spectral data improves classification result. We selected a site with a well-known occurrence in northern Namibia: the Epembe dyke. For analysis LANDSAT 8, SRTM and airborne hyperspectral (HyMap) data were chosen. The overlapping data allows a multi-scale and multi-resolution approach. Results from data analysis were validated during fieldwork in 2014. Data was corrected for atmospherical and geometrical effects. Image classification, mineral mapping and tectonic geomorphology allow a refinement of the geological map by lithological mapping in a second step. Detailed mineral abundance maps were computed using spectral unmixing techniques. These techniques are well suited to map abundances of carbonate minerals, but not to discriminate the carbonatite itself from surrounding rocks with similar spectral signatures. Thus, geometric indices were calculated using tectonic geomorphology and textures. For this purpose the TecDEM-toolbox (SHAHZAD & GLOAGUEN 2011) was applied to the SRTM-data for geomorphic analysis. Textural indices (e.g. uniformity, entropy, angular second moment) were derived from HyMap and SRTM by a grey-level co-occurrence matrix (CLAUSI 2002). The carbonatite in the study area is ridge-forming and shows a narrow linear feature in the textural bands. Spectral and geometric information were combined using kohonen Self-Organizing Maps (SOM) for unsupervised clustering. The resulting class spectra were visually compared and interpreted. Classes with similar signatures were merged according to geological context. The major conclusions are: 1. Carbonate minerals can be mapped using spectral unmixing techniques. 2. Carbonatites are associated with specific geometric pattern 3. The combination of spectral and geometric information improves classification result and reduces misclassification. References Clausi, D. A. (2002): An analysis of co-occurrence texture statistics as a function of grey-level quantization. - Canadian Journal of Remote Sensing, 28 (1), 45-62 Jones, A. P., Genge, M. and Carmody, L (2013): Carbonate Melts and Carbonatites. - Reviews in Mineralogy & Geochemistry, 75, 289-322 Shahzad, F. & Gloaguen, R. (2011): TecDEM: A MATLAB based toolbox for tectonic geomorphology, Part 2: Surface dynamics and basin analysis. - Computers and Geosciences, 37 (2), 261-271

  4. Anatomy assisted PET image reconstruction incorporating multi-resolution joint entropy

    NASA Astrophysics Data System (ADS)

    Tang, Jing; Rahmim, Arman

    2015-01-01

    A promising approach in PET image reconstruction is to incorporate high resolution anatomical information (measured from MR or CT) taking the anato-functional similarity measures such as mutual information or joint entropy (JE) as the prior. These similarity measures only classify voxels based on intensity values, while neglecting structural spatial information. In this work, we developed an anatomy-assisted maximum a posteriori (MAP) reconstruction algorithm wherein the JE measure is supplied by spatial information generated using wavelet multi-resolution analysis. The proposed wavelet-based JE (WJE) MAP algorithm involves calculation of derivatives of the subband JE measures with respect to individual PET image voxel intensities, which we have shown can be computed very similarly to how the inverse wavelet transform is implemented. We performed a simulation study with the BrainWeb phantom creating PET data corresponding to different noise levels. Realistically simulated T1-weighted MR images provided by BrainWeb modeling were applied in the anatomy-assisted reconstruction with the WJE-MAP algorithm and the intensity-only JE-MAP algorithm. Quantitative analysis showed that the WJE-MAP algorithm performed similarly to the JE-MAP algorithm at low noise level in the gray matter (GM) and white matter (WM) regions in terms of noise versus bias tradeoff. When noise increased to medium level in the simulated data, the WJE-MAP algorithm started to surpass the JE-MAP algorithm in the GM region, which is less uniform with smaller isolated structures compared to the WM region. In the high noise level simulation, the WJE-MAP algorithm presented clear improvement over the JE-MAP algorithm in both the GM and WM regions. In addition to the simulation study, we applied the reconstruction algorithms to real patient studies involving DPA-173 PET data and Florbetapir PET data with corresponding T1-MPRAGE MRI images. Compared to the intensity-only JE-MAP algorithm, the WJE-MAP algorithm resulted in comparable regional mean values to those from the maximum likelihood algorithm while reducing noise. Achieving robust performance in various noise-level simulation and patient studies, the WJE-MAP algorithm demonstrates its potential in clinical quantitative PET imaging.

  5. Systematic pharmacogenomics analysis of a Malay whole genome: proof of concept for personalized medicine.

    PubMed

    Salleh, Mohd Zaki; Teh, Lay Kek; Lee, Lian Shien; Ismet, Rose Iszati; Patowary, Ashok; Joshi, Kandarp; Pasha, Ayesha; Ahmed, Azni Zain; Janor, Roziah Mohd; Hamzah, Ahmad Sazali; Adam, Aishah; Yusoff, Khalid; Hoh, Boon Peng; Hatta, Fazleen Haslinda Mohd; Ismail, Mohamad Izwan; Scaria, Vinod; Sivasubbu, Sridhar

    2013-01-01

    With a higher throughput and lower cost in sequencing, second generation sequencing technology has immense potential for translation into clinical practice and in the realization of pharmacogenomics based patient care. The systematic analysis of whole genome sequences to assess patient to patient variability in pharmacokinetics and pharmacodynamics responses towards drugs would be the next step in future medicine in line with the vision of personalizing medicine. Genomic DNA obtained from a 55 years old, self-declared healthy, anonymous male of Malay descent was sequenced. The subject's mother died of lung cancer and the father had a history of schizophrenia and deceased at the age of 65 years old. A systematic, intuitive computational workflow/pipeline integrating custom algorithm in tandem with large datasets of variant annotations and gene functions for genetic variations with pharmacogenomics impact was developed. A comprehensive pathway map of drug transport, metabolism and action was used as a template to map non-synonymous variations with potential functional consequences. Over 3 million known variations and 100,898 novel variations in the Malay genome were identified. Further in-depth pharmacogenetics analysis revealed a total of 607 unique variants in 563 proteins, with the eventual identification of 4 drug transport genes, 2 drug metabolizing enzyme genes and 33 target genes harboring deleterious SNVs involved in pharmacological pathways, which could have a potential role in clinical settings. The current study successfully unravels the potential of personal genome sequencing in understanding the functionally relevant variations with potential influence on drug transport, metabolism and differential therapeutic outcomes. These will be essential for realizing personalized medicine through the use of comprehensive computational pipeline for systematic data mining and analysis.

  6. Face recognition using 3D facial shape and color map information: comparison and combination

    NASA Astrophysics Data System (ADS)

    Godil, Afzal; Ressler, Sandy; Grother, Patrick

    2004-08-01

    In this paper, we investigate the use of 3D surface geometry for face recognition and compare it to one based on color map information. The 3D surface and color map data are from the CAESAR anthropometric database. We find that the recognition performance is not very different between 3D surface and color map information using a principal component analysis algorithm. We also discuss the different techniques for the combination of the 3D surface and color map information for multi-modal recognition by using different fusion approaches and show that there is significant improvement in results. The effectiveness of various techniques is compared and evaluated on a dataset with 200 subjects in two different positions.

  7. Multi-system Component Phenotypes of Bipolar Disorder for Genetic Investigations of Extended Pedigrees

    PubMed Central

    Fears, Scott C.; Service, Susan K.; Kremeyer, Barbara; Araya, Carmen; Araya, Xinia; Bejarano, Julio; Ramirez, Margarita; Castrillón, Gabriel; Gomez-Franco, Juliana; Lopez, Maria C.; Montoya, Gabriel; Montoya, Patricia; Aldana, Ileana; Teshiba, Terri M.; Abaryan, Zvart; Al-Sharif, Noor B.; Ericson, Marissa; Jalbrzikowski, Maria; Luykx, Jurjen J.; Navarro, Linda; Tishler, Todd A.; Altshuler, Lori; Bartzokis, George; Escobar, Javier; Glahn, David C.; Ospina-Duque, Jorge; Risch, Neil; Ruiz-Linares, Andrés; Thompson, Paul M.; Cantor, Rita M.; Lopez-Jaramillo, Carlos; Macaya, Gabriel; Molina, Julio; Reus, Victor I.; Sabatti, Chiara; Freimer, Nelson B.; Bearden, Carrie E.

    2014-01-01

    IMPORTANCE Genetic factors contribute to risk for bipolar disorder (BP), yet its pathogenesis remains poorly understood. A focus on measuring multi-system quantitative traits that may be components of BP psychopathology may enable genetic dissection of this complex disorder, and investigation of extended pedigrees from genetically isolated populations may facilitate the detection of specific genetic variants that impact on BP as well as its component phenotypes. OBJECTIVE To identify quantitative neurocognitive, temperament-related, and neuroanatomic phenotypes that appear heritable and associated with severe bipolar disorder (BP-I), and therefore suitable for genetic linkage and association studies aimed at identifying variants contributing to BP-I risk. DESIGN Multi-generational pedigree study in two closely related, genetically isolated populations: the Central Valley of Costa Rica (CVCR) and Antioquia, Colombia (ANT). PARTICIPANTS 738 individuals, all from CVCR and ANT pedigrees, of whom 181 are affected with BP-I. MAIN OUTCOME MEASURE Familial aggregation (heritability) and association with BP-I of 169 quantitative neurocognitive, temperament, magnetic resonance imaging (MRI) and diffusion tensor imaging (DTI) phenotypes. RESULTS Seventy-five percent (126) of the phenotypes investigated were significantly heritable, and 31% (53) were associated with BP-I. About 1/4 of the phenotypes, including measures from each phenotype domain, were both heritable and associated with BP-I. Neuroimaging phenotypes, particularly cortical thickness in prefrontal and temporal regions, and volume and microstructural integrity of the corpus callosum, represented the most promising candidate traits for genetic mapping related to BP based on strong heritability and association with disease. Analyses of phenotypic and genetic covariation identified substantial correlations among the traits, at least some of which share a common underlying genetic architecture. CONCLUSIONS AND RELEVANCE This is the most extensive investigation of BP-relevant component phenotypes to date. Our results identify brain and behavioral quantitative traits that appear to be genetically influenced and show a pattern of BP-I-association within families that is consistent with expectations from case-control studies. Together these phenotypes provide a basis for identifying loci contributing to BP-I risk and for genetic dissection of the disorder. PMID:24522887

  8. Compound RYR1 heterozygosity resulting in a complex phenotype of malignant hyperthermia susceptibility and a core myopathy.

    PubMed

    Kraeva, N; Heytens, L; Jungbluth, H; Treves, S; Voermans, N; Kamsteeg, E; Ceuterick-de Groote, C; Baets, J; Riazi, S

    2015-07-01

    Malignant hyperthermia (MH) is a potentially fatal pharmacogenetic myopathy triggered by exposure to volatile anesthetics and/or depolarizing muscle relaxants. Susceptibility to MH is primarily associated with dominant mutations in the ryanodine receptor type 1 gene (RYR1). Recent genetic studies have shown that RYR1 variants are the most common cause of dominant and recessive congenital myopathies - central core and multi-minicore disease, congenital fiber type disproportion, and centronuclear myopathy. However, the MH status of many patients, especially with recessive RYR1-related myopathies, remains uncertain. We report the occurrence of a triplet of RYR1 variants, c.4711A>G (p.Ile1571Val), c.10097G>A (p.Arg3366His), c.11798A>G (p.Tyr3933Cys), found in cis in four unrelated families, one from Belgium, one from The Netherlands and two from Canada. Phenotype-genotype correlation analysis indicates that the presence of the triplet allele alone confers susceptibility to MH, and that the presence of this allele in a compound heterozygous state with the MH-associated RYR1 variant c.14545G>A (p.Val4849Ile) results in the MH susceptibility phenotype and a congenital myopathy with cores and rods. Our study underlines the notion that assigning pathogenicity to individual RYR1 variants or combination of variants, and counseling in RYR1-related myopathies may require integration of clinical, histopathological, in vitro contracture testing, MRI and genetic findings. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs)

    PubMed Central

    Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Alonso, M. Rosario; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Benitez, Javier; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M.; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F.; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G.; Goldberg, Mark S.; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L.; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C.; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H.; Tessier, Daniel C.; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M.; Vincent, Daniel; Winqvist, Robert; Wu, Anna H.; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D. P.; Hall, Per; Edwards, Stacey L.; Simard, Jacques; French, Juliet D.; Chenevix-Trench, Georgia; Dunning, Alison M.

    2016-01-01

    Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus. PMID:27600471

  10. The hutterite variant of Treacher Collins syndrome: a 28-year-old story solved.

    PubMed

    Caluseriu, Oana; Lowry, Brian R; McLeod, Ross; Lamont, Ryan; Parboosingh, Jillian S; Bernier, Francois P; Innes, A Micheil

    2013-11-01

    Treacher Collins syndrome (TCS), the best known form of mandibulofacial dysostosis (MFD) comprises a recognizable pattern of anomalies. In 1985, Lowry et al. reported on two Hutterite sisters born to apparently unaffected parents with TCS, raising the possibility of an autosomal recessive (AR) variant of TCS, subsequently given a unique Mendelian Inheritance of Man (MIM) number (248390). Recently, biallelic mutations in POLR1C were found in TCS patients, confirming AR TCS as a distinct entity. The Hutterites, an endogamous Anabaptist group, like other genetically isolated populations, provide a powerful resource for mapping AR disorders. We elected to study the molecular basis of TCS in the Hutterite population including the original kindred described in 1985, and another unrelated Hutterite patient. Prior to starting this study, a TCOF1 mutation had apparently been excluded in the original family at two outside institutions. We hypothesized that an AR variant of TCS was present in the three Hutterite patients, but homozygosity mapping did not show convincing evidence of shared regions between the affected individuals. TCOF1 analysis was undertaken and mutations were found in the three affected patients and an unaffected parent. These data show that the initial Hutterite family reported with AR TCS in fact has classic TCS due to a TCOF1 mutation, despite recent data confirming the existence of AR TCS in other populations. These results have significant counseling implications for the affected families in the Hutterite population and in the population at large. © 2013 Wiley Periodicals, Inc. © 2013 Wiley Periodicals, Inc.

  11. Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs).

    PubMed

    Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M

    2016-09-07

    Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.

  12. Community-Based Services that Facilitate Interoperability and Intercomparison of Precipitation Datasets from Multiple Sources

    NASA Technical Reports Server (NTRS)

    Liu, Zhong; Kempler, Steven; Teng, William; Leptoukh, Gregory; Ostrenga, Dana

    2010-01-01

    Over the past 12 years, large volumes of precipitation data have been generated from space-based observatories (e.g., TRMM), merging of data products (e.g., gridded 3B42), models (e.g., GMAO), climatologies (e.g., Chang SSM/I derived rain indices), field campaigns, and ground-based measuring stations. The science research, applications, and education communities have greatly benefited from the unrestricted availability of these data from the Goddard Earth Sciences Data and Information Services Center (GES DISC) and, in particular, the services tailored toward precipitation data access and usability. In addition, tools and services that are responsive to the expressed evolving needs of the precipitation data user communities have been developed at the Precipitation Data and Information Services Center (PDISC) (http://disc.gsfc.nasa.gov/precipitation or google NASA PDISC), located at the GES DISC, to provide users with quick data exploration and access capabilities. In recent years, data management and access services have become increasingly sophisticated, such that they now afford researchers, particularly those interested in multi-data set science analysis and/or data validation, the ability to homogenize data sets, in order to apply multi-variant, comparison, and evaluation functions. Included in these services is the ability to capture data quality and data provenance. These interoperability services can be directly applied to future data sets, such as those from the Global Precipitation Measurement (GPM) mission. This presentation describes the data sets and services at the PDISC that are currently used by precipitation science and applications researchers, and which will be enhanced in preparation for GPM and associated multi-sensor data research. Specifically, the GES-DISC Interactive Online Visualization ANd aNalysis Infrastructure (Giovanni) will be illustrated. Giovanni enables scientific exploration of Earth science data without researchers having to perform the complicated data access and match-up processes. In addition, PDISC tool and service capabilities being adapted for GPM data will be described, including the Google-like Mirador data search and access engine; semantic technology to help manage large amounts of multi-sensor data and their relationships; data access through various Web services (e.g., OPeNDAP, GDS, WMS, WCS); conversion to various formats (e.g., netCDF, HDF, KML (for Google Earth)); visualization and analysis of Level 2 data profiles and maps; parameter and spatial subsetting; time and temporal aggregation; regridding; data version control and provenance; continuous archive verification; and expertise in data-related standards and interoperability. The goal of providing these services is to further the progress towards a common framework by which data analysis/validation can be more easily accomplished.

  13. Digital map and situation surface: a team-oriented multidisplay workspace for network enabled situation analysis

    NASA Astrophysics Data System (ADS)

    Peinsipp-Byma, E.; Geisler, Jürgen; Bader, Thomas

    2009-05-01

    System concepts for network enabled image-based ISR (intelligence, surveillance, reconnaissance) is the major mission of Fraunhofer IITB's applied research in the area of defence and security solutions. For the TechDemo08 as part of the NATO CNAD POW Defence against terrorism Fraunhofer IITB advanced a new multi display concept to handle the shear amount and high complexity of ISR data acquired by networked, distributed surveillance systems with the objective to support the generation of a common situation picture. Amount and Complexity of ISR data demands an innovative man-machine interface concept for humans to deal with it. The IITB's concept is the Digital Map & Situation Surface. This concept offers to the user a coherent multi display environment combining a horizontal surface for the situation overview from the bird's eye view, an attached vertical display for collateral information and so-called foveatablets as personalized magic lenses in order to obtain high resolved and role-specific information about a focused areaof- interest and to interact with it. In the context of TechDemo08 the Digital Map & Situation Surface served as workspace for team-based situation visualization and analysis. Multiple sea- and landside surveillance components were connected to the system.

  14. Determination of the Impact of Urbanization on Agricultural Lands using Multi-temporal Satellite Sensor Images

    NASA Astrophysics Data System (ADS)

    Kaya, S.; Alganci, U.; Sertel, E.; Ustundag, B.

    2015-12-01

    Throughout the history, agricultural activities have been performed close to urban areas. Main reason behind this phenomenon is the need of fast marketing of the agricultural production to urban residents and financial provision. Thus, using the areas nearby cities for agricultural activities brings out advantage of easy transportation of productions and fast marketing. For decades, heavy migration to cities has directly and negatively affected natural grasslands, forests and agricultural lands. This pressure has caused agricultural lands to be changed into urban areas. Dense urbanization causes increase in impervious surfaces, heat islands and many other problems in addition to destruction of agricultural lands. Considering the negative impacts of urbanization on agricultural lands and natural resources, a periodic monitoring of these changes becomes indisputably important. At this point, satellite images are known to be good data sources for land cover / use change monitoring with their fast data acquisition, large area coverages and temporal resolution properties. Classification of the satellite images provides thematic the land cover / use maps of the earth surface and changes can be determined with GIS based analysis multi-temporal maps. In this study, effects of heavy urbanization over agricultural lands in Istanbul, metropolitan city of Turkey, were investigated with use of multi-temporal Landsat TM satellite images acquired between 1984 and 2011. Images were geometrically registered to each other and classified using supervised maximum likelihood classification algorithm. Resulting thematic maps were exported to GIS environment and destructed agricultural lands by urbanization were determined using spatial analysis.

  15. Mapping Ribonucleotides Incorporated into DNA by Hydrolytic End-Sequencing.

    PubMed

    Orebaugh, Clinton D; Lujan, Scott A; Burkholder, Adam B; Clausen, Anders R; Kunkel, Thomas A

    2018-01-01

    Ribonucleotides embedded within DNA render the DNA sensitive to the formation of single-stranded breaks under alkali conditions. Here, we describe a next-generation sequencing method called hydrolytic end sequencing (HydEn-seq) to map ribonucleotides inserted into the genome of Saccharomyce cerevisiae strains deficient in ribonucleotide excision repair. We use this method to map several genomic features in wild-type and replicase variant yeast strains.

  16. Mapping of disease-associated variants in admixed populations

    PubMed Central

    2011-01-01

    Recent developments in high-throughput genotyping and whole-genome sequencing will enhance the identification of disease loci in admixed populations. We discuss how a more refined estimation of ancestry benefits both admixture mapping and association mapping, making disease loci identification in admixed populations more powerful. High-throughput genotyping and sequencing will enable refined estimation of ancestry, thus enhancing disease loci identification in admixed populations PMID:21635713

  17. Impact of scale on morphological spatial pattern of forest

    Treesearch

    Katarzyna Ostapowicz; Peter Vogt; Kurt H. Riitters; Jacek Kozak; Christine Estreguil

    2008-01-01

    Assessing and monitoring landscape pattern structure from multi-scale land-cover maps can utilize morphological spatial pattern analysis (MSPA), only if various influences of scale are known and taken into account. This paper lays part of the foundation for applying MSPA analysis in landscape monitoring by quantifying scale effects on six classes of spatial patterns...

  18. Inlining 3d Reconstruction, Multi-Source Texture Mapping and Semantic Analysis Using Oblique Aerial Imagery

    NASA Astrophysics Data System (ADS)

    Frommholz, D.; Linkiewicz, M.; Poznanska, A. M.

    2016-06-01

    This paper proposes an in-line method for the simplified reconstruction of city buildings from nadir and oblique aerial images that at the same time are being used for multi-source texture mapping with minimal resampling. Further, the resulting unrectified texture atlases are analyzed for façade elements like windows to be reintegrated into the original 3D models. Tests on real-world data of Heligoland/ Germany comprising more than 800 buildings exposed a median positional deviation of 0.31 m at the façades compared to the cadastral map, a correctness of 67% for the detected windows and good visual quality when being rendered with GPU-based perspective correction. As part of the process building reconstruction takes the oriented input images and transforms them into dense point clouds by semi-global matching (SGM). The point sets undergo local RANSAC-based regression and topology analysis to detect adjacent planar surfaces and determine their semantics. Based on this information the roof, wall and ground surfaces found get intersected and limited in their extension to form a closed 3D building hull. For texture mapping the hull polygons are projected into each possible input bitmap to find suitable color sources regarding the coverage and resolution. Occlusions are detected by ray-casting a full-scale digital surface model (DSM) of the scene and stored in pixel-precise visibility maps. These maps are used to derive overlap statistics and radiometric adjustment coefficients to be applied when the visible image parts for each building polygon are being copied into a compact texture atlas without resampling whenever possible. The atlas bitmap is passed to a commercial object-based image analysis (OBIA) tool running a custom rule set to identify windows on the contained façade patches. Following multi-resolution segmentation and classification based on brightness and contrast differences potential window objects are evaluated against geometric constraints and conditionally grown, fused and filtered morphologically. The output polygons are vectorized and reintegrated into the previously reconstructed buildings by sparsely ray-tracing their vertices. Finally the enhanced 3D models get stored as textured geometry for visualization and semantically annotated "LOD-2.5" CityGML objects for GIS applications.

  19. Satellite SAR interferometric techniques applied to emergency mapping

    NASA Astrophysics Data System (ADS)

    Stefanova Vassileva, Magdalena; Riccardi, Paolo; Lecci, Daniele; Giulio Tonolo, Fabio; Boccardo Boccardo, Piero; Chiesa, Giuliana; Angeluccetti, Irene

    2017-04-01

    This paper aim to investigate the capabilities of the currently available SAR interferometric algorithms in the field of emergency mapping. Several tests have been performed exploiting the Copernicus Sentinel-1 data using the COTS software ENVI/SARscape 5.3. Emergency Mapping can be defined as "creation of maps, geo-information products and spatial analyses dedicated to providing situational awareness emergency management and immediate crisis information for response by means of extraction of reference (pre-event) and crisis (post-event) geographic information/data from satellite or aerial imagery". The conventional differential SAR interferometric technique (DInSAR) and the two currently available multi-temporal SAR interferometric approaches, i.e. Permanent Scatterer Interferometry (PSI) and Small BAseline Subset (SBAS), have been applied to provide crisis information useful for the emergency management activities. Depending on the considered Emergency Management phase, it may be distinguished between rapid mapping, i.e. fast provision of geospatial data regarding the area affected for the immediate emergency response, and monitoring mapping, i.e. detection of phenomena for risk prevention and mitigation activities. In order to evaluate the potential and limitations of the aforementioned SAR interferometric approaches for the specific rapid and monitoring mapping application, five main factors have been taken into account: crisis information extracted, input data required, processing time and expected accuracy. The results highlight that DInSAR has the capacity to delineate areas affected by large and sudden deformations and fulfills most of the immediate response requirements. The main limiting factor of interferometry is the availability of suitable SAR acquisition immediately after the event (e.g. Sentinel-1 mission characterized by 6-day revisiting time may not always satisfy the immediate emergency request). PSI and SBAS techniques are suitable to produce monitoring maps for risk prevention and mitigation purposes. Nevertheless, multi-temporal techniques require large SAR temporal datasets, i.e. 20 and more images. Being the Sentinel-1 missions operational only since April 2014, multi-mission SAR datasets should be therefore exploited to carry out historical analysis.

  20. mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

    PubMed

    Hach, Faraz; Sarrafi, Iman; Hormozdiari, Farhad; Alkan, Can; Eichler, Evan E; Sahinalp, S Cenk

    2014-07-01

    High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times or more) in the multi-mapping mode. Furthermore, mrsFAST-Ultra has an index size of 2GB for the entire human reference genome, which is roughly half of that of Bowtie2. mrsFAST-Ultra is open source and it can be accessed at http://mrsfast.sourceforge.net. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Top