Sample records for bayesian multilocus association

  1. Fast genomic predictions via Bayesian G-BLUP and multilocus models of threshold traits including censored Gaussian data.

    PubMed

    Kärkkäinen, Hanni P; Sillanpää, Mikko J

    2013-09-04

    Because of the increased availability of genome-wide sets of molecular markers along with reduced cost of genotyping large samples of individuals, genomic estimated breeding values have become an essential resource in plant and animal breeding. Bayesian methods for breeding value estimation have proven to be accurate and efficient; however, the ever-increasing data sets are placing heavy demands on the parameter estimation algorithms. Although a commendable number of fast estimation algorithms are available for Bayesian models of continuous Gaussian traits, there is a shortage for corresponding models of discrete or censored phenotypes. In this work, we consider a threshold approach of binary, ordinal, and censored Gaussian observations for Bayesian multilocus association models and Bayesian genomic best linear unbiased prediction and present a high-speed generalized expectation maximization algorithm for parameter estimation under these models. We demonstrate our method with simulated and real data. Our example analyses suggest that the use of the extra information present in an ordered categorical or censored Gaussian data set, instead of dichotomizing the data into case-control observations, increases the accuracy of genomic breeding values predicted by Bayesian multilocus association models or by Bayesian genomic best linear unbiased prediction. Furthermore, the example analyses indicate that the correct threshold model is more accurate than the directly used Gaussian model with a censored Gaussian data, while with a binary or an ordinal data the superiority of the threshold model could not be confirmed.

  2. Fast Genomic Predictions via Bayesian G-BLUP and Multilocus Models of Threshold Traits Including Censored Gaussian Data

    PubMed Central

    Kärkkäinen, Hanni P.; Sillanpää, Mikko J.

    2013-01-01

    Because of the increased availability of genome-wide sets of molecular markers along with reduced cost of genotyping large samples of individuals, genomic estimated breeding values have become an essential resource in plant and animal breeding. Bayesian methods for breeding value estimation have proven to be accurate and efficient; however, the ever-increasing data sets are placing heavy demands on the parameter estimation algorithms. Although a commendable number of fast estimation algorithms are available for Bayesian models of continuous Gaussian traits, there is a shortage for corresponding models of discrete or censored phenotypes. In this work, we consider a threshold approach of binary, ordinal, and censored Gaussian observations for Bayesian multilocus association models and Bayesian genomic best linear unbiased prediction and present a high-speed generalized expectation maximization algorithm for parameter estimation under these models. We demonstrate our method with simulated and real data. Our example analyses suggest that the use of the extra information present in an ordered categorical or censored Gaussian data set, instead of dichotomizing the data into case-control observations, increases the accuracy of genomic breeding values predicted by Bayesian multilocus association models or by Bayesian genomic best linear unbiased prediction. Furthermore, the example analyses indicate that the correct threshold model is more accurate than the directly used Gaussian model with a censored Gaussian data, while with a binary or an ordinal data the superiority of the threshold model could not be confirmed. PMID:23821618

  3. pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies.

    PubMed

    Zhang, J; Feng, J-Y; Ni, Y-L; Wen, Y-J; Niu, Y; Tamba, C L; Yue, C; Song, Q; Zhang, Y-M

    2017-06-01

    Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana.

  4. A Bayesian Method for Evaluating and Discovering Disease Loci Associations

    PubMed Central

    Jiang, Xia; Barmada, M. Michael; Cooper, Gregory F.; Becich, Michael J.

    2011-01-01

    Background A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it is becoming increasingly commonplace to also analyze multi-SNP associations. Techniques for handling so many hypotheses include the Bonferroni correction and recently developed Bayesian methods. These methods can encounter problems. Most importantly, they are not applicable to a complex multi-locus hypothesis which has several competing hypotheses rather than only a null hypothesis. A method that computes the posterior probability of complex hypotheses is a pressing need. Methodology/Findings We introduce the Bayesian network posterior probability (BNPP) method which addresses the difficulties. The method represents the relationship between a disease and SNPs using a directed acyclic graph (DAG) model, and computes the likelihood of such models using a Bayesian network scoring criterion. The posterior probability of a hypothesis is computed based on the likelihoods of all competing hypotheses. The BNPP can not only be used to evaluate a hypothesis that has previously been discovered or suspected, but also to discover new disease loci associations. The results of experiments using simulated and real data sets are presented. Our results concerning simulated data sets indicate that the BNPP exhibits both better evaluation and discovery performance than does a p-value based method. For the real data sets, previous findings in the literature are confirmed and additional findings are found. Conclusions/Significance We conclude that the BNPP resolves a pressing problem by providing a way to compute the posterior probability of complex multi-locus hypotheses. A researcher can use the BNPP to determine the expected utility of investigating a hypothesis further. Furthermore, we conclude that the BNPP is a promising method for discovering disease loci associations. PMID:21853025

  5. In situ genetic differentiation in a Hispaniolan lizard (Ameiva chrysolaema): a multilocus perspective.

    PubMed

    Gifford, Matthew E; Larson, Allan

    2008-10-01

    A previous phylogeographic study of mitochondrial haplotypes for the Hispaniolan lizard Ameiva chrysolaema revealed deep genetic structure associated with seawater inundation during the late Pliocene/early Pleistocene and evidence of subsequent population expansion into formerly inundated areas. We revisit hypotheses generated by our previous study using increased geographic sampling of populations and analysis of three nuclear markers (alpha-enolase intron 8, alpha-cardiac-actin intron 4, and beta-actin intron 3) in addition to mitochondrial haplotypes (ND2). Large genetic discontinuities correspond spatially and temporally with historical barriers to gene flow (sea inundations). NCPA cross-validation analysis and Bayesian multilocus analyses of divergence times (IMa and MCMCcoal) reveal two separate episodes of fragmentation associated with Pliocene and Pleistocene sea inundations, separating the species into historically separate Northern, East-Central, West-Central, and Southern population lineages. Multilocus Bayesian analysis using IMa indicates asymmetrical migration from the East-Central to the West-Central populations following secondary contact, consistent with expectations from the more pervasive sea inundation in the western region. The West-Central lineage has a genetic signature of population growth consistent with the expectation of geographic expansion into formerly inundated areas. Within each lineage, significant spatial genetic structure indicates isolation by distance at comparable temporal scales. This study adds to the growing body of evidence that vicariant speciation may be the prevailing source of lineage accumulation on oceanic islands. Thus, prior theories of island biogeography generally underestimate the role and temporal scale of intra-island vicariant processes.

  6. Major clades of Agaricales: a multilocus phylogenetic overview.

    Treesearch

    P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett

    2006-01-01

    An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...

  7. Detection of Epistasis for Flowering Time Using Bayesian Multilocus Estimation in a Barley MAGIC Population

    PubMed Central

    Mathew, Boby; Léon, Jens; Sannemann, Wiebke; Sillanpää, Mikko J.

    2018-01-01

    Gene-by-gene interactions, also known as epistasis, regulate many complex traits in different species. With the availability of low-cost genotyping it is now possible to study epistasis on a genome-wide scale. However, identifying genome-wide epistasis is a high-dimensional multiple regression problem and needs the application of dimensionality reduction techniques. Flowering Time (FT) in crops is a complex trait that is known to be influenced by many interacting genes and pathways in various crops. In this study, we successfully apply Sure Independence Screening (SIS) for dimensionality reduction to identify two-way and three-way epistasis for the FT trait in a Multiparent Advanced Generation Inter-Cross (MAGIC) barley population using the Bayesian multilocus model. The MAGIC barley population was generated from intercrossing among eight parental lines and thus, offered greater genetic diversity to detect higher-order epistatic interactions. Our results suggest that SIS is an efficient dimensionality reduction approach to detect high-order interactions in a Bayesian multilocus model. We also observe that many of our findings (genomic regions with main or higher-order epistatic effects) overlap with known candidate genes that have been already reported in barley and closely related species for the FT trait. PMID:29254994

  8. Multilocus approach to clarify species status and the divergence history of the Bemisia tabaci (Hemiptera: Aleyrodidae) species complex.

    PubMed

    Hsieh, Chia-Hung; Ko, Chiun-Cheng; Chung, Cheng-Han; Wang, Hurng-Yi

    2014-07-01

    The sweet potato whitefly, Bemisia tabaci, is a highly differentiated species complex. Despite consisting of several morphologically indistinguishable entities and frequent invasions on all continents with important associated economic losses, the phylogenetic relationships, species status, and evolutionary history of this species complex is still debated. We sequenced and analyzed one mitochondrial and three single-copy nuclear genes from 9 of the 12 genetic groups of B. tabaci and 5 closely related species. Bayesian species delimitation was applied to investigate the speciation events of B. tabaci. The species statuses of the different genetic groups were strongly supported under different prior settings and phylogenetic scenarios. Divergence histories were estimated by a multispecies coalescence approach implemented in (*)BEAST. Based on mitochondrial locus, B. tabaci was originated 6.47 million years ago (MYA). Nevertheless, the time was 1.25MYA based on nuclear loci. According to the method of approximate Bayesian computation, this difference is probably due to different degrees of migration among loci; i.e., although the mitochondrial locus had differentiated, gene flow at nuclear loci was still possible, a scenario similar to parapatric mode of speciation. This is the first study in whiteflies using multilocus data and incorporating Bayesian coalescence approaches, both of which provide a more biologically realistic framework for delimiting species status and delineating the divergence history of B. tabaci. Our study illustrates that gene flow during species divergence should not be overlooked and has a great impact on divergence time estimation. Copyright © 2014 Elsevier Inc. All rights reserved.

  9. Functional Multi-Locus QTL Mapping of Temporal Trends in Scots Pine Wood Traits

    PubMed Central

    Li, Zitong; Hallingbäck, Henrik R.; Abrahamsson, Sara; Fries, Anders; Gull, Bengt Andersson; Sillanpää, Mikko J.; García-Gil, M. Rosario

    2014-01-01

    Quantitative trait loci (QTL) mapping of wood properties in conifer species has focused on single time point measurements or on trait means based on heterogeneous wood samples (e.g., increment cores), thus ignoring systematic within-tree trends. In this study, functional QTL mapping was performed for a set of important wood properties in increment cores from a 17-yr-old Scots pine (Pinus sylvestris L.) full-sib family with the aim of detecting wood trait QTL for general intercepts (means) and for linear slopes by increasing cambial age. Two multi-locus functional QTL analysis approaches were proposed and their performances were compared on trait datasets comprising 2 to 9 time points, 91 to 455 individual tree measurements and genotype datasets of amplified length polymorphisms (AFLP), and single nucleotide polymorphism (SNP) markers. The first method was a multilevel LASSO analysis whereby trend parameter estimation and QTL mapping were conducted consecutively; the second method was our Bayesian linear mixed model whereby trends and underlying genetic effects were estimated simultaneously. We also compared several different hypothesis testing methods under either the LASSO or the Bayesian framework to perform QTL inference. In total, five and four significant QTL were observed for the intercepts and slopes, respectively, across wood traits such as earlywood percentage, wood density, radial fiberwidth, and spiral grain angle. Four of these QTL were represented by candidate gene SNPs, thus providing promising targets for future research in QTL mapping and molecular function. Bayesian and LASSO methods both detected similar sets of QTL given datasets that comprised large numbers of individuals. PMID:25305041

  10. Functional multi-locus QTL mapping of temporal trends in Scots pine wood traits.

    PubMed

    Li, Zitong; Hallingbäck, Henrik R; Abrahamsson, Sara; Fries, Anders; Gull, Bengt Andersson; Sillanpää, Mikko J; García-Gil, M Rosario

    2014-10-09

    Quantitative trait loci (QTL) mapping of wood properties in conifer species has focused on single time point measurements or on trait means based on heterogeneous wood samples (e.g., increment cores), thus ignoring systematic within-tree trends. In this study, functional QTL mapping was performed for a set of important wood properties in increment cores from a 17-yr-old Scots pine (Pinus sylvestris L.) full-sib family with the aim of detecting wood trait QTL for general intercepts (means) and for linear slopes by increasing cambial age. Two multi-locus functional QTL analysis approaches were proposed and their performances were compared on trait datasets comprising 2 to 9 time points, 91 to 455 individual tree measurements and genotype datasets of amplified length polymorphisms (AFLP), and single nucleotide polymorphism (SNP) markers. The first method was a multilevel LASSO analysis whereby trend parameter estimation and QTL mapping were conducted consecutively; the second method was our Bayesian linear mixed model whereby trends and underlying genetic effects were estimated simultaneously. We also compared several different hypothesis testing methods under either the LASSO or the Bayesian framework to perform QTL inference. In total, five and four significant QTL were observed for the intercepts and slopes, respectively, across wood traits such as earlywood percentage, wood density, radial fiberwidth, and spiral grain angle. Four of these QTL were represented by candidate gene SNPs, thus providing promising targets for future research in QTL mapping and molecular function. Bayesian and LASSO methods both detected similar sets of QTL given datasets that comprised large numbers of individuals. Copyright © 2014 Li et al.

  11. Multilocus phylogeographic assessment of the California Mountain Kingsnake (Lampropeltis zonata) suggests alternative patterns of diversification for the California Floristic Province.

    PubMed

    Myers, E A; Rodríguez-Robles, J A; Denardo, D F; Staub, R E; Stropoli, A; Ruane, S; Burbrink, F T

    2013-11-01

    Phylogeographic inference can determine the timing of population divergence, historical demographic processes, patterns of migration, and when extended to multiple species, the history of communities. Single-locus analyses can mislead interpretations of the evolutionary history of taxa and comparative analyses. It is therefore important to revisit previous single-locus phylogeographic studies, particularly those that have been used to propose general patterns for regional biotas and the processes responsible for generating inferred patterns. Here, we employ a multilocus statistical approach to re-examine the phylogeography of Lampropeltis zonata. Using nonparametic and Bayesian species delimitation, we determined that there are two well-supported species within L. zonata. Ecological niche modelling supports the delimitation of these taxa, suggesting that the two species inhabit distinct climatic environments. Gene flow between the two taxa is low and appears to occur unidirectionally. Further, our data suggest that gene flow was mediated by females, a rare pattern in snakes. In contrast to previous analyses, we determined that the divergence between the two lineages occurred in the late Pliocene (c. 2.07 Ma). Spatially and temporally, the divergence of these lineages is associated with the inundation of central California by the Monterey Bay. The effective population sizes of the two species appear to have been unaffected by Pleistocene glaciation. Our increased sampling of loci for L. zonata, combined with previously published multilocus analyses of other sympatric species, suggests that previous conclusions reached by comparative phylogeographic studies conducted within the California Floristic Province should be reassessed. © 2013 John Wiley & Sons Ltd.

  12. Integration of least angle regression with empirical Bayes for multi-locus genome-wide association studies

    USDA-ARS?s Scientific Manuscript database

    Multi-locus genome-wide association studies has become the state-of-the-art procedure to identify quantitative trait loci (QTL) associated with traits simultaneously. However, implementation of multi-locus model is still difficult. In this study, we integrated least angle regression with empirical B...

  13. Response to comment on "Nuclear genomic sequences reveal that polar bears are an old and distinct bear lineage".

    PubMed

    Hailer, Frank; Kutschera, Verena E; Hallström, Björn M; Fain, Steven R; Leonard, Jennifer A; Arnason, Ulfur; Janke, Axel

    2013-03-29

    Nakagome et al. reanalyzed some of our data and assert that we cannot refute the mitochondrial DNA-based scenario for polar bear evolution. Their single-locus test statistic is strongly affected by introgression and incomplete lineage sorting, whereas our multilocus approaches are better suited to recover the true species relationships. Indeed, our sister-lineage model receives high support in a Bayesian model comparison.

  14. Estimation of selection intensity under overdominance by Bayesian methods.

    PubMed

    Buzbas, Erkan Ozge; Joyce, Paul; Abdo, Zaid

    2009-01-01

    A balanced pattern in the allele frequencies of polymorphic loci is a potential sign of selection, particularly of overdominance. Although this type of selection is of some interest in population genetics, there exists no likelihood based approaches specifically tailored to make inference on selection intensity. To fill this gap, we present Bayesian methods to estimate selection intensity under k-allele models with overdominance. Our model allows for an arbitrary number of loci and alleles within a locus. The neutral and selected variability within each locus are modeled with corresponding k-allele models. To estimate the posterior distribution of the mean selection intensity in a multilocus region, a hierarchical setup between loci is used. The methods are demonstrated with data at the Human Leukocyte Antigen loci from world-wide populations.

  15. Origin, Migration Routes and Worldwide Population Genetic Structure of the Wheat Yellow Rust Pathogen Puccinia striiformis f.sp. tritici

    PubMed Central

    Ali, Sajid; Gladieux, Pierre; Leconte, Marc; Gautier, Angélique; Justesen, Annemarie F.; Hovmøller, Mogens S.; Enjalbert, Jérôme; de Vallavieille-Pope, Claude

    2014-01-01

    Analyses of large-scale population structure of pathogens enable the identification of migration patterns, diversity reservoirs or longevity of populations, the understanding of current evolutionary trajectories and the anticipation of future ones. This is particularly important for long-distance migrating fungal pathogens such as Puccinia striiformis f.sp. tritici (PST), capable of rapid spread to new regions and crop varieties. Although a range of recent PST invasions at continental scales are well documented, the worldwide population structure and the center of origin of the pathogen were still unknown. In this study, we used multilocus microsatellite genotyping to infer worldwide population structure of PST and the origin of new invasions based on 409 isolates representative of distribution of the fungus on six continents. Bayesian and multivariate clustering methods partitioned the set of multilocus genotypes into six distinct genetic groups associated with their geographical origin. Analyses of linkage disequilibrium and genotypic diversity indicated a strong regional heterogeneity in levels of recombination, with clear signatures of recombination in the Himalayan (Nepal and Pakistan) and near-Himalayan regions (China) and a predominant clonal population structure in other regions. The higher genotypic diversity, recombinant population structure and high sexual reproduction ability in the Himalayan and neighboring regions suggests this area as the putative center of origin of PST. We used clustering methods and approximate Bayesian computation (ABC) to compare different competing scenarios describing ancestral relationship among ancestral populations and more recently founded populations. Our analyses confirmed the Middle East-East Africa as the most likely source of newly spreading, high-temperature-adapted strains; Europe as the source of South American, North American and Australian populations; and Mediterranean-Central Asian populations as the origin of South African populations. Although most geographic populations are not markedly affected by recent dispersal events, this study emphasizes the influence of human activities on recent long-distance spread of the pathogen. PMID:24465211

  16. Is the extremely rare Iberian endemic plant species Castrilanthemum debeauxii (Compositae, Anthemideae) a 'living fossil'? Evidence from a multi-locus species tree reconstruction.

    PubMed

    Tomasello, Salvatore; Álvarez, Inés; Vargas, Pablo; Oberprieler, Christoph

    2015-01-01

    The present study provides results of multi-species coalescent species tree analyses of DNA sequences sampled from multiple nuclear and plastid regions to infer the phylogenetic relationships among the members of the subtribe Leucanthemopsidinae (Compositae, Anthemideae), to which besides the annual Castrilanthemum debeauxii (Degen, Hervier & É.Rev.) Vogt & Oberp., one of the rarest flowering plant species of the Iberian Peninsula, two other unispecific genera (Hymenostemma, Prolongoa), and the polyploidy complex of the genus Leucanthemopsis belong. Based on sequence information from two single- to low-copy nuclear regions (C16, D35, characterised by Chapman et al. (2007)), the multi-copy region of the nrDNA internal transcribed spacer regions ITS1 and ITS2, and two intergenic spacer regions of the cpDNA gene trees were reconstructed using Bayesian inference methods. For the reconstruction of a multi-locus species tree we applied three different methods: (a) analysis of concatenated sequences using Bayesian inference (MrBayes), (b) a tree reconciliation approach by minimizing the number of deep coalescences (PhyloNet), and (c) a coalescent-based species-tree method in a Bayesian framework ((∗)BEAST). All three species tree reconstruction methods unequivocally support the close relationship of the subtribe with the hitherto unclassified genus Phalacrocarpum, the sister-group relationship of Castrilanthemum with the three remaining genera of the subtribe, and the further sister-group relationship of the clade of Hymenostemma+Prolongoa with a monophyletic genus Leucanthemopsis. Dating of the (∗)BEAST phylogeny supports the long-lasting (Early Miocene, 15-22Ma) taxonomical independence and the switch from the plesiomorphic perennial to the apomorphic annual life-form assumed for the Castrilanthemum lineage that may have occurred not earlier than in the Pliocene (3Ma) when the establishment of a Mediterranean climate with summer droughts triggered evolution towards annuality. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Molecular Phylogenetics and Systematics of the Bivalve Family Ostreidae Based on rRNA Sequence-Structure Models and Multilocus Species Tree

    PubMed Central

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663

  18. Molecular phylogenetics and systematics of the bivalve family Ostreidae based on rRNA sequence-structure models and multilocus species tree.

    PubMed

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.

  19. Analysis of multilocus zygotic associations.

    PubMed

    Yang, Rong-Cai

    2002-05-01

    While nonrandom associations between zygotes at different loci (zygotic associations) frequently occur in Hardy-Weinberg disequilibrium populations, statistical analysis of such associations has received little attention. In this article, we describe the joint distributions of zygotes at multiple loci, which are completely characterized by heterozygosities at individual loci and various multilocus zygotic associations. These zygotic associations are defined in the same fashion as the usual multilocus linkage (gametic) disequilibria on the basis of gametic and allelic frequencies. The estimation and test procedures are described with details being given for three loci. The sampling properties of the estimates are examined through Monte Carlo simulation. The estimates of three-locus associations are not free of bias due to the presence of two-locus associations and vice versa. The power of detecting the zygotic associations is small unless different loci are strongly associated and/or sample sizes are large (>100). The analysis of zygotic associations not only offers an effective means of packaging numerous genic disequilibria required for a complete characterization of multilocus structure, but also provides opportunities for making inference about evolutionary and demographic processes through a comparative assessment of zygotic association vs. gametic disequilibrium for the same set of loci in nonequilibrium populations.

  20. Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Genome-Wide Typing of Clostridium difficile.

    PubMed

    Bletz, Stefan; Janezic, Sandra; Harmsen, Dag; Rupnik, Maja; Mellmann, Alexander

    2018-06-01

    Clostridium difficile , recently renamed Clostridioides difficile , is the most common cause of antibiotic-associated nosocomial gastrointestinal infections worldwide. To differentiate endogenous infections and transmission events, highly discriminatory subtyping is necessary. Today, methods based on whole-genome sequencing data are increasingly used to subtype bacterial pathogens; however, frequently a standardized methodology and typing nomenclature are missing. Here we report a core genome multilocus sequence typing (cgMLST) approach developed for C. difficile Initially, we determined the breadth of the C. difficile population based on all available MLST sequence types with Bayesian inference (BAPS). The resulting BAPS partitions were used in combination with C. difficile clade information to select representative isolates that were subsequently used to define cgMLST target genes. Finally, we evaluated the novel cgMLST scheme with genomes from 3,025 isolates. BAPS grouping ( n = 6 groups) together with the clade information led to a total of 11 representative isolates that were included for cgMLST definition and resulted in 2,270 cgMLST genes that were present in all isolates. Overall, 2,184 to 2,268 cgMLST targets were detected in the genome sequences of 70 outbreak-associated and reference strains, and on average 99.3% cgMLST targets (1,116 to 2,270 targets) were present in 2,954 genomes downloaded from the NCBI database, underlining the representativeness of the cgMLST scheme. Moreover, reanalyzing different cluster scenarios with cgMLST were concordant to published single nucleotide variant analyses. In conclusion, the novel cgMLST is representative for the whole C. difficile population, is highly discriminatory in outbreak situations, and provides a unique nomenclature facilitating interlaboratory exchange. Copyright © 2018 American Society for Microbiology.

  1. Multi-locus phylogeny and divergence time estimates of Enallagma damselflies (Odonata: Coenagrionidae).

    PubMed

    Callahan, Melissa S; McPeek, Mark A

    2016-01-01

    Reconstructing evolutionary patterns of species and populations provides a framework for asking questions about the impacts of climate change. Here we use a multilocus dataset to estimate gene trees under maximum likelihood and Bayesian models to obtain a robust estimate of relationships for a genus of North American damselflies, Enallagma. Using a relaxed molecular clock, we estimate the divergence times for this group. Furthermore, to account for the fact that gene tree analyses can overestimate ages of population divergences, we use a multi-population coalescent model to gain a more accurate estimate of divergence times. We also infer diversification rates using a method that allows for variation in diversification rate through time and among lineages. Our results reveal a complex evolutionary history of Enallagma, in which divergence events both predate and occur during Pleistocene climate fluctuations. There is also evidence of diversification rate heterogeneity across the tree. These divergence time estimates provide a foundation for addressing the relative significance of historical climatic events in the diversification of this genus. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. A “Shallow Phylogeny” of Shallow Barnacles (Chthamalus)

    PubMed Central

    Wares, John P.; Pankey, M. Sabrina; Pitombo, Fabio; Daglio, Liza Gómez; Achituv, Yair

    2009-01-01

    Background We present a multi-locus phylogenetic analysis of the shallow water (high intertidal) barnacle genus Chthamalus, focusing on member species in the western hemisphere. Understanding the phylogeny of this group improves interpretation of classical ecological work on competition, distributional changes associated with climate change, and the morphological evolution of complex cirripede phenotypes. Methodology and Findings We use traditional and Bayesian phylogenetic and ‘deep coalescent’ approaches to identify a phylogeny that supports the monophyly of the mostly American ‘fissus group’ of Chthamalus, but that also supports a need for taxonomic revision of Chthamalus and Microeuraphia. Two deep phylogeographic breaks were also found within the range of two tropical American taxa (C. angustitergum and C. southwardorum) as well. Conclusions Our data, which include two novel gene regions for phylogenetic analysis of cirripedes, suggest that much more evaluation of the morphological evolutionary history and taxonomy of Chthamalid barnacles is necessary. These data and associated analyses also indicate that the radiation of species in the late Pliocene and Pleistocene was very rapid, and may provide new insights toward speciation via transient allopatry or ecological barriers. PMID:19440543

  3. A Multilocus Molecular Phylogeny of the Parrots (Psittaciformes): Support for a Gondwanan Origin during the Cretaceous

    PubMed Central

    Schirtzinger, Erin E.; Matsumoto, Tania; Eberhard, Jessica R.; Graves, Gary R.; Sanchez, Juan J.; Capelli, Sara; Müller, Heinrich; Scharpegge, Julia; Chambers, Geoffrey K.; Fleischer, Robert C.

    2008-01-01

    The question of when modern birds (Neornithes) first diversified has generated much debate among avian systematists. Fossil evidence generally supports a Tertiary diversification, whereas estimates based on molecular dating favor an earlier diversification in the Cretaceous period. In this study, we used an alternate approach, the inference of historical biogeographic patterns, to test the hypothesis that the initial radiation of the Order Psittaciformes (the parrots and cockatoos) originated on the Gondwana supercontinent during the Cretaceous. We utilized broad taxonomic sampling (representatives of 69 of the 82 extant genera and 8 outgroup taxa) and multilocus molecular character sampling (3,941 bp from mitochondrial DNA (mtDNA) genes cytochrome oxidase I and NADH dehydrogenase 2 and nuclear introns of rhodopsin intron 1, tropomyosin alpha-subunit intron 5, and transforming growth factor ß-2) to generate phylogenetic hypotheses for the Psittaciformes. Analyses of the combined character partitions using maximum parsimony, maximum likelihood, and Bayesian criteria produced well-resolved and topologically similar trees in which the New Zealand taxa Strigops and Nestor (Psittacidae) were sister to all other psittaciforms and the cockatoo clade (Cacatuidae) was sister to a clade containing all remaining parrots (Psittacidae). Within this large clade of Psittacidae, some traditionally recognized tribes and subfamilies were monophyletic (e.g., Arini, Psittacini, and Loriinae), whereas several others were polyphyletic (e.g., Cyclopsittacini, Platycercini, Psittaculini, and Psittacinae). Ancestral area reconstructions using our Bayesian phylogenetic hypothesis and current distributions of genera supported the hypothesis of an Australasian origin for the Psittaciformes. Separate analyses of the timing of parrot diversification constructed with both Bayesian relaxed-clock and penalized likelihood approaches showed better agreement between geologic and diversification events in the chronograms based on a Cretaceous dating of the basal split within parrots than the chronograms based on a Tertiary dating of this split, although these data are more equivocal. Taken together, our results support a Cretaceous origin of Psittaciformes in Gondwana after the separation of Africa and the India/Madagascar block with subsequent diversification through both vicariance and dispersal. These well-resolved molecular phylogenies will be of value for comparative studies of behavior, ecology, and life history in parrots. PMID:18653733

  4. Multilocus Bayesian Estimates of Intra-Oceanic Genetic Differentiation, Connectivity, and Admixture in Atlantic Swordfish (Xiphias gladius L.)

    PubMed Central

    Smith, Brad L.; Lu, Ching-Ping; García-Cortés, Blanca; Viñas, Jordi; Yeh, Shean-Ya; Alvarado Bremer, Jaime R.

    2015-01-01

    Previous genetic studies of Atlantic swordfish (Xiphias gladius L.) revealed significant differentiation among Mediterranean, North Atlantic and South Atlantic populations using both mitochondrial and nuclear DNA data. However, limitations in geographic sampling coverage, and the use of single loci, precluded an accurate placement of boundaries and of estimates of admixture. In this study, we present multilocus analyses of 26 single nucleotide polymorphisms (SNPs) within 10 nuclear genes to estimate population differentiation and admixture based on the characterization of 774 individuals representing North Atlantic, South Atlantic, and Mediterranean swordfish populations. Pairwise F ST values, AMOVA, PCoA, and Bayesian individual assignments support the differentiation of swordfish inhabiting these three basins, but not the current placement of the boundaries that separate them. Specifically, the range of the South Atlantic population extends beyond 5°N management boundary to 20°N-25°N from 45°W. Likewise the Mediterranean population extends beyond the current management boundary at the Strait of Gibraltar to approximately 10°W. Further, admixture zones, characterized by asymmetric contributions of adjacent populations within samples, are confined to the Northeast Atlantic. While South Atlantic and Mediterranean migrants were identified within these Northeast Atlantic admixture zones no North Atlantic migrants were identified respectively in these two neighboring basins. Owing to both, the characterization of larger number of loci and a more ample spatial sampling coverage, it was possible to provide a finer resolution of the boundaries separating Atlantic swordfish populations than previous studies. Finally, the patterns of population structure and admixture are discussed in the light of the reproductive biology, the known patterns of dispersal, and oceanographic features that may act as barriers to gene flow to Atlantic swordfish. PMID:26057382

  5. Deep divergence and structure in the Tropical Oceanic Pacific: a multilocus phylogeography of a widespread gekkonid lizard (Squamata: Gekkonidae: Gehyra oceanica)

    USGS Publications Warehouse

    Tonione, Maria A.; Fisher, Robert N.; Zhu, Catherine; Moritz, Craig

    2016-01-01

    Aim The islands of the Tropical Oceanic Pacific (TOP) host both local radiations and widespread, colonizing species. The few phylogeographical analyses of widespread species often point to recent human-aided expansions through the Pacific, suggesting that the communities are recently assembled. Here we apply multilocus data to infer biogeographical history of the gekkonid lizard, Gehyra oceanica, which is widespread, but for which prior analyses suggested a pre-human history and in situ diversification. Location Tropical Oceanic Pacific. Methods We generated a data set including mtDNA and diagnostic SNPs for 173 individuals of G. oceanica spanning Micronesia, Melanesia, and Polynesia. For a subset of these individuals, we also sequenced nuclear loci. From these data, we performed maximum likelihood and Bayesian inference to reveal major clades. We also performed Bayesian clustering analyses and coalescence–based species delimitation tests to infer the number of species in this area. Results We found evidence for six independent evolutionary lineages (candidate species) within G. oceanica that diverged between the Pliocene and the early Pleistocene, with high diversity through northern Melanesia, and pairing of northern Melanesian endemic taxa with widespread lineages across Micronesia and Polynesia. Main conclusions The islands of northern Melanesia not only have unrecognized diversity, but also were the source of independent expansions of lineages through the more remote northern and eastern Pacific. These results highlight the very different evolutionary histories of island faunas on remote archipelagos versus those across Melanesia and point to the need for more intensive studies of fauna within Melanesia if we are to understand the evolution of diversity across the tropical Pacific.

  6. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia

    PubMed Central

    Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya

    2013-01-01

    Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38–0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear. PMID:23641189

  7. Species-Level Phylogeny and Polyploid Relationships in Hordeum (Poaceae) Inferred by Next-Generation Sequencing and In Silico Cloning of Multiple Nuclear Loci.

    PubMed

    Brassac, Jonathan; Blattner, Frank R

    2015-09-01

    Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  8. Genome-wide associations for milk production and somatic cell score in Holstein-Friesian cattle in Ireland

    PubMed Central

    2012-01-01

    Background Contemporary dairy breeding goals have broadened to include, along with milk production traits, a number of non-production-related traits in an effort to improve the overall functionality of the dairy cow. Increased indirect selection for resistance to mastitis, one of the most important production-related diseases in the dairy sector, via selection for reduced somatic cell count has been part of these broadened goals. A number of genome-wide association studies have identified genetic variants associated with milk production traits and mastitis resistance, however the majority of these studies have been based on animals which were predominantly kept in confinement and fed a concentrate-based diet (i.e. high-input production systems). This genome-wide association study aims to detect associations using genotypic and phenotypic data from Irish Holstein-Friesian cattle fed predominantly grazed grass in a pasture-based production system (low-input). Results Significant associations were detected for milk yield, fat yield, protein yield, fat percentage, protein percentage and somatic cell score using separate single-locus, frequentist and multi-locus, Bayesian approaches. These associations were detected using two separate populations of Holstein-Friesian sires and cows. In total, 1,529 and 37 associations were detected in the sires using a single SNP regression and a Bayesian method, respectively. There were 103 associations in common between the sires and cows across all the traits. As well as detecting associations within known QTL regions, a number of novel associations were detected; the most notable of these was a region of chromosome 13 associated with milk yield in the population of Holstein-Friesian sires. Conclusions A total of 276 of novel SNPs were detected in the sires using a single SNP regression approach. Although obvious candidate genes may not be initially forthcoming, this study provides a preliminary framework upon which to identify the causal mechanisms underlying the various milk production traits and somatic cell score. Consequently this will deepen our understanding of how these traits are expressed. PMID:22449276

  9. Measuring and partitioning the high-order linkage disequilibrium by multiple order Markov chains.

    PubMed

    Kim, Yunjung; Feng, Sheng; Zeng, Zhao-Bang

    2008-05-01

    A map of the background levels of disequilibrium between nearby markers can be useful for association mapping studies. In order to assess the background levels of linkage disequilibrium (LD), multilocus LD measures are more advantageous than pairwise LD measures because the combined analysis of pairwise LD measures is not adequate to detect simultaneous allele associations among multiple markers. Various multilocus LD measures based on haplotypes have been proposed. However, most of these measures provide a single index of association among multiple markers and does not reveal the complex patterns and different levels of LD structure. In this paper, we employ non-homogeneous, multiple order Markov Chain models as a statistical framework to measure and partition the LD among multiple markers into components due to different orders of marker associations. Using a sliding window of multiple markers on phased haplotype data, we compute corresponding likelihoods for different Markov Chain (MC) orders in each window. The log-likelihood difference between the lowest MC order model (MC0) and the highest MC order model in each window is used as a measure of the total LD or the overall deviation from the gametic equilibrium for the window. Then, we partition the total LD into lower order disequilibria and estimate the effects from two-, three-, and higher order disequilibria. The relationship between different orders of LD and the log-likelihood difference involving two different orders of MC models are explored. By applying our method to the phased haplotype data in the ENCODE regions of the HapMap project, we are able to identify high/low multilocus LD regions. Our results reveal that the most LD in the HapMap data is attributed to the LD between adjacent pairs of markers across the whole region. LD between adjacent pairs of markers appears to be more significant in high multilocus LD regions than in low multilocus LD regions. We also find that as the multilocus total LD increases, the effects of high-order LD tends to get weaker due to the lack of observed multilocus haplotypes. The overall estimates of first, second, third, and fourth order LD across the ENCODE regions are 64, 23, 9, and 3%.

  10. Large-scale genomic analyses reveal the population structure and evolutionary trends of Streptococcus agalactiae strains in Brazilian fish farms.

    PubMed

    Barony, Gustavo M; Tavares, Guilherme C; Pereira, Felipe L; Carvalho, Alex F; Dorella, Fernanda A; Leal, Carlos A G; Figueiredo, Henrique C P

    2017-10-19

    Streptococcus agalactiae is a major pathogen and a hindrance on tilapia farming worldwide. The aims of this work were to analyze the genomic evolution of Brazilian strains of S. agalactiae and to establish spatial and temporal relations between strains isolated from different outbreaks of streptococcosis. A total of 39 strains were obtained from outbreaks and their whole genomes were sequenced and annotated for comparative analysis of multilocus sequence typing, genomic similarity and whole genome multilocus sequence typing (wgMLST). The Brazilian strains presented two sequence types, including a newly described ST, and a non-typeable lineage. The use of wgMLST could differentiate each strain in a single clone and was used to establish temporal and geographical correlations among strains. Bayesian phylogenomic analysis suggests that the studied Brazilian population was co-introduced in the country with their host, approximately 60 years ago. Brazilian strains of S. agalactiae were shown to be heterogeneous in their genome sequences and were distributed in different regions of the country according to their genotype, which allowed the use of wgMLST analysis to track each outbreak event individually.

  11. "Contrasting patterns of selection at Pinus pinaster Ait. Drought stress candidate genes as revealed by genetic differentiation analyses".

    PubMed

    Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H

    2008-02-01

    The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed in relation to the different genes' putative involvement in drought tolerance responses, from published results in transcriptomics and association mapping in P. pinaster and other related species. These genes clearly constitute relevant candidates for future association studies in P. pinaster.

  12. A multilocus perspective on the speciation history of a North American aridland toad (Anaxyrus punctatus).

    PubMed

    Bryson, Robert W; Jaeger, Jef R; Lemos-Espinal, Julio A; Lazcano, David

    2012-09-01

    Interpretations of phylogeographic patterns can change when analyses shift from single gene-tree to multilocus coalescent analyses. Using multilocus coalescent approaches, a species tree and divergence times can be estimated from a set of gene trees while accounting for gene-tree stochasticity. We utilized the conceptual strengths of a multilocus coalescent approach coupled with complete range-wide sampling to examine the speciation history of a broadly distributed, North American warm-desert toad, Anaxyrus punctatus. Phylogenetic analyses provided strong support for three major lineages within A. punctatus. Each lineage broadly corresponded to one of three desert regions. Early speciation in A. punctatus appeared linked to late Miocene-Pliocene development of the Baja California peninsula. This event was likely followed by a Pleistocene divergence associated with the separation of the Chihuahuan and Sonoran Deserts. Our multilocus coalescent-based reconstruction provides an informative contrast to previous single gene-tree estimates of the evolutionary history of A. punctatus. Copyright © 2012 Elsevier Inc. All rights reserved.

  13. Assessing Species Boundaries Using Multilocus Species Delimitation in a Morphologically Conserved Group of Neotropical Freshwater Fishes, the Poecilia sphenops Species Complex (Poeciliidae)

    PubMed Central

    Bagley, Justin C.; Alda, Fernando; Breitman, M. Florencia; Bermingham, Eldredge; van den Berghe, Eric P.; Johnson, Jerald B.

    2015-01-01

    Accurately delimiting species is fundamentally important for understanding species diversity and distributions and devising effective strategies to conserve biodiversity. However, species delimitation is problematic in many taxa, including ‘non-adaptive radiations’ containing morphologically cryptic lineages. Fortunately, coalescent-based species delimitation methods hold promise for objectively estimating species limits in such radiations, using multilocus genetic data. Using coalescent-based approaches, we delimit species and infer evolutionary relationships in a morphologically conserved group of Central American freshwater fishes, the Poecilia sphenops species complex. Phylogenetic analyses of multiple genetic markers (sequences of two mitochondrial DNA genes and five nuclear loci) from 10/15 species and genetic lineages recognized in the group support the P. sphenops species complex as monophyletic with respect to outgroups, with eight mitochondrial ‘major-lineages’ diverged by ≥2% pairwise genetic distances. From general mixed Yule-coalescent models, we discovered (conservatively) 10 species within our concatenated mitochondrial DNA dataset, 9 of which were strongly supported by subsequent multilocus Bayesian species delimitation and species tree analyses. Results suggested species-level diversity is underestimated or overestimated by at least ~15% in different lineages in the complex. Nonparametric statistics and coalescent simulations indicate genealogical discordance among our gene tree results has mainly derived from interspecific hybridization in the nuclear genome. However, mitochondrial DNA show little evidence for introgression, and our species delimitation results appear robust to effects of this process. Overall, our findings support the utility of combining multiple lines of genetic evidence and broad phylogeographical sampling to discover and validate species using coalescent-based methods. Our study also highlights the importance of testing for hybridization versus incomplete lineage sorting, which aids inference of not only species limits but also evolutionary processes influencing genetic diversity. PMID:25849959

  14. Assessing species boundaries using multilocus species delimitation in a morphologically conserved group of neotropical freshwater fishes, the Poecilia sphenops species complex (Poeciliidae).

    PubMed

    Bagley, Justin C; Alda, Fernando; Breitman, M Florencia; Bermingham, Eldredge; van den Berghe, Eric P; Johnson, Jerald B

    2015-01-01

    Accurately delimiting species is fundamentally important for understanding species diversity and distributions and devising effective strategies to conserve biodiversity. However, species delimitation is problematic in many taxa, including 'non-adaptive radiations' containing morphologically cryptic lineages. Fortunately, coalescent-based species delimitation methods hold promise for objectively estimating species limits in such radiations, using multilocus genetic data. Using coalescent-based approaches, we delimit species and infer evolutionary relationships in a morphologically conserved group of Central American freshwater fishes, the Poecilia sphenops species complex. Phylogenetic analyses of multiple genetic markers (sequences of two mitochondrial DNA genes and five nuclear loci) from 10/15 species and genetic lineages recognized in the group support the P. sphenops species complex as monophyletic with respect to outgroups, with eight mitochondrial 'major-lineages' diverged by ≥2% pairwise genetic distances. From general mixed Yule-coalescent models, we discovered (conservatively) 10 species within our concatenated mitochondrial DNA dataset, 9 of which were strongly supported by subsequent multilocus Bayesian species delimitation and species tree analyses. Results suggested species-level diversity is underestimated or overestimated by at least ~15% in different lineages in the complex. Nonparametric statistics and coalescent simulations indicate genealogical discordance among our gene tree results has mainly derived from interspecific hybridization in the nuclear genome. However, mitochondrial DNA show little evidence for introgression, and our species delimitation results appear robust to effects of this process. Overall, our findings support the utility of combining multiple lines of genetic evidence and broad phylogeographical sampling to discover and validate species using coalescent-based methods. Our study also highlights the importance of testing for hybridization versus incomplete lineage sorting, which aids inference of not only species limits but also evolutionary processes influencing genetic diversity.

  15. Spiders on a Hot Volcanic Roof: Colonisation Pathways and Phylogeography of the Canary Islands Endemic Trap-Door Spider Titanidiops canariensis (Araneae, Idiopidae).

    PubMed

    Opatova, Vera; Arnedo, Miquel A

    2014-01-01

    Studies conducted on volcanic islands have greatly contributed to our current understanding of how organisms diversify. The Canary Islands archipelago, located northwest of the coast of northern Africa, harbours a large number of endemic taxa. Because of their low vagility, mygalomorph spiders are usually absent from oceanic islands. The spider Titanidiops canariensis, which inhabits the easternmost islands of the archipelago, constitutes an exception to this rule. Here, we use a multi-locus approach that combines three mitochondrial and four nuclear genes to investigate the origins and phylogeography of this remarkable trap-door spider. We provide a timeframe for the colonisation of the Canary Islands using two alternative approaches: concatenation and species tree inference in a Bayesian relaxed clock framework. Additionally, we investigate the existence of cryptic species on the islands by means of a Bayesian multi-locus species delimitation method. Our results indicate that T. canariensis colonised the Canary Islands once, most likely during the Miocene, although discrepancies between the timeframes from different approaches make the exact timing uncertain. A complex evolutionary history for the species in the archipelago is revealed, which involves two independent colonisations of Fuerteventura from the ancestral range of T. canariensis in northern Lanzarote and a possible back colonisation of southern Lanzarote. The data further corroborate a previously proposed volcanic refugium, highlighting the impact of the dynamic volcanic history of the island on the phylogeographic patterns of the endemic taxa. T. canariensis includes at least two different species, one inhabiting the Jandia peninsula and central Fuerteventura and one spanning from central Fuerteventura to Lanzarote. Our data suggest that the extant northern African Titanidiops lineages may have expanded to the region after the islands were colonised and, hence, are not the source of colonisation. In addition, T. maroccanus may harbour several cryptic species.

  16. Spiders on a Hot Volcanic Roof: Colonisation Pathways and Phylogeography of the Canary Islands Endemic Trap-Door Spider Titanidiops canariensis (Araneae, Idiopidae)

    PubMed Central

    Opatova, Vera; Arnedo, Miquel A.

    2014-01-01

    Studies conducted on volcanic islands have greatly contributed to our current understanding of how organisms diversify. The Canary Islands archipelago, located northwest of the coast of northern Africa, harbours a large number of endemic taxa. Because of their low vagility, mygalomorph spiders are usually absent from oceanic islands. The spider Titanidiops canariensis, which inhabits the easternmost islands of the archipelago, constitutes an exception to this rule. Here, we use a multi-locus approach that combines three mitochondrial and four nuclear genes to investigate the origins and phylogeography of this remarkable trap-door spider. We provide a timeframe for the colonisation of the Canary Islands using two alternative approaches: concatenation and species tree inference in a Bayesian relaxed clock framework. Additionally, we investigate the existence of cryptic species on the islands by means of a Bayesian multi-locus species delimitation method. Our results indicate that T. canariensis colonised the Canary Islands once, most likely during the Miocene, although discrepancies between the timeframes from different approaches make the exact timing uncertain. A complex evolutionary history for the species in the archipelago is revealed, which involves two independent colonisations of Fuerteventura from the ancestral range of T. canariensis in northern Lanzarote and a possible back colonisation of southern Lanzarote. The data further corroborate a previously proposed volcanic refugium, highlighting the impact of the dynamic volcanic history of the island on the phylogeographic patterns of the endemic taxa. T. canariensis includes at least two different species, one inhabiting the Jandia peninsula and central Fuerteventura and one spanning from central Fuerteventura to Lanzarote. Our data suggest that the extant northern African Titanidiops lineages may have expanded to the region after the islands were colonised and, hence, are not the source of colonisation. In addition, T. maroccanus may harbour several cryptic species. PMID:25494329

  17. Discrimination of Anopheles species of the Arribalzagia series in Colombia using a multilocus approach.

    PubMed

    Álvarez, Natalí; Gómez, Giovan F; Naranjo-Díaz, Nelson; Correa, Margarita M

    2018-06-18

    The Arribalzagia Series of the Anopheles Subgenus comprises morphologically similar species or members of species complexes which makes correct species identification difficult. Therefore, the aim of this work was to discriminate the morphospecies of the Arribalzagia Series present in Colombia using a multilocus approach based on ITS2, COI and CAD sequences. Specimens of the Arribalzagia Series collected at 32 localities in nine departments were allocated to seven species. Individual and concatenated Bayesian analyses showed high support for each of the species and reinforced the previous report of the Apicimacula species Complex with distribution in the Pacific Coast and northwestern Colombia. In addition, a new molecular operational taxonomic unit-MOTU was identified, herein denominated near Anopheles peryassui, providing support for the existence of a Peryassui species Complex. Further, the CAD gene, just recently used for Anopheles taxonomy and phylogeny, demonstrated its power in resolving phylogenetic relationships among species of the Arribalzagia Series. The divergence times for these species correspond to the early Pliocene and the Miocene. Considering the epidemiological importance of some species of the Series and their co-occurrence in malaria endemic regions of Colombia, their discrimination constitutes an important step for vector incrimination and control in the country. Copyright © 2018. Published by Elsevier B.V.

  18. Multilocus Analyses Reveal Postglacial Demographic Shrinkage of Juniperus morrisonicola (Cupressaceae), a Dominant Alpine Species in Taiwan

    PubMed Central

    Chiu, Chi-Te; Huang, Chao-Li; Hung, Kuo-Hsiang; Chiang, Tzen-Yuh

    2016-01-01

    Postglacial climate changes alter geographical distributions and diversity of species. Such ongoing changes often force species to migrate along the latitude/altitude. Altitudinal gradients represent assemblage of environmental, especially climatic, variable factors that influence the plant distributions. Global warming that triggered upward migrations has therefore impacted the alpine plants on an island. In this study, we examined the genetic structure of Juniperus morrisonicola, a dominant alpine species in Taiwan, and inferred historical, demographic dynamics based on multilocus analyses. Lower levels of genetic diversity in north indicated that populations at higher latitudes were vulnerable to climate change, possibly related to historical alpine glaciers. Neither organellar DNA nor nuclear genes displayed geographical subdivisions, indicating that populations were likely interconnected before migrating upward to isolated mountain peaks, providing low possibilities of seed/pollen dispersal across mountain ranges. Bayesian skyline plots suggested steady population growth of J. morrisonicola followed by recent demographic contraction. In contrast, most lower-elevation plants experienced recent demographic expansion as a result of global warming. The endemic alpine conifer may have experienced dramatic climate changes over the alternation of glacial and interglacial periods, as indicated by a trend showing decreasing genetic diversity with the altitudinal gradient, plus a fact of upward migration. PMID:27561108

  19. Extensive cryptic species diversity and fine-scale endemism in the marine red alga Portieria in the Philippines

    PubMed Central

    Payo, Dioli Ann; Leliaert, Frederik; Verbruggen, Heroen; D'hondt, Sofie; Calumpong, Hilconida P.; De Clerck, Olivier

    2013-01-01

    We investigated species diversity and distribution patterns of the marine red alga Portieria in the Philippine archipelago. Species boundaries were tested based on mitochondrial, plastid and nuclear encoded loci, using a general mixed Yule-coalescent (GMYC) model-based approach and a Bayesian multilocus species delimitation method. The outcome of the GMYC analysis of the mitochondrial encoded cox2-3 dataset was highly congruent with the multilocus analysis. In stark contrast with the current morphology-based assumption that the genus includes a single, widely distributed species in the Indo-West Pacific (Portieria hornemannii), DNA-based species delimitation resulted in the recognition of 21 species within the Philippines. Species distributions were found to be highly structured with most species restricted to island groups within the archipelago. These extremely narrow species ranges and high levels of intra-archipelagic endemism contrast with the wide-held belief that marine organisms generally have large geographical ranges and that endemism is at most restricted to the archipelagic level. Our results indicate that speciation in the marine environment may occur at spatial scales smaller than 100 km, comparable with some terrestrial systems. Our finding of fine-scale endemism has important consequences for marine conservation and management. PMID:23269854

  20. Multilocus Analyses Reveal Postglacial Demographic Shrinkage of Juniperus morrisonicola (Cupressaceae), a Dominant Alpine Species in Taiwan.

    PubMed

    Huang, Chi-Chun; Hsu, Tsai-Wen; Wang, Hao-Ven; Liu, Zin-Huang; Chen, Yi-Yen; Chiu, Chi-Te; Huang, Chao-Li; Hung, Kuo-Hsiang; Chiang, Tzen-Yuh

    2016-01-01

    Postglacial climate changes alter geographical distributions and diversity of species. Such ongoing changes often force species to migrate along the latitude/altitude. Altitudinal gradients represent assemblage of environmental, especially climatic, variable factors that influence the plant distributions. Global warming that triggered upward migrations has therefore impacted the alpine plants on an island. In this study, we examined the genetic structure of Juniperus morrisonicola, a dominant alpine species in Taiwan, and inferred historical, demographic dynamics based on multilocus analyses. Lower levels of genetic diversity in north indicated that populations at higher latitudes were vulnerable to climate change, possibly related to historical alpine glaciers. Neither organellar DNA nor nuclear genes displayed geographical subdivisions, indicating that populations were likely interconnected before migrating upward to isolated mountain peaks, providing low possibilities of seed/pollen dispersal across mountain ranges. Bayesian skyline plots suggested steady population growth of J. morrisonicola followed by recent demographic contraction. In contrast, most lower-elevation plants experienced recent demographic expansion as a result of global warming. The endemic alpine conifer may have experienced dramatic climate changes over the alternation of glacial and interglacial periods, as indicated by a trend showing decreasing genetic diversity with the altitudinal gradient, plus a fact of upward migration.

  1. Multilocus Genotypes of Relevance for Drug Metabolizing Enzymes and Therapy with Thiopurines in Patients with Acute Lymphoblastic Leukemia

    PubMed Central

    Stocco, Gabriele; Franca, Raffaella; Verzegnassi, Federico; Londero, Margherita; Rabusin, Marco; Decorti, Giuliana

    2013-01-01

    Multilocus genotypes have been shown to be of relevance for using pharmacogenomic principles to individualize drug therapy. As it relates to thiopurine therapy, genetic polymorphisms of TPMT are strongly associated with the pharmacokinetics and clinical effects of thiopurines (mercaptopurine and azathioprine), influencing their toxicity and efficacy. We have recently demonstrated that TPMT and ITPA genotypes constitute a multilocus genotype of pharmacogenetic relevance for children with acute lymphoblastic leukemia (ALL) receiving thiopurine therapy. The use of high-throughput genomic analysis allows identification of additional candidate genetic factors associated with pharmacogenetic phenotypes, such as TPMT enzymatic activity: PACSIN2 polymorphisms have been identified by a genome-wide analysis, combining evaluation of polymorphisms and gene expression, as a significant determinant of TPMT activity in the HapMap CEU cell lines and the effects of PACSIN2 on TPMT activity and mercaptopurine induced adverse effects were confirmed in children with ALL. Combination of genetic factors of relevance for thiopurine metabolizing enzyme activity, based on the growing understanding of their association with drug metabolism and efficacy, is particularly promising for patients with pediatric ALL. The knowledge basis and clinical applications for multilocus genotypes of importance for therapy with mercaptopurine in pediatric ALL is discussed in the present review. PMID:23335936

  2. Single-Locus versus Multilocus Patterns of Local Adaptation to Climate in Eastern White Pine (Pinus strobus, Pinaceae)

    PubMed Central

    Zinck, John W. R.

    2016-01-01

    Natural plant populations are often adapted to their local climate and environmental conditions, and populations of forest trees offer some of the best examples of this pattern. However, little empirical work has focused on the relative contribution of single-locus versus multilocus effects to the genetic architecture of local adaptation in plants/forest trees. Here, we employ eastern white pine (Pinus strobus) to test the hypothesis that it is the inter-genic effects that primarily drive climate-induced local adaptation. The genetic structure of 29 range-wide natural populations of eastern white pine was determined in relation to local climatic factors using both a reference set of SSR markers, and SNPs located in candidate genes putatively involved in adaptive response to climate. Comparisons were made between marker sets using standard single-locus outlier analysis, single-locus and multilocus environment association analyses and a novel implementation of Population Graphs. Magnitudes of population structure were similar between the two marker sets. Outlier loci consistent with diversifying selection were rare for both SNPs and SSRs. However, genetic distances based on the multilocus among population covariances (cGD) were significantly more correlated to climate, even after correcting for spatial effects, for SNPs as compared to SSRs. Coalescent simulations confirmed that the differences in mutation rates between SSRs and SNPs did not affect the topologies of the Population Graphs, and hence values of cGD and their correlations with associated climate variables. We conclude that the multilocus covariances among populations primarily reflect adaptation to local climate and environment in eastern white pine. This result highlights the complexity of the genetic architecture of adaptive traits, as well as the need to consider multilocus effects in studies of local adaptation. PMID:27387485

  3. Pneumocystis jirovecii multilocus gene sequencing: findings and implications.

    PubMed

    Matos, Olga; Esteves, Francisco

    2010-08-01

    Pneumocystis jirovecii pneumonia (PcP) remains a major cause of respiratory illness among immunocompromised patients, especially patients infected with HIV, but it has also been isolated from immunocompetent persons. This article discusses the application of multilocus genotyping analysis to the study of the genetic diversity of P. jirovecii and its epidemiological and clinical parameters, and the important concepts achieved to date with these approaches. The multilocus typing studies performed until now have shown that there is an important genetic diversity of stable and ubiquitous P. jirovecii genotypes; infection with P. jirovecii is not necessarily clonal, recombination between some P. jirovecii multilocus genotypes has been suggested. P. jirovecii-specific multilocus genotypes can be associated with severity of PcP. Patients infected with P. jirovecii, regardless of the form of infection they present with, are part of a common human reservoir for future infections. The CYB, DHFR, DHPS, mtLSU rRNA, SOD and the ITS loci are suitable genetic targets to be used in further epidemiological studies focused on the identification and characterization of P. jirovecii haplotypes correlated with drug resistance and PcP outcome.

  4. Molecular Epidemiology of Human Oral Chagas Disease Outbreaks in Colombia

    PubMed Central

    Ramírez, Juan David; Montilla, Marleny; Cucunubá, Zulma M.; Floréz, Astrid Carolina; Zambrano, Pilar; Guhl, Felipe

    2013-01-01

    Background Trypanosoma cruzi, the causative agent of Chagas disease, displays significant genetic variability revealed by six Discrete Typing Units (TcI-TcVI). In this pathology, oral transmission represents an emerging epidemiological scenario where different outbreaks associated to food/beverages consumption have been reported in Argentina, Bolivia, Brazil, Ecuador and Venezuela. In Colombia, six human oral outbreaks have been reported corroborating the importance of this transmission route. Molecular epidemiology of oral outbreaks is barely known observing the incrimination of TcI, TcII, TcIV and TcV genotypes. Methodology and Principal Findings High-throughput molecular characterization was conducted performing MLMT (Multilocus Microsatellite Typing) and mtMLST (mitochondrial Multilocus Sequence Typing) strategies on 50 clones from ten isolates. Results allowed observing the occurrence of TcI, TcIV and mixed infection of distinct TcI genotypes. Thus, a majority of specific mitochondrial haplotypes and allelic multilocus genotypes associated to the sylvatic cycle of transmission were detected in the dataset with the foreseen presence of mitochondrial haplotypes and allelic multilocus genotypes associated to the domestic cycle of transmission. Conclusions These findings suggest the incrimination of sylvatic genotypes in the oral outbreaks occurred in Colombia. We observed patterns of super-infection and/or co-infection with a tailored association with the severe forms of myocarditis in the acute phase of the disease. The transmission dynamics of this infection route based on molecular epidemiology evidence was unraveled and the clinical and biological implications are discussed. PMID:23437405

  5. Multilocus family-based association analysis of seven candidate polymorphisms with essential hypertension in an african-derived semi-isolated brazilian population.

    PubMed

    Kimura, L; Angeli, C B; Auricchio, M T B M; Fernandes, G R; Pereira, A C; Vicente, J P; Pereira, T V; Mingroni-Netto, R C

    2012-01-01

    Background. It has been widely suggested that analyses considering multilocus effects would be crucial to characterize the relationship between gene variability and essential hypertension (EH). Objective. To test for the presence of multilocus effects between/among seven polymorphisms (six genes) on blood pressure-related traits in African-derived semi-isolated Brazilian populations (quilombos). Methods. Analyses were carried out using a family-based design in a sample of 652 participants (97 families). Seven variants were investigated: ACE (rs1799752), AGT (rs669), ADD2 (rs3755351), NOS3 (rs1799983), GNB3 (rs5441 and rs5443), and GRK4 (rs1801058). Sensitivity analyses were further performed under a case-control design with unrelated participants only. Results. None of the investigated variants were associated individually with both systolic and diastolic BP levels (SBP and DBP, respectively) or EH (as a binary outcome). Multifactor dimensionality reduction-based techniques revealed a marginal association of the combined effect of both GNB3 variants on DBP levels in a family-based design (P = 0.040), whereas a putative NOS3-GRK4 interaction also in relation to DBP levels was observed in the case-control design only (P = 0.004). Conclusion. Our results provide limited support for the hypothesis of multilocus effects between/among the studied variants on blood pressure in quilombos. Further larger studies are needed to validate our findings.

  6. Nuclear genomic sequences reveal that polar bears are an old and distinct bear lineage.

    PubMed

    Hailer, Frank; Kutschera, Verena E; Hallström, Björn M; Klassert, Denise; Fain, Steven R; Leonard, Jennifer A; Arnason, Ulfur; Janke, Axel

    2012-04-20

    Recent studies have shown that the polar bear matriline (mitochondrial DNA) evolved from a brown bear lineage since the late Pleistocene, potentially indicating rapid speciation and adaption to arctic conditions. Here, we present a high-resolution data set from multiple independent loci across the nuclear genomes of a broad sample of polar, brown, and black bears. Bayesian coalescent analyses place polar bears outside the brown bear clade and date the divergence much earlier, in the middle Pleistocene, about 600 (338 to 934) thousand years ago. This provides more time for polar bear evolution and confirms previous suggestions that polar bears carry introgressed brown bear mitochondrial DNA due to past hybridization. Our results highlight that multilocus genomic analyses are crucial for an accurate understanding of evolutionary history.

  7. Epidemiological analysis of Leishmania tropica strains and giemsa-stained smears from Syrian and Turkish leishmaniasis patients using multilocus microsatellite typing (MLMT)

    PubMed Central

    Nasereddin, Abed; Onay, Hüseyin; Karaca, Emin; Özkeklikçi, Ahmet; Jaffe, Charles L.; Kuhls, Katrin; Özbilgin, Ahmet; Ertabaklar, Hatice; Demir, Samiye; Özbel, Yusuf; Töz, Seray

    2017-01-01

    Turkey is located in an important geographical location, in terms of the epidemiology of vector-borne diseases, linking Asia and Europe. Cutaneous leishmaniasis (CL) is one of the endemic diseases in a Turkey and according to the Ministry Health of Turkey, 45% of CL patients originate from Şanlıurfa province located in southeastern Turkey. Herein, the epidemiological status of CL, caused by L. tropica, in Turkey was examined using multilocus microsatellite typing (MLMT) of strains obtained from Turkish and Syrian patients. A total of 38 cryopreserved strains and 20 Giemsa-stained smears were included in the present study. MLMT was performed using 12 highly specific microsatellite markers. Delta K (ΔK) calculation and Bayesian statistics were used to determine the population structure. Three main populations (POP A, B and C) were identified and further examination revealed the presence of three subpopulations for POP B and C. Combined analysis was performed using the data of previously typed L. tropica strains and Mediterranean and Şanlıurfa populations were identified. This finding suggests that the epidemiological status of L. tropica is more complicated than expected when compared to previous studies. A new population, comprised of Syrian L. tropica samples, was reported for the first time in Turkey, and the data presented here will provide new epidemiological information for further studies. PMID:28403153

  8. Species limits in the Morelet's Alligator lizard (Anguidae: Gerrhonotinae).

    PubMed

    Solano-Zavaleta, Israel; Nieto-Montes de Oca, Adrián

    2018-03-01

    The widely distributed, Central American anguid lizard Mesaspis moreletii is currently recognized as a polytypic species with five subspecies (M. m. fulvus, M. m. moreletii, M. m. rafaeli, M. m. salvadorensis, and M. m. temporalis). We reevaluated the species limits within Mesaspis moreletii using DNA sequences of one mitochondrial and three nuclear genes. The multi-locus data set included samples of all of the subspecies of M. moreletii, the other species of Mesaspis in Central America (M. cuchumatanus and M. monticola), and some populations assignable to M. moreletii but of uncertain subspecific identity from Honduras and Nicaragua. We first used a tree-based method for delimiting species based on mtDNA data to identify potential evolutionary independent lineages, and then analized the multilocus dataset with two species delimitation methods that use the multispecies coalescent model to evaluate different competing species delimitation models: the Bayes factors species delimitation method (BFD) implemented in ∗ BEAST, and the Bayesian Phylogenetics and Phylogeography (BP&P) method. Our results suggest that M. m. moreletii, M. m. rafaeli, M. m. salvadorensis, and M. m. temporalis represent distinct evolutionary independent lineages, and that the populations of uncertain status from Honduras and Nicaragua may represent additional undescribed species. Our results also suggest that M. m. fulvus is a synonym of M. m. moreletii. The biogeography of the Central American lineages of Mesaspis is discussed. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Maternal heterozygosity and progeny fitness association in an inbred Scots pine population.

    PubMed

    Abrahamsson, S; Ahlinder, J; Waldmann, P; García-Gil, M R

    2013-03-01

    Associations between heterozygosity and fitness traits have typically been investigated in populations characterized by low levels of inbreeding. We investigated the associations between standardized multilocus heterozygosity (stMLH) in mother trees (obtained from12 nuclear microsatellite markers) and five fitness traits measured in progenies from an inbred Scots pine population. The traits studied were proportion of sound seed, mean seed weight, germination rate, mean family height of one-year old seedlings under greenhouse conditions (GH) and mean family height of three-year old seedlings under field conditions (FH). The relatively high average inbreeding coefficient (F) in the population under study corresponds to a mixture of trees with different levels of co-ancestry, potentially resulting from a recent bottleneck. We used both frequentist and Bayesian methods of polynomial regression to investigate the presence of linear and non-linear relations between stMLH and each of the fitness traits. No significant associations were found for any of the traits except for GH, which displayed negative linear effect with stMLH. Negative HFC for GH could potentially be explained by the effect of heterosis caused by mating of two inbred mother trees (Lippman and Zamir 2006), or outbreeding depression at the most heterozygote trees and its negative impact on the fitness of the progeny, while their simultaneous action is also possible (Lynch. 1991). However,since this effect wasn't detected for FH, we cannot either rule out that the greenhouse conditions introduce artificial effects that disappear under more realistic field conditions.

  10. Wing patterning genes and coevolution of Müllerian mimicry in Heliconius butterflies: Support from phylogeography, cophylogeny, and divergence times.

    PubMed

    Hoyal Cuthill, Jennifer F; Charleston, Michael

    2015-12-01

    Examples of long-term coevolution are rare among free-living organisms. Müllerian mimicry in Heliconius butterflies had been suggested as a key example of coevolution by early genetic studies. However, research over the last two decades has been dominated by the idea that the best-studied comimics, H. erato and H. melpomene, did not coevolve at all. Recently sequenced genes associated with wing color pattern phenotype offer a new opportunity to resolve this controversy. Here, we test the hypothesis of coevolution between H. erato and H. melpomene using Bayesian multilocus analysis of five color pattern genes and five neutral genetic markers. We first explore the extent of phylogenetic agreement versus conflict between the different genes. Coevolution is then tested against three aspects of the mimicry diversifications: phylogenetic branching patterns, divergence times, and, for the first time, phylogeographic histories. We show that all three lines of evidence are compatible with strict coevolution of the diverse mimicry wing patterns, contrary to some recent suggestions. Instead, these findings tally with a coevolutionary diversification driven primarily by the ecological force of Müllerian mimicry. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.

  11. Multilocus Microsatellite Typing (MLMT) of Strains from Turkey and Cyprus Reveals a Novel Monophyletic L. donovani Sensu Lato Group

    PubMed Central

    Amro, Ahmad; Mentis, Andreas; Pratlong, Francine; Dedet, Jean-Pierre; Votypka, Jan; Volf, Petr; Ozensoy Toz, Seray; Kuhls, Katrin; Schönian, Gabriele; Soteriadou, Ketty

    2012-01-01

    Background New foci of human CL caused by strains of the Leishmania donovani (L. donovani) complex have been recently described in Cyprus and the Çukurova region in Turkey (L. infantum) situated 150 km north of Cyprus. Cypriot strains were typed by Multilocus Enzyme Electrophoresis (MLEE) using the Montpellier (MON) system as L. donovani zymodeme MON-37. However, multilocus microsatellite typing (MLMT) has shown that this zymodeme is paraphyletic; composed of distantly related genetic subgroups of different geographical origin. Consequently the origin of the Cypriot strains remained enigmatic. Methodology/Principal Findings The Cypriot strains were compared with a set of Turkish isolates obtained from a CL patient and sand fly vectors in south-east Turkey (Çukurova region; CUK strains) and from a VL patient in the south-west (Kuşadasi; EP59 strain). These Turkish strains were initially analyzed using the K26-PCR assay that discriminates MON-1 strains by their amplicon size. In line with previous DNA-based data, the strains were inferred to the L. donovani complex and characterized as non MON-1. For these strains MLEE typing revealed two novel zymodemes; L. donovani MON-309 (CUK strains) and MON-308 (EP59). A population genetic analysis of the Turkish isolates was performed using 14 hyper-variable microsatellite loci. The genotypic profiles of 68 previously analyzed L. donovani complex strains from major endemic regions were included for comparison. Population structures were inferred by combination of Bayesian model-based and distance-based approaches. MLMT placed the Turkish and Cypriot strains in a subclade of a newly discovered, genetically distinct L. infantum monophyletic group, suggesting that the Cypriot strains may originate from Turkey. Conclusion The discovery of a genetically distinct L. infantum monophyletic group in the south-eastern Mediterranean stresses the importance of species genetic characterization towards better understanding, monitoring and controlling the spread of leishmaniasis in this region. PMID:22348162

  12. Detecting genotypic changes associated with selective mortality at sea in Atlantic salmon: polygenic multilocus analysis surpasses genome scan.

    PubMed

    Bourret, Vincent; Dionne, Mélanie; Bernatchez, Louis

    2014-09-01

    Wild populations of Atlantic salmon have declined worldwide. While the causes for this decline may be complex and numerous, increased mortality at sea is predicted to be one of the major contributing factors. Examining the potential changes occurring in the genome-wide composition of populations during this migration has the potential to tease apart some of the factors influencing marine mortality. Here, we genotyped 5568 SNPs in Atlantic salmon populations representing two distinct regional genetic groups and across two cohorts to test for differential allelic and genotypic frequencies between juveniles (smolts) migrating to sea and adults (grilses) returning to freshwater after 1 year at sea. Given the complexity of the traits potentially associated with sea mortality, we contrasted the outcomes of a single-locus F(ST) based genome scan method with a new multilocus framework to test for genetically based differential mortality at sea. While numerous outliers were identified by the single-locus analysis, no evidence for parallel, temporally repeated selection was found. In contrast, the multilocus approach detected repeated patterns of selection for a multilocus group of 34 covarying SNPs in one of the two populations. No significant pattern of selective mortality was detected in the other population, suggesting different causes of mortality among populations. These results first support the hypothesis that selection mainly causes small changes in allele frequencies among many covarying loci rather than a small number of changes in loci with large effects. They also point out that moving away from the a strict 'selective sweep paradigm' towards a multilocus genetics framework may be a more useful approach for studying the genomic signatures of natural selection on complex traits in wild populations. © 2014 John Wiley & Sons Ltd.

  13. Population sub-structuring among Trypanosoma evansi stocks.

    PubMed

    Njiru, Z K; Constantine, C C

    2007-10-01

    To investigate the population genetic structure of Trypanosoma evansi from domesticated animals, we have analysed 112 stocks from camels, buffaloes, cattle and horses using the tandemly repeated coding sequence (MORF2) and minisatellite markers 292 and cysteine-rich acidic integral membrane protein (CRAM). We recorded a total of six alleles at the MORF2 locus, seven at 292 and 12 at the CRAM loci. Nei's genetic distance showed reduced allelic diversity between buffaloes and cattle stocks (1.2) as compared to the diversity between camels and buffaloes (3.75) and camels and cattle stock (1.69). The mean index of association (IA=0.92) significantly deviated from zero, and the average number of multilocus genotypes (G/N ratio) was 0.21. Twenty-four multilocus genotypes were defined from the combination of alleles at the three loci. The Kenyan sub-populations showed Fst=0.28 and analysis of molecular variance showed significant divergence (22.7%) between the Laikipia, Kulal and Galana regions. The regional and host distribution of multi-locus genotypes significant population differentiation and high Nei's genetic distances suggest existence of genetic sub-structuring within T. evansi stocks while the few multi-locus genotypes and deviation of association index from zero indicate the lack of recombination. In conclusion, this study reveals that some genetic sub-structuring does occur within T. evansi, which has a clonal population structure.

  14. Application of multilocus variable number tandem repeat analysis to monitor Verocytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales: emergence of a profile associated with a national outbreak.

    PubMed

    Perry, N; Cheasty, T; Dallman, T; Launders, N; Willshaw, G

    2013-10-01

    Evaluation of multilocus variable number tandem repeat analysis (MLVA) to subtype all isolates of Vero cytotoxin-producing Escherichia coli O157 phage type 8 in England and Wales. Over a 13 month period from December 2010, 483 isolates of VTEC O157 PT8 were tested by MLVA; 39% were received in the first 4 months of 2011, when infections are generally low. One profile, or single locus variants of it, was present in 249 (52%) isolates but was not common previously. These cases represented a national increase in PT8, associated epidemiologically with soil-contaminated vegetables. Most of the 177 other MLVA profiles were unique to a single isolate. Profiles shared by >1 isolate included cases from two small community, food-borne outbreaks and 11 households. Several shared profiles were found among 23 isolates without known links. Apart from one group, isolates linked to travel abroad had very diverse profiles. Multilocus variable number tandem repeat analysis discriminated apparent sporadic isolates of the same PT and assisted in detection of cases in an emerging national outbreak. Multilocus variable number tandem repeat analysis is an epidemiologically valid complement to surveillance and applicable as a rapid, practical test for large numbers of isolates. © 2013 The Society for Applied Microbiology.

  15. An improved approximate-Bayesian model-choice method for estimating shared evolutionary history

    PubMed Central

    2014-01-01

    Background To understand biological diversification, it is important to account for large-scale processes that affect the evolutionary history of groups of co-distributed populations of organisms. Such events predict temporally clustered divergences times, a pattern that can be estimated using genetic data from co-distributed species. I introduce a new approximate-Bayesian method for comparative phylogeographical model-choice that estimates the temporal distribution of divergences across taxa from multi-locus DNA sequence data. The model is an extension of that implemented in msBayes. Results By reparameterizing the model, introducing more flexible priors on demographic and divergence-time parameters, and implementing a non-parametric Dirichlet-process prior over divergence models, I improved the robustness, accuracy, and power of the method for estimating shared evolutionary history across taxa. Conclusions The results demonstrate the improved performance of the new method is due to (1) more appropriate priors on divergence-time and demographic parameters that avoid prohibitively small marginal likelihoods for models with more divergence events, and (2) the Dirichlet-process providing a flexible prior on divergence histories that does not strongly disfavor models with intermediate numbers of divergence events. The new method yields more robust estimates of posterior uncertainty, and thus greatly reduces the tendency to incorrectly estimate models of shared evolutionary history with strong support. PMID:24992937

  16. Toward the resolution of an explosive radiation--a multilocus phylogeny of oceanic dolphins (Delphinidae).

    PubMed

    McGowen, Michael R

    2011-09-01

    Oceanic dolphins (Delphinidae) are the product of a rapid radiation that yielded ∼36 extant species of small to medium-sized cetaceans that first emerged in the Late Miocene. Although they are a charismatic group of organisms that have become poster children for marine conservation, many phylogenetic relationships within Delphinidae remain elusive due to the slow molecular evolution of the group and the difficulty of resolving short branches from successive cladogenic events. Here I combine existing and newly generated sequences from four mitochondrial (mt) genes and 20 nuclear (nu) genes to reconstruct a well-supported phylogenetic hypothesis for Delphinidae. This study compares maximum-likelihood and Bayesian inference methods of several data sets including mtDNA, combined nuDNA, gene trees of individual nuDNA loci, and concatenated mtDNA+nuDNA. In addition, I contrast these standard phylogenetic analyses with the species tree reconstruction method of Bayesian concordance analysis (BCA). Despite finding discordance between mtDNA and individual nuDNA loci, the concatenated matrix recovers a completely resolved and robustly supported phylogeny that is also broadly congruent with BCA trees. This study strongly supports groupings such as Delphininae, Lissodelphininae, Globicephalinae, Sotalia+Delphininae, Steno+Orcaella+Globicephalinae, and Leucopleurus acutus, Lagenorhynchus albirostris, and Orcinus orca as basal delphinid taxa. Copyright © 2011 Elsevier Inc. All rights reserved.

  17. Multilocus approaches for the measurement of selection on correlated genetic loci.

    PubMed

    Gompert, Zachariah; Egan, Scott P; Barrett, Rowan D H; Feder, Jeffrey L; Nosil, Patrik

    2017-01-01

    The study of ecological speciation is inherently linked to the study of selection. Methods for estimating phenotypic selection within a generation based on associations between trait values and fitness (e.g. survival) of individuals are established. These methods attempt to disentangle selection acting directly on a trait from indirect selection caused by correlations with other traits via multivariate statistical approaches (i.e. inference of selection gradients). The estimation of selection on genotypic or genomic variation could also benefit from disentangling direct and indirect selection on genetic loci. However, achieving this goal is difficult with genomic data because the number of potentially correlated genetic loci (p) is very large relative to the number of individuals sampled (n). In other words, the number of model parameters exceeds the number of observations (p ≫ n). We present simulations examining the utility of whole-genome regression approaches (i.e. Bayesian sparse linear mixed models) for quantifying direct selection in cases where p ≫ n. Such models have been used for genome-wide association mapping and are common in artificial breeding. Our results show they hold promise for studies of natural selection in the wild and thus of ecological speciation. But we also demonstrate important limitations to the approach and discuss study designs required for more robust inferences. © 2016 John Wiley & Sons Ltd.

  18. Joint genotype- and ancestry-based genome-wide association studies in admixed populations.

    PubMed

    Szulc, Piotr; Bogdan, Malgorzata; Frommlet, Florian; Tang, Hua

    2017-09-01

    In genome-wide association studies (GWAS) genetic loci that influence complex traits are localized by inspecting associations between genotypes of genetic markers and the values of the trait of interest. On the other hand, admixture mapping, which is performed in case of populations consisting of a recent mix of two ancestral groups, relies on the ancestry information at each locus (locus-specific ancestry). Recently it has been proposed to jointly model genotype and locus-specific ancestry within the framework of single marker tests. Here, we extend this approach for population-based GWAS in the direction of multimarker models. A modified version of the Bayesian information criterion is developed for building a multilocus model that accounts for the differential correlation structure due to linkage disequilibrium (LD) and admixture LD. Simulation studies and a real data example illustrate the advantages of this new approach compared to single-marker analysis or modern model selection strategies based on separately analyzing genotype and ancestry data, as well as to single-marker analysis combining genotypic and ancestry information. Depending on the signal strength, our procedure automatically chooses whether genotypic or locus-specific ancestry markers are added to the model. This results in a good compromise between the power to detect causal mutations and the precision of their localization. The proposed method has been implemented in R and is available at http://www.math.uni.wroc.pl/~mbogdan/admixtures/. © 2017 WILEY PERIODICALS, INC.

  19. Diversification of the silverspot butterflies (Nymphalidae) in the Neotropics inferred from multi-locus DNA sequences.

    PubMed

    Massardo, Darli; Fornel, Rodrigo; Kronforst, Marcus; Gonçalves, Gislene Lopes; Moreira, Gilson Rudinei Pires

    2015-01-01

    The tribe Heliconiini (Lepidoptera: Nymphalidae) is a diverse group of butterflies distributed throughout the Neotropics, which has been studied extensively, in particular the genus Heliconius. However, most of the other lineages, such as Dione, which are less diverse and considered basal within the group, have received little attention. Basic information, such as species limits and geographical distributions remain uncertain for this genus. Here we used multilocus DNA sequence data and the geographical distribution analysis across the entire range of Dione in the Neotropical region in order to make inferences on the evolutionary history of this poorly explored lineage. Bayesian time-tree reconstruction allows inferring two major diversification events in this tribe around 25mya. Lineages thought to be ancient, such as Dione and Agraulis, are as recent as Heliconius. Dione formed a monophyletic clade, sister to the genus Agraulis. Dione juno, D. glycera and D. moneta were reciprocally monophyletic and formed genetic clusters, with the first two more close related than each other in relation to the third. Divergence time estimates support the hypothesis that speciation in Dione coincided with both the rise of Passifloraceae (the host plants) and the uplift of the Andes. Since the sister species D. glycera and D. moneta are specialized feeders on passion-vine lineages that are endemic to areas located either within or adjacent to the Andes, we inferred that they co-speciated with their host plants during this vicariant event. Copyright © 2014 Elsevier Inc. All rights reserved.

  20. Multilocus methylation analysis in a large cohort of 11p15-related foetal growth disorders (Russell Silver and Beckwith Wiedemann syndromes) reveals simultaneous loss of methylation at paternal and maternal imprinted loci.

    PubMed

    Azzi, Salah; Rossignol, Sylvie; Steunou, Virginie; Sas, Theo; Thibaud, Nathalie; Danton, Fabienne; Le Jule, Maryline; Heinrichs, Claudine; Cabrol, Sylvie; Gicquel, Christine; Le Bouc, Yves; Netchine, Irene

    2009-12-15

    Genomic imprinting plays an important role in mammalian development. Loss of imprinting (LOI) through loss (LOM) or gain (GOM) of methylation is involved in many human disorders and cancers. The imprinted 11p15 region is crucial for the control of foetal growth and LOI at this locus is implicated in two clinically opposite disorders: Beckwith Wiedemann syndrome (BWS) with foetal overgrowth associated with an enhanced tumour risk and Russell-Silver syndrome (RSS) with intrauterine and postnatal growth restriction. So far, only a few studies have assessed multilocus LOM in human imprinting diseases. To investigate multilocus LOI syndrome, we studied the methylation status of five maternally and two paternally methylated loci in a large series (n = 167) of patients with 11p15-related foetal growth disorders. We found that 9.5% of RSS and 24% of BWS patients showed multilocus LOM at regions other than ICR1 and ICR2 11p15, respectively. Moreover, over two third of multilocus LOM RSS patients also had LOM at a second paternally methylated locus, DLK1/GTL2 IG-DMR. No additional clinical features due to LOM of other loci were found suggesting an (epi)dominant effect of the 11p15 LOM on the clinical phenotype for this series of patients. Surprisingly, four patients displayed LOM at both ICR1 and ICR2 11p15. Three of them had a RSS and one a BWS phenotype. Our results show for the first time that multilocus LOM can also concern RSS patients. Moreover, LOM can involve both paternally and maternally methylated loci in the same patient.

  1. Detection and validation of genomic regions associated with resistance to rust diseases in a worldwide hexaploid wheat landrace collection using BayesR and mixed linear model approaches.

    PubMed

    Pasam, Raj K; Bansal, Urmil; Daetwyler, Hans D; Forrest, Kerrie L; Wong, Debbie; Petkowski, Joanna; Willey, Nicholas; Randhawa, Mandeep; Chhetri, Mumta; Miah, Hanif; Tibbits, Josquin; Bariana, Harbans; Hayden, Matthew J

    2017-04-01

    BayesR and MLM association mapping approaches in common wheat landraces were used to identify genomic regions conferring resistance to Yr, Lr, and Sr diseases. Deployment of rust resistant cultivars is the most economically effective and environmentally friendly strategy to control rust diseases in wheat. However, the highly evolving nature of wheat rust pathogens demands continued identification, characterization, and transfer of new resistance alleles into new varieties to achieve durable rust control. In this study, we undertook genome-wide association studies (GWAS) using a mixed linear model (MLM) and the Bayesian multilocus method (BayesR) to identify QTL contributing to leaf rust (Lr), stem rust (Sr), and stripe rust (Yr) resistance. Our study included 676 pre-Green Revolution common wheat landrace accessions collected in the 1920-1930s by A.E. Watkins. We show that both methods produce similar results, although BayesR had reduced background signals, enabling clearer definition of QTL positions. For the three rust diseases, we found 5 (Lr), 14 (Yr), and 11 (Sr) SNPs significant in both methods above stringent false-discovery rate thresholds. Validation of marker-trait associations with known rust QTL from the literature and additional genotypic and phenotypic characterisation of biparental populations showed that the landraces harbour both previously mapped and potentially new genes for resistance to rust diseases. Our results demonstrate that pre-Green Revolution landraces provide a rich source of genes to increase genetic diversity for rust resistance to facilitate the development of wheat varieties with more durable rust resistance.

  2. Addictive behaviors and addiction-prone personality traits: associations with a dopamine multilocus genetic profile.

    PubMed

    Davis, Caroline; Loxton, Natalie J

    2013-07-01

    The purpose of this study was to examine reward-related genetic risk for addictive behaviors in a healthy community sample (n=217) of men and women. We tested a mediation model predicting that a quantitative multilocus genetic profile score - reflecting the additive effects of alleles known to confer relatively increased dopamine signaling in the ventral striatum - would relate positively to a composite measure of addictive behaviors, and that this association would be mediated by personality traits consistently associated with addiction disorders. Our model was strongly supported by the data, and accounted for 24% of the variance in addictive behaviors. These data suggest that brain reward processes tend to exert their influence on addiction risk by their role in the development of relatively stable personality traits associated with addictive behaviors. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. Influence of gene flow on divergence dating - implications for the speciation history of Takydromus grass lizards.

    PubMed

    Tseng, Shu-Ping; Li, Shou-Hsien; Hsieh, Chia-Hung; Wang, Hurng-Yi; Lin, Si-Min

    2014-10-01

    Dating the time of divergence and understanding speciation processes are central to the study of the evolutionary history of organisms but are notoriously difficult. The difficulty is largely rooted in variations in the ancestral population size or in the genealogy variation across loci. To depict the speciation processes and divergence histories of three monophyletic Takydromus species endemic to Taiwan, we sequenced 20 nuclear loci and combined with one mitochondrial locus published in GenBank. They were analysed by a multispecies coalescent approach within a Bayesian framework. Divergence dating based on the gene tree approach showed high variation among loci, and the divergence was estimated at an earlier date than when derived by the species-tree approach. To test whether variations in the ancestral population size accounted for the majority of this variation, we conducted computer inferences using isolation-with-migration (IM) and approximate Bayesian computation (ABC) frameworks. The results revealed that gene flow during the early stage of speciation was strongly favoured over the isolation model, and the initiation of the speciation process was far earlier than the dates estimated by gene- and species-based divergence dating. Due to their limited dispersal ability, it is suggested that geographical isolation may have played a major role in the divergence of these Takydromus species. Nevertheless, this study reveals a more complex situation and demonstrates that gene flow during the speciation process cannot be overlooked and may have a great impact on divergence dating. By using multilocus data and incorporating Bayesian coalescence approaches, we provide a more biologically realistic framework for delineating the divergence history of Takydromus. © 2014 John Wiley & Sons Ltd.

  4. A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction

    PubMed Central

    De Oliveira Martins, Leonardo; Mallo, Diego; Posada, David

    2016-01-01

    Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models. PMID:25281847

  5. Multilocus Association Mapping Using Variable-Length Markov Chains

    PubMed Central

    Browning, Sharon R.

    2006-01-01

    I propose a new method for association-based gene mapping that makes powerful use of multilocus data, is computationally efficient, and is straightforward to apply over large genomic regions. The approach is based on the fitting of variable-length Markov chain models, which automatically adapt to the degree of linkage disequilibrium (LD) between markers to create a parsimonious model for the LD structure. Edges of the fitted graph are tested for association with trait status. This approach can be thought of as haplotype testing with sophisticated windowing that accounts for extent of LD to reduce degrees of freedom and number of tests while maximizing information. I present analyses of two published data sets that show that this approach can have better power than single-marker tests or sliding-window haplotypic tests. PMID:16685642

  6. Multilocus association mapping using variable-length Markov chains.

    PubMed

    Browning, Sharon R

    2006-06-01

    I propose a new method for association-based gene mapping that makes powerful use of multilocus data, is computationally efficient, and is straightforward to apply over large genomic regions. The approach is based on the fitting of variable-length Markov chain models, which automatically adapt to the degree of linkage disequilibrium (LD) between markers to create a parsimonious model for the LD structure. Edges of the fitted graph are tested for association with trait status. This approach can be thought of as haplotype testing with sophisticated windowing that accounts for extent of LD to reduce degrees of freedom and number of tests while maximizing information. I present analyses of two published data sets that show that this approach can have better power than single-marker tests or sliding-window haplotypic tests.

  7. Genetic characterization of Colombian Bahman cattle using microsatellites markers.

    PubMed

    Gómez, Y M; Fernandez, M; Rivera, D; Gómez, G; Bernal, J E

    2013-07-01

    Genetic structure and diversity of 3789 animals of the Brahman breed from 23 Colombian regions were assessed. Considering the Brahman Zebu cattle as a single population, the multilocus test based on the HW equilibrium, shows significant differences (P < 0.001). Genetic characterization made on the cattle population allowed to examine the genetic variability, calculating a H(o) = 0.6621. Brahman population in Colombia was a small subdivision within populations (F(it) = 0.045), a geographic subdivision almost non-existent or low differentiation (F(st) = 0.003) and the F(is) calculated (0.042) indicates no detriment to the variability in the population, despite the narrow mating takes place or there is a force that causes the variability is sustained without inbreeding actually affect the cattle population. The outcomes of multivariate analyses, Bayesian inferences and interindividual genetic distances suggested that there is no genetic sub-structure in the population, because of the high rate of animal migration among regions.

  8. pKWmEB: integration of Kruskal-Wallis test with empirical Bayes under polygenic background control for multi-locus genome-wide association study.

    PubMed

    Ren, Wen-Long; Wen, Yang-Jun; Dunwell, Jim M; Zhang, Yuan-Ming

    2018-03-01

    Although nonparametric methods in genome-wide association studies (GWAS) are robust in quantitative trait nucleotide (QTN) detection, the absence of polygenic background control in single-marker association in genome-wide scans results in a high false positive rate. To overcome this issue, we proposed an integrated nonparametric method for multi-locus GWAS. First, a new model transformation was used to whiten the covariance matrix of polygenic matrix K and environmental noise. Using the transferred model, Kruskal-Wallis test along with least angle regression was then used to select all the markers that were potentially associated with the trait. Finally, all the selected markers were placed into multi-locus model, these effects were estimated by empirical Bayes, and all the nonzero effects were further identified by a likelihood ratio test for true QTN detection. This method, named pKWmEB, was validated by a series of Monte Carlo simulation studies. As a result, pKWmEB effectively controlled false positive rate, although a less stringent significance criterion was adopted. More importantly, pKWmEB retained the high power of Kruskal-Wallis test, and provided QTN effect estimates. To further validate pKWmEB, we re-analyzed four flowering time related traits in Arabidopsis thaliana, and detected some previously reported genes that were not identified by the other methods.

  9. Spatio-temporal Genetic Structuring of Leishmania major in Tunisia by Microsatellite Analysis

    PubMed Central

    Harrabi, Myriam; Bettaieb, Jihène; Ghawar, Wissem; Toumi, Amine; Zaâtour, Amor; Yazidi, Rihab; Chaâbane, Sana; Chalghaf, Bilel; Hide, Mallorie; Bañuls, Anne-Laure; Ben Salah, Afif

    2015-01-01

    In Tunisia, cases of zoonotic cutaneous leishmaniasis caused by Leishmania major are increasing and spreading from the south-west to new areas in the center. To improve the current knowledge on L. major evolution and population dynamics, we performed multi-locus microsatellite typing of human isolates from Tunisian governorates where the disease is endemic (Gafsa, Kairouan and Sidi Bouzid governorates) and collected during two periods: 1991–1992 and 2008–2012. Analysis (F-statistics and Bayesian model-based approach) of the genotyping results of isolates collected in Sidi Bouzid in 1991–1992 and 2008–2012 shows that, over two decades, in the same area, Leishmania parasites evolved by generating genetically differentiated populations. The genetic patterns of 2008–2012 isolates from the three governorates indicate that L. major populations did not spread gradually from the south to the center of Tunisia, according to a geographical gradient, suggesting that human activities might be the source of the disease expansion. The genotype analysis also suggests previous (Bayesian model-based approach) and current (F-statistics) flows of genotypes between governorates and districts. Human activities as well as reservoir dynamics and the effects of environmental changes could explain how the disease progresses. This study provides new insights into the evolution and spread of L. major in Tunisia that might improve our understanding of the parasite flow between geographically and temporally distinct populations. PMID:26302440

  10. Non-monophyly and intricate morphological evolution within the avian family Cettiidae revealed by multilocus analysis of a taxonomically densely sampled dataset

    PubMed Central

    2011-01-01

    Background The avian family Cettiidae, including the genera Cettia, Urosphena, Tesia, Abroscopus and Tickellia and Orthotomus cucullatus, has recently been proposed based on analysis of a small number of loci and species. The close relationship of most of these taxa was unexpected, and called for a comprehensive study based on multiple loci and dense taxon sampling. In the present study, we infer the relationships of all except one of the species in this family using one mitochondrial and three nuclear loci. We use traditional gene tree methods (Bayesian inference, maximum likelihood bootstrapping, parsimony bootstrapping), as well as a recently developed Bayesian species tree approach (*BEAST) that accounts for lineage sorting processes that might produce discordance between gene trees. We also analyse mitochondrial DNA for a larger sample, comprising multiple individuals and a large number of subspecies of polytypic species. Results There are many topological incongruences among the single-locus trees, although none of these is strongly supported. The multi-locus tree inferred using concatenated sequences and the species tree agree well with each other, and are overall well resolved and well supported by the data. The main discrepancy between these trees concerns the most basal split. Both methods infer the genus Cettia to be highly non-monophyletic, as it is scattered across the entire family tree. Deep intraspecific divergences are revealed, and one or two species and one subspecies are inferred to be non-monophyletic (differences between methods). Conclusions The molecular phylogeny presented here is strongly inconsistent with the traditional, morphology-based classification. The remarkably high degree of non-monophyly in the genus Cettia is likely to be one of the most extraordinary examples of misconceived relationships in an avian genus. The phylogeny suggests instances of parallel evolution, as well as highly unequal rates of morphological divergence in different lineages. This complex morphological evolution apparently misled earlier taxonomists. These results underscore the well-known but still often neglected problem of basing classifications on overall morphological similarity. Based on the molecular data, a revised taxonomy is proposed. Although the traditional and species tree methods inferred much the same tree in the present study, the assumption by species tree methods that all species are monophyletic is a limitation in these methods, as some currently recognized species might have more complex histories. PMID:22142197

  11. Heterozygosity and fitness: No strong association in Great Lakes populations of the zebra mussel, Dreissena Polymorpha (Pallas)

    USGS Publications Warehouse

    Lewis, K.M.; Feder, J.L.; Horvath, T.G.; Lamberti, G.A.

    2000-01-01

    A number of studies have found positive associations between allozyme heterozygosity and fitness surrogates (e.g., body size and growth rate) for marine molluscs. We investigated whether similar relationships exist for freshwater populations of the zebra mussel, Dreissena polymorpha. Only one significant correlation between multi-locus heterozygosity and shell length was observed for a total of 22 D. polymorpha populations surveyed from midwestern U.S.A. lakes and streams, and the result was not significant on a table-wide basis. Meta-analysis revealed a significant common correlation coefficient (effect magnitude) between multi-locus heterozygosity and shell length across all 22 sites (rc = 0.052, P = 0.019, 1557 df). However, the variance in shell length explained by multi-locus heterozygosity was small (rc2 = 0.0027), implying a weak causal relationship if any. Also, we saw no relationship between heterozygosity and growth rate in a one-year field enclosure experiment. A significant heterozygosity-shell length correlation previously reported for a zebra mussel population at Put-in-Bay, Lake Erie, Ohio, may have been the product of unique population dynamics, rather than natural selection. Similar demographic considerations may contribute to inconsistencies in heterozygosity-fitness correlations seen for other molluscs.

  12. sourceR: Classification and source attribution of infectious agents among heterogeneous populations

    PubMed Central

    French, Nigel

    2017-01-01

    Zoonotic diseases are a major cause of morbidity, and productivity losses in both human and animal populations. Identifying the source of food-borne zoonoses (e.g. an animal reservoir or food product) is crucial for the identification and prioritisation of food safety interventions. For many zoonotic diseases it is difficult to attribute human cases to sources of infection because there is little epidemiological information on the cases. However, microbial strain typing allows zoonotic pathogens to be categorised, and the relative frequencies of the strain types among the sources and in human cases allows inference on the likely source of each infection. We introduce sourceR, an R package for quantitative source attribution, aimed at food-borne diseases. It implements a Bayesian model using strain-typed surveillance data from both human cases and source samples, capable of identifying important sources of infection. The model measures the force of infection from each source, allowing for varying survivability, pathogenicity and virulence of pathogen strains, and varying abilities of the sources to act as vehicles of infection. A Bayesian non-parametric (Dirichlet process) approach is used to cluster pathogen strain types by epidemiological behaviour, avoiding model overfitting and allowing detection of strain types associated with potentially high “virulence”. sourceR is demonstrated using Campylobacter jejuni isolate data collected in New Zealand between 2005 and 2008. Chicken from a particular poultry supplier was identified as the major source of campylobacteriosis, which is qualitatively similar to results of previous studies using the same dataset. Additionally, the software identifies a cluster of 9 multilocus sequence types with abnormally high ‘virulence’ in humans. sourceR enables straightforward attribution of cases of zoonotic infection to putative sources of infection. As sourceR develops, we intend it to become an important and flexible resource for food-borne disease attribution studies. PMID:28558033

  13. Systematics of Fusaria associated with Ambrosia beetles

    USDA-ARS?s Scientific Manuscript database

    Here, I summarize research efforts directed at characterizing ambrosia beetle-associated fusaria, including the species responsible for avocado wilt in Israel (Mendel et al., Phytoparasitica 2012) and branch dieback in California (Eskalen et al., Pl. Dis. 2012). Our multilocus molecular phylogenetic...

  14. Complex tissue-specific epigenotypes in Russell-Silver Syndrome associated with 11p15 ICR1 hypomethylation.

    PubMed

    Azzi, Salah; Blaise, Annick; Steunou, Virginie; Harbison, Madeleine D; Salem, Jennifer; Brioude, Frédéric; Rossignol, Sylvie; Habib, Walid Abi; Thibaud, Nathalie; Neves, Cristina Das; Jule, Marilyne Le; Brachet, Cécile; Heinrichs, Claudine; Bouc, Yves Le; Netchine, Irène

    2014-10-01

    Russell-Silver Syndrome (RSS) is a prenatal and postnatal growth retardation syndrome caused mainly by 11p15 ICR1 hypomethylation. Clinical presentation is heterogeneous in RSS patients with 11p15 ICR1 hypomethylation. We previously identified a subset of RSS patients with 11p15 ICR1 and multilocus hypomethylation. Here, we examine the relationships between IGF2 expression, 11p15 ICR1 methylation, and multilocus imprinting defects in various cell types from 39 RSS patients with 11p15 ICR1 hypomethylation in leukocyte DNA. 11p15 ICR1 hypomethylation was more pronounced in leukocytes than in buccal mucosa cells. Skin fibroblast IGF2 expression was correlated with the degree of ICR1 hypomethylation. Different tissue-specific multilocus methylation defects coexisted in 38% of cases, with some loci hypomethylated and others hypermethylated within the same cell type in some cases. Our new results suggest that tissue-specific epigenotypes may lead to clinical heterogeneity in RSS. © 2014 WILEY PERIODICALS, INC.

  15. GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures.

    PubMed

    Urbanowicz, Ryan J; Kiralis, Jeff; Sinnott-Armstrong, Nicholas A; Heberling, Tamra; Fisher, Jonathan M; Moore, Jason H

    2012-10-01

    Geneticists who look beyond single locus disease associations require additional strategies for the detection of complex multi-locus effects. Epistasis, a multi-locus masking effect, presents a particular challenge, and has been the target of bioinformatic development. Thorough evaluation of new algorithms calls for simulation studies in which known disease models are sought. To date, the best methods for generating simulated multi-locus epistatic models rely on genetic algorithms. However, such methods are computationally expensive, difficult to adapt to multiple objectives, and unlikely to yield models with a precise form of epistasis which we refer to as pure and strict. Purely and strictly epistatic models constitute the worst-case in terms of detecting disease associations, since such associations may only be observed if all n-loci are included in the disease model. This makes them an attractive gold standard for simulation studies considering complex multi-locus effects. We introduce GAMETES, a user-friendly software package and algorithm which generates complex biallelic single nucleotide polymorphism (SNP) disease models for simulation studies. GAMETES rapidly and precisely generates random, pure, strict n-locus models with specified genetic constraints. These constraints include heritability, minor allele frequencies of the SNPs, and population prevalence. GAMETES also includes a simple dataset simulation strategy which may be utilized to rapidly generate an archive of simulated datasets for given genetic models. We highlight the utility and limitations of GAMETES with an example simulation study using MDR, an algorithm designed to detect epistasis. GAMETES is a fast, flexible, and precise tool for generating complex n-locus models with random architectures. While GAMETES has a limited ability to generate models with higher heritabilities, it is proficient at generating the lower heritability models typically used in simulation studies evaluating new algorithms. In addition, the GAMETES modeling strategy may be flexibly combined with any dataset simulation strategy. Beyond dataset simulation, GAMETES could be employed to pursue theoretical characterization of genetic models and epistasis.

  16. Multilocus analyses of seven candidate genes suggest interacting pathways for obesity-related traits in Brazilian populations.

    PubMed

    Angeli, Cláudia B; Kimura, Lilian; Auricchio, Maria T; Vicente, João P; Mattevi, Vanessa S; Zembrzuski, Verônica M; Hutz, Mara H; Pereira, Alexandre C; Pereira, Tiago V; Mingroni-Netto, Regina C

    2011-06-01

    We investigated whether variants in major candidate genes for food intake and body weight regulation contribute to obesity-related traits under a multilocus perspective. We studied 375 Brazilian subjects from partially isolated African-derived populations (quilombos). Seven variants displaying conflicting results in previous reports and supposedly implicated in the susceptibility of obesity-related phenotypes were investigated: β2-adrenergic receptor (ADRB2) (Arg16Gly), insulin induced gene 2 (INSIG2) (rs7566605), leptin (LEP) (A19G), LEP receptor (LEPR) (Gln223Arg), perilipin (PLIN) (6209T > C), peroxisome proliferator-activated receptor-γ (PPARG) (Pro12Ala), and resistin (RETN) (-420 C > G). Regression models as well as generalized multifactor dimensionality reduction (GMDR) were employed to test the contribution of individual effects and higher-order interactions to BMI and waist-hip ratio (WHR) variation and risk of overweight/obesity. The best multilocus association signal identified in the quilombos was further examined in an independent sample of 334 Brazilian subjects of European ancestry. In quilombos, only the PPARG polymorphism displayed significant individual effects (WHR variation, P = 0.028). No association was observed either with the risk of overweight/obesity (BMI ≥ 25 kg/m2), risk of obesity alone (BMI ≥ 30 kg/m2) or BMI variation. However, GMDR analyses revealed an interaction between the LEPR and ADRB2 polymorphisms (P = 0.009) as well as a third-order effect involving the latter two variants plus INSIG2 (P = 0.034) with overweight/obesity. Assessment of the LEPR-ADRB2 interaction in the second sample indicated a marginally significant association (P = 0.0724), which was further verified to be limited to men (P = 0.0118). Together, our findings suggest evidence for a two-locus interaction between the LEPR Gln223Arg and ADRB2 Arg16Gly variants in the risk of overweight/obesity, and highlight further the importance of multilocus effects in the genetic component of obesity.

  17. A multilocus evaluation of ermine (Mustela erminea) across the Holarctic, testing hypotheses of Pleistocene diversification in response to climate change

    USGS Publications Warehouse

    Dawson, Natalie G.; Hope, Andrew G.; Talbot, Sandra L.; Cook, Joseph A.

    2013-01-01

    Aim: We examined data for ermine (Mustela erminea) to test two sets of diversification hypotheses concerning the number and location of late Pleistocene refugia, the timing and mode of diversification, and the evolutionary influence of insularization. Location: Temperate and sub-Arctic Northern Hemisphere. Methods: We used up to two mitochondrial and four nuclear loci from 237 specimens for statistical phylogeographical and demographic analyses. Coalescent species-tree estimation used a Bayesian approach for clade divergence based on external mutation rate calibrations. Approximate Bayesian methods were used to assess population size, timing of divergence and gene flow. Results: Limited structure coupled with evidence of population growth across broad regions, including previously ice-covered areas, indicated expansion from multiple centres of differentiation, but high endemism along the North Pacific coast (NPC). A bifurcating model of diversification with recent growth spanning three glacial cycles best explained the empirical data. Main conclusions: A newly identified clade in North America indicated a fourth refugial area for ermine. The shallow coalescence of all extant ermine reflects a recent history of diversification overlying a deeper fossil record. Post-glacial colonization has led to potential contact zones for multiple lineages in north-western North America. A model of diversification of ermine accompanied by recent gene flow was marginally less well supported than a model of divergence of major clades in response to the most recent glacial cycles.

  18. Relationships between functional genes in Lactobacillus delbrueckii ssp. bulgaricus isolates and phenotypic characteristics associated with fermentation time and flavor production in yogurt elucidated using multilocus sequence typing.

    PubMed

    Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang

    2016-01-01

    Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  19. Differentiation of Xylella fastidiosa Strains via Multilocus Sequence Analysis of Environmentally Mediated Genes (MLSA-E)

    PubMed Central

    Parker, Jennifer K.; Havird, Justin C.

    2012-01-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops. PMID:22194287

  20. Differentiation of Xylella fastidiosa strains via multilocus sequence analysis of environmentally mediated genes (MLSA-E).

    PubMed

    Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo

    2012-03-01

    Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops.

  1. Philopatry drives genetic differentiation in an island archipelago: comparative population genetics of Galapagos Nazca boobies (Sula granti) and great frigatebirds (Fregata minor)

    PubMed Central

    Levin, Iris I; Parker, Patricia G

    2012-01-01

    Seabirds are considered highly mobile, able to fly great distances with few apparent barriers to dispersal. However, it is often the case that seabird populations exhibit strong population genetic structure despite their potential vagility. Here we show that Galapagos Nazca booby (Sula granti) populations are substantially differentiated, even within the small geographic scale of this archipelago. On the other hand, Galapagos great frigatebird (Fregata minor) populations do not show any genetic structure. We characterized the genetic differentiation by sampling five colonies of both species in the Galapagos archipelago and analyzing eight microsatellite loci and three mitochondrial genes. Using an F-statistic approach on the multilocus data, we found significant differentiation between nearly all island pairs of Nazca booby populations and a Bayesian clustering analysis provided support for three distinct genetic clusters. Mitochondrial DNA showed less differentiation of Nazca booby colonies; only Nazca boobies from the island of Darwin were significantly differentiated from individuals throughout the rest of the archipelago. Great frigatebird populations showed little to no evidence for genetic differentiation at the same scale. Only two island pairs (Darwin – Wolf, N. Seymour – Wolf) were significantly differentiated using the multilocus data, and only two island pairs had statistically significant φST values (N. Seymour – Darwin, N. Seymour – Wolf) according to the mitochondrial data. There was no significant pattern of isolation by distance for either species calculated using both markers. Seven of the ten Nazca booby migration rates calculated between island pairs were in the south or southeast to north or northwest direction. The population differentiation found among Galapagos Nazca booby colonies, but not great frigatebird colonies, is most likely due to differences in natal and breeding philopatry. PMID:23170212

  2. Accurate determination of genetic identity for a single cacao bean, using molecular markers with a nanofluidic system, ensures cocoa authentication.

    PubMed

    Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng

    2014-01-15

    Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.

  3. Delimiting cryptic pathogen species causing apple Valsa canker with multilocus data

    PubMed Central

    Wang, Xuli; Zang, Rui; Yin, Zhiyuan; Kang, Zhensheng; Huang, Lili

    2014-01-01

    Fungal diseases are posing tremendous threats to global economy and food safety. Among them, Valsa canker, caused by fungi of Valsa and their Cytospora anamorphs, has been a serious threat to fruit and forest trees and is one of the most destructive diseases of apple in East Asia, particularly. Accurate and robust delimitation of pathogen species is not only essential for the development of effective disease control programs, but also will advance our understanding of the emergence of plant diseases. However, species delimitation is especially difficult in Valsa because of the high variability of morphological traits and in many cases the lack of the teleomorph. In this study, we delimitated species boundary for pathogens causing apple Valsa canker with a multifaceted approach. Based on three independent loci, the internal transcribed spacer (ITS), β-tubulin (Btu), and translation elongation factor-1 alpha (EF1α), we inferred gene trees with both maximum likelihood and Bayesian methods, estimated species tree with Bayesian multispecies coalescent approaches, and validated species tree with Bayesian species delimitation. Through divergence time estimation and ancestral host reconstruction, we tested the possible underlying mechanisms for fungal speciation and host-range change. Our results proved that two varieties of the former morphological species V. mali represented two distinct species, V. mali and V. pyri, which diverged about 5 million years ago, much later than the divergence of their preferred hosts, excluding a scenario of fungi–host co-speciation. The marked different thermal preferences and contrasting pathogenicity in cross-inoculation suggest ecological divergences between the two species. Apple was the most likely ancestral host for both V. mali and V. pyri. Host-range expansion led to the occurrence of V. pyri on both pear and apple. Our results also represent an example in which ITS data might underestimate species diversity. PMID:24834333

  4. Multi-locus mixed model analysis of stem rust resistance in a worldwide collection of winter wheat

    USDA-ARS?s Scientific Manuscript database

    Genome-wide association mapping is a powerful tool for dissecting the relationship between phenotypes and genetic variants in diverse populations. With improved cost efficiency of high-throughput genotyping platforms, association mapping is a desirable method to mine populations for favorable allele...

  5. Clonal Population Structure of Pseudomonas stutzeri, a Species with Exceptional Genetic Diversity

    PubMed Central

    Rius, Núria; Fusté, M. Carme; Guasp, Caterina; Lalucat, Jorge; Lorén, José G.

    2001-01-01

    Genetic diversity and genetic relationships among 42 Pseudomonas stutzeri strains belonging to several genomovars and isolated from different sources were investigated in an examination of 20 metabolic enzymes by multilocus enzyme electrophoresis analysis. Forty-two distinct allele profiles were identified, indicating that all multilocus genotypes were represented by a single strain. All 20 loci were exceptionally polymorphic, with an average of 15.9 alleles per locus. To the best of our knowledge, this P. stutzeri sample exhibited the highest mean genetic diversity (H = 0.876) found to date in all bacterial species studied by multilocus enzyme electrophoresis. A high frequency of occurrence of null alleles was identified. The index of association (IA) for the P. stutzeri strains analyzed was 1.10. The IA values were always significantly different from zero for all subgroups studied, including clinical and environmental isolates and strains classified as genomovar 1. These results suggest that the population structure of P. stutzeri is strongly clonal, indicating that there is no significant level of assortative recombination that might destroy linkage disequilibrium. PMID:11133969

  6. Imbalanced presence of Borrelia burgdorferi s.l. multilocus sequence types in clinical manifestations of Lyme borreliosis.

    PubMed

    Coipan, E Claudia; Jahfari, Setareh; Fonville, Manoj; Oei, G Anneke; Spanjaard, Lodewijk; Takumi, Katsuhisa; Hovius, Joppe W R; Sprong, Hein

    2016-08-01

    In this study we used typing based on the eight multilocus sequence typing scheme housekeeping genes (MLST) and 5S-23S rDNA intergenic spacer (IGS) to explore the population structure of Borrelia burgdorferi sensu lato isolates from patients with Lyme borreliosis (LB) and to test the association between the B. burgdorferi s.l. sequence types (ST) and the clinical manifestations they cause in humans. Isolates of B. burgdorferi from 183 LB cases across Europe, with distinct clinical manifestations, and 257 Ixodes ricinus lysates from The Netherlands, were analyzed for this study alone. For completeness, we incorporated in our analysis also 335 European B. burgdorferi s.l. MLST profiles retrieved from literature. Borrelia afzelii and Borrelia bavariensis were associated with human cases of LB while Borrelia garinii, Borrelia lusitaniae and Borrelia valaisiana were associated with questing I. ricinus ticks. B. afzelii was associated with acrodermatitis chronica atrophicans, while B. garinii and B. bavariensis were associated with neuroborreliosis. The samples in our study belonged to 251 different STs, of which 94 are newly described, adding to the overall picture of the genetic diversity of Borrelia genospecies. The fraction of STs that were isolated from human samples was significantly higher for the genospecies that are known to be maintained in enzootic cycles by mammals (B. afzelii, B. bavariensis, and Borrelia spielmanii) than for genospecies that are maintained by birds (B. garinii and B. valaisiana) or lizards (B. lusitaniae). We found six multilocus sequence types that were significantly associated to clinical manifestations in humans and five IGS haplotypes that were associated with the human LB cases. While IGS could perform just as well as the housekeeping genes in the MLST scheme for predicting the infectivity of B. burgdorferi s.l., the advantage of MLST is that it can also capture the differential invasiveness of the various STs. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  7. A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study

    ERIC Educational Resources Information Center

    Kaplan, David; Chen, Jianshen

    2012-01-01

    A two-step Bayesian propensity score approach is introduced that incorporates prior information in the propensity score equation and outcome equation without the problems associated with simultaneous Bayesian propensity score approaches. The corresponding variance estimators are also provided. The two-step Bayesian propensity score is provided for…

  8. Three Divergent Subpopulations of the Malaria Parasite Plasmodium knowlesi

    PubMed Central

    Lin, Lee C.; Rovie-Ryan, Jeffrine J.; Kadir, Khamisah A.; Anderios, Fread; Hisam, Shamilah; Sharma, Reuben S.K.; Singh, Balbir; Conway, David J.

    2017-01-01

    Multilocus microsatellite genotyping of Plasmodium knowlesi isolates previously indicated 2 divergent parasite subpopulations in humans on the island of Borneo, each associated with a different macaque reservoir host species. Geographic divergence was also apparent, and independent sequence data have indicated particularly deep divergence between parasites from mainland Southeast Asia and Borneo. To resolve the overall population structure, multilocus microsatellite genotyping was conducted on a new sample of 182 P. knowlesi infections (obtained from 134 humans and 48 wild macaques) from diverse areas of Malaysia, first analyzed separately and then in combination with previous data. All analyses confirmed 2 divergent clusters of human cases in Malaysian Borneo, associated with long-tailed macaques and pig-tailed macaques, and a third cluster in humans and most macaques in peninsular Malaysia. High levels of pairwise divergence between each of these sympatric and allopatric subpopulations have implications for the epidemiology and control of this zoonotic species. PMID:28322705

  9. Assessment of the value of a genetic risk score in improving the estimation of coronary risk

    USDA-ARS?s Scientific Manuscript database

    The American Heart Association has established criteria for the evaluation of novel markers of cardiovascular risk. In accordance with these criteria, we assessed the association between a multi-locus genetic risk score (GRS) and incident coronary heart disease (CHD), and evaluated whether this GRS ...

  10. Deep phylogeographic divergence and cytonuclear discordance in the grasshopper Oedaleus decorus.

    PubMed

    Kindler, Eveline; Arlettaz, Raphaël; Heckel, Gerald

    2012-11-01

    The grasshopper Oedaleus decorus is a thermophilic insect with a large, mostly south-Palaearctic distribution range, stretching from the Mediterranean regions in Europe to Central-Asia and China. In this study, we analyzed the extent of phylogenetic divergence and the recent evolutionary history of the species based on 274 specimens from 26 localities across the distribution range in Europe. Phylogenetic relationships were determined using sequences of two mitochondrial loci (ctr, ND2) with neighbour-joining and Bayesian methods. Additionally, genetic differentiation was analyzed based on mitochondrial DNA and 11 microsatellite markers using F-statistics, model-free multivariate and model-based Bayesian clustering approaches. Phylogenetic analyses detected consistently two highly divergent, allopatrically distributed lineages within O. decorus. The divergence among these Western and Eastern lineages meeting in the region of the Alps was similar to the divergence of each lineage to the sister species O. asiaticus. Genetic differentiation for ctr was extremely high between Western and Eastern grasshopper populations (F(ct)=0.95). Microsatellite markers detected much lower but nevertheless very significant genetic structure among population samples. The nuclear data also demonstrated a case of cytonuclear discordance because the affiliation with mitochondrial lineages was incongruent in Northern Italy. Taken together these results provide evidence of an ancient separation within Oedaleus and either historical introgression of mtDNA among lineages and/or ongoing sex-specific gene flow in this grasshopper. Our study stresses the importance of multilocus approaches for unravelling the history and status of taxa of uncertain evolutionary divergence. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. Spatial variation in anthropogenic mortality induces a source-sink system in a hunted mesopredator.

    PubMed

    Minnie, Liaan; Zalewski, Andrzej; Zalewska, Hanna; Kerley, Graham I H

    2018-04-01

    Lethal carnivore management is a prevailing strategy to reduce livestock predation. Intensity of lethal management varies according to land-use, where carnivores are more intensively hunted on farms relative to reserves. Variations in hunting intensity may result in the formation of a source-sink system where carnivores disperse from high-density to low-density areas. Few studies quantify dispersal between supposed sources and sinks-a fundamental requirement for source-sink systems. We used the black-backed jackal (Canis mesomelas) as a model to determine if heterogeneous anthropogenic mortality induces a source-sink system. We analysed 12 microsatellite loci from 554 individuals from lightly hunted and previously unhunted reserves, as well as heavily hunted livestock- and game farms. Bayesian genotype assignment showed that jackal populations displayed a hierarchical population structure. We identified two genetically distinct populations at the regional level and nine distinct subpopulations at the local level, with each cluster corresponding to distinct land-use types separated by various dispersal barriers. Migration, estimated using Bayesian multilocus genotyping, between reserves and farms was asymmetric and heterogeneous anthropogenic mortality induced source-sink dynamics via compensatory immigration. Additionally some heavily hunted populations also acted as source populations, exporting individuals to other heavily hunted populations. This indicates that heterogeneous anthropogenic mortality results in the formation of a complex series of interconnected sources and sinks. Thus, lethal management of mesopredators may not be an effective long-term strategy in reducing livestock predation, as dispersal and, more importantly, compensatory immigration may continue to affect population reduction efforts as long as dispersal from other areas persists.

  12. Evolutionary History of Wild Barley (Hordeum vulgare subsp. spontaneum) Analyzed Using Multilocus Sequence Data and Paleodistribution Modeling

    PubMed Central

    Jakob, Sabine S.; Rödder, Dennis; Engler, Jan O.; Shaaf, Salar; Özkan, Hakan; Blattner, Frank R.; Kilian, Benjamin

    2014-01-01

    Studies of Hordeum vulgare subsp. spontaneum, the wild progenitor of cultivated barley, have mostly relied on materials collected decades ago and maintained since then ex situ in germplasm repositories. We analyzed spatial genetic variation in wild barley populations collected rather recently, exploring sequence variations at seven single-copy nuclear loci, and inferred the relationships among these populations and toward the genepool of the crop. The wild barley collection covers the whole natural distribution area from the Mediterranean to Middle Asia. In contrast to earlier studies, Bayesian assignment analyses revealed three population clusters, in the Levant, Turkey, and east of Turkey, respectively. Genetic diversity was exceptionally high in the Levant, while eastern populations were depleted of private alleles. Species distribution modeling based on climate parameters and extant occurrence points of the taxon inferred suitable habitat conditions during the ice-age, particularly in the Levant and Turkey. Together with the ecologically wide range of habitats, they might contribute to structured but long-term stable populations in this region and their high genetic diversity. For recently collected individuals, Bayesian assignment to geographic clusters was generally unambiguous, but materials from genebanks often showed accessions that were not placed according to their assumed geographic origin or showed traces of introgression from cultivated barley. We assign this to gene flow among accessions during ex situ maintenance. Evolutionary studies based on such materials might therefore result in wrong conclusions regarding the history of the species or the origin and mode of domestication of the crop, depending on the accessions included. PMID:24586028

  13. Development and application of a multilocus sequence analysis method for the identification of genotypes within genus Bradyrhizobium and for establishing nodule occupancy of soybean (Glycine max L. Merr)

    USDA-ARS?s Scientific Manuscript database

    A Multilocus Sequence Typing (MLST) method based on allelic variation of 7 chromosomal loci was developed for characterizing genotypes within the genus Bradyrhizobium. With the method 29 distinct multilocus genotypes (GTs) were identified among 191 culture collection soybean strains. The occupancy ...

  14. Application of a data-mining method based on Bayesian networks to lesion-deficit analysis

    NASA Technical Reports Server (NTRS)

    Herskovits, Edward H.; Gerring, Joan P.

    2003-01-01

    Although lesion-deficit analysis (LDA) has provided extensive information about structure-function associations in the human brain, LDA has suffered from the difficulties inherent to the analysis of spatial data, i.e., there are many more variables than subjects, and data may be difficult to model using standard distributions, such as the normal distribution. We herein describe a Bayesian method for LDA; this method is based on data-mining techniques that employ Bayesian networks to represent structure-function associations. These methods are computationally tractable, and can represent complex, nonlinear structure-function associations. When applied to the evaluation of data obtained from a study of the psychiatric sequelae of traumatic brain injury in children, this method generates a Bayesian network that demonstrates complex, nonlinear associations among lesions in the left caudate, right globus pallidus, right side of the corpus callosum, right caudate, and left thalamus, and subsequent development of attention-deficit hyperactivity disorder, confirming and extending our previous statistical analysis of these data. Furthermore, analysis of simulated data indicates that methods based on Bayesian networks may be more sensitive and specific for detecting associations among categorical variables than methods based on chi-square and Fisher exact statistics.

  15. Multilocus Sequence Typing of Bartonella henselae in the United Kingdom Indicates that Only a Few, Uncommon Sequence Types Are Associated with Zoonotic Disease▿†

    PubMed Central

    Chaloner, Gemma L.; Harrison, Timothy G.; Coyne, Karen P.; Aanensen, David M.; Birtles, Richard J.

    2011-01-01

    Bartonella henselae is one of the most common zoonotic agents acquired from companion animals (cats) in industrialized countries. Nonetheless, although the prevalence of infections in cats is high, the number of human cases reported is relatively low. One hypothesis for this discrepancy is that B. henselae strains vary in their zoonotic potential. To test this hypothesis, we employed structured sampling to explore the population structure of B. henselae in the United Kingdom and to determine the distribution of strains associated with zoonotic disease within this structure. A total of 118 B. henselae strains were delineated into 12 sequence types (STs) using multilocus sequence typing. We observed that most (85%) of the zoonosis-associated strains belonged to only three genotypes, i.e., ST2, ST5, and ST8. Conversely, most (74%) of the feline isolates belonged to ST4, ST6, and ST7. The difference in host association of ST2, ST5, and ST8 (zoonosis associated) and ST6 (feline) was statistically significant (P < 0.05), indicating that a few, uncommon STs were responsible for the majority of symptomatic human infections. PMID:21471345

  16. Colletotrichum gloeosporioides s.l. associated with Theobroma cacao and other plants in Panama: multilocus phylogenies distinguish host-associated pathogens from asymptomatic endophytes

    USDA-ARS?s Scientific Manuscript database

    Species of Colletotrichum interact with a vast but as yet undetermined number of plant species as pathogens and as asymptomatic endophytes. It is not known, however, whether these contrasting ecological modes are optional strategies exercised by individual species or whether species ecology is more ...

  17. Do Staphylococcus epidermidis Genetic Clusters Predict Isolation Sources?

    PubMed Central

    Tolo, Isaiah; Thomas, Jonathan C.; Fischer, Rebecca S. B.; Brown, Eric L.; Gray, Barry M.

    2016-01-01

    Staphylococcus epidermidis is a ubiquitous colonizer of human skin and a common cause of medical device-associated infections. The extent to which the population genetic structure of S. epidermidis distinguishes commensal from pathogenic isolates is unclear. Previously, Bayesian clustering of 437 multilocus sequence types (STs) in the international database revealed a population structure of six genetic clusters (GCs) that may reflect the species' ecology. Here, we first verified the presence of six GCs, including two (GC3 and GC5) with significant admixture, in an updated database of 578 STs. Next, a single nucleotide polymorphism (SNP) assay was developed that accurately assigned 545 (94%) of 578 STs to GCs. Finally, the hypothesis that GCs could distinguish isolation sources was tested by SNP typing and GC assignment of 154 isolates from hospital patients with bacteremia and those with blood culture contaminants and from nonhospital carriage. GC5 was isolated almost exclusively from hospital sources. GC1 and GC6 were isolated from all sources but were overrepresented in isolates from nonhospital and infection sources, respectively. GC2, GC3, and GC4 were relatively rare in this collection. No association was detected between fdh-positive isolates (GC2 and GC4) and nonhospital sources. Using a machine learning algorithm, GCs predicted hospital and nonhospital sources with 80% accuracy and predicted infection and contaminant sources with 45% accuracy, which was comparable to the results seen with a combination of five genetic markers (icaA, IS256, sesD [bhp], mecA, and arginine catabolic mobile element [ACME]). Thus, analysis of population structure with subgenomic data shows the distinction of hospital and nonhospital sources and the near-inseparability of sources within a hospital. PMID:27076664

  18. Bayesian Analysis of the Association between Family-Level Factors and Siblings' Dental Caries.

    PubMed

    Wen, A; Weyant, R J; McNeil, D W; Crout, R J; Neiswanger, K; Marazita, M L; Foxman, B

    2017-07-01

    We conducted a Bayesian analysis of the association between family-level socioeconomic status and smoking and the prevalence of dental caries among siblings (children from infant to 14 y) among children living in rural and urban Northern Appalachia using data from the Center for Oral Health Research in Appalachia (COHRA). The observed proportion of siblings sharing caries was significantly different from predicted assuming siblings' caries status was independent. Using a Bayesian hierarchical model, we found the inclusion of a household factor significantly improved the goodness of fit. Other findings showed an inverse association between parental education and siblings' caries and a positive association between households with smokers and siblings' caries. Our study strengthens existing evidence suggesting that increased parental education and decreased parental cigarette smoking are associated with reduced childhood caries in the household. Our results also demonstrate the value of a Bayesian approach, which allows us to include household as a random effect, thereby providing more accurate estimates than obtained using generalized linear mixed models.

  19. Ideal discrimination of discrete clinical endpoints using multilocus genotypes.

    PubMed

    Hahn, Lance W; Moore, Jason H

    2004-01-01

    Multifactor Dimensionality Reduction (MDR) is a method for the classification and prediction of discrete clinical endpoints using attributes constructed from multilocus genotype data. Empirical studies with both real and simulated data suggest that MDR has good power for detecting gene-gene interactions in the absence of independent main effects. The purpose of this study is to develop an objective, theory-driven approach to evaluate the strengths and limitations of MDR. To accomplish this goal, we borrow concepts from ideal observer analysis used in visual perception to evaluate the theoretical limits of classifying and predicting discrete clinical endpoints using multilocus genotype data. We conclude that MDR ideally discriminates between low risk and high risk subjects using attributes constructed from multilocus genotype data. We also how that the classification approach used once a multilocus attribute is constructed is similar to that of a naive Bayes classifier. This study provides a theoretical foundation for the continued development, evaluation, and application of the MDR as a data mining tool in the domain of statistical genetics and genetic epidemiology.

  20. Population Structure and History in Developing Core Sets in Wild Germplasm

    USDA-ARS?s Scientific Manuscript database

    Accurate inference of genetic discontinuities between populations is an essential component in studies of intraspecific biodiversity and evolution, as well as associative genetics. Multi-locus genotypes were amplified from 949 individuals representing seedling trees from 88 half-sib families from ei...

  1. Population Structure And History In Developing Core Sets In Wild Germplasm.

    USDA-ARS?s Scientific Manuscript database

    Accurate inference of genetic discontinuities between populations is an essential component in studies of intraspecific biodiversity and evolution, as well as associative genetics. Multi-locus genotypes were amplified from 949 individuals representing seedling trees from 88 half-sib families from ei...

  2. mec-associated dru typing in the epidemiological analysis of ST239 MRSA in Malaysia.

    PubMed

    Ghaznavi-Rad, E; Goering, R V; Nor Shamsudin, M; Weng, P L; Sekawi, Z; Tavakol, M; van Belkum, A; Neela, V

    2011-11-01

    The usefulness of mec-associated dru typing in the epidemiological analysis of methicillin-resistant Staphylococcus aureus (MRSA) isolated in Malaysia was investigated and compared with pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and spa and SCCmec typing. The isolates studied included all MRSA types in Malaysia. Multilocus sequence type ST188 and ST1 isolates were highly clonal by all typing methods. However, the dru typing of ST239 isolates produced the clearest discrimination between SCCmec IIIa and III isolates, yielding more subtypes than any other method. Evaluation of the discriminatory power for each method identified dru typing and PFGE as the most discriminatory, with Simpson's index of diversity (SID) values over 89%, including an isolate which was non-typeable by spa, but dru-typed as dt13j. The discriminatory ability of dru typing, especially with closely related MRSA ST239 strains (e.g., Brazilian and Hungarian), underscores its utility as a tool for the epidemiological investigation of MRSA.

  3. Zinc Resistance within Swine-Associated Methicillin-Resistant Staphylococcus aureus Isolates in the United States Is Associated with Multilocus Sequence Type Lineage

    PubMed Central

    Hau, Samantha J.; Frana, Timothy; Sun, Jisun; Davies, Peter R.

    2017-01-01

    ABSTRACT Zinc resistance in livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) sequence type 398 (ST398) is primarily mediated by the czrC gene colocated with the mecA gene, encoding methicillin resistance, within the type V staphylococcal cassette chromosome mec (SCCmec) element. Because czrC and mecA are located within the same mobile genetic element, it has been suggested that the use of zinc in feed as an antidiarrheal agent has the potential to contribute to the emergence and spread of methicillin-resistant S. aureus (MRSA) in swine, through increased selection pressure to maintain the SCCmec element in isolates obtained from pigs. In this study, we report the prevalence of the czrC gene and phenotypic zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates, MRSA ST5 isolates from humans with no swine contact, and U.S. swine-associated LA-MRSA ST398 isolates. We demonstrated that the prevalence of zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates was significantly lower than the prevalence of zinc resistance in MRSA ST5 isolates from humans with no swine contact and swine-associated LA-MRSA ST398 isolates, as well as prevalences from previous reports describing zinc resistance in other LA-MRSA ST398 isolates. Collectively, our data suggest that selection pressure associated with zinc supplementation in feed is unlikely to have played a significant role in the emergence of LA-MRSA ST5 in the U.S. swine population. Additionally, our data indicate that zinc resistance is associated with the multilocus sequence type lineage, suggesting a potential link between the genetic lineage and the carriage of resistance determinants. IMPORTANCE Our data suggest that coselection thought to be associated with the use of zinc in feed as an antimicrobial agent is not playing a role in the emergence of livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) ST5 in the U.S. swine population. Additionally, our data indicate that zinc resistance is more associated with the multilocus sequence type lineage, suggesting a potential link between the genetic lineage and the carriage of resistance markers. This information is important for public health professionals, veterinarians, producers, and consumers. PMID:28526788

  4. Zinc Resistance within Swine-Associated Methicillin-Resistant Staphylococcus aureus Isolates in the United States Is Associated with Multilocus Sequence Type Lineage.

    PubMed

    Hau, Samantha J; Frana, Timothy; Sun, Jisun; Davies, Peter R; Nicholson, Tracy L

    2017-08-01

    Zinc resistance in livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) sequence type 398 (ST398) is primarily mediated by the czrC gene colocated with the mecA gene, encoding methicillin resistance, within the type V staphylococcal cassette chromosome mec (SCC mec ) element. Because czrC and mecA are located within the same mobile genetic element, it has been suggested that the use of zinc in feed as an antidiarrheal agent has the potential to contribute to the emergence and spread of methicillin-resistant S. aureus (MRSA) in swine, through increased selection pressure to maintain the SCC mec element in isolates obtained from pigs. In this study, we report the prevalence of the czrC gene and phenotypic zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates, MRSA ST5 isolates from humans with no swine contact, and U.S. swine-associated LA-MRSA ST398 isolates. We demonstrated that the prevalence of zinc resistance in U.S. swine-associated LA-MRSA ST5 isolates was significantly lower than the prevalence of zinc resistance in MRSA ST5 isolates from humans with no swine contact and swine-associated LA-MRSA ST398 isolates, as well as prevalences from previous reports describing zinc resistance in other LA-MRSA ST398 isolates. Collectively, our data suggest that selection pressure associated with zinc supplementation in feed is unlikely to have played a significant role in the emergence of LA-MRSA ST5 in the U.S. swine population. Additionally, our data indicate that zinc resistance is associated with the multilocus sequence type lineage, suggesting a potential link between the genetic lineage and the carriage of resistance determinants. IMPORTANCE Our data suggest that coselection thought to be associated with the use of zinc in feed as an antimicrobial agent is not playing a role in the emergence of livestock-associated methicillin-resistant Staphylococcus aureus (LA-MRSA) ST5 in the U.S. swine population. Additionally, our data indicate that zinc resistance is more associated with the multilocus sequence type lineage, suggesting a potential link between the genetic lineage and the carriage of resistance markers. This information is important for public health professionals, veterinarians, producers, and consumers. Copyright © 2017 American Society for Microbiology.

  5. Population structure of Lactobacillus helveticus isolates from naturally fermented dairy products based on multilocus sequence typing.

    PubMed

    Sun, Zhihong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Yu, Jie; Bilige, Menghe; Zhang, Heping; Chen, Yongfu

    2015-05-01

    Lactobacillus helveticus is an economically important lactic acid bacterium used in industrial dairy fermentation. In the present study, the population structure of 245 isolates of L. helveticus from different naturally fermented dairy products in China and Mongolia were investigated using an multilocus sequence typing scheme with 11 housekeeping genes. A total of 108 sequence types were detected, which formed 8 clonal complexes and 27 singletons. Results from Structure, SplitsTree, and ClonalFrame software analyses demonstrated the presence of 3 subpopulations in the L. helveticus isolates used in our study, namely koumiss, kurut-tarag, and panmictic lineages. Most L. helveticus isolates from particular ecological origins had specific population structures. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  6. Multilocus Sequence Typing Has Better Discriminatory Ability for Typing Vibrio cholerae than Does Pulsed-Field Gel Electrophoresis and Provides a Measure of Phylogenetic Relatedness

    PubMed Central

    Kotetishvili, Mamuka; Stine, O. Colin; Chen, Yuansha; Kreger, Arnold; Sulakvelidze, Alexander; Sozhamannan, Shanmuga; Morris, Jr., J. Glenn

    2003-01-01

    Twenty-two Vibrio cholerae isolates, including some from “epidemic” (O1 and O139) and “nonepidemic” serogroups, were characterized by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) by using three housekeeping genes, gyrB, pgm, and recA; sequence data were also obtained for the virulence-associated genes tcpA, ctxA, and ctxB. Even with the small number of loci used, MLST had better discriminatory ability than did PFGE. On MLST analysis, there was clear clustering of epidemic serogroups; much greater diversity was seen among tcpA- and ctxAB-positive V. cholerae strains from other, nonepidemic serogroups, with a number of tcpA and ctxAB alleles identified. PMID:12734277

  7. Multi-locus phylogeny of dolphins in the subfamily Lissodelphininae: character synergy improves phylogenetic resolution

    PubMed Central

    Harlin-Cognato, April D; Honeycutt, Rodney L

    2006-01-01

    Background Dolphins of the genus Lagenorhynchus are anti-tropically distributed in temperate to cool waters. Phylogenetic analyses of cytochrome b sequences have suggested that the genus is polyphyletic; however, many relationships were poorly resolved. In this study, we present a combined-analysis phylogenetic hypothesis for Lagenorhynchus and members of the subfamily Lissodelphininae, which is derived from two nuclear and two mitochondrial data sets and the addition of 34 individuals representing 9 species. In addition, we characterize with parsimony and Bayesian analyses the phylogenetic utility and interaction of characters with statistical measures, including the utility of highly consistent (non-homoplasious) characters as a conservative measure of phylogenetic robustness. We also explore the effects of removing sources of character conflict on phylogenetic resolution. Results Overall, our study provides strong support for the monophyly of the subfamily Lissodelphininae and the polyphyly of the genus Lagenorhynchus. In addition, the simultaneous parsimony analysis resolved and/or improved resolution for 12 nodes including: (1) L. albirostris, L. acutus; (2) L. obscurus and L. obliquidens; and (3) L. cruciger and L. australis. In addition, the Bayesian analysis supported the monophyly of the Cephalorhynchus, and resolved ambiguities regarding the relationship of L. australis/L. cruciger to other members of the genus Lagenorhynchus. The frequency of highly consistent characters varied among data partitions, but the rate of evolution was consistent within data partitions. Although the control region was the greatest source of character conflict, removal of this data partition impeded phylogenetic resolution. Conclusion The simultaneous analysis approach produced a more robust phylogenetic hypothesis for Lagenorhynchus than previous studies, thus supporting a phylogenetic approach employing multiple data partitions that vary in overall rate of evolution. Even in cases where there was apparent conflict among characters, our data suggest a synergistic interaction in the simultaneous analysis, and speak against a priori exclusion of data because of potential conflicts, primarily because phylogenetic results can be less robust. For example, the removal of the control region, the putative source of character conflict, produced spurious results with inconsistencies among and within topologies from parsimony and Bayesian analyses. PMID:17078887

  8. Genetic Predisposition to Ischemic Stroke

    PubMed Central

    Kamatani, Yoichiro; Takahashi, Atsushi; Hata, Jun; Furukawa, Ryohei; Shiwa, Yuh; Yamaji, Taiki; Hara, Megumi; Tanno, Kozo; Ohmomo, Hideki; Ono, Kanako; Takashima, Naoyuki; Matsuda, Koichi; Wakai, Kenji; Sawada, Norie; Iwasaki, Motoki; Yamagishi, Kazumasa; Ago, Tetsuro; Ninomiya, Toshiharu; Fukushima, Akimune; Hozawa, Atsushi; Minegishi, Naoko; Satoh, Mamoru; Endo, Ryujin; Sasaki, Makoto; Sakata, Kiyomi; Kobayashi, Seiichiro; Ogasawara, Kuniaki; Nakamura, Motoyuki; Hitomi, Jiro; Kita, Yoshikuni; Tanaka, Keitaro; Iso, Hiroyasu; Kitazono, Takanari; Kubo, Michiaki; Tanaka, Hideo; Tsugane, Shoichiro; Kiyohara, Yutaka; Yamamoto, Masayuki; Sobue, Kenji; Shimizu, Atsushi

    2017-01-01

    Background and Purpose— The prediction of genetic predispositions to ischemic stroke (IS) may allow the identification of individuals at elevated risk and thereby prevent IS in clinical practice. Previously developed weighted multilocus genetic risk scores showed limited predictive ability for IS. Here, we investigated the predictive ability of a newer method, polygenic risk score (polyGRS), based on the idea that a few strong signals, as well as several weaker signals, can be collectively informative to determine IS risk. Methods— We genotyped 13 214 Japanese individuals with IS and 26 470 controls (derivation samples) and generated both multilocus genetic risk scores and polyGRS, using the same derivation data set. The predictive abilities of each scoring system were then assessed using 2 independent sets of Japanese samples (KyushuU and JPJM data sets). Results— In both validation data sets, polyGRS was shown to be significantly associated with IS, but weighted multilocus genetic risk scores was not. Comparing the highest with the lowest polyGRS quintile, the odds ratios for IS were 1.75 (95% confidence interval, 1.33–2.31) and 1.99 (95% confidence interval, 1.19–3.33) in the KyushuU and JPJM samples, respectively. Using the KyushuU samples, the addition of polyGRS to a nongenetic risk model resulted in a significant improvement of the predictive ability (net reclassification improvement=0.151; P<0.001). Conclusions— The polyGRS was shown to be superior to weighted multilocus genetic risk scores as an IS prediction model. Thus, together with the nongenetic risk factors, polyGRS will provide valuable information for individual risk assessment and management of modifiable risk factors. PMID:28034966

  9. Multilocus sequence typing of Chlamydia trachomatis among men who have sex with men reveals cocirculating strains not associated with specific subpopulations.

    PubMed

    Bom, Reinier J M; Matser, Amy; Bruisten, Sylvia M; van Rooijen, Martijn S; Heijman, Titia; Morré, Servaas A; de Vries, Henry J C; Schim van der Loeff, Maarten F

    2013-09-01

    Previous studies identified specific Chlamydia trachomatis strains circulating among men who have sex with men (MSM). This study investigates whether distinct C. trachomatis strains circulate among subpopulations within the MSM community. Participants were recruited at the sexually transmitted infection clinic of the Public Health Service of Amsterdam from 2008 to 2009. C. trachomatis samples were typed using multilocus sequence typing. Epidemiological and clinical data were derived from questionnaires and patient records. Typing of 277 samples from 260 MSM identified distinct C. trachomatis strains circulating concurrently over time. Men with lymphogranuloma venereum (LGV)-inducing strains were more likely to be infected with human immunodeficiency virus, more often had a history of STI, and had a higher frequency of risky sexual behavior. No such associations were found for non-LGV-inducing strains. MSM infected with heterosexual-associated strains were often younger (P = .04) and more often reported sex with women (P = .03), compared with men infected with MSM-associated strains. With the exception of LGV-inducing strains, no evidence was found that different C. trachomatis strains circulated in distinct subpopulations of MSM. This indicates that no separate transmission networks for C. trachomatis among MSM existed. However, younger MSM and bisexuals were more often infected with heterosexual-associated C. trachomatis strains.

  10. Association of virulence plasmid and antibiotic resistance determinants with chromosomal multilocus genotypes in Mexican Salmonella enterica serovar Typhimurium strains

    PubMed Central

    2009-01-01

    Background Bacterial genomes are mosaic structures composed of genes present in every strain of the same species (core genome), and genes present in some but not all strains of a species (accessory genome). The aim of this study was to compare the genetic diversity of core and accessory genes of a Salmonella enterica subspecies enterica serovar Typhimurium (Typhimurium) population isolated from food-animal and human sources in four regions of Mexico. Multilocus sequence typing (MLST) and macrorestriction fingerprints by pulsed-field gel electrophoresis (PFGE) were used to address the core genetic variation, and genes involved in pathogenesis and antibiotic resistance were selected to evaluate the accessory genome. Results We found a low genetic diversity for both housekeeping and accessory genes. Sequence type 19 (ST19) was supported as the founder genotype of STs 213, 302 and 429. We found a temporal pattern in which the derived ST213 is replacing the founder ST19 in the four geographic regions analyzed and a geographic trend in the number of resistance determinants. The distribution of the accessory genes was not random among chromosomal genotypes. We detected strong associations among the different accessory genes and the multilocus chromosomal genotypes (STs). First, the Salmonella virulence plasmid (pSTV) was found mostly in ST19 isolates. Second, the plasmid-borne betalactamase cmy-2 was found only in ST213 isolates. Third, the most abundant integron, IP-1 (dfrA12, orfF and aadA2), was found only in ST213 isolates. Fourth, the Salmonella genomic island (SGI1) was found mainly in a subgroup of ST19 isolates carrying pSTV. The mapping of accessory genes and multilocus genotypes on the dendrogram derived from macrorestiction fingerprints allowed the establishment of genetic subgroups within the population. Conclusion Despite the low levels of genetic diversity of core and accessory genes, the non-random distribution of the accessory genes across chromosomal backgrounds allowed us to discover genetic subgroups within the population. This study provides information about the importance of the accessory genome in generating genetic variability within a bacterial population. PMID:19573249

  11. Intra-Species Genetic Diversity and Clonal Structure of Cryptosporidium parvum in Sheep Farms in a Confined Geographical Area in Northeastern Spain.

    PubMed

    Ramo, Ana; Monteagudo, Luis V; Del Cacho, Emilio; Sánchez-Acedo, Caridad; Quílez, Joaquín

    2016-01-01

    A multilocus fragment typing approach including eleven variable-number tandem-repeat (VNTR) loci and the GP60 gene was used to investigate the intra-farm and intra-host genetic diversity of Cryptosporidium parvum in sheep farms in a confined area in northeastern Spain. Genomic DNA samples of 113 C. parvum isolates from diarrheic pre-weaned lambs collected in 49 meat-type sheep farms were analyzed. Loci exhibited various degrees of polymorphism, the finding of 7-9 alleles in the four most variable and discriminatory markers (ML2, Cgd6_5400, Cgd6_3940, and GP60) being remarkable. The combination of alleles at the twelve loci identified a total of 74 multilocus subtypes (MLTs) and provided a Hunter-Gaston discriminatory index of 0.988 (95% CI, 0.979-0.996). The finding that most MLTs (n = 64) were unique to individual farms evidenced that cryptosporidial infection is mainly transmitted within sheep flocks, with herd-to-herd transmission playing a secondary role. Limited intra- host variability was found, since only five isolates were genotypically mixed. In contrast, a significant intra-farm genetic diversity was seen, with the presence of multiple MLTs on more than a half of the farms (28/46), suggesting frequent mutations or genetic exchange through recombination. Comparison with a previous study in calves in northern Spain using the same 12-loci typing approach showed differences in the identity of major alleles at most loci, with a single MLT being shared between lambs and calves. Analysis of evolutionary descent by the algorithm eBURST indicated a high degree of genetic divergence, with over 41% MLTs appearing as singletons along with a high number of clonal complexes, most of them linking only two MLTs. Bayesian Structure analysis and F statistics also revealed the genetic remoteness of most C. parvum isolates and no ancestral population size was chosen. Linkage analysis evidenced a prevalent pattern of clonality within the parasite population.

  12. Inference of domain-disease associations from domain-protein, protein-disease and disease-disease relationships.

    PubMed

    Zhang, Wangshu; Coba, Marcelo P; Sun, Fengzhu

    2016-01-11

    Protein domains can be viewed as portable units of biological function that defines the functional properties of proteins. Therefore, if a protein is associated with a disease, protein domains might also be associated and define disease endophenotypes. However, knowledge about such domain-disease relationships is rarely available. Thus, identification of domains associated with human diseases would greatly improve our understanding of the mechanism of human complex diseases and further improve the prevention, diagnosis and treatment of these diseases. Based on phenotypic similarities among diseases, we first group diseases into overlapping modules. We then develop a framework to infer associations between domains and diseases through known relationships between diseases and modules, domains and proteins, as well as proteins and disease modules. Different methods including Association, Maximum likelihood estimation (MLE), Domain-disease pair exclusion analysis (DPEA), Bayesian, and Parsimonious explanation (PE) approaches are developed to predict domain-disease associations. We demonstrate the effectiveness of all the five approaches via a series of validation experiments, and show the robustness of the MLE, Bayesian and PE approaches to the involved parameters. We also study the effects of disease modularization in inferring novel domain-disease associations. Through validation, the AUC (Area Under the operating characteristic Curve) scores for Bayesian, MLE, DPEA, PE, and Association approaches are 0.86, 0.84, 0.83, 0.83 and 0.79, respectively, indicating the usefulness of these approaches for predicting domain-disease relationships. Finally, we choose the Bayesian approach to infer domains associated with two common diseases, Crohn's disease and type 2 diabetes. The Bayesian approach has the best performance for the inference of domain-disease relationships. The predicted landscape between domains and diseases provides a more detailed view about the disease mechanisms.

  13. Complete Deletion of the Fucose Operon in Haemophilus influenzae Is Associated with a Cluster in Multilocus Sequence Analysis-Based Phylogenetic Group II Related to Haemophilus haemolyticus: Implications for Identification and Typing

    PubMed Central

    de Gier, Camilla; Kirkham, Lea-Ann S.

    2015-01-01

    Nonhemolytic variants of Haemophilus haemolyticus are difficult to differentiate from Haemophilus influenzae despite a wide difference in pathogenic potential. A previous investigation characterized a challenging set of 60 clinical strains using multiple PCRs for marker genes and described strains that could not be unequivocally identified as either species. We have analyzed the same set of strains by multilocus sequence analysis (MLSA) and near-full-length 16S rRNA gene sequencing. MLSA unambiguously allocated all study strains to either of the two species, while identification by 16S rRNA sequence was inconclusive for three strains. Notably, the two methods yielded conflicting identifications for two strains. Most of the “fuzzy species” strains were identified as H. influenzae that had undergone complete deletion of the fucose operon. Such strains, which are untypeable by the H. influenzae multilocus sequence type (MLST) scheme, have sporadically been reported and predominantly belong to a single branch of H. influenzae MLSA phylogenetic group II. We also found evidence of interspecies recombination between H. influenzae and H. haemolyticus within the 16S rRNA genes. Establishing an accurate method for rapid and inexpensive identification of H. influenzae is important for disease surveillance and treatment. PMID:26378279

  14. The conquering of North America: dated phylogenetic and biogeographic inference of migratory behavior in bee hummingbirds.

    PubMed

    Licona-Vera, Yuyini; Ornelas, Juan Francisco

    2017-06-05

    Geographical and temporal patterns of diversification in bee hummingbirds (Mellisugini) were assessed with respect to the evolution of migration, critical for colonization of North America. We generated a dated multilocus phylogeny of the Mellisugini based on a dense sampling using Bayesian inference, maximum-likelihood and maximum parsimony methods, and reconstructed the ancestral states of distributional areas in a Bayesian framework and migratory behavior using maximum parsimony, maximum-likelihood and re-rooting methods. All phylogenetic analyses confirmed monophyly of the Mellisugini and the inclusion of Atthis, Calothorax, Doricha, Eulidia, Mellisuga, Microstilbon, Myrmia, Tilmatura, and Thaumastura. Mellisugini consists of two clades: (1) South American species (including Tilmatura dupontii), and (2) species distributed in North and Central America and the Caribbean islands. The second clade consists of four subclades: Mexican (Calothorax, Doricha) and Caribbean (Archilochus, Calliphlox, Mellisuga) sheartails, Calypte, and Selasphorus (incl. Atthis). Coalescent-based dating places the origin of the Mellisugini in the mid-to-late Miocene, with crown ages of most subclades in the early Pliocene, and subsequent species splits in the Pleistocene. Bee hummingbirds reached western North America by the end of the Miocene and the ancestral mellisuginid (bee hummingbirds) was reconstructed as sedentary, with four independent gains of migratory behavior during the evolution of the Mellisugini. Early colonization of North America and subsequent evolution of migration best explained biogeographic and diversification patterns within the Mellisugini. The repeated evolution of long-distance migration by different lineages was critical for the colonization of North America, contributing to the radiation of bee hummingbirds. Comparative phylogeography is needed to test whether the repeated evolution of migration resulted from northward expansion of southern sedentary populations.

  15. Molecular Phylogenetic Diversity of Dermatologic and Other Human Pathogenic Fusaria from Hospitals in Northern Italy

    USDA-ARS?s Scientific Manuscript database

    Fifty-eight fusaria isolated from 52 Italian patients between 2004 and 2007 were subject to multilocus DNA sequence typing to characterize the spectrum of species and circulating sequence types (STs) associated with dermatological infections, especially onychomycoses and paronychia, and other fusari...

  16. Campylobacter multi-locus sequence typing subtypes detected on chicken livers available at retail.

    USDA-ARS?s Scientific Manuscript database

    Foodborne campylobacteriosis has been traced to undercooked chicken liver. It is not known what prevalence of Campylobacter to expect on fresh chicken livers available at retail. The objectives of this study were to measure prevalence of Campylobacter associated with chicken livers at retail and d...

  17. Invited commentary: Lost in estimation--searching for alternatives to markov chains to fit complex Bayesian models.

    PubMed

    Molitor, John

    2012-03-01

    Bayesian methods have seen an increase in popularity in a wide variety of scientific fields, including epidemiology. One of the main reasons for their widespread application is the power of the Markov chain Monte Carlo (MCMC) techniques generally used to fit these models. As a result, researchers often implicitly associate Bayesian models with MCMC estimation procedures. However, Bayesian models do not always require Markov-chain-based methods for parameter estimation. This is important, as MCMC estimation methods, while generally quite powerful, are complex and computationally expensive and suffer from convergence problems related to the manner in which they generate correlated samples used to estimate probability distributions for parameters of interest. In this issue of the Journal, Cole et al. (Am J Epidemiol. 2012;175(5):368-375) present an interesting paper that discusses non-Markov-chain-based approaches to fitting Bayesian models. These methods, though limited, can overcome some of the problems associated with MCMC techniques and promise to provide simpler approaches to fitting Bayesian models. Applied researchers will find these estimation approaches intuitively appealing and will gain a deeper understanding of Bayesian models through their use. However, readers should be aware that other non-Markov-chain-based methods are currently in active development and have been widely published in other fields.

  18. Multicentric epidemiological study of Aspergillus fumigatus isolates by multilocus enzyme electrophoresis.

    PubMed Central

    Rodriguez, E; De Meeüs, T; Mallie, M; Renaud, F; Symoens, F; Mondon, P; Piens, M A; Lebeau, B; Viviani, M A; Grillot, R; Nolard, N; Chapuis, F; Tortorano, A M; Bastide, J M

    1996-01-01

    The genotypes of 63 isolates of Aspergillus fumigatus obtained from three hospitals in different geographical areas and of eight culture collection strains were determined by multilocus enzyme electrophoresis. Twelve of the 17 enzymatic loci studied were polymorphic, giving rise to 48 different electrophoretic types. The existence of fixed multilocus genotypes, significant heterozygote deficits and excesses at the different loci, and linkage disequilibria within subpopulations strongly suggests a clonal reproduction mode for A. fumigatus. Numerical analysis of the comparison and disposition of the different electrophoretic types demonstrates a significant genetic differentiation between the three sampling sites. However, no correlation could be found between geographical distances and genetic differentiation. On account of the multiple discriminatory markers, multilocus enzyme electrophoresis typing seems to be a very powerful tool for epidemiological and reproductive mode studies of A. fumigatus. PMID:8880520

  19. Modeling Associations among Multivariate Longitudinal Categorical Variables in Survey Data: A Semiparametric Bayesian Approach

    ERIC Educational Resources Information Center

    Tchumtchoua, Sylvie; Dey, Dipak K.

    2012-01-01

    This paper proposes a semiparametric Bayesian framework for the analysis of associations among multivariate longitudinal categorical variables in high-dimensional data settings. This type of data is frequent, especially in the social and behavioral sciences. A semiparametric hierarchical factor analysis model is developed in which the…

  20. A New Perspective on Listeria monocytogenes Evolution

    PubMed Central

    Ragon, Marie; Wirth, Thierry; Hollandt, Florian; Lavenir, Rachel; Lecuit, Marc; Le Monnier, Alban; Brisse, Sylvain

    2008-01-01

    Listeria monocytogenes is a model organism for cellular microbiology and host–pathogen interaction studies and an important food-borne pathogen widespread in the environment, thus representing an attractive model to study the evolution of virulence. The phylogenetic structure of L. monocytogenes was determined by sequencing internal portions of seven housekeeping genes (3,288 nucleotides) in 360 representative isolates. Fifty-eight of the 126 disclosed sequence types were grouped into seven well-demarcated clonal complexes (clones) that comprised almost 75% of clinical isolates. Each clone had a unique or dominant serotype (4b for clones 1, 2 and 4, 1/2b for clones 3 and 5, 1/2a for clone 7, and 1/2c for clone 9), with no association of clones with clinical forms of human listeriosis. Homologous recombination was extremely limited (r/m<1 for nucleotides), implying long-term genetic stability of multilocus genotypes over time. Bayesian analysis based on 438 SNPs recovered the three previously defined lineages, plus one unclassified isolate of mixed ancestry. The phylogenetic distribution of serotypes indicated that serotype 4b evolved once from 1/2b, the likely ancestral serotype of lineage I. Serotype 1/2c derived once from 1/2a, with reference strain EGDe (1/2a) likely representing an intermediate evolutionary state. In contrast to housekeeping genes, the virulence factor internalin (InlA) evolved by localized recombination resulting in a mosaic pattern, with convergent evolution indicative of natural selection towards a truncation of InlA protein. This work provides a reference evolutionary framework for future studies on L. monocytogenes epidemiology, ecology, and virulence. PMID:18773117

  1. Multi-Locus Candidate Gene Analyses of Lipid Levels in a Pediatric Turkish Cohort: Lessons Learned on LPL, CETP, LIPC, ABCA1, and SHBG

    PubMed Central

    Eren, Fatih; Agirbasli, Deniz; White, Marquitta J.; Williams, Scott M

    2013-01-01

    Abstract Cardiovascular risk factors and atherosclerosis precursors were examined in 365 Turkish children and adolescents. Study participants were recruited at five different state schools. We tested single and multi-locus effects of six polymorphisms from five candidate genes, chosen based on prior known association with lipid levels in adults, for association with low (≤10th percentile) high density lipoprotein cholesterol (HDL-C) and high (≥90th percentile) triglycerides (TG), and the related continuous outcomes. We observed an association between CETP variant rs708272 and low HDL-C (allelic p=0.020, genotypic p=0.046), which was supported by an independent analysis, PRAT (PRAT control p=0.027). Sex-stratified logistic regression analysis showed that the B2 allele of rs708272 decreased odds of being in the lower tenth percentile of HDL-C measurements (OR=0.36, p=0.02) in girls; this direction of effect was also seen in boys but was not significant (OR=0.64, p=0.21). Logistic regression analysis also revealed that the T allele of rs6257 (SHBG) decreased odds of being in the top tenth percentile of TG measurements in boys (OR=0.43, p=0.03). Analysis of lipid levels as a continuous trait revealed a significant association between rs708272 (CETP) and LDL-C levels in males (p=0.02) with the B2B2 genotype group having the lowest mean LDL-C; the same direction of effect was also seen in females (p=0.05). An effect was also seen between rs708272 and HDL-C levels in girls (p=0.01), with the B2B2 genotype having the highest mean HDL-C levels. Multi-locus analysis, using quantitative multifactor dimensionality reduction (qMDR) identified the previously mentioned CETP variant as the best single locus model, and overall model, for predicting HDL-C levels in children. This study provides evidence for association between CETP and low HDL-C phenotype in children, but the results appear to be weaker in children than previous results in adults and may also be subject to gender effects. PMID:23988150

  2. Multilocus genetics to reconstruct aeromonad evolution

    PubMed Central

    2012-01-01

    Background Aeromonas spp. are versatile bacteria that exhibit a wide variety of lifestyles. In an attempt to improve the understanding of human aeromonosis, we investigated whether clinical isolates displayed specific characteristics in terms of genetic diversity, population structure and mode of evolution among Aeromonas spp. A collection of 195 Aeromonas isolates from human, animal and environmental sources was therefore genotyped using multilocus sequence analysis (MLSA) based on the dnaK, gltA, gyrB, radA, rpoB, tsf and zipA genes. Results The MLSA showed a high level of genetic diversity among the population, and multilocus-based phylogenetic analysis (MLPA) revealed 3 major clades: the A. veronii, A. hydrophila and A. caviae clades, among the eleven clades detected. Lower genetic diversity was observed within the A. caviae clade as well as among clinical isolates compared to environmental isolates. Clonal complexes, each of which included a limited number of strains, mainly corresponded to host-associated subsclusters of strains, i.e., a fish-associated subset within A. salmonicida and 11 human-associated subsets, 9 of which included only disease-associated strains. The population structure was shown to be clonal, with modes of evolution that involved mutations in general and recombination events locally. Recombination was detected in 5 genes in the MLSA scheme and concerned approximately 50% of the STs. Therefore, these recombination events could explain the observed phylogenetic incongruities and low robustness. However, the MLPA globally confirmed the current systematics of the genus Aeromonas. Conclusions Evolution in the genus Aeromonas has resulted in exceptionally high genetic diversity. Emerging from this diversity, subsets of strains appeared to be host adapted and/or “disease specialized” while the A. caviae clade displayed an atypical tempo of evolution among aeromonads. Considering that A. salmonicida has been described as a genetically uniform pathogen that has adapted to fish through evolution from a variable ancestral population, we hypothesize that the population structure of aeromonads described herein suggested an ongoing process of adaptation to specialized niches associated with different degrees of advancement according to clades and clusters. PMID:22545815

  3. A pernicious agent affecting avocado in Israel: a novel symbiotic Fusarium sp. associated with the ambrosia beetle Euwallacea fornicatus

    USDA-ARS?s Scientific Manuscript database

    Since first recorded in Israel in 2009, the ambrosia beetle, Euwallacea fornicatus Eichhoff, has been shown to vector a fusarial pathogen of avocado (Persea Americana Miller) in its mandibular mycangia. Multilocus molecular phylogenetic analyses indicate the pathogen represents a novel symbiotic Fus...

  4. Multilocus genotyping of a ‘Candidatus Phytoplasma aurantifolia’-related strain associated with cauliflower phyllody disease in China

    USDA-ARS?s Scientific Manuscript database

    A new cauliflower disease characterized by formation of leaf-like inflorescences and malformed flowers occurred in a seed production filed located in Yunnan, a southwest province of China. In the diseased plants, floral organs in three inner whorls (petals, carpels, and stamens) were under-develope...

  5. Streptococcus agalactiae serotype Ib as an agent of meningitis in two adult nonpregnant women.

    PubMed

    Martins, E R; Florindo, C; Martins, F; Aldir, I; Borrego, M J; Brum, L; Ramirez, M; Melo-Cristino, J

    2007-11-01

    Two temporally and geographically clustered cases of meningitis caused by Streptococcus agalactiae expressing the infrequent Ib serotype are reported. Characterization by pulsed-field gel electrophoresis and multilocus sequence typing revealed that the isolates were identical and represented the widely distributed ST10/ST8 lineage associated with serotype Ib.

  6. Clonality, recombination, and hybridization in the plumbing-inhabiting human pathogen Fusarium keratoplasticum inferred from multilocus sequence typing

    USDA-ARS?s Scientific Manuscript database

    Recent work has shown that Fusarium species and genotypes most commonly associated with human infections, particularly of the cornea (mycotic keratitis), are the same as those most commonly isolated from plumbing systems. The species most dominant in plumbing biofilms is Fusarium keratoplasticum, a ...

  7. Multi-locus sequence subtypes of Campylobacter detected on the surface and from internal tissue of retail chicken livers

    USDA-ARS?s Scientific Manuscript database

    Foodborne campylobacteriosis has been traced to undercooked chicken liver. The objectives of this study were to measure prevalence of Campylobacter associated with chicken livers at retail and determine which subtypes are detected on the surface and inner tissue of livers. Fifteen packages of fres...

  8. Multilocus sequence analysis reveals extensive genetic variety within Tenacibaculum spp. associated with ulcers in sea-farmed fish in Norway.

    PubMed

    Olsen, Anne Berit; Gulla, Snorre; Steinum, Terje; Colquhoun, Duncan J; Nilsen, Hanne K; Duchaud, Eric

    2017-06-01

    Skin ulcer development in sea-reared salmonids, commonly associated with Tenacibaculum spp., is a significant fish welfare- and economical problem in Norwegian aquaculture. A collection of 89 Tenacibaculum isolates was subjected to multilocus sequence analysis (MLSA). The isolates were retrieved from outbreaks of clinical disease in farms spread along the Norwegian coast line from seven different fish species over a period of 19 years. MLSA analysis reveals considerable genetic diversity, but allows identification of four main clades. One clade encompasses isolates belonging to the species T. dicentrarchi, whereas three clades encompass bacteria that likely represent novel, as yet undescribed species. The study identified T. maritimum in lumpsucker, T. ovolyticum in halibut, and has extended the host and geographic range for T. soleae, isolated from wrasse. The overall lack of clonality and host specificity, with some indication of geographical range restriction argue for local epidemics involving multiple strains. The diversity of Tenacibaculum isolates from fish displaying ulcerative disease may complicate vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Bayesian multimodel inference for dose-response studies

    USGS Publications Warehouse

    Link, W.A.; Albers, P.H.

    2007-01-01

    Statistical inference in dose?response studies is model-based: The analyst posits a mathematical model of the relation between exposure and response, estimates parameters of the model, and reports conclusions conditional on the model. Such analyses rarely include any accounting for the uncertainties associated with model selection. The Bayesian inferential system provides a convenient framework for model selection and multimodel inference. In this paper we briefly describe the Bayesian paradigm and Bayesian multimodel inference. We then present a family of models for multinomial dose?response data and apply Bayesian multimodel inferential methods to the analysis of data on the reproductive success of American kestrels (Falco sparveriuss) exposed to various sublethal dietary concentrations of methylmercury.

  10. Bayesian LASSO, scale space and decision making in association genetics.

    PubMed

    Pasanen, Leena; Holmström, Lasse; Sillanpää, Mikko J

    2015-01-01

    LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four problems that need special attention: (i) controlling false positives, (ii) multiple comparisons, (iii) collinearity among explanatory variables, and (iv) the choice of the tuning parameter that controls the amount of shrinkage and the sparsity of the estimates. The particular application considered is association genetics, where LASSO regression can be used to find links between chromosome locations and phenotypic traits in a biological organism. However, the proposed techniques are relevant also in other contexts where LASSO is used for variable selection. We separate the true associations from false positives using the posterior distribution of the effects (regression coefficients) provided by Bayesian LASSO. We propose to solve the multiple comparisons problem by using simultaneous inference based on the joint posterior distribution of the effects. Bayesian LASSO also tends to distribute an effect among collinear variables, making detection of an association difficult. We propose to solve this problem by considering not only individual effects but also their functionals (i.e. sums and differences). Finally, whereas in Bayesian LASSO the tuning parameter is often regarded as a random variable, we adopt a scale space view and consider a whole range of fixed tuning parameters, instead. The effect estimates and the associated inference are considered for all tuning parameters in the selected range and the results are visualized with color maps that provide useful insights into data and the association problem considered. The methods are illustrated using two sets of artificial data and one real data set, all representing typical settings in association genetics.

  11. BANYAN_Sigma: Bayesian classifier for members of young stellar associations

    NASA Astrophysics Data System (ADS)

    Gagné, Jonathan; Mamajek, Eric E.; Malo, Lison; Riedel, Adric; Rodriguez, David; Lafrenière, David; Faherty, Jacqueline K.; Roy-Loubier, Olivier; Pueyo, Laurent; Robin, Annie C.; Doyon, René

    2018-01-01

    BANYAN_Sigma calculates the membership probability that a given astrophysical object belongs to one of the currently known 27 young associations within 150 pc of the Sun, using Bayesian inference. This tool uses the sky position and proper motion measurements of an object, with optional radial velocity (RV) and distance (D) measurements, to derive a Bayesian membership probability. By default, the priors are adjusted such that a probability threshold of 90% will recover 50%, 68%, 82% or 90% of true association members depending on what observables are input (only sky position and proper motion, with RV, with D, with both RV and D, respectively). The algorithm is implemented in a Python package, in IDL, and is also implemented as an interactive web page.

  12. Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions.

    PubMed

    Chatterjee, Nilanjan; Kalaylioglu, Zeynep; Moslehi, Roxana; Peters, Ulrike; Wacholder, Sholom

    2006-12-01

    In modern genetic epidemiology studies, the association between the disease and a genomic region, such as a candidate gene, is often investigated using multiple SNPs. We propose a multilocus test of genetic association that can account for genetic effects that might be modified by variants in other genes or by environmental factors. We consider use of the venerable and parsimonious Tukey's 1-degree-of-freedom model of interaction, which is natural when individual SNPs within a gene are associated with disease through a common biological mechanism; in contrast, many standard regression models are designed as if each SNP has unique functional significance. On the basis of Tukey's model, we propose a novel but computationally simple generalized test of association that can simultaneously capture both the main effects of the variants within a genomic region and their interactions with the variants in another region or with an environmental exposure. We compared performance of our method with that of two standard tests of association, one ignoring gene-gene/gene-environment interactions and the other based on a saturated model of interactions. We demonstrate major power advantages of our method both in analysis of data from a case-control study of the association between colorectal adenoma and DNA variants in the NAT2 genomic region, which are well known to be related to a common biological phenotype, and under different models of gene-gene interactions with use of simulated data.

  13. The importance of molecular analyses for understanding the genetic diversity of Histoplasma capsulatum: an overview.

    PubMed

    Vite-Garín, Tania; Estrada-Bárcenas, Daniel Alfonso; Cifuentes, Joaquín; Taylor, Maria Lucia

    2014-01-01

    Advances in the classification of the human pathogen Histoplasma capsulatum (H. capsulatum) (ascomycete) are sustained by the results of several genetic analyses that support the high diversity of this dimorphic fungus. The present mini-review highlights the great genetic plasticity of H. capsulatum. Important records with different molecular tools, mainly single- or multi-locus sequence analyses developed with this fungus, are discussed. Recent phylogenetic data with a multi-locus sequence analysis using 5 polymorphic loci support a new clade and/or phylogenetic species of H. capsulatum for the Americas, which was associated with fungal isolates obtained from the migratory bat Tadarida brasiliensis. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.

  14. Inference on the Genetic Basis of Eye and Skin Color in an Admixed Population via Bayesian Linear Mixed Models.

    PubMed

    Lloyd-Jones, Luke R; Robinson, Matthew R; Moser, Gerhard; Zeng, Jian; Beleza, Sandra; Barsh, Gregory S; Tang, Hua; Visscher, Peter M

    2017-06-01

    Genetic association studies in admixed populations are underrepresented in the genomics literature, with a key concern for researchers being the adequate control of spurious associations due to population structure. Linear mixed models (LMMs) are well suited for genome-wide association studies (GWAS) because they account for both population stratification and cryptic relatedness and achieve increased statistical power by jointly modeling all genotyped markers. Additionally, Bayesian LMMs allow for more flexible assumptions about the underlying distribution of genetic effects, and can concurrently estimate the proportion of phenotypic variance explained by genetic markers. Using three recently published Bayesian LMMs, Bayes R, BSLMM, and BOLT-LMM, we investigate an existing data set on eye ( n = 625) and skin ( n = 684) color from Cape Verde, an island nation off West Africa that is home to individuals with a broad range of phenotypic values for eye and skin color due to the mix of West African and European ancestry. We use simulations to demonstrate the utility of Bayesian LMMs for mapping loci and studying the genetic architecture of quantitative traits in admixed populations. The Bayesian LMMs provide evidence for two new pigmentation loci: one for eye color ( AHRR ) and one for skin color ( DDB1 ). Copyright © 2017 by the Genetics Society of America.

  15. Adaptive evolution of Mediterranean pines.

    PubMed

    Grivet, Delphine; Climent, José; Zabal-Aguirre, Mario; Neale, David B; Vendramin, Giovanni G; González-Martínez, Santiago C

    2013-09-01

    Mediterranean pines represent an extremely heterogeneous assembly. Although they have evolved under similar environmental conditions, they diversified long ago, ca. 10 Mya, and present distinct biogeographic and demographic histories. Therefore, it is of special interest to understand whether and to what extent they have developed specific strategies of adaptive evolution through time and space. To explore evolutionary patterns, the Mediterranean pines' phylogeny was first reconstructed analyzing a new set of 21 low-copy nuclear genes with multilocus Bayesian tree reconstruction methods. Secondly, a phylogenetic approach was used to search for footprints of natural selection and to examine the evolution of multiple phenotypic traits. We identified two genes (involved in pines' defense and stress responses) that have likely played a role in the adaptation of Mediterranean pines to their environment. Moreover, few life-history traits showed historical or evolutionary adaptive convergence in Mediterranean lineages, while patterns of character evolution revealed various evolutionary trade-offs linking growth-development, reproduction and fire-related traits. Assessing the evolutionary path of important life-history traits, as well as the genomic basis of adaptive variation is central to understanding the past evolutionary success of Mediterranean pines and their future response to environmental changes. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. Phylogeographic History and Gene Flow Among Giant Galápagos Tortoises on Southern Isabela Island

    PubMed Central

    Ciofi, Claudio; Wilson, Gregory A.; Beheregaray, Luciano B.; Marquez, Cruz; Gibbs, James P.; Tapia, Washington; Snell, Howard L.; Caccone, Adalgisa; Powell, Jeffrey R.

    2006-01-01

    Volcanic islands represent excellent models with which to study the effect of vicariance on colonization and dispersal, particularly when the evolution of genetic diversity mirrors the sequence of geological events that led to island formation. Phylogeographic inference, however, can be particularly challenging for recent dispersal events within islands, where the antagonistic effects of land bridge formation and vicariance can affect movements of organisms with limited dispersal ability. We investigated levels of genetic divergence and recovered signatures of dispersal events for 631 Galápagos giant tortoises across the volcanoes of Sierra Negra and Cerro Azul on the island of Isabela. These volcanoes are among the most recent formations in the Galápagos (<0.7 million years), and previous studies based on genetic and morphological data could not recover a consistent pattern of lineage sorting. We integrated nested clade analysis of mitochondrial DNA control region sequences, to infer historical patterns of colonization, and a novel Bayesian multilocus genotyping method for recovering evidence of recent migration across volcanoes using eleven microsatellite loci. These genetic studies illuminate taxonomic distinctions as well as provide guidance to possible repatriation programs aimed at countering the rapid population declines of these spectacular animals. PMID:16387883

  17. Phylogeographic history and gene flow among giant Galápagos tortoises on southern Isabela Island.

    PubMed

    Ciofi, Claudio; Wilson, Gregory A; Beheregaray, Luciano B; Marquez, Cruz; Gibbs, James P; Tapia, Washington; Snell, Howard L; Caccone, Adalgisa; Powell, Jeffrey R

    2006-03-01

    Volcanic islands represent excellent models with which to study the effect of vicariance on colonization and dispersal, particularly when the evolution of genetic diversity mirrors the sequence of geological events that led to island formation. Phylogeographic inference, however, can be particularly challenging for recent dispersal events within islands, where the antagonistic effects of land bridge formation and vicariance can affect movements of organisms with limited dispersal ability. We investigated levels of genetic divergence and recovered signatures of dispersal events for 631 Galápagos giant tortoises across the volcanoes of Sierra Negra and Cerro Azul on the island of Isabela. These volcanoes are among the most recent formations in the Galápagos (<0.7 million years), and previous studies based on genetic and morphological data could not recover a consistent pattern of lineage sorting. We integrated nested clade analysis of mitochondrial DNA control region sequences, to infer historical patterns of colonization, and a novel Bayesian multilocus genotyping method for recovering evidence of recent migration across volcanoes using eleven microsatellite loci. These genetic studies illuminate taxonomic distinctions as well as provide guidance to possible repatriation programs aimed at countering the rapid population declines of these spectacular animals.

  18. An efficient Bayesian meta-analysis approach for studying cross-phenotype genetic associations

    PubMed Central

    Majumdar, Arunabha; Haldar, Tanushree; Bhattacharya, Sourabh; Witte, John S.

    2018-01-01

    Simultaneous analysis of genetic associations with multiple phenotypes may reveal shared genetic susceptibility across traits (pleiotropy). For a locus exhibiting overall pleiotropy, it is important to identify which specific traits underlie this association. We propose a Bayesian meta-analysis approach (termed CPBayes) that uses summary-level data across multiple phenotypes to simultaneously measure the evidence of aggregate-level pleiotropic association and estimate an optimal subset of traits associated with the risk locus. This method uses a unified Bayesian statistical framework based on a spike and slab prior. CPBayes performs a fully Bayesian analysis by employing the Markov Chain Monte Carlo (MCMC) technique Gibbs sampling. It takes into account heterogeneity in the size and direction of the genetic effects across traits. It can be applied to both cohort data and separate studies of multiple traits having overlapping or non-overlapping subjects. Simulations show that CPBayes can produce higher accuracy in the selection of associated traits underlying a pleiotropic signal than the subset-based meta-analysis ASSET. We used CPBayes to undertake a genome-wide pleiotropic association study of 22 traits in the large Kaiser GERA cohort and detected six independent pleiotropic loci associated with at least two phenotypes. This includes a locus at chromosomal region 1q24.2 which exhibits an association simultaneously with the risk of five different diseases: Dermatophytosis, Hemorrhoids, Iron Deficiency, Osteoporosis and Peripheral Vascular Disease. We provide an R-package ‘CPBayes’ implementing the proposed method. PMID:29432419

  19. Learning Bayesian Networks from Correlated Data

    NASA Astrophysics Data System (ADS)

    Bae, Harold; Monti, Stefano; Montano, Monty; Steinberg, Martin H.; Perls, Thomas T.; Sebastiani, Paola

    2016-05-01

    Bayesian networks are probabilistic models that represent complex distributions in a modular way and have become very popular in many fields. There are many methods to build Bayesian networks from a random sample of independent and identically distributed observations. However, many observational studies are designed using some form of clustered sampling that introduces correlations between observations within the same cluster and ignoring this correlation typically inflates the rate of false positive associations. We describe a novel parameterization of Bayesian networks that uses random effects to model the correlation within sample units and can be used for structure and parameter learning from correlated data without inflating the Type I error rate. We compare different learning metrics using simulations and illustrate the method in two real examples: an analysis of genetic and non-genetic factors associated with human longevity from a family-based study, and an example of risk factors for complications of sickle cell anemia from a longitudinal study with repeated measures.

  20. Inferences on the phylogeography of the fungal pathogen Heterobasidion annosum, including evidence of interspecific horizontal genetic transfer and of human-mediated, long-range dispersal

    Treesearch

    R.E. Linzer; W.J. Otrosina; P. Gonthier; J. Bruhn; G. Laflamme; G. Bussieres; M. Garbelotto

    2008-01-01

    Fungi in the basidiomycete species complex Heterobasidion annosum are significant root-rot pathogens of conifers throughout the northern hemisphere. We utilize a multilocus phylogenetic approach to examine hypotheses regarding the evolution and divergence of two Heterobasidion taxa associated with pines: the Eurasian H. ...

  1. Multilocus genotyping identifies a highly homogeneous phytoplasma lineage associated with sweet cherry virescence disease in China and its carriage by an erythroneurine leafhopper

    USDA-ARS?s Scientific Manuscript database

    Phytoplasmas are a diverse group of insect-transmitted, cell wall-less bacteria that colonize plant phloem sieve elements and cause numerous diseases in economically important crops. Living a trans-kingdom parasitic life, phytoplasmas possess dynamic genomes and evolve rapidly toward formation of di...

  2. Molecular sequence typing reveals genotypic diversity among Escherichia coli isolates recovered from a cantaloupe packinghouse in Northwestern Mexico

    USDA-ARS?s Scientific Manuscript database

    The increase in the consumption of fresh produce in the United States has correlated with a rise in the number of reported foodborne illnesses. To identify potential risk factors associated with post-harvest practices, the present study employed multilocus sequence typing (MLST) for the genotypic c...

  3. Salmonella enterica serotype enteritidis in French Polynesia, South Pacific, 2008-2013.

    PubMed

    Le Hello, Simon; Maillard, Fiona; Mallet, Henri-Pierre; Daudens, Elise; Levy, Marc; Roy, Valérie; Branaa, Philippe; Bertrand, Sophie; Fabre, Laetitia; Weill, François-Xavier

    2015-06-01

    Outbreaks of Salmonella enterica serotype Enteritidis infections associated with eggs occurred in French Polynesia during 2008-2013. Molecular analysis of isolates by using clustered regularly interspaced short palindromic repeat polymorphisms and multilocus variable-number tandem-repeat analysis was performed. This subtyping made defining the epidemic strain, finding the source, and decontaminating affected poultry flocks possible.

  4. A Bayesian network model to assess the public health risk associated with wet weather sewer overflows discharging into waterways.

    PubMed

    Goulding, R; Jayasuriya, N; Horan, E

    2012-10-15

    Overflows from sanitary sewers during wet weather, which occur when the hydraulic capacity of the sewer system is exceeded, are considered a potential threat to the ecological and public health of the waterways which receive these overflows. As a result, water retailers in Australia and internationally commit significant resources to manage and abate sewer overflows. However, whilst some studies have contributed to an increased understanding of the impacts and risks associated with these events, they are relatively few in number and there still is a general lack of knowledge in this area. A Bayesian network model to assess the public health risk associated with wet weather sewer overflows is presented in this paper. The Bayesian network approach is shown to provide significant benefits in the assessment of public health risks associated with wet weather sewer overflows. In particular, the ability for the model to account for the uncertainty inherent in sewer overflow events and subsequent impacts through the use of probabilities is a valuable function. In addition, the paper highlights the benefits of the probabilistic inference function of the Bayesian network in prioritising management options to minimise public health risks associated with sewer overflows. Copyright © 2012. Published by Elsevier Ltd.

  5. Proper joint analysis of summary association statistics requires the adjustment of heterogeneity in SNP coverage pattern.

    PubMed

    Zhang, Han; Wheeler, William; Song, Lei; Yu, Kai

    2017-07-07

    As meta-analysis results published by consortia of genome-wide association studies (GWASs) become increasingly available, many association summary statistics-based multi-locus tests have been developed to jointly evaluate multiple single-nucleotide polymorphisms (SNPs) to reveal novel genetic architectures of various complex traits. The validity of these approaches relies on the accurate estimate of z-score correlations at considered SNPs, which in turn requires knowledge on the set of SNPs assessed by each study participating in the meta-analysis. However, this exact SNP coverage information is usually unavailable from the meta-analysis results published by GWAS consortia. In the absence of the coverage information, researchers typically estimate the z-score correlations by making oversimplified coverage assumptions. We show through real studies that such a practice can generate highly inflated type I errors, and we demonstrate the proper way to incorporate correct coverage information into multi-locus analyses. We advocate that consortia should make SNP coverage information available when posting their meta-analysis results, and that investigators who develop analytic tools for joint analyses based on summary data should pay attention to the variation in SNP coverage and adjust for it appropriately. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  6. Multilocus sequence analysis of phytopathogenic species of the genus Streptomyces

    USDA-ARS?s Scientific Manuscript database

    The identification and classification of species within the genus Streptomyces is difficult because there are presently 576 validly described species and this number increases every year. The value of the application of multilocus sequence analysis scheme to the systematics of Streptomyces species h...

  7. Editorial: Bayesian benefits for child psychology and psychiatry researchers.

    PubMed

    Oldehinkel, Albertine J

    2016-09-01

    For many scientists, performing statistical tests has become an almost automated routine. However, p-values are frequently used and interpreted incorrectly; and even when used appropriately, p-values tend to provide answers that do not match researchers' questions and hypotheses well. Bayesian statistics present an elegant and often more suitable alternative. The Bayesian approach has rarely been applied in child psychology and psychiatry research so far, but the development of user-friendly software packages and tutorials has placed it well within reach now. Because Bayesian analyses require a more refined definition of hypothesized probabilities of possible outcomes than the classical approach, going Bayesian may offer the additional benefit of sparkling the development and refinement of theoretical models in our field. © 2016 Association for Child and Adolescent Mental Health.

  8. Bayesian network modeling applied to coastal geomorphology: lessons learned from a decade of experimentation and application

    NASA Astrophysics Data System (ADS)

    Plant, N. G.; Thieler, E. R.; Gutierrez, B.; Lentz, E. E.; Zeigler, S. L.; Van Dongeren, A.; Fienen, M. N.

    2016-12-01

    We evaluate the strengths and weaknesses of Bayesian networks that have been used to address scientific and decision-support questions related to coastal geomorphology. We will provide an overview of coastal geomorphology research that has used Bayesian networks and describe what this approach can do and when it works (or fails to work). Over the past decade, Bayesian networks have been formulated to analyze the multi-variate structure and evolution of coastal morphology and associated human and ecological impacts. The approach relates observable system variables to each other by estimating discrete correlations. The resulting Bayesian-networks make predictions that propagate errors, conduct inference via Bayes rule, or both. In scientific applications, the model results are useful for hypothesis testing, using confidence estimates to gage the strength of tests while applications to coastal resource management are aimed at decision-support, where the probabilities of desired ecosystems outcomes are evaluated. The range of Bayesian-network applications to coastal morphology includes emulation of high-resolution wave transformation models to make oceanographic predictions, morphologic response to storms and/or sea-level rise, groundwater response to sea-level rise and morphologic variability, habitat suitability for endangered species, and assessment of monetary or human-life risk associated with storms. All of these examples are based on vast observational data sets, numerical model output, or both. We will discuss the progression of our experiments, which has included testing whether the Bayesian-network approach can be implemented and is appropriate for addressing basic and applied scientific problems and evaluating the hindcast and forecast skill of these implementations. We will present and discuss calibration/validation tests that are used to assess the robustness of Bayesian-network models and we will compare these results to tests of other models. This will demonstrate how Bayesian networks are used to extract new insights about coastal morphologic behavior, assess impacts to societal and ecological systems, and communicate probabilistic predictions to decision makers.

  9. Novel fusarium head blight pathogens from Nepal and Louisiana revealed by multilocus genealogical concordance

    USDA-ARS?s Scientific Manuscript database

    This study was conducted to assess evolutionary relationships, species diversity, and trichothecene toxin potential of five Fusarium graminearum complex (FGSC) isolates identified as genetically novel during prior Fusarium head blight (FHB) surveys in Nepal and Louisiana. Results of a multilocus gen...

  10. Using Bayesian variable selection to analyze regular resolution IV two-level fractional factorial designs

    DOE PAGES

    Chipman, Hugh A.; Hamada, Michael S.

    2016-06-02

    Regular two-level fractional factorial designs have complete aliasing in which the associated columns of multiple effects are identical. Here, we show how Bayesian variable selection can be used to analyze experiments that use such designs. In addition to sparsity and hierarchy, Bayesian variable selection naturally incorporates heredity . This prior information is used to identify the most likely combinations of active terms. We also demonstrate the method on simulated and real experiments.

  11. Using Bayesian variable selection to analyze regular resolution IV two-level fractional factorial designs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chipman, Hugh A.; Hamada, Michael S.

    Regular two-level fractional factorial designs have complete aliasing in which the associated columns of multiple effects are identical. Here, we show how Bayesian variable selection can be used to analyze experiments that use such designs. In addition to sparsity and hierarchy, Bayesian variable selection naturally incorporates heredity . This prior information is used to identify the most likely combinations of active terms. We also demonstrate the method on simulated and real experiments.

  12. Genetic diversity of Flavobacterium psychrophilum isolates from three Oncorhynchus spp. in the United States, as revealed by multilocus sequence typing

    USDA-ARS?s Scientific Manuscript database

    Flavobacterium psychrophilum is an important pathogen of salmonids worldwide. Multilocus sequence typing (MLST) has identified a recombinogenic population structure from which emerged a few epidemic clonal complexes particularly threatening for salmonid aquaculture. To date, MLST genotypes for this ...

  13. The rise of ampicillin-resistant Enterococcus faecium high-risk clones as a frequent intestinal colonizer in oncohaematological neutropenic patients on levofloxacin prophylaxis: a risk for bacteraemia?

    PubMed

    Sánchez-Díaz, A M; Cuartero, C; Rodríguez, J D; Lozano, S; Alonso, J M; Rodríguez-Domínguez, M; Tedim, A P; Del Campo, R; López, J; Cantón, R; Ruiz-Garbajosa, P

    2016-01-01

    Levofloxacin extended prophylaxis (LEP), recommended in oncohaematological neutropenic patients to reduce infections, might select resistant bacteria in the intestine acting as a source of endogenous infection. In a prospective observational study we evaluated intestinal emergence and persistence of ampicillin-resistant Enterococcus faecium (AREfm), a marker of hospital adapted high-risk clones. AREfm was recovered from the faeces of 52 patients with prolonged neutropenia after chemotherapy, at admission (Basal), during LEP, and twice weekly until discharge (Pos-LEP). Antibiotic susceptibility, virulence traits and population structure (pulsed-field gel electrophoresis and multilocus sequence typing) were determined and compared with bacteraemic isolates. Gut enterococcal population was monitored using a quantitative PCR quantification approach. AREfm colonized 61.4% of patients (194/482 faecal samples). Sequential AREfm acquisition (25% Basal, 36.5% LEP, 50% Pos-LEP) and high persistent colonization rates (76.9-89.5%) associated with a decrease in clonal diversity were demonstrated. Isolates were clustered into 24 PFGE-patterns within 13 sequence types, 95.8% of them belonging to hospital-associated Bayesian analysis of population structure subgroups 2.1a and 3.3a. Levofloxacin resistance and high-level streptomycin resistance were a common trait of these high-risk clones. AREfm-ST117, the most persistent clone, was dominant (60.0% isolates, 32.6% patients). It presented esp gene and caused 18.2% of all bacteraemia episodes in 21% of patients previously colonized by this clone. In AREfm-colonized patients, intestinal enrichment in the E. faecium population with a decline in total bacterial load was observed. AREfm intestinal colonization increases during hospital stay and coincides with enterococci population enrichment in the gut. Dominance and intestinal persistence of the ST117 clone might increase the risk of bacteraemia. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  14. Salmonella enterica Serotype Enteritidis in French Polynesia, South Pacific, 2008–2013

    PubMed Central

    Maillard, Fiona; Mallet, Henri-Pierre; Daudens, Elise; Levy, Marc; Roy, Valérie; Branaa, Philippe; Bertrand, Sophie; Fabre, Laetitia; Weill, François-Xavier

    2015-01-01

    Outbreaks of Salmonella enterica serotype Enteritidis infections associated with eggs occurred in French Polynesia during 2008–2013. Molecular analysis of isolates by using clustered regularly interspaced short palindromic repeat polymorphisms and multilocus variable-number tandem-repeat analysis was performed. This subtyping made defining the epidemic strain, finding the source, and decontaminating affected poultry flocks possible. PMID:25988406

  15. Bayesian shrinkage approach for a joint model of longitudinal and survival outcomes assuming different association structures.

    PubMed

    Andrinopoulou, Eleni-Rosalina; Rizopoulos, Dimitris

    2016-11-20

    The joint modeling of longitudinal and survival data has recently received much attention. Several extensions of the standard joint model that consists of one longitudinal and one survival outcome have been proposed including the use of different association structures between the longitudinal and the survival outcomes. However, in general, relatively little attention has been given to the selection of the most appropriate functional form to link the two outcomes. In common practice, it is assumed that the underlying value of the longitudinal outcome is associated with the survival outcome. However, it could be that different characteristics of the patients' longitudinal profiles influence the hazard. For example, not only the current value but also the slope or the area under the curve of the longitudinal outcome. The choice of which functional form to use is an important decision that needs to be investigated because it could influence the results. In this paper, we use a Bayesian shrinkage approach in order to determine the most appropriate functional forms. We propose a joint model that includes different association structures of different biomarkers and assume informative priors for the regression coefficients that correspond to the terms of the longitudinal process. Specifically, we assume Bayesian lasso, Bayesian ridge, Bayesian elastic net, and horseshoe. These methods are applied to a dataset consisting of patients with a chronic liver disease, where it is important to investigate which characteristics of the biomarkers have an influence on survival. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  16. Trichomonas vaginalis Metronidazole Resistance Is Associated with Single Nucleotide Polymorphisms in the Nitroreductase Genes ntr4Tv and ntr6Tv

    PubMed Central

    Paulish-Miller, Teresa E.; Augostini, Peter; Schuyler, Jessica A.; Smith, William L.; Mordechai, Eli; Adelson, Martin E.; Gygax, Scott E.; Secor, William E.

    2014-01-01

    Metronidazole resistance in the sexually transmitted parasite Trichomonas vaginalis is a problematic public health issue. We have identified single nucleotide polymorphisms (SNPs) in two nitroreductase genes (ntr4Tv and ntr6Tv) associated with resistance. These SNPs were associated with one of two distinct T. vaginalis populations identified by multilocus sequence typing, yet one SNP (ntr6Tv A238T), which results in a premature stop codon, was associated with resistance independent of population structure and may be of diagnostic value. PMID:24550324

  17. Bayesian modeling of flexible cognitive control

    PubMed Central

    Jiang, Jiefeng; Heller, Katherine; Egner, Tobias

    2014-01-01

    “Cognitive control” describes endogenous guidance of behavior in situations where routine stimulus-response associations are suboptimal for achieving a desired goal. The computational and neural mechanisms underlying this capacity remain poorly understood. We examine recent advances stemming from the application of a Bayesian learner perspective that provides optimal prediction for control processes. In reviewing the application of Bayesian models to cognitive control, we note that an important limitation in current models is a lack of a plausible mechanism for the flexible adjustment of control over conflict levels changing at varying temporal scales. We then show that flexible cognitive control can be achieved by a Bayesian model with a volatility-driven learning mechanism that modulates dynamically the relative dependence on recent and remote experiences in its prediction of future control demand. We conclude that the emergent Bayesian perspective on computational mechanisms of cognitive control holds considerable promise, especially if future studies can identify neural substrates of the variables encoded by these models, and determine the nature (Bayesian or otherwise) of their neural implementation. PMID:24929218

  18. Daniel Goodman’s empirical approach to Bayesian statistics

    USGS Publications Warehouse

    Gerrodette, Tim; Ward, Eric; Taylor, Rebecca L.; Schwarz, Lisa K.; Eguchi, Tomoharu; Wade, Paul; Himes Boor, Gina

    2016-01-01

    Bayesian statistics, in contrast to classical statistics, uses probability to represent uncertainty about the state of knowledge. Bayesian statistics has often been associated with the idea that knowledge is subjective and that a probability distribution represents a personal degree of belief. Dr. Daniel Goodman considered this viewpoint problematic for issues of public policy. He sought to ground his Bayesian approach in data, and advocated the construction of a prior as an empirical histogram of “similar” cases. In this way, the posterior distribution that results from a Bayesian analysis combined comparable previous data with case-specific current data, using Bayes’ formula. Goodman championed such a data-based approach, but he acknowledged that it was difficult in practice. If based on a true representation of our knowledge and uncertainty, Goodman argued that risk assessment and decision-making could be an exact science, despite the uncertainties. In his view, Bayesian statistics is a critical component of this science because a Bayesian analysis produces the probabilities of future outcomes. Indeed, Goodman maintained that the Bayesian machinery, following the rules of conditional probability, offered the best legitimate inference from available data. We give an example of an informative prior in a recent study of Steller sea lion spatial use patterns in Alaska.

  19. Multilocus dataset reveals demographic histories of two peat mosses in Europe

    PubMed Central

    Szövényi, Péter; Hock, Zsófia; Schneller, Jakob J; Tóth, Zoltán

    2007-01-01

    Background Revealing the past and present demographic history of populations is of high importance to evaluate the conservation status of species. Demographic data can be obtained by direct monitoring or by analysing data of historical and recent collections. Although these methods provide the most detailed information they are very time consuming. Another alternative way is to make use of the information accumulated in the species' DNA over its history. Recent development of the coalescent theory makes it possible to reconstruct the demographic history of species using nucleotide polymorphism data. To separate the effect of natural selection and demography, multilocus analysis is needed because these two forces can produce similar patterns of polymorphisms. In this study we investigated the amount and pattern of sequence variability of a Europe wide sample set of two peat moss species (Sphagnum fimbriatum and S. squarrosum) with similar distributions and mating systems but presumably contrasting historical demographies using 3 regions of the nuclear genome (appr. 3000 bps). We aimed to draw inferences concerning demographic, and phylogeographic histories of the species. Results All three nuclear regions supported the presence of an Atlantic and Non-Atlantic clade of S. fimbriatum suggesting glacial survival of the species along the Atlantic coast of Europe. Contrarily, S. squarrosum haplotypes showed three clades but no geographic structure at all. Maximum likelihood, mismatch and Bayesian analyses supported a severe historical bottleneck and a relatively recent demographic expansion of the Non-Atlantic clade of S. fimbriatum, whereas size of S. squarrosum populations has probably decreased in the past. Species wide molecular diversity of the two species was nearly the same with an excess of replacement mutations in S. fimbriatum. Similar levels of molecular diversity, contrasting phylogeographic patterns and excess of replacement mutations in S. fimbriatum compared to S. squarrosum mirror unexpected differences in the demography and population history of the species. Conclusion This study represents the first detailed European wide phylodemographic investigation on bryophytes and shows how pattern of nucleotide polymorphism can reveal unexpected differences in the population history of haploid plants with seemingly similar characteristics. PMID:17714592

  20. Potential for bias and low precision in molecular divergence time estimation of the Canopy of Life: an example from aquatic bird families

    PubMed Central

    van Tuinen, Marcel; Torres, Christopher R.

    2015-01-01

    Uncertainty in divergence time estimation is frequently studied from many angles but rarely from the perspective of phylogenetic node age. If appropriate molecular models and fossil priors are used, a multi-locus, partitioned analysis is expected to equally minimize error in accuracy and precision across all nodes of a given phylogeny. In contrast, if available models fail to completely account for rate heterogeneity, substitution saturation and incompleteness of the fossil record, uncertainty in divergence time estimation may increase with node age. While many studies have stressed this concern with regard to deep nodes in the Tree of Life, the inference that molecular divergence time estimation of shallow nodes is less sensitive to erroneous model choice has not been tested explicitly in a Bayesian framework. Because of available divergence time estimation methods that permit fossil priors across any phylogenetic node and the present increase in efficient, cheap collection of species-level genomic data, insight is needed into the performance of divergence time estimation of shallow (<10 MY) nodes. Here, we performed multiple sensitivity analyses in a multi-locus data set of aquatic birds with six fossil constraints. Comparison across divergence time analyses that varied taxon and locus sampling, number and position of fossil constraint and shape of prior distribution showed various insights. Deviation from node ages obtained from a reference analysis was generally highest for the shallowest nodes but determined more by temporal placement than number of fossil constraints. Calibration with only the shallowest nodes significantly underestimated the aquatic bird fossil record, indicating the presence of saturation. Although joint calibration with all six priors yielded ages most consistent with the fossil record, ages of shallow nodes were overestimated. This bias was found in both mtDNA and nDNA regions. Thus, divergence time estimation of shallow nodes may suffer from bias and low precision, even when appropriate fossil priors and best available substitution models are chosen. Much care must be taken to address the possible ramifications of substitution saturation across the entire Tree of Life. PMID:26106406

  1. Intra-Species Genetic Diversity and Clonal Structure of Cryptosporidium parvum in Sheep Farms in a Confined Geographical Area in Northeastern Spain

    PubMed Central

    Ramo, Ana; Monteagudo, Luis V.; Del Cacho, Emilio; Sánchez-Acedo, Caridad

    2016-01-01

    A multilocus fragment typing approach including eleven variable-number tandem-repeat (VNTR) loci and the GP60 gene was used to investigate the intra-farm and intra-host genetic diversity of Cryptosporidium parvum in sheep farms in a confined area in northeastern Spain. Genomic DNA samples of 113 C. parvum isolates from diarrheic pre-weaned lambs collected in 49 meat-type sheep farms were analyzed. Loci exhibited various degrees of polymorphism, the finding of 7–9 alleles in the four most variable and discriminatory markers (ML2, Cgd6_5400, Cgd6_3940, and GP60) being remarkable. The combination of alleles at the twelve loci identified a total of 74 multilocus subtypes (MLTs) and provided a Hunter-Gaston discriminatory index of 0.988 (95% CI, 0.979−0.996). The finding that most MLTs (n = 64) were unique to individual farms evidenced that cryptosporidial infection is mainly transmitted within sheep flocks, with herd-to-herd transmission playing a secondary role. Limited intra- host variability was found, since only five isolates were genotypically mixed. In contrast, a significant intra-farm genetic diversity was seen, with the presence of multiple MLTs on more than a half of the farms (28/46), suggesting frequent mutations or genetic exchange through recombination. Comparison with a previous study in calves in northern Spain using the same 12-loci typing approach showed differences in the identity of major alleles at most loci, with a single MLT being shared between lambs and calves. Analysis of evolutionary descent by the algorithm eBURST indicated a high degree of genetic divergence, with over 41% MLTs appearing as singletons along with a high number of clonal complexes, most of them linking only two MLTs. Bayesian Structure analysis and F statistics also revealed the genetic remoteness of most C. parvum isolates and no ancestral population size was chosen. Linkage analysis evidenced a prevalent pattern of clonality within the parasite population. PMID:27176718

  2. Multilocus Genotyping and Molecular Phylogenetics Resolve a Novel Head Blight Pathogen within the Fusarium graminearum Species Complex from Ethiopia

    USDA-ARS?s Scientific Manuscript database

    A survey of Fusarium head blight (FHB)-contaminated wheat in Ethiopia recovered 31 isolates resembling members of the Fusarium graminearum species complex. Results of a multilocus genotyping (MLGT) assay for FHB species and trichothecene chemotype determination suggested that 22 of these isolates m...

  3. Taxonomic evaluation of putative Streptomyces scabiei strains held in the ARS (NRRL) Culture Collection using multi-locus sequence analysis

    USDA-ARS?s Scientific Manuscript database

    Multi-locus sequence analysis has been demonstrated to be a useful tool for identification of Streptomyces species and was previously applied to phylogenetically differentiate the type strains of species pathogenic on potatoes (Solanum tuberosum L.). The ARS Culture Collection (NRRL) contains 43 str...

  4. Multilocus resolution of Mugilidae phylogeny (Teleostei: Mugiliformes): Implications for the family's taxonomy.

    PubMed

    Xia, Rong; Durand, Jean-Dominique; Fu, Cuizhang

    2016-03-01

    The interrelationships among mugilids (Mugiliformes: Mugilidae) remain highly debated. Using a mitochondrial gene-based phylogeny as criterion, a revised classification with 25 genera in the Mugilidae has recently been proposed. However, phylogenetic relationships of major mitochondrial lineages remain unresolved and to gain a general acceptance the classification requires confirmation based on multilocus evidence and diagnostic morphological characters. Here, we construct a species-tree using twelve nuclear and three mitochondrial loci and infer the evolution of 71 morphological characters. Our multilocus phylogeny does not agree with previous morphology-based hypotheses for the relationships within Mugilidae, confirms the revised classification with 25 genera and further resolves their phylogenetic relationships. Using the well-resolved multilocus phylogeny as the criterion, we reclassify Mugilidae genera into three new subfamilies (Myxinae, Rhinomugilinae, and Cheloninae) and one new, recombined, subfamily (Mugilinae). The Rhinomugilinae subfamily is further divided into four tribes. The revised classification of Mugilidae is supported by morpho-anatomical synapomorphies or a combination of characters. These characters are used to erect a key to the subfamilies and genera. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. [Multilocus Sequence Typing analysis of human Campylobacter coli in Granada (Spain)].

    PubMed

    Carrillo-Ávila, J A; Sorlózano-Puerto, A; Pérez-Ruiz, M; Gutiérrez-Fernández, J

    2016-12-01

    Different subtypes of Campylobacter spp. have been associated with diarrhoea and a Multilocus Sequence Typing (MLST) method has been performed for subtyping. In the present work, MLST was used to analyse the genetic diversity of eight strains of Campylobacter coli. Nineteen genetic markers were amplified for MLST analysis: AnsB, DmsA, ggt, Cj1585c, CJJ81176-1367/1371, Tlp7, cj1321-cj1326, fucP, cj0178, cj0755/cfrA, ceuE, pldA, cstII, cstIII. After comparing the obtained sequences with the Campylobacter MLST database, the allele numbers, sequence types (STs) and clonal complexes (CCs) were assigned. The 8 C. coli isolates yielded 4 different STs belonging to 2 CCs. Seven isolates belong to ST-828 clonal complex and only one isolate belong to ST-21. Two samples came from the same patient, but were isolated in two different periods of time. MLST can be useful for taxonomic characterization of C. coli isolates.

  6. Adherent and Invasive Escherichia coli Is Associated with Granulomatous Colitis in Boxer Dogs

    PubMed Central

    Simpson, Kenneth W.; Dogan, Belgin; Rishniw, Mark; Goldstein, Richard E.; Klaessig, Suzanne; McDonough, Patrick L.; German, Alex J.; Yates, Robin M.; Russell, David G.; Johnson, Susan E.; Berg, Douglas E.; Harel, Josee; Bruant, Guillaume; McDonough, Sean P.; Schukken, Ynte H.

    2006-01-01

    The mucosa-associated microflora is increasingly considered to play a pivotal role in the pathogenesis of inflammatory bowel disease. This study explored the possibility that an abnormal mucosal flora is involved in the etiopathogenesis of granulomatous colitis of Boxer dogs (GCB). Colonic biopsy samples from affected dogs (n = 13) and controls (n = 38) were examined by fluorescent in situ hybridization (FISH) with a eubacterial 16S rRNA probe. Culture, 16S ribosomal DNA sequencing, and histochemistry were used to guide subsequent FISH. GCB-associated Escherichia coli isolates were evaluated for their ability to invade and persist in cultured epithelial cells and macrophages as well as for serotype, phylogenetic group, genome size, overall genotype, and presence of virulence genes. Intramucosal gram-negative coccobacilli were present in 100% of GCB samples but not controls. Invasive bacteria hybridized with FISH probes to E. coli. Three of four GCB-associated E. coli isolates adhered to, invaded, and replicated within cultured epithelial cells. Invasion triggered a “splash”-type response, was decreased by cytochalasin D, genistein, colchicine, and wortmannin, and paralleled the behavior of the Crohn's disease-associated strain E. coli LF 82. GCB E. coli and LF 82 were diverse in serotype and overall genotype but similar in phylogeny (B2 and D), in virulence gene profiles (fyuA, irp1, irp2, chuA, fepC, ibeA, kpsMII, iss), in having a larger genome size than commensal E. coli, and in the presence of novel multilocus sequence types. We conclude that GCB is associated with selective intramucosal colonization by E. coli. E. coli strains associated with GCB and Crohn's disease have an adherent and invasive phenotype and novel multilocus sequence types and resemble E. coli associated with extraintestinal disease in phylogeny and virulence gene profile. PMID:16861666

  7. A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study.

    PubMed

    Kaplan, David; Chen, Jianshen

    2012-07-01

    A two-step Bayesian propensity score approach is introduced that incorporates prior information in the propensity score equation and outcome equation without the problems associated with simultaneous Bayesian propensity score approaches. The corresponding variance estimators are also provided. The two-step Bayesian propensity score is provided for three methods of implementation: propensity score stratification, weighting, and optimal full matching. Three simulation studies and one case study are presented to elaborate the proposed two-step Bayesian propensity score approach. Results of the simulation studies reveal that greater precision in the propensity score equation yields better recovery of the frequentist-based treatment effect. A slight advantage is shown for the Bayesian approach in small samples. Results also reveal that greater precision around the wrong treatment effect can lead to seriously distorted results. However, greater precision around the correct treatment effect parameter yields quite good results, with slight improvement seen with greater precision in the propensity score equation. A comparison of coverage rates for the conventional frequentist approach and proposed Bayesian approach is also provided. The case study reveals that credible intervals are wider than frequentist confidence intervals when priors are non-informative.

  8. Multi-Locus Sequence Typing of Bartonella henselae Isolates from Three Continents Reveals Hypervirulent and Feline-Associated Clones

    PubMed Central

    Arvand, Mardjan; Feil, Edward J.; Giladi, Michael; Boulouis, Henri-Jean; Viezens, Juliane

    2007-01-01

    Bartonella henselae is a zoonotic pathogen and the causative agent of cat scratch disease and a variety of other disease manifestations in humans. Previous investigations have suggested that a limited subset of B. henselae isolates may be associated with human disease. In the present study, 182 human and feline B. henselae isolates from Europe, North America and Australia were analysed by multi-locus sequence typing (MLST) to detect any associations between sequence type (ST), host species and geographical distribution of the isolates. A total of 14 sequence types were detected, but over 66% (16/24) of the isolates recovered from human disease corresponded to a single genotype, ST1, and this type was detected in all three continents. In contrast, 27.2% (43/158) of the feline isolates corresponded to ST7, but this ST was not recovered from humans and was restricted to Europe. The difference in host association of STs 1 (human) and 7 (feline) was statistically significant (P≤0.001). eBURST analysis assigned the 14 STs to three clonal lineages, which contained two or more STs, and a singleton comprising ST7. These groups were broadly consistent with a neighbour-joining tree, although splits decomposition analysis was indicative of a history of recombination. These data indicate that B. henselae lineages differ in their virulence properties for humans and contribute to a better understanding of the population structure of B. henselae. PMID:18094753

  9. Multilocus phylogeny and antifungal susceptibility of Aspergillus section Circumdati from clinical samples and description of A. pseudosclerotiorum sp. nov.

    USDA-ARS?s Scientific Manuscript database

    A multilocus phylogenetic study was carried out to assess the species distribution in a set of 34 clinical isolates of Aspergillus section Circumdati from the USA and their in vitro antifungal susceptibility were determined against eight antifungal drugs. The genetic markers used were ITS, BenA, CaM...

  10. Rickettsia asembonensis Characterization by Multilocus Sequence Typing of Complete Genes, Peru.

    PubMed

    Loyola, Steev; Flores-Mendoza, Carmen; Torre, Armando; Kocher, Claudine; Melendrez, Melanie; Luce-Fedrow, Alison; Maina, Alice N; Richards, Allen L; Leguia, Mariana

    2018-05-01

    While studying rickettsial infections in Peru, we detected Rickettsia asembonensis in fleas from domestic animals. We characterized 5 complete genomic regions (17kDa, gltA, ompA, ompB, and sca4) and conducted multilocus sequence typing and phylogenetic analyses. The molecular isolate from Peru is distinct from the original R. asembonensis strain from Kenya.

  11. Evolution of the Staphylococcus argenteus ST2250 Clone in Northeastern Thailand Is Linked with the Acquisition of Livestock-Associated Staphylococcal Genes

    PubMed Central

    Jamrozy, Dorota; Mostowy, Rafal; Anderson, Annaliesa; Nickerson, Emma K.; Thaipadungpanit, Janjira; Wuthiekanun, Vanaporn; Limmathurotsakul, Direk; Tandhavanant, Sarunporn; Wikraiphat, Chanthiwa; Wongsuvan, Gumphol; Teerawattanasook, Nittaya; Jutrakul, Yaowaruk; Srisurat, Nuttiya; Chaimanee, Prajuab; Eoin West, T.; Blane, Beth; Peacock, Sharon J.

    2017-01-01

    ABSTRACT Staphylococcus argenteus is a newly named species previously described as a divergent lineage of Staphylococcus aureus that has recently been shown to have a global distribution. Despite growing evidence of the clinical importance of this species, knowledge about its population epidemiology and genomic architecture is limited. We used whole-genome sequencing to evaluate and compare S. aureus (n = 251) and S. argenteus (n = 68) isolates from adults with staphylococcal sepsis at several hospitals in northeastern Thailand between 2006 and 2013. The majority (82%) of the S. argenteus isolates were of multilocus sequence type 2250 (ST2250). S. aureus was more diverse, although 43% of the isolates belonged to ST121. Bayesian analysis suggested an S. argenteus ST2250 substitution rate of 4.66 (95% confidence interval [CI], 3.12 to 6.38) mutations per genome per year, which was comparable to the S. aureus ST121 substitution rate of 4.07 (95% CI, 2.61 to 5.55). S. argenteus ST2250 emerged in Thailand an estimated 15 years ago, which contrasts with the S. aureus ST1, ST88, and ST121 clades that emerged around 100 to 150 years ago. Comparison of S. argenteus ST2250 genomes from Thailand and a global collection indicated a single introduction into Thailand, followed by transmission to local and more distant countries in Southeast Asia and further afield. S. argenteus and S. aureus shared around half of their core gene repertoire, indicating a high level of divergence and providing strong support for their classification as separate species. Several gene clusters were present in ST2250 isolates but absent from the other S. argenteus and S. aureus study isolates. These included multiple exotoxins and antibiotic resistance genes that have been linked previously with livestock-associated S. aureus, consistent with a livestock reservoir for S. argenteus. These genes appeared to be associated with plasmids and mobile genetic elements and may have contributed to the biological success of ST2250. PMID:28679748

  12. Evaluation of calibration efficacy under different levels of uncertainty

    DOE PAGES

    Heo, Yeonsook; Graziano, Diane J.; Guzowski, Leah; ...

    2014-06-10

    This study examines how calibration performs under different levels of uncertainty in model input data. It specifically assesses the efficacy of Bayesian calibration to enhance the reliability of EnergyPlus model predictions. A Bayesian approach can be used to update uncertain values of parameters, given measured energy-use data, and to quantify the associated uncertainty.We assess the efficacy of Bayesian calibration under a controlled virtual-reality setup, which enables rigorous validation of the accuracy of calibration results in terms of both calibrated parameter values and model predictions. Case studies demonstrate the performance of Bayesian calibration of base models developed from audit data withmore » differing levels of detail in building design, usage, and operation.« less

  13. Bayesian flood forecasting methods: A review

    NASA Astrophysics Data System (ADS)

    Han, Shasha; Coulibaly, Paulin

    2017-08-01

    Over the past few decades, floods have been seen as one of the most common and largely distributed natural disasters in the world. If floods could be accurately forecasted in advance, then their negative impacts could be greatly minimized. It is widely recognized that quantification and reduction of uncertainty associated with the hydrologic forecast is of great importance for flood estimation and rational decision making. Bayesian forecasting system (BFS) offers an ideal theoretic framework for uncertainty quantification that can be developed for probabilistic flood forecasting via any deterministic hydrologic model. It provides suitable theoretical structure, empirically validated models and reasonable analytic-numerical computation method, and can be developed into various Bayesian forecasting approaches. This paper presents a comprehensive review on Bayesian forecasting approaches applied in flood forecasting from 1999 till now. The review starts with an overview of fundamentals of BFS and recent advances in BFS, followed with BFS application in river stage forecasting and real-time flood forecasting, then move to a critical analysis by evaluating advantages and limitations of Bayesian forecasting methods and other predictive uncertainty assessment approaches in flood forecasting, and finally discusses the future research direction in Bayesian flood forecasting. Results show that the Bayesian flood forecasting approach is an effective and advanced way for flood estimation, it considers all sources of uncertainties and produces a predictive distribution of the river stage, river discharge or runoff, thus gives more accurate and reliable flood forecasts. Some emerging Bayesian forecasting methods (e.g. ensemble Bayesian forecasting system, Bayesian multi-model combination) were shown to overcome limitations of single model or fixed model weight and effectively reduce predictive uncertainty. In recent years, various Bayesian flood forecasting approaches have been developed and widely applied, but there is still room for improvements. Future research in the context of Bayesian flood forecasting should be on assimilation of various sources of newly available information and improvement of predictive performance assessment methods.

  14. A comprehensive multilocus phylogeny for the wood-warblers and a revised classification of the Parulidae (Aves)

    USGS Publications Warehouse

    Lovette, I.J.; Perez-Eman, J. L.; Sullivan, J.P.; Banks, R.C.; Fiorentino, I.; Cordoba-Cordoba, S.; Echeverry-Galvis, M.; Barker, F.K.; Burns, K.J.; Klicka, J.; Lanyon, Scott M.; Bermingham, E.

    2010-01-01

    The birds in the family Parulidae-commonly termed the New World warblers or wood-warblers-are a classic model radiation for studies of ecological and behavioral differentiation. Although the monophyly of a 'core' wood-warbler clade is well established, no phylogenetic hypothesis for this group has included a full sampling of wood-warbler species diversity. We used parsimony, maximum likelihood, and Bayesian methods to reconstruct relationships among all genera and nearly all wood-warbler species, based on a matrix of mitochondrial DNA (5840 nucleotides) and nuclear DNA (6 loci, 4602 nucleotides) characters. The resulting phylogenetic hypotheses provide a highly congruent picture of wood-warbler relationships, and indicate that the traditional generic classification of these birds recognizes many non-monophyletic groups. We recommend a revised taxonomy in which each of 14 genera (Seiurus, Helmitheros, Mniotilta, Limnothlypis, Protonotaria, Parkesia, Vermivora, Oreothlypis, Geothlypis, Setophaga, Myioborus, Cardellina, Basileuterus, Myiothlypis) corresponds to a well-supported clade; these nomenclatural changes also involve subsuming a number of well-known, traditional wood-warbler genera (Catharopeza, Dendroica, Ergaticus, Euthlypis, Leucopeza, Oporornis, Parula, Phaeothlypis, Wilsonia). We provide a summary phylogenetic hypothesis that will be broadly applicable to investigations of the historical biogeography, processes of diversification, and evolution of trait variation in this well studied avian group. ?? 2010 Elsevier Inc.

  15. Investigating Processes of Neotropical Rain Forest Tree Diversification By Examining the Evolution and Historical Biogeography of the Protieae (BURSERACEAE)

    NASA Astrophysics Data System (ADS)

    Fine, P.; Zapata, F.; Daly, D.

    2014-12-01

    Andean uplift and the collision of North and South America are thought to have major implications for the diversification of the Neotropical biota. However, few studies have investigated how these geological events may have influenced diversification. We present a multilocus phylogeny of 102 Protieae taxa (73% of published species), sampled pantropically, to test hypotheses about the relative importance of dispersal, vicariance, habitat specialization, and biotic factors in the diversification of this ecologically dominant tribe of Neotropical trees. Bayesian fossil-calibrated analyses date the Protieae stem at 55 Mya. Biogeographic analyses reconstruct an initial late Oligocene/early Miocene radiation in Amazonia for Neotropical Protieae, with several subsequent late Miocene dispersal events to Central America, the Caribbean, Brazil's Atlantic Forest, and the Chocó. Regional phylogenetic structure results indicate frequent dispersal among regions throughout the Miocene and many instances of more recent regional in situ speciation. Habitat specialization to white sand or flooded soils was common, especially in Amazonia. There was one significant increase in diversification rate coincident with colonization of the Neotropics, followed by a gradual decrease consistent with models of diversity-dependent cladogenesis. Dispersal, biotic interactions, and habitat specialization are thus hypothesized to be the most important processes underlying the diversification of the Protieae.

  16. Multilocus microsatellite typing shows three different genetic clusters of Leishmania major in Iran.

    PubMed

    Mahnaz, Tashakori; Al-Jawabreh, Amer; Kuhls, Katrin; Schönian, Gabriele

    2011-10-01

    Ten polymorphic microsatellite markers were used to analyse 25 strains of Leishmania major collected from cutaneous leishmaniasis cases in different endemic areas in Iran. Nine of the markers were polymorphic, revealing 21 different genotypes. The data displayed significant microsatellite polymorphism with rare allelic heterozygosity. Bayesian statistic and distance based analyses identified three genetic clusters among the 25 strains analysed. Cluster I represented mainly strains isolated in the west and south-west of Iran, with the exception of four strains originating from central Iran. Cluster II comprised strains from the central part of Iran, and cluster III included only strains from north Iran. The geographical distribution of L. major in Iran was supported by comparing the microsatellite profiles of the 25 Iranian strains to those of 105 strains collected in 19 Asian and African countries. The Iranian clusters I and II were separated from three previously described populations comprising strains from Africa, the Middle East and Central Asia whereas cluster III grouped together with the Central Asian population. The considerable genetic variability of L. major might be related to the existence of different populations of Phlebotomus papatasi and/or to differences in reservoir host abundance in different parts of Iran. Copyright © 2011 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  17. Hybridization and population structure of the Culex pipiens complex in the islands of Macaronesia

    PubMed Central

    Gomes, Bruno; Alves, Joana; Sousa, Carla A; Santa-Ana, Marta; Vieira, Inês; Silva, Teresa L; Almeida, António PG; Donnelly, Martin J; Pinto, João

    2012-01-01

    The Culex pipiens complex includes two widespread mosquito vector species, Cx. pipiens and Cx. quinquefasciatus. The distribution of these species varies in latitude, with the former being present in temperate regions and the latter in tropical and subtropical regions. However, their distribution range overlaps in certain areas and interspecific hybridization has been documented. Genetic introgression between these species may have epidemiological repercussions for West Nile virus (WNV) transmission. Bayesian clustering analysis based on multilocus genotypes of 12 microsatellites was used to determine levels of hybridization between these two species in Macaronesian islands, the only contact zone described in West Africa. The distribution of the two species reflects both the islands' biogeography and historical aspects of human colonization. Madeira Island displayed a homogenous population of Cx. pipiens, whereas Cape Verde showed a more intriguing scenario with extensive hybridization. In the islands of Brava and Santiago, only Cx. quinquefasciatus was found, while in Fogo and Maio high hybrid rates (∼40%) between the two species were detected. Within the admixed populations, second-generation hybrids (∼50%) were identified suggesting a lack of isolation mechanisms. The observed levels of hybridization may locally potentiate the transmission to humans of zoonotic arboviruses such as WNV. PMID:22957190

  18. Re-Evaluation of Phylogenetic Relationships among Species of the Mangrove Genus Avicennia from Indo-West Pacific Based on Multilocus Analyses.

    PubMed

    Li, Xinnian; Duke, Norman C; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua

    2016-01-01

    Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time.

  19. Re-Evaluation of Phylogenetic Relationships among Species of the Mangrove Genus Avicennia from Indo-West Pacific Based on Multilocus Analyses

    PubMed Central

    Li, Xinnian; Duke, Norman C.; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua

    2016-01-01

    Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time. PMID:27716800

  20. Nuclear microsatellites reveal contrasting patterns of genetic structure between western and southeastern European populations of the common ash (Fraxinus excelsior L.).

    PubMed

    Heuertz, Myriam; Hausman, Jean-François; Hardy, Olivier J; Vendramin, Giovanni G; Frascaria-Lacoste, Nathalie; Vekemans, Xavier

    2004-05-01

    To determine extant patterns of population genetic structure in common ash and gain insight into postglacial recolonization processes, we applied multilocus-based Bayesian approaches to data from 36 European populations genotyped at five nuclear microsatellite loci. We identified two contrasting patterns in terms of population genetic structure: (1) a large area from the British Isles to Lithuania throughout central Europe constituted effectively a single deme, whereas (2) strong genetic differentiation occurred over short distances in Sweden and southeastern Europe. Concomitant geographical variation was observed in estimates of allelic richness and genetic diversity, which were lowest in populations from southeastern Europe, that is, in regions close to putative ice age refuges, but high in western and central Europe, that is, in more recently recolonized areas. We suggest that in southeastern Europe, restricted postglacial gene flow caused by a rapid expansion of refuge populations in a mountainous topography is responsible for the observed strong genetic structure. In contrast, admixture of previously differentiated gene pools and high gene flow at the onset of postglacial recolonization of western and central Europe would have homogenized the genetic structure and raised the levels of genetic diversity above values in the refuges.

  1. Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

    PubMed

    Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay

    2016-01-01

    There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.

  2. Association of variants in innate immune genes with asthma and eczema

    PubMed Central

    Sharma, Sunita; Poon, Audrey; Himes, Blanca E.; Lasky-Su, Jessica; Sordillo, Joanne E.; Belanger, Kathleen; Milton, Donald K.; Bracken, Michael B.; Triche, Elizabeth W.; Leaderer, Brian P.; Gold, Diane R.; Litonjua, Augusto A.

    2012-01-01

    Background The innate immune pathway is important in the pathogenesis of asthma and eczema. However, only a few variants in these genes have been associated with either disease. We investigate the association between polymorphisms of genes in the innate immune pathway with childhood asthma and eczema. In addition, we compare individual associations with those discovered using a multivariate approach. Methods Using a novel method, case control based association testing (C2BAT), 569 single nucleotide polymorphisms (SNPs) in 44 innate immune genes were tested for association with asthma and eczema in children from the Boston Home Allergens and Asthma Study and the Connecticut Childhood Asthma Study. The screening algorithm was used to identify the top SNPs associated with asthma and eczema. We next investigated the interaction of innate immune variants with asthma and eczema risk using Bayesian networks. Results After correction for multiple comparisons, 7 SNPs in 6 genes (CARD25, TGFB1, LY96, ACAA1, DEFB1, and IFNG) were associated with asthma (adjusted p-value<0.02), while 5 SNPs in 3 different genes (CD80, STAT4, and IRAKI) were significantly associated with eczema (adjusted p-value < 0.02). None of these SNPs were associated with both asthma and eczema. Bayesian network analysis identified 4 SNPs that were predictive of asthma and 10 SNPs that predicted eczema. Of the genes identified using Bayesian networks, only CD80 was associated with eczema in the single-SNP study. Using novel methodology that allows for screening and replication in the same population, we have identified associations of innate immune genes with asthma and eczema. Bayesian network analysis suggests that additional SNPs influence disease susceptibility via SNP interactions. Conclusion Our findings suggest that innate immune genes contribute to the pathogenesis of asthma and eczema, and that these diseases likely have different genetic determinants. PMID:22192168

  3. Prediction of Individual Serum Infliximab Concentrations in Inflammatory Bowel Disease by a Bayesian Dashboard System.

    PubMed

    Eser, Alexander; Primas, Christian; Reinisch, Sieglinde; Vogelsang, Harald; Novacek, Gottfried; Mould, Diane R; Reinisch, Walter

    2018-01-30

    Despite a robust exposure-response relationship of infliximab in inflammatory bowel disease (IBD), attempts to adjust dosing to individually predicted serum concentrations of infliximab (SICs) are lacking. Compared with labor-intensive conventional software for pharmacokinetic (PK) modeling (eg, NONMEM) dashboards are easy-to-use programs incorporating complex Bayesian statistics to determine individual pharmacokinetics. We evaluated various infliximab detection assays and the number of samples needed to precisely forecast individual SICs using a Bayesian dashboard. We assessed long-term infliximab retention in patients being dosed concordantly versus discordantly with Bayesian dashboard recommendations. Three hundred eighty-two serum samples from 117 adult IBD patients on infliximab maintenance therapy were analyzed by 3 commercially available assays. Data from each assay was modeled using NONMEM and a Bayesian dashboard. PK parameter precision and residual variability were assessed. Forecast concentrations from both systems were compared with observed concentrations. Infliximab retention was assessed by prediction for dose intensification via Bayesian dashboard versus real-life practice. Forecast precision of SICs varied between detection assays. At least 3 SICs from a reliable assay are needed for an accurate forecast. The Bayesian dashboard performed similarly to NONMEM to predict SICs. Patients dosed concordantly with Bayesian dashboard recommendations had a significantly longer median drug survival than those dosed discordantly (51.5 versus 4.6 months, P < .0001). The Bayesian dashboard helps to assess the diagnostic performance of infliximab detection assays. Three, not single, SICs provide sufficient information for individualized dose adjustment when incorporated into the Bayesian dashboard. Treatment adjusted to forecasted SICs is associated with longer drug retention of infliximab. © 2018, The American College of Clinical Pharmacology.

  4. A Hierarchical Bayesian Procedure for Two-Mode Cluster Analysis

    ERIC Educational Resources Information Center

    DeSarbo, Wayne S.; Fong, Duncan K. H.; Liechty, John; Saxton, M. Kim

    2004-01-01

    This manuscript introduces a new Bayesian finite mixture methodology for the joint clustering of row and column stimuli/objects associated with two-mode asymmetric proximity, dominance, or profile data. That is, common clusters are derived which partition both the row and column stimuli/objects simultaneously into the same derived set of clusters.…

  5. An empirical evaluation of two-stage species tree inference strategies using a multilocus dataset from North American pines

    Treesearch

    Michael DeGiorgio; John Syring; Andrew J. Eckert; Aaron Liston; Richard Cronn; David B. Neale; Noah A. Rosenberg

    2014-01-01

    Background: As it becomes increasingly possible to obtain DNA sequences of orthologous genes from diverse sets of taxa, species trees are frequently being inferred from multilocus data. However, the behavior of many methods for performing this inference has remained largely unexplored. Some methods have been proven to be consistent given certain evolutionary models,...

  6. Exact Bayesian p-values for a test of independence in a 2 × 2 contingency table with missing data.

    PubMed

    Lin, Yan; Lipsitz, Stuart R; Sinha, Debajyoti; Fitzmaurice, Garrett; Lipshultz, Steven

    2017-01-01

    Altham (Altham PME. Exact Bayesian analysis of a 2 × 2 contingency table, and Fisher's "exact" significance test. J R Stat Soc B 1969; 31: 261-269) showed that a one-sided p-value from Fisher's exact test of independence in a 2 × 2 contingency table is equal to the posterior probability of negative association in the 2 × 2 contingency table under a Bayesian analysis using an improper prior. We derive an extension of Fisher's exact test p-value in the presence of missing data, assuming the missing data mechanism is ignorable (i.e., missing at random or completely at random). Further, we propose Bayesian p-values for a test of independence in a 2 × 2 contingency table with missing data using alternative priors; we also present results from a simulation study exploring the Type I error rate and power of the proposed exact test p-values. An example, using data on the association between blood pressure and a cardiac enzyme, is presented to illustrate the methods.

  7. Discriminative Bayesian Dictionary Learning for Classification.

    PubMed

    Akhtar, Naveed; Shafait, Faisal; Mian, Ajmal

    2016-12-01

    We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data. The proposed approach infers probability distributions over the atoms of a discriminative dictionary using a finite approximation of Beta Process. It also computes sets of Bernoulli distributions that associate class labels to the learned dictionary atoms. This association signifies the selection probabilities of the dictionary atoms in the expansion of class-specific data. Furthermore, the non-parametric character of the proposed approach allows it to infer the correct size of the dictionary. We exploit the aforementioned Bernoulli distributions in separately learning a linear classifier. The classifier uses the same hierarchical Bayesian model as the dictionary, which we present along the analytical inference solution for Gibbs sampling. For classification, a test instance is first sparsely encoded over the learned dictionary and the codes are fed to the classifier. We performed experiments for face and action recognition; and object and scene-category classification using five public datasets and compared the results with state-of-the-art discriminative sparse representation approaches. Experiments show that the proposed Bayesian approach consistently outperforms the existing approaches.

  8. Bayesian sensitivity analysis methods to evaluate bias due to misclassification and missing data using informative priors and external validation data.

    PubMed

    Luta, George; Ford, Melissa B; Bondy, Melissa; Shields, Peter G; Stamey, James D

    2013-04-01

    Recent research suggests that the Bayesian paradigm may be useful for modeling biases in epidemiological studies, such as those due to misclassification and missing data. We used Bayesian methods to perform sensitivity analyses for assessing the robustness of study findings to the potential effect of these two important sources of bias. We used data from a study of the joint associations of radiotherapy and smoking with primary lung cancer among breast cancer survivors. We used Bayesian methods to provide an operational way to combine both validation data and expert opinion to account for misclassification of the two risk factors and missing data. For comparative purposes we considered a "full model" that allowed for both misclassification and missing data, along with alternative models that considered only misclassification or missing data, and the naïve model that ignored both sources of bias. We identified noticeable differences between the four models with respect to the posterior distributions of the odds ratios that described the joint associations of radiotherapy and smoking with primary lung cancer. Despite those differences we found that the general conclusions regarding the pattern of associations were the same regardless of the model used. Overall our results indicate a nonsignificantly decreased lung cancer risk due to radiotherapy among nonsmokers, and a mildly increased risk among smokers. We described easy to implement Bayesian methods to perform sensitivity analyses for assessing the robustness of study findings to misclassification and missing data. Copyright © 2012 Elsevier Ltd. All rights reserved.

  9. Bayesian techniques for analyzing group differences in the Iowa Gambling Task: A case study of intuitive and deliberate decision-makers.

    PubMed

    Steingroever, Helen; Pachur, Thorsten; Šmíra, Martin; Lee, Michael D

    2018-06-01

    The Iowa Gambling Task (IGT) is one of the most popular experimental paradigms for comparing complex decision-making across groups. Most commonly, IGT behavior is analyzed using frequentist tests to compare performance across groups, and to compare inferred parameters of cognitive models developed for the IGT. Here, we present a Bayesian alternative based on Bayesian repeated-measures ANOVA for comparing performance, and a suite of three complementary model-based methods for assessing the cognitive processes underlying IGT performance. The three model-based methods involve Bayesian hierarchical parameter estimation, Bayes factor model comparison, and Bayesian latent-mixture modeling. We illustrate these Bayesian methods by applying them to test the extent to which differences in intuitive versus deliberate decision style are associated with differences in IGT performance. The results show that intuitive and deliberate decision-makers behave similarly on the IGT, and the modeling analyses consistently suggest that both groups of decision-makers rely on similar cognitive processes. Our results challenge the notion that individual differences in intuitive and deliberate decision styles have a broad impact on decision-making. They also highlight the advantages of Bayesian methods, especially their ability to quantify evidence in favor of the null hypothesis, and that they allow model-based analyses to incorporate hierarchical and latent-mixture structures.

  10. Genetic barcodes

    DOEpatents

    Weier, Heinz -Ulrich G

    2015-08-04

    Herein are described multicolor FISH probe sets termed "genetic barcodes" targeting several cancer or disease-related loci to assess gene rearrangements and copy number changes in tumor cells. Two, three or more different fluorophores are used to detect the genetic barcode sections thus permitting unique labeling and multilocus analysis in individual cell nuclei. Gene specific barcodes can be generated and combined to provide both numerical and structural genetic information for these and other pertinent disease associated genes.

  11. 'Food addiction' and its association with a dopaminergic multilocus genetic profile.

    PubMed

    Davis, Caroline; Loxton, Natalie J; Levitan, Robert D; Kaplan, Allan S; Carter, Jacqueline C; Kennedy, James L

    2013-06-13

    Our objective was to employ a novel genetic methodology - whereby functional variants of the dopamine pathway were aggregated to reflect a polygenic liability - in the study of food addiction. We anticipated that the composite index of elevated dopamine signaling (a multilocus genetic profile score [MLGP]) would distinguish those with a designation of food addiction (according to the Yale Food Addiction Scale [YFAS] criteria), and age and weight equivalent controls. Our second aim was to assess whether this index was positively associated with eating-related sub-phenotypes of food addiction (e.g. binge eating and food cravings). Adults (n=120) recruited from the community were solicited for an overeating/overweight study. Eating-behavior questionnaires were completed and a blood sample was taken for genotyping. The YFAS identified 21 participants with food addiction. As predicted, the MLGP score was higher in those with YFAS-diagnosed food addiction, and it correlated positively with binge eating, food cravings, and emotional overeating. We then tested a multiple-mediation model proposing that reward-driven overeating facilitates the relationship between the MLGP score and food addiction. The model was statistically significant, supporting the view that the relationship between a composite genetic index of dopamine signaling and food addiction is mediated by certain aspects of reward-responsive overeating. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Evaluation of a Partial Genome Screening of Two Asthma Susceptibility Regions Using Bayesian Network Based Bayesian Multilevel Analysis of Relevance

    PubMed Central

    Antal, Péter; Kiszel, Petra Sz.; Gézsi, András; Hadadi, Éva; Virág, Viktor; Hajós, Gergely; Millinghoffer, András; Nagy, Adrienne; Kiss, András; Semsei, Ágnes F.; Temesi, Gergely; Melegh, Béla; Kisfali, Péter; Széll, Márta; Bikov, András; Gálffy, Gabriella; Tamási, Lilla; Falus, András; Szalai, Csaba

    2012-01-01

    Genetic studies indicate high number of potential factors related to asthma. Based on earlier linkage analyses we selected the 11q13 and 14q22 asthma susceptibility regions, for which we designed a partial genome screening study using 145 SNPs in 1201 individuals (436 asthmatic children and 765 controls). The results were evaluated with traditional frequentist methods and we applied a new statistical method, called Bayesian network based Bayesian multilevel analysis of relevance (BN-BMLA). This method uses Bayesian network representation to provide detailed characterization of the relevance of factors, such as joint significance, the type of dependency, and multi-target aspects. We estimated posteriors for these relations within the Bayesian statistical framework, in order to estimate the posteriors whether a variable is directly relevant or its association is only mediated. With frequentist methods one SNP (rs3751464 in the FRMD6 gene) provided evidence for an association with asthma (OR = 1.43(1.2–1.8); p = 3×10−4). The possible role of the FRMD6 gene in asthma was also confirmed in an animal model and human asthmatics. In the BN-BMLA analysis altogether 5 SNPs in 4 genes were found relevant in connection with asthma phenotype: PRPF19 on chromosome 11, and FRMD6, PTGER2 and PTGDR on chromosome 14. In a subsequent step a partial dataset containing rhinitis and further clinical parameters was used, which allowed the analysis of relevance of SNPs for asthma and multiple targets. These analyses suggested that SNPs in the AHNAK and MS4A2 genes were indirectly associated with asthma. This paper indicates that BN-BMLA explores the relevant factors more comprehensively than traditional statistical methods and extends the scope of strong relevance based methods to include partial relevance, global characterization of relevance and multi-target relevance. PMID:22432035

  13. Haplotypes in CCR5-CCR2, CCL3 and CCL5 are associated with natural resistance to HIV-1 infection in a Colombian cohort.

    PubMed

    Vega, Jorge A; Villegas-Ospina, Simón; Aguilar-Jiménez, Wbeimar; Rugeles, María T; Bedoya, Gabriel; Zapata, Wildeman

    2017-06-01

    Variants in genes encoding for HIV-1 co-receptors and their natural ligands have been individually associated to natural resistance to HIV-1 infection. However, the simultaneous presence of these variants has been poorly studied. To evaluate the association of single and multilocus haplotypes in genes coding for the viral co-receptors CCR5 and CCR2, and their ligands CCL3 and CCL5, with resistance or susceptibility to HIV-1 infection. Nine variants in CCR5-CCR2, two SNPs in CCL3 and two in CCL5 were genotyped by PCR-RFLP in 35 seropositive (cases) and 49 HIV-1-exposed seronegative Colombian individuals (controls). Haplotypes were inferred using the Arlequin software, and their frequency in individual or combined loci was compared between cases and controls by the chi-square test. A p' value ;0.05 after Bonferroni correction was considered significant. Homozygosis of the human haplogroup (HH) E was absent in controls and frequent in cases, showing a tendency to susceptibility. The haplotypes C-C and T-T in CCL3 were associated with susceptibility (p'=0.016) and resistance (p';0.0001) to HIV-1 infection, respectively. Finally, in multilocus analysis, the haplotype combinations formed by HHC in CCR5-CCR2, T-T in CCL3 and G-C in CCL5 were associated with resistance (p'=0.006). Our results suggest that specific combinations of variants in genes from the same signaling pathway can define an HIV-1 resistant phenotype. Despite our small sample size, our statistically significant associations suggest strong effects; however, these results should be further validated in larger cohorts.

  14. Population forecasts for Bangladesh, using a Bayesian methodology.

    PubMed

    Mahsin, Md; Hossain, Syed Shahadat

    2012-12-01

    Population projection for many developing countries could be quite a challenging task for the demographers mostly due to lack of availability of enough reliable data. The objective of this paper is to present an overview of the existing methods for population forecasting and to propose an alternative based on the Bayesian statistics, combining the formality of inference. The analysis has been made using Markov Chain Monte Carlo (MCMC) technique for Bayesian methodology available with the software WinBUGS. Convergence diagnostic techniques available with the WinBUGS software have been applied to ensure the convergence of the chains necessary for the implementation of MCMC. The Bayesian approach allows for the use of observed data and expert judgements by means of appropriate priors, and a more realistic population forecasts, along with associated uncertainty, has been possible.

  15. Additive Genetic Variability and the Bayesian Alphabet

    PubMed Central

    Gianola, Daniel; de los Campos, Gustavo; Hill, William G.; Manfredi, Eduardo; Fernando, Rohan

    2009-01-01

    The use of all available molecular markers in statistical models for prediction of quantitative traits has led to what could be termed a genomic-assisted selection paradigm in animal and plant breeding. This article provides a critical review of some theoretical and statistical concepts in the context of genomic-assisted genetic evaluation of animals and crops. First, relationships between the (Bayesian) variance of marker effects in some regression models and additive genetic variance are examined under standard assumptions. Second, the connection between marker genotypes and resemblance between relatives is explored, and linkages between a marker-based model and the infinitesimal model are reviewed. Third, issues associated with the use of Bayesian models for marker-assisted selection, with a focus on the role of the priors, are examined from a theoretical angle. The sensitivity of a Bayesian specification that has been proposed (called “Bayes A”) with respect to priors is illustrated with a simulation. Methods that can solve potential shortcomings of some of these Bayesian regression procedures are discussed briefly. PMID:19620397

  16. Applications of Bayesian Statistics to Problems in Gamma-Ray Bursts

    NASA Technical Reports Server (NTRS)

    Meegan, Charles A.

    1997-01-01

    This presentation will describe two applications of Bayesian statistics to Gamma Ray Bursts (GRBS). The first attempts to quantify the evidence for a cosmological versus galactic origin of GRBs using only the observations of the dipole and quadrupole moments of the angular distribution of bursts. The cosmological hypothesis predicts isotropy, while the galactic hypothesis is assumed to produce a uniform probability distribution over positive values for these moments. The observed isotropic distribution indicates that the Bayes factor for the cosmological hypothesis over the galactic hypothesis is about 300. Another application of Bayesian statistics is in the estimation of chance associations of optical counterparts with galaxies. The Bayesian approach is preferred to frequentist techniques here because the Bayesian approach easily accounts for galaxy mass distributions and because one can incorporate three disjoint hypotheses: (1) bursts come from galactic centers, (2) bursts come from galaxies in proportion to luminosity, and (3) bursts do not come from external galaxies. This technique was used in the analysis of the optical counterpart to GRB970228.

  17. Itô-SDE MCMC method for Bayesian characterization of errors associated with data limitations in stochastic expansion methods for uncertainty quantification

    NASA Astrophysics Data System (ADS)

    Arnst, M.; Abello Álvarez, B.; Ponthot, J.-P.; Boman, R.

    2017-11-01

    This paper is concerned with the characterization and the propagation of errors associated with data limitations in polynomial-chaos-based stochastic methods for uncertainty quantification. Such an issue can arise in uncertainty quantification when only a limited amount of data is available. When the available information does not suffice to accurately determine the probability distributions that must be assigned to the uncertain variables, the Bayesian method for assigning these probability distributions becomes attractive because it allows the stochastic model to account explicitly for insufficiency of the available information. In previous work, such applications of the Bayesian method had already been implemented by using the Metropolis-Hastings and Gibbs Markov Chain Monte Carlo (MCMC) methods. In this paper, we present an alternative implementation, which uses an alternative MCMC method built around an Itô stochastic differential equation (SDE) that is ergodic for the Bayesian posterior. We draw together from the mathematics literature a number of formal properties of this Itô SDE that lend support to its use in the implementation of the Bayesian method, and we describe its discretization, including the choice of the free parameters, by using the implicit Euler method. We demonstrate the proposed methodology on a problem of uncertainty quantification in a complex nonlinear engineering application relevant to metal forming.

  18. Multi-locus variable number tandem repeat analysis for Escherichia coli causing extraintestinal infections.

    PubMed

    Manges, Amee R; Tellis, Patricia A; Vincent, Caroline; Lifeso, Kimberley; Geneau, Geneviève; Reid-Smith, Richard J; Boerlin, Patrick

    2009-11-01

    Discriminatory genotyping methods for the analysis of Escherichia coli other than O157:H7 are necessary for public health-related activities. A new multi-locus variable number tandem repeat analysis protocol is presented; this method achieves an index of discrimination of 99.5% and is reproducible and valid when tested on a collection of 836 diverse E. coli.

  19. Quantum state estimation when qubits are lost: a no-data-left-behind approach

    DOE PAGES

    Williams, Brian P.; Lougovski, Pavel

    2017-04-06

    We present an approach to Bayesian mean estimation of quantum states using hyperspherical parametrization and an experiment-specific likelihood which allows utilization of all available data, even when qubits are lost. With this method, we report the first closed-form Bayesian mean and maximum likelihood estimates for the ideal single qubit. Due to computational constraints, we utilize numerical sampling to determine the Bayesian mean estimate for a photonic two-qubit experiment in which our novel analysis reduces burdens associated with experimental asymmetries and inefficiencies. This method can be applied to quantum states of any dimension and experimental complexity.

  20. Structural mapping in statistical word problems: A relational reasoning approach to Bayesian inference.

    PubMed

    Johnson, Eric D; Tubau, Elisabet

    2017-06-01

    Presenting natural frequencies facilitates Bayesian inferences relative to using percentages. Nevertheless, many people, including highly educated and skilled reasoners, still fail to provide Bayesian responses to these computationally simple problems. We show that the complexity of relational reasoning (e.g., the structural mapping between the presented and requested relations) can help explain the remaining difficulties. With a non-Bayesian inference that required identical arithmetic but afforded a more direct structural mapping, performance was universally high. Furthermore, reducing the relational demands of the task through questions that directed reasoners to use the presented statistics, as compared with questions that prompted the representation of a second, similar sample, also significantly improved reasoning. Distinct error patterns were also observed between these presented- and similar-sample scenarios, which suggested differences in relational-reasoning strategies. On the other hand, while higher numeracy was associated with better Bayesian reasoning, higher-numerate reasoners were not immune to the relational complexity of the task. Together, these findings validate the relational-reasoning view of Bayesian problem solving and highlight the importance of considering not only the presented task structure, but also the complexity of the structural alignment between the presented and requested relations.

  1. Using molecular tools to identify the geographical origin of a case of human brucellosis.

    PubMed

    Muchowski, J K; Koylass, M S; Dainty, A C; Stack, J A; Perrett, L; Whatmore, A M; Perrier, C; Chircop, S; Demicoli, N; Gatt, A B; Caruana, P A; Gopaul, K K

    2015-10-01

    Although Malta is historically linked with the zoonosis brucellosis, there had not been a case of the disease in either the human or livestock population for several years. However, in July 2013 a case of human brucellosis was identified on the island. To determine whether this recent case originated in Malta, four isolates from this case were subjected to molecular analysis. Molecular profiles generated using multilocus sequence analysis and multilocus variable number tandem repeat for the recent human case isolates and 11 Brucella melitensis strains of known Maltese origin were compared with others held on in-house and global databases. While the 11 isolates of Maltese origin formed a distinct cluster, the recent human isolation was not associated with these strains but instead clustered with isolates originating from the Horn of Africa. These data was congruent with epidemiological trace-back showed that the individual had travelled to Malta from Eritrea. This work highlights the potential of using molecular typing data to aid in epidemiological trace-back of Brucella isolations and assist in monitoring of the effectiveness of brucellosis control schemes.

  2. Developmental Stability Covaries with Genome-Wide and Single-Locus Heterozygosity in House Sparrows

    PubMed Central

    Vangestel, Carl; Mergeay, Joachim; Dawson, Deborah A.; Vandomme, Viki; Lens, Luc

    2011-01-01

    Fluctuating asymmetry (FA), a measure of developmental instability, has been hypothesized to increase with genetic stress. Despite numerous studies providing empirical evidence for associations between FA and genome-wide properties such as multi-locus heterozygosity, support for single-locus effects remains scant. Here we test if, and to what extent, FA co-varies with single- and multilocus markers of genetic diversity in house sparrow (Passer domesticus) populations along an urban gradient. In line with theoretical expectations, FA was inversely correlated with genetic diversity estimated at genome level. However, this relationship was largely driven by variation at a single key locus. Contrary to our expectations, relationships between FA and genetic diversity were not stronger in individuals from urban populations that experience higher nutritional stress. We conclude that loss of genetic diversity adversely affects developmental stability in P. domesticus, and more generally, that the molecular basis of developmental stability may involve complex interactions between local and genome-wide effects. Further study on the relative effects of single-locus and genome-wide effects on the developmental stability of populations with different genetic properties is therefore needed. PMID:21747940

  3. Molecular characterization of the probiotic strain Bacillus cereus var. toyoi NCIMB 40112 and differentiation from food poisoning strains.

    PubMed

    Klein, Günter

    2011-07-01

    Bacillus cereus var. toyoi strain NCIMB 40112 (Toyocerin), a probiotic authorized in the European Union as feed additive for swine, bovines, poultry, and rabbits, was characterized by DNA fingerprinting applying pulsed-field gel electrophoresis and multilocus sequence typing and was compared with reference strains (of clinical and environmental origins). The probiotic strain was clearly characterized by pulsed-field gel electrophoresis using the restriction enzymes Apa I and Sma I resulting in unique DNA patterns. The comparison to the clinical reference strain B. cereus DSM 4312 was done with the same restriction enzymes, and again a clear differentiation of the two strains was possible by the resulting DNA patterns. The use of the restriction enzymes Apa I and Sma I is recommended for further studies. Furthermore, multilocus sequence typing analysis revealed a sequence type (ST 111) that was different from all known STs of B. cereus strains from food poisoning incidents. Thus, a strain characterization and differentiation from food poisoning strains for the probiotic strain was possible. Copyright ©, International Association for Food Protection

  4. Population genetic characterization of Cyclospora cayetanensis from discrete geographical regions.

    PubMed

    Guo, Yaqiong; Li, Na; Ortega, Ynes R; Zhang, Longxian; Roellig, Dawn M; Feng, Yaoyu; Xiao, Lihua

    2018-01-01

    Cyclospora cayetanensis is an emerging pathogen that is endemic in developing countries and responsible for many large foodborne cyclosporiasis outbreaks in North America since 1990s. Because of the lack of typing targets, the genetic diversity and population genetics of C. cayetanensis have not been investigated. In this study, we undertook a population genetic analysis of multilocus sequence typing data we recently collected from 64 C. cayetanensis specimens. Despite the extensive genetic heterogeneity in the overall C. cayetanensis population, there were significant intra- and inter-genic linkage disequilibria (LD). A disappearance of LD was observed when only multilocus genotypes were included in the population genetic analysis, indicative of an epidemic nature of C. cayetanensis. Geographical segregation-associated sub-structuring was observed between specimens from China and those from Peru and the United States. The two subpopulations had reduced LD, indicating the likely occurrence of genetic exchange among isolates in endemic areas. Further analyses of specimens from other geographical regions are necessary to fully understand the population genetics of C. cayetanensis. Copyright © 2017 Elsevier Inc. All rights reserved.

  5. Multilocus enzyme electrophoresis and cytochrome B gene sequencing-based identification of Leishmania isolates from different foci of cutaneous leishmaniasis in Pakistan.

    PubMed

    Marco, Jorge D; Bhutto, Abdul M; Soomro, Farooq R; Baloch, Javed H; Barroso, Paola A; Kato, Hirotomo; Uezato, Hiroshi; Katakura, Ken; Korenaga, Masataka; Nonaka, Shigeo; Hashiguchi, Yoshihisa

    2006-08-01

    Seventeen Leishmania stocks isolated from cutaneous lesions of Pakistani patients were studied by multilocus enzyme electrophoresis and by polymerase chain reaction amplification and sequencing of the cytochrome b (Cyt b) gene. Eleven stocks that expressed nine zymodemes were assigned to L. (Leishmania) major. All of them were isolated from patients in the lowlands of Larkana district and Sibi city in Sindh and Balochistan provinces, respectively. The remaining six, distributed in two zymodemes (five and one), isolated from the highland of Quetta city, Balochistan, were identified as L. (L.) tropica. The same result at species level was obtained by the Cyt b sequencing for all the stocks examined. No clear-cut association between the clinical features (wet or dry type lesions) and the Leishmania species involved was found. Leishmania (L.) major was highly polymorphic compared with L. (L.) tropica. This difference may be explained by the fact that humans may act as a sole reservoir of L. (L.) tropica in anthroponotic cycles; however, many wild mammals can be reservoirs of L. (L.) major in zoonotic cycles.

  6. Bayesian forecasting and uncertainty quantifying of stream flows using Metropolis-Hastings Markov Chain Monte Carlo algorithm

    NASA Astrophysics Data System (ADS)

    Wang, Hongrui; Wang, Cheng; Wang, Ying; Gao, Xiong; Yu, Chen

    2017-06-01

    This paper presents a Bayesian approach using Metropolis-Hastings Markov Chain Monte Carlo algorithm and applies this method for daily river flow rate forecast and uncertainty quantification for Zhujiachuan River using data collected from Qiaotoubao Gage Station and other 13 gage stations in Zhujiachuan watershed in China. The proposed method is also compared with the conventional maximum likelihood estimation (MLE) for parameter estimation and quantification of associated uncertainties. While the Bayesian method performs similarly in estimating the mean value of daily flow rate, it performs over the conventional MLE method on uncertainty quantification, providing relatively narrower reliable interval than the MLE confidence interval and thus more precise estimation by using the related information from regional gage stations. The Bayesian MCMC method might be more favorable in the uncertainty analysis and risk management.

  7. Multi-Virulence-Locus Sequence Typing of Staphylococcus lugdunensis Generates Results Consistent with a Clonal Population Structure and Is Reliable for Epidemiological Typing

    PubMed Central

    Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis

    2014-01-01

    Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. PMID:25078912

  8. Wronski Brackets and the Ferris Wheel

    NASA Astrophysics Data System (ADS)

    Martin, Keye

    2005-11-01

    We connect the Bayesian order on classical states to a certain Lie algebra on C^infty[0,1]. This special Lie algebra structure, made precise by an idea we introduce called a Wronski bracket, suggests new phenomena the Bayesian order naturally models. We then study Wronski brackets on associative algebras, and in the commutative case, discover the beautiful result that they are equivalent to derivations.

  9. Inferring metabolic networks using the Bayesian adaptive graphical lasso with informative priors.

    PubMed

    Peterson, Christine; Vannucci, Marina; Karakas, Cemal; Choi, William; Ma, Lihua; Maletić-Savatić, Mirjana

    2013-10-01

    Metabolic processes are essential for cellular function and survival. We are interested in inferring a metabolic network in activated microglia, a major neuroimmune cell in the brain responsible for the neuroinflammation associated with neurological diseases, based on a set of quantified metabolites. To achieve this, we apply the Bayesian adaptive graphical lasso with informative priors that incorporate known relationships between covariates. To encourage sparsity, the Bayesian graphical lasso places double exponential priors on the off-diagonal entries of the precision matrix. The Bayesian adaptive graphical lasso allows each double exponential prior to have a unique shrinkage parameter. These shrinkage parameters share a common gamma hyperprior. We extend this model to create an informative prior structure by formulating tailored hyperpriors on the shrinkage parameters. By choosing parameter values for each hyperprior that shift probability mass toward zero for nodes that are close together in a reference network, we encourage edges between covariates with known relationships. This approach can improve the reliability of network inference when the sample size is small relative to the number of parameters to be estimated. When applied to the data on activated microglia, the inferred network includes both known relationships and associations of potential interest for further investigation.

  10. Inferring metabolic networks using the Bayesian adaptive graphical lasso with informative priors

    PubMed Central

    PETERSON, CHRISTINE; VANNUCCI, MARINA; KARAKAS, CEMAL; CHOI, WILLIAM; MA, LIHUA; MALETIĆ-SAVATIĆ, MIRJANA

    2014-01-01

    Metabolic processes are essential for cellular function and survival. We are interested in inferring a metabolic network in activated microglia, a major neuroimmune cell in the brain responsible for the neuroinflammation associated with neurological diseases, based on a set of quantified metabolites. To achieve this, we apply the Bayesian adaptive graphical lasso with informative priors that incorporate known relationships between covariates. To encourage sparsity, the Bayesian graphical lasso places double exponential priors on the off-diagonal entries of the precision matrix. The Bayesian adaptive graphical lasso allows each double exponential prior to have a unique shrinkage parameter. These shrinkage parameters share a common gamma hyperprior. We extend this model to create an informative prior structure by formulating tailored hyperpriors on the shrinkage parameters. By choosing parameter values for each hyperprior that shift probability mass toward zero for nodes that are close together in a reference network, we encourage edges between covariates with known relationships. This approach can improve the reliability of network inference when the sample size is small relative to the number of parameters to be estimated. When applied to the data on activated microglia, the inferred network includes both known relationships and associations of potential interest for further investigation. PMID:24533172

  11. STBase: one million species trees for comparative biology.

    PubMed

    McMahon, Michelle M; Deepak, Akshay; Fernández-Baca, David; Boss, Darren; Sanderson, Michael J

    2015-01-01

    Comprehensively sampled phylogenetic trees provide the most compelling foundations for strong inferences in comparative evolutionary biology. Mismatches are common, however, between the taxa for which comparative data are available and the taxa sampled by published phylogenetic analyses. Moreover, many published phylogenies are gene trees, which cannot always be adapted immediately for species level comparisons because of discordance, gene duplication, and other confounding biological processes. A new database, STBase, lets comparative biologists quickly retrieve species level phylogenetic hypotheses in response to a query list of species names. The database consists of 1 million single- and multi-locus data sets, each with a confidence set of 1000 putative species trees, computed from GenBank sequence data for 413,000 eukaryotic taxa. Two bodies of theoretical work are leveraged to aid in the assembly of multi-locus concatenated data sets for species tree construction. First, multiply labeled gene trees are pruned to conflict-free singly-labeled species-level trees that can be combined between loci. Second, impacts of missing data in multi-locus data sets are ameliorated by assembling only decisive data sets. Data sets overlapping with the user's query are ranked using a scheme that depends on user-provided weights for tree quality and for taxonomic overlap of the tree with the query. Retrieval times are independent of the size of the database, typically a few seconds. Tree quality is assessed by a real-time evaluation of bootstrap support on just the overlapping subtree. Associated sequence alignments, tree files and metadata can be downloaded for subsequent analysis. STBase provides a tool for comparative biologists interested in exploiting the most relevant sequence data available for the taxa of interest. It may also serve as a prototype for future species tree oriented databases and as a resource for assembly of larger species phylogenies from precomputed trees.

  12. Seroprevalence and first multilocus microsatellite genotyping of Neospora caninum in dairy cattle in Henan, central China.

    PubMed

    Qian, Weifeng; Wang, Tianqi; Yan, Wenchao; Zhang, Min; Han, Lifang; Xue, Rui; Song, Shaofu; Lv, Chaochao

    2017-09-15

    Neospora caninum is one of the important causes of abortion in cattle worldwide, and losses due to neosporosis to the cattle industry are considerable. However, the knowledge of genetic characterization of this parasite is limited. The aim of the present study is to determine the prevalence and genetic characterization of N. caninum from dairy cows in Henan Province, central China. A total of 510 blood samples and 7 aborted fetuses were collected from 8 dairy farms in Henan Province. Serum antibodies to N. caninum were examined by ELISA using a recombinant tNcSRS2 protein as the coating antigen. The overall seroprevalence of N. caninum in dairy cows was 41.2% (210/510). The seropositivity rate of N. caninum in aborting cows (49.3%) was statistically significant higher than that (29.3%) in non-aborting cows (p<0.05) with an odds ratio of 2.44 (95% CI, 1.61-3.41). Statistical association was also found between farm type and the seropositivity rate of N. caninum infection in cows (p<0.01).N. caninum DNA was detected from 6 of 396 blood samples (1.5%) and 4 of 7 aborted fetuses by nested PCR based on NC5 gene, and the 10N. caninum positive DNA samples were further analyzed by multilocus microsatellite (MS) genotyping for MS4, MS5, MS6A, MS7, MS8, MS10, and MS12. Only 2 samples were successfully genotyped at all genetic loci, and two unique profiles including two novel allelic patterns were identified. To our knowledge, this study is the first report of genetic characterization of N. caninum isolates from naturally infected dairy cows based on multilocus microsatellites (more than 2 loci) in China. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Multi-locus sequence typing of Salmonella enterica serovar Typhimurium isolates from wild birds in northern England suggests host-adapted strain.

    PubMed

    Hughes, L A; Wigley, P; Bennett, M; Chantrey, J; Williams, N

    2010-10-01

    Recent studies have suggested that Salmonella Typhimurium strains associated with mortality in UK garden birds are significantly different from strains that cause disease in humans and livestock and that wild bird strains may be host adapted. However, without further genomic characterization of these strains, it is not possible to determine whether they are host adapted. The aim of this study was to characterize a representative sample of Salm. Typhimurium strains detected in wild garden birds using multi-locus sequence typing (MLST)to investigate evolutionary relationships between them. Multi-locus sequence typing was performed on nine Salm. Typhimurium strains isolated from wild garden birds. Two sequence types were identified, the most common of which was ST568. Examination of the public Salmonella enterica MLST database revealed that only three other ST568 isolates had been cultured from a human in Scotland. Two further isolates of Salm. Typhimurium were determined to be ST19. Results of MLST analysis suggest that there is a predominant strain of Salm. Typhimurium circulating among garden bird populations in the United Kingdom, which is rarely detected in other species, supporting the hypothesis that this strain is host adapted. Host-pathogen evolution is often assumed to lead to pathogens becoming less virulent to avoid the death of their host; however, infection with ST568 led to high mortality rates among the wild birds examined, which were all found dead at wild bird-feeding stations. We hypothesize that by attracting unnaturally high densities of birds, wild bird-feeding stations may facilitate the transmission of ST568 between wild birds, therefore reducing the evolutionary cost of this pathogen killing its host, resulting in a host-adapted strain with increased virulence.

  14. A Bayesian Approach to the Overlap Analysis of Epidemiologically Linked Traits.

    PubMed

    Asimit, Jennifer L; Panoutsopoulou, Kalliope; Wheeler, Eleanor; Berndt, Sonja I; Cordell, Heather J; Morris, Andrew P; Zeggini, Eleftheria; Barroso, Inês

    2015-12-01

    Diseases often cooccur in individuals more often than expected by chance, and may be explained by shared underlying genetic etiology. A common approach to genetic overlap analyses is to use summary genome-wide association study data to identify single-nucleotide polymorphisms (SNPs) that are associated with multiple traits at a selected P-value threshold. However, P-values do not account for differences in power, whereas Bayes' factors (BFs) do, and may be approximated using summary statistics. We use simulation studies to compare the power of frequentist and Bayesian approaches with overlap analyses, and to decide on appropriate thresholds for comparison between the two methods. It is empirically illustrated that BFs have the advantage over P-values of a decreasing type I error rate as study size increases for single-disease associations. Consequently, the overlap analysis of traits from different-sized studies encounters issues in fair P-value threshold selection, whereas BFs are adjusted automatically. Extensive simulations show that Bayesian overlap analyses tend to have higher power than those that assess association strength with P-values, particularly in low-power scenarios. Calibration tables between BFs and P-values are provided for a range of sample sizes, as well as an approximation approach for sample sizes that are not in the calibration table. Although P-values are sometimes thought more intuitive, these tables assist in removing the opaqueness of Bayesian thresholds and may also be used in the selection of a BF threshold to meet a certain type I error rate. An application of our methods is used to identify variants associated with both obesity and osteoarthritis. © 2015 The Authors. *Genetic Epidemiology published by Wiley Periodicals, Inc.

  15. Systemic antibiotics in the treatment of aggressive periodontitis. A systematic review and a Bayesian Network meta-analysis.

    PubMed

    Rabelo, Cleverton Correa; Feres, Magda; Gonçalves, Cristiane; Figueiredo, Luciene C; Faveri, Marcelo; Tu, Yu-Kang; Chambrone, Leandro

    2015-07-01

    The aim of this study was to assess the effect of systemic antibiotic therapy on the treatment of aggressive periodontitis (AgP). This study was conducted and reported in accordance with the PRISMA statement. The MEDLINE, EMBASE and CENTRAL databases were searched up to June 2014 for randomized clinical trials comparing the treatment of subjects with AgP with either scaling and root planing (SRP) alone or associated with systemic antibiotics. Bayesian network meta-analysis was prepared using the Bayesian random-effects hierarchical models and the outcomes reported at 6-month post-treatment. Out of 350 papers identified, 14 studies were eligible. Greater gain in clinical attachment (CA) (mean difference [MD]: 1.08 mm; p < 0.0001) and reduction in probing depth (PD) (MD: 1.05 mm; p < 0.00001) were observed for SRP + metronidazole (Mtz), and for SRP + Mtz + amoxicillin (Amx) (MD: 0.45 mm, MD: 0.53 mm, respectively; p < 0.00001) than SRP alone/placebo. Bayesian network meta-analysis showed additional benefits in CA gain and PD reduction when SRP was associated with systemic antibiotics. SRP plus systemic antibiotics led to an additional clinical effect compared with SRP alone in the treatment of AgP. Of the antibiotic protocols available for inclusion into the Bayesian network meta-analysis, Mtz and Mtz/Amx provided to the most beneficial outcomes. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  16. Development of dynamic Bayesian models for web application test management

    NASA Astrophysics Data System (ADS)

    Azarnova, T. V.; Polukhin, P. V.; Bondarenko, Yu V.; Kashirina, I. L.

    2018-03-01

    The mathematical apparatus of dynamic Bayesian networks is an effective and technically proven tool that can be used to model complex stochastic dynamic processes. According to the results of the research, mathematical models and methods of dynamic Bayesian networks provide a high coverage of stochastic tasks associated with error testing in multiuser software products operated in a dynamically changing environment. Formalized representation of the discrete test process as a dynamic Bayesian model allows us to organize the logical connection between individual test assets for multiple time slices. This approach gives an opportunity to present testing as a discrete process with set structural components responsible for the generation of test assets. Dynamic Bayesian network-based models allow us to combine in one management area individual units and testing components with different functionalities and a direct influence on each other in the process of comprehensive testing of various groups of computer bugs. The application of the proposed models provides an opportunity to use a consistent approach to formalize test principles and procedures, methods used to treat situational error signs, and methods used to produce analytical conclusions based on test results.

  17. Numerical study on the sequential Bayesian approach for radioactive materials detection

    NASA Astrophysics Data System (ADS)

    Qingpei, Xiang; Dongfeng, Tian; Jianyu, Zhu; Fanhua, Hao; Ge, Ding; Jun, Zeng

    2013-01-01

    A new detection method, based on the sequential Bayesian approach proposed by Candy et al., offers new horizons for the research of radioactive detection. Compared with the commonly adopted detection methods incorporated with statistical theory, the sequential Bayesian approach offers the advantages of shorter verification time during the analysis of spectra that contain low total counts, especially in complex radionuclide components. In this paper, a simulation experiment platform implanted with the methodology of sequential Bayesian approach was developed. Events sequences of γ-rays associating with the true parameters of a LaBr3(Ce) detector were obtained based on an events sequence generator using Monte Carlo sampling theory to study the performance of the sequential Bayesian approach. The numerical experimental results are in accordance with those of Candy. Moreover, the relationship between the detection model and the event generator, respectively represented by the expected detection rate (Am) and the tested detection rate (Gm) parameters, is investigated. To achieve an optimal performance for this processor, the interval of the tested detection rate as a function of the expected detection rate is also presented.

  18. Examining Gene-Environment Interactions in Comorbid Depressive and Disruptive Behavior Disorders using a Bayesian Approach

    PubMed Central

    Adrian, Molly; Kiff, Cara; Glazner, Chris; Kohen, Ruth; Tracy, Julia Helen; Zhou, Chuan; McCauley, Elizabeth; Stoep, Ann Vander

    2015-01-01

    Objective The objective of this study was to apply a Bayesian statistical analytic approach that minimizes multiple testing problems to explore the combined effects of chronic low familial support and variants in 12 candidate genes on risk for a common and debilitating childhood mental health condition. Method Bayesian mixture modeling was used to examine gene by environment interactions among genetic variants and environmental factors (family support) associated in previous studies with the occurrence of comorbid depression and disruptive behavior disorders youth, using a sample of 255 children. Results One main effects, variants in the oxytocin receptor (OXTR, rs53576) was associated with increased risk for comorbid disorders. Two significant gene x environment and one signification gene x gene interaction emerged. Variants in the nicotinic acetylcholine receptor α5 subunit (CHRNA5, rs16969968) and in the glucocorticoid receptor chaperone protein FK506 binding protein 5 (FKBP5, rs4713902) interacted with chronic low family support in association with child mental health status. One gene x gene interaction, 5-HTTLPR variant of the serotonin transporter (SERT/SLC6A4) in combination with μ opioid receptor (OPRM1, rs1799971) was associated with comorbid depression and conduct problems. Conclusions Results indicate that Bayesian modeling is a feasible strategy for conducting behavioral genetics research. This approach, combined with an optimized genetic selection strategy (Vrieze, Iacono, & McGue, 2012), revealed genetic variants involved in stress regulation ( FKBP5, SERTxOPMR), social bonding (OXTR), and nicotine responsivity (CHRNA5) in predicting comorbid status. PMID:26228411

  19. Bayesian forecasting and uncertainty quantifying of stream flows using Metropolis–Hastings Markov Chain Monte Carlo algorithm

    DOE PAGES

    Wang, Hongrui; Wang, Cheng; Wang, Ying; ...

    2017-04-05

    This paper presents a Bayesian approach using Metropolis-Hastings Markov Chain Monte Carlo algorithm and applies this method for daily river flow rate forecast and uncertainty quantification for Zhujiachuan River using data collected from Qiaotoubao Gage Station and other 13 gage stations in Zhujiachuan watershed in China. The proposed method is also compared with the conventional maximum likelihood estimation (MLE) for parameter estimation and quantification of associated uncertainties. While the Bayesian method performs similarly in estimating the mean value of daily flow rate, it performs over the conventional MLE method on uncertainty quantification, providing relatively narrower reliable interval than the MLEmore » confidence interval and thus more precise estimation by using the related information from regional gage stations. As a result, the Bayesian MCMC method might be more favorable in the uncertainty analysis and risk management.« less

  20. A Bayesian hierarchical diffusion model decomposition of performance in Approach–Avoidance Tasks

    PubMed Central

    Krypotos, Angelos-Miltiadis; Beckers, Tom; Kindt, Merel; Wagenmakers, Eric-Jan

    2015-01-01

    Common methods for analysing response time (RT) tasks, frequently used across different disciplines of psychology, suffer from a number of limitations such as the failure to directly measure the underlying latent processes of interest and the inability to take into account the uncertainty associated with each individual's point estimate of performance. Here, we discuss a Bayesian hierarchical diffusion model and apply it to RT data. This model allows researchers to decompose performance into meaningful psychological processes and to account optimally for individual differences and commonalities, even with relatively sparse data. We highlight the advantages of the Bayesian hierarchical diffusion model decomposition by applying it to performance on Approach–Avoidance Tasks, widely used in the emotion and psychopathology literature. Model fits for two experimental data-sets demonstrate that the model performs well. The Bayesian hierarchical diffusion model overcomes important limitations of current analysis procedures and provides deeper insight in latent psychological processes of interest. PMID:25491372

  1. A Bayesian approach to tracking patients having changing pharmacokinetic parameters

    NASA Technical Reports Server (NTRS)

    Bayard, David S.; Jelliffe, Roger W.

    2004-01-01

    This paper considers the updating of Bayesian posterior densities for pharmacokinetic models associated with patients having changing parameter values. For estimation purposes it is proposed to use the Interacting Multiple Model (IMM) estimation algorithm, which is currently a popular algorithm in the aerospace community for tracking maneuvering targets. The IMM algorithm is described, and compared to the multiple model (MM) and Maximum A-Posteriori (MAP) Bayesian estimation methods, which are presently used for posterior updating when pharmacokinetic parameters do not change. Both the MM and MAP Bayesian estimation methods are used in their sequential forms, to facilitate tracking of changing parameters. Results indicate that the IMM algorithm is well suited for tracking time-varying pharmacokinetic parameters in acutely ill and unstable patients, incurring only about half of the integrated error compared to the sequential MM and MAP methods on the same example.

  2. Bayesian learning of visual chunks by human observers

    PubMed Central

    Orbán, Gergő; Fiser, József; Aslin, Richard N.; Lengyel, Máté

    2008-01-01

    Efficient and versatile processing of any hierarchically structured information requires a learning mechanism that combines lower-level features into higher-level chunks. We investigated this chunking mechanism in humans with a visual pattern-learning paradigm. We developed an ideal learner based on Bayesian model comparison that extracts and stores only those chunks of information that are minimally sufficient to encode a set of visual scenes. Our ideal Bayesian chunk learner not only reproduced the results of a large set of previous empirical findings in the domain of human pattern learning but also made a key prediction that we confirmed experimentally. In accordance with Bayesian learning but contrary to associative learning, human performance was well above chance when pair-wise statistics in the exemplars contained no relevant information. Thus, humans extract chunks from complex visual patterns by generating accurate yet economical representations and not by encoding the full correlational structure of the input. PMID:18268353

  3. Analyses of the population structure in a global collection of Phytophthora nicotianae isolates inferred from mitochondrial and nuclear DNA sequences.

    PubMed

    Mammella, Marco A; Martin, Frank N; Cacciola, Santa O; Coffey, Michael D; Faedda, Roberto; Schena, Leonardo

    2013-06-01

    Genetic variation within the heterothallic cosmopolitan plant pathogen Phytophthora nicotianae was determined in 96 isolates from a wide range of hosts and geographic locations by characterizing four mitochondrial (10% of the genome) and three nuclear loci. In all, 52 single-nucleotide polymorphisms (SNPs) (an average of 1 every 58 bp) and 313 sites with gaps representing 5,450 bases enabled the identification of 50 different multilocus mitochondrial haplotypes. Similarly, 24 SNPs (an average of 1 every 69 bp), with heterozygosity observed at each locus, were observed in three nuclear regions (hyp, scp, and β-tub) differentiating 40 multilocus nuclear genotypes. Both mitochondrial and nuclear markers revealed a high level of dispersal of isolates and an inconsistent geographic structuring of populations. However, a specific association was observed for host of origin and genetic grouping with both nuclear and mitochondrial sequences. In particular, the majority of citrus isolates from Italy, California, Florida, Syria, Albania, and the Philippines clustered in the same mitochondrial group and shared at least one nuclear allele. A similar association was also observed for isolates recovered from Nicotiana and Solanum spp. The present study suggests an important role of nursery populations in increasing genetic recombination within the species and the existence of extensive phenomena of migration of isolates that have been likely spread worldwide with infected plant material.

  4. Multilocus sequence analysis (MLSA) of Bradyrhizobium strains: revealing high diversity of tropical diazotrophic symbiotic bacteria.

    PubMed

    Delamuta, Jakeline Renata Marçon; Ribeiro, Renan Augusto; Menna, Pâmela; Bangel, Eliane Villamil; Hungria, Mariangela

    2012-04-01

    Symbiotic association of several genera of bacteria collectively called as rhizobia and plants belonging to the family Leguminosae (=Fabaceae) results in the process of biological nitrogen fixation, playing a key role in global N cycling, and also bringing relevant contributions to the agriculture. Bradyrhizobium is considered as the ancestral of all nitrogen-fixing rhizobial species, probably originated in the tropics. The genus encompasses a variety of diverse bacteria, but the diversity captured in the analysis of the 16S rRNA is often low. In this study, we analyzed twelve Bradyrhizobium strains selected from previous studies performed by our group for showing high genetic diversity in relation to the described species. In addition to the 16S rRNA, five housekeeping genes (recA, atpD, glnII, gyrB and rpoB) were analyzed in the MLSA (multilocus sequence analysis) approach. Analysis of each gene and of the concatenated housekeeping genes captured a considerably higher level of genetic diversity, with indication of putative new species. The results highlight the high genetic variability associated with Bradyrhizobium microsymbionts of a variety of legumes. In addition, the MLSA approach has proved to represent a rapid and reliable method to be employed in phylogenetic and taxonomic studies, speeding the identification of the still poorly known diversity of nitrogen-fixing rhizobia in the tropics.

  5. Bayesian spatial prediction of the site index in the study of the Missouri Ozark Forest Ecosystem Project

    Treesearch

    Xiaoqian Sun; Zhuoqiong He; John Kabrick

    2008-01-01

    This paper presents a Bayesian spatial method for analysing the site index data from the Missouri Ozark Forest Ecosystem Project (MOFEP). Based on ecological background and availability, we select three variables, the aspect class, the soil depth and the land type association as covariates for analysis. To allow great flexibility of the smoothness of the random field,...

  6. Combination of multilocus sequence typing and pulsed-field gel electrophoresis reveals an association of molecular clonality with the emergence of extensive-drug resistance (XDR) in Salmonella.

    PubMed

    Cao, Yongzhong; Shen, Yongxiu; Cheng, Lingling; Zhang, Xiaorong; Wang, Chao; Wang, Yan; Zhou, Xiaohui; Chao, Guoxiang; Wu, Yantao

    2018-03-01

    Salmonellae is one of the most important foodborne pathogens and becomes resistant to multiple antibiotics, which represents a significant challenge to food industry and public health. However, a molecular signature that can be used to distinguish antimicrobial resistance profile, particularly multi-drug resistance or extensive-drug resistance (XDR). In the current study, 168 isolates from the chicken and pork production chains and ill chickens were characterized by serotyping, antimicrobial susceptibility test, multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE). The results showed that these isolates belonged to 13 serotypes, 14 multilocus sequence types (STs), 94 PFGE genotypes, and 70 antimicrobial resistant profiles. S. Enteritidis, S. Indiana, and S. Derby were the predominant serotypes, corresponding to the ST11, ST17, and ST40 clones, respectively and the PFGE Cluster A, Cluster E, and Cluster D, respectively. Among the ST11-S. Enteritidis (Cluster A) and the ST40-S. Derby (Cluster D) clones, the majority of isolates were resistant to 4-8 antimicrobial agents, whereas in the ST17S. Indiana (Cluster E) clone, isolates showed extensive-drug resistance (XDR) to 9-16 antimicrobial agents. The bla TEM-1-like gene was prevalent in the ST11 and ST17 clones corresponding to high ampicillin resistance. The bla TEM-1-like , bla CTX-M , bla OXA-1-like , sul1, aaC4, aac(6')-1b, dfrA17, and floR gene complex was highly prevalent among isolates of ST17, corresponding to an XDR phenotype. These results demonstrated the association of the resistant phenotypes and genotypes with ST clone and PFGE cluster. Our results also indicated that the newly identified gene complex comprising bla TEM-1-like , bla CTX-M , bla OXA-1-like , sul1, aaC4, aac(6')-1b, dfrA17, and floR, was responsible for the emergence of the ST17S. Indiana XDR clone. ST17 could be potentially used as a molecular signature to distinguish S. Indiana XDR clone. Copyright © 2017 Elsevier GmbH. All rights reserved.

  7. Sinorhizobium meliloti strains TII7 and A5 by Multilocus Sequence Typing (MLST) have chromsomes identical with Rm1021 and form an effective and ineffective symbiosis with Medicago truncatula line Jemalong A17, respectively

    USDA-ARS?s Scientific Manuscript database

    The strains TII7 and A5 formed an effective and ineffective symbiosis with Medicago truncatula Jemalong A17, respectively. Both were shown to have identical chromsomes with strains Rm1021 and RCR2011 using a Multilocus Sequence Typing method. The 2260 bp segments of DNA stretching from the 3’ end ...

  8. Latitudinal Clines of the Human Vitamin D Receptor and Skin Color Genes.

    PubMed

    Tiosano, Dov; Audi, Laura; Climer, Sharlee; Zhang, Weixiong; Templeton, Alan R; Fernández-Cancio, Monica; Gershoni-Baruch, Ruth; Sánchez-Muro, José Miguel; El Kholy, Mohamed; Hochberg, Zèev

    2016-05-03

    The well-documented latitudinal clines of genes affecting human skin color presumably arise from the need for protection from intense ultraviolet radiation (UVR) vs. the need to use UVR for vitamin D synthesis. Sampling 751 subjects from a broad range of latitudes and skin colors, we investigated possible multilocus correlated adaptation of skin color genes with the vitamin D receptor gene (VDR), using a vector correlation metric and network method called BlocBuster. We discovered two multilocus networks involving VDR promoter and skin color genes that display strong latitudinal clines as multilocus networks, even though many of their single gene components do not. Considered one by one, the VDR components of these networks show diverse patterns: no cline, a weak declining latitudinal cline outside of Africa, and a strong in- vs. out-of-Africa frequency pattern. We confirmed these results with independent data from HapMap. Standard linkage disequilibrium analyses did not detect these networks. We applied BlocBuster across the entire genome, showing that our networks are significant outliers for interchromosomal disequilibrium that overlap with environmental variation relevant to the genes' functions. These results suggest that these multilocus correlations most likely arose from a combination of parallel selective responses to a common environmental variable and coadaptation, given the known Mendelian epistasis among VDR and the skin color genes. Copyright © 2016 Tiosano et al.

  9. Latitudinal Clines of the Human Vitamin D Receptor and Skin Color Genes

    PubMed Central

    Tiosano, Dov; Audi, Laura; Climer, Sharlee; Zhang, Weixiong; Templeton, Alan R.; Fernández-Cancio, Monica; Gershoni-Baruch, Ruth; Sánchez-Muro, José Miguel; El Kholy, Mohamed; Hochberg, Zèev

    2016-01-01

    The well-documented latitudinal clines of genes affecting human skin color presumably arise from the need for protection from intense ultraviolet radiation (UVR) vs. the need to use UVR for vitamin D synthesis. Sampling 751 subjects from a broad range of latitudes and skin colors, we investigated possible multilocus correlated adaptation of skin color genes with the vitamin D receptor gene (VDR), using a vector correlation metric and network method called BlocBuster. We discovered two multilocus networks involving VDR promoter and skin color genes that display strong latitudinal clines as multilocus networks, even though many of their single gene components do not. Considered one by one, the VDR components of these networks show diverse patterns: no cline, a weak declining latitudinal cline outside of Africa, and a strong in- vs. out-of-Africa frequency pattern. We confirmed these results with independent data from HapMap. Standard linkage disequilibrium analyses did not detect these networks. We applied BlocBuster across the entire genome, showing that our networks are significant outliers for interchromosomal disequilibrium that overlap with environmental variation relevant to the genes’ functions. These results suggest that these multilocus correlations most likely arose from a combination of parallel selective responses to a common environmental variable and coadaptation, given the known Mendelian epistasis among VDR and the skin color genes. PMID:26921301

  10. Rigorous Approach in Investigation of Seismic Structure and Source Characteristicsin Northeast Asia: Hierarchical and Trans-dimensional Bayesian Inversion

    NASA Astrophysics Data System (ADS)

    Mustac, M.; Kim, S.; Tkalcic, H.; Rhie, J.; Chen, Y.; Ford, S. R.; Sebastian, N.

    2015-12-01

    Conventional approaches to inverse problems suffer from non-linearity and non-uniqueness in estimations of seismic structures and source properties. Estimated results and associated uncertainties are often biased by applied regularizations and additional constraints, which are commonly introduced to solve such problems. Bayesian methods, however, provide statistically meaningful estimations of models and their uncertainties constrained by data information. In addition, hierarchical and trans-dimensional (trans-D) techniques are inherently implemented in the Bayesian framework to account for involved error statistics and model parameterizations, and, in turn, allow more rigorous estimations of the same. Here, we apply Bayesian methods throughout the entire inference process to estimate seismic structures and source properties in Northeast Asia including east China, the Korean peninsula, and the Japanese islands. Ambient noise analysis is first performed to obtain a base three-dimensional (3-D) heterogeneity model using continuous broadband waveforms from more than 300 stations. As for the tomography of surface wave group and phase velocities in the 5-70 s band, we adopt a hierarchical and trans-D Bayesian inversion method using Voronoi partition. The 3-D heterogeneity model is further improved by joint inversions of teleseismic receiver functions and dispersion data using a newly developed high-efficiency Bayesian technique. The obtained model is subsequently used to prepare 3-D structural Green's functions for the source characterization. A hierarchical Bayesian method for point source inversion using regional complete waveform data is applied to selected events from the region. The seismic structure and source characteristics with rigorously estimated uncertainties from the novel Bayesian methods provide enhanced monitoring and discrimination of seismic events in northeast Asia.

  11. Multilocus genetic diversity and historical biogeography of the endemic wall lizard from Ibiza and Formentera, Podarcis pityusensis (Squamata: Lacertidae).

    PubMed

    Rodríguez, V; Brown, R P; Terrasa, B; Pérez-Mellado, V; Castro, J A; Picornell, A; Ramon, M M

    2013-10-01

    Two monophyletic sister species of wall lizards inhabit the two main groups of Balearic Islands: Podarcis lilfordi from islets and small islands around Mallorca and Menorca and Podarcis pityusensis from Ibiza, Formentera and associated islets. Genetic diversity within the endangered P. lilfordi has been well characterized, but P. pityusensis has not been studied in depth. Here, 2430 bp of mtDNA and 15 microsatellite loci were analysed from P. pityusensis populations from across its natural range. Two main genetic groupings were identified, although geographical structuring differed slightly between the mtDNA and the nuclear loci. In general, individuals from islets/islands adjacent to the main island of Ibiza were genetically distinct from those from Formentera and the associated Freus islands for both mtDNA and the nuclear loci. However, most individuals from the island of Ibiza were grouped with neighbouring islets/islands for nuclear loci, but with Formentera and Freus islands for the mitochondrial locus. A time-calibrated Bayesian tree was constructed for the principal mitochondrial lineages within the Balearics, using the multispecies coalescent model, and provided statistical support for divergence of the two main P. pityusensis lineages 0.111-0.295 Ma. This suggests a mid-late Pleistocene intraspecific divergence, compared with an early Pleistocene divergence in P. lilfordi, and postdates some major increases in sea level between 0.4 and 0.6 Ma, which may have flooded Formentera. The program IMa2 provided a posterior divergence time of 0.089-0.221 Ma, which was similar to the multispecies coalescent tree estimate. More significantly, it indicated low but asymmetric effective gene copy migration rates, with higher migration from Formentera to Ibiza populations. Our findings suggest that much of the present-day diversity may have originated from a late Pleistocene colonization of one island group from the other, followed by allopatric divergence of these populations. Subsequent gene flow between these insular groups seems likely to be explained by recent human introductions. Two evolutionary significant units can be defined for P. pityusensis but these units would need to exclude the populations that have been the subjects of recent admixture. © 2013 John Wiley & Sons Ltd.

  12. Verifying the geographic origin of mahogany (Swietenia macrophylla King) with DNA-fingerprints.

    PubMed

    Degen, B; Ward, S E; Lemes, M R; Navarro, C; Cavers, S; Sebbenn, A M

    2013-01-01

    Illegal logging is one of the main causes of ongoing worldwide deforestation and needs to be eradicated. The trade in illegal timber and wood products creates market disadvantages for products from sustainable forestry. Although various measures have been established to counter illegal logging and the subsequent trade, there is a lack of practical mechanisms for identifying the origin of timber and wood products. In this study, six nuclear microsatellites were used to generate DNA fingerprints for a genetic reference database characterising the populations of origin of a large set of mahogany (Swietenia macrophylla King, Meliaceae) samples. For the database, leaves and/or cambium from 1971 mahogany trees sampled in 31 stands from Mexico to Bolivia were genotyped. A total of 145 different alleles were found, showing strong genetic differentiation (δ(Gregorious)=0.52, F(ST)=0.18, G(ST(Hedrick))=0.65) and clear correlation between genetic and spatial distances among stands (r=0.82, P<0.05). We used the genetic reference database and Bayesian assignment testing to determine the geographic origins of two sets of mahogany wood samples, based on their multilocus genotypes. In both cases the wood samples were assigned to the correct country of origin. We discuss the overall applicability of this methodology to tropical timber trading. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  13. Contrasting patterns of genetic diversity and population structure of Armillaria mellea sensu stricto in the eastern and western United States.

    PubMed

    Baumgartner, Kendra; Travadon, Renaud; Bruhn, Johann; Bergemann, Sarah E

    2010-07-01

    ABSTRACT Armillaria mellea infects hundreds of plant species in natural and managed ecosystems throughout the Northern hemisphere. Previously reported nuclear genetic divergence between eastern and western U.S. isolates is consistent with the disjunct range of A. mellea in North America, which is restricted mainly to both coasts of the United States. We investigated patterns of population structure and genetic diversity of the eastern (northern and southern Appalachians, Ozarks, and western Great Lakes) and western (Berkeley, Los Angeles, St. Helena, and San Jose, CA) regions of the United States. In total, 156 diploid isolates were genotyped using 12 microsatellite loci. Absence of genetic differentiation within either eastern subpopulations (theta(ST) = -0.002, P = 0.5 ) or western subpopulations (theta(ST) = 0.004, P = 0.3 ) suggests that spore dispersal within each region is sufficient to prevent geographic differentiation. In contrast to the western United States, our finding of more than one genetic cluster of isolates within the eastern United States (K = 3), revealed by Bayesian assignment of multilocus genotypes in STRUCTURE and confirmed by genetic multivariate analyses, suggests that eastern subpopulations are derived from multiple founder sources. The existence of amplifiable and nonamplifiable loci and contrasting patterns of genetic diversity between the two regions demonstrate that there are two geographically isolated, divergent genetic pools of A. mellea in the United States.

  14. Multi-locus phylogenetic analysis of Old World chats and flycatchers reveals extensive paraphyly at family, subfamily and genus level (Aves: Muscicapidae).

    PubMed

    Sangster, George; Alström, Per; Forsmark, Emma; Olsson, Urban

    2010-10-01

    The chats and flycatchers (Muscicapidae) represent an assemblage of 275 species in 48 genera. Defining natural groups within this assemblage has been challenging because of its high diversity and a paucity of phylogenetically informative morphological characters. We assessed the phylogenetic relationships of 124 species and 34 genera of Muscicapidae, and 20 species of Turdidae, using molecular sequence data from one mitochondrial gene and three nuclear loci, in total 3240bp. Bayesian and maximum likelihood analyses yielded a well-resolved tree in which nearly all basal nodes were strongly supported. The traditionally defined Muscicapidae, Muscicapinae and Saxicolinae were paraphyletic. Four major clades are recognized in Muscicapidae: Muscicapinae, Niltavinae (new family-group name), Erithacinae and Saxicolinae. Interesting relationships recovered by this analysis include: (i) a clade comprising the 'blue' flycatcher genera Niltava, Cyornis, Cyanoptila and Eumyias and some species of Rhinomyias; (ii) the position of Erithacus rubecula in a clade of otherwise exclusively African species; (iii) a close relationship between the shortwing Heinrichia calligyna and the flycatcher Rhinomyias insignis; (iv) a sister-relationship between forktails Enicurus and whistling thrushes Myophonus; and (v) a sister relationship of Ficedula and the 'chats'Monticola, Phoenicurus, Saxicola and Oenanthe. A high number of traditionally defined genera was found to be paraphyletic or polyphyletic. Copyright 2010 Elsevier Inc. All rights reserved.

  15. Investigating processes of neotropical rain forest tree diversification by examining the evolution and historical biogeography of the Protieae (Burseraceae).

    PubMed

    Fine, Paul V A; Zapata, Felipe; Daly, Douglas C

    2014-07-01

    Andean uplift and the collision of North and South America are thought to have major implications for the diversification of the Neotropical biota. However, few studies have investigated how these geological events may have influenced diversification. We present a multilocus phylogeny of 102 Protieae taxa (73% of published species), sampled pantropically, to test hypotheses about the relative importance of dispersal, vicariance, habitat specialization, and biotic factors in the diversification of this ecologically dominant tribe of Neotropical trees. Bayesian fossil-calibrated analyses date the Protieae stem at 55 Mya. Biogeographic analyses reconstruct an initial late Oligocene/early Miocene radiation in Amazonia for Neotropical Protieae, with several subsequent late Miocene dispersal events to Central America, the Caribbean, Brazil's Atlantic Forest, and the Chocó. Regional phylogenetic structure results indicate frequent dispersal among regions throughout the Miocene and many instances of more recent regional in situ speciation. Habitat specialization to white sand or flooded soils was common, especially in Amazonia. There was one significant increase in diversification rate coincident with colonization of the Neotropics, followed by a gradual decrease consistent with models of diversity-dependent cladogenesis. Dispersal, biotic interactions, and habitat specialization are thus hypothesized to be the most important processes underlying the diversification of the Protieae. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.

  16. Divergence times and the evolution of morphological complexity in an early land plant lineage (Marchantiopsida) with a slow molecular rate.

    PubMed

    Villarreal A, Juan Carlos; Crandall-Stotler, Barbara J; Hart, Michelle L; Long, David G; Forrest, Laura L

    2016-03-01

    We present a complete generic-level phylogeny of the complex thalloid liverworts, a lineage that includes the model system Marchantia polymorpha. The complex thalloids are remarkable for their slow rate of molecular evolution and for being the only extant plant lineage to differentiate gas exchange tissues in the gametophyte generation. We estimated the divergence times and analyzed the evolutionary trends of morphological traits, including air chambers, rhizoids and specialized reproductive structures. A multilocus dataset was analyzed using maximum likelihood and Bayesian approaches. Relative rates were estimated using local clocks. Our phylogeny cements the early branching in complex thalloids. Marchantia is supported in one of the earliest divergent lineages. The rate of evolution in organellar loci is slower than for other liverwort lineages, except for two annual lineages. Most genera diverged in the Cretaceous. Marchantia polymorpha diversified in the Late Miocene, giving a minimum age estimate for the evolution of its sex chromosomes. The complex thalloid ancestor, excluding Blasiales, is reconstructed as a plant with a carpocephalum, with filament-less air chambers opening via compound pores, and without pegged rhizoids. Our comprehensive study of the group provides a temporal framework for the analysis of the evolution of critical traits essential for plants during land colonization. © 2015 Royal Botanic Garden Edinburgh. New Phytologist © 2015 New Phytologist Trust.

  17. Multilocus patterns of nucleotide polymorphism and demographic change in Taxodium distichum (Cupressaceae) in the lower Mississippi River alluvial valley

    USGS Publications Warehouse

    Kusumi, J.; Zidong, L.; Kado, T.; Tsumura, Y.; Middleton, B.A.; Tachida, H.

    2010-01-01

    Premise of the Study: Studies of the geographic patterns of genetic variation can give important insights into the past population structure of species. Our study species, Taxodium distichum L. (bald-cypress), prefers riparian and wetland habitats and is widely distributed in southeastern North America and Mexico. We compared the genetic variation of T. distichum with that of its close relative, Cryptomeria japonica, which is endemic to Japan. Methods: Nucleotide polymorphisms of T. distichum in the lower Mississippi River alluvial valley, USA, were examined at 10 nuclear loci. Key Results: The average nucleotide diversity at silent sites, 7sil, across the 10 loci in T. distichum was higher than that of C. japonica (7sil = 0.00732 and 0.00322, respectively). In T. distichum, Tajima's D values were each negative at 9 out of 10 loci, which suggests a recent population expansion. Maximum-likelihood and Bayesian estimations of the exponential population growth rate (g) of T. distichum populations indicated that this species had expanded approximately at the rate of 1.7 - 1.0 10 -6 per year in the past. Conclusions: Taxodium distichum had signifi cantly higher nucleotide variation than C. japonica, and its patterns of polymorphism contrasted strikingly with those of the latter, which previously has been inferred to have experienced a reduction in population size.

  18. A clinical measure of DNA methylation predicts outcome in de novo acute myeloid leukemia

    PubMed Central

    Luskin, Marlise R.; Gimotty, Phyllis A.; Smith, Catherine; Loren, Alison W.; Figueroa, Maria E.; Harrison, Jenna; Sun, Zhuoxin; Tallman, Martin S.; Paietta, Elisabeth M.; Litzow, Mark R.; Melnick, Ari M.; Levine, Ross L.; Fernandez, Hugo F.; Luger, Selina M.; Master, Stephen R.; Wertheim, Gerald B.W.

    2016-01-01

    BACKGROUND. Variable response to chemotherapy in acute myeloid leukemia (AML) represents a major treatment challenge. Clinical and genetic features incompletely predict outcome. The value of clinical epigenetic assays for risk classification has not been extensively explored. We assess the prognostic implications of a clinical assay for multilocus DNA methylation on adult patients with de novo AML. METHODS. We performed multilocus DNA methylation assessment using xMELP on samples and calculated a methylation statistic (M-score) for 166 patients from UPENN with de novo AML who received induction chemotherapy. The association of M-score with complete remission (CR) and overall survival (OS) was evaluated. The optimal M-score cut-point for identifying groups with differing survival was used to define a binary M-score classifier. This classifier was validated in an independent cohort of 383 patients from the Eastern Cooperative Oncology Group Trial 1900 (E1900; NCT00049517). RESULTS. A higher mean M-score was associated with death and failure to achieve CR. Multivariable analysis confirmed that a higher M-score was associated with death (P = 0.011) and failure to achieve CR (P = 0.034). Median survival was 26.6 months versus 10.6 months for low and high M-score groups. The ability of the M-score to perform as a classifier was confirmed in patients ≤ 60 years with intermediate cytogenetics and patients who achieved CR, as well as in the E1900 validation cohort. CONCLUSION. The M-score represents a valid binary prognostic classifier for patients with de novo AML. The xMELP assay and associated M-score can be used for prognosis and should be further investigated for clinical decision making in AML patients. PMID:27446991

  19. Epigenetic variation predicts regional and local intraspecific functional diversity in a perennial herb.

    PubMed

    Medrano, Mónica; Herrera, Carlos M; Bazaga, Pilar

    2014-10-01

    The ecological significance of epigenetic variation has been generally inferred from studies on model plants under artificial conditions, but the importance of epigenetic differences between individuals as a source of intraspecific diversity in natural plant populations remains essentially unknown. This study investigates the relationship between epigenetic variation and functional plant diversity by conducting epigenetic (methylation-sensitive amplified fragment length polymorphisms, MSAP) and genetic (amplified fragment length polymorphisms, AFLP) marker-trait association analyses for 20 whole-plant, leaf and regenerative functional traits in a large sample of wild-growing plants of the perennial herb Helleborus foetidus from ten sampling sites in south-eastern Spain. Plants differed widely in functional characteristics, and exhibited greater epigenetic than genetic diversity, as shown by per cent polymorphism of MSAP fragments (92%) or markers (69%) greatly exceeding that for AFLP ones (41%). After controlling for genetic structuring and possible cryptic relatedness, every functional trait considered exhibited a significant association with at least one AFLP or MSAP marker. A total of 27 MSAP (13.0% of total) and 12 AFLP (4.4%) markers were involved in significant associations, which explained on average 8.2% and 8.0% of trait variance, respectively. Individual MSAP markers were more likely to be associated with functional traits than AFLP markers. Between-site differences in multivariate functional diversity were directly related to variation in multilocus epigenetic diversity after multilocus genetic diversity was statistically accounted for. Results suggest that epigenetic variation can be an important source of intraspecific functional diversity in H. foetidus, possibly endowing this species with the capacity to exploit a broad range of ecological conditions despite its modest genetic diversity. © 2014 John Wiley & Sons Ltd.

  20. A Bayesian hierarchical approach to comparative audit for carotid surgery.

    PubMed

    Kuhan, G; Marshall, E C; Abidia, A F; Chetter, I C; McCollum, P T

    2002-12-01

    the aim of this study was to illustrate how a Bayesian hierarchical modelling approach can aid the reliable comparison of outcome rates between surgeons. retrospective analysis of prospective and retrospective data. binary outcome data (death/stroke within 30 days), together with information on 15 possible risk factors specific for CEA were available on 836 CEAs performed by four vascular surgeons from 1992-99. The median patient age was 68 (range 38-86) years and 60% were men. the model was developed using the WinBUGS software. After adjusting for patient-level risk factors, a cross-validatory approach was adopted to identify "divergent" performance. A ranking exercise was also carried out. the overall observed 30-day stroke/death rate was 3.9% (33/836). The model found diabetes, stroke and heart disease to be significant risk factors. There was no significant difference between the predicted and observed outcome rates for any surgeon (Bayesian p -value>0.05). Each surgeon had a median rank of 3 with associated 95% CI 1.0-5.0, despite the variability of observed stroke/death rate from 2.9-4.4%. After risk adjustment, there was very little residual between-surgeon variability in outcome rate. Bayesian hierarchical models can help to accurately quantify the uncertainty associated with surgeons' performance and rank.

  1. Inference of Antibiotic Resistance and Virulence Among Diverse Group A Streptococcus Strains Using emm Sequencing and Multilocus Genotyping Methods

    DTIC Science & Technology

    2009-09-04

    apparent GAS-associated conditions were sampled by oropharyn- geal swab. Swabs were streaked on blood agar plates using Table 3. Isolate properties by...testing, samples were re-streaked on blood agar plates (5% sheep blood in TSA base) (Hardy Diagnostics, Santa Maria, CA), and incubated at 35–37uC with 5–10...sensitivity (A-disk method, Hardy Diagnostics) and positive GAS latex agglutination reaction (Hardy Diagnostics). Confirmed GAS isolates were then

  2. Dispersion of Multidrug-Resistant Enterococcus faecium Isolates Belonging to Major Clonal Complexes in Different Portuguese Settings▿

    PubMed Central

    Freitas, Ana R.; Novais, Carla; Ruiz-Garbajosa, Patricia; Coque, Teresa M.; Peixe, Luísa

    2009-01-01

    The population structure of 56 Enterococcus faecium isolates selected from a collection of enterococci from humans, animals, and the environment in Portugal (1997 to 2007) was analyzed by multilocus sequence typing. We identified 41 sequence types clustering into CC17, CC5, CC9, CC22 and CC94, all clonal lineages comprising isolates from different hosts. Our findings highlight the role of community-associated hosts as reservoirs of enterococci able to cause human infections. PMID:19447948

  3. Population-level differences in disease transmission: A Bayesian analysis of multiple smallpox epidemics

    PubMed Central

    Elderd, Bret D.; Dwyer, Greg; Dukic, Vanja

    2013-01-01

    Estimates of a disease’s basic reproductive rate R0 play a central role in understanding outbreaks and planning intervention strategies. In many calculations of R0, a simplifying assumption is that different host populations have effectively identical transmission rates. This assumption can lead to an underestimate of the overall uncertainty associated with R0, which, due to the non-linearity of epidemic processes, may result in a mis-estimate of epidemic intensity and miscalculated expenditures associated with public-health interventions. In this paper, we utilize a Bayesian method for quantifying the overall uncertainty arising from differences in population-specific basic reproductive rates. Using this method, we fit spatial and non-spatial susceptible-exposed-infected-recovered (SEIR) models to a series of 13 smallpox outbreaks. Five outbreaks occurred in populations that had been previously exposed to smallpox, while the remaining eight occurred in Native-American populations that were naïve to the disease at the time. The Native-American outbreaks were close in a spatial and temporal sense. Using Bayesian Information Criterion (BIC), we show that the best model includes population-specific R0 values. These differences in R0 values may, in part, be due to differences in genetic background, social structure, or food and water availability. As a result of these inter-population differences, the overall uncertainty associated with the “population average” value of smallpox R0 is larger, a finding that can have important consequences for controlling epidemics. In general, Bayesian hierarchical models are able to properly account for the uncertainty associated with multiple epidemics, provide a clearer understanding of variability in epidemic dynamics, and yield a better assessment of the range of potential risks and consequences that decision makers face. PMID:24021521

  4. Multilocus Phylogeny of the Afrotropical Freshwater Crab Fauna Reveals Historical Drainage Connectivity and Transoceanic Dispersal Since the Eocene.

    PubMed

    Daniels, Savel R; Phiri, Ethel E; Klaus, Sebastian; Albrecht, Christian; Cumberlidge, Neil

    2015-07-01

    Phylogenetic reconstruction, divergence time estimations and ancestral range estimation were undertaken for 66% of the Afrotropical freshwater crab fauna (Potamonautidae) based on four partial DNA loci (12S rRNA, 16S rRNA, cytochrome oxidase one [COI], and histone 3). The present study represents the most comprehensive taxonomic sampling of any freshwater crab family globally, and explores the impact of paleodrainage interconnectivity on cladogenesis among freshwater crabs. Phylogenetic analyses of the total evidence data using maximum-likelihood (ML), maximum parsimony (MP), and Bayesian inference (BI) produced a robust statistically well-supported tree topology that reaffirmed the monophyly of the Afrotropical freshwater crab fauna. The estimated divergence times suggest that the Afrotropical Potamonautidae diverged during the Eocene. Cladogenesis within and among several genera occurred predominantly during the Miocene, which was associated with major tectonic and climatic ameliorations throughout the region. Paleodrainage connectivity was observed with specimens from the Nilo-Sudan and East African coast proving to be sister to specimens from the Upper Guinea Forests in West Africa. In addition, we observed strong sister taxon affinity between specimens from East Africa and the Congo basin, including specimens from Lake Tanganyika, while the southern African fauna was retrieved as sister to the Angolan taxa. Within the East African clade we observed two independent transoceanic dispersal events, one to the Seychelles Archipelago and a second to Madagascar, while we observe a single transoceanic dispersal event from West Africa to São Tomé. The ancestral area estimation suggested a West African/East African ancestral range for the family with multiple dispersal events between southern Africa and East Africa, and between East Africa and Central Africa The taxonomic implications of our results are discussed in light of the widespread paraphyly evident among a number of genera. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Evolutionary history of Lissotriton helveticus: multilocus assessment of ancestral vs. recent colonization of the Iberian Peninsula.

    PubMed

    Recuero, Ernesto; García-París, Mario

    2011-07-01

    The Pleistocene was characterized by climatic changes that greatly altered the distribution of organisms. Population extinctions, bottlenecks, isolation, range expansions and contractions were often associated with glaciations, leaving signatures in the spatial patterns of genetic diversity across species. Lissotriton helveticus belongs to a Pan-European lineage of newts that were strongly affected by glaciations and represent an excellent model to analyse the effect of generalized climatic changes in phylogeographic patterns. We studied the genetic diversity of the species using data from two mitochondrial and three nuclear genes analyzed in a Bayesian phylogenetic framework to investigate the historical processes shaping spatial patterns of genetic diversity. Mitochondrial haplotypes cluster in four different groups present in the Iberian Peninsula and of Pleistocene origin, probably by allopatric fragmentation. Nuclear genes present no obvious geographic structure patterns, suggesting gene flow and generalized incomplete lineage sorting. Populations north of the Pyrenees are closely related to those from northeastern Iberia, suggesting recent range expansion from this region. Historical demographic analyses indicate a demographic expansion starting about 100,000years ago and more recent population declines. Compared to other Lissotriton species, L. helveticus includes only relatively young genetic lineages, suggesting a Central European pre-Pleistocene distribution followed by complete extirpation of the species during glaciations in that area. Historical demographic trends in the Iberian Peninsula are reversed with respect to the more Mediterranean species Lissotriton boscai, indicating different responses of both species to climate changes. Diversity patterns among Lissotriton species seem to be defined by four main factors: ancestral distributions, colonization capabilities, interactions with other species and effective population sizes. Differences in these factors define two types of species, referred to as "R" (refugia) and "S" (sanctuaries) that explain part of the diversity in patterns of genetic diversity created by glaciations in Western Europe. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. Genetic and morphological contrasts between wild and anthropogenic populations of Agave parryi var. huachucensis in south-eastern Arizona.

    PubMed

    Parker, Kathleen C; Trapnell, Dorset W; Hamrick, J L; Hodgson, Wendy C

    2014-05-01

    At least seven species of Agave, including A. parryi, were cultivated prehistorically in Arizona, serving as important sources of food and fibre. Many relict populations from ancient cultivation remain in the modern landscape, offering a unique opportunity to study pre-Columbian plant manipulation practices. This study examined genetic and morphological variation in six A. p. var. huachucensis populations of unknown origin to compare them with previous work on A. parryi populations of known origin, to infer their cultivation history and to determine whether artificial selection is evident in populations potentially managed by early agriculturalists. Six A. p. var. huachucensis and 17 A. parryi populations were sampled, and morphometric, allozyme and microsatellite data were used to compare morphology and genetic structure in purportedly anthropogenic and wild populations, as well as in the two taxa. Analysis of molecular variance and Bayesian clustering were performed to partition variation associated with taxonomic identity and hypothesized evolutionary history, to highlight patterns of similarity among populations and to identify potential wild sources for the planting stock. A p. var. huachucensis and A. parryi populations differed significantly both morphologically and genetically. Like A. parryi, wild A. p. var. huachucensis populations were more genetically diverse than the inferred anthropogenic populations, with greater expected heterozygosity, percentage of polymorphic loci and number of alleles. Inferred anthropogenic populations exhibited many traits indicative of past active cultivation: greater morphological uniformity, fixed heterozygosity for several loci (non-existent in wild populations), fewer multilocus genotypes and strong differentiation among populations. Where archaeological information is lacking, the genetic signature of many Agave populations in Arizona can be used to infer their evolutionary history and to identify potentially fruitful sites for archaeological investigation of ancient settlements and cultivation practices. The same approach can clearly be adopted for other species in similar situations.

  7. Genetic and morphological contrasts between wild and anthropogenic populations of Agave parryi var. huachucensis in south-eastern Arizona

    PubMed Central

    Parker, Kathleen C.; Trapnell, Dorset W.; Hamrick, J. L.; Hodgson, Wendy C.

    2014-01-01

    Background and Aims At least seven species of Agave, including A. parryi, were cultivated prehistorically in Arizona, serving as important sources of food and fibre. Many relict populations from ancient cultivation remain in the modern landscape, offering a unique opportunity to study pre-Columbian plant manipulation practices. This study examined genetic and morphological variation in six A. p. var. huachucensis populations of unknown origin to compare them with previous work on A. parryi populations of known origin, to infer their cultivation history and to determine whether artificial selection is evident in populations potentially managed by early agriculturalists. Methods Six A. p. var. huachucensis and 17 A. parryi populations were sampled, and morphometric, allozyme and microsatellite data were used to compare morphology and genetic structure in purportedly anthropogenic and wild populations, as well as in the two taxa. Analysis of molecular variance and Bayesian clustering were performed to partition variation associated with taxonomic identity and hypothesized evolutionary history, to highlight patterns of similarity among populations and to identify potential wild sources for the planting stock. Key Results A p. var. huachucensis and A. parryi populations differed significantly both morphologically and genetically. Like A. parryi, wild A. p. var. huachucensis populations were more genetically diverse than the inferred anthropogenic populations, with greater expected heterozygosity, percentage of polymorphic loci and number of alleles. Inferred anthropogenic populations exhibited many traits indicative of past active cultivation: greater morphological uniformity, fixed heterozygosity for several loci (non-existent in wild populations), fewer multilocus genotypes and strong differentiation among populations. Conclusions Where archaeological information is lacking, the genetic signature of many Agave populations in Arizona can be used to infer their evolutionary history and to identify potentially fruitful sites for archaeological investigation of ancient settlements and cultivation practices. The same approach can clearly be adopted for other species in similar situations. PMID:24638822

  8. Bayesian networks in neuroscience: a survey.

    PubMed

    Bielza, Concha; Larrañaga, Pedro

    2014-01-01

    Bayesian networks are a type of probabilistic graphical models lie at the intersection between statistics and machine learning. They have been shown to be powerful tools to encode dependence relationships among the variables of a domain under uncertainty. Thanks to their generality, Bayesian networks can accommodate continuous and discrete variables, as well as temporal processes. In this paper we review Bayesian networks and how they can be learned automatically from data by means of structure learning algorithms. Also, we examine how a user can take advantage of these networks for reasoning by exact or approximate inference algorithms that propagate the given evidence through the graphical structure. Despite their applicability in many fields, they have been little used in neuroscience, where they have focused on specific problems, like functional connectivity analysis from neuroimaging data. Here we survey key research in neuroscience where Bayesian networks have been used with different aims: discover associations between variables, perform probabilistic reasoning over the model, and classify new observations with and without supervision. The networks are learned from data of any kind-morphological, electrophysiological, -omics and neuroimaging-, thereby broadening the scope-molecular, cellular, structural, functional, cognitive and medical- of the brain aspects to be studied.

  9. Rediscovery of Good-Turing estimators via Bayesian nonparametrics.

    PubMed

    Favaro, Stefano; Nipoti, Bernardo; Teh, Yee Whye

    2016-03-01

    The problem of estimating discovery probabilities originated in the context of statistical ecology, and in recent years it has become popular due to its frequent appearance in challenging applications arising in genetics, bioinformatics, linguistics, designs of experiments, machine learning, etc. A full range of statistical approaches, parametric and nonparametric as well as frequentist and Bayesian, has been proposed for estimating discovery probabilities. In this article, we investigate the relationships between the celebrated Good-Turing approach, which is a frequentist nonparametric approach developed in the 1940s, and a Bayesian nonparametric approach recently introduced in the literature. Specifically, under the assumption of a two parameter Poisson-Dirichlet prior, we show that Bayesian nonparametric estimators of discovery probabilities are asymptotically equivalent, for a large sample size, to suitably smoothed Good-Turing estimators. As a by-product of this result, we introduce and investigate a methodology for deriving exact and asymptotic credible intervals to be associated with the Bayesian nonparametric estimators of discovery probabilities. The proposed methodology is illustrated through a comprehensive simulation study and the analysis of Expressed Sequence Tags data generated by sequencing a benchmark complementary DNA library. © 2015, The International Biometric Society.

  10. Bayesian networks in neuroscience: a survey

    PubMed Central

    Bielza, Concha; Larrañaga, Pedro

    2014-01-01

    Bayesian networks are a type of probabilistic graphical models lie at the intersection between statistics and machine learning. They have been shown to be powerful tools to encode dependence relationships among the variables of a domain under uncertainty. Thanks to their generality, Bayesian networks can accommodate continuous and discrete variables, as well as temporal processes. In this paper we review Bayesian networks and how they can be learned automatically from data by means of structure learning algorithms. Also, we examine how a user can take advantage of these networks for reasoning by exact or approximate inference algorithms that propagate the given evidence through the graphical structure. Despite their applicability in many fields, they have been little used in neuroscience, where they have focused on specific problems, like functional connectivity analysis from neuroimaging data. Here we survey key research in neuroscience where Bayesian networks have been used with different aims: discover associations between variables, perform probabilistic reasoning over the model, and classify new observations with and without supervision. The networks are learned from data of any kind–morphological, electrophysiological, -omics and neuroimaging–, thereby broadening the scope–molecular, cellular, structural, functional, cognitive and medical– of the brain aspects to be studied. PMID:25360109

  11. Multilocus sequence typing and virulence analysis of Haemophilus parasuis strains isolated in five provinces of China.

    PubMed

    Wang, Liyan; Ma, Lina; Liu, Yongan; Gao, Pengcheng; Li, Youquan; Li, Xuerui; Liu, Yongsheng

    2016-10-01

    Haemophilus parasuis is the etiological agent of Glässers disease, which causes high morbidity and mortality in swine herds. Although H. parasuis strains can be classified into 15 serovars with the Kielstein-Rapp-Gabrielson serotyping scheme, a large number of isolates cannot be classified and have been designated 'nontypeable' strains. In this study, multilocus sequence typing (MLST) of H. parasuis was used to analyze 48 H. parasuis field strains isolated in China and two strains from Australia. Twenty-six new alleles and 29 new sequence types (STs) were detected, enriching the H. parasuis MLST databases. A BURST analysis indicated that H. parasuis lacks stable population structure and is highly heterogeneous, and that there is no association between STs and geographic area. When an UPGMA dendrogram was constructed, two major clades, clade A and clade B, were defined. Animal experiments, in which guinea pigs were challenged intraperitoneally with the bacterial isolates, supported the hypothesis that the H. parasuis STs in clade A are generally avirulent or weakly virulent, whereas the STs in clade B tend to be virulent. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Genetic relationships within the genus Prevotella analyzed by multilocus enzyme electrophoresis and DNA-DNA hybridization.

    PubMed

    Combe, M L; Pons, J L

    1999-12-01

    The genetic diversity and relationships within the genus Prevotella were studied by analyzing twenty-five strains by multilocus enzyme electrophoresis (MLEE) at nine metabolic enzyme loci and DNA-DNA hybridization. MLEE revealed a high genetic diversity with 25 electrophoretic types (ETs) for the 25 strains studied, a mean number of alleles per enzyme locus of 6.8 and a mean genetic diversity per locus of 0.786. The index of association described by Maynard Smith et al. (1993) revealed a clonal structure within the genus Prevotella. A dendrogram generated by cluster analysis of a matrix of ETs showed that species like P. bivia, P. buccae, P. oris, P. oralis, P. nigrescens, and P. denticola form clusters that are consistent with DNA homologies. However, strains identified as P. melaninogenica or P. loescheii by DNA-DNA hybridization did not constitute distinct subpopulations in MLEE. MLEE analysis demonstrated its high power in differentiating closely related strains. It provides an alternative to 16S rRNA analysis for the study of phylogenetic relationships within the genus Prevotella, especially for differentiating strains with high DNA homology or high rRNA homology.

  13. Conditional adaptive Bayesian spectral analysis of nonstationary biomedical time series.

    PubMed

    Bruce, Scott A; Hall, Martica H; Buysse, Daniel J; Krafty, Robert T

    2018-03-01

    Many studies of biomedical time series signals aim to measure the association between frequency-domain properties of time series and clinical and behavioral covariates. However, the time-varying dynamics of these associations are largely ignored due to a lack of methods that can assess the changing nature of the relationship through time. This article introduces a method for the simultaneous and automatic analysis of the association between the time-varying power spectrum and covariates, which we refer to as conditional adaptive Bayesian spectrum analysis (CABS). The procedure adaptively partitions the grid of time and covariate values into an unknown number of approximately stationary blocks and nonparametrically estimates local spectra within blocks through penalized splines. CABS is formulated in a fully Bayesian framework, in which the number and locations of partition points are random, and fit using reversible jump Markov chain Monte Carlo techniques. Estimation and inference averaged over the distribution of partitions allows for the accurate analysis of spectra with both smooth and abrupt changes. The proposed methodology is used to analyze the association between the time-varying spectrum of heart rate variability and self-reported sleep quality in a study of older adults serving as the primary caregiver for their ill spouse. © 2017, The International Biometric Society.

  14. Localizing multiple X chromosome-linked retinitis pigmentosa loci using multilocus homogeneity tests

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ott, J.; Terwilliger, J.D.; Bhattacharya, S.

    1990-01-01

    Multilocus linkage analysis of 62 family pedigrees with X chromosome-linked retinitis pigmentosa (XLRP) was undertaken to determine the presence of possible multiple disease loci and to reliability estimate their map location. Multilocus homogeneity tests furnish convincing evidence for the presence of two XLRP loci, the likelihood ratio being 6.4 {times} 10{sup 9}:1 in a favor of two versus a single XLRP locus and gave accurate estimates for their map location. In 60-75% of the families, location of an XLRP gene was estimated at 1 centimorgan distal to OTC, and in 25-40% of the families, an XLRP locus was located halfwaymore » between DXS14 (p58-1) and DXZ1 (Xcen), with an estimated recombination fraction of 25% between the two XLRP loci. There is also good evidence for third XLRP locus, midway between DXS28 (C7) and DXS164 (pERT87), supported by a likelihood ratio of 293:1 for three versus two XLRP loci.« less

  15. Evolution of recombination rates in a multi-locus, haploid-selection, symmetric-viability model.

    PubMed

    Chasnov, J R; Ye, Felix Xiaofeng

    2013-02-01

    A fast algorithm for computing multi-locus recombination is extended to include a recombination-modifier locus. This algorithm and a linear stability analysis is used to investigate the evolution of recombination rates in a multi-locus, haploid-selection, symmetric-viability model for which stable equilibria have recently been determined. When the starting equilibrium is symmetric with two selected loci, we show analytically that modifier alleles that reduce recombination always invade. When the starting equilibrium is monomorphic, and there is a fixed nonzero recombination rate between the modifier locus and the selected loci, we determine analytical conditions for which a modifier allele can invade. In particular, we show that a gap exists between the recombination rates of modifiers that can invade and the recombination rate that specifies the lower stability boundary of the monomorphic equilibrium. A numerical investigation shows that a similar gap exists in a weakened form when the starting equilibrium is fully polymorphic but asymmetric. Copyright © 2012 Elsevier Inc. All rights reserved.

  16. Multilocus inference of species trees and DNA barcoding.

    PubMed

    Mallo, Diego; Posada, David

    2016-09-05

    The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.

  17. Additive Genetic Risk from Five Serotonin System Polymorphisms Interacts with Interpersonal Stress to Predict Depression

    PubMed Central

    Vrshek-Schallhorn, Suzanne; Stroud, Catherine B.; Mineka, Susan; Zinbarg, Richard E.; Adam, Emma K.; Redei, Eva E.; Hammen, Constance; Craske, Michelle G.

    2016-01-01

    Behavioral genetic research supports polygenic models of depression in which many genetic variations each contribute a small amount of risk, and prevailing diathesis-stress models suggest gene-environment interactions (GxE). Multilocus profile scores of additive risk offer an approach that is consistent with polygenic models of depression risk. In a first demonstration of this approach in a GxE predicting depression, we created an additive multilocus profile score from five serotonin system polymorphisms (one each in the genes HTR1A, HTR2A, HTR2C, and two in TPH2). Analyses focused on two forms of interpersonal stress as environmental risk factors. Using five years of longitudinal diagnostic and life stress interviews from 387 emerging young adults in the Youth Emotion Project, survival analyses show that this multilocus profile score interacts with major interpersonal stressful life events to predict major depressive episode onsets (HR = 1.815, p = .007). Simultaneously, there was a significant protective effect of the profile score without a recent event (HR = 0.83, p = .030). The GxE effect with interpersonal chronic stress was not significant (HR = 1.15, p = .165). Finally, effect sizes for genetic factors examined ignoring stress suggested such an approach could lead to overlooking or misinterpreting genetic effects. Both the GxE effect and the protective simple main effect were replicated in a sample of early adolescent girls (N = 105). We discuss potential benefits of the multilocus genetic profile score approach and caveats for future research. PMID:26595467

  18. Technical note: Bayesian calibration of dynamic ruminant nutrition models.

    PubMed

    Reed, K F; Arhonditsis, G B; France, J; Kebreab, E

    2016-08-01

    Mechanistic models of ruminant digestion and metabolism have advanced our understanding of the processes underlying ruminant animal physiology. Deterministic modeling practices ignore the inherent variation within and among individual animals and thus have no way to assess how sources of error influence model outputs. We introduce Bayesian calibration of mathematical models to address the need for robust mechanistic modeling tools that can accommodate error analysis by remaining within the bounds of data-based parameter estimation. For the purpose of prediction, the Bayesian approach generates a posterior predictive distribution that represents the current estimate of the value of the response variable, taking into account both the uncertainty about the parameters and model residual variability. Predictions are expressed as probability distributions, thereby conveying significantly more information than point estimates in regard to uncertainty. Our study illustrates some of the technical advantages of Bayesian calibration and discusses the future perspectives in the context of animal nutrition modeling. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  19. Bayesian analyses of seasonal runoff forecasts

    NASA Astrophysics Data System (ADS)

    Krzysztofowicz, R.; Reese, S.

    1991-12-01

    Forecasts of seasonal snowmelt runoff volume provide indispensable information for rational decision making by water project operators, irrigation district managers, and farmers in the western United States. Bayesian statistical models and communication frames have been researched in order to enhance the forecast information disseminated to the users, and to characterize forecast skill from the decision maker's point of view. Four products are presented: (i) a Bayesian Processor of Forecasts, which provides a statistical filter for calibrating the forecasts, and a procedure for estimating the posterior probability distribution of the seasonal runoff; (ii) the Bayesian Correlation Score, a new measure of forecast skill, which is related monotonically to the ex ante economic value of forecasts for decision making; (iii) a statistical predictor of monthly cumulative runoffs within the snowmelt season, conditional on the total seasonal runoff forecast; and (iv) a framing of the forecast message that conveys the uncertainty associated with the forecast estimates to the users. All analyses are illustrated with numerical examples of forecasts for six gauging stations from the period 1971 1988.

  20. NET-VISA, a Bayesian method next-generation automatic association software. Latest developments and operational assessment.

    NASA Astrophysics Data System (ADS)

    Le Bras, Ronan; Kushida, Noriyuki; Mialle, Pierrick; Tomuta, Elena; Arora, Nimar

    2017-04-01

    The Preparatory Commission for the Comprehensive Nuclear-Test-Ban Treaty Organization (CTBTO) has been developing a Bayesian method and software to perform the key step of automatic association of seismological, hydroacoustic, and infrasound (SHI) parametric data. In our preliminary testing in the CTBTO, NET_VISA shows much better performance than its currently operating automatic association module, with a rate for automatic events matching the analyst-reviewed events increased by 10%, signifying that the percentage of missed events is lowered by 40%. Initial tests involving analysts also showed that the new software will complete the automatic bulletins of the CTBTO by adding previously missed events. Because products by the CTBTO are also widely distributed to its member States as well as throughout the seismological community, the introduction of a new technology must be carried out carefully, and the first step of operational integration is to first use NET-VISA results within the interactive analysts' software so that the analysts can check the robustness of the Bayesian approach. We report on the latest results both on the progress for automatic processing and for the initial introduction of NET-VISA results in the analyst review process

  1. Bayesian Population Forecasting: Extending the Lee-Carter Method.

    PubMed

    Wiśniowski, Arkadiusz; Smith, Peter W F; Bijak, Jakub; Raymer, James; Forster, Jonathan J

    2015-06-01

    In this article, we develop a fully integrated and dynamic Bayesian approach to forecast populations by age and sex. The approach embeds the Lee-Carter type models for forecasting the age patterns, with associated measures of uncertainty, of fertility, mortality, immigration, and emigration within a cohort projection model. The methodology may be adapted to handle different data types and sources of information. To illustrate, we analyze time series data for the United Kingdom and forecast the components of population change to the year 2024. We also compare the results obtained from different forecast models for age-specific fertility, mortality, and migration. In doing so, we demonstrate the flexibility and advantages of adopting the Bayesian approach for population forecasting and highlight areas where this work could be extended.

  2. Social deprivation and population density are not associated with small area risk of amyotrophic lateral sclerosis.

    PubMed

    Rooney, James P K; Tobin, Katy; Crampsie, Arlene; Vajda, Alice; Heverin, Mark; McLaughlin, Russell; Staines, Anthony; Hardiman, Orla

    2015-10-01

    Evidence of an association between areal ALS risk and population density has been previously reported. We aim to examine ALS spatial incidence in Ireland using small areas, to compare this analysis with our previous analysis of larger areas and to examine the associations between population density, social deprivation and ALS incidence. Residential area social deprivation has not been previously investigated as a risk factor for ALS. Using the Irish ALS register, we included all cases of ALS diagnosed in Ireland from 1995-2013. 2006 census data was used to calculate age and sex standardised expected cases per small area. Social deprivation was assessed using the pobalHP deprivation index. Bayesian smoothing was used to calculate small area relative risk for ALS, whilst cluster analysis was performed using SaTScan. The effects of population density and social deprivation were tested in two ways: (1) as covariates in the Bayesian spatial model; (2) via post-Bayesian regression. 1701 cases were included. Bayesian smoothed maps of relative risk at small area resolution matched closely to our previous analysis at a larger area resolution. Cluster analysis identified two areas of significant low risk. These areas did not correlate with population density or social deprivation indices. Two areas showing low frequency of ALS have been identified in the Republic of Ireland. These areas do not correlate with population density or residential area social deprivation, indicating that other reasons, such as genetic admixture may account for the observed findings. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. Development of a multilocus sequence typing scheme for Ureaplasma.

    PubMed

    Zhang, J; Kong, Y; Feng, Y; Huang, J; Song, T; Ruan, Z; Song, J; Jiang, Y; Yu, Y; Xie, X

    2014-04-01

    Ureaplasma is a commensal of the human urogenital tract but is always associated with invasive diseases such as non-gonococcal urethritis and infertility adverse pregnancy outcomes. To better understand the molecular epidemiology and population structure of Ureaplasma, a multilocus sequence typing (MLST) scheme based on four housekeeping genes (ftsH, rpL22, valS, thrS) was developed and validated using 283 isolates, including 14 serovars of reference strains and 269 strains obtained from clinical patients. A total of 99 sequence types (STs) were revealed: the 14 type strains of the Ureaplasma serovars were assigned to 12 STs, and 87 novel and special STs appeared among the clinical isolates. ST1 and ST22 were the predominant STs, which contained 68 and 70 isolates, respectively. Two clonal lineages (CC1 and CC2) were shown by eBURST analysis, and linkage disequilibrium was revealed through a standardized index of association (I A (S)). The neighbor-joining tree results of 14 Ureaplasma serovars showed two genetically significantly distant clusters, which was highly congruent with the species taxonomy of ureaplasmas [Ureaplasma parvum (UPA) and Ureaplasma urealyticum (UUR)]. Analysis of the biotypes of 269 clinical isolates revealed that all the isolates of CC1 were UPA and those of CC2 were UUR. Additionally, CC2 was found more often in symptomatic patients with vaginitis, tubal obstruction, and cervicitis. In conclusion, this MLST scheme is adequate for investigations of molecular epidemiology and population structure with highly discriminating capacity.

  4. Multi-locus tree and species tree approaches toward resolving a complex clade of downy mildews (Straminipila, Oomycota), including pathogens of beet and spinach

    PubMed Central

    Choi, Young-Joon; Klosterman, Steven J.; Kummer, Volker; Voglmayr, Hermann; Shin, Hyeon-Dong; Thines, Marco

    2017-01-01

    Accurate species determination of plant pathogens is a prerequisite for their control and quarantine, and further for assessing their potential threat to crops. The family Peronosporaceae (Straminipila; Oomycota) consists of obligate biotrophic pathogens that cause downy mildew disease on angiosperms, including a large number of cultivated plants. In the largest downy mildew genus Peronospora, a phylogenetically complex clade includes the economically important downy mildew pathogens of spinach and beet, as well as the type species of the genus Peronospora. To resolve this complex clade at the species level and to infer evolutionary relationships among them, we used multi-locus phylogenetic analysis and species tree estimation. Both approaches discriminated all nine currently accepted species and revealed four previously unrecognized lineages, which are specific to a host genus or species. This is in line with a narrow species concept, i.e. that a downy mildew species is associated with only a particular host plant genus or species. Instead of applying the dubious name Peronospora farinosa, which has been proposed for formal rejection, our results provide strong evidence that Peronospora schachtii is an independent species from lineages on Atriplex and apparently occurs exclusively on Beta vulgaris. The members of the clade investigated, the Peronospora rumicis clade, associate with three different host plant families, Amaranthaceae, Caryophyllaceae, and Polygonaceae, suggesting that they may have speciated following at least two recent inter-family host shifts, rather than contemporary cospeciation with the host plants. PMID:25772799

  5. Novel type of VanB2 teicoplanin-resistant hospital-associated Enterococcus faecium.

    PubMed

    Santona, Antonella; Paglietti, Bianca; Al-Qahtani, Ahmed A; Bohol, Marie Fe F; Senok, Abiola; Deligios, Massimo; Rubino, Salvatore; Al-Ahdal, Mohammed N

    2014-08-01

    Seven high-risk clones of vancomycin-resistant Enterococcus faecium (VREF) belonging to clonal complex 17 were identified using multilocus sequence typing (MLST) among clinical isolates from Saudi Arabia. Among these isolates, a new hospital-associated sequence type (ST795), VanB(2)-type teicoplanin-resistant strain was detected. Its unusual phenotype resulted from a new combination of mutations in the ddl, vanS and vanW genes, which confirmed the trend of evolution in VanB-type resistance. Furthermore, characteristics of adaptation and persistence in the hospital environment of ST795 were emphasised by the presence of genes and clusters recognised to be specific for hospital-associated VREF. Copyright © 2014 Elsevier B.V. and the International Society of Chemotherapy. All rights reserved.

  6. Comparing methods of measuring geographic patterns in temporal trends: an application to county-level heart disease mortality in the United States, 1973 to 2010.

    PubMed

    Vaughan, Adam S; Kramer, Michael R; Waller, Lance A; Schieb, Linda J; Greer, Sophia; Casper, Michele

    2015-05-01

    To demonstrate the implications of choosing analytical methods for quantifying spatiotemporal trends, we compare the assumptions, implementation, and outcomes of popular methods using county-level heart disease mortality in the United States between 1973 and 2010. We applied four regression-based approaches (joinpoint regression, both aspatial and spatial generalized linear mixed models, and Bayesian space-time model) and compared resulting inferences for geographic patterns of local estimates of annual percent change and associated uncertainty. The average local percent change in heart disease mortality from each method was -4.5%, with the Bayesian model having the smallest range of values. The associated uncertainty in percent change differed markedly across the methods, with the Bayesian space-time model producing the narrowest range of variance (0.0-0.8). The geographic pattern of percent change was consistent across methods with smaller declines in the South Central United States and larger declines in the Northeast and Midwest. However, the geographic patterns of uncertainty differed markedly between methods. The similarity of results, including geographic patterns, for magnitude of percent change across these methods validates the underlying spatial pattern of declines in heart disease mortality. However, marked differences in degree of uncertainty indicate that Bayesian modeling offers substantially more precise estimates. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. [The use of multilocus sequence typing (MLST) and randomly amplified polymorphic DNA (RAPD) for the differentiation between strains of Burkholderia mallei].

    PubMed

    Antonov, V A; Altukhova, V V; Savchenko, S S; Zamaraev, V S; Iliukhin, V I; Alekseev, V V

    2007-01-01

    Burkholderia mallei is highly pathogenic microorganism for both humans and animals. In this work, the possibility of the use of the genotyping method for differentiation between strains of B. mallei was studied. A collection of 14 isolates of B. mallei was characterized using randomly amplified polymorphic DNA (RAPD) and multilocus sequence typing (MLST). RAPD was the best method used for detecting strain differences of B. mallei. It was suggested that this method would be an increasingly useful molecular epidemiological tool.

  8. Efficient Probabilistic Diagnostics for Electrical Power Systems

    NASA Technical Reports Server (NTRS)

    Mengshoel, Ole J.; Chavira, Mark; Cascio, Keith; Poll, Scott; Darwiche, Adnan; Uckun, Serdar

    2008-01-01

    We consider in this work the probabilistic approach to model-based diagnosis when applied to electrical power systems (EPSs). Our probabilistic approach is formally well-founded, as it based on Bayesian networks and arithmetic circuits. We investigate the diagnostic task known as fault isolation, and pay special attention to meeting two of the main challenges . model development and real-time reasoning . often associated with real-world application of model-based diagnosis technologies. To address the challenge of model development, we develop a systematic approach to representing electrical power systems as Bayesian networks, supported by an easy-to-use speci.cation language. To address the real-time reasoning challenge, we compile Bayesian networks into arithmetic circuits. Arithmetic circuit evaluation supports real-time diagnosis by being predictable and fast. In essence, we introduce a high-level EPS speci.cation language from which Bayesian networks that can diagnose multiple simultaneous failures are auto-generated, and we illustrate the feasibility of using arithmetic circuits, compiled from Bayesian networks, for real-time diagnosis on real-world EPSs of interest to NASA. The experimental system is a real-world EPS, namely the Advanced Diagnostic and Prognostic Testbed (ADAPT) located at the NASA Ames Research Center. In experiments with the ADAPT Bayesian network, which currently contains 503 discrete nodes and 579 edges, we .nd high diagnostic accuracy in scenarios where one to three faults, both in components and sensors, were inserted. The time taken to compute the most probable explanation using arithmetic circuits has a small mean of 0.2625 milliseconds and standard deviation of 0.2028 milliseconds. In experiments with data from ADAPT we also show that arithmetic circuit evaluation substantially outperforms joint tree propagation and variable elimination, two alternative algorithms for diagnosis using Bayesian network inference.

  9. Evaluation of Flagging Criteria of United States Kidney Transplant Center Performance: How to Best Define Outliers?

    PubMed

    Schold, Jesse D; Miller, Charles M; Henry, Mitchell L; Buccini, Laura D; Flechner, Stuart M; Goldfarb, David A; Poggio, Emilio D; Andreoni, Kenneth A

    2017-06-01

    Scientific Registry of Transplant Recipients report cards of US organ transplant center performance are publicly available and used for quality oversight. Low center performance (LP) evaluations are associated with changes in practice including reduced transplant rates and increased waitlist removals. In 2014, Scientific Registry of Transplant Recipients implemented new Bayesian methodology to evaluate performance which was not adopted by Center for Medicare and Medicaid Services (CMS). In May 2016, CMS altered their performance criteria, reducing the likelihood of LP evaluations. Our aims were to evaluate incidence, survival rates, and volume of LP centers with Bayesian, historical (old-CMS) and new-CMS criteria using 6 consecutive program-specific reports (PSR), January 2013 to July 2015 among adult kidney transplant centers. Bayesian, old-CMS and new-CMS criteria identified 13.4%, 8.3%, and 6.1% LP PSRs, respectively. Over the 3-year period, 31.9% (Bayesian), 23.4% (old-CMS), and 19.8% (new-CMS) of centers had 1 or more LP evaluation. For small centers (<83 transplants/PSR), there were 4-fold additional LP evaluations (52 vs 13 PSRs) for 1-year mortality with Bayesian versus new-CMS criteria. For large centers (>183 transplants/PSR), there were 3-fold additional LP evaluations for 1-year mortality with Bayesian versus new-CMS criteria with median differences in observed and expected patient survival of -1.6% and -2.2%, respectively. A significant proportion of kidney transplant centers are identified as low performing with relatively small survival differences compared with expected. Bayesian criteria have significantly higher flagging rates and new-CMS criteria modestly reduce flagging. Critical appraisal of performance criteria is needed to assess whether quality oversight is meeting intended goals and whether further modifications could reduce risk aversion, more efficiently allocate resources, and increase transplant opportunities.

  10. Hip fracture in the elderly: a re-analysis of the EPIDOS study with causal Bayesian networks.

    PubMed

    Caillet, Pascal; Klemm, Sarah; Ducher, Michel; Aussem, Alexandre; Schott, Anne-Marie

    2015-01-01

    Hip fractures commonly result in permanent disability, institutionalization or death in elderly. Existing hip-fracture predicting tools are underused in clinical practice, partly due to their lack of intuitive interpretation. By use of a graphical layer, Bayesian network models could increase the attractiveness of fracture prediction tools. Our aim was to study the potential contribution of a causal Bayesian network in this clinical setting. A logistic regression was performed as a standard control approach to check the robustness of the causal Bayesian network approach. EPIDOS is a multicenter study, conducted in an ambulatory care setting in five French cities between 1992 and 1996 and updated in 2010. The study included 7598 women aged 75 years or older, in which fractures were assessed quarterly during 4 years. A causal Bayesian network and a logistic regression were performed on EPIDOS data to describe major variables involved in hip fractures occurrences. Both models had similar association estimations and predictive performances. They detected gait speed and mineral bone density as variables the most involved in the fracture process. The causal Bayesian network showed that gait speed and bone mineral density were directly connected to fracture and seem to mediate the influence of all the other variables included in our model. The logistic regression approach detected multiple interactions involving psychotropic drug use, age and bone mineral density. Both approaches retrieved similar variables as predictors of hip fractures. However, Bayesian network highlighted the whole web of relation between the variables involved in the analysis, suggesting a possible mechanism leading to hip fracture. According to the latter results, intervention focusing concomitantly on gait speed and bone mineral density may be necessary for an optimal prevention of hip fracture occurrence in elderly people.

  11. Bayesian characterization of uncertainty in species interaction strengths.

    PubMed

    Wolf, Christopher; Novak, Mark; Gitelman, Alix I

    2017-06-01

    Considerable effort has been devoted to the estimation of species interaction strengths. This effort has focused primarily on statistical significance testing and obtaining point estimates of parameters that contribute to interaction strength magnitudes, leaving the characterization of uncertainty associated with those estimates unconsidered. We consider a means of characterizing the uncertainty of a generalist predator's interaction strengths by formulating an observational method for estimating a predator's prey-specific per capita attack rates as a Bayesian statistical model. This formulation permits the explicit incorporation of multiple sources of uncertainty. A key insight is the informative nature of several so-called non-informative priors that have been used in modeling the sparse data typical of predator feeding surveys. We introduce to ecology a new neutral prior and provide evidence for its superior performance. We use a case study to consider the attack rates in a New Zealand intertidal whelk predator, and we illustrate not only that Bayesian point estimates can be made to correspond with those obtained by frequentist approaches, but also that estimation uncertainty as described by 95% intervals is more useful and biologically realistic using the Bayesian method. In particular, unlike in bootstrap confidence intervals, the lower bounds of the Bayesian posterior intervals for attack rates do not include zero when a predator-prey interaction is in fact observed. We conclude that the Bayesian framework provides a straightforward, probabilistic characterization of interaction strength uncertainty, enabling future considerations of both the deterministic and stochastic drivers of interaction strength and their impact on food webs.

  12. Bayesian analysis of time-series data under case-crossover designs: posterior equivalence and inference.

    PubMed

    Li, Shi; Mukherjee, Bhramar; Batterman, Stuart; Ghosh, Malay

    2013-12-01

    Case-crossover designs are widely used to study short-term exposure effects on the risk of acute adverse health events. While the frequentist literature on this topic is vast, there is no Bayesian work in this general area. The contribution of this paper is twofold. First, the paper establishes Bayesian equivalence results that require characterization of the set of priors under which the posterior distributions of the risk ratio parameters based on a case-crossover and time-series analysis are identical. Second, the paper studies inferential issues under case-crossover designs in a Bayesian framework. Traditionally, a conditional logistic regression is used for inference on risk-ratio parameters in case-crossover studies. We consider instead a more general full likelihood-based approach which makes less restrictive assumptions on the risk functions. Formulation of a full likelihood leads to growth in the number of parameters proportional to the sample size. We propose a semi-parametric Bayesian approach using a Dirichlet process prior to handle the random nuisance parameters that appear in a full likelihood formulation. We carry out a simulation study to compare the Bayesian methods based on full and conditional likelihood with the standard frequentist approaches for case-crossover and time-series analysis. The proposed methods are illustrated through the Detroit Asthma Morbidity, Air Quality and Traffic study, which examines the association between acute asthma risk and ambient air pollutant concentrations. © 2013, The International Biometric Society.

  13. Detangling complex relationships in forensic data: principles and use of causal networks and their application to clinical forensic science.

    PubMed

    Lefèvre, Thomas; Lepresle, Aude; Chariot, Patrick

    2015-09-01

    The search for complex, nonlinear relationships and causality in data is hindered by the availability of techniques in many domains, including forensic science. Linear multivariable techniques are useful but present some shortcomings. In the past decade, Bayesian approaches have been introduced in forensic science. To date, authors have mainly focused on providing an alternative to classical techniques for quantifying effects and dealing with uncertainty. Causal networks, including Bayesian networks, can help detangle complex relationships in data. A Bayesian network estimates the joint probability distribution of data and graphically displays dependencies between variables and the circulation of information between these variables. In this study, we illustrate the interest in utilizing Bayesian networks for dealing with complex data through an application in clinical forensic science. Evaluating the functional impairment of assault survivors is a complex task for which few determinants are known. As routinely estimated in France, the duration of this impairment can be quantified by days of 'Total Incapacity to Work' ('Incapacité totale de travail,' ITT). In this study, we used a Bayesian network approach to identify the injury type, victim category and time to evaluation as the main determinants of the 'Total Incapacity to Work' (TIW). We computed the conditional probabilities associated with the TIW node and its parents. We compared this approach with a multivariable analysis, and the results of both techniques were converging. Thus, Bayesian networks should be considered a reliable means to detangle complex relationships in data.

  14. Multi-virulence-locus sequence typing of Staphylococcus lugdunensis generates results consistent with a clonal population structure and is reliable for epidemiological typing.

    PubMed

    Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis; Pestel-Caron, Martine

    2014-10-01

    Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  15. Colletotrichum gloeosporioides s.l. associated with Theobroma cacao and other plants in Panama: multilocus phylogenies distinguish host-associated pathogens from asymptomatic endophytes.

    PubMed

    Rojas, Enith I; Rehner, Stephen A; Samuels, Gary J; Van Bael, Sunshine A; Herre, Edward A; Cannon, Paul; Chen, Rui; Pang, Junfeng; Wang, Ruiwu; Zhang, Yaping; Peng, Yan-Qiong; Sha, Tao

    2010-01-01

    Colletotrichum interacts with numerous plant species overtly as symptomatic pathogens and cryptically as asymptomatic endophytes. It is not known whether these contrasting ecological modes are optional strategies expressed by individual Colletotrichum species or whether a species' ecology is explicitly pathogenic or endophytic. We explored this question by inferring relationships among 77 C. gloeosporioides s.l. strains isolated from asymptomatic leaves and from anthracnose lesions on leaves and fruits of Theobroma cacao (cacao) and other plants from Panamá. ITS and 5'-tef1 were used to assess diversity and to delineate operational taxonomic units for multilocus phylogenetic analysis. The ITS and 5'-tef1 screens concordantly resolved four strongly supported lineages, clades A-D: Clade A includes the ex type of C. gloeosporioides, clade B includes the ex type ITS sequence of C. boninense, and clades C and D are unidentified. The ITS yielded limited resolution and support within all clades, in particular the C. gloeosporioides clade (A), the focal lineage dealt with in this study. In contrast the 5'-tef1 screen differentiated nine distinctive haplotype subgroups within the C. gloeosporioides clade that were concordant with phylogenetic terminals resolved in a five-locus nuclear phylogeny. Among these were two phylogenetic species associated with symptomatic infections specific to either cacao or mango and five phylogenetic species isolated principally as asymptomatic infections from cacao and other plant hosts. We formally describe two new species, C. tropicale and C. ignotum, that are frequent asymptomatic associates of cacao and other Neotropical plant species, and epitypify C. theobromicola, which is associated with foliar and fruit anthracnose lesions of cacao. Asymptomatic Colletotrichum strains isolated from cacao plants grown in China included six distinct C. gloeosporioides clade taxa, only one of which is known to occur in the Neotropics.

  16. A Bayesian Approach to Real-Time Earthquake Phase Association

    NASA Astrophysics Data System (ADS)

    Benz, H.; Johnson, C. E.; Earle, P. S.; Patton, J. M.

    2014-12-01

    Real-time location of seismic events requires a robust and extremely efficient means of associating and identifying seismic phases with hypothetical sources. An association algorithm converts a series of phase arrival times into a catalog of earthquake hypocenters. The classical approach based on time-space stacking of the locus of possible hypocenters for each phase arrival using the principal of acoustic reciprocity has been in use now for many years. One of the most significant problems that has emerged over time with this approach is related to the extreme variations in seismic station density throughout the global seismic network. To address this problem we have developed a novel, Bayesian association algorithm, which looks at the association problem as a dynamically evolving complex system of "many to many relationships". While the end result must be an array of one to many relations (one earthquake, many phases), during the association process the situation is quite different. Both the evolving possible hypocenters and the relationships between phases and all nascent hypocenters is many to many (many earthquakes, many phases). The computational framework we are using to address this is a responsive, NoSQL graph database where the earthquake-phase associations are represented as intersecting Bayesian Learning Networks. The approach directly addresses the network inhomogeneity issue while at the same time allowing the inclusion of other kinds of data (e.g., seismic beams, station noise characteristics, priors on estimated location of the seismic source) by representing the locus of intersecting hypothetical loci for a given datum as joint probability density functions.

  17. Environmental exposure to manganese in air: Associations with tremor and motor function.

    PubMed

    Bowler, Rosemarie M; Beseler, Cheryl L; Gocheva, Vihra V; Colledge, Michelle; Kornblith, Erica S; Julian, Jaime R; Kim, Yangho; Bollweg, George; Lobdell, Danelle T

    2016-01-15

    Manganese (Mn) inhalation has been associated with neuropsychological and neurological sequelae in exposed workers. Few environmental epidemiologic studies have examined the potentially neurotoxic effects of Mn exposure in ambient air on motor function and hand tremor in adult community residents. Mn exposed residents were recruited in two Ohio towns: Marietta, a town near a ferro-manganese smelter, and East Liverpool, a town adjacent to a facility processing, crushing, screening, and packaging Mn products. Chronic (≥ 10 years) exposure to ambient air Mn in adult residents and effects on neuropsychological and neurological outcomes were investigated. Participants from Marietta (n=100) and East Liverpool (n=86) were combined for analyses. AERMOD dispersion modeling of fixed-site outdoor air monitoring data estimated Mn inhalation over a ten year period. Adult Mn-exposed residents' psychomotor ability was assessed using Finger Tapping, Hand Dynamometer, Grooved Pegboard, and the Computerized Adaptive Testing System (CATSYS) Tremor system. Bayesian structural equation modeling was used to assess associations between air-Mn and motor function and tremor. Air-Mn exposure was significantly correlated in bivariate analyses with the tremor test (CATSYS) for intensity, center frequency and harmonic index. The Bayesian path analysis model showed associations of air-Mn with the CATSYS non-dominant center frequency and harmonic index; while the Bayesian structural equation model revealed associations between air-Mn and lower Finger Tapping scores. Household income was significantly associated with motor dysfunction but not with tremor. Tremor and motor function were associated with higher exposure to airborne Mn. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. STBase: One Million Species Trees for Comparative Biology

    PubMed Central

    McMahon, Michelle M.; Deepak, Akshay; Fernández-Baca, David; Boss, Darren; Sanderson, Michael J.

    2015-01-01

    Comprehensively sampled phylogenetic trees provide the most compelling foundations for strong inferences in comparative evolutionary biology. Mismatches are common, however, between the taxa for which comparative data are available and the taxa sampled by published phylogenetic analyses. Moreover, many published phylogenies are gene trees, which cannot always be adapted immediately for species level comparisons because of discordance, gene duplication, and other confounding biological processes. A new database, STBase, lets comparative biologists quickly retrieve species level phylogenetic hypotheses in response to a query list of species names. The database consists of 1 million single- and multi-locus data sets, each with a confidence set of 1000 putative species trees, computed from GenBank sequence data for 413,000 eukaryotic taxa. Two bodies of theoretical work are leveraged to aid in the assembly of multi-locus concatenated data sets for species tree construction. First, multiply labeled gene trees are pruned to conflict-free singly-labeled species-level trees that can be combined between loci. Second, impacts of missing data in multi-locus data sets are ameliorated by assembling only decisive data sets. Data sets overlapping with the user’s query are ranked using a scheme that depends on user-provided weights for tree quality and for taxonomic overlap of the tree with the query. Retrieval times are independent of the size of the database, typically a few seconds. Tree quality is assessed by a real-time evaluation of bootstrap support on just the overlapping subtree. Associated sequence alignments, tree files and metadata can be downloaded for subsequent analysis. STBase provides a tool for comparative biologists interested in exploiting the most relevant sequence data available for the taxa of interest. It may also serve as a prototype for future species tree oriented databases and as a resource for assembly of larger species phylogenies from precomputed trees. PMID:25679219

  19. Multilocus genotyping of human Giardia isolates suggests limited zoonotic transmission and association between assemblage B and flatulence in children.

    PubMed

    Lebbad, Marianne; Petersson, Ingvor; Karlsson, Lillemor; Botero-Kleiven, Silvia; Andersson, Jan O; Svenungsson, Bo; Svärd, Staffan G

    2011-08-01

    Giardia intestinalis is one of the most common diarrhea-related parasites in humans, where infection ranges from asymptomatic to acute or chronic disease. G. intestinalis consists of eight genetically distinct genotypes or assemblages, designated A-H, and assemblages A and B can infect humans. Giardiasis has been classified as a possible zoonotic disease but the role of animals in human disease transmission still needs to be proven. We tried to link different assemblages and sub-assemblages of G. intestinalis isolates from Swedish human patients to clinical symptoms and zoonotic transmission. Multilocus sequence-based genotyping of 207 human Giardia isolates using three gene loci: ß-giardin, glutamate dehydrogenase (gdh), and triose phosphate isomerase (tpi) was combined with assemblage-specific tpi PCRs. This analysis identified 73 patients infected with assemblage A, 128 with assemblage B, and six with mixed assemblages A+B. Multilocus genotypes (MLGs) were easily determined for the assemblage A isolates, and most patients with this genotype had apparently been infected through anthroponotic transmission. However, we also found evidence of limited zoonotic transmission of Giardia in Sweden, since a few domestic human infections involved the same assemblage A MLGs previously reported in Swedish cats and ruminants. Assemblage B was detected more frequently than assemblage A and it was also more common in patients with suspected treatment failure. However, a large genetic variability made determination of assemblage B MLGs problematic. Correlation between symptoms and assemblages was found only for flatulence, which was significantly more common in children less than six years of age infected with assemblage B. This study shows that certain assemblage A subtypes are potentially zoonotic and that flatulence is connected to assemblage B infections in young children. Determination of MLGs from assemblages A and B can be a valuable tool in outbreak situations and to help identify possible zoonotic transmission.

  20. Multilocus Genotyping of Human Giardia Isolates Suggests Limited Zoonotic Transmission and Association between Assemblage B and Flatulence in Children

    PubMed Central

    Lebbad, Marianne; Petersson, Ingvor; Karlsson, Lillemor; Botero-Kleiven, Silvia; Andersson, Jan O.; Svenungsson, Bo; Svärd, Staffan G.

    2011-01-01

    Background Giardia intestinalis is one of the most common diarrhea-related parasites in humans, where infection ranges from asymptomatic to acute or chronic disease. G. intestinalis consists of eight genetically distinct genotypes or assemblages, designated A–H, and assemblages A and B can infect humans. Giardiasis has been classified as a possible zoonotic disease but the role of animals in human disease transmission still needs to be proven. We tried to link different assemblages and sub-assemblages of G. intestinalis isolates from Swedish human patients to clinical symptoms and zoonotic transmission. Methodology/Principal Findings Multilocus sequence-based genotyping of 207 human Giardia isolates using three gene loci: ß-giardin, glutamate dehydrogenase (gdh), and triose phosphate isomerase (tpi) was combined with assemblage-specific tpi PCRs. This analysis identified 73 patients infected with assemblage A, 128 with assemblage B, and six with mixed assemblages A+B. Multilocus genotypes (MLGs) were easily determined for the assemblage A isolates, and most patients with this genotype had apparently been infected through anthroponotic transmission. However, we also found evidence of limited zoonotic transmission of Giardia in Sweden, since a few domestic human infections involved the same assemblage A MLGs previously reported in Swedish cats and ruminants. Assemblage B was detected more frequently than assemblage A and it was also more common in patients with suspected treatment failure. However, a large genetic variability made determination of assemblage B MLGs problematic. Correlation between symptoms and assemblages was found only for flatulence, which was significantly more common in children less than six years of age infected with assemblage B. Conclusions/Significance This study shows that certain assemblage A subtypes are potentially zoonotic and that flatulence is connected to assemblage B infections in young children. Determination of MLGs from assemblages A and B can be a valuable tool in outbreak situations and to help identify possible zoonotic transmission. PMID:21829745

  1. Ruminant Rhombencephalitis-Associated Listeria monocytogenes Alleles Linked to a Multilocus Variable-Number Tandem-Repeat Analysis Complex ▿ †

    PubMed Central

    Balandyté, Lina; Brodard, Isabelle; Frey, Joachim; Oevermann, Anna; Abril, Carlos

    2011-01-01

    Listeria monocytogenes is among the most important food-borne pathogens and is well adapted to persist in the environment. To gain insight into the genetic relatedness and potential virulence of L. monocytogenes strains causing central nervous system (CNS) infections, we used multilocus variable-number tandem-repeat analysis (MLVA) to subtype 183 L. monocytogenes isolates, most from ruminant rhombencephalitis and some from human patients, food, and the environment. Allelic-profile-based comparisons grouped L. monocytogenes strains mainly into three clonal complexes and linked single-locus variants (SLVs). Clonal complex A essentially consisted of isolates from human and ruminant brain samples. All but one rhombencephalitis isolate from cattle were located in clonal complex A. In contrast, food and environmental isolates mainly clustered into clonal complex C, and none was classified as clonal complex A. Isolates of the two main clonal complexes (A and C) obtained by MLVA were analyzed by PCR for the presence of 11 virulence-associated genes (prfA, actA, inlA, inlB, inlC, inlD, inlE, inlF, inlG, inlJ, and inlC2H). Virulence gene analysis revealed significant differences in the actA, inlF, inlG, and inlJ allelic profiles between clinical isolates (complex A) and nonclinical isolates (complex C). The association of particular alleles of actA, inlF, and newly described alleles of inlJ with isolates from CNS infections (particularly rhombencephalitis) suggests that these virulence genes participate in neurovirulence of L. monocytogenes. The overall absence of inlG in clinical complex A and its presence in complex C isolates suggests that the InlG protein is more relevant for the survival of L. monocytogenes in the environment. PMID:21984240

  2. Prevalence and characterisation of Staphylococcus aureus causing community-acquired skin and soft tissue infections on Java and Bali, Indonesia.

    PubMed

    Santosaningsih, Dewi; Santoso, Sanarto; Setijowati, Nanik; Rasyid, Harun A; Budayanti, Nyoman S; Suata, Ketut; Widhyatmoko, Dicky B; Purwono, Priyo B; Kuntaman, Kuntaman; Damayanti, Damayanti; Prakoeswa, Cita R S; Laurens, Mitchell; van Nierop, Josephine W I; Nanninga, Geraldine L; Oudenes, Neline; de Regt, Michelle; Snijders, Susan V; Verbrugh, Henri A; Severin, Juliëtte A

    2018-01-01

    To define the role of Staphylococcus aureus in community settings among patients with skin and soft tissue infections (SSTI) in Indonesia. Staphylococcus aureus were cultured from anterior nares, throat and wounds of 567 ambulatory patients presenting with SSTI. The mecA gene and genes encoding Panton-Valentine leukocidin (PVL; lukF-PV and lukS-PV) and exfoliative toxin (ET; eta and etb) were determined by PCR. Clonal relatedness among methicillin-resistant S. aureus (MRSA) and PVL-positive S. aureus was analysed using multilocus variable-number tandem-repeat analysis (MLVA) typing, and multilocus sequence typing (MLST) for a subset of isolates. Staphylococcal cassette chromosome mec (SCCmec) was determined for all MRSA isolates. Moreover, determinants for S. aureus SSTI, and PVL/ET-positive vs PVL/ET-negative S. aureus were assessed. Staphylococcus aureus were isolated from SSTI wounds of 257 (45.3%) patients, eight (3.1%) of these were MRSA. Genes encoding PVL and ETs were detected in 21.8% and 17.5% of methicillin-susceptible S. aureus (MSSA), respectively. PVL-positive MRSA was not detected. Nasopharyngeal S. aureus carriage was an independent determinant for S. aureus SSTI (odds ratio [OR] 1.8). Primary skin infection (OR 5.4) and previous antibiotic therapy (OR 3.5) were associated with PVL-positive MSSA. Primary skin infection (OR 2.2) was the only factor associated with ET-positive MSSA. MLVA typing revealed two more prevalent MSSA clusters. One ST1-MRSA-SCCmec type IV isolate and a cluster of ST239-MRSA-SCCmec type III were found. Community-acquired SSTI in Indonesia was frequently caused by PVL-positive MSSA, and the hospital-associated ST239-MRSA may have spread from the hospital into the community. © 2017 John Wiley & Sons Ltd.

  3. Statistical modeling for Bayesian extrapolation of adult clinical trial information in pediatric drug evaluation.

    PubMed

    Gamalo-Siebers, Margaret; Savic, Jasmina; Basu, Cynthia; Zhao, Xin; Gopalakrishnan, Mathangi; Gao, Aijun; Song, Guochen; Baygani, Simin; Thompson, Laura; Xia, H Amy; Price, Karen; Tiwari, Ram; Carlin, Bradley P

    2017-07-01

    Children represent a large underserved population of "therapeutic orphans," as an estimated 80% of children are treated off-label. However, pediatric drug development often faces substantial challenges, including economic, logistical, technical, and ethical barriers, among others. Among many efforts trying to remove these barriers, increased recent attention has been paid to extrapolation; that is, the leveraging of available data from adults or older age groups to draw conclusions for the pediatric population. The Bayesian statistical paradigm is natural in this setting, as it permits the combining (or "borrowing") of information across disparate sources, such as the adult and pediatric data. In this paper, authored by the pediatric subteam of the Drug Information Association Bayesian Scientific Working Group and Adaptive Design Working Group, we develop, illustrate, and provide suggestions on Bayesian statistical methods that could be used to design improved pediatric development programs that use all available information in the most efficient manner. A variety of relevant Bayesian approaches are described, several of which are illustrated through 2 case studies: extrapolating adult efficacy data to expand the labeling for Remicade to include pediatric ulcerative colitis and extrapolating adult exposure-response information for antiepileptic drugs to pediatrics. Copyright © 2017 John Wiley & Sons, Ltd.

  4. Approximate Bayesian evaluations of measurement uncertainty

    NASA Astrophysics Data System (ADS)

    Possolo, Antonio; Bodnar, Olha

    2018-04-01

    The Guide to the Expression of Uncertainty in Measurement (GUM) includes formulas that produce an estimate of a scalar output quantity that is a function of several input quantities, and an approximate evaluation of the associated standard uncertainty. This contribution presents approximate, Bayesian counterparts of those formulas for the case where the output quantity is a parameter of the joint probability distribution of the input quantities, also taking into account any information about the value of the output quantity available prior to measurement expressed in the form of a probability distribution on the set of possible values for the measurand. The approximate Bayesian estimates and uncertainty evaluations that we present have a long history and illustrious pedigree, and provide sufficiently accurate approximations in many applications, yet are very easy to implement in practice. Differently from exact Bayesian estimates, which involve either (analytical or numerical) integrations, or Markov Chain Monte Carlo sampling, the approximations that we describe involve only numerical optimization and simple algebra. Therefore, they make Bayesian methods widely accessible to metrologists. We illustrate the application of the proposed techniques in several instances of measurement: isotopic ratio of silver in a commercial silver nitrate; odds of cryptosporidiosis in AIDS patients; height of a manometer column; mass fraction of chromium in a reference material; and potential-difference in a Zener voltage standard.

  5. Radiation dose reduction in computed tomography perfusion using spatial-temporal Bayesian methods

    NASA Astrophysics Data System (ADS)

    Fang, Ruogu; Raj, Ashish; Chen, Tsuhan; Sanelli, Pina C.

    2012-03-01

    In current computed tomography (CT) examinations, the associated X-ray radiation dose is of significant concern to patients and operators, especially CT perfusion (CTP) imaging that has higher radiation dose due to its cine scanning technique. A simple and cost-effective means to perform the examinations is to lower the milliampere-seconds (mAs) parameter as low as reasonably achievable in data acquisition. However, lowering the mAs parameter will unavoidably increase data noise and degrade CT perfusion maps greatly if no adequate noise control is applied during image reconstruction. To capture the essential dynamics of CT perfusion, a simple spatial-temporal Bayesian method that uses a piecewise parametric model of the residual function is used, and then the model parameters are estimated from a Bayesian formulation of prior smoothness constraints on perfusion parameters. From the fitted residual function, reliable CTP parameter maps are obtained from low dose CT data. The merit of this scheme exists in the combination of analytical piecewise residual function with Bayesian framework using a simpler prior spatial constrain for CT perfusion application. On a dataset of 22 patients, this dynamic spatial-temporal Bayesian model yielded an increase in signal-tonoise-ratio (SNR) of 78% and a decrease in mean-square-error (MSE) of 40% at low dose radiation of 43mA.

  6. A Multi-Locus Genetic Risk Score for Primary Open-Angle Glaucoma (POAG) Variants Is Associated with POAG Risk in a Mediterranean Population: Inverse Correlations with Plasma Vitamin C and E Concentrations

    PubMed Central

    Ortega-Azorin, Carolina; Asensio-Marquez, Eva M.; Garcia-Medina, Jose J.; Pinazo-Duran, Maria D.; Coltell, Oscar; Ordovas, Jose M.

    2017-01-01

    Primary open-angle glaucoma (POAG) is a leading cause of blindness worldwide. The genetics of POAG are complex, and population-specific effects have been reported. Although many polymorphisms associated with POAG risk have been reported, few studies have analyzed their additive effects. We investigated, in a southern European Mediterranean population, the association between relevant POAG polymorphisms, identified by initial genome-wide association studies (GWASs) and POAG risk, both separately and as an aggregated multi-locus genetic risk score (GRS). Also, bearing in mind that oxidative stress is a factor increasingly recognized in the pathogenesis of POAG, we analyzed the potential association of the GRS with plasma concentrations of antioxidant vitamins (C and E). We carried out a case–control study including 391 POAG cases and 383 healthy controls, and analyzed four genetic polymorphisms (rs4656461-TMCO1, rs4236601-CAV1/CAV2, rs2157719-CDKN2B-AS1 and rs3088440-CDKN2A). An unweighted GRS including the four non-linked polymorphisms was constructed. A strong association between the GRS and POAG risk was found. When three categories of the GRS were considered, subjects in the top category of the GRS were 2.92 (95% confidence interval (CI): 1.79–4.77) times more likely to have POAG compared with participants in the bottom category (p < 0.001). Moreover, the GRS was inversely correlated with plasma vitamin C (p = 0.002) and vitamin E (p = 0.001) concentrations, even after additional adjustment for POAG status. In conclusion, we have found a strong association between the GRS and POAG risk in this Mediterranean population. While the additional correlation found between GRS and low levels of vitamins C and E does not indicated a causal relationship, it does suggest the need for new and deeper research into the effects of oxidative stress as a potential mechanism for those associations. PMID:29104244

  7. Compromise decision support problems for hierarchical design involving uncertainty

    NASA Astrophysics Data System (ADS)

    Vadde, S.; Allen, J. K.; Mistree, F.

    1994-08-01

    In this paper an extension to the traditional compromise Decision Support Problem (DSP) formulation is presented. Bayesian statistics is used in the formulation to model uncertainties associated with the information being used. In an earlier paper a compromise DSP that accounts for uncertainty using fuzzy set theory was introduced. The Bayesian Decision Support Problem is described in this paper. The method for hierarchical design is demonstrated by using this formulation to design a portal frame. The results are discussed and comparisons are made with those obtained using the fuzzy DSP. Finally, the efficacy of incorporating Bayesian statistics into the traditional compromise DSP formulation is discussed and some pending research issues are described. Our emphasis in this paper is on the method rather than the results per se.

  8. Additive genetic risk from five serotonin system polymorphisms interacts with interpersonal stress to predict depression.

    PubMed

    Vrshek-Schallhorn, Suzanne; Stroud, Catherine B; Mineka, Susan; Zinbarg, Richard E; Adam, Emma K; Redei, Eva E; Hammen, Constance; Craske, Michelle G

    2015-11-01

    Behavioral genetic research supports polygenic models of depression in which many genetic variations each contribute a small amount of risk, and prevailing diathesis-stress models suggest gene-environment interactions (G×E). Multilocus profile scores of additive risk offer an approach that is consistent with polygenic models of depression risk. In a first demonstration of this approach in a G×E predicting depression, we created an additive multilocus profile score from 5 serotonin system polymorphisms (1 each in the genes HTR1A, HTR2A, HTR2C, and 2 in TPH2). Analyses focused on 2 forms of interpersonal stress as environmental risk factors. Using 5 years of longitudinal diagnostic and life stress interviews from 387 emerging young adults in the Youth Emotion Project, survival analyses show that this multilocus profile score interacts with major interpersonal stressful life events to predict major depressive episode onsets (hazard ratio [HR] = 1.815, p = .007). Simultaneously, there was a significant protective effect of the profile score without a recent event (HR = 0.83, p = .030). The G×E effect with interpersonal chronic stress was not significant (HR = 1.15, p = .165). Finally, effect sizes for genetic factors examined ignoring stress suggested such an approach could lead to overlooking or misinterpreting genetic effects. Both the G×E effect and the protective simple main effect were replicated in a sample of early adolescent girls (N = 105). We discuss potential benefits of the multilocus genetic profile score approach and caveats for future research. (c) 2015 APA, all rights reserved).

  9. Bayesian Assessment of the Uncertainties of Estimates of a Conceptual Rainfall-Runoff Model Parameters

    NASA Astrophysics Data System (ADS)

    Silva, F. E. O. E.; Naghettini, M. D. C.; Fernandes, W.

    2014-12-01

    This paper evaluated the uncertainties associated with the estimation of the parameters of a conceptual rainfall-runoff model, through the use of Bayesian inference techniques by Monte Carlo simulation. The Pará River sub-basin, located in the upper São Francisco river basin, in southeastern Brazil, was selected for developing the studies. In this paper, we used the Rio Grande conceptual hydrologic model (EHR/UFMG, 2001) and the Markov Chain Monte Carlo simulation method named DREAM (VRUGT, 2008a). Two probabilistic models for the residues were analyzed: (i) the classic [Normal likelihood - r ≈ N (0, σ²)]; and (ii) a generalized likelihood (SCHOUPS & VRUGT, 2010), in which it is assumed that the differences between observed and simulated flows are correlated, non-stationary, and distributed as a Skew Exponential Power density. The assumptions made for both models were checked to ensure that the estimation of uncertainties in the parameters was not biased. The results showed that the Bayesian approach proved to be adequate to the proposed objectives, enabling and reinforcing the importance of assessing the uncertainties associated with hydrological modeling.

  10. Bayesian Scalar-on-Image Regression with Application to Association Between Intracranial DTI and Cognitive Outcomes

    PubMed Central

    Huang, Lei; Goldsmith, Jeff; Reiss, Philip T.; Reich, Daniel S.; Crainiceanu, Ciprian M.

    2013-01-01

    Diffusion tensor imaging (DTI) measures water diffusion within white matter, allowing for in vivo quantification of brain pathways. These pathways often subserve specific functions, and impairment of those functions is often associated with imaging abnormalities. As a method for predicting clinical disability from DTI images, we propose a hierarchical Bayesian “scalar-on-image” regression procedure. Our procedure introduces a latent binary map that estimates the locations of predictive voxels and penalizes the magnitude of effect sizes in these voxels, thereby resolving the ill-posed nature of the problem. By inducing a spatial prior structure, the procedure yields a sparse association map that also maintains spatial continuity of predictive regions. The method is demonstrated on a simulation study and on a study of association between fractional anisotropy and cognitive disability in a cross-sectional sample of 135 multiple sclerosis patients. PMID:23792220

  11. Taxonomic evaluation of putative Streptomyces scabiei strains held in the ARS Culture Collection (NRRL) using multi-locus sequence analysis.

    PubMed

    Labeda, David P

    2016-03-01

    Multi-locus sequence analysis has been demonstrated to be a useful tool for identification of Streptomyces species and was previously applied to phylogenetically differentiate the type strains of species pathogenic on potatoes (Solanum tuberosum L.). The ARS Culture Collection (NRRL) contains 43 strains identified as Streptomyces scabiei deposited at various times since the 1950s and these were subjected to multi-locus sequence analysis utilising partial sequences of the house-keeping genes atpD, gyrB, recA, rpoB and trpB. Phylogenetic analyses confirmed the identity of 17 of these strains as Streptomyces scabiei, 9 of the strains as the potato-pathogenic species Streptomyces europaeiscabiei and 6 strains as potentially new phytopathogenic species. Of the 16 other strains, 12 were identified as members of previously described non-pathogenic Streptomyces species while the remaining 4 strains may represent heretofore unrecognised non-pathogenic species. This study demonstrated the value of this technique for the relatively rapid, simple and sensitive molecular identification of Streptomyces strains held in culture collections.

  12. Comparing energy sources for surgical ablation of atrial fibrillation: a Bayesian network meta-analysis of randomized, controlled trials.

    PubMed

    Phan, Kevin; Xie, Ashleigh; Kumar, Narendra; Wong, Sophia; Medi, Caroline; La Meir, Mark; Yan, Tristan D

    2015-08-01

    Simplified maze procedures involving radiofrequency, cryoenergy and microwave energy sources have been increasingly utilized for surgical treatment of atrial fibrillation as an alternative to the traditional cut-and-sew approach. In the absence of direct comparisons, a Bayesian network meta-analysis is another alternative to assess the relative effect of different treatments, using indirect evidence. A Bayesian meta-analysis of indirect evidence was performed using 16 published randomized trials identified from 6 databases. Rank probability analysis was used to rank each intervention in terms of their probability of having the best outcome. Sinus rhythm prevalence beyond the 12-month follow-up was similar between the cut-and-sew, microwave and radiofrequency approaches, which were all ranked better than cryoablation (respectively, 39, 36, and 25 vs 1%). The cut-and-sew maze was ranked worst in terms of mortality outcomes compared with microwave, radiofrequency and cryoenergy (2 vs 19, 34, and 24%, respectively). The cut-and-sew maze procedure was associated with significantly lower stroke rates compared with microwave ablation [odds ratio <0.01; 95% confidence interval 0.00, 0.82], and ranked the best in terms of pacemaker requirements compared with microwave, radiofrequency and cryoenergy (81 vs 14, and 1, <0.01% respectively). Bayesian rank probability analysis shows that the cut-and-sew approach is associated with the best outcomes in terms of sinus rhythm prevalence and stroke outcomes, and remains the gold standard approach for AF treatment. Given the limitations of indirect comparison analysis, these results should be viewed with caution and not over-interpreted. © The Author 2014. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. All rights reserved.

  13. Multivariable and Bayesian Network Analysis of Outcome Predictors in Acute Aneurysmal Subarachnoid Hemorrhage: Review of a Pure Surgical Series in the Post-International Subarachnoid Aneurysm Trial Era.

    PubMed

    Zador, Zsolt; Huang, Wendy; Sperrin, Matthew; Lawton, Michael T

    2018-06-01

    Following the International Subarachnoid Aneurysm Trial (ISAT), evolving treatment modalities for acute aneurysmal subarachnoid hemorrhage (aSAH) has changed the case mix of patients undergoing urgent surgical clipping. To update our knowledge on outcome predictors by analyzing admission parameters in a pure surgical series using variable importance ranking and machine learning. We reviewed a single surgeon's case series of 226 patients suffering from aSAH treated with urgent surgical clipping. Predictions were made using logistic regression models, and predictive performance was assessed using areas under the receiver operating curve (AUC). We established variable importance ranking using partial Nagelkerke R2 scores. Probabilistic associations between variables were depicted using Bayesian networks, a method of machine learning. Importance ranking showed that World Federation of Neurosurgical Societies (WFNS) grade and age were the most influential outcome prognosticators. Inclusion of only these 2 predictors was sufficient to maintain model performance compared to when all variables were considered (AUC = 0.8222, 95% confidence interval (CI): 0.7646-0.88 vs 0.8218, 95% CI: 0.7616-0.8821, respectively, DeLong's P = .992). Bayesian networks showed that age and WFNS grade were associated with several variables such as laboratory results and cardiorespiratory parameters. Our study is the first to report early outcomes and formal predictor importance ranking following aSAH in a post-ISAT surgical case series. Models showed good predictive power with fewer relevant predictors than in similar size series. Bayesian networks proved to be a powerful tool in visualizing the widespread association of the 2 key predictors with admission variables, explaining their importance and demonstrating the potential for hypothesis generation.

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gagné, Jonathan; Lafrenière, David; Doyon, René

    We present Bayesian Analysis for Nearby Young AssociatioNs II (BANYAN II), a modified Bayesian analysis for assessing the membership of later-than-M5 objects to any of several Nearby Young Associations (NYAs). In addition to using kinematic information (from sky position and proper motion), this analysis exploits 2MASS-WISE color-magnitude diagrams in which old and young objects follow distinct sequences. As an improvement over our earlier work, the spatial and kinematic distributions for each association are now modeled as ellipsoids whose axes need not be aligned with the Galactic coordinate axes, and we use prior probabilities matching the expected populations of the NYAsmore » considered versus field stars. We present an extensive contamination analysis to characterize the performance of our new method. We find that Bayesian probabilities are generally representative of contamination rates, except when a parallax measurement is considered. In this case contamination rates become significantly smaller and hence Bayesian probabilities for NYA memberships are pessimistic. We apply this new algorithm to a sample of 158 objects from the literature that are either known to display spectroscopic signs of youth or have unusually red near-infrared colors for their spectral type. Based on our analysis, we identify 25 objects as new highly probable candidates to NYAs, including a new M7.5 bona fide member to Tucana-Horologium, making it the latest-type member. In addition, we reveal that a known L2γ dwarf is co-moving with a bright M5 dwarf, and we show for the first time that two of the currently known ultra red L dwarfs are strong candidates to the AB Doradus moving group. Several objects identified here as highly probable members to NYAs could be free-floating planetary-mass objects if their membership is confirmed.« less

  15. Bayesian modelling of the emission spectrum of the Joint European Torus Lithium Beam Emission Spectroscopy system.

    PubMed

    Kwak, Sehyun; Svensson, J; Brix, M; Ghim, Y-C

    2016-02-01

    A Bayesian model of the emission spectrum of the JET lithium beam has been developed to infer the intensity of the Li I (2p-2s) line radiation and associated uncertainties. The detected spectrum for each channel of the lithium beam emission spectroscopy system is here modelled by a single Li line modified by an instrumental function, Bremsstrahlung background, instrumental offset, and interference filter curve. Both the instrumental function and the interference filter curve are modelled with non-parametric Gaussian processes. All free parameters of the model, the intensities of the Li line, Bremsstrahlung background, and instrumental offset, are inferred using Bayesian probability theory with a Gaussian likelihood for photon statistics and electronic background noise. The prior distributions of the free parameters are chosen as Gaussians. Given these assumptions, the intensity of the Li line and corresponding uncertainties are analytically available using a Bayesian linear inversion technique. The proposed approach makes it possible to extract the intensity of Li line without doing a separate background subtraction through modulation of the Li beam.

  16. Surviving in mountain climate refugia: new insights from the genetic diversity and structure of the relict shrub Myrtus nivellei (Myrtaceae) in the Sahara Desert.

    PubMed

    Migliore, Jérémy; Baumel, Alex; Juin, Marianick; Fady, Bruno; Roig, Anne; Duong, Nathalie; Médail, Frédéric

    2013-01-01

    The identification of past glacial refugia has become a key topic for conservation under environmental change, since they contribute importantly to shaping current patterns of biodiversity. However, little attention has been paid so far to interglacial refugia despite their key role for the survival of relict species currently occurring in climate refugia. Here, we focus on the genetic consequences of range contraction on the relict populations of the evergreen shrub Myrtus nivellei, endemic in the Saharan mountains since at least the end of the last Green Sahara period, around 5.5 ka B.P. Multilocus genotypes (nuclear microsatellites and AFLP) were obtained from 215 individuals collected from 23 wadis (temporary rivers) in the three main mountain ranges in southern Algeria (the Hoggar, Tassili n'Ajjer and Tassili n'Immidir ranges). Identical genotypes were found in several plants growing far apart within the same wadis, a pattern taken as evidence of clonality. Multivariate analyses and Bayesian clustering revealed that genetic diversity was mainly structured among the mountain ranges, while low isolation by distance was observed within each mountain range. The range contraction induced by the last episode of aridification has likely increased the genetic isolation of the populations of M. nivellei, without greatly affecting the genetic diversity of the species as a whole. The pattern of genetic diversity observed here suggests that high connectivity may have prevailed during humid periods, which is consistent with recent paleoenvironmental reconstructions.

  17. Molecular Systematics of the Cape Parrot (Poicephalus robustus): Implications for Taxonomy and Conservation

    PubMed Central

    Coetzer, Willem G.; Downs, Colleen T.; Perrin, Mike R.; Willows-Munro, Sandi

    2015-01-01

    The taxonomic position of the Cape Parrot (Poicephalus robustus robustus) has been the focus of much debate. A number of authors suggest that the Cape Parrot should be viewed as a distinct species separate from the other two P. robustus subspecies (P. r. fuscicollis and P. r. suahelicus). These recommendations were based on morphological, ecological, and behavioural assessments. In this study we investigated the validity of these recommendations using multilocus DNA analyses. We genotyped 138 specimens from five Poicephalus species (P. cryptoxanthus, P. gulielmi, P. meyeri, P. robustus, and P. rueppellii) using 11 microsatellite loci. Additionally, two mitochondrial (cytochrome oxidase I gene and 16S ribosomal RNA) and one nuclear intron (intron 7 of the β-fibrinogen gene) markers were amplified and sequenced. Bayesian clustering analysis and pairwise FST analysis of microsatellite data identified P. r. robustus as genetically distinct from the other P. robustus subspecies. Phylogenetic and molecular clock analyses on sequence data also supported the microsatellite analyses, placing P. r. robustus in a distinct clade separate from the other P. robustus subspecies. Molecular clock analysis places the most recent common ancestor between P. r. robustus and P. r. fuscicollis / P. r. suahelicus at 2.13 to 2.67 million years ago. Our results all support previous recommendations to elevate the Cape Parrot to species level. This will facilitate better planning and implementation of international and local conservation management strategies for the Cape Parrot. PMID:26267261

  18. An Approximate Markov Model for the Wright-Fisher Diffusion and Its Application to Time Series Data.

    PubMed

    Ferrer-Admetlla, Anna; Leuenberger, Christoph; Jensen, Jeffrey D; Wegmann, Daniel

    2016-06-01

    The joint and accurate inference of selection and demography from genetic data is considered a particularly challenging question in population genetics, since both process may lead to very similar patterns of genetic diversity. However, additional information for disentangling these effects may be obtained by observing changes in allele frequencies over multiple time points. Such data are common in experimental evolution studies, as well as in the comparison of ancient and contemporary samples. Leveraging this information, however, has been computationally challenging, particularly when considering multilocus data sets. To overcome these issues, we introduce a novel, discrete approximation for diffusion processes, termed mean transition time approximation, which preserves the long-term behavior of the underlying continuous diffusion process. We then derive this approximation for the particular case of inferring selection and demography from time series data under the classic Wright-Fisher model and demonstrate that our approximation is well suited to describe allele trajectories through time, even when only a few states are used. We then develop a Bayesian inference approach to jointly infer the population size and locus-specific selection coefficients with high accuracy and further extend this model to also infer the rates of sequencing errors and mutations. We finally apply our approach to recent experimental data on the evolution of drug resistance in influenza virus, identifying likely targets of selection and finding evidence for much larger viral population sizes than previously reported. Copyright © 2016 by the Genetics Society of America.

  19. Estimating phylogenetic relationships despite discordant gene trees across loci: the species tree of a diverse species group of feather mites (Acari: Proctophyllodidae).

    PubMed

    Knowles, Lacey L; Klimov, Pavel B

    2011-11-01

    With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.

  20. Statistical Hypothesis Testing in Intraspecific Phylogeography: NCPA versus ABC

    PubMed Central

    Templeton, Alan R.

    2009-01-01

    Nested clade phylogeographic analysis (NCPA) and approximate Bayesian computation (ABC) have been used to test phylogeographic hypotheses. Multilocus NCPA tests null hypotheses, whereas ABC discriminates among a finite set of alternatives. The interpretive criteria of NCPA are explicit and allow complex models to be built from simple components. The interpretive criteria of ABC are ad hoc and require the specification of a complete phylogeographic model. The conclusions from ABC are often influenced by implicit assumptions arising from the many parameters needed to specify a complex model. These complex models confound many assumptions so that biological interpretations are difficult. Sampling error is accounted for in NCPA, but ABC ignores important sources of sampling error that creates pseudo-statistical power. NCPA generates the full sampling distribution of its statistics, but ABC only yields local probabilities, which in turn make it impossible to distinguish between a good fitting model, a non-informative model, and an over-determined model. Both NCPA and ABC use approximations, but convergences of the approximations used in NCPA are well defined whereas those in ABC are not. NCPA can analyze a large number of locations, but ABC cannot. Finally, the dimensionality of tested hypothesis is known in NCPA, but not for ABC. As a consequence, the “probabilities” generated by ABC are not true probabilities and are statistically non-interpretable. Accordingly, ABC should not be used for hypothesis testing, but simulation approaches are valuable when used in conjunction with NCPA or other methods that do not rely on highly parameterized models. PMID:19192182

  1. Population structure and phylogeography of the Gentoo Penguin (Pygoscelis papua) across the Scotia Arc.

    PubMed

    Levy, Hila; Clucas, Gemma V; Rogers, Alex D; Leaché, Adam D; Ciborowski, Kate L; Polito, Michael J; Lynch, Heather J; Dunn, Michael J; Hart, Tom

    2016-03-01

    Climate change, fisheries' pressure on penguin prey, and direct human disturbance of wildlife have all been implicated in causing large shifts in the abundance and distribution of penguins in the Southern Ocean. Without mark-recapture studies, understanding how colonies form and, by extension, how ranges shift is challenging. Genetic studies, particularly focused on newly established colonies, provide a snapshot of colonization and can reveal the extent to which shifts in abundance and occupancy result from changes in demographic rates (e.g., reproduction and survival) or migration among suitable patches of habitat. Here, we describe the population structure of a colonial seabird breeding across a large latitudinal range in the Southern Ocean. Using multilocus microsatellite genotype data from 510 Gentoo penguin (Pygoscelis papua) individuals from 14 colonies along the Scotia Arc and Antarctic Peninsula, together with mitochondrial DNA data, we find strong genetic differentiation between colonies north and south of the Polar Front, that coincides geographically with the taxonomic boundary separating the subspecies P. p. papua and P. p. ellsworthii. Using a discrete Bayesian phylogeographic approach, we show that southern Gentoos expanded from a possible glacial refuge in the center of their current range, colonizing regions to the north and south through rare, long-distance dispersal. Our findings show that this dispersal is important for new colony foundation and range expansion in a seabird species that ordinarily exhibits high levels of natal philopatry, though persistent oceanographic features serve as barriers to movement.

  2. Phylogeographic structure in long-tailed voles (Rodentia: Arvicolinae) belies the complex Pleistocene history of isolation, divergence, and recolonization of Northwest North America's fauna.

    PubMed

    Sawyer, Yadéeh E; Cook, Joseph A

    2016-09-01

    Quaternary climate fluctuations restructured biodiversity across North American high latitudes through repeated episodes of range contraction, population isolation and divergence, and subsequent expansion. Identifying how species responded to changing environmental conditions not only allows us to explore the mode and tempo of evolution in northern taxa, but also provides a basis for forecasting future biotic response across the highly variable topography of western North America. Using a multilocus approach under a Bayesian coalescent framework, we investigated the phylogeography of a wide-ranging mammal, the long-tailed vole, Microtus longicaudus . We focused on populations along the North Pacific Coast to refine our understanding of diversification by exploring the potentially compounding roles of multiple glacial refugia and more recent fragmentation of an extensive coastal archipelago. Through a combination of genetic data and species distribution models (SDMs), we found that historical climate variability influenced contemporary genetic structure, with multiple isolated locations of persistence (refugia) producing multiple divergent lineages (Beringian or northern, southeast Alaska or coastal, and southern or continental) during glacial advances. These vole lineages all occur along the North Pacific Coast where the confluence of numerous independent lineages in other species has produced overlapping zones of secondary contact, collectively a suture zone. Finally, we detected high levels of neoendemism due to complex island geography that developed in the last 10,000 years with the rising sea levels of the Holocene.

  3. Forest corridors maintain historical gene flow in a tiger metapopulation in the highlands of central India.

    PubMed

    Sharma, Sandeep; Dutta, Trishna; Maldonado, Jesús E; Wood, Thomas C; Panwar, Hemendra Singh; Seidensticker, John

    2013-09-22

    Understanding the patterns of gene flow of an endangered species metapopulation occupying a fragmented habitat is crucial for landscape-level conservation planning and devising effective conservation strategies. Tigers (Panthera tigris) are globally endangered and their populations are highly fragmented and exist in a few isolated metapopulations across their range. We used multi-locus genotypic data from 273 individual tigers (Panthera tigris tigris) from four tiger populations of the Satpura-Maikal landscape of central India to determine whether the corridors in this landscape are functional. This 45 000 km(2) landscape contains 17% of India's tiger population and 12% of its tiger habitat. We applied Bayesian and coalescent-based analyses to estimate contemporary and historical gene flow among these populations and to infer their evolutionary history. We found that the tiger metapopulation in central India has high rates of historical and contemporary gene flow. The tests for population history reveal that tigers populated central India about 10 000 years ago. Their population subdivision began about 1000 years ago and accelerated about 200 years ago owing to habitat fragmentation, leading to four spatially separated populations. These four populations have been in migration-drift equilibrium maintained by high gene flow. We found the highest rates of contemporary gene flow in populations that are connected by forest corridors. This information is highly relevant to conservation practitioners and policy makers, because deforestation, road widening and mining are imminent threats to these corridors.

  4. Forest corridors maintain historical gene flow in a tiger metapopulation in the highlands of central India

    PubMed Central

    Sharma, Sandeep; Dutta, Trishna; Maldonado, Jesús E.; Wood, Thomas C.; Panwar, Hemendra Singh; Seidensticker, John

    2013-01-01

    Understanding the patterns of gene flow of an endangered species metapopulation occupying a fragmented habitat is crucial for landscape-level conservation planning and devising effective conservation strategies. Tigers (Panthera tigris) are globally endangered and their populations are highly fragmented and exist in a few isolated metapopulations across their range. We used multi-locus genotypic data from 273 individual tigers (Panthera tigris tigris) from four tiger populations of the Satpura–Maikal landscape of central India to determine whether the corridors in this landscape are functional. This 45 000 km2 landscape contains 17% of India's tiger population and 12% of its tiger habitat. We applied Bayesian and coalescent-based analyses to estimate contemporary and historical gene flow among these populations and to infer their evolutionary history. We found that the tiger metapopulation in central India has high rates of historical and contemporary gene flow. The tests for population history reveal that tigers populated central India about 10 000 years ago. Their population subdivision began about 1000 years ago and accelerated about 200 years ago owing to habitat fragmentation, leading to four spatially separated populations. These four populations have been in migration–drift equilibrium maintained by high gene flow. We found the highest rates of contemporary gene flow in populations that are connected by forest corridors. This information is highly relevant to conservation practitioners and policy makers, because deforestation, road widening and mining are imminent threats to these corridors. PMID:23902910

  5. Insights into the origin and distribution of biodiversity in the Brazilian Atlantic forest hot spot: a statistical phylogeographic study using a low-dispersal organism.

    PubMed

    Álvarez-Presas, M; Sánchez-Gracia, A; Carbayo, F; Rozas, J; Riutort, M

    2014-06-01

    The relative importance of the processes that generate and maintain biodiversity is a major and controversial topic in evolutionary biology with large implications for conservation management. The Atlantic Forest of Brazil, one of the world's richest biodiversity hot spots, is severely damaged by human activities. To formulate an efficient conservation policy, a good understanding of spatial and temporal biodiversity patterns and their underlying evolutionary mechanisms is required. With this aim, we performed a comprehensive phylogeographic study using a low-dispersal organism, the land planarian species Cephaloflexa bergi (Platyhelminthes, Tricladida). Analysing multi-locus DNA sequence variation under the Approximate Bayesian Computation framework, we evaluated two scenarios proposed to explain the diversity of Southern Atlantic Forest (SAF) region. We found that most sampled localities harbour high levels of genetic diversity, with lineages sharing common ancestors that predate the Pleistocene. Remarkably, we detected the molecular hallmark of the isolation-by-distance effect and little evidence of a recent colonization of SAF localities; nevertheless, some populations might result from very recent secondary contacts. We conclude that extant SAF biodiversity originated and has been shaped by complex interactions between ancient geological events and more recent evolutionary processes, whereas Pleistocene climate changes had a minor influence in generating present-day diversity. We also demonstrate that land planarians are an advantageous biological model for making phylogeographic and, particularly, fine-scale evolutionary inferences, and propose appropriate conservation policies.

  6. Distribution and factors associated with Salmonella enterica genotypes in a diverse population of humans and animals in Qatar using multi-locus sequence typing (MLST).

    PubMed

    Chang, Yu C; Scaria, Joy; Ibraham, Mariamma; Doiphode, Sanjay; Chang, Yung-Fu; Sultan, Ali; Mohammed, Hussni O

    2016-01-01

    Salmonella enterica is one of the most commonly reported causes of bacterial foodborne illness around the world. Understanding the sources of this pathogen and the associated factors that exacerbate its risk to humans will help in developing risk mitigation strategies. The genetic relatedness among Salmonella isolates recovered from human gastroenteritis cases and food animals in Qatar were investigated in the hope of shedding light on these sources, their possible transmission routes, and any associated factors. A repeat cross-sectional study was conducted in which the samples and associated data were collected from both populations (gastroenteritis cases and animals). Salmonella isolates were initially analyzed using multi-locus sequence typing (MLST) to investigate the genetic diversity and clonality. The relatedness among the isolates was assessed using the minimum spanning tree (MST). Twenty-seven different sequence types (STs) were identified in this study; among them, seven were novel, including ST1695, ST1696, ST1697, ST1698, ST1699, ST1702, and ST1703. The pattern of overall ST distribution was diverse; in particular, it was revealed that ST11 and ST19 were the most common sequence types, presenting 29.5% and 11.5% within the whole population. In addition, 20 eBurst Groups (eBGs) were identified in our data, which indicates that ST11 and ST19 belonged to eBG4 and eBG1, respectively. In addition, the potential association between the putative risk factors and eBGs were evaluated. There was no significant clustering of these eBGs by season; however, a significant association was identified in terms of nationality in that Qataris were six times more likely to present with eBG1 compared to non-Qataris. In the MST analysis, four major clusters were presented, namely, ST11, ST19, ST16, and ST31. The linkages between the clusters alluded to a possible transmission route. The results of the study have provided insight into the ST distributions of S. enterica and their possible zoonotic associations in Qatar. Published by Elsevier Ltd.

  7. Multilocus Patterns of Nucleotide Diversity, Linkage Disequilibrium and Demographic History of Norway Spruce [Picea abies (L.) Karst

    PubMed Central

    Heuertz, Myriam; De Paoli, Emanuele; Källman, Thomas; Larsson, Hanna; Jurman, Irena; Morgante, Michele; Lascoux, Martin; Gyllenstrand, Niclas

    2006-01-01

    DNA polymorphism at 22 loci was studied in an average of 47 Norway spruce [Picea abies (L.) Karst.] haplotypes sampled in seven populations representative of the natural range. The overall nucleotide variation was limited, being lower than that observed in most plant species so far studied. Linkage disequilibrium was also restricted and did not extend beyond a few hundred base pairs. All populations, with the exception of the Romanian population, could be divided into two main domains, a Baltico–Nordic and an Alpine one. Mean Tajima's D and Fay and Wu's H across loci were both negative, indicating the presence of an excess of both rare and high-frequency-derived variants compared to the expected frequency spectrum in a standard neutral model. Multilocus neutrality tests based on D and H led to the rejection of the standard neutral model and exponential growth in the whole population as well as in the two main domains. On the other hand, in all three cases the data are compatible with a severe bottleneck occurring some hundreds of thousands of years ago. Hence, demographic departures from equilibrium expectations and population structure will have to be accounted for when detecting selection at candidate genes and in association mapping studies, respectively. PMID:17057229

  8. Whole genome sequencing of Salmonella Typhimurium illuminates distinct outbreaks caused by an endemic multi-locus variable number tandem repeat analysis type in Australia, 2014.

    PubMed

    Phillips, Anastasia; Sotomayor, Cristina; Wang, Qinning; Holmes, Nadine; Furlong, Catriona; Ward, Kate; Howard, Peter; Octavia, Sophie; Lan, Ruiting; Sintchenko, Vitali

    2016-09-15

    Salmonella Typhimurium (STM) is an important cause of foodborne outbreaks worldwide. Subtyping of STM remains critical to outbreak investigation, yet current techniques (e.g. multilocus variable number tandem repeat analysis, MLVA) may provide insufficient discrimination. Whole genome sequencing (WGS) offers potentially greater discriminatory power to support infectious disease surveillance. We performed WGS on 62 STM isolates of a single, endemic MLVA type associated with two epidemiologically independent, food-borne outbreaks along with sporadic cases in New South Wales, Australia, during 2014. Genomes of case and environmental isolates were sequenced using HiSeq (Illumina) and the genetic distance between them was assessed by single nucleotide polymorphism (SNP) analysis. SNP analysis was compared to the epidemiological context. The WGS analysis supported epidemiological evidence and genomes of within-outbreak isolates were nearly identical. Sporadic cases differed from outbreak cases by a small number of SNPs, although their close relationship to outbreak cases may represent an unidentified common food source that may warrant further public health follow up. Previously unrecognised mini-clusters were detected. WGS of STM can discriminate foodborne community outbreaks within a single endemic MLVA clone. Our findings support the translation of WGS into public health laboratory surveillance of salmonellosis.

  9. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015

    PubMed Central

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-01-01

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. PMID:28277220

  10. 16S rRNA Gene Sequencing, Multilocus Sequence Analysis, and Mass Spectrometry Identification of the Proposed New Species “Clostridium neonatale”

    PubMed Central

    Bouvet, Philippe; Ferraris, Laurent; Dauphin, Brunhilde; Popoff, Michel-Robert; Butel, Marie Jose

    2014-01-01

    In 2002, an outbreak of necrotizing enterocolitis in a Canadian neonatal intensive care unit was associated with a proposed novel species of Clostridium, “Clostridium neonatale.” To date, there are no data about the isolation, identification, or clinical significance of this species. Additionally, C. neonatale has not been formally classified as a new species, rendering its identification challenging. Indeed, the C. neonatale 16S rRNA gene sequence shows high similarity to another Clostridium species involved in neonatal necrotizing enterocolitis, Clostridium butyricum. By performing a polyphasic study combining phylogenetic analysis (16S rRNA gene sequencing and multilocus sequence analysis) and phenotypic characterization with mass spectrometry, we demonstrated that C. neonatale is a new species within the Clostridium genus sensu stricto, for which we propose the name Clostridium neonatale sp. nov. Now that the status of C. neonatale has been clarified, matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) can be used for better differential identification of C. neonatale and C. butyricum clinical isolates. This is necessary to precisely define the role and clinical significance of C. neonatale, a species that may have been misidentified and underrepresented during previous neonatal necrotizing enterocolitis studies. PMID:25232167

  11. Online Variational Bayesian Filtering-Based Mobile Target Tracking in Wireless Sensor Networks

    PubMed Central

    Zhou, Bingpeng; Chen, Qingchun; Li, Tiffany Jing; Xiao, Pei

    2014-01-01

    The received signal strength (RSS)-based online tracking for a mobile node in wireless sensor networks (WSNs) is investigated in this paper. Firstly, a multi-layer dynamic Bayesian network (MDBN) is introduced to characterize the target mobility with either directional or undirected movement. In particular, it is proposed to employ the Wishart distribution to approximate the time-varying RSS measurement precision's randomness due to the target movement. It is shown that the proposed MDBN offers a more general analysis model via incorporating the underlying statistical information of both the target movement and observations, which can be utilized to improve the online tracking capability by exploiting the Bayesian statistics. Secondly, based on the MDBN model, a mean-field variational Bayesian filtering (VBF) algorithm is developed to realize the online tracking of a mobile target in the presence of nonlinear observations and time-varying RSS precision, wherein the traditional Bayesian filtering scheme cannot be directly employed. Thirdly, a joint optimization between the real-time velocity and its prior expectation is proposed to enable online velocity tracking in the proposed online tacking scheme. Finally, the associated Bayesian Cramer–Rao Lower Bound (BCRLB) analysis and numerical simulations are conducted. Our analysis unveils that, by exploiting the potential state information via the general MDBN model, the proposed VBF algorithm provides a promising solution to the online tracking of a mobile node in WSNs. In addition, it is shown that the final tracking accuracy linearly scales with its expectation when the RSS measurement precision is time-varying. PMID:25393784

  12. Bayesian analysis of rare events

    NASA Astrophysics Data System (ADS)

    Straub, Daniel; Papaioannou, Iason; Betz, Wolfgang

    2016-06-01

    In many areas of engineering and science there is an interest in predicting the probability of rare events, in particular in applications related to safety and security. Increasingly, such predictions are made through computer models of physical systems in an uncertainty quantification framework. Additionally, with advances in IT, monitoring and sensor technology, an increasing amount of data on the performance of the systems is collected. This data can be used to reduce uncertainty, improve the probability estimates and consequently enhance the management of rare events and associated risks. Bayesian analysis is the ideal method to include the data into the probabilistic model. It ensures a consistent probabilistic treatment of uncertainty, which is central in the prediction of rare events, where extrapolation from the domain of observation is common. We present a framework for performing Bayesian updating of rare event probabilities, termed BUS. It is based on a reinterpretation of the classical rejection-sampling approach to Bayesian analysis, which enables the use of established methods for estimating probabilities of rare events. By drawing upon these methods, the framework makes use of their computational efficiency. These methods include the First-Order Reliability Method (FORM), tailored importance sampling (IS) methods and Subset Simulation (SuS). In this contribution, we briefly review these methods in the context of the BUS framework and investigate their applicability to Bayesian analysis of rare events in different settings. We find that, for some applications, FORM can be highly efficient and is surprisingly accurate, enabling Bayesian analysis of rare events with just a few model evaluations. In a general setting, BUS implemented through IS and SuS is more robust and flexible.

  13. Examination of tetrahydrobiopterin pathway genes in autism.

    PubMed

    Schnetz-Boutaud, N C; Anderson, B M; Brown, K D; Wright, H H; Abramson, R K; Cuccaro, M L; Gilbert, J R; Pericak-Vance, M A; Haines, J L

    2009-11-01

    Autism is a complex disorder with a high degree of heritability and significant phenotypic and genotypic heterogeneity. Although candidate gene studies and genome-wide screens have failed to identify major causal loci associated with autism, numerous studies have proposed association with several variations in genes in the dopaminergic and serotonergic pathways. Because tetrahydrobiopterin (BH4) is the essential cofactor in the synthesis of these two neurotransmitters, we genotyped 25 SNPs in nine genes of the BH4 pathway in a total of 403 families. Significant nominal association was detected in the gene for 6-pyruvoyl-tetrahydropterin synthase, PTS (chromosome 11), with P = 0.009; this result was not restricted to an affected male-only subset. Multilocus interaction was detected in the BH4 pathway alone, but not across the serotonin, dopamine and BH4 pathways.

  14. Bayesian structured additive regression modeling of epidemic data: application to cholera

    PubMed Central

    2012-01-01

    Background A significant interest in spatial epidemiology lies in identifying associated risk factors which enhances the risk of infection. Most studies, however, make no, or limited use of the spatial structure of the data, as well as possible nonlinear effects of the risk factors. Methods We develop a Bayesian Structured Additive Regression model for cholera epidemic data. Model estimation and inference is based on fully Bayesian approach via Markov Chain Monte Carlo (MCMC) simulations. The model is applied to cholera epidemic data in the Kumasi Metropolis, Ghana. Proximity to refuse dumps, density of refuse dumps, and proximity to potential cholera reservoirs were modeled as continuous functions; presence of slum settlers and population density were modeled as fixed effects, whereas spatial references to the communities were modeled as structured and unstructured spatial effects. Results We observe that the risk of cholera is associated with slum settlements and high population density. The risk of cholera is equal and lower for communities with fewer refuse dumps, but variable and higher for communities with more refuse dumps. The risk is also lower for communities distant from refuse dumps and potential cholera reservoirs. The results also indicate distinct spatial variation in the risk of cholera infection. Conclusion The study highlights the usefulness of Bayesian semi-parametric regression model analyzing public health data. These findings could serve as novel information to help health planners and policy makers in making effective decisions to control or prevent cholera epidemics. PMID:22866662

  15. A Bayesian Framework for Generalized Linear Mixed Modeling Identifies New Candidate Loci for Late-Onset Alzheimer’s Disease

    PubMed Central

    Wang, Xulong; Philip, Vivek M.; Ananda, Guruprasad; White, Charles C.; Malhotra, Ankit; Michalski, Paul J.; Karuturi, Krishna R. Murthy; Chintalapudi, Sumana R.; Acklin, Casey; Sasner, Michael; Bennett, David A.; De Jager, Philip L.; Howell, Gareth R.; Carter, Gregory W.

    2018-01-01

    Recent technical and methodological advances have greatly enhanced genome-wide association studies (GWAS). The advent of low-cost, whole-genome sequencing facilitates high-resolution variant identification, and the development of linear mixed models (LMM) allows improved identification of putatively causal variants. While essential for correcting false positive associations due to sample relatedness and population stratification, LMMs have commonly been restricted to quantitative variables. However, phenotypic traits in association studies are often categorical, coded as binary case-control or ordered variables describing disease stages. To address these issues, we have devised a method for genomic association studies that implements a generalized LMM (GLMM) in a Bayesian framework, called Bayes-GLMM. Bayes-GLMM has four major features: (1) support of categorical, binary, and quantitative variables; (2) cohesive integration of previous GWAS results for related traits; (3) correction for sample relatedness by mixed modeling; and (4) model estimation by both Markov chain Monte Carlo sampling and maximal likelihood estimation. We applied Bayes-GLMM to the whole-genome sequencing cohort of the Alzheimer’s Disease Sequencing Project. This study contains 570 individuals from 111 families, each with Alzheimer’s disease diagnosed at one of four confidence levels. Using Bayes-GLMM we identified four variants in three loci significantly associated with Alzheimer’s disease. Two variants, rs140233081 and rs149372995, lie between PRKAR1B and PDGFA. The coded proteins are localized to the glial-vascular unit, and PDGFA transcript levels are associated with Alzheimer’s disease-related neuropathology. In summary, this work provides implementation of a flexible, generalized mixed-model approach in a Bayesian framework for association studies. PMID:29507048

  16. Retrodiction for Bayesian multiple-hypothesis/multiple-target tracking in densely cluttered environment

    NASA Astrophysics Data System (ADS)

    Koch, Wolfgang

    1996-05-01

    Sensor data processing in a dense target/dense clutter environment is inevitably confronted with data association conflicts which correspond with the multiple hypothesis character of many modern approaches (MHT: multiple hypothesis tracking). In this paper we analyze the efficiency of retrodictive techniques that generalize standard fixed interval smoothing to MHT applications. 'Delayed estimation' based on retrodiction provides uniquely interpretable and accurate trajectories from ambiguous MHT output if a certain time delay is tolerated. In a Bayesian framework the theoretical background of retrodiction and its intimate relation to Bayesian MHT is sketched. By a simulated example with two closely-spaced targets, relatively low detection probabilities, and rather high false return densities, we demonstrate the benefits of retrodiction and quantitatively discuss the achievable track accuracies and the time delays involved for typical radar parameters.

  17. Modeling Soot Oxidation and Gasification with Bayesian Statistics

    DOE PAGES

    Josephson, Alexander J.; Gaffin, Neal D.; Smith, Sean T.; ...

    2017-08-22

    This paper presents a statistical method for model calibration using data collected from literature. The method is used to calibrate parameters for global models of soot consumption in combustion systems. This consumption is broken into two different submodels: first for oxidation where soot particles are attacked by certain oxidizing agents; second for gasification where soot particles are attacked by H 2O or CO 2 molecules. Rate data were collected from 19 studies in the literature and evaluated using Bayesian statistics to calibrate the model parameters. Bayesian statistics are valued in their ability to quantify uncertainty in modeling. The calibrated consumptionmore » model with quantified uncertainty is presented here along with a discussion of associated implications. The oxidation results are found to be consistent with previous studies. Significant variation is found in the CO 2 gasification rates.« less

  18. Modeling Soot Oxidation and Gasification with Bayesian Statistics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Josephson, Alexander J.; Gaffin, Neal D.; Smith, Sean T.

    This paper presents a statistical method for model calibration using data collected from literature. The method is used to calibrate parameters for global models of soot consumption in combustion systems. This consumption is broken into two different submodels: first for oxidation where soot particles are attacked by certain oxidizing agents; second for gasification where soot particles are attacked by H 2O or CO 2 molecules. Rate data were collected from 19 studies in the literature and evaluated using Bayesian statistics to calibrate the model parameters. Bayesian statistics are valued in their ability to quantify uncertainty in modeling. The calibrated consumptionmore » model with quantified uncertainty is presented here along with a discussion of associated implications. The oxidation results are found to be consistent with previous studies. Significant variation is found in the CO 2 gasification rates.« less

  19. Forensic timber identification: a case study of a CITES listed species, Gonystylus bancanus (Thymelaeaceae).

    PubMed

    Ng, Kevin Kit Siong; Lee, Soon Leong; Tnah, Lee Hong; Nurul-Farhanah, Zakaria; Ng, Chin Hong; Lee, Chai Ting; Tani, Naoki; Diway, Bibian; Lai, Pei Sing; Khoo, Eyen

    2016-07-01

    Illegal logging and smuggling of Gonystylus bancanus (Thymelaeaceae) poses a serious threat to this fragile valuable peat swamp timber species. Using G. bancanus as a case study, DNA markers were used to develop identification databases at the species, population and individual level. The species level database for Gonystylus comprised of an rDNA (ITS2) and two cpDNA (trnH-psbA and trnL) markers based on a 20 Gonystylus species database. When concatenated, taxonomic species recognition was achieved with a resolution of 90% (18 out of the 20 species). In addition, based on 17 natural populations of G. bancanus throughout West (Peninsular Malaysia) and East (Sabah and Sarawak) Malaysia, population and individual identification databases were developed using cpDNA and STR markers respectively. A haplotype distribution map for Malaysia was generated using six cpDNA markers, resulting in 12 unique multilocus haplotypes, from 24 informative intraspecific variable sites. These unique haplotypes suggest a clear genetic structuring of West and East regions. A simulation procedure based on the composition of the samples was used to test whether a suspected sample conformed to a given regional origin. Overall, the observed type I and II errors of the databases showed good concordance with the predicted 5% threshold which indicates that the databases were useful in revealing provenance and establishing conformity of samples from West and East Malaysia. Sixteen STRs were used to develop the DNA profiling databases for individual identification. Bayesian clustering analyses divided the 17 populations into two main genetic clusters, corresponding to the regions of West and East Malaysia. Population substructuring (K=2) was observed within each region. After removal of bias resulting from sampling effects and population subdivision, conservativeness tests showed that the West and East Malaysia databases were conservative. This suggests that both databases can be used independently for random match probability estimation within respective regions. The reliability of the databases was further determined by independent self-assignment tests based on the likelihood of each individual's multilocus genotype occurring in each identified population, genetic cluster and region with an average percentage of correctly assigned individuals of 54.80%, 99.60% and 100% respectively. Thus, after appropriate validation, the genetic identification databases developed for G. bancanus in this study could support forensic applications and help safeguard this valuable species into the future. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. Preliminary evidence that negative symptom severity relates to multilocus genetic profile for dopamine signaling capacity and D2 receptor binding in healthy controls and in schizophrenia.

    PubMed

    Eisenstein, Sarah A; Bogdan, Ryan; Chen, Ling; Moerlein, Stephen M; Black, Kevin J; Perlmutter, Joel S; Hershey, Tamara; Barch, Deanna M

    2017-03-01

    Deficits in central, subcortical dopamine (DA) signaling may underlie negative symptom severity, particularly anhedonia, in healthy individuals and in schizophrenia. To investigate these relationships, we assessed negative symptoms with the Schedule for the Assessment of Negative Symptoms and the Brief Negative Symptom Scale (BNSS) and self-reported anhedonia with the Scales for Physical and Social Anhedonia (SPSA), Temporal Experience of Pleasure Scale, and Snaith-Hamilton Pleasure Scale in 36 healthy controls (HC), 27 siblings (SIB) of individuals with schizophrenia, and 66 individuals with schizophrenia or schizoaffective disorder (SCZ). A subset of participants (N = 124) were genotyped for DA-related polymorphisms in genes for DRD4, DRD2/ANKK1, DAT1, and COMT, which were used to construct biologically-informed multi-locus genetic profile (MGP) scores reflective of subcortical dopaminergic signaling. DA receptor type 2 (D2R) binding was assessed among a second subset of participants (N = 23) using PET scans with the D2R-selective, non-displaceable radioligand (N-[ 11 C]methyl)benperidol. Higher MGP scores, reflecting elevated subcortical dopaminergic signaling capacity, were associated with less negative symptom severity, as measured by the BNSS, across all participants. In addition, higher striatal D2R binding was associated with less physical and social anhedonia, as measured by the SPSA, across HC, SIB, and SCZ. The current preliminary findings support the hypothesis that subcortical DA function may contribute to negative symptom severity and self-reported anhedonia, independent of diagnostic status. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Preliminary Evidence that Negative Symptom Severity Relates to Multilocus Genetic Profile for Dopamine Signaling Capacity and D2 Receptor Binding in Healthy Controls and in Schizophrenia

    PubMed Central

    Eisenstein, Sarah A.; Bogdan, Ryan; Chen, Ling; Moerlein, Stephen M.; Black, Kevin J.; Perlmutter, Joel S.; Hershey, Tamara; Barch, Deanna M.

    2017-01-01

    Deficits in central, subcortical dopamine (DA) signaling may underlie negative symptom severity, particularly anhedonia, in healthy individuals and in schizophrenia. To investigate these relationships, we assessed negative symptoms with the Schedule for the Assessment of Negative Symptoms and the Brief Negative Symptom Scale (BNSS) and self-reported anhedonia with the Scales for Physical and Social Anhedonia (SPSA), Temporal Experience of Pleasure Scale, and Snaith-Hamilton Pleasure Scale in 36 healthy controls (HC), 27 siblings (SIB) of individuals with schizophrenia, and 66 individuals with schizophrenia or schizoaffective disorder (SCZ). A subset of participants (N = 124) were genotyped for DA-related polymorphisms in genes for DRD4, DRD2/ANKK1, DAT1, and COMT, which were used to construct biologically-informed multi-locus genetic profile (MGP) scores reflective of subcortical dopaminergic signaling. DA receptor type 2 (D2R) binding was assessed among a second subset of participants (N = 23) using PET scans with the D2R-selective, non-displaceable radioligand (N-[11C]methyl)benperidol. Higher MGP scores, reflecting elevated subcortical dopaminergic signaling capacity, were associated with less negative symptom severity, as measured by the BNSS, across all participants. In addition, higher striatal D2R binding was associated with less physical and social anhedonia, as measured by the SPSA, across HC, SIB, and SCZ. The current preliminary findings support the hypothesis that subcortical DA function may contribute to negative symptom severity and self-reported anhedonia, independent of diagnostic status. PMID:27886638

  2. Integration of Multilocus Genetic Risk into the Default Mode Network Longitudinal Trajectory during the Alzheimer's Disease Process.

    PubMed

    Su, Fan; Shu, Hao; Ye, Qing; Xie, Chunming; Yuan, Baoyu; Zhang, Zhijun; Bai, Feng

    2017-01-01

    The aim of the study was to investigate the cognitive significance of the changes in default mode network (DMN) during the process of Alzheimer's disease (AD) and the genetic basis that drives the alteration. Eighty-seven subjects with mild cognitive impairment (MCI) and 131 healthy controls (HC) were employed at baseline, and they had the genetic risk scores (GRS) based on the GWAS-validated AD-related top loci. Eleven MCIs who converted to AD (c-MCIs), 32 subjects who remained stable (nc-MCIs), and 56 HCs participated in the follow-up analyses after an average of 35 months. Decreased functional connectivity (FC) within temporal cortex was identified for MCIs at baseline, which was partially determined by the GRS; moreover, compensations may occur within the frontal-parietal brain to maintain relatively intact cognition. During the follow-ups, c-MCIs exhibited more FC declines within the prefrontal-parietal lobes and parahippocampal gyrus/hippocampus than the HCs and nc-MCIs. The GRS did not significantly vary among the three groups, whereas associations were identified at risky alleles and FC declines in all AD spectra. Interestingly, the influence of APOEɛ4 varied as the disease progressed; APOEɛ4 was associated with longitudinal FC decreases only for HCs in the single variance-based analyses and deteriorated DMN integration in nc-MCIs by combining the effects of other loci. However, the GRS without APOEɛ4 predicted FC decline for converters. It is suggested that the integration of multilocus genetic risk predicted the longitudinal trajectory of DMN and may be used as a clinical strategy to track AD progression.

  3. Veterinary Fusarioses within the United States

    PubMed Central

    Sutton, Deanna A.; Wiederhold, Nathan; Robert, Vincent A. R. G.; Crous, Pedro W.; Geiser, David M.

    2016-01-01

    Multilocus DNA sequence data were used to assess the genetic diversity and evolutionary relationships of 67 Fusarium strains from veterinary sources, most of which were from the United States. Molecular phylogenetic analyses revealed that the strains comprised 23 phylogenetically distinct species, all but two of which were previously known to infect humans, distributed among eight species complexes. The majority of the veterinary isolates (47/67 = 70.1%) were nested within the Fusarium solani species complex (FSSC), and these included 8 phylospecies and 33 unique 3-locus sequence types (STs). Three of the FSSC species (Fusarium falciforme, Fusarium keratoplasticum, and Fusarium sp. FSSC 12) accounted for four-fifths of the veterinary strains (38/47) and STs (27/33) within this clade. Most of the F. falciforme strains (12/15) were recovered from equine keratitis infections; however, strains of F. keratoplasticum and Fusarium sp. FSSC 12 were mostly (25/27) isolated from marine vertebrates and invertebrates. Our sampling suggests that the Fusarium incarnatum-equiseti species complex (FIESC), with eight mycoses-associated species, may represent the second most important clade of veterinary relevance within Fusarium. Six of the multilocus STs within the FSSC (3+4-eee, 1-b, 12-a, 12-b, 12-f, and 12-h) and one each within the FIESC (1-a) and the Fusarium oxysporum species complex (ST-33) were widespread geographically, including three STs with transoceanic disjunctions. In conclusion, fusaria associated with veterinary mycoses are phylogenetically diverse and typically can only be identified to the species level using DNA sequence data from portions of one or more informative genes. PMID:27605713

  4. The Effect and Relative Importance of Neutral Genetic Diversity for Predicting Parasitism Varies across Parasite Taxa

    PubMed Central

    Ruiz-López, María José; Monello, Ryan J.; Gompper, Matthew E.; Eggert, Lori S.

    2012-01-01

    Understanding factors that determine heterogeneity in levels of parasitism across individuals is a major challenge in disease ecology. It is known that genetic makeup plays an important role in infection likelihood, but the mechanism remains unclear as does its relative importance when compared to other factors. We analyzed relationships between genetic diversity and macroparasites in outbred, free-ranging populations of raccoons (Procyon lotor). We measured heterozygosity at 14 microsatellite loci and modeled the effects of both multi-locus and single-locus heterozygosity on parasitism using an information theoretic approach and including non-genetic factors that are known to influence the likelihood of parasitism. The association of genetic diversity and parasitism, as well as the relative importance of genetic diversity, differed by parasitic group. Endoparasite species richness was better predicted by a model that included genetic diversity, with the more heterozygous hosts harboring fewer endoparasite species. Genetic diversity was also important in predicting abundance of replete ticks (Dermacentor variabilis). This association fit a curvilinear trend, with hosts that had either high or low levels of heterozygosity harboring fewer parasites than those with intermediate levels. In contrast, genetic diversity was not important in predicting abundance of non-replete ticks and lice (Trichodectes octomaculatus). No strong single-locus effects were observed for either endoparasites or replete ticks. Our results suggest that in outbred populations multi-locus diversity might be important for coping with parasitism. The differences in the relationships between heterozygosity and parasitism for the different parasites suggest that the role of genetic diversity varies with parasite-mediated selective pressures. PMID:23049796

  5. Veterinary Fusarioses within the United States.

    PubMed

    O'Donnell, Kerry; Sutton, Deanna A; Wiederhold, Nathan; Robert, Vincent A R G; Crous, Pedro W; Geiser, David M

    2016-11-01

    Multilocus DNA sequence data were used to assess the genetic diversity and evolutionary relationships of 67 Fusarium strains from veterinary sources, most of which were from the United States. Molecular phylogenetic analyses revealed that the strains comprised 23 phylogenetically distinct species, all but two of which were previously known to infect humans, distributed among eight species complexes. The majority of the veterinary isolates (47/67 = 70.1%) were nested within the Fusarium solani species complex (FSSC), and these included 8 phylospecies and 33 unique 3-locus sequence types (STs). Three of the FSSC species (Fusarium falciforme, Fusarium keratoplasticum, and Fusarium sp. FSSC 12) accounted for four-fifths of the veterinary strains (38/47) and STs (27/33) within this clade. Most of the F. falciforme strains (12/15) were recovered from equine keratitis infections; however, strains of F. keratoplasticum and Fusarium sp. FSSC 12 were mostly (25/27) isolated from marine vertebrates and invertebrates. Our sampling suggests that the Fusarium incarnatum-equiseti species complex (FIESC), with eight mycoses-associated species, may represent the second most important clade of veterinary relevance within Fusarium Six of the multilocus STs within the FSSC (3+4-eee, 1-b, 12-a, 12-b, 12-f, and 12-h) and one each within the FIESC (1-a) and the Fusarium oxysporum species complex (ST-33) were widespread geographically, including three STs with transoceanic disjunctions. In conclusion, fusaria associated with veterinary mycoses are phylogenetically diverse and typically can only be identified to the species level using DNA sequence data from portions of one or more informative genes. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  6. Multilocus sequence typing (MLST) analysis of Propionibacterium acnes isolates from radical prostatectomy specimens.

    PubMed

    Mak, Tim N; Yu, Shu-Han; De Marzo, Angelo M; Brüggemann, Holger; Sfanos, Karen S

    2013-05-01

    Inflammation is commonly observed in radical prostatectomy specimens, and evidence suggests that inflammation may contribute to prostate carcinogenesis. Multiple microorganisms have been implicated in serving as a stimulus for prostatic inflammation. The pro-inflammatory anaerobe, Propionibacterium acnes, is ubiquitously found on human skin and is associated with the skin disease acne vulgaris. Recent studies have shown that P. acnes can be detected in prostatectomy specimens by bacterial culture or by culture-independent molecular techniques. Radical prostatectomy tissue samples were obtained from 30 prostate cancer patients and subject to both aerobic and anaerobic culture. Cultured species were identified by 16S rDNA gene sequencing. Propionibacterium acnes isolates were typed using multilocus sequence typing (MLST). Our study confirmed that P. acnes can be readily cultured from prostatectomy tissues (7 of 30 cases, 23%). In some cases, multiple isolates of P. acnes were cultured as well as other Propionibacterium species, such as P. granulosum and P. avidum. Overall, 9 of 30 cases (30%) were positive for Propionibacterium spp. MLST analyses identified eight different sequence types (STs) among prostate-derived P. acnes isolates. These STs belong to two clonal complexes, namely CC36 (type I-2) and CC53/60 (type II), or are CC53/60-related singletons. MLST typing results indicated that prostate-derived P. acnes isolates do not fall within the typical skin/acne STs, but rather are characteristic of STs associated with opportunistic infections and/or urethral flora. The MLST typing results argue against the likelihood that prostatectomy-derived P. acnes isolates represent contamination from skin flora. Copyright © 2012 Wiley Periodicals, Inc.

  7. Multi-locus tree and species tree approaches toward resolving a complex clade of downy mildews (Straminipila, Oomycota), including pathogens of beet and spinach.

    PubMed

    Choi, Young-Joon; Klosterman, Steven J; Kummer, Volker; Voglmayr, Hermann; Shin, Hyeon-Dong; Thines, Marco

    2015-05-01

    Accurate species determination of plant pathogens is a prerequisite for their control and quarantine, and further for assessing their potential threat to crops. The family Peronosporaceae (Straminipila; Oomycota) consists of obligate biotrophic pathogens that cause downy mildew disease on angiosperms, including a large number of cultivated plants. In the largest downy mildew genus Peronospora, a phylogenetically complex clade includes the economically important downy mildew pathogens of spinach and beet, as well as the type species of the genus Peronospora. To resolve this complex clade at the species level and to infer evolutionary relationships among them, we used multi-locus phylogenetic analysis and species tree estimation. Both approaches discriminated all nine currently accepted species and revealed four previously unrecognized lineages, which are specific to a host genus or species. This is in line with a narrow species concept, i.e. that a downy mildew species is associated with only a particular host plant genus or species. Instead of applying the dubious name Peronospora farinosa, which has been proposed for formal rejection, our results provide strong evidence that Peronospora schachtii is an independent species from lineages on Atriplex and apparently occurs exclusively on Beta vulgaris. The members of the clade investigated, the Peronospora rumicis clade, associate with three different host plant families, Amaranthaceae, Caryophyllaceae, and Polygonaceae, suggesting that they may have speciated following at least two recent inter-family host shifts, rather than contemporary cospeciation with the host plants. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Finite‐fault Bayesian inversion of teleseismic body waves

    USGS Publications Warehouse

    Clayton, Brandon; Hartzell, Stephen; Moschetti, Morgan P.; Minson, Sarah E.

    2017-01-01

    Inverting geophysical data has provided fundamental information about the behavior of earthquake rupture. However, inferring kinematic source model parameters for finite‐fault ruptures is an intrinsically underdetermined problem (the problem of nonuniqueness), because we are restricted to finite noisy observations. Although many studies use least‐squares techniques to make the finite‐fault problem tractable, these methods generally lack the ability to apply non‐Gaussian error analysis and the imposition of nonlinear constraints. However, the Bayesian approach can be employed to find a Gaussian or non‐Gaussian distribution of all probable model parameters, while utilizing nonlinear constraints. We present case studies to quantify the resolving power and associated uncertainties using only teleseismic body waves in a Bayesian framework to infer the slip history for a synthetic case and two earthquakes: the 2011 Mw 7.1 Van, east Turkey, earthquake and the 2010 Mw 7.2 El Mayor–Cucapah, Baja California, earthquake. In implementing the Bayesian method, we further present two distinct solutions to investigate the uncertainties by performing the inversion with and without velocity structure perturbations. We find that the posterior ensemble becomes broader when including velocity structure variability and introduces a spatial smearing of slip. Using the Bayesian framework solely on teleseismic body waves, we find rake is poorly constrained by the observations and rise time is poorly resolved when slip amplitude is low.

  9. The long-term evolution of multilocus traits under frequency-dependent disruptive selection.

    PubMed

    van Doorn, G Sander; Dieckmann, Ulf

    2006-11-01

    Frequency-dependent disruptive selection is widely recognized as an important source of genetic variation. Its evolutionary consequences have been extensively studied using phenotypic evolutionary models, based on quantitative genetics, game theory, or adaptive dynamics. However, the genetic assumptions underlying these approaches are highly idealized and, even worse, predict different consequences of frequency-dependent disruptive selection. Population genetic models, by contrast, enable genotypic evolutionary models, but traditionally assume constant fitness values. Only a minority of these models thus addresses frequency-dependent selection, and only a few of these do so in a multilocus context. An inherent limitation of these remaining studies is that they only investigate the short-term maintenance of genetic variation. Consequently, the long-term evolution of multilocus characters under frequency-dependent disruptive selection remains poorly understood. We aim to bridge this gap between phenotypic and genotypic models by studying a multilocus version of Levene's soft-selection model. Individual-based simulations and deterministic approximations based on adaptive dynamics theory provide insights into the underlying evolutionary dynamics. Our analysis uncovers a general pattern of polymorphism formation and collapse, likely to apply to a wide variety of genetic systems: after convergence to a fitness minimum and the subsequent establishment of genetic polymorphism at multiple loci, genetic variation becomes increasingly concentrated on a few loci, until eventually only a single polymorphic locus remains. This evolutionary process combines features observed in quantitative genetics and adaptive dynamics models, and it can be explained as a consequence of changes in the selection regime that are inherent to frequency-dependent disruptive selection. Our findings demonstrate that the potential of frequency-dependent disruptive selection to maintain polygenic variation is considerably smaller than previously expected.

  10. [Standard algorithm of molecular typing of Yersinia pestis strains].

    PubMed

    Eroshenko, G A; Odinokov, G N; Kukleva, L M; Pavlova, A I; Krasnov, Ia M; Shavina, N Iu; Guseva, N P; Vinogradova, N A; Kutyrev, V V

    2012-01-01

    Development of the standard algorithm of molecular typing of Yersinia pestis that ensures establishing of subspecies, biovar and focus membership of the studied isolate. Determination of the characteristic strain genotypes of plague infectious agent of main and nonmain subspecies from various natural foci of plague of the Russian Federation and the near abroad. Genotyping of 192 natural Y. pestis strains of main and nonmain subspecies was performed by using PCR methods, multilocus sequencing and multilocus analysis of variable tandem repeat number. A standard algorithm of molecular typing of plague infectious agent including several stages of Yersinia pestis differentiation by membership: in main and nonmain subspecies, various biovars of the main subspecies, specific subspecies; natural foci and geographic territories was developed. The algorithm is based on 3 typing methods--PCR, multilocus sequence typing and multilocus analysis of variable tandem repeat number using standard DNA targets--life support genes (terC, ilvN, inv, glpD, napA, rhaS and araC) and 7 loci of variable tandem repeats (ms01, ms04, ms06, ms07, ms46, ms62, ms70). The effectiveness of the developed algorithm is shown on the large number of natural Y. pestis strains. Characteristic sequence types of Y. pestis strains of various subspecies and biovars as well as MLVA7 genotypes of strains from natural foci of plague of the Russian Federation and the near abroad were established. The application of the developed algorithm will increase the effectiveness of epidemiologic monitoring of plague infectious agent, and analysis of epidemics and outbreaks of plague with establishing the source of origin of the strain and routes of introduction of the infection.

  11. Maximum a posteriori Bayesian estimation of mycophenolic Acid area under the concentration-time curve: is this clinically useful for dosage prediction yet?

    PubMed

    Staatz, Christine E; Tett, Susan E

    2011-12-01

    This review seeks to summarize the available data about Bayesian estimation of area under the plasma concentration-time curve (AUC) and dosage prediction for mycophenolic acid (MPA) and evaluate whether sufficient evidence is available for routine use of Bayesian dosage prediction in clinical practice. A literature search identified 14 studies that assessed the predictive performance of maximum a posteriori Bayesian estimation of MPA AUC and one report that retrospectively evaluated how closely dosage recommendations based on Bayesian forecasting achieved targeted MPA exposure. Studies to date have mostly been undertaken in renal transplant recipients, with limited investigation in patients treated with MPA for autoimmune disease or haematopoietic stem cell transplantation. All of these studies have involved use of the mycophenolate mofetil (MMF) formulation of MPA, rather than the enteric-coated mycophenolate sodium (EC-MPS) formulation. Bias associated with estimation of MPA AUC using Bayesian forecasting was generally less than 10%. However some difficulties with imprecision was evident, with values ranging from 4% to 34% (based on estimation involving two or more concentration measurements). Evaluation of whether MPA dosing decisions based on Bayesian forecasting (by the free website service https://pharmaco.chu-limoges.fr) achieved target drug exposure has only been undertaken once. When MMF dosage recommendations were applied by clinicians, a higher proportion (72-80%) of subsequent estimated MPA AUC values were within the 30-60 mg · h/L target range, compared with when dosage recommendations were not followed (only 39-57% within target range). Such findings provide evidence that Bayesian dosage prediction is clinically useful for achieving target MPA AUC. This study, however, was retrospective and focussed only on adult renal transplant recipients. Furthermore, in this study, Bayesian-generated AUC estimations and dosage predictions were not compared with a later full measured AUC but rather with a further AUC estimate based on a second Bayesian analysis. This study also provided some evidence that a useful monitoring schedule for MPA AUC following adult renal transplant would be every 2 weeks during the first month post-transplant, every 1-3 months between months 1 and 12, and each year thereafter. It will be interesting to see further validations in different patient groups using the free website service. In summary, the predictive performance of Bayesian estimation of MPA, comparing estimated with measured AUC values, has been reported in several studies. However, the next step of predicting dosages based on these Bayesian-estimated AUCs, and prospectively determining how closely these predicted dosages give drug exposure matching targeted AUCs, remains largely unaddressed. Further prospective studies are required, particularly in non-renal transplant patients and with the EC-MPS formulation. Other important questions remain to be answered, such as: do Bayesian forecasting methods devised to date use the best population pharmacokinetic models or most accurate algorithms; are the methods simple to use for routine clinical practice; do the algorithms actually improve dosage estimations beyond empirical recommendations in all groups that receive MPA therapy; and, importantly, do the dosage predictions, when followed, improve patient health outcomes?

  12. [Evaluation of estimation of prevalence ratio using bayesian log-binomial regression model].

    PubMed

    Gao, W L; Lin, H; Liu, X N; Ren, X W; Li, J S; Shen, X P; Zhu, S L

    2017-03-10

    To evaluate the estimation of prevalence ratio ( PR ) by using bayesian log-binomial regression model and its application, we estimated the PR of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea in their infants by using bayesian log-binomial regression model in Openbugs software. The results showed that caregivers' recognition of infant' s risk signs of diarrhea was associated significantly with a 13% increase of medical care-seeking. Meanwhile, we compared the differences in PR 's point estimation and its interval estimation of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea and convergence of three models (model 1: not adjusting for the covariates; model 2: adjusting for duration of caregivers' education, model 3: adjusting for distance between village and township and child month-age based on model 2) between bayesian log-binomial regression model and conventional log-binomial regression model. The results showed that all three bayesian log-binomial regression models were convergence and the estimated PRs were 1.130(95 %CI : 1.005-1.265), 1.128(95 %CI : 1.001-1.264) and 1.132(95 %CI : 1.004-1.267), respectively. Conventional log-binomial regression model 1 and model 2 were convergence and their PRs were 1.130(95 % CI : 1.055-1.206) and 1.126(95 % CI : 1.051-1.203), respectively, but the model 3 was misconvergence, so COPY method was used to estimate PR , which was 1.125 (95 %CI : 1.051-1.200). In addition, the point estimation and interval estimation of PRs from three bayesian log-binomial regression models differed slightly from those of PRs from conventional log-binomial regression model, but they had a good consistency in estimating PR . Therefore, bayesian log-binomial regression model can effectively estimate PR with less misconvergence and have more advantages in application compared with conventional log-binomial regression model.

  13. Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases.

    PubMed

    Mezlini, Aziz M; Goldenberg, Anna

    2017-10-01

    Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios.

  14. BAYESIAN LARGE-SCALE MULTIPLE REGRESSION WITH SUMMARY STATISTICS FROM GENOME-WIDE ASSOCIATION STUDIES1

    PubMed Central

    Zhu, Xiang; Stephens, Matthew

    2017-01-01

    Bayesian methods for large-scale multiple regression provide attractive approaches to the analysis of genome-wide association studies (GWAS). For example, they can estimate heritability of complex traits, allowing for both polygenic and sparse models; and by incorporating external genomic data into the priors, they can increase power and yield new biological insights. However, these methods require access to individual genotypes and phenotypes, which are often not easily available. Here we provide a framework for performing these analyses without individual-level data. Specifically, we introduce a “Regression with Summary Statistics” (RSS) likelihood, which relates the multiple regression coefficients to univariate regression results that are often easily available. The RSS likelihood requires estimates of correlations among covariates (SNPs), which also can be obtained from public databases. We perform Bayesian multiple regression analysis by combining the RSS likelihood with previously proposed prior distributions, sampling posteriors by Markov chain Monte Carlo. In a wide range of simulations RSS performs similarly to analyses using the individual data, both for estimating heritability and detecting associations. We apply RSS to a GWAS of human height that contains 253,288 individuals typed at 1.06 million SNPs, for which analyses of individual-level data are practically impossible. Estimates of heritability (52%) are consistent with, but more precise, than previous results using subsets of these data. We also identify many previously unreported loci that show evidence for association with height in our analyses. Software is available at https://github.com/stephenslab/rss. PMID:29399241

  15. GLASS 2.0: An Operational, Multimodal, Bayesian Earthquake Data Association Engine

    NASA Astrophysics Data System (ADS)

    Benz, H.; Johnson, C. E.; Patton, J. M.; McMahon, N. D.; Earle, P. S.

    2015-12-01

    The legacy approach to automated detection and determination of hypocenters is arrival time stacking algorithms. Examples of such algorithms are the associator, Binder, which has been in continuous use in many USGS-supported regional seismic networks since the 1980s and the spherical earth successor, GLASS 1.0, currently in service at the USGS National Earthquake Information Center for over 10 years. The principle short-comings of the legacy approach are 1) it can only use phase arrival times, 2) it does not adequately address the problems of extreme variations in station density worldwide, 3) it cannot incorporate multiple phase models or statistical attributes of phases with distance, and 4) it cannot incorporate noise model attributes of individual stations. Previously we introduced a theoretical framework of a new associator using a Bayesian kernel stacking approach to approximate a joint probability density function for hypocenter localization. More recently we added station- and phase-specific Bayesian constraints to the association process. GLASS 2.0 incorporates a multiplicity of earthquake related data including phase arrival times, back-azimuth and slowness information from array beamforming, arrival times from waveform cross correlation processing, and geographic constraints from real-time social media reports of ground shaking. We demonstrate its application by modeling an aftershock sequence using dozens of stations that recorded tens of thousands of earthquakes over a period of one month. We also demonstrate Glass 2.0 performance regionally and teleseismically using the globally distributed real-time monitoring system at NEIC.

  16. Genomic insights from whole genome sequencing of four clonal outbreak Campylobacter jejuni assessed within the global C. jejuni population.

    PubMed

    Clark, Clifford G; Berry, Chrystal; Walker, Matthew; Petkau, Aaron; Barker, Dillon O R; Guan, Cai; Reimer, Aleisha; Taboada, Eduardo N

    2016-12-03

    Whole genome sequencing (WGS) is useful for determining clusters of human cases, investigating outbreaks, and defining the population genetics of bacteria. It also provides information about other aspects of bacterial biology, including classical typing results, virulence, and adaptive strategies of the organism. Cell culture invasion and protein expression patterns of four related multilocus sequence type 21 (ST21) C. jejuni isolates from a significant Canadian water-borne outbreak were previously associated with the presence of a CJIE1 prophage. Whole genome sequencing was used to examine the genetic diversity among these isolates and confirm that previous observations could be attributed to differential prophage carriage. Moreover, we sought to determine the presence of genome sequences that could be used as surrogate markers to delineate outbreak-associated isolates. Differential carriage of the CJIE1 prophage was identified as the major genetic difference among the four outbreak isolates. High quality single-nucleotide variant (hqSNV) and core genome multilocus sequence typing (cgMLST) clustered these isolates within expanded datasets consisting of additional C. jejuni strains. The number and location of homopolymeric tract regions was identical in all four outbreak isolates but differed from all other C. jejuni examined. Comparative genomics and PCR amplification enabled the identification of large chromosomal inversions of approximately 93 kb and 388 kb within the outbreak isolates associated with transducer-like proteins containing long nucleotide repeat sequences. The 93-kb inversion was characteristic of the outbreak-associated isolates, and the gene content of this inverted region displayed high synteny with the reference strain. The four outbreak isolates were clonally derived and differed mainly in the presence of the CJIE1 prophage, validating earlier findings linking the prophage to phenotypic differences in virulence assays and protein expression. The identification of large, genetically syntenous chromosomal inversions in the genomes of outbreak-associated isolates provided a unique method for discriminating outbreak isolates from the background population. Transducer-like proteins appear to be associated with the chromosomal inversions. CgMLST and hqSNV analysis also effectively delineated the outbreak isolates within the larger C. jejuni population structure.

  17. A Defence of the AR4’s Bayesian Approach to Quantifying Uncertainty

    NASA Astrophysics Data System (ADS)

    Vezer, M. A.

    2009-12-01

    The field of climate change research is a kimberlite pipe filled with philosophic diamonds waiting to be mined and analyzed by philosophers. Within the scientific literature on climate change, there is much philosophical dialogue regarding the methods and implications of climate studies. To this date, however, discourse regarding the philosophy of climate science has been confined predominately to scientific - rather than philosophical - investigations. In this paper, I hope to bring one such issue to the surface for explicit philosophical analysis: The purpose of this paper is to address a philosophical debate pertaining to the expressions of uncertainty in the International Panel on Climate Change (IPCC) Fourth Assessment Report (AR4), which, as will be noted, has received significant attention in scientific journals and books, as well as sporadic glances from the popular press. My thesis is that the AR4’s Bayesian method of uncertainty analysis and uncertainty expression is justifiable on pragmatic grounds: it overcomes problems associated with vagueness, thereby facilitating communication between scientists and policy makers such that the latter can formulate decision analyses in response to the views of the former. Further, I argue that the most pronounced criticisms against the AR4’s Bayesian approach, which are outlined below, are misguided. §1 Introduction Central to AR4 is a list of terms related to uncertainty that in colloquial conversations would be considered vague. The IPCC attempts to reduce the vagueness of its expressions of uncertainty by calibrating uncertainty terms with numerical probability values derived from a subjective Bayesian methodology. This style of analysis and expression has stimulated some controversy, as critics reject as inappropriate and even misleading the association of uncertainty terms with Bayesian probabilities. [...] The format of the paper is as follows. The investigation begins (§2) with an explanation of background considerations relevant to the IPCC and its use of uncertainty expressions. It then (§3) outlines some general philosophical worries regarding vague expressions and (§4) relates those worries to the AR4 and its method of dealing with them, which is a subjective Bayesian probability analysis. The next phase of the paper (§5) examines the notions of ‘objective’ and ‘subjective’ probability interpretations and compares the IPCC’s subjective Bayesian strategy with a frequentist approach. It then (§6) addresses objections to that methodology, and concludes (§7) that those objections are wrongheaded.

  18. Estimating the extent and distribution of new-onset adult asthma in British Columbia using frequentist and Bayesian approaches.

    PubMed

    Beach, Jeremy; Burstyn, Igor; Cherry, Nicola

    2012-07-01

    We previously described a method to identify the incidence of new-onset adult asthma (NOAA) in Alberta by industry and occupation, utilizing Workers' Compensation Board (WCB) and physician billing data. The aim of this study was to extend this method to data from British Columbia (BC) so as to compare the two provinces and to incorporate Bayesian methodology into estimates of risk. WCB claims for any reason 1995-2004 were linked to physician billing data. NOAA was defined as a billing for asthma (ICD-9 493) in the 12 months before a WCB claim without asthma in the previous 3 years. Incidence was calculated by occupation and industry. In a matched case-referent analysis, associations with exposures were examined using an asthma-specific job exposure matrix (JEM). Posterior distributions from the Alberta analysis and estimated misclassification parameters were used as priors in the Bayesian analysis of the BC data. Among 1 118 239 eligible WCB claims the incidence of NOAA was 1.4%. Sixteen occupations and 44 industries had a significantly increased risk; six industries had a decreased risk. The JEM identified wood dust [odds ratio (OR) 1.55, 95% confidence interval (CI) 1.08-2.24] and animal antigens (OR 1.66, 95% CI 1.17-2.36) as related to an increased risk of NOAA. Exposure to isocyanates was associated with decreased risk (OR 0.57, 95% CI 0.39-0.85). Bayesian analyses taking account of exposure misclassification and informative priors resulted in posterior distributions of ORs with lower boundary of 95% credible intervals >1.00 for almost all exposures. The distribution of NOAA in BC appeared somewhat similar to that in Alberta, except for isocyanates. Bayesian analyses allowed incorporation of prior evidence into risk estimates, permitting reconsideration of the apparently protective effect of isocyanate exposure.

  19. The BANYAN-Sigma Bayesian classifier and the search for isolated planetary-mass objects

    NASA Astrophysics Data System (ADS)

    Gagné, Jonathan

    2018-01-01

    I will present new developments in the construction of a Bayesian classification tool to identify members of 22 young associations within 150 pc from partially complete kinematic data sets such as Gaia-DR1 and DR2. The new BANYAN-Sigma tool makes it possible to quickly analyze massive data sets and yields a better classification performance than all its predecessors. It will open the door to large-scale surveys to complete the stellar and substellar populations of nearby associations, which will provide deep insights in the low-mass end of the initial mass function and valuable age-calibrated targets for exoplanet surveys.I will also presents preliminary results of a search for T-type isolated planetary-mass objects in these young associations, based on BANYAN-Sigma and a cross-match between the AllWISE and 2MASS-Reject catalogs.

  20. Exploring the inequality-mortality relationship in the US with Bayesian spatial modeling

    PubMed Central

    Yang, Tse-Chuan; Jensen, Leif

    2014-01-01

    While there is evidence to suggest that socioeconomic inequality within places is associated with mortality rates among people living within them, the empirical connection between the two remains unsettled as potential confounders associated with racial and social structure are overlooked. This study seeks to test this relationship, to determine whether it is due to differential levels of deprivation and social capital, and does so with intrinsically conditional autoregressive Bayesian spatial modeling that effectively addresses the bias introduced by spatial dependence. We find that deprivation and social capital partly but not completely account for why inequality is positively associated with mortality and that spatial modeling generates more accurate predictions than does the traditional approach. We advance the literature by unveiling the intervening roles of social capital and deprivation in the inequality-mortality relationship and offering new evidence that inequality matters in US county mortality rates. PMID:26166920

  1. Bayesian analysis of physiologically based toxicokinetic and toxicodynamic models.

    PubMed

    Hack, C Eric

    2006-04-17

    Physiologically based toxicokinetic (PBTK) and toxicodynamic (TD) models of bromate in animals and humans would improve our ability to accurately estimate the toxic doses in humans based on available animal studies. These mathematical models are often highly parameterized and must be calibrated in order for the model predictions of internal dose to adequately fit the experimentally measured doses. Highly parameterized models are difficult to calibrate and it is difficult to obtain accurate estimates of uncertainty or variability in model parameters with commonly used frequentist calibration methods, such as maximum likelihood estimation (MLE) or least squared error approaches. The Bayesian approach called Markov chain Monte Carlo (MCMC) analysis can be used to successfully calibrate these complex models. Prior knowledge about the biological system and associated model parameters is easily incorporated in this approach in the form of prior parameter distributions, and the distributions are refined or updated using experimental data to generate posterior distributions of parameter estimates. The goal of this paper is to give the non-mathematician a brief description of the Bayesian approach and Markov chain Monte Carlo analysis, how this technique is used in risk assessment, and the issues associated with this approach.

  2. Bayesian analysis of rare events

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Straub, Daniel, E-mail: straub@tum.de; Papaioannou, Iason; Betz, Wolfgang

    2016-06-01

    In many areas of engineering and science there is an interest in predicting the probability of rare events, in particular in applications related to safety and security. Increasingly, such predictions are made through computer models of physical systems in an uncertainty quantification framework. Additionally, with advances in IT, monitoring and sensor technology, an increasing amount of data on the performance of the systems is collected. This data can be used to reduce uncertainty, improve the probability estimates and consequently enhance the management of rare events and associated risks. Bayesian analysis is the ideal method to include the data into themore » probabilistic model. It ensures a consistent probabilistic treatment of uncertainty, which is central in the prediction of rare events, where extrapolation from the domain of observation is common. We present a framework for performing Bayesian updating of rare event probabilities, termed BUS. It is based on a reinterpretation of the classical rejection-sampling approach to Bayesian analysis, which enables the use of established methods for estimating probabilities of rare events. By drawing upon these methods, the framework makes use of their computational efficiency. These methods include the First-Order Reliability Method (FORM), tailored importance sampling (IS) methods and Subset Simulation (SuS). In this contribution, we briefly review these methods in the context of the BUS framework and investigate their applicability to Bayesian analysis of rare events in different settings. We find that, for some applications, FORM can be highly efficient and is surprisingly accurate, enabling Bayesian analysis of rare events with just a few model evaluations. In a general setting, BUS implemented through IS and SuS is more robust and flexible.« less

  3. Bayesian state space models for dynamic genetic network construction across multiple tissues.

    PubMed

    Liang, Yulan; Kelemen, Arpad

    2016-08-01

    Construction of gene-gene interaction networks and potential pathways is a challenging and important problem in genomic research for complex diseases while estimating the dynamic changes of the temporal correlations and non-stationarity are the keys in this process. In this paper, we develop dynamic state space models with hierarchical Bayesian settings to tackle this challenge for inferring the dynamic profiles and genetic networks associated with disease treatments. We treat both the stochastic transition matrix and the observation matrix time-variant and include temporal correlation structures in the covariance matrix estimations in the multivariate Bayesian state space models. The unevenly spaced short time courses with unseen time points are treated as hidden state variables. Hierarchical Bayesian approaches with various prior and hyper-prior models with Monte Carlo Markov Chain and Gibbs sampling algorithms are used to estimate the model parameters and the hidden state variables. We apply the proposed Hierarchical Bayesian state space models to multiple tissues (liver, skeletal muscle, and kidney) Affymetrix time course data sets following corticosteroid (CS) drug administration. Both simulation and real data analysis results show that the genomic changes over time and gene-gene interaction in response to CS treatment can be well captured by the proposed models. The proposed dynamic Hierarchical Bayesian state space modeling approaches could be expanded and applied to other large scale genomic data, such as next generation sequence (NGS) combined with real time and time varying electronic health record (EHR) for more comprehensive and robust systematic and network based analysis in order to transform big biomedical data into predictions and diagnostics for precision medicine and personalized healthcare with better decision making and patient outcomes.

  4. Bayesian Model Comparison for the Order Restricted RC Association Model

    ERIC Educational Resources Information Center

    Iliopoulos, G.; Kateri, M.; Ntzoufras, I.

    2009-01-01

    Association models constitute an attractive alternative to the usual log-linear models for modeling the dependence between classification variables. They impose special structure on the underlying association by assigning scores on the levels of each classification variable, which can be fixed or parametric. Under the general row-column (RC)…

  5. spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.

    PubMed

    Dellicour, Simon; Mardulyn, Patrick

    2014-05-01

    SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. © 2013 John Wiley & Sons Ltd.

  6. Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

    PubMed

    Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

    2018-06-01

    In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.

  7. A bayesian translational framework for knowledge propagation, discovery, and integration under specific contexts.

    PubMed

    Deng, Michelle; Zollanvari, Amin; Alterovitz, Gil

    2012-01-01

    The immense corpus of biomedical literature existing today poses challenges in information search and integration. Many links between pieces of knowledge occur or are significant only under certain contexts-rather than under the entire corpus. This study proposes using networks of ontology concepts, linked based on their co-occurrences in annotations of abstracts of biomedical literature and descriptions of experiments, to draw conclusions based on context-specific queries and to better integrate existing knowledge. In particular, a Bayesian network framework is constructed to allow for the linking of related terms from two biomedical ontologies under the queried context concept. Edges in such a Bayesian network allow associations between biomedical concepts to be quantified and inference to be made about the existence of some concepts given prior information about others. This approach could potentially be a powerful inferential tool for context-specific queries, applicable to ontologies in other fields as well.

  8. A Bayesian Translational Framework for Knowledge Propagation, Discovery, and Integration Under Specific Contexts

    PubMed Central

    Deng, Michelle; Zollanvari, Amin; Alterovitz, Gil

    2012-01-01

    The immense corpus of biomedical literature existing today poses challenges in information search and integration. Many links between pieces of knowledge occur or are significant only under certain contexts—rather than under the entire corpus. This study proposes using networks of ontology concepts, linked based on their co-occurrences in annotations of abstracts of biomedical literature and descriptions of experiments, to draw conclusions based on context-specific queries and to better integrate existing knowledge. In particular, a Bayesian network framework is constructed to allow for the linking of related terms from two biomedical ontologies under the queried context concept. Edges in such a Bayesian network allow associations between biomedical concepts to be quantified and inference to be made about the existence of some concepts given prior information about others. This approach could potentially be a powerful inferential tool for context-specific queries, applicable to ontologies in other fields as well. PMID:22779044

  9. Application of Bayesian model averaging to measurements of the primordial power spectrum

    NASA Astrophysics Data System (ADS)

    Parkinson, David; Liddle, Andrew R.

    2010-11-01

    Cosmological parameter uncertainties are often stated assuming a particular model, neglecting the model uncertainty, even when Bayesian model selection is unable to identify a conclusive best model. Bayesian model averaging is a method for assessing parameter uncertainties in situations where there is also uncertainty in the underlying model. We apply model averaging to the estimation of the parameters associated with the primordial power spectra of curvature and tensor perturbations. We use CosmoNest and MultiNest to compute the model evidences and posteriors, using cosmic microwave data from WMAP, ACBAR, BOOMERanG, and CBI, plus large-scale structure data from the SDSS DR7. We find that the model-averaged 95% credible interval for the spectral index using all of the data is 0.940

  10. A multilocus approach to assessing co-evolutionary relationships between Steinernema spp. (Nematoda: Steinernematidae) and their bacterial symbionts Xenorhabdus spp. (gamma-Proteobacteria: Enterobacteriaceae).

    PubMed

    Lee, Ming-Min; Stock, S Patricia

    2010-09-01

    Nematodes of the genus Steinernema Travassos, 1927 (Nematoda: Steinernematidae) and their associated bacteria, Xenorhabdus spp. (gamma-Proteobacteria), are an emergent model of terrestrial animal-microbe symbiosis. Interest in this association initially arose out of their potential as biocontrol agents against insect pests, but, despite advances in their field application and the growing popularity of this model system, relatively little has been published to uncover the evolutionary facets of this beneficial partnership. This study adds to the body of knowledge regarding nematode-bacteria symbiosis by proposing a possible scenario for their historical association in the form of a cophylogenetic hypothesis. Topological and likelihood based testing methods were employed to reconstruct a history of association between 30 host-symbiont pairs and to gauge the level of similarity between their inferred phylogenetic patterns.

  11. Bayesian aggregation versus majority vote in the characterization of non-specific arm pain based on quantitative needle electromyography

    PubMed Central

    2010-01-01

    Background Methods for the calculation and application of quantitative electromyographic (EMG) statistics for the characterization of EMG data detected from forearm muscles of individuals with and without pain associated with repetitive strain injury are presented. Methods A classification procedure using a multi-stage application of Bayesian inference is presented that characterizes a set of motor unit potentials acquired using needle electromyography. The utility of this technique in characterizing EMG data obtained from both normal individuals and those presenting with symptoms of "non-specific arm pain" is explored and validated. The efficacy of the Bayesian technique is compared with simple voting methods. Results The aggregate Bayesian classifier presented is found to perform with accuracy equivalent to that of majority voting on the test data, with an overall accuracy greater than 0.85. Theoretical foundations of the technique are discussed, and are related to the observations found. Conclusions Aggregation of motor unit potential conditional probability distributions estimated using quantitative electromyographic analysis, may be successfully used to perform electrodiagnostic characterization of "non-specific arm pain." It is expected that these techniques will also be able to be applied to other types of electrodiagnostic data. PMID:20156353

  12. Bayesian data analysis for newcomers.

    PubMed

    Kruschke, John K; Liddell, Torrin M

    2018-02-01

    This article explains the foundational concepts of Bayesian data analysis using virtually no mathematical notation. Bayesian ideas already match your intuitions from everyday reasoning and from traditional data analysis. Simple examples of Bayesian data analysis are presented that illustrate how the information delivered by a Bayesian analysis can be directly interpreted. Bayesian approaches to null-value assessment are discussed. The article clarifies misconceptions about Bayesian methods that newcomers might have acquired elsewhere. We discuss prior distributions and explain how they are not a liability but an important asset. We discuss the relation of Bayesian data analysis to Bayesian models of mind, and we briefly discuss what methodological problems Bayesian data analysis is not meant to solve. After you have read this article, you should have a clear sense of how Bayesian data analysis works and the sort of information it delivers, and why that information is so intuitive and useful for drawing conclusions from data.

  13. Bayesian algorithm implementation in a real time exposure assessment model on benzene with calculation of associated cancer risks.

    PubMed

    Sarigiannis, Dimosthenis A; Karakitsios, Spyros P; Gotti, Alberto; Papaloukas, Costas L; Kassomenos, Pavlos A; Pilidis, Georgios A

    2009-01-01

    The objective of the current study was the development of a reliable modeling platform to calculate in real time the personal exposure and the associated health risk for filling station employees evaluating current environmental parameters (traffic, meteorological and amount of fuel traded) determined by the appropriate sensor network. A set of Artificial Neural Networks (ANNs) was developed to predict benzene exposure pattern for the filling station employees. Furthermore, a Physiology Based Pharmaco-Kinetic (PBPK) risk assessment model was developed in order to calculate the lifetime probability distribution of leukemia to the employees, fed by data obtained by the ANN model. Bayesian algorithm was involved in crucial points of both model sub compartments. The application was evaluated in two filling stations (one urban and one rural). Among several algorithms available for the development of the ANN exposure model, Bayesian regularization provided the best results and seemed to be a promising technique for prediction of the exposure pattern of that occupational population group. On assessing the estimated leukemia risk under the scope of providing a distribution curve based on the exposure levels and the different susceptibility of the population, the Bayesian algorithm was a prerequisite of the Monte Carlo approach, which is integrated in the PBPK-based risk model. In conclusion, the modeling system described herein is capable of exploiting the information collected by the environmental sensors in order to estimate in real time the personal exposure and the resulting health risk for employees of gasoline filling stations.

  14. Bayesian Algorithm Implementation in a Real Time Exposure Assessment Model on Benzene with Calculation of Associated Cancer Risks

    PubMed Central

    Sarigiannis, Dimosthenis A.; Karakitsios, Spyros P.; Gotti, Alberto; Papaloukas, Costas L.; Kassomenos, Pavlos A.; Pilidis, Georgios A.

    2009-01-01

    The objective of the current study was the development of a reliable modeling platform to calculate in real time the personal exposure and the associated health risk for filling station employees evaluating current environmental parameters (traffic, meteorological and amount of fuel traded) determined by the appropriate sensor network. A set of Artificial Neural Networks (ANNs) was developed to predict benzene exposure pattern for the filling station employees. Furthermore, a Physiology Based Pharmaco-Kinetic (PBPK) risk assessment model was developed in order to calculate the lifetime probability distribution of leukemia to the employees, fed by data obtained by the ANN model. Bayesian algorithm was involved in crucial points of both model sub compartments. The application was evaluated in two filling stations (one urban and one rural). Among several algorithms available for the development of the ANN exposure model, Bayesian regularization provided the best results and seemed to be a promising technique for prediction of the exposure pattern of that occupational population group. On assessing the estimated leukemia risk under the scope of providing a distribution curve based on the exposure levels and the different susceptibility of the population, the Bayesian algorithm was a prerequisite of the Monte Carlo approach, which is integrated in the PBPK-based risk model. In conclusion, the modeling system described herein is capable of exploiting the information collected by the environmental sensors in order to estimate in real time the personal exposure and the resulting health risk for employees of gasoline filling stations. PMID:22399936

  15. Isolation and characterization of atypical Listeria monocytogenes associated with a canine urinary tract infection.

    PubMed

    Palerme, Jean-Sébastien; Pan, Po Ching; Parsons, Cameron T; Kathariou, Sophia; Ward, Todd J; Jacob, Megan E

    2016-09-01

    Listeria monocytogenes, a well-described cause of encephalitis and abortion in ruminants and of food-borne illness in humans, is rarely associated with disease in companion animals. A case of urinary tract infection associated with an atypical, weakly hemolytic L. monocytogenes strain is described in a diabetic dog. The serotype of the L. monocytogenes isolate was determined to be 1/2a (3a), with the multilocus genotyping pattern 2.72_1/2a. A nucleotide substitution (Gly145Asp) was detected at residue 145 in the promoter prfA region. This residue is within the critical helix-turn-helix motif of PrfA. The source of the L. monocytogenes strain remains unknown, and the dog recovered after a 4-week course of cephalexin (30 mg/kg orally twice daily). © 2016 The Author(s).

  16. Adult myeloid leukaemia and radon exposure: a Bayesian model for a case-control study with error in covariates.

    PubMed

    Toti, Simona; Biggeri, Annibale; Forastiere, Francesco

    2005-06-30

    The possible association between radon exposure in dwellings and adult myeloid leukaemia had been explored in an Italian province by a case-control study. A total of 44 cases and 211 controls were selected from death certificates file. No association had been found in the original study (OR = 0.58 for > 185 vs 80 < or = Bq/cm). Here we reanalyse the data taking into account the measurement error of radon concentration and the presence of missing data. A Bayesian hierarchical model with error in covariates is proposed which allows appropriate imputation of missing values. The general conclusion of no evidence of association with radon does not change, but a negative association is not observed anymore (OR = 0.99 for > 185 vs 80 < or = Bq/cm). After adjusting for residential house radon and gamma radiation, and for the multilevel data structure, geological features of the soil is associated with adult myeloid leukaemia risk (OR = 2.14, 95 per cent Cr.I. 1.0-5.5). Copyright 2005 John Wiley & Sons, Ltd.

  17. Evolution of Associative Learning in Chemical Networks

    PubMed Central

    McGregor, Simon; Vasas, Vera; Husbands, Phil; Fernando, Chrisantha

    2012-01-01

    Organisms that can learn about their environment and modify their behaviour appropriately during their lifetime are more likely to survive and reproduce than organisms that do not. While associative learning – the ability to detect correlated features of the environment – has been studied extensively in nervous systems, where the underlying mechanisms are reasonably well understood, mechanisms within single cells that could allow associative learning have received little attention. Here, using in silico evolution of chemical networks, we show that there exists a diversity of remarkably simple and plausible chemical solutions to the associative learning problem, the simplest of which uses only one core chemical reaction. We then asked to what extent a linear combination of chemical concentrations in the network could approximate the ideal Bayesian posterior of an environment given the stimulus history so far? This Bayesian analysis revealed the ‘memory traces’ of the chemical network. The implication of this paper is that there is little reason to believe that a lack of suitable phenotypic variation would prevent associative learning from evolving in cell signalling, metabolic, gene regulatory, or a mixture of these networks in cells. PMID:23133353

  18. Molecular Epidemiologic Analysis of Enterococcus faecalis Isolates in Cuba by Multilocus Sequence Typing

    PubMed Central

    Kobayashi, Nobumichi; Nagashima, Shigeo

    2009-01-01

    We carried out the first study of Enterococcus faecalis clinical isolates in Cuba by multilocus sequence typing linking the molecular typing data with the presence of virulence determinants and the antibiotic resistance genes. A total of 23 E. faecalis isolates recovered from several clinic sources and geographic areas of Cuba during a period between 2000 and 2005 were typed by multilocus sequence typing. Thirteen sequence types (STs) including five novel STs were identified, and the ST 64 (clonal complex [CC] 8), ST 6 (CC2), ST 21(CC21), and ST 16 (CC58) were found in more than one strain. Sixty-seven percent of STs corresponded to STs reported previously in Spain, Poland, and The Netherlands, and other STs (ST115, ST64, ST6, and ST40) were genetically close to those detected in the United States. Prevalence of both antimicrobial resistance genes [aac(6′)-aph(2″), aph(3′), ant(6), ant(3″)(9), aph(2″)-Id, aph(2″)-Ic, erm(B), erm(A), erm(C), mef(A), tet(M), and tet(L)] and virulence genes (agg, gelE, cylA, esp, ccf, and efaAfs) were examined by polymerase chain reaction. Aminoglycoside resistance genes aac(6′)-Ie-aph(2″)-Ia, aph(3′), ant(6), ant(3″)(9) were more frequently detected in ST6, ST16, ST23, ST64, and ST115. The multidrug resistance was distributed to all STs detected, except for ST117 and singleton ST225. The presence of cyl gene was specifically linked to the ST64 and ST16. Presence of the esp, gel, and agg genes was not specific to any particular ST. This research provided the first insight into the population structure of E. faecalis in Cuba, that is, most Cuban strains were related to European strains, whereas others to U.S. strains. The CC2, CC21, and CC8, three of the biggest CCs in the world, were evidently circulating in Cuba, associated with multidrug resistance and virulence traits. PMID:19857135

  19. Molecular phylogenetic diversity, multilocus haplotype nomenclature, and in vitro antifungal resistance within the Fusarium solani species complex.

    PubMed

    O'Donnell, Kerry; Sutton, Deanna A; Fothergill, Annette; McCarthy, Dora; Rinaldi, Michael G; Brandt, Mary E; Zhang, Ning; Geiser, David M

    2008-08-01

    Members of the species-rich Fusarium solani species complex (FSSC) are responsible for approximately two-thirds all fusarioses of humans and other animals. In addition, many economically important phytopathogenic species are nested within this complex. Due to their increasing clinical relevance and because most of the human pathogenic and plant pathogenic FSSC lack Latin binomials, we have extended the multilocus haplotype nomenclatural system introduced in a previous study (D. C. Chang, G. B. Grant, K. O'Donnell, K. A. Wannemuehler, J. Noble-Wang, C. Y. Rao, L. M. Jacobson, C. S. Crowell, R. S. Sneed, F. M. T. Lewis, J. K. Schaffzin, M. A. Kainer, C. A. Genese, E. C. Alfonso, D. B. Jones, A. Srinivasan, S. K. Fridkin, and B. J. Park, JAMA 296:953-963, 2006) to all 34 species within the medically important FSSC clade 3 to facilitate global epidemiological studies. The typing scheme is based on polymorphisms in portions of the following three genes: the internal transcribed spacer region and domains D1 plus D2 of the nuclear large-subunit rRNA, the translation elongation factor 1 alpha gene (EF-1alpha), and the second largest subunit of RNA polymerase II gene (RPB2). Of the 251 isolates subjected to multilocus DNA sequence typing, 191 sequence types were differentiated, and these were distributed among three strongly supported clades designated 1, 2, and 3. All of the mycosis-associated isolates were restricted to FSSC clade 3, as previously reported (N. Zhang, K. O'Donnell, D. A. Sutton, F. A Nalim, R. C. Summerbell, A. A. Padhye, and D. M. Geiser, J. Clin. Microbiol. 44:2186-2190, 2006), and these represent at least 20 phylogenetically distinct species. Analyses of the combined DNA sequence data by use of two separate phylogenetic methods yielded the most robust hypothesis of evolutionary relationships and genetic diversity within the FSSC to date. The in vitro activities of 10 antifungals tested against 19 isolates representing 18 species that span the breadth of the FSSC phylogeny show that members of this complex are broadly resistant to these drugs.

  20. Multilocus PCR-RFLP profiling in Trypanosoma cruzi I highlights an intraspecific genetic variation pattern.

    PubMed

    Ramírez, Juan David; Duque, María Clara; Montilla, Marleny; Cucunubá, Zulma M; Guhl, Felipe

    2012-12-01

    Chagas disease represents a serious problem in public health. This zoonotic pathology is caused by the kinetoplastid Trypanosoma cruzi which displays a high genetic diversity falling into six Discrete Typing Units (TcI-TcVI). In Colombia, the prevalent DTU is TcI with findings of TcII, TcIII and TcIV in low proportions. The aim of this work was to observe the genetic variability within TcI using a multilocus PCR-RFLP strategy. We analyzed 70 single-celled clones from triatomines, reservoirs and humans that were amplified and restricted via ten PCR-RFLPs targets across TcI genome, the restriction fragments were used to construct phylograms according to calculated genetic distances. We obtained five polymorphic targets (1f8, HSP60, HSP70, SAPA and H1) and the consensus tree constructed according to these regions allowed us to observe two well-defined groups with close association to the transmission cycles (domestic/peridomestic and sylvatic) of Chagas disease in Colombia. Our findings allowed us to corroborate the previous reported genotypes based on the intergenic region of mini-exon gene. More studies examining the genetic diversity among T. cruzi I populations must be conducted in order to obtain a better understanding in regions where this DTU is endemic. Copyright © 2012 Elsevier B.V. All rights reserved.

  1. Molecular typing of methicillin-resistant Staphylococcus aureus: Comparison of PCR-based open reading frame typing, multilocus sequence typing, and Staphylococcus protein A gene typing.

    PubMed

    Ogihara, Shinji; Saito, Ryoichi; Sawabe, Etsuko; Kozakai, Takahiro; Shima, Mari; Aiso, Yoshibumi; Fujie, Toshihide; Nukui, Yoko; Koike, Ryuji; Hagihara, Michio; Tohda, Shuji

    2018-04-01

    The recently developed PCR-based open reading frame typing (POT) method is a useful molecular typing tool. Here, we evaluated the performance of POT for molecular typing of methicillin-resistant Staphylococcus aureus (MRSA) isolates and compared its performance to those of multilocus sequence typing (MLST) and Staphylococcus protein A gene typing (spa typing). Thirty-seven MRSA isolates were collected between July 2012 and May 2015. MLST, spa typing, and POT were performed, and their discriminatory powers were evaluated using Simpson's index analysis. The MRSA isolates were classified into 11, 18, and 33 types by MLST, spa typing, and POT, respectively. The predominant strains identified by MLST, spa typing, and POT were ST8 and ST764, t002, and 93-191-127, respectively. The discriminatory power of MLST, spa typing, and POT was 0.853, 0.875, and 0.992, respectively, indicating that POT had the highest discriminatory power. Moreover, the results of MLST and spa were available after 2 days, whereas that of POT was available in 5 h. Furthermore, POT is rapid and easy to perform and interpret. Therefore, POT is a superior molecular typing tool for monitoring nosocomial transmission of MRSA. Copyright © 2017 Japanese Society of Chemotherapy and The Japanese Association for Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  2. Multilocus Sequence Analysis of Nectar Pseudomonads Reveals High Genetic Diversity and Contrasting Recombination Patterns

    PubMed Central

    Álvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M.

    2013-01-01

    The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas ‘sensu stricto’ isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria. PMID:24116076

  3. The epidemiology of listeriosis in the United States--1986. Listeriosis Study Group.

    PubMed

    Gellin, B G; Broome, C V; Bibb, W F; Weaver, R E; Gaventa, S; Mascola, L

    1991-02-15

    To determine the morbidity and mortality due to listeriosis in the United States, the authors undertook an active surveillance project in 1986 to identify all cases in which Listeria monocytogenes was isolated from cultures of ordinarily sterile sites in a population of 34 million persons. The authors estimated that at least 1,700 cases of listeriosis and 450 deaths occurred in the United States in 1986; 27% of these cases occurred in pregnant women, with 22% of perinatal cases resulting in stillbirths or neonatal deaths. The risk of listeriosis in adults (0.5 per 100,000 population) was similar in all regions studied; the incidence of perinatal listeriosis was three times higher in Los Angeles County, California, than in the other areas (24.3/100,000 live births vs. 7.8/100,000 live births). Geographic variation may have resulted from underdiagnosis of perinatal listeriosis in five of the study areas. Multilocus electrophoretic enzyme typing was useful for elucidating the molecular epidemiology of L. monocytogenes; perinatal listeriosis was significantly associated with one group of related strains. Multilocus electrophoretic enzyme typing also identified three clusters representing possible common-source outbreaks. These findings document the substantial morbidity due to listeriosis in the United States; to the extent that sporadic listeriosis is foodborne, this morbidity could be reduced by appropriate preventive measures, particularly in persons known to be at increased risk of infection.

  4. The evolution and population structure of Lactobacillus fermentum from different naturally fermented products as determined by multilocus sequence typing (MLST).

    PubMed

    Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong

    2015-05-20

    Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.

  5. Multilocus sequence typing analyses of Clostridium perfringens type A strains harboring tpeL and netB genes.

    PubMed

    Nakano, V; Ignacio, A; Llanco, L; Bueris, V; Sircili, M P; Avila-Campos, M J

    2017-04-01

    Clostridium perfringens is an anaerobic bacterium ubiquitous in various environments, especially in soil and the gastrointestinal tract of healthy humans and animals. In this study, multilocus sequence typing protocol was used to investigate genotypic relationships among 40 C. perfringens strains isolated from humans and broiler chicken with necrotic enteritis [NE]. The results indicated a few clonal populations, mainly observed in human strains, with 32.5% of all strains associated with one of three clonal complexes and 30 sequences types. The CC-1 cluster showed an interesting and unexpected result because it contained seven strains [six from animals and one of human origin]. Detection assays for toxin genes tpeL and netB were also performed. The netB gene was only observed in 7.5% of the strains from healthy human. The toxin gene tpeL was detected in 22.5% of the C. perfringens strains isolated from three individuals and in six broilers with NE. Our study describes the role of some C. perfringens strains of human origin acting as reservoirs of virulence genes and sources of infection. In addition, the strains of human and animal origin were found to be genetically distinct but phylogenetically close, and the human strains showed more diversity than the animal strains. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Methicillin-resistant Staphylococcus aureus from dental school clinic surfaces and students.

    PubMed

    Roberts, Marilyn C; Soge, Olusegun O; Horst, Jeremy A; Ly, Kiet A; Milgrom, Peter

    2011-10-01

    Methicillin-resistant Staphylococcus aureus (MRSA) isolated from frequently touched dental school clinic surfaces were compared with MRSA isolated nasal cultures of dental students. Sixty-one dental students and 95 environmental surfaces from 7 clinics were sampled using SANICULT (Starplex Scientific Inc, Etobicoke, Ontario, Canada) swabs. Antimicrobial susceptibility testing was performed, and pulsed-field gel electrophoresis analysis, the mecA gene, multilocus sequence type, and SCCmec type were determined by polymerase chain reaction and sequencing. Thirteen (21%) dental students and 8 (8.4%) surfaces were MRSA positive. Three MRSA strains were SCCmec type IV, whereas 3 were nontypeable isolates and Panton-Valentine leukocidin positive (PVL+), and none were USA300. One surface and 1 student isolate shared the same multilocus sequence type ST 8 and were 75% related. Two groups of students carried the same MRSA strains. The MRSA-positive samples were from 4 of 7 dental clinics. In addition, 21% of the dental students carried MRSA, which is > 10 times higher than the general public and twice as frequent as in other university students. This is the first study to characterize MRSA from dental clinic surfaces and dental students and suggests that both may be reservoirs for MRSA. Further studies are needed to verify this premise. Copyright © 2011 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Mosby, Inc. All rights reserved.

  7. Multi-laboratory validation study of multilocus variable-number tandem repeat analysis (MLVA) for Salmonella enterica serovar Enteritidis, 2015.

    PubMed

    Peters, Tansy; Bertrand, Sophie; Björkman, Jonas T; Brandal, Lin T; Brown, Derek J; Erdõsi, Tímea; Heck, Max; Ibrahem, Salha; Johansson, Karin; Kornschober, Christian; Kotila, Saara M; Le Hello, Simon; Lienemann, Taru; Mattheus, Wesley; Nielsen, Eva Møller; Ragimbeau, Catherine; Rumore, Jillian; Sabol, Ashley; Torpdahl, Mia; Trees, Eija; Tuohy, Alma; de Pinna, Elizabeth

    2017-03-02

    Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data. This article is copyright of The Authors, 2017.

  8. Identifying causal networks linking cancer processes and anti-tumor immunity using Bayesian network inference and metagene constructs.

    PubMed

    Kaiser, Jacob L; Bland, Cassidy L; Klinke, David J

    2016-03-01

    Cancer arises from a deregulation of both intracellular and intercellular networks that maintain system homeostasis. Identifying the architecture of these networks and how they are changed in cancer is a pre-requisite for designing drugs to restore homeostasis. Since intercellular networks only appear in intact systems, it is difficult to identify how these networks become altered in human cancer using many of the common experimental models. To overcome this, we used the diversity in normal and malignant human tissue samples from the Cancer Genome Atlas (TCGA) database of human breast cancer to identify the topology associated with intercellular networks in vivo. To improve the underlying biological signals, we constructed Bayesian networks using metagene constructs, which represented groups of genes that are concomitantly associated with different immune and cancer states. We also used bootstrap resampling to establish the significance associated with the inferred networks. In short, we found opposing relationships between cell proliferation and epithelial-to-mesenchymal transformation (EMT) with regards to macrophage polarization. These results were consistent across multiple carcinomas in that proliferation was associated with a type 1 cell-mediated anti-tumor immune response and EMT was associated with a pro-tumor anti-inflammatory response. To address the identifiability of these networks from other datasets, we could identify the relationship between EMT and macrophage polarization with fewer samples when the Bayesian network was generated from malignant samples alone. However, the relationship between proliferation and macrophage polarization was identified with fewer samples when the samples were taken from a combination of the normal and malignant samples. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:470-479, 2016. © 2016 American Institute of Chemical Engineers.

  9. Social deprivation, inequality, and the neighborhood-level incidence of psychotic syndromes in East London.

    PubMed

    Kirkbride, James B; Jones, Peter B; Ullrich, Simone; Coid, Jeremy W

    2014-01-01

    Although urban birth, upbringing, and living are associated with increased risk of nonaffective psychotic disorders, few studies have used appropriate multilevel techniques accounting for spatial dependency in risk to investigate social, economic, or physical determinants of psychosis incidence. We adopted Bayesian hierarchical modeling to investigate the sociospatial distribution of psychosis risk in East London for DSM-IV nonaffective and affective psychotic disorders, ascertained over a 2-year period in the East London first-episode psychosis study. We included individual and environmental data on 427 subjects experiencing first-episode psychosis to estimate the incidence of disorder across 56 neighborhoods, having standardized for age, sex, ethnicity, and socioeconomic status. A Bayesian model that included spatially structured neighborhood-level random effects identified substantial unexplained variation in nonaffective psychosis risk after controlling for individual-level factors. This variation was independently associated with greater levels of neighborhood income inequality (SD increase in inequality: Bayesian relative risks [RR]: 1.25; 95% CI: 1.04-1.49), absolute deprivation (RR: 1.28; 95% CI: 1.08-1.51) and population density (RR: 1.18; 95% CI: 1.00-1.41). Neighborhood ethnic composition effects were associated with incidence of nonaffective psychosis for people of black Caribbean and black African origin. No variation in the spatial distribution of the affective psychoses was identified, consistent with the possibility of differing etiological origins of affective and nonaffective psychoses. Our data suggest that both absolute and relative measures of neighborhood social composition are associated with the incidence of nonaffective psychosis. We suggest these associations are consistent with a role for social stressors in psychosis risk, particularly when people live in more unequal communities.

  10. Discussion of “Bayesian design of experiments for industrial and scientific applications via gaussian processes”

    DOE PAGES

    Anderson-Cook, Christine M.; Burke, Sarah E.

    2016-10-18

    First, we would like to commend Dr. Woods on his thought-provoking paper and insightful presentation at the 4th Annual Stu Hunter conference. We think that the material presented highlights some important needs in the area of design of experiments for generalized linear models (GLMs). In addition, we agree with Dr. Woods that design of experiements of GLMs does implicitly require expert judgement about model parameters, and hence using a Bayesian approach to capture this knowledge is a natural strategy to summarize what is known with the opportunity to incorporate associated uncertainty about that information.

  11. Discussion of “Bayesian design of experiments for industrial and scientific applications via gaussian processes”

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson-Cook, Christine M.; Burke, Sarah E.

    First, we would like to commend Dr. Woods on his thought-provoking paper and insightful presentation at the 4th Annual Stu Hunter conference. We think that the material presented highlights some important needs in the area of design of experiments for generalized linear models (GLMs). In addition, we agree with Dr. Woods that design of experiements of GLMs does implicitly require expert judgement about model parameters, and hence using a Bayesian approach to capture this knowledge is a natural strategy to summarize what is known with the opportunity to incorporate associated uncertainty about that information.

  12. Bayes factors and multimodel inference

    USGS Publications Warehouse

    Link, W.A.; Barker, R.J.; Thomson, David L.; Cooch, Evan G.; Conroy, Michael J.

    2009-01-01

    Multimodel inference has two main themes: model selection, and model averaging. Model averaging is a means of making inference conditional on a model set, rather than on a selected model, allowing formal recognition of the uncertainty associated with model choice. The Bayesian paradigm provides a natural framework for model averaging, and provides a context for evaluation of the commonly used AIC weights. We review Bayesian multimodel inference, noting the importance of Bayes factors. Noting the sensitivity of Bayes factors to the choice of priors on parameters, we define and propose nonpreferential priors as offering a reasonable standard for objective multimodel inference.

  13. The Development of Bayesian Theory and Its Applications in Business and Bioinformatics

    NASA Astrophysics Data System (ADS)

    Zhang, Yifei

    2018-03-01

    Bayesian Theory originated from an Essay of a British mathematician named Thomas Bayes in 1763, and after its development in 20th century, Bayesian Statistics has been taking a significant part in statistical study of all fields. Due to the recent breakthrough of high-dimensional integral, Bayesian Statistics has been improved and perfected, and now it can be used to solve problems that Classical Statistics failed to solve. This paper summarizes Bayesian Statistics’ history, concepts and applications, which are illustrated in five parts: the history of Bayesian Statistics, the weakness of Classical Statistics, Bayesian Theory and its development and applications. The first two parts make a comparison between Bayesian Statistics and Classical Statistics in a macroscopic aspect. And the last three parts focus on Bayesian Theory in specific -- from introducing some particular Bayesian Statistics’ concepts to listing their development and finally their applications.

  14. Bayesian demography 250 years after Bayes

    PubMed Central

    Bijak, Jakub; Bryant, John

    2016-01-01

    Bayesian statistics offers an alternative to classical (frequentist) statistics. It is distinguished by its use of probability distributions to describe uncertain quantities, which leads to elegant solutions to many difficult statistical problems. Although Bayesian demography, like Bayesian statistics more generally, is around 250 years old, only recently has it begun to flourish. The aim of this paper is to review the achievements of Bayesian demography, address some misconceptions, and make the case for wider use of Bayesian methods in population studies. We focus on three applications: demographic forecasts, limited data, and highly structured or complex models. The key advantages of Bayesian methods are the ability to integrate information from multiple sources and to describe uncertainty coherently. Bayesian methods also allow for including additional (prior) information next to the data sample. As such, Bayesian approaches are complementary to many traditional methods, which can be productively re-expressed in Bayesian terms. PMID:26902889

  15. Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.

    PubMed

    Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun

    2016-06-01

    The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.

  16. A Single Multilocus Sequence Typing (MLST) Scheme for Seven Pathogenic Leptospira Species

    PubMed Central

    Amornchai, Premjit; Wuthiekanun, Vanaporn; Bailey, Mark S.; Holden, Matthew T. G.; Zhang, Cuicai; Jiang, Xiugao; Koizumi, Nobuo; Taylor, Kyle; Galloway, Renee; Hoffmaster, Alex R.; Craig, Scott; Smythe, Lee D.; Hartskeerl, Rudy A.; Day, Nicholas P.; Chantratita, Narisara; Feil, Edward J.; Aanensen, David M.; Spratt, Brian G.; Peacock, Sharon J.

    2013-01-01

    Background The available Leptospira multilocus sequence typing (MLST) scheme supported by a MLST website is limited to L. interrogans and L. kirschneri. Our aim was to broaden the utility of this scheme to incorporate a total of seven pathogenic species. Methodology and Findings We modified the existing scheme by replacing one of the seven MLST loci (fadD was changed to caiB), as the former gene did not appear to be present in some pathogenic species. Comparison of the original and modified schemes using data for L. interrogans and L. kirschneri demonstrated that the discriminatory power of the two schemes was not significantly different. The modified scheme was used to further characterize 325 isolates (L. alexanderi [n = 5], L. borgpetersenii [n = 34], L. interrogans [n = 222], L. kirschneri [n = 29], L. noguchii [n = 9], L. santarosai [n = 10], and L. weilii [n = 16]). Phylogenetic analysis using concatenated sequences of the 7 loci demonstrated that each species corresponded to a discrete clade, and that no strains were misclassified at the species level. Comparison between genotype and serovar was possible for 254 isolates. Of the 31 sequence types (STs) represented by at least two isolates, 18 STs included isolates assigned to two or three different serovars. Conversely, 14 serovars were identified that contained between 2 to 10 different STs. New observations were made on the global phylogeography of Leptospira spp., and the utility of MLST in making associations between human disease and specific maintenance hosts was demonstrated. Conclusion The new MLST scheme, supported by an updated MLST website, allows the characterization and species assignment of isolates of the seven major pathogenic species associated with leptospirosis. PMID:23359622

  17. The Impact of Multilocus Variable-Number Tandem-Repeat Analysis on PulseNet Canada Escherichia coli O157:H7 Laboratory Surveillance and Outbreak Support, 2008-2012.

    PubMed

    Rumore, Jillian Leigh; Tschetter, Lorelee; Nadon, Celine

    2016-05-01

    The lack of pattern diversity among pulsed-field gel electrophoresis (PFGE) profiles for Escherichia coli O157:H7 in Canada does not consistently provide optimal discrimination, and therefore, differentiating temporally and/or geographically associated sporadic cases from potential outbreak cases can at times impede investigations. To address this limitation, DNA sequence-based methods such as multilocus variable-number tandem-repeat analysis (MLVA) have been explored. To assess the performance of MLVA as a supplemental method to PFGE from the Canadian perspective, a retrospective analysis of all E. coli O157:H7 isolated in Canada from January 2008 to December 2012 (inclusive) was conducted. A total of 2285 E. coli O157:H7 isolates and 63 clusters of cases (by PFGE) were selected for the study. Based on the qualitative analysis, the addition of MLVA improved the categorization of cases for 60% of clusters and no change was observed for ∼40% of clusters investigated. In such situations, MLVA serves to confirm PFGE results, but may not add further information per se. The findings of this study demonstrate that MLVA data, when used in combination with PFGE-based analyses, provide additional resolution to the detection of clusters lacking PFGE diversity as well as demonstrate good epidemiological concordance. In addition, MLVA is able to identify cluster-associated isolates with variant PFGE pattern combinations that may have been previously missed by PFGE alone. Optimal laboratory surveillance in Canada is achieved with the application of PFGE and MLVA in tandem for routine surveillance, cluster detection, and outbreak response.

  18. Significant spread of extensively drug-resistant Acinetobacter baumannii genotypes of clonal complex 92 among intensive care unit patients in a university hospital in southern Iran.

    PubMed

    Saffari, Fereshteh; Monsen, Tor; Karmostaji, Afsaneh; Azimabad, Fahimeh Bahadori; Widerström, Micael

    2017-11-01

    Infections associated with Acinetobacter baumannii represent an increasing threat in healthcare settings. Therefore, we investigated the epidemiological relationship between clinical isolates of A. baumannii obtained from patients in a university hospital in Bandar Abbas in southern Iran. Sixty-four consecutive non-duplicate clinical isolates collected during 2014-2015 were subjected to susceptibility testing, clonal relationship analysis using PFGE, multilocus variable-number tandem-repeat analysis (MLVA) and multilocus sequence typing (MLST), and examined for the presence of carbapenemases and integrons. Almost all A. baumannii isolates were extensively drug-resistant (XDR; 98 %) and carried an OXA carbapenemase gene (blaOXA-23-like; 98 %) and class 1 integrons (48 %). PFGE and MLST analysis identified three major genotypes, all belonging to clonal complex 92 (CC92): sequence type 848 (ST848) (n=23), ST451 (n=16) and ST195 (n=8). CC92 has previously been documented in the hospital setting in northern Iran, and ST195 has been reported in Arab States of the Persian Gulf. These data suggest national and global transmission of A. baumannii CC92. This report demonstrates the occurrence and potential spread of closely related XDR genotypes of A. baumannii CC92 within a university hospital in southern Iran. These genotypes were found in the majority of the investigated isolates, showed high prevalence of blaOXA-23 and integron class 1, and were associated with stay in the intensive care unit. Very few treatment options remain for healthcare-adapted XDR A. baumannii, and hence effective measures are desperately needed to reduce the spread of these strains and resultant infections in the healthcare setting.

  19. Phenotypic Diversification Is Associated with Host-Induced Transposon Derepression in the Sudden Oak Death Pathogen Phytophthora ramorum

    PubMed Central

    Kasuga, Takao; Kozanitas, Melina; Bui, Mai; Hüberli, Daniel; Rizzo, David M.; Garbelotto, Matteo

    2012-01-01

    The oomycete pathogen Phytophthora ramorum is responsible for sudden oak death (SOD) in California coastal forests. P. ramorum is a generalist pathogen with over 100 known host species. Three or four closely related genotypes of P. ramorum (from a single lineage) were originally introduced in California forests and the pathogen reproduces clonally. Because of this the genetic diversity of P. ramorum is extremely low in Californian forests. However, P. ramorum shows diverse phenotypic variation in colony morphology, colony senescence, and virulence. In this study, we show that phenotypic variation among isolates is associated with the host species from which the microbe was originally cultured. Microarray global mRNA profiling detected derepression of transposable elements (TEs) and down-regulation of crinkler effector homologs (CRNs) in the majority of isolates originating from coast live oak (Quercus agrifolia), but this expression pattern was not observed in isolates from California bay laurel (Umbellularia californica). In some instances, oak and bay laurel isolates originating from the same geographic location had identical genotypes based on multilocus simples sequence repeat (SSR) marker analysis but had different phenotypes. Expression levels of the two marker genes analyzed by quantitative reverse transcription PCR were correlated with originating host species, but not with multilocus genotypes. Because oak is a nontransmissive dead-end host for P. ramorum, our observations are congruent with an epi-transposon hypothesis; that is, physiological stress is triggered on P. ramorum while colonizing oak stems and disrupts epigenetic silencing of TEs. This then results in TE reactivation and possibly genome diversification without significant epidemiological consequences. We propose the P. ramorum-oak host system in California forests as an ad hoc model for epi-transposon mediated diversification. PMID:22529930

  20. Investigation of genetic diversity and epidemiological characteristics of Pasteurella multocida isolates from poultry in southwest China by population structure, multi-locus sequence typing and virulence-associated gene profile analysis.

    PubMed

    Li, Zhangcheng; Cheng, Fangjun; Lan, Shimei; Guo, Jianhua; Liu, Wei; Li, Xiaoyan; Luo, Zeli; Zhang, Manli; Wu, Juan; Shi, Yang

    2018-04-25

    Fowl cholera caused by Pasteurella multocida has always been a disease of global importance for poultry production. The aim of this study was to obtain more information about the epidemiology of avian P. multocida infection in southwest China and the genetic characteristics of clinical isolates. P. multocida isolates were characterized by biochemical and molecular-biological methods. The distributions of the capsular serogroups, the phenotypic antimicrobial resistance profiles, lipopolysaccharide (LPS) genotyping and the presence of 19 virulence genes were investigated in 45 isolates of P. multocida that were associated with clinical disease in poultry. The genetic diversity of P. multocida strains was performed by 16S rRNA and rpoB gene sequence analysis as well as multilocus sequence typing (MLST). The results showed that most (80.0%) of the P. multocida isolates in this study represented special P. multocida subspecies, and 71.1% of the isolates showed multiple-drug resistance. 45 isolates belonged to capsular types: A (100%) and two LPS genotypes: L1 (95.6%) and L3 (4.4%). MLST revealed two new alleles (pmi77 and gdh57) and one new sequence type (ST342). ST129 types dominated in 45 P. multocida isolates. Isolates belonging to ST129 were with the genes ompH+plpB+ptfA+tonB, whereas ST342 included isolates with fur+hgbA+tonB genes. Population genetic analysis and the MLST results revealed that at least one new ST genotype was present in the avian P. multocida in China. These findings provide novel insights into the epidemiological characteristics of avian P. multocida isolates in southwest China.

  1. Human Infections Attributable to the d-Tartrate-Fermenting Variant of Salmonella enterica Serovar Paratyphi B in Germany Originate in Reptiles and, on Rare Occasions, Poultry

    PubMed Central

    Toboldt, Anne; Tietze, Erhard; Helmuth, Reiner; Fruth, Angelika; Junker, Ernst

    2012-01-01

    In this study, the population structure, incidence, and potential sources of human infection caused by the d-tartrate-fermenting variant of Salmonella enterica serovar Paratyphi B [S. Paratyphi B (dT+)] was investigated. In Germany, the serovar is frequently isolated from broilers. Therefore, a selection of 108 epidemiologically unrelated S. enterica serovar Paratyphi B (dT+) strains isolated in Germany between 2002 and 2010 especially from humans, poultry/poultry meat, and reptiles was investigated by phenotypic and genotypic methods. Strains isolated from poultry and products thereof were strongly associated with multilocus sequence type ST28 and showed antimicrobial multiresistance profiles. Pulsed-field gel electrophoresis XbaI profiles were highly homogeneous, with only a few minor XbaI profile variants. All strains isolated from reptiles, except one, were strongly associated with ST88, another distantly related type. Most of the strains were susceptible to antimicrobial agents, and XbaI profiles were heterogeneous. Strains isolated from humans yielded seven sequence types (STs) clustering in three distantly related lineages. The first lineage, comprising five STs, represented mainly strains belonging to ST43 and ST149. The other two lineages were represented only by one ST each, ST28 and ST88. The relatedness of strains based on the pathogenicity gene repertoire (102 markers tested) was mostly in agreement with the multilocus sequence type. Because ST28 was frequently isolated from poultry but rarely in humans over the 9-year period investigated, overall, this study indicates that in Germany S. enterica serovar Paratyphi B (dT+) poses a health risk preferentially by contact with reptiles and, to a less extent, by exposure to poultry or poultry meat. PMID:22885742

  2. Genetic Dissection of Maize Embryonic Callus Regenerative Capacity Using Multi-Locus Genome-Wide Association Studies

    PubMed Central

    Ma, Langlang; Liu, Min; Yan, Yuanyuan; Qing, Chunyan; Zhang, Xiaoling; Zhang, Yanling; Long, Yun; Wang, Lei; Pan, Lang; Zou, Chaoying; Li, Zhaoling; Wang, Yanli; Peng, Huanwei; Pan, Guangtang; Jiang, Zhou; Shen, Yaou

    2018-01-01

    The regenerative capacity of the embryonic callus, a complex quantitative trait, is one of the main limiting factors for maize transformation. This trait was decomposed into five traits, namely, green callus rate (GCR), callus differentiating rate (CDR), callus plantlet number (CPN), callus rooting rate (CRR), and callus browning rate (CBR). To dissect the genetic foundation of maize transformation, in this study multi-locus genome-wide association studies (GWAS) for the five traits were performed in a population of 144 inbred lines genotyped with 43,427 SNPs. Using the phenotypic values in three environments and best linear unbiased prediction (BLUP) values, as a result, a total of 127, 56, 160, and 130 significant quantitative trait nucleotides (QTNs) were identified by mrMLM, FASTmrEMMA, ISIS EM-BLASSO, and pLARmEB, respectively. Of these QTNs, 63 QTNs were commonly detected, including 15 across multiple environments and 58 across multiple methods. Allele distribution analysis showed that the proportion of superior alleles for 36 QTNs was <50% in 31 elite inbred lines. Meanwhile, these superior alleles had obviously additive effect on the regenerative capacity. This indicates that the regenerative capacity-related traits can be improved by proper integration of the superior alleles using marker-assisted selection. Moreover, a total of 40 candidate genes were found based on these common QTNs. Some annotated genes were previously reported to relate with auxin transport, cell fate, seed germination, or embryo development, especially, GRMZM2G108933 (WOX2) was found to promote maize transgenic embryonic callus regeneration. These identified candidate genes will contribute to a further understanding of the genetic foundation of maize embryonic callus regeneration. PMID:29755499

  3. Infection Density Dynamics and Phylogeny of Wolbachia Associated with Coconut Hispine Beetle, Brontispa longissima (Gestro) (Coleoptera: Chrysomelidae), by Multilocus Sequence Type (MLST) Genotyping.

    PubMed

    Ali, Habib; Muhammad, Abrar; Hou, Youming

    2018-05-28

    The intracellular bacterium Wolbachia pipientis is widespread in arthropods. Recently, possibilities of novel Wolbachia -mediated hosts, their distribution, and natural rate have been anticipated, and the coconut leaf beetle Brontispa longissima (Gestro) (Coleoptera: Chrysomelidae), which has garnered attention as a serious pest of palms, was subjected to this interrogation. By adopting Wolbachia surface protein ( wsp ) and multilocus sequence type (MLST) genotypic systems, we determined the Wolbachia infection density within host developmental stages, body parts, and tissues, and the results revealed that all the tested samples of B. longissima were infected with the same Wolbachia strain (wLog), suggesting complete vertical transmission. The MLST profile elucidated two new alleles ( ftsZ -234 and coxA-266) that define a new sequence type (ST-483), which indicates the particular genotypic association of B. longissima and Wolbachia . The quantitative real-time PCR analysis revealed a higher infection density in the eggs and adult stage, followed by the abdomen and reproductive tissues, respectively. However, no significant differences were observed in the infection density between sexes. Moreover, the wsp and concatenated MLST alignment analysis of this study with other known Wolbachia-mediated arthropods revealed similar clustering with distinct monophyletic supergroup B. This is the first comprehensive report on the prevalence, infection dynamics, and phylogeny of the Wolbachia endosymbiont in B. longissima , which demonstrated that Wolbachia is ubiquitous across all developmental stages and distributed in the entire body of B. longissima . Understanding the Wolbachia infection dynamics would provide useful insight to build a framework for future investigations, understand its impacts on host physiology, and exploit it as a potential biocontrol agent.

  4. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data

    USDA-ARS?s Scientific Manuscript database

    Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...

  5. Population structure of an endemic vulnerable species, the Jamaican boa (Epicrates subflavus).

    PubMed

    Tzika, Athanasia C; Koenig, Susan; Miller, Ricardo; Garcia, Gerardo; Remy, Christophe; Milinkovitch, Michel C

    2008-01-01

    The Jamaican boa (Epicrates subflavus; also called Yellow boa) is an endemic species whose natural populations greatly and constantly declined since the late 19th century, mainly because of predation by introduced species, human persecution, and habitat destruction. In-situ conservation of the Jamaican boa is seriously hindered by the lack of information on demographic and ecological parameters as well as by a poor understanding of the population structure and species distribution in the wild. Here, using nine nuclear microsatellite loci and a fragment of the mitochondrial cytochrome b gene from 87 wild-born individuals, we present the first molecular genetic analyses focusing on the diversity and structure of the natural populations of the Jamaican boa. A model-based clustering analysis of multilocus microsatellite genotypes identifies three groups that are also significantly differentiated on the basis of F-statistics. Similarly, haplotypic network reconstruction methods applied on the cytochrome b haplotypes isolated here identify two well-differentiated haplogroups separated by four to six fixed mutations. Bayesian and metaGA analyses of the mitochondrial data set combined with sequences from other Boidae species indicate that rooting of the haplotypic network occurs most likely between the two defined haplogroups. Both analyses (based on nuclear and mitochondrial markers) underline an Eastern vs. (Western + Central) pattern of differentiation in agreement with geological data and patterns of differentiation uncovered in other vertebrate and invertebrate Jamaican species. Our results provide important insights for improving management of ex-situ captive populations and for guiding the development of proper in-situ species survival and habitat management plans for this spectacular, yet poorly known and vulnerable, snake.

  6. Divergence history of the Carpathian and smooth newts modelled in space and time.

    PubMed

    Zieliński, P; Nadachowska-Brzyska, K; Dudek, K; Babik, W

    2016-08-01

    Information about demographic history is essential for the understanding of the processes of divergence and speciation. Patterns of genetic variation within and between closely related species provide insights into the history of their interactions. Here, we investigated historical demography and genetic exchange between the Carpathian (Lissotriton montandoni, Lm) and smooth (L. vulgaris, Lv) newts. We combine an extensive geographical sampling and multilocus nuclear sequence data with the approximate Bayesian computation framework to test alternative scenarios of divergence and reconstruct the temporal and spatial pattern of gene flow between species. A model of recent (last glacial period) interspecific gene flow was favoured over alternative models. Thus, despite the relatively old divergence (4-6 mya) and presumably long periods of isolation, the species have retained the ability to exchange genes. Nevertheless, the low migration rates (ca. 10(-6) per gene copy per generation) are consistent with strong reproductive isolation between the species. Models allowing demographic changes were favoured, suggesting that the effective population sizes of both species at least doubled as divergence reaching the current ca. 0.2 million in Lm and 1 million in Lv. We found asymmetry in rates of interspecific gene flow between Lm and one evolutionary lineage of Lv. We suggest that intraspecific polymorphism for hybrid incompatibilities segregating within Lv could explain this pattern and propose further tests to distinguish between alternative explanations. Our study highlights the importance of incorporating intraspecific genetic structure into the models investigating the history of divergence. © 2016 John Wiley & Sons Ltd.

  7. Patterns of population structure and dispersal in the long-lived "redwood" of the coral reef, the giant barrel sponge ( Xestospongia muta)

    NASA Astrophysics Data System (ADS)

    Richards, Vincent P.; Bernard, Andrea M.; Feldheim, Kevin A.; Shivji, Mahmood S.

    2016-09-01

    Sponges are one of the dominant fauna on Florida and Caribbean coral reefs, with species diversity often exceeding that of scleractinian corals. Despite the key role of sponges as structural components, habitat providers, and nutrient recyclers in reef ecosystems, their dispersal dynamics are little understood. We used ten microsatellite markers to study the population structure and dispersal patterns of a prominent reef species, the giant barrel sponge ( Xestospongia muta), the long-lived "redwood" of the reef, throughout Florida and the Caribbean. F-statistics, exact tests of population differentiation, and Bayesian multi-locus genotype analyses revealed high levels of overall genetic partitioning ( F ST = 0.12, P = 0.001) and grouped 363 individuals collected from the Bahamas, Honduras, US Virgin Islands, Key Largo (Florida), and the remainder of the Florida reef tract into at minimum five genetic clusters ( K = 5). Exact tests, however, revealed further differentiation, grouping sponges sampled from five locations across the Florida reef tract (~250 km) into three populations, suggesting a total of six genetic populations across the eight locations sampled. Assignment tests showed dispersal over ecological timescales to be limited to relatively short distances, as the only migration detected among populations was within the Florida reef tract. Consequently, populations of this major coral reef benthic constituent appear largely self-recruiting. A combination of levels of genetic differentiation, genetic distance, and assignment tests support the important role of the Caribbean and Florida currents in shaping patterns of contemporary and historical gene flow in this widespread coral reef species.

  8. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  9. Population structure and reticulate evolution of Saccharomyces eubayanus and its lager-brewing hybrids.

    PubMed

    Peris, David; Sylvester, Kayla; Libkind, Diego; Gonçalves, Paula; Sampaio, José Paulo; Alexander, William G; Hittinger, Chris Todd

    2014-04-01

    Reticulate evolution can be a major driver of diversification into new niches, especially in disturbed habitats and at the edges of ranges. Industrial fermentation strains of yeast provide a window into these processes, but progress has been hampered by a limited understanding of the natural diversity and distribution of Saccharomyces species and populations. For example, lager beer is brewed with Saccharomyces pastorianus, an alloploid hybrid of S. cerevisiae and S. eubayanus, a species only recently discovered in Patagonia, Argentina. Here, we report that genetically diverse strains of S. eubayanus are readily isolated from Patagonia, demonstrating that the species is well established there. Analyses of multilocus sequence data strongly suggest that there are two diverse and highly differentiated Patagonian populations. The low nucleotide diversity found in the S. eubayanus moiety of hybrid European brewing strains suggests that their alleles were drawn from a small subpopulation that is closely related to one of the Patagonian populations. For the first time, we also report the rare isolation of S. eubayanus outside Patagonia, in Wisconsin, USA. In contrast to the clear population differentiation in Patagonia, the North American strains represent a recent and possibly transient admixture of the two Patagonian populations. These complex and varied reticulation events are not adequately captured by conventional phylogenetic methods and required analyses of Bayesian concordance factors and phylogenetic networks to accurately summarize and interpret. These findings show how genetically diverse eukaryotic microbes can produce rare but economically important hybrids with low genetic diversity when they migrate from their natural ecological context. © 2014 John Wiley & Sons Ltd.

  10. Genetic identification of Theobroma cacao L. trees with high Criollo ancestry in Soconusco, Chiapas, Mexico.

    PubMed

    Vázquez-Ovando, J A; Molina-Freaner, F; Nuñez-Farfán, J; Ovando-Medina, I; Salvador-Figueroa, M

    2014-12-12

    Criollo-type cacao trees are an important pool of genes with potential to be used in cacao breeding and selection programs. For that reason, we assessed the diversity and population structure of Criollo-type trees (108 cultivars with Criollo phenotypic characteristics and 10 Criollo references) using 12 simple sequence repeat (SSR) markers. Cultivars were selected from 7 demes in the Soconusco region of southern Mexico. SSRs amplified 74 alleles with an average of 3.6 alleles per population. The overall populations showed an average observed heterozygosity of 0.28, indicating heterozygote deficiency (average fixation index F = 0.50). However, moderate allelic diversity was found within populations (Shannon index for all populations I = 0.97). Bayesian method analysis determined 2 genetic clusters (K = 2) within individuals. In concordance, an assignment test grouped 37 multilocus genotypes (including 10 references) into a first cluster (Criollo), 54 into a second (presumably Amelonado), and 27 admixed individuals unassigned at the 90% threshold likely corresponding to the Trinitario genotype. This classification was supported by the principal coordinate analysis and analysis of molecular variance, which showed 12% of variation among populations (FST = 0.123, P < 0.0001). Sampled demes sites (1- 7) in the Soconusco region did not show any evidence of clustering by geographic location, and this was supported by the Mantel test (Rxy = 0.54, P = 0.120). Individuals with high Criollo lineage planted in Soconusco farms could be an important reservoir of genes for future breeding programs searching for fine, taste, flavor, and aroma cocoa.

  11. Individual-based assessment of population structure and admixture in Austrian, Croatian and German draught horses.

    PubMed

    Druml, T; Curik, I; Baumung, R; Aberle, K; Distl, O; Sölkner, J

    2007-02-01

    All over Europe, the number of draught horses has decreased drastically during the last 50 years. As a prerequisite for efficient management decisions, we analysed the conservation status in Austrian (Noriker Carinthia - NC, Noriker Salzburg - NS), Croatian (Croatian Coldblood - C, Posavina horse - P) and German (Altmaerkisch Coldblood - A, Black Forest horse - BF, Mecklenburg Coldblood - M, Rhenish German Draught horse - R, Saxon Thuringa Coldblood - ST, Schleswig Draught horse - Sch, South German Coldblood - SG) draught horses (434) using multilocus genotypic information from 30 (effectively 27) microsatellite loci. Populations located in areas with less intensive agricultural production (C, NC, NS, P and SG) had greater diversity within the population and estimated effective population size than A, BF, Sch, M, R and ST populations. The PCA plots revealed that populations form five separate groups. The 'Noriker' group (NC, NS and SG) and the 'Rhenish' group (A, M, R and ST) were the most distinctive (pairwise F(ST) values ranged from 0.078 to 0.094). The 'Croatian' group (C and P) was in the centre, while the BF and Sch populations formed two out-groups. A posterior Bayesian analysis detected further differentiation, mainly caused by political and geographical factors. Thus, it was possible to separate the South German Coldblood from the Austrian Noriker population where no subpopulation structure was detected. The admixture analysis revealed imprecise classification between C and P populations. A small but notable separation of R from A, M and ST populations was detected, while Sch and BF populations remained as out-groups. The information obtained should aid in making efficient conservation decisions.

  12. A Spatial Statistical Model for Landscape Genetics

    PubMed Central

    Guillot, Gilles; Estoup, Arnaud; Mortier, Frédéric; Cosson, Jean François

    2005-01-01

    Landscape genetics is a new discipline that aims to provide information on how landscape and environmental features influence population genetic structure. The first key step of landscape genetics is the spatial detection and location of genetic discontinuities between populations. However, efficient methods for achieving this task are lacking. In this article, we first clarify what is conceptually involved in the spatial modeling of genetic data. Then we describe a Bayesian model implemented in a Markov chain Monte Carlo scheme that allows inference of the location of such genetic discontinuities from individual geo-referenced multilocus genotypes, without a priori knowledge on populational units and limits. In this method, the global set of sampled individuals is modeled as a spatial mixture of panmictic populations, and the spatial organization of populations is modeled through the colored Voronoi tessellation. In addition to spatially locating genetic discontinuities, the method quantifies the amount of spatial dependence in the data set, estimates the number of populations in the studied area, assigns individuals to their population of origin, and detects individual migrants between populations, while taking into account uncertainty on the location of sampled individuals. The performance of the method is evaluated through the analysis of simulated data sets. Results show good performances for standard data sets (e.g., 100 individuals genotyped at 10 loci with 10 alleles per locus), with high but also low levels of population differentiation (e.g., FST < 0.05). The method is then applied to a set of 88 individuals of wolverines (Gulo gulo) sampled in the northwestern United States and genotyped at 10 microsatellites. PMID:15520263

  13. Analysis of Multilocus Sequence Typing and Virulence Characterization of Listeria monocytogenes Isolates from Chinese Retail Ready-to-Eat Food

    PubMed Central

    Wu, Shi; Wu, Qingping; Zhang, Jumei; Chen, Moutong; Guo, Weipeng

    2016-01-01

    Eighty Listeria monocytogenes isolates were obtained from Chinese retail ready-to-eat (RTE) food and were previously characterized with serotyping and antibiotic susceptibility tests. The aim of this study was to characterize the subtype and virulence potential of these L. monocytogenes isolates by multilocus sequence typing (MLST), virulence-associate genes, epidemic clones (ECs), and sequence analysis of the important virulence factor: internalin A (inlA). The result of MLST revealed that these L. monocytogenes isolates belonged to 14 different sequence types (STs). With the exception of four new STs (ST804, ST805, ST806, and ST807), all other STs observed in this study have been associated with human listeriosis and outbreaks to varying extents. Six virulence-associate genes (inlA, inlB, inlC, inlJ, hly, and llsX) were selected and their presence was investigated using PCR. All strains carried inlA, inlB, inlC, inlJ, and hly, whereas 38.8% (31/80) of strains harbored the listeriolysin S genes (llsX). A multiplex PCR assay was used to evaluate the presence of markers specific to epidemic clones of L. monocytogenes and identified 26.3% (21/80) of ECI in the 4b-4d-4e strains. Further study of inlA sequencing revealed that most strains contained the full-length InlA required for host cell invasion, whereas three mutations lead to premature stop codons (PMSC) within a novel PMSCs at position 326 (GAA → TAA). MLST and inlA sequence analysis results were concordant, and different virulence potentials within isolates were observed. These findings suggest that L. monocytogenes isolates from RTE food in China could be virulent and be capable of causing human illness. Furthermore, the STs and virulence profiles of L. monocytogenes isolates have significant implications for epidemiological and public health studies of this pathogen. PMID:26909076

  14. Multilocus Sequence Typing and Virulence-Associated Gene Profile Analysis of Staphylococcus aureus Isolates From Retail Ready-to-Eat Food in China.

    PubMed

    Yang, Xiaojuan; Yu, Shubo; Wu, Qingping; Zhang, Jumei; Wu, Shi; Rong, Dongli

    2018-01-01

    The aim of this study was to characterize the subtypes and virulence profiles of 69 Staphylococcus aureus isolates obtained from retail ready-to-eat food in China. The isolates were analyzed using multilocus sequence typing (MLST) and polymerase chain reaction (PCR) analysis of important virulence factor genes, including the staphylococcal enterotoxin (SE) genes ( sea , seb , sec , sed , see , seg , seh , sei , sej ), the exfoliative toxin genes ( eta and etb ), the toxic shock syndrome toxin-1 gene ( tst ), and the Panton-Valentine leucocidin-encoding gene ( pvl ). The isolates encompassed 26 different sequence types (STs), including four new STs (ST3482, ST3484, ST3485, ST3504), clustered in three clonal complexes and 17 singletons. The most prevalent STs were ST1, ST6, and ST15, constituting 34.8% of all isolates. Most STs (15/26, 57.7%) detected have previously been associated with human infections. All 13 toxin genes examined were detected in the S. aureus isolates, with 84.1% of isolates containing toxin genes. The three most prevalent toxin genes were seb (36.2%), sea (33.3%), and seg (33.3%). The classical SE genes ( sea - see ), which contribute significantly to staphylococcal food poisoning (SFP), were detected in 72.5% of the S. aureus isolates. In addition, pvl , eta , etb , and tst were found in 11.6, 10.1, 10.1, and 7.2% of the S. aureus isolates, respectively. Strains ST6 carrying sea and ST1 harboring sec-seh enterotoxin profile, which are the two most common clones associated with SFP, were also frequently detected in the food samples in this study. This study indicates that these S. aureus isolates present in Chinese ready-to-eat food represents a potential public health risk. These data are valuable for epidemiological studies, risk management, and public health strategies.

  15. Analysis of Multilocus Sequence Typing and Virulence Characterization of Listeria monocytogenes Isolates from Chinese Retail Ready-to-Eat Food.

    PubMed

    Wu, Shi; Wu, Qingping; Zhang, Jumei; Chen, Moutong; Guo, Weipeng

    2016-01-01

    Eighty Listeria monocytogenes isolates were obtained from Chinese retail ready-to-eat (RTE) food and were previously characterized with serotyping and antibiotic susceptibility tests. The aim of this study was to characterize the subtype and virulence potential of these L. monocytogenes isolates by multilocus sequence typing (MLST), virulence-associate genes, epidemic clones (ECs), and sequence analysis of the important virulence factor: internalin A (inlA). The result of MLST revealed that these L. monocytogenes isolates belonged to 14 different sequence types (STs). With the exception of four new STs (ST804, ST805, ST806, and ST807), all other STs observed in this study have been associated with human listeriosis and outbreaks to varying extents. Six virulence-associate genes (inlA, inlB, inlC, inlJ, hly, and llsX) were selected and their presence was investigated using PCR. All strains carried inlA, inlB, inlC, inlJ, and hly, whereas 38.8% (31/80) of strains harbored the listeriolysin S genes (llsX). A multiplex PCR assay was used to evaluate the presence of markers specific to epidemic clones of L. monocytogenes and identified 26.3% (21/80) of ECI in the 4b-4d-4e strains. Further study of inlA sequencing revealed that most strains contained the full-length InlA required for host cell invasion, whereas three mutations lead to premature stop codons (PMSC) within a novel PMSCs at position 326 (GAA → TAA). MLST and inlA sequence analysis results were concordant, and different virulence potentials within isolates were observed. These findings suggest that L. monocytogenes isolates from RTE food in China could be virulent and be capable of causing human illness. Furthermore, the STs and virulence profiles of L. monocytogenes isolates have significant implications for epidemiological and public health studies of this pathogen.

  16. Multilocus Sequence Typing and Virulence-Associated Gene Profile Analysis of Staphylococcus aureus Isolates From Retail Ready-to-Eat Food in China

    PubMed Central

    Yang, Xiaojuan; Yu, Shubo; Wu, Qingping; Zhang, Jumei; Wu, Shi; Rong, Dongli

    2018-01-01

    The aim of this study was to characterize the subtypes and virulence profiles of 69 Staphylococcus aureus isolates obtained from retail ready-to-eat food in China. The isolates were analyzed using multilocus sequence typing (MLST) and polymerase chain reaction (PCR) analysis of important virulence factor genes, including the staphylococcal enterotoxin (SE) genes (sea, seb, sec, sed, see, seg, seh, sei, sej), the exfoliative toxin genes (eta and etb), the toxic shock syndrome toxin-1 gene (tst), and the Panton-Valentine leucocidin-encoding gene (pvl). The isolates encompassed 26 different sequence types (STs), including four new STs (ST3482, ST3484, ST3485, ST3504), clustered in three clonal complexes and 17 singletons. The most prevalent STs were ST1, ST6, and ST15, constituting 34.8% of all isolates. Most STs (15/26, 57.7%) detected have previously been associated with human infections. All 13 toxin genes examined were detected in the S. aureus isolates, with 84.1% of isolates containing toxin genes. The three most prevalent toxin genes were seb (36.2%), sea (33.3%), and seg (33.3%). The classical SE genes (sea–see), which contribute significantly to staphylococcal food poisoning (SFP), were detected in 72.5% of the S. aureus isolates. In addition, pvl, eta, etb, and tst were found in 11.6, 10.1, 10.1, and 7.2% of the S. aureus isolates, respectively. Strains ST6 carrying sea and ST1 harboring sec-seh enterotoxin profile, which are the two most common clones associated with SFP, were also frequently detected in the food samples in this study. This study indicates that these S. aureus isolates present in Chinese ready-to-eat food represents a potential public health risk. These data are valuable for epidemiological studies, risk management, and public health strategies. PMID:29662467

  17. How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?

    PubMed

    Veturi, Yogasudha; Ritchie, Marylyn D

    2018-01-01

    Transcriptome-wide association studies (TWAS) have recently been employed as an approach that can draw upon the advantages of genome-wide association studies (GWAS) and gene expression studies to identify genes associated with complex traits. Unlike standard GWAS, summary level data suffices for TWAS and offers improved statistical power. Two popular TWAS methods include either (a) imputing the cis genetic component of gene expression from smaller sized studies (using multi-SNP prediction or MP) into much larger effective sample sizes afforded by GWAS - TWAS-MP or (b) using summary-based Mendelian randomization - TWAS-SMR. Although these methods have been effective at detecting functional variants, it remains unclear how extensive variability in the genetic architecture of complex traits and diseases impacts TWAS results. Our goal was to investigate the different scenarios under which these methods yielded enough power to detect significant expression-trait associations. In this study, we conducted extensive simulations based on 6000 randomly chosen, unrelated Caucasian males from Geisinger's MyCode population to compare the power to detect cis expression-trait associations (within 500 kb of a gene) using the above-described approaches. To test TWAS across varying genetic backgrounds we simulated gene expression and phenotype using different quantitative trait loci per gene and cis-expression /trait heritability under genetic models that differentiate the effect of causality from that of pleiotropy. For each gene, on a training set ranging from 100 to 1000 individuals, we either (a) estimated regression coefficients with gene expression as the response using five different methods: LASSO, elastic net, Bayesian LASSO, Bayesian spike-slab, and Bayesian ridge regression or (b) performed eQTL analysis. We then sampled with replacement 50,000, 150,000, and 300,000 individuals respectively from the testing set of the remaining 5000 individuals and conducted GWAS on each set. Subsequently, we integrated the GWAS summary statistics derived from the testing set with the weights (or eQTLs) derived from the training set to identify expression-trait associations using (a) TWAS-MP (b) TWAS-SMR (c) eQTL-based GWAS, or (d) standalone GWAS. Finally, we examined the power to detect functionally relevant genes using the different approaches under the considered simulation scenarios. In general, we observed great similarities among TWAS-MP methods although the Bayesian methods resulted in improved power in comparison to LASSO and elastic net as the trait architecture grew more complex while training sample sizes and expression heritability remained small. Finally, we observed high power under causality but very low to moderate power under pleiotropy.

  18. BANYAN. XI. The BANYAN Σ Multivariate Bayesian Algorithm to Identify Members of Young Associations with 150 pc

    NASA Astrophysics Data System (ADS)

    Gagné, Jonathan; Mamajek, Eric E.; Malo, Lison; Riedel, Adric; Rodriguez, David; Lafrenière, David; Faherty, Jacqueline K.; Roy-Loubier, Olivier; Pueyo, Laurent; Robin, Annie C.; Doyon, René

    2018-03-01

    BANYAN Σ is a new Bayesian algorithm to identify members of young stellar associations within 150 pc of the Sun. It includes 27 young associations with ages in the range ∼1–800 Myr, modeled with multivariate Gaussians in six-dimensional (6D) XYZUVW space. It is the first such multi-association classification tool to include the nearest sub-groups of the Sco-Cen OB star-forming region, the IC 2602, IC 2391, Pleiades and Platais 8 clusters, and the ρ Ophiuchi, Corona Australis, and Taurus star formation regions. A model of field stars is built from a mixture of multivariate Gaussians based on the Besançon Galactic model. The algorithm can derive membership probabilities for objects with only sky coordinates and proper motion, but can also include parallax and radial velocity measurements, as well as spectrophotometric distance constraints from sequences in color–magnitude or spectral type–magnitude diagrams. BANYAN Σ benefits from an analytical solution to the Bayesian marginalization integrals over unknown radial velocities and distances that makes it more accurate and significantly faster than its predecessor BANYAN II. A contamination versus hit rate analysis is presented and demonstrates that BANYAN Σ achieves a better classification performance than other moving group tools available in the literature, especially in terms of cross-contamination between young associations. An updated list of bona fide members in the 27 young associations, augmented by the Gaia-DR1 release, as well as all parameters for the 6D multivariate Gaussian models for each association and the Galactic field neighborhood within 300 pc are presented. This new tool will make it possible to analyze large data sets such as the upcoming Gaia-DR2 to identify new young stars. IDL and Python versions of BANYAN Σ are made available with this publication, and a more limited online web tool is available at http://www.exoplanetes.umontreal.ca/banyan/banyansigma.php.

  19. A hierarchical bayesian analysis of parasite prevalence and sociocultural outcomes: The role of structural racism and sanitation infrastructure.

    PubMed

    Ross, Cody T; Winterhalder, Bruce

    2016-01-01

    We conduct a revaluation of the Thornhill and Fincher research project on parasites using finely-resolved geographic data on parasite prevalence, individual-level sociocultural data, and multilevel Bayesian modeling. In contrast to the evolutionary psychological mechanisms linking parasites to human behavior and cultural characteristics proposed by Thornhill and Fincher, we offer an alternative hypothesis that structural racism and differential access to sanitation systems drive both variation in parasite prevalence and differential behaviors and cultural characteristics. We adopt a Bayesian framework to estimate parasite prevalence rates in 51 districts in eight Latin American countries using the disease status of 170,220 individuals tested for infection with the intestinal roundworm Ascaris lumbricoides (Hürlimann et al., []: PLoS Negl Trop Dis 5:e1404). We then use district-level estimates of parasite prevalence and individual-level social data from 5,558 individuals in the same 51 districts (Latinobarómetro, 2008) to assess claims of causal associations between parasite prevalence and sociocultural characteristics. We find, contrary to Thornhill and Fincher, that parasite prevalence is positively associated with preferences for democracy, negatively associated with preferences for collectivism, and not associated with violent crime rates or gender inequality. A positive association between parasite prevalence and religiosity, as in Fincher and Thornhill (: Behav Brain Sci 35:61-79), and a negative association between parasite prevalence and achieved education, as predicted by Eppig et al. (: Proc R S B: Biol Sci 277:3801-3808), become negative and unreliable when reasonable controls are included in the model. We find support for all predictions derived from our hypothesis linking structural racism to both parasite prevalence and cultural outcomes. We conclude that best practices in biocultural modeling require examining more than one hypothesis, retaining individual-level data and its associated variance whenever possible, and adopting multilevel techniques suited to the structuring of the data. © 2015 Wiley Periodicals, Inc.

  20. Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.

    PubMed

    Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier

    2014-01-01

    Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.

  1. Inference on the Strength of Balancing Selection for Epistatically Interacting Loci

    PubMed Central

    Buzbas, Erkan Ozge; Joyce, Paul; Rosenberg, Noah A.

    2011-01-01

    Existing inference methods for estimating the strength of balancing selection in multi-locus genotypes rely on the assumption that there are no epistatic interactions between loci. Complex systems in which balancing selection is prevalent, such as sets of human immune system genes, are known to contain components that interact epistatically. Therefore, current methods may not produce reliable inference on the strength of selection at these loci. In this paper, we address this problem by presenting statistical methods that can account for epistatic interactions in making inference about balancing selection. A theoretical result due to Fearnhead (2006) is used to build a multi-locus Wright-Fisher model of balancing selection, allowing for epistatic interactions among loci. Antagonistic and synergistic types of interactions are examined. The joint posterior distribution of the selection and mutation parameters is sampled by Markov chain Monte Carlo methods, and the plausibility of models is assessed via Bayes factors. As a component of the inference process, an algorithm to generate multi-locus allele frequencies under balancing selection models with epistasis is also presented. Recent evidence on interactions among a set of human immune system genes is introduced as a motivating biological system for the epistatic model, and data on these genes are used to demonstrate the methods. PMID:21277883

  2. Development of a multilocus-based approach for sponge (phylum Porifera) identification: refinement and limitations.

    PubMed

    Yang, Qi; Franco, Christopher M M; Sorokin, Shirley J; Zhang, Wei

    2017-02-02

    For sponges (phylum Porifera), there is no reliable molecular protocol available for species identification. To address this gap, we developed a multilocus-based Sponge Identification Protocol (SIP) validated by a sample of 37 sponge species belonging to 10 orders from South Australia. The universal barcode COI mtDNA, 28S rRNA gene (D3-D5), and the nuclear ITS1-5.8S-ITS2 region were evaluated for their suitability and capacity for sponge identification. The highest Bit Score was applied to infer the identity. The reliability of SIP was validated by phylogenetic analysis. The 28S rRNA gene and COI mtDNA performed better than the ITS region in classifying sponges at various taxonomic levels. A major limitation is that the databases are not well populated and possess low diversity, making it difficult to conduct the molecular identification protocol. The identification is also impacted by the accuracy of the morphological classification of the sponges whose sequences have been submitted to the database. Re-examination of the morphological identification further demonstrated and improved the reliability of sponge identification by SIP. Integrated with morphological identification, the multilocus-based SIP offers an improved protocol for more reliable and effective sponge identification, by coupling the accuracy of different DNA markers.

  3. Development of a multilocus-based approach for sponge (phylum Porifera) identification: refinement and limitations

    PubMed Central

    Yang, Qi; Franco, Christopher M. M.; Sorokin, Shirley J.; Zhang, Wei

    2017-01-01

    For sponges (phylum Porifera), there is no reliable molecular protocol available for species identification. To address this gap, we developed a multilocus-based Sponge Identification Protocol (SIP) validated by a sample of 37 sponge species belonging to 10 orders from South Australia. The universal barcode COI mtDNA, 28S rRNA gene (D3–D5), and the nuclear ITS1-5.8S-ITS2 region were evaluated for their suitability and capacity for sponge identification. The highest Bit Score was applied to infer the identity. The reliability of SIP was validated by phylogenetic analysis. The 28S rRNA gene and COI mtDNA performed better than the ITS region in classifying sponges at various taxonomic levels. A major limitation is that the databases are not well populated and possess low diversity, making it difficult to conduct the molecular identification protocol. The identification is also impacted by the accuracy of the morphological classification of the sponges whose sequences have been submitted to the database. Re-examination of the morphological identification further demonstrated and improved the reliability of sponge identification by SIP. Integrated with morphological identification, the multilocus-based SIP offers an improved protocol for more reliable and effective sponge identification, by coupling the accuracy of different DNA markers. PMID:28150727

  4. Prenatal maternal depression and child serotonin transporter linked polymorphic region (5-HTTLPR) and dopamine receptor D4 (DRD4) genotype predict negative emotionality from 3 to 36 months.

    PubMed

    Green, Cathryn Gordon; Babineau, Vanessa; Jolicoeur-Martineau, Alexia; Bouvette-Turcot, Andrée-Anne; Minde, Klaus; Sassi, Roberto; St-André, Martin; Carrey, Normand; Atkinson, Leslie; Kennedy, James L; Steiner, Meir; Lydon, John; Gaudreau, Helene; Burack, Jacob A; Levitan, Robert; Meaney, Michael J; Wazana, Ashley

    2017-08-01

    Prenatal maternal depression and a multilocus genetic profile of two susceptibility genes implicated in the stress response were examined in an interaction model predicting negative emotionality in the first 3 years. In 179 mother-infant dyads from the Maternal Adversity, Vulnerability, and Neurodevelopment cohort, prenatal depression (Center for Epidemiologic Studies Depressions Scale) was assessed at 24 to 36 weeks. The multilocus genetic profile score consisted of the number of susceptibility alleles from the serotonin transporter linked polymorphic region gene (5-HTTLPR): no long-rs25531(A) (LA: short/short, short/long-rs25531(G) [LG], or LG/LG] vs. any LA) and the dopamine receptor D4 gene (six to eight repeats vs. two to five repeats). Negative emotionality was extracted from the Infant Behaviour Questionnaire-Revised at 3 and 6 months and the Early Child Behavior Questionnaire at 18 and 36 months. Mixed and confirmatory regression analyses indicated that prenatal depression and the multilocus genetic profile interacted to predict negative emotionality from 3 to 36 months. The results were characterized by a differential susceptibility model at 3 and 6 months and by a diathesis-stress model at 36 months.

  5. Pestalotiopsis and allied genera from Camellia, with description of 11 new species from China.

    PubMed

    Liu, Fang; Hou, Lingwei; Raza, Mubashar; Cai, Lei

    2017-04-13

    A total of 124 Pestalotiopsis-like isolates associated with symptomatic and asymptomatic tissues of Camellia sinensis and other Camellia spp. from eight provinces in China were investigated. Based on single- and multi-locus (ITS, TEF, TUB2) phylogenies, as well as morphological characters, host associations and geographical distributions, they were classified into at least 19 species in three genera, i.e. Neopestalotiopsis, Pestalotiopsis and Pseudopestalotiopsis. Eight novel species in Pestalotiopsis and three novel species in Pseudopestalotiopsis were described. Our data suggested that the currently widely used loci in Pestalotiopsis-like genera do not consistently provide stable and sufficient resolution tree topologies, especially for Neopestalotiopsis. Moreover, the number, branch pattern and length of the conidial basal appendages were revealed to be phylogenetically informative characters in Pestalotiopsis.

  6. Cyber-T web server: differential analysis of high-throughput data.

    PubMed

    Kayala, Matthew A; Baldi, Pierre

    2012-07-01

    The Bayesian regularization method for high-throughput differential analysis, described in Baldi and Long (A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes. Bioinformatics 2001: 17: 509-519) and implemented in the Cyber-T web server, is one of the most widely validated. Cyber-T implements a t-test using a Bayesian framework to compute a regularized variance of the measurements associated with each probe under each condition. This regularized estimate is derived by flexibly combining the empirical measurements with a prior, or background, derived from pooling measurements associated with probes in the same neighborhood. This approach flexibly addresses problems associated with low replication levels and technology biases, not only for DNA microarrays, but also for other technologies, such as protein arrays, quantitative mass spectrometry and next-generation sequencing (RNA-seq). Here we present an update to the Cyber-T web server, incorporating several useful new additions and improvements. Several preprocessing data normalization options including logarithmic and (Variance Stabilizing Normalization) VSN transforms are included. To augment two-sample t-tests, a one-way analysis of variance is implemented. Several methods for multiple tests correction, including standard frequentist methods and a probabilistic mixture model treatment, are available. Diagnostic plots allow visual assessment of the results. The web server provides comprehensive documentation and example data sets. The Cyber-T web server, with R source code and data sets, is publicly available at http://cybert.ics.uci.edu/.

  7. A Gaussian random field model for similarity-based smoothing in Bayesian disease mapping.

    PubMed

    Baptista, Helena; Mendes, Jorge M; MacNab, Ying C; Xavier, Miguel; Caldas-de-Almeida, José

    2016-08-01

    Conditionally specified Gaussian Markov random field (GMRF) models with adjacency-based neighbourhood weight matrix, commonly known as neighbourhood-based GMRF models, have been the mainstream approach to spatial smoothing in Bayesian disease mapping. In the present paper, we propose a conditionally specified Gaussian random field (GRF) model with a similarity-based non-spatial weight matrix to facilitate non-spatial smoothing in Bayesian disease mapping. The model, named similarity-based GRF, is motivated for modelling disease mapping data in situations where the underlying small area relative risks and the associated determinant factors do not vary systematically in space, and the similarity is defined by "similarity" with respect to the associated disease determinant factors. The neighbourhood-based GMRF and the similarity-based GRF are compared and accessed via a simulation study and by two case studies, using new data on alcohol abuse in Portugal collected by the World Mental Health Survey Initiative and the well-known lip cancer data in Scotland. In the presence of disease data with no evidence of positive spatial correlation, the simulation study showed a consistent gain in efficiency from the similarity-based GRF, compared with the adjacency-based GMRF with the determinant risk factors as covariate. This new approach broadens the scope of the existing conditional autocorrelation models. © The Author(s) 2016.

  8. Bayesian depth estimation from monocular natural images.

    PubMed

    Su, Che-Chun; Cormack, Lawrence K; Bovik, Alan C

    2017-05-01

    Estimating an accurate and naturalistic dense depth map from a single monocular photographic image is a difficult problem. Nevertheless, human observers have little difficulty understanding the depth structure implied by photographs. Two-dimensional (2D) images of the real-world environment contain significant statistical information regarding the three-dimensional (3D) structure of the world that the vision system likely exploits to compute perceived depth, monocularly as well as binocularly. Toward understanding how this might be accomplished, we propose a Bayesian model of monocular depth computation that recovers detailed 3D scene structures by extracting reliable, robust, depth-sensitive statistical features from single natural images. These features are derived using well-accepted univariate natural scene statistics (NSS) models and recent bivariate/correlation NSS models that describe the relationships between 2D photographic images and their associated depth maps. This is accomplished by building a dictionary of canonical local depth patterns from which NSS features are extracted as prior information. The dictionary is used to create a multivariate Gaussian mixture (MGM) likelihood model that associates local image features with depth patterns. A simple Bayesian predictor is then used to form spatial depth estimates. The depth results produced by the model, despite its simplicity, correlate well with ground-truth depths measured by a current-generation terrestrial light detection and ranging (LIDAR) scanner. Such a strong form of statistical depth information could be used by the visual system when creating overall estimated depth maps incorporating stereopsis, accommodation, and other conditions. Indeed, even in isolation, the Bayesian predictor delivers depth estimates that are competitive with state-of-the-art "computer vision" methods that utilize highly engineered image features and sophisticated machine learning algorithms.

  9. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region

    PubMed Central

    Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.

    2013-01-01

    Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195

  10. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region.

    PubMed

    Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J

    2013-01-01

    Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.

  11. Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers

    PubMed Central

    2010-01-01

    Background The information provided by dense genome-wide markers using high throughput technology is of considerable potential in human disease studies and livestock breeding programs. Genome-wide association studies relate individual single nucleotide polymorphisms (SNP) from dense SNP panels to individual measurements of complex traits, with the underlying assumption being that any association is caused by linkage disequilibrium (LD) between SNP and quantitative trait loci (QTL) affecting the trait. Often SNP are in genomic regions of no trait variation. Whole genome Bayesian models are an effective way of incorporating this and other important prior information into modelling. However a full Bayesian analysis is often not feasible due to the large computational time involved. Results This article proposes an expectation-maximization (EM) algorithm called emBayesB which allows only a proportion of SNP to be in LD with QTL and incorporates prior information about the distribution of SNP effects. The posterior probability of being in LD with at least one QTL is calculated for each SNP along with estimates of the hyperparameters for the mixture prior. A simulated example of genomic selection from an international workshop is used to demonstrate the features of the EM algorithm. The accuracy of prediction is comparable to a full Bayesian analysis but the EM algorithm is considerably faster. The EM algorithm was accurate in locating QTL which explained more than 1% of the total genetic variation. A computational algorithm for very large SNP panels is described. Conclusions emBayesB is a fast and accurate EM algorithm for implementing genomic selection and predicting complex traits by mapping QTL in genome-wide dense SNP marker data. Its accuracy is similar to Bayesian methods but it takes only a fraction of the time. PMID:20969788

  12. Uncertainty analysis for effluent trading planning using a Bayesian estimation-based simulation-optimization modeling approach.

    PubMed

    Zhang, J L; Li, Y P; Huang, G H; Baetz, B W; Liu, J

    2017-06-01

    In this study, a Bayesian estimation-based simulation-optimization modeling approach (BESMA) is developed for identifying effluent trading strategies. BESMA incorporates nutrient fate modeling with soil and water assessment tool (SWAT), Bayesian estimation, and probabilistic-possibilistic interval programming with fuzzy random coefficients (PPI-FRC) within a general framework. Based on the water quality protocols provided by SWAT, posterior distributions of parameters can be analyzed through Bayesian estimation; stochastic characteristic of nutrient loading can be investigated which provides the inputs for the decision making. PPI-FRC can address multiple uncertainties in the form of intervals with fuzzy random boundaries and the associated system risk through incorporating the concept of possibility and necessity measures. The possibility and necessity measures are suitable for optimistic and pessimistic decision making, respectively. BESMA is applied to a real case of effluent trading planning in the Xiangxihe watershed, China. A number of decision alternatives can be obtained under different trading ratios and treatment rates. The results can not only facilitate identification of optimal effluent-trading schemes, but also gain insight into the effects of trading ratio and treatment rate on decision making. The results also reveal that decision maker's preference towards risk would affect decision alternatives on trading scheme as well as system benefit. Compared with the conventional optimization methods, it is proved that BESMA is advantageous in (i) dealing with multiple uncertainties associated with randomness and fuzziness in effluent-trading planning within a multi-source, multi-reach and multi-period context; (ii) reflecting uncertainties existing in nutrient transport behaviors to improve the accuracy in water quality prediction; and (iii) supporting pessimistic and optimistic decision making for effluent trading as well as promoting diversity of decision alternatives. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Uncertainty estimation of Intensity-Duration-Frequency relationships: A regional analysis

    NASA Astrophysics Data System (ADS)

    Mélèse, Victor; Blanchet, Juliette; Molinié, Gilles

    2018-03-01

    We propose in this article a regional study of uncertainties in IDF curves derived from point-rainfall maxima. We develop two generalized extreme value models based on the simple scaling assumption, first in the frequentist framework and second in the Bayesian framework. Within the frequentist framework, uncertainties are obtained i) from the Gaussian density stemming from the asymptotic normality theorem of the maximum likelihood and ii) with a bootstrap procedure. Within the Bayesian framework, uncertainties are obtained from the posterior densities. We confront these two frameworks on the same database covering a large region of 100, 000 km2 in southern France with contrasted rainfall regime, in order to be able to draw conclusion that are not specific to the data. The two frameworks are applied to 405 hourly stations with data back to the 1980's, accumulated in the range 3 h-120 h. We show that i) the Bayesian framework is more robust than the frequentist one to the starting point of the estimation procedure, ii) the posterior and the bootstrap densities are able to better adjust uncertainty estimation to the data than the Gaussian density, and iii) the bootstrap density give unreasonable confidence intervals, in particular for return levels associated to large return period. Therefore our recommendation goes towards the use of the Bayesian framework to compute uncertainty.

  14. Spatial distribution of psychotic disorders in an urban area of France: an ecological study.

    PubMed

    Pignon, Baptiste; Schürhoff, Franck; Baudin, Grégoire; Ferchiou, Aziz; Richard, Jean-Romain; Saba, Ghassen; Leboyer, Marion; Kirkbride, James B; Szöke, Andrei

    2016-05-18

    Previous analyses of neighbourhood variations of non-affective psychotic disorders (NAPD) have focused mainly on incidence. However, prevalence studies provide important insights on factors associated with disease evolution as well as for healthcare resource allocation. This study aimed to investigate the distribution of prevalent NAPD cases in an urban area in France. The number of cases in each neighbourhood was modelled as a function of potential confounders and ecological variables, namely: migrant density, economic deprivation and social fragmentation. This was modelled using statistical models of increasing complexity: frequentist models (using Poisson and negative binomial regressions), and several Bayesian models. For each model, assumptions validity were checked and compared as to how this fitted to the data, in order to test for possible spatial variation in prevalence. Data showed significant overdispersion (invalidating the Poisson regression model) and residual autocorrelation (suggesting the need to use Bayesian models). The best Bayesian model was Leroux's model (i.e. a model with both strong correlation between neighbouring areas and weaker correlation between areas further apart), with economic deprivation as an explanatory variable (OR = 1.13, 95% CI [1.02-1.25]). In comparison with frequentist methods, the Bayesian model showed a better fit. The number of cases showed non-random spatial distribution and was linked to economic deprivation.

  15. New Insights on Taxonomy, Phylogeny and Population Genetics of Leishmania (Viannia) Parasites Based on Multilocus Sequence Analysis

    PubMed Central

    Boité, Mariana C.; Mauricio, Isabel L.; Miles, Michael A.; Cupolillo, Elisa

    2012-01-01

    The Leishmania genus comprises up to 35 species, some with status still under discussion. The multilocus sequence typing (MLST)—extensively used for bacteria—has been proposed for pathogenic trypanosomatids. For Leishmania, however, a detailed analysis and revision on the taxonomy is still required. We have partially sequenced four housekeeping genes—glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD)—from 96 Leishmania (Viannia) strains and assessed their discriminatory typing capacity. The fragments had different degrees of diversity, and are thus suitable to be used in combination for intra- and inter-specific inferences. Species-specific single nucleotide polymorphisms were detected, but not for all species; ambiguous sites indicating heterozygosis were observed, as well as the putative homozygous donor. A large number of haplotypes were detected for each marker; for 6PGD a possible ancestral allele for L. (Viannia) was found. Maximum parsimony-based haplotype networks were built. Strains of different species, as identified by multilocus enzyme electrophoresis (MLEE), formed separated clusters in each network, with exceptions. NeighborNet of concatenated sequences confirmed species-specific clusters, suggesting recombination occurring in L. braziliensis and L. guyanensis. Phylogenetic analysis indicates L. lainsoni and L. naiffi as the most divergent species and does not support L. shawi as a distinct species, placing it in the L. guyanensis cluster. BURST analysis resulted in six clonal complexes (CC), corresponding to distinct species. The L. braziliensis strains evaluated correspond to one widely geographically distributed CC and another restricted to one endemic area. This study demonstrates the value of systematic multilocus sequence analysis (MLSA) for determining intra- and inter-species relationships and presents an approach to validate the species status of some entities. Furthermore, it contributes to the phylogeny of L. (Viannia) and might be helpful for epidemiological and population genetics analysis based on haplotype/diplotype determinations and inferences. PMID:23133690

  16. Kakusan4 and Aminosan: two programs for comparing nonpartitioned, proportional and separate models for combined molecular phylogenetic analyses of multilocus sequence data.

    PubMed

    Tanabe, Akifumi S

    2011-09-01

    Proportional and separate models able to apply different combination of substitution rate matrix (SRM) and among-site rate variation model (ASRVM) to each locus are frequently used in phylogenetic studies of multilocus data. A proportional model assumes that branch lengths are proportional among partitions and a separate model assumes that each partition has an independent set of branch lengths. However, the selection from among nonpartitioned (i.e., a common combination of models is applied to all-loci concatenated sequences), proportional and separate models is usually based on the researcher's preference rather than on any information criteria. This study describes two programs, 'Kakusan4' (for DNA sequences) and 'Aminosan' (for amino-acid sequences), which allow the selection of evolutionary models based on several types of information criteria. The programs can handle both multilocus and single-locus data, in addition to providing an easy-to-use wizard interface and a noninteractive command line interface. In the case of multilocus data, SRMs and ASRVMs are compared at each locus and at all-loci concatenated sequences, after which nonpartitioned, proportional and separate models are compared based on information criteria. The programs also provide model configuration files for mrbayes, paup*, phyml, raxml and Treefinder to support further phylogenetic analysis using a selected model. When likelihoods are optimized by Treefinder, the best-fit models were found to differ depending on the data set. Furthermore, differences in the information criteria among nonpartitioned, proportional and separate models were much larger than those among the nonpartitioned models. These findings suggest that selecting from nonpartitioned, proportional and separate models results in a better phylogenetic tree. Kakusan4 and Aminosan are available at http://www.fifthdimension.jp/. They are licensed under gnugpl Ver.2, and are able to run on Windows, MacOS X and Linux. © 2011 Blackwell Publishing Ltd.

  17. Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis

    PubMed Central

    Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu

    2013-01-01

    Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis. PMID:23825605

  18. Comparison of seven techniques for typing international epidemic strains of Clostridium difficile: restriction endonuclease analysis, pulsed-field gel electrophoresis, PCR-ribotyping, multilocus sequence typing, multilocus variable-number tandem-repeat analysis, amplified fragment length polymorphism, and surface layer protein A gene sequence typing.

    PubMed

    Killgore, George; Thompson, Angela; Johnson, Stuart; Brazier, Jon; Kuijper, Ed; Pepin, Jacques; Frost, Eric H; Savelkoul, Paul; Nicholson, Brad; van den Berg, Renate J; Kato, Haru; Sambol, Susan P; Zukowski, Walter; Woods, Christopher; Limbago, Brandi; Gerding, Dale N; McDonald, L Clifford

    2008-02-01

    Using 42 isolates contributed by laboratories in Canada, The Netherlands, the United Kingdom, and the United States, we compared the results of analyses done with seven Clostridium difficile typing techniques: multilocus variable-number tandem-repeat analysis (MLVA), amplified fragment length polymorphism (AFLP), surface layer protein A gene sequence typing (slpAST), PCR-ribotyping, restriction endonuclease analysis (REA), multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). We assessed the discriminating ability and typeability of each technique as well as the agreement among techniques in grouping isolates by allele profile A (AP-A) through AP-F, which are defined by toxinotype, the presence of the binary toxin gene, and deletion in the tcdC gene. We found that all isolates were typeable by all techniques and that discrimination index scores for the techniques tested ranged from 0.964 to 0.631 in the following order: MLVA, REA, PFGE, slpAST, PCR-ribotyping, MLST, and AFLP. All the techniques were able to distinguish the current epidemic strain of C. difficile (BI/027/NAP1) from other strains. All of the techniques showed multiple types for AP-A (toxinotype 0, binary toxin negative, and no tcdC gene deletion). REA, slpAST, MLST, and PCR-ribotyping all included AP-B (toxinotype III, binary toxin positive, and an 18-bp deletion in tcdC) in a single group that excluded other APs. PFGE, AFLP, and MLVA grouped two, one, and two different non-AP-B isolates, respectively, with their AP-B isolates. All techniques appear to be capable of detecting outbreak strains, but only REA and MLVA showed sufficient discrimination to distinguish strains from different outbreaks.

  19. Genetic Diversity of Giardia duodenalis: Multilocus Genotyping Reveals Zoonotic Potential between Clinical and Environmental Sources in a Metropolitan Region of Brazil

    PubMed Central

    Durigan, Mauricio; Abreu, Aluana Gonçalves; Zucchi, Maria Imaculada; Franco, Regina Maura Bueno; de Souza, Anete Pereira

    2014-01-01

    Background Giardia duodenalis is a flagellate protozoan that parasitizes humans and several other mammals. Protozoan contamination has been regularly documented at important environmental sites, although most of these studies were performed at the species level. There is a lack of studies that correlate environmental contamination and clinical infections in the same region. The aim of this study is to evaluate the genetic diversity of a set of clinical and environmental samples and to use the obtained data to characterize the genetic profile of the distribution of G. duodenalis and the potential for zoonotic transmission in a metropolitan region of Brazil. Methodology/Principal Findings The genetic assemblages and subtypes of G. duodenalis isolates obtained from hospitals, a veterinary clinic, a day-care center and important environmental sites were determined via multilocus sequence-based genotyping using three unlinked gene loci. Cysts of Giardia were detected at all of the environmental sites. Mixed assemblages were detected in 25% of the total samples, and an elevated number of haplotypes was identified. The main haplotypes were shared among the groups, and new subtypes were identified at all loci. Ten multilocus genotypes were identified: 7 for assemblage A and 3 for assemblage B. Conclusions/Significance There is persistent G. duodenalis contamination at important environmental sites in the city. The identified mixed assemblages likely represent mixed infections, suggesting high endemicity of Giardia in these hosts. Most Giardia isolates obtained in this study displayed zoonotic potential. The high degree of genetic diversity in the isolates obtained from both clinical and environmental samples suggests that multiple sources of infection are likely responsible for the detected contamination events. The finding that many multilocus genotypes (MLGs) and haplotypes are shared by different groups suggests that these sources of infection may be related and indicates that there is a notable risk of human infection caused by Giardia in this region. PMID:25536055

  20. Multi-locus analysis of Giardia duodenalis from nonhuman primates kept in zoos in China: geographical segregation and host-adaptation of assemblage B isolates.

    PubMed

    Karim, Md Robiul; Wang, Rongjun; Yu, Fuchang; Li, Tongyi; Dong, Haiju; Li, Dezhong; Zhang, Longxian; Li, Junqiang; Jian, Fuchun; Zhang, Sumei; Rume, Farzana Islam; Ning, Changshen; Xiao, Lihua

    2015-03-01

    Only a few studies based on single locus characterization have been conducted on the molecular epidemiology of Giardia duodenalis in nonhuman primates (NHPs). The present study was conducted to examine the occurrence and genotype identity of G. duodenalis in NHPs based on multi-locus analysis of the small-subunit ribosomal RNA (SSU rRNA), triose phosphate isomerase (tpi), glutamate dehydrogenase (gdh), and beta-giardin (bg) genes. Fecal specimens were collected from 496 animals of 36 NHP species kept in seven zoos in China and screened for G. duodenalis by tpi-based PCR. G. duodenalis was detected in 92 (18.6%) specimens from 18 NHP species, belonging to assemblage A (n=4) and B (n=88). In positive NHP species, the infection rates ranged from 4.8% to 100%. In tpi sequence analysis, the assemblage A included subtypes A1, A2 and one novel subtype. Multi-locus analysis of the tpi, gdh, and bg genes detected 11 (8 known and 3 new), 6 (3 known and 3 new) and 9 (2 known and 7 new) subtypes in 88, 47 and 35 isolates in assemblage B, respectively. Thirty-two assemblage B isolates with data at all three loci yielded 15 multi-locus genotypes (MLGs), including 2 known and 13 new MLGs. Phylogenetic analysis of concatenated sequences of assemblage B showed that MLGs found here were genetically different from those of humans, NHPs, rabbit and guinea pig in Italy and Sweden. It further indicated that assemblage B isolates in ring-tailed lemurs and squirrel monkeys might be genetically different from those in other NHPs. These data suggest that NHPs are mainly infected with G. duodenalis assemblage B and there might be geographical segregation and host-adaptation in assemblage B in NHPs. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Identifying the seasonal origins of human campylobacteriosis

    PubMed Central

    STRACHAN, N. J. C.; ROTARIU, O.; SMITH-PALMER, A.; COWDEN, J.; SHEPPARD, S. K.; O’BRIEN, S. J.; MAIDEN, M. C. J.; MACRAE, M.; BESSELL, P. R.; MATTHEWS, L.; REID, S. W. J.; INNOCENT, G. T.; OGDEN, I. D.; FORBES, K. J.

    2014-01-01

    SUMMARY Human campylobacteriosis exhibits a distinctive seasonality in temperate regions. This paper aims to identify the origins of this seasonality. Clinical isolates [typed by multi-locus sequence typing (MLST)] and epidemiological data were collected from Scotland. Young rural children were found to have an increased burden of disease in the late spring due to strains of non-chicken origin (e.g. ruminant and wild bird strains from environmental sources). In contrast the adult population had an extended summer peak associated with chicken strains. Travel abroad and UK mainland travel were associated with up to 17% and 18% of cases, respectively. International strains were associated with chicken, had a higher diversity than indigenous strains and a different spectrum of MLST types representative of these countries. Integrating empirical epidemiology and molecular subtyping can successfully elucidate the seasonal components of human campylobacteriosis. The findings will enable public health officials to focus strategies to reduce the disease burden. PMID:22989449

  2. Population Structure of Xylella fastidiosa Associated with Almond Leaf Scorch Disease in the San Joaquin Valley of California.

    PubMed

    Lin, Hong; Islam, Md Sajedul; Cabrera-La Rosa, Juan C; Civerolo, Edwin L; Groves, Russell L

    2015-06-01

    Xylella fastidiosa causes disease in many commercial crops, including almond leaf scorch (ALS) disease in susceptible almond (Prunus dulcis). In this study, genetic diversity and population structure of X. fastidiosa associated with ALS disease were evaluated. Isolates obtained from two almond orchards in Fresno and Kern County in the San Joaquin Valley of California were analyzed for two successive years. Multilocus simple-sequence repeat (SSR) analysis revealed two major genetic clusters that were associated with two host cultivars, 'Sonora' and 'Nonpareil', respectively, regardless of the year of study or location of the orchard. These relationships suggest that host cultivar selection and adaptation are major driving forces shaping ALS X. fastidiosa population structure in the San Joaquin Valley. This finding will provide insight into understanding pathogen adaptation and host selection in the context of ALS disease dynamics.

  3. Bayesian Inference on Malignant Breast Cancer in Nigeria: A Diagnosis of MCMC Convergence

    PubMed Central

    Ogunsakin, Ropo Ebenezer; Siaka, Lougue

    2017-01-01

    Background: There has been no previous study to classify malignant breast tumor in details based on Markov Chain Monte Carlo (MCMC) convergence in Western, Nigeria. This study therefore aims to profile patients living with benign and malignant breast tumor in two different hospitals among women of Western Nigeria, with a focus on prognostic factors and MCMC convergence. Materials and Methods: A hospital-based record was used to identify prognostic factors for malignant breast cancer among women of Western Nigeria. This paper describes Bayesian inference and demonstrates its usage to estimation of parameters of the logistic regression via Markov Chain Monte Carlo (MCMC) algorithm. The result of the Bayesian approach is compared with the classical statistics. Results: The mean age of the respondents was 42.2 ±16.6 years with 52% of the women aged between 35-49 years. The results of both techniques suggest that age and women with at least high school education have a significantly higher risk of being diagnosed with malignant breast tumors than benign breast tumors. The results also indicate a reduction of standard errors is associated with the coefficients obtained from the Bayesian approach. In addition, simulation result reveal that women with at least high school are 1.3 times more at risk of having malignant breast lesion in western Nigeria compared to benign breast lesion. Conclusion: We concluded that more efforts are required towards creating awareness and advocacy campaigns on how the prevalence of malignant breast lesions can be reduced, especially among women. The application of Bayesian produces precise estimates for modeling malignant breast cancer. PMID:29072396

  4. Quantifying Uncertainty in Near Surface Electromagnetic Imaging Using Bayesian Methods

    NASA Astrophysics Data System (ADS)

    Blatter, D. B.; Ray, A.; Key, K.

    2017-12-01

    Geoscientists commonly use electromagnetic methods to image the Earth's near surface. Field measurements of EM fields are made (often with the aid an artificial EM source) and then used to infer near surface electrical conductivity via a process known as inversion. In geophysics, the standard inversion tool kit is robust and can provide an estimate of the Earth's near surface conductivity that is both geologically reasonable and compatible with the measured field data. However, standard inverse methods struggle to provide a sense of the uncertainty in the estimate they provide. This is because the task of finding an Earth model that explains the data to within measurement error is non-unique - that is, there are many, many such models; but the standard methods provide only one "answer." An alternative method, known as Bayesian inversion, seeks to explore the full range of Earth model parameters that can adequately explain the measured data, rather than attempting to find a single, "ideal" model. Bayesian inverse methods can therefore provide a quantitative assessment of the uncertainty inherent in trying to infer near surface conductivity from noisy, measured field data. This study applies a Bayesian inverse method (called trans-dimensional Markov chain Monte Carlo) to transient airborne EM data previously collected over Taylor Valley - one of the McMurdo Dry Valleys in Antarctica. Our results confirm the reasonableness of previous estimates (made using standard methods) of near surface conductivity beneath Taylor Valley. In addition, we demonstrate quantitatively the uncertainty associated with those estimates. We demonstrate that Bayesian inverse methods can provide quantitative uncertainty to estimates of near surface conductivity.

  5. Bayesian Estimation of Small Effects in Exercise and Sports Science.

    PubMed

    Mengersen, Kerrie L; Drovandi, Christopher C; Robert, Christian P; Pyne, David B; Gore, Christopher J

    2016-01-01

    The aim of this paper is to provide a Bayesian formulation of the so-called magnitude-based inference approach to quantifying and interpreting effects, and in a case study example provide accurate probabilistic statements that correspond to the intended magnitude-based inferences. The model is described in the context of a published small-scale athlete study which employed a magnitude-based inference approach to compare the effect of two altitude training regimens (live high-train low (LHTL), and intermittent hypoxic exposure (IHE)) on running performance and blood measurements of elite triathletes. The posterior distributions, and corresponding point and interval estimates, for the parameters and associated effects and comparisons of interest, were estimated using Markov chain Monte Carlo simulations. The Bayesian analysis was shown to provide more direct probabilistic comparisons of treatments and able to identify small effects of interest. The approach avoided asymptotic assumptions and overcame issues such as multiple testing. Bayesian analysis of unscaled effects showed a probability of 0.96 that LHTL yields a substantially greater increase in hemoglobin mass than IHE, a 0.93 probability of a substantially greater improvement in running economy and a greater than 0.96 probability that both IHE and LHTL yield a substantially greater improvement in maximum blood lactate concentration compared to a Placebo. The conclusions are consistent with those obtained using a 'magnitude-based inference' approach that has been promoted in the field. The paper demonstrates that a fully Bayesian analysis is a simple and effective way of analysing small effects, providing a rich set of results that are straightforward to interpret in terms of probabilistic statements.

  6. Comparing models for quantitative risk assessment: an application to the European Registry of foreign body injuries in children.

    PubMed

    Berchialla, Paola; Scarinzi, Cecilia; Snidero, Silvia; Gregori, Dario

    2016-08-01

    Risk Assessment is the systematic study of decisions subject to uncertain consequences. An increasing interest has been focused on modeling techniques like Bayesian Networks since their capability of (1) combining in the probabilistic framework different type of evidence including both expert judgments and objective data; (2) overturning previous beliefs in the light of the new information being received and (3) making predictions even with incomplete data. In this work, we proposed a comparison among Bayesian Networks and other classical Quantitative Risk Assessment techniques such as Neural Networks, Classification Trees, Random Forests and Logistic Regression models. Hybrid approaches, combining both Classification Trees and Bayesian Networks, were also considered. Among Bayesian Networks, a clear distinction between purely data-driven approach and combination of expert knowledge with objective data is made. The aim of this paper consists in evaluating among this models which best can be applied, in the framework of Quantitative Risk Assessment, to assess the safety of children who are exposed to the risk of inhalation/insertion/aspiration of consumer products. The issue of preventing injuries in children is of paramount importance, in particular where product design is involved: quantifying the risk associated to product characteristics can be of great usefulness in addressing the product safety design regulation. Data of the European Registry of Foreign Bodies Injuries formed the starting evidence for risk assessment. Results showed that Bayesian Networks appeared to have both the ease of interpretability and accuracy in making prediction, even if simpler models like logistic regression still performed well. © The Author(s) 2013.

  7. Model Diagnostics for Bayesian Networks

    ERIC Educational Resources Information Center

    Sinharay, Sandip

    2006-01-01

    Bayesian networks are frequently used in educational assessments primarily for learning about students' knowledge and skills. There is a lack of works on assessing fit of Bayesian networks. This article employs the posterior predictive model checking method, a popular Bayesian model checking tool, to assess fit of simple Bayesian networks. A…

  8. A Gentle Introduction to Bayesian Analysis: Applications to Developmental Research

    PubMed Central

    van de Schoot, Rens; Kaplan, David; Denissen, Jaap; Asendorpf, Jens B; Neyer, Franz J; van Aken, Marcel AG

    2014-01-01

    Bayesian statistical methods are becoming ever more popular in applied and fundamental research. In this study a gentle introduction to Bayesian analysis is provided. It is shown under what circumstances it is attractive to use Bayesian estimation, and how to interpret properly the results. First, the ingredients underlying Bayesian methods are introduced using a simplified example. Thereafter, the advantages and pitfalls of the specification of prior knowledge are discussed. To illustrate Bayesian methods explained in this study, in a second example a series of studies that examine the theoretical framework of dynamic interactionism are considered. In the Discussion the advantages and disadvantages of using Bayesian statistics are reviewed, and guidelines on how to report on Bayesian statistics are provided. PMID:24116396

  9. Efficient Bayesian mixed model analysis increases association power in large cohorts

    PubMed Central

    Loh, Po-Ru; Tucker, George; Bulik-Sullivan, Brendan K; Vilhjálmsson, Bjarni J; Finucane, Hilary K; Salem, Rany M; Chasman, Daniel I; Ridker, Paul M; Neale, Benjamin M; Berger, Bonnie; Patterson, Nick; Price, Alkes L

    2014-01-01

    Linear mixed models are a powerful statistical tool for identifying genetic associations and avoiding confounding. However, existing methods are computationally intractable in large cohorts, and may not optimize power. All existing methods require time cost O(MN2) (where N = #samples and M = #SNPs) and implicitly assume an infinitesimal genetic architecture in which effect sizes are normally distributed, which can limit power. Here, we present a far more efficient mixed model association method, BOLT-LMM, which requires only a small number of O(MN)-time iterations and increases power by modeling more realistic, non-infinitesimal genetic architectures via a Bayesian mixture prior on marker effect sizes. We applied BOLT-LMM to nine quantitative traits in 23,294 samples from the Women’s Genome Health Study (WGHS) and observed significant increases in power, consistent with simulations. Theory and simulations show that the boost in power increases with cohort size, making BOLT-LMM appealing for GWAS in large cohorts. PMID:25642633

  10. Probabilistic evaluation of n traces with no putative source: A likelihood ratio based approach in an investigative framework.

    PubMed

    De March, I; Sironi, E; Taroni, F

    2016-09-01

    Analysis of marks recovered from different crime scenes can be useful to detect a linkage between criminal cases, even though a putative source for the recovered traces is not available. This particular circumstance is often encountered in the early stage of investigations and thus, the evaluation of evidence association may provide useful information for the investigators. This association is evaluated here from a probabilistic point of view: a likelihood ratio based approach is suggested in order to quantify the strength of the evidence of trace association in the light of two mutually exclusive propositions, namely that the n traces come from a common source or from an unspecified number of sources. To deal with this kind of problem, probabilistic graphical models are used, in form of Bayesian networks and object-oriented Bayesian networks, allowing users to intuitively handle with uncertainty related to the inferential problem. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  11. Sex differences in the development of neuroanatomical functional connectivity underlying intelligence found using Bayesian connectivity analysis.

    PubMed

    Schmithorst, Vincent J; Holland, Scott K

    2007-03-01

    A Bayesian method for functional connectivity analysis was adapted to investigate between-group differences. This method was applied in a large cohort of almost 300 children to investigate differences in boys and girls in the relationship between intelligence and functional connectivity for the task of narrative comprehension. For boys, a greater association was shown between intelligence and the functional connectivity linking Broca's area to auditory processing areas, including Wernicke's areas and the right posterior superior temporal gyrus. For girls, a greater association was shown between intelligence and the functional connectivity linking the left posterior superior temporal gyrus to Wernicke's areas bilaterally. A developmental effect was also seen, with girls displaying a positive correlation with age in the association between intelligence and the functional connectivity linking the right posterior superior temporal gyrus to Wernicke's areas bilaterally. Our results demonstrate a sexual dimorphism in the relationship of functional connectivity to intelligence in children and an increasing reliance on inter-hemispheric connectivity in girls with age.

  12. Multivariate Bayesian analysis of Gaussian, right censored Gaussian, ordered categorical and binary traits using Gibbs sampling

    PubMed Central

    Korsgaard, Inge Riis; Lund, Mogens Sandø; Sorensen, Daniel; Gianola, Daniel; Madsen, Per; Jensen, Just

    2003-01-01

    A fully Bayesian analysis using Gibbs sampling and data augmentation in a multivariate model of Gaussian, right censored, and grouped Gaussian traits is described. The grouped Gaussian traits are either ordered categorical traits (with more than two categories) or binary traits, where the grouping is determined via thresholds on the underlying Gaussian scale, the liability scale. Allowances are made for unequal models, unknown covariance matrices and missing data. Having outlined the theory, strategies for implementation are reviewed. These include joint sampling of location parameters; efficient sampling from the fully conditional posterior distribution of augmented data, a multivariate truncated normal distribution; and sampling from the conditional inverse Wishart distribution, the fully conditional posterior distribution of the residual covariance matrix. Finally, a simulated dataset was analysed to illustrate the methodology. This paper concentrates on a model where residuals associated with liabilities of the binary traits are assumed to be independent. A Bayesian analysis using Gibbs sampling is outlined for the model where this assumption is relaxed. PMID:12633531

  13. Autistic traits, but not schizotypy, predict increased weighting of sensory information in Bayesian visual integration.

    PubMed

    Karvelis, Povilas; Seitz, Aaron R; Lawrie, Stephen M; Seriès, Peggy

    2018-05-14

    Recent theories propose that schizophrenia/schizotypy and autistic spectrum disorder are related to impairments in Bayesian inference that is, how the brain integrates sensory information (likelihoods) with prior knowledge. However existing accounts fail to clarify: (i) how proposed theories differ in accounts of ASD vs. schizophrenia and (ii) whether the impairments result from weaker priors or enhanced likelihoods. Here, we directly address these issues by characterizing how 91 healthy participants, scored for autistic and schizotypal traits, implicitly learned and combined priors with sensory information. This was accomplished through a visual statistical learning paradigm designed to quantitatively assess variations in individuals' likelihoods and priors. The acquisition of the priors was found to be intact along both traits spectra. However, autistic traits were associated with more veridical perception and weaker influence of expectations. Bayesian modeling revealed that this was due, not to weaker prior expectations, but to more precise sensory representations. © 2018, Karvelis et al.

  14. Π4U: A high performance computing framework for Bayesian uncertainty quantification of complex models

    NASA Astrophysics Data System (ADS)

    Hadjidoukas, P. E.; Angelikopoulos, P.; Papadimitriou, C.; Koumoutsakos, P.

    2015-03-01

    We present Π4U, an extensible framework, for non-intrusive Bayesian Uncertainty Quantification and Propagation (UQ+P) of complex and computationally demanding physical models, that can exploit massively parallel computer architectures. The framework incorporates Laplace asymptotic approximations as well as stochastic algorithms, along with distributed numerical differentiation and task-based parallelism for heterogeneous clusters. Sampling is based on the Transitional Markov Chain Monte Carlo (TMCMC) algorithm and its variants. The optimization tasks associated with the asymptotic approximations are treated via the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). A modified subset simulation method is used for posterior reliability measurements of rare events. The framework accommodates scheduling of multiple physical model evaluations based on an adaptive load balancing library and shows excellent scalability. In addition to the software framework, we also provide guidelines as to the applicability and efficiency of Bayesian tools when applied to computationally demanding physical models. Theoretical and computational developments are demonstrated with applications drawn from molecular dynamics, structural dynamics and granular flow.

  15. A Bayesian network approach to the database search problem in criminal proceedings

    PubMed Central

    2012-01-01

    Background The ‘database search problem’, that is, the strengthening of a case - in terms of probative value - against an individual who is found as a result of a database search, has been approached during the last two decades with substantial mathematical analyses, accompanied by lively debate and centrally opposing conclusions. This represents a challenging obstacle in teaching but also hinders a balanced and coherent discussion of the topic within the wider scientific and legal community. This paper revisits and tracks the associated mathematical analyses in terms of Bayesian networks. Their derivation and discussion for capturing probabilistic arguments that explain the database search problem are outlined in detail. The resulting Bayesian networks offer a distinct view on the main debated issues, along with further clarity. Methods As a general framework for representing and analyzing formal arguments in probabilistic reasoning about uncertain target propositions (that is, whether or not a given individual is the source of a crime stain), this paper relies on graphical probability models, in particular, Bayesian networks. This graphical probability modeling approach is used to capture, within a single model, a series of key variables, such as the number of individuals in a database, the size of the population of potential crime stain sources, and the rarity of the corresponding analytical characteristics in a relevant population. Results This paper demonstrates the feasibility of deriving Bayesian network structures for analyzing, representing, and tracking the database search problem. The output of the proposed models can be shown to agree with existing but exclusively formulaic approaches. Conclusions The proposed Bayesian networks allow one to capture and analyze the currently most well-supported but reputedly counter-intuitive and difficult solution to the database search problem in a way that goes beyond the traditional, purely formulaic expressions. The method’s graphical environment, along with its computational and probabilistic architectures, represents a rich package that offers analysts and discussants with additional modes of interaction, concise representation, and coherent communication. PMID:22849390

  16. A Gentle Introduction to Bayesian Analysis: Applications to Developmental Research

    ERIC Educational Resources Information Center

    van de Schoot, Rens; Kaplan, David; Denissen, Jaap; Asendorpf, Jens B.; Neyer, Franz J.; van Aken, Marcel A. G.

    2014-01-01

    Bayesian statistical methods are becoming ever more popular in applied and fundamental research. In this study a gentle introduction to Bayesian analysis is provided. It is shown under what circumstances it is attractive to use Bayesian estimation, and how to interpret properly the results. First, the ingredients underlying Bayesian methods are…

  17. Bayesian correction for covariate measurement error: A frequentist evaluation and comparison with regression calibration.

    PubMed

    Bartlett, Jonathan W; Keogh, Ruth H

    2018-06-01

    Bayesian approaches for handling covariate measurement error are well established and yet arguably are still relatively little used by researchers. For some this is likely due to unfamiliarity or disagreement with the Bayesian inferential paradigm. For others a contributory factor is the inability of standard statistical packages to perform such Bayesian analyses. In this paper, we first give an overview of the Bayesian approach to handling covariate measurement error, and contrast it with regression calibration, arguably the most commonly adopted approach. We then argue why the Bayesian approach has a number of statistical advantages compared to regression calibration and demonstrate that implementing the Bayesian approach is usually quite feasible for the analyst. Next, we describe the closely related maximum likelihood and multiple imputation approaches and explain why we believe the Bayesian approach to generally be preferable. We then empirically compare the frequentist properties of regression calibration and the Bayesian approach through simulation studies. The flexibility of the Bayesian approach to handle both measurement error and missing data is then illustrated through an analysis of data from the Third National Health and Nutrition Examination Survey.

  18. Description of a new species of crested newt, previously subsumed in Triturus ivanbureschi (Amphibia: Caudata: Salamandridae).

    PubMed

    Wielstra, B; Arntzen, J W

    2016-05-05

    Multilocus molecular data play a pivotal role in diagnosing cryptic species (i.e. genetically distinct but morphologically similar species). A multilocus phylogeographic survey has provided compelling evidence that Triturus ivanbureschi sensu lato comprises two distinct gene pools with restricted gene flow. We conclude that this taxon had better be treated as two distinct (albeit morphologically cryptic) species. The name T. ivanbureschi should be restricted to the western species, which is distributed in western Asiatic Turkey plus the south-eastern Balkan Peninsula. No name is as yet available for the eastern species, which is distributed in northern Asiatic Turkey. We propose the name T. anatolicus sp. nov. for the eastern species and provide a formal species description.

  19. Predictors of Outcome in Traumatic Brain Injury: New Insight Using Receiver Operating Curve Indices and Bayesian Network Analysis.

    PubMed

    Zador, Zsolt; Sperrin, Matthew; King, Andrew T

    2016-01-01

    Traumatic brain injury remains a global health problem. Understanding the relative importance of outcome predictors helps optimize our treatment strategies by informing assessment protocols, clinical decisions and trial designs. In this study we establish importance ranking for outcome predictors based on receiver operating indices to identify key predictors of outcome and create simple predictive models. We then explore the associations between key outcome predictors using Bayesian networks to gain further insight into predictor importance. We analyzed the corticosteroid randomization after significant head injury (CRASH) trial database of 10008 patients and included patients for whom demographics, injury characteristics, computer tomography (CT) findings and Glasgow Outcome Scale (GCS) were recorded (total of 13 predictors, which would be available to clinicians within a few hours following the injury in 6945 patients). Predictions of clinical outcome (death or severe disability at 6 months) were performed using logistic regression models with 5-fold cross validation. Predictive performance was measured using standardized partial area (pAUC) under the receiver operating curve (ROC) and we used Delong test for comparisons. Variable importance ranking was based on pAUC targeted at specificity (pAUCSP) and sensitivity (pAUCSE) intervals of 90-100%. Probabilistic associations were depicted using Bayesian networks. Complete AUC analysis showed very good predictive power (AUC = 0.8237, 95% CI: 0.8138-0.8336) for the complete model. Specificity focused importance ranking highlighted age, pupillary, motor responses, obliteration of basal cisterns/3rd ventricle and midline shift. Interestingly when targeting model sensitivity, the highest-ranking variables were age, severe extracranial injury, verbal response, hematoma on CT and motor response. Simplified models, which included only these key predictors, had similar performance (pAUCSP = 0.6523, 95% CI: 0.6402-0.6641 and pAUCSE = 0.6332, 95% CI: 0.62-0.6477) compared to the complete models (pAUCSP = 0.6664, 95% CI: 0.6543-0.679, pAUCSE = 0.6436, 95% CI: 0.6289-0.6585, de Long p value 0.1165 and 0.3448 respectively). Bayesian networks showed the predictors that did not feature in the simplified models were associated with those that did. We demonstrate that importance based variable selection allows simplified predictive models to be created while maintaining prediction accuracy. Variable selection targeting specificity confirmed key components of clinical assessment in TBI whereas sensitivity based ranking suggested extracranial injury as one of the important predictors. These results help refine our approach to head injury assessment, decision-making and outcome prediction targeted at model sensitivity and specificity. Bayesian networks proved to be a comprehensive tool for depicting probabilistic associations for key predictors giving insight into why the simplified model has maintained accuracy.

  20. Population Genetic Structure of Listeria monocytogenes Strains as Determined by Pulsed-Field Gel Electrophoresis and Multilocus Sequence Typing

    PubMed Central

    Henri, Clémentine; Félix, Benjamin; Guillier, Laurent; Leekitcharoenphon, Pimlapas; Michelon, Damien; Mariet, Jean-François; Aarestrup, Frank M.; Mistou, Michel-Yves; Hendriksen, René S.

    2016-01-01

    ABSTRACT Listeria monocytogenes is a ubiquitous bacterium that may cause the foodborne illness listeriosis. Only a small amount of data about the population genetic structure of strains isolated from food is available. This study aimed to provide an accurate view of the L. monocytogenes food strain population in France. From 1999 to 2014, 1,894 L. monocytogenes strains were isolated from food at the French National Reference Laboratory for L. monocytogenes and classified according to the five risk food matrices defined by the European Food Safety Authority (EFSA). A total of 396 strains were selected on the basis of different pulsed-field gel electrophoresis (PFGE) clusters, serotypes, and strain origins and typed by multilocus sequence typing (MLST), and the MLST results were supplemented with MLST data available from Institut Pasteur, representing human and additional food strains from France. The distribution of sequence types (STs) was compared between food and clinical strains on a panel of 675 strains. High congruence between PFGE and MLST was found. Out of 73 PFGE clusters, the two most prevalent corresponded to ST9 and ST121. Using original statistical analysis, we demonstrated that (i) there was not a clear association between ST9 and ST121 and the food matrices, (ii) serotype IIc, ST8, and ST4 were associated with meat products, and (iii) ST13 was associated with dairy products. Of the two major STs, ST121 was the ST that included the fewest clinical strains, which might indicate lower virulence. This observation may be directly relevant for refining risk analysis models for the better management of food safety. IMPORTANCE This study showed a very useful backward compatibility between PFGE and MLST for surveillance. The results enabled better understanding of the population structure of L. monocytogenes strains isolated from food and management of the health risks associated with L. monocytogenes food strains. Moreover, this work provided an accurate view of L. monocytogenes strain populations associated with specific food matrices. We clearly showed that some STs were associated with food matrices, such as meat, meat products, and dairy products. We opened the way to source attribution modeling in order to quantify the relative importance of the main food matrices. PMID:27235443

  1. The current state of Bayesian methods in medical product development: survey results and recommendations from the DIA Bayesian Scientific Working Group.

    PubMed

    Natanegara, Fanni; Neuenschwander, Beat; Seaman, John W; Kinnersley, Nelson; Heilmann, Cory R; Ohlssen, David; Rochester, George

    2014-01-01

    Bayesian applications in medical product development have recently gained popularity. Despite many advances in Bayesian methodology and computations, increase in application across the various areas of medical product development has been modest. The DIA Bayesian Scientific Working Group (BSWG), which includes representatives from industry, regulatory agencies, and academia, has adopted the vision to ensure Bayesian methods are well understood, accepted more broadly, and appropriately utilized to improve decision making and enhance patient outcomes. As Bayesian applications in medical product development are wide ranging, several sub-teams were formed to focus on various topics such as patient safety, non-inferiority, prior specification, comparative effectiveness, joint modeling, program-wide decision making, analytical tools, and education. The focus of this paper is on the recent effort of the BSWG Education sub-team to administer a Bayesian survey to statisticians across 17 organizations involved in medical product development. We summarize results of this survey, from which we provide recommendations on how to accelerate progress in Bayesian applications throughout medical product development. The survey results support findings from the literature and provide additional insight on regulatory acceptance of Bayesian methods and information on the need for a Bayesian infrastructure within an organization. The survey findings support the claim that only modest progress in areas of education and implementation has been made recently, despite substantial progress in Bayesian statistical research and software availability. Copyright © 2013 John Wiley & Sons, Ltd.

  2. Sensitivity analyses for sparse-data problems-using weakly informative bayesian priors.

    PubMed

    Hamra, Ghassan B; MacLehose, Richard F; Cole, Stephen R

    2013-03-01

    Sparse-data problems are common, and approaches are needed to evaluate the sensitivity of parameter estimates based on sparse data. We propose a Bayesian approach that uses weakly informative priors to quantify sensitivity of parameters to sparse data. The weakly informative prior is based on accumulated evidence regarding the expected magnitude of relationships using relative measures of disease association. We illustrate the use of weakly informative priors with an example of the association of lifetime alcohol consumption and head and neck cancer. When data are sparse and the observed information is weak, a weakly informative prior will shrink parameter estimates toward the prior mean. Additionally, the example shows that when data are not sparse and the observed information is not weak, a weakly informative prior is not influential. Advancements in implementation of Markov Chain Monte Carlo simulation make this sensitivity analysis easily accessible to the practicing epidemiologist.

  3. Sensitivity Analyses for Sparse-Data Problems—Using Weakly Informative Bayesian Priors

    PubMed Central

    Hamra, Ghassan B.; MacLehose, Richard F.; Cole, Stephen R.

    2013-01-01

    Sparse-data problems are common, and approaches are needed to evaluate the sensitivity of parameter estimates based on sparse data. We propose a Bayesian approach that uses weakly informative priors to quantify sensitivity of parameters to sparse data. The weakly informative prior is based on accumulated evidence regarding the expected magnitude of relationships using relative measures of disease association. We illustrate the use of weakly informative priors with an example of the association of lifetime alcohol consumption and head and neck cancer. When data are sparse and the observed information is weak, a weakly informative prior will shrink parameter estimates toward the prior mean. Additionally, the example shows that when data are not sparse and the observed information is not weak, a weakly informative prior is not influential. Advancements in implementation of Markov Chain Monte Carlo simulation make this sensitivity analysis easily accessible to the practicing epidemiologist. PMID:23337241

  4. Discovering complex interrelationships between socioeconomic status and health in Europe: A case study applying Bayesian Networks.

    PubMed

    Alvarez-Galvez, Javier

    2016-03-01

    Studies assume that socioeconomic status determines individuals' states of health, but how does health determine socioeconomic status? And how does this association vary depending on contextual differences? To answer this question, our study uses an additive Bayesian Networks model to explain the interrelationships between health and socioeconomic determinants using complex and messy data. This model has been used to find the most probable structure in a network to describe the interdependence of these factors in five European welfare state regimes. The advantage of this study is that it offers a specific picture to describe the complex interrelationship between socioeconomic determinants and health, producing a network that is controlled by socio-demographic factors such as gender and age. The present work provides a general framework to describe and understand the complex association between socioeconomic determinants and health. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. On measures of association among genetic variables

    PubMed Central

    Gianola, Daniel; Manfredi, Eduardo; Simianer, Henner

    2012-01-01

    Summary Systems involving many variables are important in population and quantitative genetics, for example, in multi-trait prediction of breeding values and in exploration of multi-locus associations. We studied departures of the joint distribution of sets of genetic variables from independence. New measures of association based on notions of statistical distance between distributions are presented. These are more general than correlations, which are pairwise measures, and lack a clear interpretation beyond the bivariate normal distribution. Our measures are based on logarithmic (Kullback-Leibler) and on relative ‘distances’ between distributions. Indexes of association are developed and illustrated for quantitative genetics settings in which the joint distribution of the variables is either multivariate normal or multivariate-t, and we show how the indexes can be used to study linkage disequilibrium in a two-locus system with multiple alleles and present applications to systems of correlated beta distributions. Two multivariate beta and multivariate beta-binomial processes are examined, and new distributions are introduced: the GMS-Sarmanov multivariate beta and its beta-binomial counterpart. PMID:22742500

  6. Listeria monocytogenes sequence type 1 is predominant in ruminant rhombencephalitis

    PubMed Central

    Dreyer, Margaux; Aguilar-Bultet, Lisandra; Rupp, Sebastian; Guldimann, Claudia; Stephan, Roger; Schock, Alexandra; Otter, Arthur; Schüpbach, Gertraud; Brisse, Sylvain; Lecuit, Marc; Frey, Joachim; Oevermann, Anna

    2016-01-01

    Listeria (L.) monocytogenes is an opportunistic pathogen causing life-threatening infections in diverse mammalian species including humans and ruminants. As little is known on the link between strains and clinicopathological phenotypes, we studied potential strain-associated virulence and organ tropism in L. monocytogenes isolates from well-defined ruminant cases of clinical infections and the farm environment. The phylogeny of isolates and their virulence-associated genes were analyzed by multilocus sequence typing (MLST) and sequence analysis of virulence-associated genes. Additionally, a panel of representative isolates was subjected to in vitro infection assays. Our data suggest the environmental exposure of ruminants to a broad range of strains and yet the strong association of sequence type (ST) 1 from clonal complex (CC) 1 with rhombencephalitis, suggesting increased neurotropism of ST1 in ruminants, which is possibly related to its hypervirulence. This study emphasizes the importance of considering clonal background of L. monocytogenes isolates in surveillance, epidemiological investigation and disease control. PMID:27848981

  7. On the Adequacy of Bayesian Evaluations of Categorization Models: Reply to Vanpaemel and Lee (2012)

    ERIC Educational Resources Information Center

    Wills, Andy J.; Pothos, Emmanuel M.

    2012-01-01

    Vanpaemel and Lee (2012) argued, and we agree, that the comparison of formal models can be facilitated by Bayesian methods. However, Bayesian methods neither precede nor supplant our proposals (Wills & Pothos, 2012), as Bayesian methods can be applied both to our proposals and to their polar opposites. Furthermore, the use of Bayesian methods to…

  8. Moving beyond qualitative evaluations of Bayesian models of cognition.

    PubMed

    Hemmer, Pernille; Tauber, Sean; Steyvers, Mark

    2015-06-01

    Bayesian models of cognition provide a powerful way to understand the behavior and goals of individuals from a computational point of view. Much of the focus in the Bayesian cognitive modeling approach has been on qualitative model evaluations, where predictions from the models are compared to data that is often averaged over individuals. In many cognitive tasks, however, there are pervasive individual differences. We introduce an approach to directly infer individual differences related to subjective mental representations within the framework of Bayesian models of cognition. In this approach, Bayesian data analysis methods are used to estimate cognitive parameters and motivate the inference process within a Bayesian cognitive model. We illustrate this integrative Bayesian approach on a model of memory. We apply the model to behavioral data from a memory experiment involving the recall of heights of people. A cross-validation analysis shows that the Bayesian memory model with inferred subjective priors predicts withheld data better than a Bayesian model where the priors are based on environmental statistics. In addition, the model with inferred priors at the individual subject level led to the best overall generalization performance, suggesting that individual differences are important to consider in Bayesian models of cognition.

  9. Comprehensive Phylogenetic Analysis of Bovine Non-aureus Staphylococci Species Based on Whole-Genome Sequencing

    PubMed Central

    Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen

    2016-01-01

    Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335

  10. Analysis of clinical and food-borne isolates of Listeria monocytogenes in the United States by multilocus enzyme electrophoresis and application of the method to epidemiologic investigations.

    PubMed Central

    Bibb, W F; Gellin, B G; Weaver, R; Schwartz, B; Plikaytis, B D; Reeves, M W; Pinner, R W; Broome, C V

    1990-01-01

    To investigate the microbiology and epidemiology of the 1,700 sporadic cases of listeriosis that occur annually in the United States, we developed a multilocus enzyme electrophoresis (MEE) typing system for Listeria monocytogenes. We studied 390 isolates by MEE. Eighty-two electrophoretic types (ETs) were defined. Two distinct clusters of ETs, ET group A (ETGA) and ET group B (ETGB), separated at a genetic distance of 0.440, were identified. Strains of ETGB were associated with perinatal listeriosis (P = 0.03). All strains of H antigen type a were in ETGA, while all strains of H antigen type b were in ETGB. Among 328 clinical isolates from cases of literiosis, 55 ETs of L. monocytogenes were defined. Thirty-four ETs were identified among 62 isolates from food products. The mean number of strains per ET (5.2) was significantly higher among clinical isolates than among food-borne isolates. Examination of isolates from outbreaks further documented the link between cases and contaminated food products. In one investigation, we found 11 different ETs, ruling out a single common source as a cause of that outbreak. By examining a large number of isolates collected over a specified time in diverse geographic locations in the United States, we have begun to establish a baseline for the study of the epidemiology of listeriosis by MEE. PMID:2117880

  11. Antimicrobial susceptibility, tetracycline and erythromycin resistance genes, and multilocus sequence typing of Streptococcus suis isolates from diseased pigs in China.

    PubMed

    Chen, Lei; Song, Yajing; Wei, Zigong; He, Hongkui; Zhang, Anding; Jin, Meilin

    2013-01-01

    Streptococcus suis (S. suis) is an emerging zoonotic pathogen causing significant economic losses in the swine industry. Here, we investigated the antimicrobial susceptibility, associated antibiotic-resistant determinants and sequence type (ST) of S. suis isolates from diseased pigs in China from 2008 to 2010. Serotype 2 was the most frequently observed strain (n=95) among the 106 S. suis strains collected, followed by serotypes 3 (n=3), 5 (n=3), 4 (n=2), 7 (n=1), 11 (n=1) and 28 (n=1). Multilocus sequence typing analysis revealed that ST1 (n=21) and ST7 (n=74) were the predominant STs, and serotype 2 was found to be significantly correlated with ST7 (P=0.017, Fisher's exact test) and CC1 (P=0.024, Fisher's exact test). The antimicrobial susceptibility results indicated that the antibiotic resistance rate was highest for tetracycline (99.1%), followed by azithromycin (68.9%), erythromycin (67.9%), clindamycin (67.9%), trimethoprim/sulfamethoxazole (16%), levofloxacin (2.8%), chloramphenicol (1.9%), cefaclor (0.9%) and ceftriaxone (0.9%). Antibiotic-resistant genes tet(M), tet(O), tet(O/W/32/O), tet(O/32/O), tet(S), tet(W), tet(L), tet(40), erm(B), mef(A/E) and msr(D) could be detected, and several tandem organizations of antibiotic resistance genes were also found in this study. In conclusion, S. suis strains isolated from diseased pigs in China were less diverse and multi-drug resistant.

  12. Class 1 integrons characterization and multilocus sequence typing of Salmonella spp. from swine production chains in Chiang Mai and Lamphun provinces, Thailand.

    PubMed

    Boonkhot, Phacharaporn; Tadee, Pakpoom; Yamsakul, Panuwat; Pocharoen, Chairoj; Chokesajjawatee, Nipa; Patchanee, Prapas

    2015-05-01

    Pigs and pork products are well known as an important source of Salmonella, one of the major zoonotic foodborne pathogens. The emergence and spread of antimicrobial resistance is becoming a major public health concern worldwide. Integrons are genetic elements known to have a role in the acquisition and expression of genes conferring antibiotic resistance. This study focuses on the prevalence of class 1 integrons-carrying Salmonella, the genetic diversity of strains of those organisms obtained from swine production chains in Chiang Mai and Lamphun provinces, Thailand, using multilocus sequence typing (MLST) and comparison of genetic diversity of sequence types of Salmonella from this study with pulsotypes identified in previous study. In 175 Salmonella strains, the overall prevalence of class 1 integrons-carrying-Salmonella was 14%. The gene cassettes array pattern "dfrA12-orfF-aadA2" was the most frequently observed. Most of the antimicrobial resistance identified was not associated with related gene cassettes harbored by Salmonella. Six sequence types were generated from 30 randomly selected strains detected by MLST. Salmonella at the human-animal-environment interface was confirmed. Linkages both in the farm to slaughterhouse contamination route and the horizontal transmission of resistance genes were demonstrated. To reduce this problem, the use of antimicrobials in livestock should be controlled by veterinarians. Education and training of food handlers as well as promotion of safe methods of food consumption are important avenues for helping prevent foodborne illness.

  13. Insights into the Emergent Bacterial Pathogen Cronobacter spp., Generated by Multilocus Sequence Typing and Analysis

    PubMed Central

    Joseph, Susan; Forsythe, Stephen J.

    2012-01-01

    Cronobacter spp. (previously known as Enterobacter sakazakii) is a bacterial pathogen affecting all age groups, with particularly severe clinical complications in neonates and infants. One recognized route of infection being the consumption of contaminated infant formula. As a recently recognized bacterial pathogen of considerable importance and regulatory control, appropriate detection, and identification schemes are required. The application of multilocus sequence typing (MLST) and analysis (MLSA) of the seven alleles atpD, fusA, glnS, gltB, gyrB, infB, and ppsA (concatenated length 3036 base pairs) has led to considerable advances in our understanding of the genus. This approach is supported by both the reliability of DNA sequencing over subjective phenotyping and the establishment of a MLST database which has open access and is also curated; http://www.pubMLST.org/cronobacter. MLST has been used to describe the diversity of the newly recognized genus, instrumental in the formal recognition of new Cronobacter species (C. universalis and C. condimenti) and revealed the high clonality of strains and the association of clonal complex 4 with neonatal meningitis cases. Clearly the MLST approach has considerable benefits over the use of non-DNA sequence based methods of analysis for newly emergent bacterial pathogens. The application of MLST and MLSA has dramatically enabled us to better understand this opportunistic bacterium which can cause irreparable damage to a newborn baby’s brain, and has contributed to improved control measures to protect neonatal health. PMID:23189075

  14. Methicillin-resistant Staphylococcus aureus nasal colonization in a level III neonatal intensive care unit: Incidence and risk factors.

    PubMed

    Giuffrè, Mario; Amodio, Emanuele; Bonura, Celestino; Geraci, Daniela M; Saporito, Laura; Ortolano, Rita; Corsello, Giovanni; Mammina, Caterina

    2015-05-01

    To describe epidemiologic features and identify risk factors for methicillin-resistant Staphylococcus aureus (MRSA) acquisition in a level III neonatal intensive care unit (NICU). A prospective, cohort study in a university-affiliated NICU with an infection control program including weekly nasal cultures of all neonates. Demographic, clinical, and microbiologic data were prospectively collected between June 2009 and June 2013. Molecular characterization of MRSA isolates was done by multilocus variable number tandem repeat fingerprinting, staphylococcal cassette chromosome mec typing, and on representative isolates by multilocus sequence typing and spa typing. Of 949 neonates, 217 (22.87%) had a culture growing MRSA, including 117 neonates testing positive at their first sampling. Of these latter infants, 96 (82.05%) were inborn and 59 (50.43%) had been transferred from the nursery. Length of stay and colonization pressure were strong independent predictors of MRSA acquisition. Among MRSA isolates, 7 sequence types were identified, with ST22-IVa, spa type t223, being the predominant strain. In an endemic area, early MRSA acquisition and high colonization pressure, likely related to an influx of colonized infants from a well-infant nursery, can support persistence of MRSA in NICUs. Surveillance, molecular tracking of strains, and reinforcement of infection control practices, involving well-infant nurseries in a comprehensive infection control program, could be helpful in containing MRSA transmission. Copyright © 2015 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.

  15. Multilocus Sequence Typing of Serially Collected Isolates of Cryptococcus from HIV-Infected Patients in South Africa

    PubMed Central

    Van Wyk, Marelize; Govender, Nelesh P.; Litvintseva, Anastasia P.

    2014-01-01

    Patients with cryptococcal meningitis in sub-Saharan Africa frequently relapse following treatment. The natural history and etiology of these recurrent episodes warrant investigation. Here, we used multilocus sequence typing (MLST) to compare the molecular genotypes of strains of Cryptococcus neoformans and Cryptococcus gattii isolated from serial episodes of cryptococcal meningitis that were separated by at least 110 days. The most common MLST genotypes among the isolates were the dominant global clinical genotypes (M5 and M4) of molecular type VNI, as well as the VNI genotypes apparently restricted to southern Africa. In addition, there was considerable genetic diversity among these South African isolates, as 15% of the patients had unique genotypes. Eleven percent of the patients were reinfected with a genetically different strain following their initial diagnosis and treatment. However, the majority of serial episodes (89%) were caused by strains with the same genotype as the original strain. These results indicate that serial episodes of cryptococcosis in South Africa are frequently associated with persistence or relapse of the original infection. Using a reference broth microdilution method, we found that the serial isolates of 11% of the patients infected with strains of C. neoformans var. grubii with identical genotypes exhibited ≥4-fold increases in the MICs to fluconazole. Therefore, these recurrent episodes may have been precipitated by inadequate induction or consolidation of antifungal treatment and occasionally may have been due to increased resistance to fluconazole, which may have developed during the chronic infection. PMID:24648562

  16. Molecular typing of avian pathogenic Escherichia coli colonies originating from outbreaks of E. coli peritonitis syndrome in chicken flocks.

    PubMed

    Landman, W J M; Buter, G J; Dijkman, R; van Eck, J H H

    2014-01-01

    Escherichia coli colonies isolated from the bone marrow of fresh dead hens of laying flocks with the E. coli peritonitis syndrome (EPS) were genotyped using pulsed-field gel electrophoresis (PFGE). Typing is important from an epidemiological point of view and also if the use of autogenous (auto)vaccines is considered. Birds with EPS originated from one house of each of three layer farms and one broiler breeder farm. Farms were considered as separate epidemiological units. In total, six flocks were examined including two successive flocks of one layer farm and the broiler breeder farm. E. coli colonies (one per bird) from nine to 16 hens of each flock were genotyped. The clonality of E. coli within birds was studied using five colonies of each of nine to 14 birds per flock. E. coli genotypes, which totalled 15, differed between farms and flocks except for two successive layer flocks that shared three genotypes. One to five genotypes were found per flock with one or two genotypes dominating each outbreak. Within hens, E. coli bacteria were always clonal. Colonies of the same PFGE type always had the same multilocus sequence type. However, four PFGE types shared sequence type 95. Neither PFGE types nor multilocus sequence types were unambiguously related to avian pathogenic E. coli from EPS. In cases where persistence of E. coli strains associated with EPS is found to occur frequently, routine genotyping to select strains for autovaccines should be considered.

  17. A Wide Variety of Clostridium perfringens Type A Food-Borne Isolates That Carry a Chromosomal cpe Gene Belong to One Multilocus Sequence Typing Cluster

    PubMed Central

    Xiao, Yinghua; Wagendorp, Arjen; Moezelaar, Roy; Abee, Tjakko

    2012-01-01

    Of 98 suspected food-borne Clostridium perfringens isolates obtained from a nationwide survey by the Food and Consumer Product Safety Authority in The Netherlands, 59 strains were identified as C. perfringens type A. Using PCR-based techniques, the cpe gene encoding enterotoxin was detected in eight isolates, showing a chromosomal location for seven isolates and a plasmid location for one isolate. Further characterization of these strains by using (GTG)5 fingerprint repetitive sequence-based PCR analysis distinguished C. perfringens from other sulfite-reducing clostridia but did not allow for differentiation between various types of C. perfringens strains. To characterize the C. perfringens strains further, multilocus sequence typing (MLST) analysis was performed on eight housekeeping genes of both enterotoxic and non-cpe isolates, and the data were combined with a previous global survey covering strains associated with food poisoning, gas gangrene, and isolates from food or healthy individuals. This revealed that the chromosomal cpe strains (food strains and isolates from food poisoning cases) belong to a distinct cluster that is significantly distant from all the other cpe plasmid-carrying and cpe-negative strains. These results suggest that different groups of C. perfringens have undergone niche specialization and that a distinct group of food isolates has specific core genome sequences. Such findings have epidemiological and evolutionary significance. Better understanding of the origin and reservoir of enterotoxic C. perfringens may allow for improved control of this organism in foods. PMID:22865060

  18. Inferring a Population Structure for Staphylococcus epidermidis from Multilocus Sequence Typing Data▿

    PubMed Central

    Miragaia, M.; Thomas, J. C.; Couto, I.; Enright, M. C.; de Lencastre, H.

    2007-01-01

    Despite its importance as a human pathogen, information on population structure and global epidemiology of Staphylococcus epidermidis is scarce and the relative importance of the mechanisms contributing to clonal diversification is unknown. In this study, we addressed these issues by analyzing a representative collection of S. epidermidis isolates from diverse geographic and clinical origins using multilocus sequence typing (MLST). Additionally, we characterized the mobile element (SCCmec) carrying the genetic determinant of methicillin resistance. The 217 S. epidermidis isolates from our collection were split by MLST into 74 types, suggesting a high level of genetic diversity. Analysis of MLST data using the eBURST algorithm revealed the existence of nine epidemic clonal lineages that were disseminated worldwide. One single clonal lineage (clonal complex 2) comprised 74% of the isolates, whereas the remaining isolates were clustered into 8 minor clonal lineages and 13 singletons. According to our evolutionary model, SCCmec was acquired at least 56 times by S. epidermidis. Although geographic dissemination of S. epidermidis strains and the value of the index of association between the alleles, 0.2898 (P < 0.05), support the clonality of S. epidermidis species, examination of the sequence changes at MLST loci during clonal diversification showed that recombination gives rise to new alleles approximately twice as frequently as point mutations. We suggest that S. epidermidis has a population with an epidemic structure, in which nine clones have emerged upon a recombining background and evolved quickly through frequent transfer of genetic mobile elements, including SCCmec. PMID:17220222

  19. Population genetic analysis of Enterocytozoon bieneusi in humans.

    PubMed

    Li, Wei; Cama, Vitaliano; Feng, Yaoyu; Gilman, Robert H; Bern, Caryn; Zhang, Xichen; Xiao, Lihua

    2012-01-01

    Genotyping based on sequence analysis of the ribosomal internal transcribed spacer has revealed significant genetic diversity in Enterocytozoonbieneusi. Thus far, the population genetics of E. bieneusi and its significance in the epidemiology of microsporidiosis have not been examined. In this study, a multilocus sequence typing of E. bieneusi in AIDS patients in Lima, Peru was conducted, using 72 specimens previously genotyped as A, D, IV, EbpC, WL11, Peru7, Peru8, Peru10 and Peru11 at the internal transcribed spacer locus. Altogether, 39 multilocus genotypes were identified among the 72 specimens. The observation of strong intragenic linkage disequilibria and limited genetic recombination among markers were indicative of an overall clonal population structure of E. bieneusi. Measures of pair-wise intergenic linkage disequilibria and a standardised index of association (IAS) based on allelic profile data further supported this conclusion. Both sequence-based and allelic profile-based phylogenetic analyses showed the presence of two genetically isolated groups in the study population, one (group 1) containing isolates of the anthroponotic internal transcribed spacer genotype A, and the other (group 2) containing isolates of multiple internal transcribed spacer genotypes (mainly genotypes D and IV) with zoonotic potential. The measurement of linkage disequilibria and recombination indicated group 2 had a clonal population structure, whereas group 1 had an epidemic population structure. The formation of the two sub-populations was confirmed by STRUCTURE and Wright's fixation index (FST) analyses. The data highlight the power of MLST in understanding the epidemiology of E. bieneusi. Published by Elsevier Ltd.

  20. Multilocus adaptation associated with heat resistance in reef-building corals.

    PubMed

    Bay, Rachael A; Palumbi, Stephen R

    2014-12-15

    The evolution of tolerance to future climate change depends on the standing stock of genetic variation for resistance to climate-related impacts, but genes contributing to climate tolerance in wild populations are poorly described in number and effect. Physiology and gene expression patterns have shown that corals living in naturally high-temperature microclimates are more resistant to bleaching because of both acclimation and fixed effects, including adaptation. To search for potential genetic correlates of these fixed effects, we genotyped 15,399 single nucleotide polymorphisms (SNPs) in 23 individual tabletop corals, Acropora hyacinthus, within a natural temperature mosaic in backreef lagoons on Ofu Island, American Samoa. Despite overall lack of population substructure, we identified 114 highly divergent SNPs as candidates for environmental selection, via multiple stringent outlier tests, and correlations with temperature. Corals from the warmest reef location had higher minor allele frequencies across these candidate SNPs, a pattern not seen for noncandidate loci. Furthermore, within backreef pools, colonies in the warmest microclimates had a higher number and frequency of alternative alleles at candidate loci. These data suggest mild selection for alternate alleles at many loci in these corals during high heat episodes and possible maintenance of extensive polymorphism through multilocus balancing selection in a heterogeneous environment. In this case, a natural population harbors a reservoir of alleles preadapted to high temperatures, suggesting potential for future evolutionary response to climate change. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. New Multilocus Variable-Number Tandem-Repeat Analysis Tool for Surveillance and Local Epidemiology of Bacterial Leaf Blight and Bacterial Leaf Streak of Rice Caused by Xanthomonas oryzae

    PubMed Central

    Poulin, L.; Grygiel, P.; Magne, M.; Rodriguez-R, L. M.; Forero Serna, N.; Zhao, S.; El Rafii, M.; Dao, S.; Tekete, C.; Wonni, I.; Koita, O.; Pruvost, O.; Verdier, V.; Vernière, C.

    2014-01-01

    Multilocus variable-number tandem-repeat analysis (MLVA) is efficient for routine typing and for investigating the genetic structures of natural microbial populations. Two distinct pathovars of Xanthomonas oryzae can cause significant crop losses in tropical and temperate rice-growing countries. Bacterial leaf streak is caused by X. oryzae pv. oryzicola, and bacterial leaf blight is caused by X. oryzae pv. oryzae. For the latter, two genetic lineages have been described in the literature. We developed a universal MLVA typing tool both for the identification of the three X. oryzae genetic lineages and for epidemiological analyses. Sixteen candidate variable-number tandem-repeat (VNTR) loci were selected according to their presence and polymorphism in 10 draft or complete genome sequences of the three X. oryzae lineages and by VNTR sequencing of a subset of loci of interest in 20 strains per lineage. The MLVA-16 scheme was then applied to 338 strains of X. oryzae representing different pathovars and geographical locations. Linkage disequilibrium between MLVA loci was calculated by index association on different scales, and the 16 loci showed linear Mantel correlation with MLSA data on 56 X. oryzae strains, suggesting that they provide a good phylogenetic signal. Furthermore, analyses of sets of strains for different lineages indicated the possibility of using the scheme for deeper epidemiological investigation on small spatial scales. PMID:25398857

  2. Prevalence of three campylobacter species, C. jejuni, C. coli, and C. lari, using multilocus sequence typing in wild birds of the Mid-Atlantic region, USA.

    PubMed

    Keller, Judith I; Shriver, W Gregory

    2014-01-01

    Campylobacter jejuni is responsible for the majority of bacterial foodborne gastroenteritis in the US, usually due to the consumption of undercooked poultry. Research on which avian species transmit the bacterium is limited, especially in the US. We sampled wild birds in three families-Anatidae, Scolopacidae, and Laridae-in eastern North America to determine the prevalence and specific strains of Campylobacter. The overall prevalence of Campylobacter spp. was 9.2% for all wild birds sampled (n = 781). Campylobacter jejuni was the most prevalent species (8.1%), while Campylobacter coli and Campylobacter lari prevalence estimates were low (1.4% and 0.3%, respectively). We used multilocus sequence typing PCR specific to C. jejuni to characterize clonal complexes and sequence types isolated from wild bird samples and detected 13 novel sequence types, along with a clonal complex previously only associated with human disease (ST-658). Wild birds share an increasing amount of habitat with humans as more landscapes become fragmented and developed for human needs. Wild birds are and will remain an important aspect of public health due to their ability to carry and disperse emerging zoonotic pathogens or their arthropod vectors. As basic information such as prevalence is limited or lacking from a majority of wild birds in the US, this study provides further insight into Campylobacter epidemiology, host preference, and strain characterization of C. jejuni.

  3. Bayesian structural equation modeling in sport and exercise psychology.

    PubMed

    Stenling, Andreas; Ivarsson, Andreas; Johnson, Urban; Lindwall, Magnus

    2015-08-01

    Bayesian statistics is on the rise in mainstream psychology, but applications in sport and exercise psychology research are scarce. In this article, the foundations of Bayesian analysis are introduced, and we will illustrate how to apply Bayesian structural equation modeling in a sport and exercise psychology setting. More specifically, we contrasted a confirmatory factor analysis on the Sport Motivation Scale II estimated with the most commonly used estimator, maximum likelihood, and a Bayesian approach with weakly informative priors for cross-loadings and correlated residuals. The results indicated that the model with Bayesian estimation and weakly informative priors provided a good fit to the data, whereas the model estimated with a maximum likelihood estimator did not produce a well-fitting model. The reasons for this discrepancy between maximum likelihood and Bayesian estimation are discussed as well as potential advantages and caveats with the Bayesian approach.

  4. Molecular phylogenetics and species delimitation of leaf-toed geckos (Phyllodactylidae: Phyllodactylus) throughout the Mexican tropical dry forest.

    PubMed

    Blair, Christopher; Méndez de la Cruz, Fausto R; Law, Christopher; Murphy, Robert W

    2015-03-01

    Methods and approaches for accurate species delimitation continue to be a highly controversial subject in the systematics community. Inaccurate assessment of species' limits precludes accurate inference of historical evolutionary processes. Recent evidence suggests that multilocus coalescent methods show promise in delimiting species in cryptic clades. We combine multilocus sequence data with coalescence-based phylogenetics in a hypothesis-testing framework to assess species limits and elucidate the timing of diversification in leaf-toed geckos (Phyllodactylus) of Mexico's dry forests. Tropical deciduous forests (TDF) of the Neotropics are among the planet's most diverse ecosystems. However, in comparison to moist tropical forests, little is known about the mode and tempo of biotic evolution throughout this threatened biome. We find increased speciation and substantial, cryptic molecular diversity originating following the formation of Mexican TDF 30-20million years ago due to orogenesis of the Sierra Madre Occidental and Mexican Volcanic Belt. Phylogenetic results suggest that the Mexican Volcanic Belt, the Rio Fuerte, and Isthmus of Tehuantepec may be important biogeographic barriers. Single- and multilocus coalescent analyses suggest that nearly every sampling locality may be a distinct species. These results suggest unprecedented levels of diversity, a complex evolutionary history, and that the formation and expansion of TDF vegetation in the Miocene may have influenced subsequent cladogenesis of leaf-toed geckos throughout western Mexico. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. A novel Bayesian approach to quantify clinical variables and to determine their spectroscopic counterparts in 1H NMR metabonomic data

    PubMed Central

    Vehtari, Aki; Mäkinen, Ville-Petteri; Soininen, Pasi; Ingman, Petri; Mäkelä, Sanna M; Savolainen, Markku J; Hannuksela, Minna L; Kaski, Kimmo; Ala-Korpela, Mika

    2007-01-01

    Background A key challenge in metabonomics is to uncover quantitative associations between multidimensional spectroscopic data and biochemical measures used for disease risk assessment and diagnostics. Here we focus on clinically relevant estimation of lipoprotein lipids by 1H NMR spectroscopy of serum. Results A Bayesian methodology, with a biochemical motivation, is presented for a real 1H NMR metabonomics data set of 75 serum samples. Lipoprotein lipid concentrations were independently obtained for these samples via ultracentrifugation and specific biochemical assays. The Bayesian models were constructed by Markov chain Monte Carlo (MCMC) and they showed remarkably good quantitative performance, the predictive R-values being 0.985 for the very low density lipoprotein triglycerides (VLDL-TG), 0.787 for the intermediate, 0.943 for the low, and 0.933 for the high density lipoprotein cholesterol (IDL-C, LDL-C and HDL-C, respectively). The modelling produced a kernel-based reformulation of the data, the parameters of which coincided with the well-known biochemical characteristics of the 1H NMR spectra; particularly for VLDL-TG and HDL-C the Bayesian methodology was able to clearly identify the most characteristic resonances within the heavily overlapping information in the spectra. For IDL-C and LDL-C the resulting model kernels were more complex than those for VLDL-TG and HDL-C, probably reflecting the severe overlap of the IDL and LDL resonances in the 1H NMR spectra. Conclusion The systematic use of Bayesian MCMC analysis is computationally demanding. Nevertheless, the combination of high-quality quantification and the biochemical rationale of the resulting models is expected to be useful in the field of metabonomics. PMID:17493257

  6. Substantial advantage of a combined Bayesian and genotyping approach in testosterone doping tests.

    PubMed

    Schulze, Jenny Jakobsson; Lundmark, Jonas; Garle, Mats; Ekström, Lena; Sottas, Pierre-Edouard; Rane, Anders

    2009-03-01

    Testosterone abuse is conventionally assessed by the urinary testosterone/epitestosterone (T/E) ratio, levels above 4.0 being considered suspicious. A deletion polymorphism in the gene coding for UGT2B17 is strongly associated with reduced testosterone glucuronide (TG) levels in urine. Many of the individuals devoid of the gene would not reach a T/E ratio of 4.0 after testosterone intake. Future test programs will most likely shift from population based- to individual-based T/E cut-off ratios using Bayesian inference. A longitudinal analysis is dependent on an individual's true negative baseline T/E ratio. The aim was to investigate whether it is possible to increase the sensitivity and specificity of the T/E test by addition of UGT2B17 genotype information in a Bayesian framework. A single intramuscular dose of 500mg testosterone enanthate was given to 55 healthy male volunteers with either two, one or no allele (ins/ins, ins/del or del/del) of the UGT2B17 gene. Urinary excretion of TG and the T/E ratio was measured during 15 days. The Bayesian analysis was conducted to calculate the individual T/E cut-off ratio. When adding the genotype information, the program returned lower individual cut-off ratios in all del/del subjects increasing the sensitivity of the test considerably. It will be difficult, if not impossible, to discriminate between a true negative baseline T/E value and a false negative one without knowledge of the UGT2B17 genotype. UGT2B17 genotype information is crucial, both to decide which initial cut-off ratio to use for an individual, and for increasing the sensitivity of the Bayesian analysis.

  7. Bayesian inference of the number of factors in gene-expression analysis: application to human virus challenge studies.

    PubMed

    Chen, Bo; Chen, Minhua; Paisley, John; Zaas, Aimee; Woods, Christopher; Ginsburg, Geoffrey S; Hero, Alfred; Lucas, Joseph; Dunson, David; Carin, Lawrence

    2010-11-09

    Nonparametric Bayesian techniques have been developed recently to extend the sophistication of factor models, allowing one to infer the number of appropriate factors from the observed data. We consider such techniques for sparse factor analysis, with application to gene-expression data from three virus challenge studies. Particular attention is placed on employing the Beta Process (BP), the Indian Buffet Process (IBP), and related sparseness-promoting techniques to infer a proper number of factors. The posterior density function on the model parameters is computed using Gibbs sampling and variational Bayesian (VB) analysis. Time-evolving gene-expression data are considered for respiratory syncytial virus (RSV), Rhino virus, and influenza, using blood samples from healthy human subjects. These data were acquired in three challenge studies, each executed after receiving institutional review board (IRB) approval from Duke University. Comparisons are made between several alternative means of per-forming nonparametric factor analysis on these data, with comparisons as well to sparse-PCA and Penalized Matrix Decomposition (PMD), closely related non-Bayesian approaches. Applying the Beta Process to the factor scores, or to the singular values of a pseudo-SVD construction, the proposed algorithms infer the number of factors in gene-expression data. For real data the "true" number of factors is unknown; in our simulations we consider a range of noise variances, and the proposed Bayesian models inferred the number of factors accurately relative to other methods in the literature, such as sparse-PCA and PMD. We have also identified a "pan-viral" factor of importance for each of the three viruses considered in this study. We have identified a set of genes associated with this pan-viral factor, of interest for early detection of such viruses based upon the host response, as quantified via gene-expression data.

  8. Bias in logistic regression due to imperfect diagnostic test results and practical correction approaches.

    PubMed

    Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul

    2015-11-04

    Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.

  9. Prediction and assimilation of surf-zone processes using a Bayesian network: Part II: Inverse models

    USGS Publications Warehouse

    Plant, Nathaniel G.; Holland, K. Todd

    2011-01-01

    A Bayesian network model has been developed to simulate a relatively simple problem of wave propagation in the surf zone (detailed in Part I). Here, we demonstrate that this Bayesian model can provide both inverse modeling and data-assimilation solutions for predicting offshore wave heights and depth estimates given limited wave-height and depth information from an onshore location. The inverse method is extended to allow data assimilation using observational inputs that are not compatible with deterministic solutions of the problem. These inputs include sand bar positions (instead of bathymetry) and estimates of the intensity of wave breaking (instead of wave-height observations). Our results indicate that wave breaking information is essential to reduce prediction errors. In many practical situations, this information could be provided from a shore-based observer or from remote-sensing systems. We show that various combinations of the assimilated inputs significantly reduce the uncertainty in the estimates of water depths and wave heights in the model domain. Application of the Bayesian network model to new field data demonstrated significant predictive skill (R2 = 0.7) for the inverse estimate of a month-long time series of offshore wave heights. The Bayesian inverse results include uncertainty estimates that were shown to be most accurate when given uncertainty in the inputs (e.g., depth and tuning parameters). Furthermore, the inverse modeling was extended to directly estimate tuning parameters associated with the underlying wave-process model. The inverse estimates of the model parameters not only showed an offshore wave height dependence consistent with results of previous studies but the uncertainty estimates of the tuning parameters also explain previously reported variations in the model parameters.

  10. Bayesian Inference for Functional Dynamics Exploring in fMRI Data.

    PubMed

    Guo, Xuan; Liu, Bing; Chen, Le; Chen, Guantao; Pan, Yi; Zhang, Jing

    2016-01-01

    This paper aims to review state-of-the-art Bayesian-inference-based methods applied to functional magnetic resonance imaging (fMRI) data. Particularly, we focus on one specific long-standing challenge in the computational modeling of fMRI datasets: how to effectively explore typical functional interactions from fMRI time series and the corresponding boundaries of temporal segments. Bayesian inference is a method of statistical inference which has been shown to be a powerful tool to encode dependence relationships among the variables with uncertainty. Here we provide an introduction to a group of Bayesian-inference-based methods for fMRI data analysis, which were designed to detect magnitude or functional connectivity change points and to infer their functional interaction patterns based on corresponding temporal boundaries. We also provide a comparison of three popular Bayesian models, that is, Bayesian Magnitude Change Point Model (BMCPM), Bayesian Connectivity Change Point Model (BCCPM), and Dynamic Bayesian Variable Partition Model (DBVPM), and give a summary of their applications. We envision that more delicate Bayesian inference models will be emerging and play increasingly important roles in modeling brain functions in the years to come.

  11. Bayesian network interface for assisting radiology interpretation and education

    NASA Astrophysics Data System (ADS)

    Duda, Jeffrey; Botzolakis, Emmanuel; Chen, Po-Hao; Mohan, Suyash; Nasrallah, Ilya; Rauschecker, Andreas; Rudie, Jeffrey; Bryan, R. Nick; Gee, James; Cook, Tessa

    2018-03-01

    In this work, we present the use of Bayesian networks for radiologist decision support during clinical interpretation. This computational approach has the advantage of avoiding incorrect diagnoses that result from known human cognitive biases such as anchoring bias, framing effect, availability bias, and premature closure. To integrate Bayesian networks into clinical practice, we developed an open-source web application that provides diagnostic support for a variety of radiology disease entities (e.g., basal ganglia diseases, bone lesions). The Clinical tool presents the user with a set of buttons representing clinical and imaging features of interest. These buttons are used to set the value for each observed feature. As features are identified, the conditional probabilities for each possible diagnosis are updated in real time. Additionally, using sensitivity analysis, the interface may be set to inform the user which remaining imaging features provide maximum discriminatory information to choose the most likely diagnosis. The Case Submission tools allow the user to submit a validated case and the associated imaging features to a database, which can then be used for future tuning/testing of the Bayesian networks. These submitted cases are then reviewed by an assigned expert using the provided QC tool. The Research tool presents users with cases with previously labeled features and a chosen diagnosis, for the purpose of performance evaluation. Similarly, the Education page presents cases with known features, but provides real time feedback on feature selection.

  12. Bayesian Scoring Systems for Military Pelvic and Perineal Blast Injuries: Is it Time to Take a New Approach?

    PubMed

    Mossadegh, Somayyeh; He, Shan; Parker, Paul

    2016-05-01

    Various injury severity scores exist for trauma; it is known that they do not correlate accurately to military injuries. A promising anatomical scoring system for blast pelvic and perineal injury led to the development of an improved scoring system using machine-learning techniques. An unbiased genetic algorithm selected optimal anatomical and physiological parameters from 118 military cases. A Naïve Bayesian model was built using the proposed parameters to predict the probability of survival. Ten-fold cross validation was employed to evaluate its performance. Our model significantly out-performed Injury Severity Score (ISS), Trauma ISS, New ISS, and the Revised Trauma Score in virtually all areas; positive predictive value 0.8941, specificity 0.9027, accuracy 0.9056, and area under curve 0.9059. A two-sample t test showed that the predictive performance of the proposed scoring system was significantly better than the other systems (p < 0.001). With limited resources and the simplest of Bayesian methodologies, we have demonstrated that the Naïve Bayesian model performed significantly better in virtually all areas assessed by current scoring systems used for trauma. This is encouraging and highlights that more can be done to improve trauma systems not only for our military injured, but also for civilian trauma victims. Reprint & Copyright © 2016 Association of Military Surgeons of the U.S.

  13. Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics

    PubMed Central

    Chen, Wenan; Larrabee, Beth R.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Haralambieva, Iana H.; Poland, Gregory A.; Schaid, Daniel J.

    2015-01-01

    Two recently developed fine-mapping methods, CAVIAR and PAINTOR, demonstrate better performance over other fine-mapping methods. They also have the advantage of using only the marginal test statistics and the correlation among SNPs. Both methods leverage the fact that the marginal test statistics asymptotically follow a multivariate normal distribution and are likelihood based. However, their relationship with Bayesian fine mapping, such as BIMBAM, is not clear. In this study, we first show that CAVIAR and BIMBAM are actually approximately equivalent to each other. This leads to a fine-mapping method using marginal test statistics in the Bayesian framework, which we call CAVIAR Bayes factor (CAVIARBF). Another advantage of the Bayesian framework is that it can answer both association and fine-mapping questions. We also used simulations to compare CAVIARBF with other methods under different numbers of causal variants. The results showed that both CAVIARBF and BIMBAM have better performance than PAINTOR and other methods. Compared to BIMBAM, CAVIARBF has the advantage of using only marginal test statistics and takes about one-quarter to one-fifth of the running time. We applied different methods on two independent cohorts of the same phenotype. Results showed that CAVIARBF, BIMBAM, and PAINTOR selected the same top 3 SNPs; however, CAVIARBF and BIMBAM had better consistency in selecting the top 10 ranked SNPs between the two cohorts. Software is available at https://bitbucket.org/Wenan/caviarbf. PMID:25948564

  14. An ecosystem service approach to support integrated pond management: a case study using Bayesian belief networks--highlighting opportunities and risks.

    PubMed

    Landuyt, Dries; Lemmens, Pieter; D'hondt, Rob; Broekx, Steven; Liekens, Inge; De Bie, Tom; Declerck, Steven A J; De Meester, Luc; Goethals, Peter L M

    2014-12-01

    Freshwater ponds deliver a broad range of ecosystem services (ESS). Taking into account this broad range of services to attain cost-effective ESS delivery is an important challenge facing integrated pond management. To assess the strengths and weaknesses of an ESS approach to support decisions in integrated pond management, we applied it on a small case study in Flanders, Belgium. A Bayesian belief network model was developed to assess ESS delivery under three alternative pond management scenarios: intensive fish farming (IFF), extensive fish farming (EFF) and nature conservation management (NCM). A probabilistic cost-benefit analysis was performed that includes both costs associated with pond management practices and benefits associated with ESS delivery. Whether or not a particular ESS is included in the analysis affects the identification of the most preferable management scenario by the model. Assessing the delivery of a more complete set of ecosystem services tends to shift the results away from intensive management to more biodiversity-oriented management scenarios. The proposed methodology illustrates the potential of Bayesian belief networks. BBNs facilitate knowledge integration and their modular nature encourages future model expansion to more encompassing sets of services. Yet, we also illustrate the key weaknesses of such exercises, being that the choice whether or not to include a particular ecosystem service may determine the suggested optimal management practice. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. Combining cow and bull reference populations to increase accuracy of genomic prediction and genome-wide association studies.

    PubMed

    Calus, M P L; de Haas, Y; Veerkamp, R F

    2013-10-01

    Genomic selection holds the promise to be particularly beneficial for traits that are difficult or expensive to measure, such that access to phenotypes on large daughter groups of bulls is limited. Instead, cow reference populations can be generated, potentially supplemented with existing information from the same or (highly) correlated traits available on bull reference populations. The objective of this study, therefore, was to develop a model to perform genomic predictions and genome-wide association studies based on a combined cow and bull reference data set, with the accuracy of the phenotypes differing between the cow and bull genomic selection reference populations. The developed bivariate Bayesian stochastic search variable selection model allowed for an unbalanced design by imputing residuals in the residual updating scheme for all missing records. The performance of this model is demonstrated on a real data example, where the analyzed trait, being milk fat or protein yield, was either measured only on a cow or a bull reference population, or recorded on both. Our results were that the developed bivariate Bayesian stochastic search variable selection model was able to analyze 2 traits, even though animals had measurements on only 1 of 2 traits. The Bayesian stochastic search variable selection model yielded consistently higher accuracy for fat yield compared with a model without variable selection, both for the univariate and bivariate analyses, whereas the accuracy of both models was very similar for protein yield. The bivariate model identified several additional quantitative trait loci peaks compared with the single-trait models on either trait. In addition, the bivariate models showed a marginal increase in accuracy of genomic predictions for the cow traits (0.01-0.05), although a greater increase in accuracy is expected as the size of the bull population increases. Our results emphasize that the chosen value of priors in Bayesian genomic prediction models are especially important in small data sets. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  16. How Socio-Environmental Factors Are Associated with Japanese Encephalitis in Shaanxi, China—A Bayesian Spatial Analysis

    PubMed Central

    Zhang, Shaobai; Hu, Wenbiao; Zhuang, Guihua

    2018-01-01

    Evidence indicated that socio-environmental factors were associated with occurrence of Japanese encephalitis (JE). This study explored the association of climate and socioeconomic factors with JE (2006–2014) in Shaanxi, China. JE data at the county level in Shaanxi were supplied by Shaanxi Center for Disease Control and Prevention. Population and socioeconomic data were obtained from the China Population Census in 2010 and statistical yearbooks. Meteorological data were acquired from the China Meteorological Administration. A Bayesian conditional autoregressive model was used to examine the association of meteorological and socioeconomic factors with JE. A total of 1197 JE cases were included in this study. Urbanization rate was inversely associated with JE incidence during the whole study period. Meteorological variables were significantly associated with JE incidence between 2012 and 2014. The excessive precipitation at lag of 1–2 months in the north of Shaanxi in June 2013 had an impact on the increase of local JE incidence. The spatial residual variations indicated that the whole study area had more stable risk (0.80–1.19 across all the counties) between 2012 and 2014 than earlier years. Public health interventions need to be implemented to reduce JE incidence, especially in rural areas and after extreme weather. PMID:29584661

  17. Bayesian networks in overlay recipe optimization

    NASA Astrophysics Data System (ADS)

    Binns, Lewis A.; Reynolds, Greg; Rigden, Timothy C.; Watkins, Stephen; Soroka, Andrew

    2005-05-01

    Currently, overlay measurements are characterized by "recipe", which defines both physical parameters such as focus, illumination et cetera, and also the software parameters such as algorithm to be used and regions of interest. Setting up these recipes requires both engineering time and wafer availability on an overlay tool, so reducing these requirements will result in higher tool productivity. One of the significant challenges to automating this process is that the parameters are highly and complexly correlated. At the same time, a high level of traceability and transparency is required in the recipe creation process, so a technique that maintains its decisions in terms of well defined physical parameters is desirable. Running time should be short, given the system (automatic recipe creation) is being implemented to reduce overheads. Finally, a failure of the system to determine acceptable parameters should be obvious, so a certainty metric is also desirable. The complex, nonlinear interactions make solution by an expert system difficult at best, especially in the verification of the resulting decision network. The transparency requirements tend to preclude classical neural networks and similar techniques. Genetic algorithms and other "global minimization" techniques require too much computational power (given system footprint and cost requirements). A Bayesian network, however, provides a solution to these requirements. Such a network, with appropriate priors, can be used during recipe creation / optimization not just to select a good set of parameters, but also to guide the direction of search, by evaluating the network state while only incomplete information is available. As a Bayesian network maintains an estimate of the probability distribution of nodal values, a maximum-entropy approach can be utilized to obtain a working recipe in a minimum or near-minimum number of steps. In this paper we discuss the potential use of a Bayesian network in such a capacity, reducing the amount of engineering intervention. We discuss the benefits of this approach, especially improved repeatability and traceability of the learning process, and quantification of uncertainty in decisions made. We also consider the problems associated with this approach, especially in detailed construction of network topology, validation of the Bayesian network and the recipes it generates, and issues arising from the integration of a Bayesian network with a complex multithreaded application; these primarily relate to maintaining Bayesian network and system architecture integrity.

  18. MCMC multilocus lod scores: application of a new approach.

    PubMed

    George, Andrew W; Wijsman, Ellen M; Thompson, Elizabeth A

    2005-01-01

    On extended pedigrees with extensive missing data, the calculation of multilocus likelihoods for linkage analysis is often beyond the computational bounds of exact methods. Growing interest therefore surrounds the implementation of Monte Carlo estimation methods. In this paper, we demonstrate the speed and accuracy of a new Markov chain Monte Carlo method for the estimation of linkage likelihoods through an analysis of real data from a study of early-onset Alzheimer's disease. For those data sets where comparison with exact analysis is possible, we achieved up to a 100-fold increase in speed. Our approach is implemented in the program lm_bayes within the framework of the freely available MORGAN 2.6 package for Monte Carlo genetic analysis (http://www.stat.washington.edu/thompson/Genepi/MORGAN/Morgan.shtml).

  19. Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data.

    PubMed

    Ferragina, A; de los Campos, G; Vazquez, A I; Cecchinato, A; Bittante, G

    2015-11-01

    The aim of this study was to assess the performance of Bayesian models commonly used for genomic selection to predict "difficult-to-predict" dairy traits, such as milk fatty acid (FA) expressed as percentage of total fatty acids, and technological properties, such as fresh cheese yield and protein recovery, using Fourier-transform infrared (FTIR) spectral data. Our main hypothesis was that Bayesian models that can estimate shrinkage and perform variable selection may improve our ability to predict FA traits and technological traits above and beyond what can be achieved using the current calibration models (e.g., partial least squares, PLS). To this end, we assessed a series of Bayesian methods and compared their prediction performance with that of PLS. The comparison between models was done using the same sets of data (i.e., same samples, same variability, same spectral treatment) for each trait. Data consisted of 1,264 individual milk samples collected from Brown Swiss cows for which gas chromatographic FA composition, milk coagulation properties, and cheese-yield traits were available. For each sample, 2 spectra in the infrared region from 5,011 to 925 cm(-1) were available and averaged before data analysis. Three Bayesian models: Bayesian ridge regression (Bayes RR), Bayes A, and Bayes B, and 2 reference models: PLS and modified PLS (MPLS) procedures, were used to calibrate equations for each of the traits. The Bayesian models used were implemented in the R package BGLR (http://cran.r-project.org/web/packages/BGLR/index.html), whereas the PLS and MPLS were those implemented in the WinISI II software (Infrasoft International LLC, State College, PA). Prediction accuracy was estimated for each trait and model using 25 replicates of a training-testing validation procedure. Compared with PLS, which is currently the most widely used calibration method, MPLS and the 3 Bayesian methods showed significantly greater prediction accuracy. Accuracy increased in moving from calibration to external validation methods, and in moving from PLS and MPLS to Bayesian methods, particularly Bayes A and Bayes B. The maximum R(2) value of validation was obtained with Bayes B and Bayes A. For the FA, C10:0 (% of each FA on total FA basis) had the highest R(2) (0.75, achieved with Bayes A and Bayes B), and among the technological traits, fresh cheese yield R(2) of 0.82 (achieved with Bayes B). These 2 methods have proven to be useful instruments in shrinking and selecting very informative wavelengths and inferring the structure and functions of the analyzed traits. We conclude that Bayesian models are powerful tools for deriving calibration equations, and, importantly, these equations can be easily developed using existing open-source software. As part of our study, we provide scripts based on the open source R software BGLR, which can be used to train customized prediction equations for other traits or populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. Development and comparison of Bayesian modularization method in uncertainty assessment of hydrological models

    NASA Astrophysics Data System (ADS)

    Li, L.; Xu, C.-Y.; Engeland, K.

    2012-04-01

    With respect to model calibration, parameter estimation and analysis of uncertainty sources, different approaches have been used in hydrological models. Bayesian method is one of the most widely used methods for uncertainty assessment of hydrological models, which incorporates different sources of information into a single analysis through Bayesian theorem. However, none of these applications can well treat the uncertainty in extreme flows of hydrological models' simulations. This study proposes a Bayesian modularization method approach in uncertainty assessment of conceptual hydrological models by considering the extreme flows. It includes a comprehensive comparison and evaluation of uncertainty assessments by a new Bayesian modularization method approach and traditional Bayesian models using the Metropolis Hasting (MH) algorithm with the daily hydrological model WASMOD. Three likelihood functions are used in combination with traditional Bayesian: the AR (1) plus Normal and time period independent model (Model 1), the AR (1) plus Normal and time period dependent model (Model 2) and the AR (1) plus multi-normal model (Model 3). The results reveal that (1) the simulations derived from Bayesian modularization method are more accurate with the highest Nash-Sutcliffe efficiency value, and (2) the Bayesian modularization method performs best in uncertainty estimates of entire flows and in terms of the application and computational efficiency. The study thus introduces a new approach for reducing the extreme flow's effect on the discharge uncertainty assessment of hydrological models via Bayesian. Keywords: extreme flow, uncertainty assessment, Bayesian modularization, hydrological model, WASMOD

  1. Additional molecular findings in 11p15-associated imprinting disorders: an urgent need for multi-locus testing.

    PubMed

    Eggermann, Thomas; Heilsberg, Ann-Kathrin; Bens, Susanne; Siebert, Reiner; Beygo, Jasmin; Buiting, Karin; Begemann, Matthias; Soellner, Lukas

    2014-07-01

    The chromosomal region 11p15 contains two imprinting control regions (ICRs) and is a key player in molecular processes regulated by genomic imprinting. Genomic as well as epigenetic changes affecting 11p15 are associated either with Silver-Russell syndrome (SRS) or Beckwith-Wiedemann syndrome (BWS). In the last years, a growing number of patients affected by imprinting disorders (IDs) have reported carrying the disease-specific 11p15 hypomethylation patterns as well as methylation changes at imprinted loci at other chromosomal sites (multi-locus methylation defects, MLMD). Furthermore, in several patients, molecular alterations (e.g., uniparental disomies, UPDs) additional to the primary epimutations have been reported. To determine the frequency and distribution of mutations and epimutations in patients referred as SRS or BWS for genetic testing, we retrospectively ascertained our routine patient cohort consisting of 711 patients (SRS, n = 571; BWS, n = 140). As this cohort represents the typical cohort in a routine diagnostic lab without clinical preselection, the detection rates were much lower than those reported from clinically characterized cohorts in the literature (SRS, 19.9%; BWS, 28.6%). Among the molecular subgroups known to be predisposed to MLMD, the frequencies corresponded to that in the literature (SRS, 7.1% in ICR1 hypomethylation carriers; BWS, 20.8% in ICR2 hypomethylation patients). In several patients, more than one epigenetic or genetic disturbance could be identified. Our study illustrates that the complex molecular alterations as well as the overlapping and sometimes unusual clinical findings in patients with imprinting disorders (IDs) often make the decision for a specific imprinting disorder test difficult. We therefore suggest to implement molecular assays in routine ID diagnostics which allow the detection of a broad range of (epi)mutation types (epimutations, UPDs, chromosomal imbalances) and cover the clinically most relevant known ID loci because of the following: (a) Multi-locus tests increase the detection rates as they cover numerous loci. (b) Patients with unexpected molecular alterations are detected. (c) The testing of rare imprinting disorders becomes more efficient and quality of molecular diagnosis increases. (d) The tests identify MLMDs. In the future, the detailed characterization of clinical and molecular findings in ID patients will help us to decipher the complex regulation of imprinting and thereby providing the basis for more directed genetic counseling and therapeutic managements in IDs. Molecular disturbances in patients with imprinting disorders are often not restricted to the disease-specific locus but also affect other chromosomal regions. These additional disturbances include methylation defects, uniparental disomies as well as chromosomal imbalances. The identification of these additional alterations is mandatory for a well-directed genetic counseling. Furthermore, these findings help to decipher the complex regulation of imprinting.

  2. Mining pharmacovigilance data using Bayesian logistic regression with James-Stein type shrinkage estimation.

    PubMed

    An, Lihua; Fung, Karen Y; Krewski, Daniel

    2010-09-01

    Spontaneous adverse event reporting systems are widely used to identify adverse reactions to drugs following their introduction into the marketplace. In this article, a James-Stein type shrinkage estimation strategy was developed in a Bayesian logistic regression model to analyze pharmacovigilance data. This method is effective in detecting signals as it combines information and borrows strength across medically related adverse events. Computer simulation demonstrated that the shrinkage estimator is uniformly better than the maximum likelihood estimator in terms of mean squared error. This method was used to investigate the possible association of a series of diabetic drugs and the risk of cardiovascular events using data from the Canada Vigilance Online Database.

  3. New insights into faster computation of uncertainties

    NASA Astrophysics Data System (ADS)

    Bhattacharya, Atreyee

    2012-11-01

    Heavy computation power, lengthy simulations, and an exhaustive number of model runs—often these seem like the only statistical tools that scientists have at their disposal when computing uncertainties associated with predictions, particularly in cases of environmental processes such as groundwater movement. However, calculation of uncertainties need not be as lengthy, a new study shows. Comparing two approaches—the classical Bayesian “credible interval” and a less commonly used regression-based “confidence interval” method—Lu et al. show that for many practical purposes both methods provide similar estimates of uncertainties. The advantage of the regression method is that it demands 10-1000 model runs, whereas the classical Bayesian approach requires 10,000 to millions of model runs.

  4. Bayesian Community Detection in the Space of Group-Level Functional Differences

    PubMed Central

    Venkataraman, Archana; Yang, Daniel Y.-J.; Pelphrey, Kevin A.; Duncan, James S.

    2017-01-01

    We propose a unified Bayesian framework to detect both hyper- and hypo-active communities within whole-brain fMRI data. Specifically, our model identifies dense subgraphs that exhibit population-level differences in functional synchrony between a control and clinical group. We derive a variational EM algorithm to solve for the latent posterior distributions and parameter estimates, which subsequently inform us about the afflicted network topology. We demonstrate that our method provides valuable insights into the neural mechanisms underlying social dysfunction in autism, as verified by the Neurosynth meta-analytic database. In contrast, both univariate testing and community detection via recursive edge elimination fail to identify stable functional communities associated with the disorder. PMID:26955022

  5. Bayesian Community Detection in the Space of Group-Level Functional Differences.

    PubMed

    Venkataraman, Archana; Yang, Daniel Y-J; Pelphrey, Kevin A; Duncan, James S

    2016-08-01

    We propose a unified Bayesian framework to detect both hyper- and hypo-active communities within whole-brain fMRI data. Specifically, our model identifies dense subgraphs that exhibit population-level differences in functional synchrony between a control and clinical group. We derive a variational EM algorithm to solve for the latent posterior distributions and parameter estimates, which subsequently inform us about the afflicted network topology. We demonstrate that our method provides valuable insights into the neural mechanisms underlying social dysfunction in autism, as verified by the Neurosynth meta-analytic database. In contrast, both univariate testing and community detection via recursive edge elimination fail to identify stable functional communities associated with the disorder.

  6. Bayesian analysis of factors associated with fibromyalgia syndrome subjects

    NASA Astrophysics Data System (ADS)

    Jayawardana, Veroni; Mondal, Sumona; Russek, Leslie

    2015-01-01

    Factors contributing to movement-related fear were assessed by Russek, et al. 2014 for subjects with Fibromyalgia (FM) based on the collected data by a national internet survey of community-based individuals. The study focused on the variables, Activities-Specific Balance Confidence scale (ABC), Primary Care Post-Traumatic Stress Disorder screen (PC-PTSD), Tampa Scale of Kinesiophobia (TSK), a Joint Hypermobility Syndrome screen (JHS), Vertigo Symptom Scale (VSS-SF), Obsessive-Compulsive Personality Disorder (OCPD), Pain, work status and physical activity dependent from the "Revised Fibromyalgia Impact Questionnaire" (FIQR). The study presented in this paper revisits same data with a Bayesian analysis where appropriate priors were introduced for variables selected in the Russek's paper.

  7. Bayesian data analysis in population ecology: motivations, methods, and benefits

    USGS Publications Warehouse

    Dorazio, Robert

    2016-01-01

    During the 20th century ecologists largely relied on the frequentist system of inference for the analysis of their data. However, in the past few decades ecologists have become increasingly interested in the use of Bayesian methods of data analysis. In this article I provide guidance to ecologists who would like to decide whether Bayesian methods can be used to improve their conclusions and predictions. I begin by providing a concise summary of Bayesian methods of analysis, including a comparison of differences between Bayesian and frequentist approaches to inference when using hierarchical models. Next I provide a list of problems where Bayesian methods of analysis may arguably be preferred over frequentist methods. These problems are usually encountered in analyses based on hierarchical models of data. I describe the essentials required for applying modern methods of Bayesian computation, and I use real-world examples to illustrate these methods. I conclude by summarizing what I perceive to be the main strengths and weaknesses of using Bayesian methods to solve ecological inference problems.

  8. A Bayesian-frequentist two-stage single-arm phase II clinical trial design.

    PubMed

    Dong, Gaohong; Shih, Weichung Joe; Moore, Dirk; Quan, Hui; Marcella, Stephen

    2012-08-30

    It is well-known that both frequentist and Bayesian clinical trial designs have their own advantages and disadvantages. To have better properties inherited from these two types of designs, we developed a Bayesian-frequentist two-stage single-arm phase II clinical trial design. This design allows both early acceptance and rejection of the null hypothesis ( H(0) ). The measures (for example probability of trial early termination, expected sample size, etc.) of the design properties under both frequentist and Bayesian settings are derived. Moreover, under the Bayesian setting, the upper and lower boundaries are determined with predictive probability of trial success outcome. Given a beta prior and a sample size for stage I, based on the marginal distribution of the responses at stage I, we derived Bayesian Type I and Type II error rates. By controlling both frequentist and Bayesian error rates, the Bayesian-frequentist two-stage design has special features compared with other two-stage designs. Copyright © 2012 John Wiley & Sons, Ltd.

  9. Computational analysis of gene-gene interactions using multifactor dimensionality reduction.

    PubMed

    Moore, Jason H

    2004-11-01

    Understanding the relationship between DNA sequence variations and biologic traits is expected to improve the diagnosis, prevention and treatment of common human diseases. Success in characterizing genetic architecture will depend on our ability to address nonlinearities in the genotype-to-phenotype mapping relationship as a result of gene-gene interactions, or epistasis. This review addresses the challenges associated with the detection and characterization of epistasis. A novel strategy known as multifactor dimensionality reduction that was specifically designed for the identification of multilocus genetic effects is presented. Several case studies that demonstrate the detection of gene-gene interactions in common diseases such as atrial fibrillation, Type II diabetes and essential hypertension are also discussed.

  10. Eight new Arthrinium species from China

    PubMed Central

    Wang, Mei; Tan, Xiao-Ming; Liu, Fang; Cai, Lei

    2018-01-01

    Abstract The genus Arthrinium includes important plant pathogens, endophytes and saprobes with a wide host range and geographic distribution. In this paper, 74 Arthrinium strains isolated from various substrates such as bamboo leaves, tea plants, soil and air from karst caves in China were examined using a multi-locus phylogeny based on a combined dataset of ITS rDNA, TEF1 and TUB2, in conjunction with morphological characters, host association and ecological distribution. Eight new species were described based on their distinct phylogenetic relationships and morphological characters. Our results indicated a high species diversity of Arthrinium with wide host ranges, amongst which, Poaceae and Cyperaceae were the major host plant families of Arthrinium species. PMID:29755262

  11. Eight new Arthrinium species from China.

    PubMed

    Wang, Mei; Tan, Xiao-Ming; Liu, Fang; Cai, Lei

    2018-01-01

    The genus Arthrinium includes important plant pathogens, endophytes and saprobes with a wide host range and geographic distribution. In this paper, 74 Arthrinium strains isolated from various substrates such as bamboo leaves, tea plants, soil and air from karst caves in China were examined using a multi-locus phylogeny based on a combined dataset of ITS rDNA, TEF1 and TUB2, in conjunction with morphological characters, host association and ecological distribution. Eight new species were described based on their distinct phylogenetic relationships and morphological characters. Our results indicated a high species diversity of Arthrinium with wide host ranges, amongst which, Poaceae and Cyperaceae were the major host plant families of Arthrinium species.

  12. Pseudo-outbreak of septicemia due to rapidly growing mycobacteria associated with extrinsic contamination of culture supplement.

    PubMed Central

    Ashford, D A; Kellerman, S; Yakrus, M; Brim, S; Good, R C; Finelli, L; Jarvis, W R; McNeil, M M

    1997-01-01

    Between April and December 1994, 23 blood cultures from human immunodeficiency virus-infected patients grew rapidly growing mycobacteria suspected to be Mycobacterium chelonae at a hospital in New Jersey. The isolates were later identified as M. abscessus. Several bacterial species, including M. abscessus, were cultured from an opened multidose supplement vial (BBL Septi-Chek AFB Supplement) that had been used for mycobacterial blood cultures. The M. abscessus isolates from case patients and the supplement vial had identical multilocus enzyme electrophoresis and antimicrobial susceptibility patterns. Finding a contaminated vial of supplement, together with the lack of a distinct syndrome in case patients, was consistent with a pseudo-outbreak. PMID:9230377

  13. Using SPM 12’s Second-Level Bayesian Inference Procedure for fMRI Analysis: Practical Guidelines for End Users

    PubMed Central

    Han, Hyemin; Park, Joonsuk

    2018-01-01

    Recent debates about the conventional traditional threshold used in the fields of neuroscience and psychology, namely P < 0.05, have spurred researchers to consider alternative ways to analyze fMRI data. A group of methodologists and statisticians have considered Bayesian inference as a candidate methodology. However, few previous studies have attempted to provide end users of fMRI analysis tools, such as SPM 12, with practical guidelines about how to conduct Bayesian inference. In the present study, we aim to demonstrate how to utilize Bayesian inference, Bayesian second-level inference in particular, implemented in SPM 12 by analyzing fMRI data available to public via NeuroVault. In addition, to help end users understand how Bayesian inference actually works in SPM 12, we examine outcomes from Bayesian second-level inference implemented in SPM 12 by comparing them with those from classical second-level inference. Finally, we provide practical guidelines about how to set the parameters for Bayesian inference and how to interpret the results, such as Bayes factors, from the inference. We also discuss the practical and philosophical benefits of Bayesian inference and directions for future research. PMID:29456498

  14. An introduction to Bayesian statistics in health psychology.

    PubMed

    Depaoli, Sarah; Rus, Holly M; Clifton, James P; van de Schoot, Rens; Tiemensma, Jitske

    2017-09-01

    The aim of the current article is to provide a brief introduction to Bayesian statistics within the field of health psychology. Bayesian methods are increasing in prevalence in applied fields, and they have been shown in simulation research to improve the estimation accuracy of structural equation models, latent growth curve (and mixture) models, and hierarchical linear models. Likewise, Bayesian methods can be used with small sample sizes since they do not rely on large sample theory. In this article, we discuss several important components of Bayesian statistics as they relate to health-based inquiries. We discuss the incorporation and impact of prior knowledge into the estimation process and the different components of the analysis that should be reported in an article. We present an example implementing Bayesian estimation in the context of blood pressure changes after participants experienced an acute stressor. We conclude with final thoughts on the implementation of Bayesian statistics in health psychology, including suggestions for reviewing Bayesian manuscripts and grant proposals. We have also included an extensive amount of online supplementary material to complement the content presented here, including Bayesian examples using many different software programmes and an extensive sensitivity analysis examining the impact of priors.

  15. A multi-locus molecular timescale for the origin and diversification of eels (Order: Anguilliformes).

    PubMed

    Santini, Francesco; Kong, Xianghui; Sorenson, Laurie; Carnevale, Giorgio; Mehta, Rita S; Alfaro, Michael E

    2013-12-01

    Anguilliformes are an ecologically diverse group of predominantly marine fishes whose members are easily recognized by their extremely elongate bodies, and universal lack of pelvic fins. Recent studies based on mitochondrial loci, including full mitogenomes, have called into question the monophyly of both the Anguilliformes, which appear to be paraphyletic without the inclusion of the Saccopharyngiformes (gulper eels and allies), as well as other more commonly known eel families (e.g., Congridae, Serrivomeridae). However, no study to date has investigated anguilliform interrelationships using nuclear loci. Here we present a new phylogenetic hypothesis for the Anguilliformes based on five markers (the nuclear loci Early Growth Hormone 3, Myosin Heavy Polypeptide 6 and Recombinase Activating Gene 1, as well as the mitochondrial genes Cytochrome b and Cytochrome Oxidase I). Our sampling spans 148 species and includes 19 of the 20 extant families of anguilliforms and saccopharyngiforms. Maximum likelihood analysis reveals that saccopharyngiform eels are deeply nested within the anguilliforms, and supports the non-monophyly of Congridae and Nettastomatidae, as well as that of Derichthyidae and Chlopsidae. Our analyses suggest that Protanguilla may be the sister group of the Synaphobranchidae, though the recent hypothesis that this species is the sister group to all other anguilliforms cannot be rejected. The molecular phylogeny, time-calibrated using a Bayesian relaxed clock approach and seven fossil calibration points, reveals a Late Cretaceous origin of this expanded anguilliform clade (stem age ~116 Ma, crown age ~99 Ma). Most major (family level) lineages originated between the end of the Cretaceous and Early Eocene, suggesting that anguilliform radiation may have been facilitated by the recovery of marine ecosystems following the KP extinction. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. A novel multilocus phylogenetic estimation reveals unrecognized diversity in Asian horned toads, genus Megophrys sensu lato (Anura: Megophryidae).

    PubMed

    Chen, Jin-Min; Zhou, Wei-Wei; Poyarkov, Nikolay A; Stuart, Bryan L; Brown, Rafe M; Lathrop, Amy; Wang, Ying-Yong; Yuan, Zhi-Yong; Jiang, Ke; Hou, Mian; Chen, Hong-Man; Suwannapoom, Chatmongkon; Nguyen, Sang Ngoc; Duong, Tang Van; Papenfuss, Theodore J; Murphy, Robert W; Zhang, Ya-Ping; Che, Jing

    2017-01-01

    The horned toad assemblage, genus Megophrys sensu lato, currently includes three groups previously recognized as the genera Atympanophrys, Xenophrys and Megophrys sensu stricto. The taxonomic status and species composition of the three groups remain controversial due to conflicting phenotypic analyses and insufficient phylogenetic reconstruction; likewise, the position of the monotypic Borneophrys remains uncertain with respect to the horned toads. Further, the diversity of the horned toads remains poorly understood, especially for widespread species. Herein, we evaluate species-level diversity based on 45 of the 57 described species from throughout southern China, Southeast Asia and the Himalayas using Bayesian inference trees and the Generalized Mixed Yule Coalescent (GMYC) approach. We estimate the phylogeny using both mitochondrial and nuclear DNA data. Analyses reveal statistically significant mito-nuclear discordance. All analyses resolve paraphyly for horned toads involving multiple strongly supported clades. These clades correspond with geography. We resurrect the genera Atympanophrys and Xenophrys from the synonymy of Megophrys to eliminate paraphyly of Megophrys s.l. and to account for the morphological, molecular and biogeographic differences among these groups, but we also provide an alternative option. Our study suggests that Borneophrys is junior synonym of Megophrys sensu stricto. We provide an estimation of timeframe for the horned toads. The mitochondrial and nuclear trees indicate the presence of many putative undescribed species. Widespread species, such as Xenophrys major and X. minor, likely have dramatically underestimated diversity. The integration of morphological and molecular evidence can validate this discovery. Montane forest dynamics appear to play a significant role in driving diversification of horned toads. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Fuzzy boundaries: color and gene flow patterns among parapatric lineages of the western shovel-nosed snake and taxonomic implication

    USGS Publications Warehouse

    Wood, Dustin A.; Fisher, Robert N.; Vandergast, Amy G.

    2014-01-01

    Accurate delineation of lineage diversity is increasingly important, as species distributions are becoming more reduced and threatened. During the last century, the subspecies category was often used to denote phenotypic variation within a species range and to provide a framework for understanding lineage differentiation, often considered incipient speciation. While this category has largely fallen into disuse, previously recognized subspecies often serve as important units for conservation policy and management when other information is lacking. In this study, we evaluated phenotypic subspecies hypotheses within shovel-nosed snakes on the basis of genetic data and considered how evolutionary processes such as gene flow influenced possible incongruence between phenotypic and genetic patterns. We used both traditional phylogenetic and Bayesian clustering analyses to infer range-wide genetic structure and spatially explicit analyses to detect possible boundary locations of lineage contact. Multilocus analyses supported three historically isolated groups with low to moderate levels of contemporary gene exchange. Genetic data did not support phenotypic subspecies as exclusive groups, and we detected patterns of discordance in areas where three subspecies are presumed to be in contact. Based on genetic and phenotypic evidence, we suggested that species-level diversity is underestimated in this group and we proposed that two species be recognized, Chionactis occipitalis and C. annulata. In addition, we recommend retention of two subspecific designations within C. annulata (C. a. annulata and C. a. klauberi) that reflect regional shifts in both genetic and phenotypic variation within the species. Our results highlight the difficultly in validating taxonomic boundaries within lineages that are evolving under a time-dependent, continuous process.

  18. Revisiting the phylogeny of Bombacoideae (Malvaceae): Novel relationships, morphologically cohesive clades, and a new tribal classification based on multilocus phylogenetic analyses.

    PubMed

    Carvalho-Sobrinho, Jefferson G; Alverson, William S; Alcantara, Suzana; Queiroz, Luciano P; Mota, Aline C; Baum, David A

    2016-08-01

    Bombacoideae (Malvaceae) is a clade of deciduous trees with a marked dominance in many forests, especially in the Neotropics. The historical lack of a well-resolved phylogenetic framework for Bombacoideae hinders studies in this ecologically important group. We reexamined phylogenetic relationships in this clade based on a matrix of 6465 nuclear (ETS, ITS) and plastid (matK, trnL-trnF, trnS-trnG) DNA characters. We used maximum parsimony, maximum likelihood, and Bayesian inference to infer relationships among 108 species (∼70% of the total number of known species). We analyzed the evolution of selected morphological traits: trunk or branch prickles, calyx shape, endocarp type, seed shape, and seed number per fruit, using ML reconstructions of their ancestral states to identify possible synapomorphies for major clades. Novel phylogenetic relationships emerged from our analyses, including three major lineages marked by fruit or seed traits: the winged-seed clade (Bernoullia, Gyranthera, and Huberodendron), the spongy endocarp clade (Adansonia, Aguiaria, Catostemma, Cavanillesia, and Scleronema), and the Kapok clade (Bombax, Ceiba, Eriotheca, Neobuchia, Pachira, Pseudobombax, Rhodognaphalon, and Spirotheca). The Kapok clade, the most diverse lineage of the subfamily, includes sister relationships (i) between Pseudobombax and "Pochota fendleri" a historically incertae sedis taxon, and (ii) between the Paleotropical genera Bombax and Rhodognaphalon, implying just two bombacoid dispersals to the Old World, the other one involving Adansonia. This new phylogenetic framework offers new insights and a promising avenue for further evolutionary studies. In view of this information, we present a new tribal classification of the subfamily, accompanied by an identification key. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. Toward a DNA Taxonomy of Alpine Rhithrogena (Ephemeroptera: Heptageniidae) Using a Mixed Yule-Coalescent Analysis of Mitochondrial and Nuclear DNA

    PubMed Central

    Vuataz, Laurent; Sartori, Michel; Wagner, André; Monaghan, Michael T.

    2011-01-01

    Aquatic larvae of many Rhithrogena mayflies (Ephemeroptera) inhabit sensitive Alpine environments. A number of species are on the IUCN Red List and many recognized species have restricted distributions and are of conservation interest. Despite their ecological and conservation importance, ambiguous morphological differences among closely related species suggest that the current taxonomy may not accurately reflect the evolutionary diversity of the group. Here we examined the species status of nearly 50% of European Rhithrogena diversity using a widespread sampling scheme of Alpine species that included 22 type localities, general mixed Yule-coalescent (GMYC) model analysis of one standard mtDNA marker and one newly developed nDNA marker, and morphological identification where possible. Using sequences from 533 individuals from 144 sampling localities, we observed significant clustering of the mitochondrial (cox1) marker into 31 GMYC species. Twenty-one of these could be identified based on the presence of topotypes (expertly identified specimens from the species' type locality) or unambiguous morphology. These results strongly suggest the presence of both cryptic diversity and taxonomic oversplitting in Rhithrogena. Significant clustering was not detected with protein-coding nuclear PEPCK, although nine GMYC species were congruent with well supported terminal clusters of nDNA. Lack of greater congruence in the two data sets may be the result of incomplete sorting of ancestral polymorphism. Bayesian phylogenetic analyses of both gene regions recovered four of the six recognized Rhithrogena species groups in our samples as monophyletic. Future development of more nuclear markers would facilitate multi-locus analysis of unresolved, closely related species pairs. The DNA taxonomy developed here lays the groundwork for a future revision of the important but cryptic Rhithrogena genus in Europe. PMID:21611178

  20. Multi-locus phylogeny using topotype specimens sheds light on the systematics of Niviventer (Rodentia, Muridae) in China.

    PubMed

    Zhang, Bin; He, Kai; Wan, Tao; Chen, Peng; Sun, Guozheng; Liu, Shaoying; Nguyen, Truong Son; Lin, Liangkong; Jiang, Xuelong

    2016-12-01

    Niviventer is a genus of white-bellied rats that are among the most common rodents in the Indo-Sundaic region. The taxonomy of the genus has undergone extensive revisions and remains controversial. The current phylogeny is unresolved and was developed primarily on the basis of mitochondrial genes. Identification is extremely difficult, and a large number of GenBank sequences seem to be problematic. We extensively sampled specimens of Niviventer in China and neighboring northern Vietnam, including topotypes of the most reported species (n = 6), subspecies (n = 8), and synonyms (n = 4). We estimated phylogenetic relationships on the basis of one mitochondrial and three nuclear genes, using concatenation and coalescent-based approaches. We also employed molecular species delimitation approaches to test the existence of cryptic and putative new species. Our phylogeny was finely resolved, especially for the N. confucianus-like species. Our data provided the first support for N. brahma and N. eha as sister species, an assignment that is congruent with their morphological similarities. Species delimitation analyses provided new insight into species diversity and systematics. Three geographic populations of N. confucianus and one of N. fulvescens were supported as genetically distinct in our species delimitation analyses, while three recognized species (N. coninga, N. huang, and N. lotipes) were not strongly supported as distinct. Our results suggested that several genetically distinct species may be contained within the species currently known as N. confucianus and N. fulvescens. In addition, the results of Bayesian Phylogenetics and Phylogeography (BPP) for N. coninga, N. huang, and N. lotipes indicated that either inter-specific gene flow had occurred or imperfect taxonomy was present. Morphological examinations and morphometric analyses are warranted to examine the molecular results.

Top