Posterior Predictive Bayesian Phylogenetic Model Selection
Lewis, Paul O.; Xie, Wangang; Chen, Ming-Hui; Fan, Yu; Kuo, Lynn
2014-01-01
We present two distinctly different posterior predictive approaches to Bayesian phylogenetic model selection and illustrate these methods using examples from green algal protein-coding cpDNA sequences and flowering plant rDNA sequences. The Gelfand–Ghosh (GG) approach allows dissection of an overall measure of model fit into components due to posterior predictive variance (GGp) and goodness-of-fit (GGg), which distinguishes this method from the posterior predictive P-value approach. The conditional predictive ordinate (CPO) method provides a site-specific measure of model fit useful for exploratory analyses and can be combined over sites yielding the log pseudomarginal likelihood (LPML) which is useful as an overall measure of model fit. CPO provides a useful cross-validation approach that is computationally efficient, requiring only a sample from the posterior distribution (no additional simulation is required). Both GG and CPO add new perspectives to Bayesian phylogenetic model selection based on the predictive abilities of models and complement the perspective provided by the marginal likelihood (including Bayes Factor comparisons) based solely on the fit of competing models to observed data. [Bayesian; conditional predictive ordinate; CPO; L-measure; LPML; model selection; phylogenetics; posterior predictive.] PMID:24193892
Bayesian models for comparative analysis integrating phylogenetic uncertainty
2012-01-01
Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language. PMID:22741602
Inferring Phylogenetic Networks Using PhyloNet.
Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay
2018-07-01
PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
Phylogenetic Analyses: A Toolbox Expanding towards Bayesian Methods
Aris-Brosou, Stéphane; Xia, Xuhua
2008-01-01
The reconstruction of phylogenies is becoming an increasingly simple activity. This is mainly due to two reasons: the democratization of computing power and the increased availability of sophisticated yet user-friendly software. This review describes some of the latest additions to the phylogenetic toolbox, along with some of their theoretical and practical limitations. It is shown that Bayesian methods are under heavy development, as they offer the possibility to solve a number of long-standing issues and to integrate several steps of the phylogenetic analyses into a single framework. Specific topics include not only phylogenetic reconstruction, but also the comparison of phylogenies, the detection of adaptive evolution, and the estimation of divergence times between species. PMID:18483574
Cross-validation to select Bayesian hierarchical models in phylogenetics.
Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C
2016-05-26
Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.
Bayesian models for comparative analysis integrating phylogenetic uncertainty.
de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P
2012-06-28
Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language.
Yang, Ziheng; Zhu, Tianqi
2018-02-20
The Bayesian method is noted to produce spuriously high posterior probabilities for phylogenetic trees in analysis of large datasets, but the precise reasons for this overconfidence are unknown. In general, the performance of Bayesian selection of misspecified models is poorly understood, even though this is of great scientific interest since models are never true in real data analysis. Here we characterize the asymptotic behavior of Bayesian model selection and show that when the competing models are equally wrong, Bayesian model selection exhibits surprising and polarized behaviors in large datasets, supporting one model with full force while rejecting the others. If one model is slightly less wrong than the other, the less wrong model will eventually win when the amount of data increases, but the method may become overconfident before it becomes reliable. We suggest that this extreme behavior may be a major factor for the spuriously high posterior probabilities for evolutionary trees. The philosophical implications of our results to the application of Bayesian model selection to evaluate opposing scientific hypotheses are yet to be explored, as are the behaviors of non-Bayesian methods in similar situations.
Matthews, Luke J.; Tehrani, Jamie J.; Jordan, Fiona M.; Collard, Mark; Nunn, Charles L.
2011-01-01
Background Archaeologists and anthropologists have long recognized that different cultural complexes may have distinct descent histories, but they have lacked analytical techniques capable of easily identifying such incongruence. Here, we show how Bayesian phylogenetic analysis can be used to identify incongruent cultural histories. We employ the approach to investigate Iranian tribal textile traditions. Methods We used Bayes factor comparisons in a phylogenetic framework to test two models of cultural evolution: the hierarchically integrated system hypothesis and the multiple coherent units hypothesis. In the hierarchically integrated system hypothesis, a core tradition of characters evolves through descent with modification and characters peripheral to the core are exchanged among contemporaneous populations. In the multiple coherent units hypothesis, a core tradition does not exist. Rather, there are several cultural units consisting of sets of characters that have different histories of descent. Results For the Iranian textiles, the Bayesian phylogenetic analyses supported the multiple coherent units hypothesis over the hierarchically integrated system hypothesis. Our analyses suggest that pile-weave designs represent a distinct cultural unit that has a different phylogenetic history compared to other textile characters. Conclusions The results from the Iranian textiles are consistent with the available ethnographic evidence, which suggests that the commercial rug market has influenced pile-rug designs but not the techniques or designs incorporated in the other textiles produced by the tribes. We anticipate that Bayesian phylogenetic tests for inferring cultural units will be of great value for researchers interested in studying the evolution of cultural traits including language, behavior, and material culture. PMID:21559083
Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses
Lanfear, Robert; Hua, Xia; Warren, Dan L.
2016-01-01
Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses. PMID:27435794
Assessment of phylogenetic sensitivity for reconstructing HIV-1 epidemiological relationships.
Beloukas, Apostolos; Magiorkinis, Emmanouil; Magiorkinis, Gkikas; Zavitsanou, Asimina; Karamitros, Timokratis; Hatzakis, Angelos; Paraskevis, Dimitrios
2012-06-01
Phylogenetic analysis has been extensively used as a tool for the reconstruction of epidemiological relations for research or for forensic purposes. It was our objective to assess the sensitivity of different phylogenetic methods and various phylogenetic programs to reconstruct epidemiological links among HIV-1 infected patients that is the probability to reveal a true transmission relationship. Multiple datasets (90) were prepared consisting of HIV-1 sequences in protease (PR) and partial reverse transcriptase (RT) sampled from patients with documented epidemiological relationship (target population), and from unrelated individuals (control population) belonging to the same HIV-1 subtype as the target population. Each dataset varied regarding the number, the geographic origin and the transmission risk groups of the sequences among the control population. Phylogenetic trees were inferred by neighbor-joining (NJ), maximum likelihood heuristics (hML) and Bayesian methods. All clusters of sequences belonging to the target population were correctly reconstructed by NJ and Bayesian methods receiving high bootstrap and posterior probability (PP) support, respectively. On the other hand, TreePuzzle failed to reconstruct or provide significant support for several clusters; high puzzling step support was associated with the inclusion of control sequences from the same geographic area as the target population. In contrary, all clusters were correctly reconstructed by hML as implemented in PhyML 3.0 receiving high bootstrap support. We report that under the conditions of our study, hML using PhyML, NJ and Bayesian methods were the most sensitive for the reconstruction of epidemiological links mostly from sexually infected individuals. Copyright © 2012 Elsevier B.V. All rights reserved.
Cau, Andrea
2017-01-01
Bayesian phylogenetic methods integrating simultaneously morphological and stratigraphic information have been applied increasingly among paleontologists. Most of these studies have used Bayesian methods as an alternative to the widely-used parsimony analysis, to infer macroevolutionary patterns and relationships among species-level or higher taxa. Among recently introduced Bayesian methodologies, the Fossilized Birth-Death (FBD) model allows incorporation of hypotheses on ancestor-descendant relationships in phylogenetic analyses including fossil taxa. Here, the FBD model is used to infer the relationships among an ingroup formed exclusively by fossil individuals, i.e., dipnoan tooth plates from four localities in the Ain el Guettar Formation of Tunisia. Previous analyses of this sample compared the results of phylogenetic analysis using parsimony with stratigraphic methods, inferred a high diversity (five or more genera) in the Ain el Guettar Formation, and interpreted it as an artifact inflated by depositional factors. In the analysis performed here, the uncertainty on the chronostratigraphic relationships among the specimens was included among the prior settings. The results of the analysis confirm the referral of most of the specimens to the taxa Asiatoceratodus , Equinoxiodus, Lavocatodus and Neoceratodus , but reject those to Ceratodus and Ferganoceratodus . The resulting phylogeny constrained the evolution of the Tunisian sample exclusively in the Early Cretaceous, contrasting with the previous scenario inferred by the stratigraphically-calibrated topology resulting from parsimony analysis. The phylogenetic framework also suggests that (1) the sampled localities are laterally equivalent, (2) but three localities are restricted to the youngest part of the section; both results are in agreement with previous stratigraphic analyses of these localities. The FBD model of specimen-level units provides a novel tool for phylogenetic inference among fossils but also for independent tests of stratigraphic scenarios.
Bayesian nonparametric clustering in phylogenetics: modeling antigenic evolution in influenza.
Cybis, Gabriela B; Sinsheimer, Janet S; Bedford, Trevor; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A
2018-01-30
Influenza is responsible for up to 500,000 deaths every year, and antigenic variability represents much of its epidemiological burden. To visualize antigenic differences across many viral strains, antigenic cartography methods use multidimensional scaling on binding assay data to map influenza antigenicity onto a low-dimensional space. Analysis of such assay data ideally leads to natural clustering of influenza strains of similar antigenicity that correlate with sequence evolution. To understand the dynamics of these antigenic groups, we present a framework that jointly models genetic and antigenic evolution by combining multidimensional scaling of binding assay data, Bayesian phylogenetic machinery and nonparametric clustering methods. We propose a phylogenetic Chinese restaurant process that extends the current process to incorporate the phylogenetic dependency structure between strains in the modeling of antigenic clusters. With this method, we are able to use the genetic information to better understand the evolution of antigenicity throughout epidemics, as shown in applications of this model to H1N1 influenza. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Phylogenetic inference under varying proportions of indel-induced alignment gaps
Dwivedi, Bhakti; Gadagkar, Sudhindra R
2009-01-01
Background The effect of alignment gaps on phylogenetic accuracy has been the subject of numerous studies. In this study, we investigated the relationship between the total number of gapped sites and phylogenetic accuracy, when the gaps were introduced (by means of computer simulation) to reflect indel (insertion/deletion) events during the evolution of DNA sequences. The resulting (true) alignments were subjected to commonly used gap treatment and phylogenetic inference methods. Results (1) In general, there was a strong – almost deterministic – relationship between the amount of gap in the data and the level of phylogenetic accuracy when the alignments were very "gappy", (2) gaps resulting from deletions (as opposed to insertions) contributed more to the inaccuracy of phylogenetic inference, (3) the probabilistic methods (Bayesian, PhyML & "MLε, " a method implemented in DNAML in PHYLIP) performed better at most levels of gap percentage when compared to parsimony (MP) and distance (NJ) methods, with Bayesian analysis being clearly the best, (4) methods that treat gapped sites as missing data yielded less accurate trees when compared to those that attribute phylogenetic signal to the gapped sites (by coding them as binary character data – presence/absence, or as in the MLε method), and (5) in general, the accuracy of phylogenetic inference depended upon the amount of available data when the gaps resulted from mainly deletion events, and the amount of missing data when insertion events were equally likely to have caused the alignment gaps. Conclusion When gaps in an alignment are a consequence of indel events in the evolution of the sequences, the accuracy of phylogenetic analysis is likely to improve if: (1) alignment gaps are categorized as arising from insertion events or deletion events and then treated separately in the analysis, (2) the evolutionary signal provided by indels is harnessed in the phylogenetic analysis, and (3) methods that utilize the phylogenetic signal in indels are developed for distance methods too. When the true homology is known and the amount of gaps is 20 percent of the alignment length or less, the methods used in this study are likely to yield trees with 90–100 percent accuracy. PMID:19698168
Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.
2014-01-01
Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554
Posada, David; Buckley, Thomas R
2004-10-01
Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental concepts and techniques of model selection in the context of phylogenetics. We start by reviewing different aspects of the selection of substitution models in phylogenetics from a theoretical, philosophical and practical point of view, and summarize this comparison in table format. We argue that the most commonly implemented model selection approach, the hierarchical likelihood ratio test, is not the optimal strategy for model selection in phylogenetics, and that approaches like the Akaike Information Criterion (AIC) and Bayesian methods offer important advantages. In particular, the latter two methods are able to simultaneously compare multiple nested or nonnested models, assess model selection uncertainty, and allow for the estimation of phylogenies and model parameters using all available models (model-averaged inference or multimodel inference). We also describe how the relative importance of the different parameters included in substitution models can be depicted. To illustrate some of these points, we have applied AIC-based model averaging to 37 mitochondrial DNA sequences from the subgenus Ohomopterus(genus Carabus) ground beetles described by Sota and Vogler (2001).
Wang, Wei; Xia, Minxuan; Chen, Jie; Deng, Fenni; Yuan, Rui; Zhang, Xiaopei; Shen, Fafu
2016-12-01
The data presented in this paper is supporting the research article "Genome-Wide Analysis of Superoxide Dismutase Gene Family in Gossypium raimondii and G. arboreum" [1]. In this data article, we present phylogenetic tree showing dichotomy with two different clusters of SODs inferred by the Bayesian method of MrBayes (version 3.2.4), "Bayesian phylogenetic inference under mixed models" [2], Ramachandran plots of G. raimondii and G. arboreum SODs, the protein sequence used to generate 3D sructure of proteins and the template accession via SWISS-MODEL server, "SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information." [3] and motif sequences of SODs identified by InterProScan (version 4.8) with the Pfam database, "Pfam: the protein families database" [4].
Estimating Bayesian Phylogenetic Information Content
Lewis, Paul O.; Chen, Ming-Hui; Kuo, Lynn; Lewis, Louise A.; Fučíková, Karolina; Neupane, Suman; Wang, Yu-Bo; Shi, Daoyuan
2016-01-01
Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.] PMID:27155008
Xiang, Kun-Li; Wu, Sheng-Dan; Yu, Sheng-Xian; Liu, Yang; Jabbour, Florian; Erst, Andrey S.; Zhao, Liang; Wang, Wei; Chen, Zhi-Duan
2016-01-01
Coptis (Ranunculaceae) contains 15 species and is one of the pharmaceutically most important plant genera in eastern Asia. Understanding of the evolution of morphological characters and phylogenetic relationships within the genus is very limited. Here, we present the first comprehensive phylogenetic analysis of the genus based on two plastid and one nuclear markers. The phylogeny was reconstructed using Bayesian inference, as well as maximum parsimony and maximum likelihood methods. The Swofford-Olsen-Waddell-Hillis and Bayesian tests were used to assess the strength of the conflicts between traditional taxonomic units and those suggested by the phylogenetic inferences. Evolution of morphological characters was inferred using Bayesian method to identify synapomorphies for the infrageneric lineages. Our data recognize two strongly supported clades within Coptis. The first clade contains subgenus Coptis and section Japonocoptis of subgenus Metacoptis, supported by morphological characters, such as traits of the central leaflet base, petal color, and petal shape. The second clade consists of section Japonocoptis of subgenus Metacoptis. Coptis morii is not united with C. quinquefolia, in contrast with the view that C. morii is a synonym of C. quinquefolia. Two varieties of C. chinensis do not cluster together. Coptis groenlandica and C. lutescens are reduced to C. trifolia and C. japonica, respectively. Central leaflet base, sepal shape, and petal blade carry a strong phylogenetic signal in Coptis, while leaf type, sepal and petal color, and petal shape exhibit relatively higher levels of evolutionary flexibility. PMID:27044035
The phylogenetic relationships of known mosquito (Diptera: Culicidae) mitogenomes.
Chu, Hongliang; Li, Chunxiao; Guo, Xiaoxia; Zhang, Hengduan; Luo, Peng; Wu, Zhonghua; Wang, Gang; Zhao, Tongyan
2018-01-01
The known mosquito mitogenomes, containing a total of 34 species, which belong to five genera, were collected from GenBank, and the practicality and effectiveness of the variation in the complete mitochondrial DNA genome and portions of mitochondrial COI gene were assessed to reconstruct the phylogeny of mosquitoes. Phylogenetic trees were reconstructed on the basis of parsimony, maximum likelihood, and Bayesian (BI) methods. It is concluded that: (1) Both mitogenomes and COI gene support the monophly of following taxa: Subgenus Nyssorhynchus, Subgenus Cellia, Anopheles albitarsis complex, Anopheles gambiae complex, and Anopheles punctulatus group; (2) Genus Aedes is not monophyletic relative to Ochlerotatus vigilax; (3) The mitogenome results indicate a close relationship between Anopheles epiroticus and Anopheles gambiae complex, Anopheles dirus complex and Anopheles punctulatus group, respectively; (4) The Bayesian posterior probability (BPP) within phylogenetic tree reconstructed by mitogenomes is higher than COI tree. The results show that phylogenetic relationships reconstructed using the mitogenomes were more similar to those based on morphological data.
Torres-Carvajal, Omar; Schulte, James A; Cadle, John E
2006-04-01
The South American iguanian lizard genus Stenocercus includes 54 species occurring mostly in the Andes and adjacent lowland areas from northern Venezuela and Colombia to central Argentina at elevations of 0-4000m. Small taxon or character sampling has characterized all phylogenetic analyses of Stenocercus, which has long been recognized as sister taxon to the Tropidurus Group. In this study, we use mtDNA sequence data to perform phylogenetic analyses that include 32 species of Stenocercus and 12 outgroup taxa. Monophyly of this genus is strongly supported by maximum parsimony and Bayesian analyses. Evolutionary relationships within Stenocercus are further analyzed with a Bayesian implementation of a general mixture model, which accommodates variability in the pattern of evolution across sites. These analyses indicate a basal split of Stenocercus into two clades, one of which receives very strong statistical support. In addition, we test previous hypotheses using non-parametric and parametric statistical methods, and provide a phylogenetic classification for Stenocercus.
Liang, Li-Jung; Weiss, Robert E; Redelings, Benjamin; Suchard, Marc A
2009-10-01
Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.
Garamszegi, László Zsolt
2011-02-01
Plasmodium parasites, the causative agents of malaria, are generally considered as harmful parasites, but many of them cause mild symptoms. Little is known about the evolutionary history and phylogenetic constraints that generate this interspecific variation in virulence due to uncertainties about the phylogenetic associations of parasites. Here, to account for such phylogenetic uncertainty, phylogenetic methods based on Bayesian statistics were followed in combination with sequence data from five genes to estimate the ancestral state of virulence in primate Plasmodium parasites. When recent parasites were categorised according to the damage caused to the host, Bayesian estimates of ancestral states indicated that the acquisition of a harmful host exploitation strategy is more likely to be a recent evolutionary event than a result of an ancient change in a character state altering virulence. On the contrary, there was more evidence for moderate host exploitation having a deep origin along the phylogenetic tree. Moreover, the evolution of host severity is determined by the phylogenetic relationships of parasites, as severity gains did not appear randomly on the evolutionary tree. Such phylogenetic constraints can be mediated by the acquisition of virulence genes. As the impact of a parasite on a host is the result of both the parasite's investment in reproduction and host sensitivity, virulence was also estimated by calculating peak parasitemia after eliminating host effects. A directional random-walk evolutionary model showed that the ancestral primate malarias reproduced at very low parasitemia in their hosts. Consequently, the extreme variation in the outcome of malaria infection in different host species can be better understood in light of the phylogeny of parasites. Copyright © 2010 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies.
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456
Taming the BEAST—A Community Teaching Material Resource for BEAST 2
Barido-Sottani, Joëlle; Bošková, Veronika; Plessis, Louis Du; Kühnert, Denise; Magnus, Carsten; Mitov, Venelin; Müller, Nicola F.; PečErska, Jūlija; Rasmussen, David A.; Zhang, Chi; Drummond, Alexei J.; Heath, Tracy A.; Pybus, Oliver G.; Vaughan, Timothy G.; Stadler, Tanja
2018-01-01
Abstract Phylogenetics and phylodynamics are central topics in modern evolutionary biology. Phylogenetic methods reconstruct the evolutionary relationships among organisms, whereas phylodynamic approaches reveal the underlying diversification processes that lead to the observed relationships. These two fields have many practical applications in disciplines as diverse as epidemiology, developmental biology, palaeontology, ecology, and linguistics. The combination of increasingly large genetic data sets and increases in computing power is facilitating the development of more sophisticated phylogenetic and phylodynamic methods. Big data sets allow us to answer complex questions. However, since the required analyses are highly specific to the particular data set and question, a black-box method is not sufficient anymore. Instead, biologists are required to be actively involved with modeling decisions during data analysis. The modular design of the Bayesian phylogenetic software package BEAST 2 enables, and in fact enforces, this involvement. At the same time, the modular design enables computational biology groups to develop new methods at a rapid rate. A thorough understanding of the models and algorithms used by inference software is a critical prerequisite for successful hypothesis formulation and assessment. In particular, there is a need for more readily available resources aimed at helping interested scientists equip themselves with the skills to confidently use cutting-edge phylogenetic analysis software. These resources will also benefit researchers who do not have access to similar courses or training at their home institutions. Here, we introduce the “Taming the Beast” (https://taming-the-beast.github.io/) resource, which was developed as part of a workshop series bearing the same name, to facilitate the usage of the Bayesian phylogenetic software package BEAST 2. PMID:28673048
Taming the BEAST-A Community Teaching Material Resource for BEAST 2.
Barido-Sottani, Joëlle; Bošková, Veronika; Plessis, Louis Du; Kühnert, Denise; Magnus, Carsten; Mitov, Venelin; Müller, Nicola F; PecErska, Julija; Rasmussen, David A; Zhang, Chi; Drummond, Alexei J; Heath, Tracy A; Pybus, Oliver G; Vaughan, Timothy G; Stadler, Tanja
2018-01-01
Phylogenetics and phylodynamics are central topics in modern evolutionary biology. Phylogenetic methods reconstruct the evolutionary relationships among organisms, whereas phylodynamic approaches reveal the underlying diversification processes that lead to the observed relationships. These two fields have many practical applications in disciplines as diverse as epidemiology, developmental biology, palaeontology, ecology, and linguistics. The combination of increasingly large genetic data sets and increases in computing power is facilitating the development of more sophisticated phylogenetic and phylodynamic methods. Big data sets allow us to answer complex questions. However, since the required analyses are highly specific to the particular data set and question, a black-box method is not sufficient anymore. Instead, biologists are required to be actively involved with modeling decisions during data analysis. The modular design of the Bayesian phylogenetic software package BEAST 2 enables, and in fact enforces, this involvement. At the same time, the modular design enables computational biology groups to develop new methods at a rapid rate. A thorough understanding of the models and algorithms used by inference software is a critical prerequisite for successful hypothesis formulation and assessment. In particular, there is a need for more readily available resources aimed at helping interested scientists equip themselves with the skills to confidently use cutting-edge phylogenetic analysis software. These resources will also benefit researchers who do not have access to similar courses or training at their home institutions. Here, we introduce the "Taming the Beast" (https://taming-the-beast.github.io/) resource, which was developed as part of a workshop series bearing the same name, to facilitate the usage of the Bayesian phylogenetic software package BEAST 2. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Fundamentals and Recent Developments in Approximate Bayesian Computation
Lintusaari, Jarno; Gutmann, Michael U.; Dutta, Ritabrata; Kaski, Samuel; Corander, Jukka
2017-01-01
Abstract Bayesian inference plays an important role in phylogenetics, evolutionary biology, and in many other branches of science. It provides a principled framework for dealing with uncertainty and quantifying how it changes in the light of new evidence. For many complex models and inference problems, however, only approximate quantitative answers are obtainable. Approximate Bayesian computation (ABC) refers to a family of algorithms for approximate inference that makes a minimal set of assumptions by only requiring that sampling from a model is possible. We explain here the fundamentals of ABC, review the classical algorithms, and highlight recent developments. [ABC; approximate Bayesian computation; Bayesian inference; likelihood-free inference; phylogenetics; simulator-based models; stochastic simulation models; tree-based models.] PMID:28175922
Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
Baele, Guy; Lemey, Philippe; Suchard, Marc A.
2016-01-01
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of “working distributions” to facilitate—or shorten—the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a “working” distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different “working” distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses. PMID:26526428
Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
Fourment, Mathieu; Claywell, Brian C; Dinh, Vu; McCoy, Connor; Matsen IV, Frederick A; Darling, Aaron E
2018-01-01
Abstract Modern infectious disease outbreak surveillance produces continuous streams of sequence data which require phylogenetic analysis as data arrives. Current software packages for Bayesian phylogenetic inference are unable to quickly incorporate new sequences as they become available, making them less useful for dynamically unfolding evolutionary stories. This limitation can be addressed by applying a class of Bayesian statistical inference algorithms called sequential Monte Carlo (SMC) to conduct online inference, wherein new data can be continuously incorporated to update the estimate of the posterior probability distribution. In this article, we describe and evaluate several different online phylogenetic sequential Monte Carlo (OPSMC) algorithms. We show that proposing new phylogenies with a density similar to the Bayesian prior suffers from poor performance, and we develop “guided” proposals that better match the proposal density to the posterior. Furthermore, we show that the simplest guided proposals can exhibit pathological behavior in some situations, leading to poor results, and that the situation can be resolved by heating the proposal density. The results demonstrate that relative to the widely used MCMC-based algorithm implemented in MrBayes, the total time required to compute a series of phylogenetic posteriors as sequences arrive can be significantly reduced by the use of OPSMC, without incurring a significant loss in accuracy. PMID:29186587
Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach
Reaz, Rezwana; Bayzid, Md. Shamsuzzoha; Rahman, M. Sohel
2014-01-01
Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A ‘quartet’ is an unrooted tree over taxa, hence the quartet-based supertree methods combine many -taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets. PMID:25117474
Wade, E J; Hertach, T; Gogala, M; Trilar, T; Simon, C
2015-12-01
Molecular species delimitation is increasingly being used to discover and illuminate species level diversity, and a number of methods have been developed. Here, we compare the ability of two molecular species delimitation methods to recover song-delimited species in the Cicadetta montana cryptic species complex throughout Europe. Recent bioacoustics studies of male calling songs (premating reproductive barriers) have revealed cryptic species diversity in this complex. Maximum likelihood and Bayesian phylogenetic analyses were used to analyse the mitochondrial genes COI and COII and the nuclear genes EF1α and period for thirteen European Cicadetta species as well as the closely related monotypic genus Euboeana. Two molecular species delimitation methods, general mixed Yule-coalescent (GMYC) and Bayesian phylogenetics and phylogeography, identified the majority of song-delimited species and were largely congruent with each other. None of the molecular delimitation methods were able to fully recover a recent radiation of four Greek species. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.
Arai, Satoru; Gu, Se Hun; Baek, Luck Ju; Tabara, Kenji; Bennett, Shannon; Oh, Hong-Shik; Takada, Nobuhiro; Kang, Hae Ji; Tanaka-Taya, Keiko; Morikawa, Shigeru; Okabe, Nobuhiko; Yanagihara, Richard; Song, Jin-Won
2012-01-01
Spurred by the recent isolation of a novel hantavirus, named Imjin virus (MJNV), from the Ussuri white-toothed shrew (Crocidura lasiura), targeted trapping was conducted for the phylogenetically related Asian lesser white-toothed shrew (Crocidura shantungensis). Pair-wise alignment and comparison of the S, M and L segments of a newfound hantavirus, designated Jeju virus (JJUV), indicated remarkably low nucleotide and amino acid sequence similarity with MJNV. Phylogenetic analyses, using maximum likelihood and Bayesian methods, showed divergent ancestral lineages for JJUV and MJNV, despite the close phylogenetic relationship of their reservoir soricid hosts. Also, no evidence of host switching was apparent in tanglegrams, generated by TreeMap 2.0β. PMID:22230701
HIV Migration Between Blood and Cerebrospinal Fluid or Semen Over Time
Chaillon, Antoine; Gianella, Sara; Wertheim, Joel O.; Richman, Douglas D.; Mehta, Sanjay R.; Smith, David M.
2014-01-01
Previous studies reported associations between neuropathogenesis and human immunodeficiency virus (HIV) compartmentalization in cerebrospinal fluid (CSF) and between sexual transmission and human immunodeficiency virus type 1 (HIV) compartmentalization in semen. It remains unclear, however, how compartmentalization dynamics change over time. To address this, we used statistical methods and Bayesian phylogenetic approaches to reconstruct temporal dynamics of HIV migration between blood and CSF and between blood and the male genital tract. We investigated 11 HIV-infected individuals with paired semen and blood samples and 4 individuals with paired CSF and blood samples. Aligned partial HIV env sequences were analyzed by (1) phylogenetic reconstruction, using a Bayesian Markov-chain Monte Carlo approach; (2) evaluation of viral compartmentalization, using tree-based and distance-based methods; and (3) analysis of migration events, using a discrete Bayesian asymmetric phylogeographic approach of diffusion with Markov jump counts estimation. Finally, we evaluated potential correlates of viral gene flow across anatomical compartments. We observed bidirectional replenishment of viral compartments and asynchronous peaks of viral migration from and to blood over time, suggesting that disruption of viral compartment is transient and directionally selected. These findings imply that viral subpopulations in anatomical sites are an active part of the whole viral population and that compartmental reservoirs could have implications in future eradication studies. PMID:24302756
McGowen, Michael R
2011-09-01
Oceanic dolphins (Delphinidae) are the product of a rapid radiation that yielded ∼36 extant species of small to medium-sized cetaceans that first emerged in the Late Miocene. Although they are a charismatic group of organisms that have become poster children for marine conservation, many phylogenetic relationships within Delphinidae remain elusive due to the slow molecular evolution of the group and the difficulty of resolving short branches from successive cladogenic events. Here I combine existing and newly generated sequences from four mitochondrial (mt) genes and 20 nuclear (nu) genes to reconstruct a well-supported phylogenetic hypothesis for Delphinidae. This study compares maximum-likelihood and Bayesian inference methods of several data sets including mtDNA, combined nuDNA, gene trees of individual nuDNA loci, and concatenated mtDNA+nuDNA. In addition, I contrast these standard phylogenetic analyses with the species tree reconstruction method of Bayesian concordance analysis (BCA). Despite finding discordance between mtDNA and individual nuDNA loci, the concatenated matrix recovers a completely resolved and robustly supported phylogeny that is also broadly congruent with BCA trees. This study strongly supports groupings such as Delphininae, Lissodelphininae, Globicephalinae, Sotalia+Delphininae, Steno+Orcaella+Globicephalinae, and Leucopleurus acutus, Lagenorhynchus albirostris, and Orcinus orca as basal delphinid taxa. Copyright © 2011 Elsevier Inc. All rights reserved.
Treetrimmer: a method for phylogenetic dataset size reduction.
Maruyama, Shinichiro; Eveleigh, Robert J M; Archibald, John M
2013-04-12
With rapid advances in genome sequencing and bioinformatics, it is now possible to generate phylogenetic trees containing thousands of operational taxonomic units (OTUs) from a wide range of organisms. However, use of rigorous tree-building methods on such large datasets is prohibitive and manual 'pruning' of sequence alignments is time consuming and raises concerns over reproducibility. There is a need for bioinformatic tools with which to objectively carry out such pruning procedures. Here we present 'TreeTrimmer', a bioinformatics procedure that removes unnecessary redundancy in large phylogenetic datasets, alleviating the size effect on more rigorous downstream analyses. The method identifies and removes user-defined 'redundant' sequences, e.g., orthologous sequences from closely related organisms and 'recently' evolved lineage-specific paralogs. Representative OTUs are retained for more rigorous re-analysis. TreeTrimmer reduces the OTU density of phylogenetic trees without sacrificing taxonomic diversity while retaining the original tree topology, thereby speeding up downstream computer-intensive analyses, e.g., Bayesian and maximum likelihood tree reconstructions, in a reproducible fashion.
Choosing and Using Introns in Molecular Phylogenetics
Creer, Simon
2007-01-01
Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984
Höhna, Sebastian; Landis, Michael J.
2016-01-01
Programs for Bayesian inference of phylogeny currently implement a unique and fixed suite of models. Consequently, users of these software packages are simultaneously forced to use a number of programs for a given study, while also lacking the freedom to explore models that have not been implemented by the developers of those programs. We developed a new open-source software package, RevBayes, to address these problems. RevBayes is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models. Phylogenetic-graphical models can be specified interactively in RevBayes, piece by piece, using a new succinct and intuitive language called Rev. Rev is similar to the R language and the BUGS model-specification language, and should be easy to learn for most users. The strength of RevBayes is the simplicity with which one can design, specify, and implement new and complex models. Fortunately, this tremendous flexibility does not come at the cost of slower computation; as we demonstrate, RevBayes outperforms competing software for several standard analyses. Compared with other programs, RevBayes has fewer black-box elements. Users need to explicitly specify each part of the model and analysis. Although this explicitness may initially be unfamiliar, we are convinced that this transparency will improve understanding of phylogenetic models in our field. Moreover, it will motivate the search for improvements to existing methods by brazenly exposing the model choices that we make to critical scrutiny. RevBayes is freely available at http://www.RevBayes.com. [Bayesian inference; Graphical models; MCMC; statistical phylogenetics.] PMID:27235697
Höhna, Sebastian; Landis, Michael J; Heath, Tracy A; Boussau, Bastien; Lartillot, Nicolas; Moore, Brian R; Huelsenbeck, John P; Ronquist, Fredrik
2016-07-01
Programs for Bayesian inference of phylogeny currently implement a unique and fixed suite of models. Consequently, users of these software packages are simultaneously forced to use a number of programs for a given study, while also lacking the freedom to explore models that have not been implemented by the developers of those programs. We developed a new open-source software package, RevBayes, to address these problems. RevBayes is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models. Phylogenetic-graphical models can be specified interactively in RevBayes, piece by piece, using a new succinct and intuitive language called Rev. Rev is similar to the R language and the BUGS model-specification language, and should be easy to learn for most users. The strength of RevBayes is the simplicity with which one can design, specify, and implement new and complex models. Fortunately, this tremendous flexibility does not come at the cost of slower computation; as we demonstrate, RevBayes outperforms competing software for several standard analyses. Compared with other programs, RevBayes has fewer black-box elements. Users need to explicitly specify each part of the model and analysis. Although this explicitness may initially be unfamiliar, we are convinced that this transparency will improve understanding of phylogenetic models in our field. Moreover, it will motivate the search for improvements to existing methods by brazenly exposing the model choices that we make to critical scrutiny. RevBayes is freely available at http://www.RevBayes.com [Bayesian inference; Graphical models; MCMC; statistical phylogenetics.]. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Makowsky, Robert; Cox, Christian L; Roelke, Corey; Chippindale, Paul T
2010-11-01
Determining the appropriate gene for phylogeny reconstruction can be a difficult process. Rapidly evolving genes tend to resolve recent relationships, but suffer from alignment issues and increased homoplasy among distantly related species. Conversely, slowly evolving genes generally perform best for deeper relationships, but lack sufficient variation to resolve recent relationships. We determine the relationship between sequence divergence and Bayesian phylogenetic reconstruction ability using both natural and simulated datasets. The natural data are based on 28 well-supported relationships within the subphylum Vertebrata. Sequences of 12 genes were acquired and Bayesian analyses were used to determine phylogenetic support for correct relationships. Simulated datasets were designed to determine whether an optimal range of sequence divergence exists across extreme phylogenetic conditions. Across all genes we found that an optimal range of divergence for resolving the correct relationships does exist, although this level of divergence expectedly depends on the distance metric. Simulated datasets show that an optimal range of sequence divergence exists across diverse topologies and models of evolution. We determine that a simple to measure property of genetic sequences (genetic distance) is related to phylogenic reconstruction ability in Bayesian analyses. This information should be useful for selecting the most informative gene to resolve any relationships, especially those that are difficult to resolve, as well as minimizing both cost and confounding information during project design. Copyright © 2010. Published by Elsevier Inc.
The riddle of Tasmanian languages
Bowern, Claire
2012-01-01
Recent work which combines methods from linguistics and evolutionary biology has been fruitful in discovering the history of major language families because of similarities in evolutionary processes. Such work opens up new possibilities for language research on previously unsolvable problems, especially in areas where information from other sources may be lacking. I use phylogenetic methods to investigate Tasmanian languages. Existing materials are so fragmentary that scholars have been unable to discover how many languages are represented in the sources. Using a clustering algorithm which identifies admixture, source materials representing more than one language are identified. Using the Neighbor-Net algorithm, 12 languages are identified in five clusters. Bayesian phylogenetic methods reveal that the families are not demonstrably related; an important result, given the importance of Tasmanian Aborigines for information about how societies have responded to population collapse in prehistory. This work provides insight into the societies of prehistoric Tasmania and illustrates a new utility of phylogenetics in reconstructing linguistic history. PMID:23015621
Fast algorithms for computing phylogenetic divergence time.
Crosby, Ralph W; Williams, Tiffani L
2017-12-06
The inference of species divergence time is a key step in most phylogenetic studies. Methods have been available for the last ten years to perform the inference, but the performance of the methods does not yet scale well to studies with hundreds of taxa and thousands of DNA base pairs. For example a study of 349 primate taxa was estimated to require over 9 months of processing time. In this work, we present a new algorithm, AncestralAge, that significantly improves the performance of the divergence time process. As part of AncestralAge, we demonstrate a new method for the computation of phylogenetic likelihood and our experiments show a 90% improvement in likelihood computation time on the aforementioned dataset of 349 primates taxa with over 60,000 DNA base pairs. Additionally, we show that our new method for the computation of the Bayesian prior on node ages reduces the running time for this computation on the 349 taxa dataset by 99%. Through the use of these new algorithms we open up the ability to perform divergence time inference on large phylogenetic studies.
A Bayesian framework to estimate diversification rates and their variation through time and space
2011-01-01
Background Patterns of species diversity are the result of speciation and extinction processes, and molecular phylogenetic data can provide valuable information to derive their variability through time and across clades. Bayesian Markov chain Monte Carlo methods offer a promising framework to incorporate phylogenetic uncertainty when estimating rates of diversification. Results We introduce a new approach to estimate diversification rates in a Bayesian framework over a distribution of trees under various constant and variable rate birth-death and pure-birth models, and test it on simulated phylogenies. Furthermore, speciation and extinction rates and their posterior credibility intervals can be estimated while accounting for non-random taxon sampling. The framework is particularly suitable for hypothesis testing using Bayes factors, as we demonstrate analyzing dated phylogenies of Chondrostoma (Cyprinidae) and Lupinus (Fabaceae). In addition, we develop a model that extends the rate estimation to a meta-analysis framework in which different data sets are combined in a single analysis to detect general temporal and spatial trends in diversification. Conclusions Our approach provides a flexible framework for the estimation of diversification parameters and hypothesis testing while simultaneously accounting for uncertainties in the divergence times and incomplete taxon sampling. PMID:22013891
Abdul-Latiff, Muhammad Abu Bakar; Ruslin, Farhani; Fui, Vun Vui; Abu, Mohd-Hashim; Rovie-Ryan, Jeffrine Japning; Abdul-Patah, Pazil; Lakim, Maklarin; Roos, Christian; Yaakop, Salmah; Md-Zain, Badrul Munir
2014-01-01
Abstract Phylogenetic relationships among Malaysia’s long-tailed macaques have yet to be established, despite abundant genetic studies of the species worldwide. The aims of this study are to examine the phylogenetic relationships of Macaca fascicularis in Malaysia and to test its classification as a morphological subspecies. A total of 25 genetic samples of M. fascicularis yielding 383 bp of Cytochrome b (Cyt b) sequences were used in phylogenetic analysis along with one sample each of M. nemestrina and M. arctoides used as outgroups. Sequence character analysis reveals that Cyt b locus is a highly conserved region with only 23% parsimony informative character detected among ingroups. Further analysis indicates a clear separation between populations originating from different regions; the Malay Peninsula versus Borneo Insular, the East Coast versus West Coast of the Malay Peninsula, and the island versus mainland Malay Peninsula populations. Phylogenetic trees (NJ, MP and Bayesian) portray a consistent clustering paradigm as Borneo’s population was distinguished from Peninsula’s population (99% and 100% bootstrap value in NJ and MP respectively and 1.00 posterior probability in Bayesian trees). The East coast population was separated from other Peninsula populations (64% in NJ, 66% in MP and 0.53 posterior probability in Bayesian). West coast populations were divided into 2 clades: the North-South (47%/54% in NJ, 26/26% in MP and 1.00/0.80 posterior probability in Bayesian) and Island-Mainland (93% in NJ, 90% in MP and 1.00 posterior probability in Bayesian). The results confirm the previous morphological assignment of 2 subspecies, M. f. fascicularis and M. f. argentimembris, in the Malay Peninsula. These populations should be treated as separate genetic entities in order to conserve the genetic diversity of Malaysia’s M. fascicularis. These findings are crucial in aiding the conservation management and translocation process of M. fascicularis populations in Malaysia. PMID:24899832
Abdul-Latiff, Muhammad Abu Bakar; Ruslin, Farhani; Fui, Vun Vui; Abu, Mohd-Hashim; Rovie-Ryan, Jeffrine Japning; Abdul-Patah, Pazil; Lakim, Maklarin; Roos, Christian; Yaakop, Salmah; Md-Zain, Badrul Munir
2014-01-01
Phylogenetic relationships among Malaysia's long-tailed macaques have yet to be established, despite abundant genetic studies of the species worldwide. The aims of this study are to examine the phylogenetic relationships of Macaca fascicularis in Malaysia and to test its classification as a morphological subspecies. A total of 25 genetic samples of M. fascicularis yielding 383 bp of Cytochrome b (Cyt b) sequences were used in phylogenetic analysis along with one sample each of M. nemestrina and M. arctoides used as outgroups. Sequence character analysis reveals that Cyt b locus is a highly conserved region with only 23% parsimony informative character detected among ingroups. Further analysis indicates a clear separation between populations originating from different regions; the Malay Peninsula versus Borneo Insular, the East Coast versus West Coast of the Malay Peninsula, and the island versus mainland Malay Peninsula populations. Phylogenetic trees (NJ, MP and Bayesian) portray a consistent clustering paradigm as Borneo's population was distinguished from Peninsula's population (99% and 100% bootstrap value in NJ and MP respectively and 1.00 posterior probability in Bayesian trees). The East coast population was separated from other Peninsula populations (64% in NJ, 66% in MP and 0.53 posterior probability in Bayesian). West coast populations were divided into 2 clades: the North-South (47%/54% in NJ, 26/26% in MP and 1.00/0.80 posterior probability in Bayesian) and Island-Mainland (93% in NJ, 90% in MP and 1.00 posterior probability in Bayesian). The results confirm the previous morphological assignment of 2 subspecies, M. f. fascicularis and M. f. argentimembris, in the Malay Peninsula. These populations should be treated as separate genetic entities in order to conserve the genetic diversity of Malaysia's M. fascicularis. These findings are crucial in aiding the conservation management and translocation process of M. fascicularis populations in Malaysia.
Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud
2017-10-01
The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.
Romero-Severson, Ethan O.; Bulla, Ingo; Hengartner, Nick; Bártolo, Inês; Abecasis, Ana; Azevedo-Pereira, José M.; Taveira, Nuno; Leitner, Thomas
2017-01-01
Diversity of the founding population of Human Immunodeficiency Virus Type 1 (HIV-1) transmissions raises many important biological, clinical, and epidemiological issues. In up to 40% of sexual infections, there is clear evidence for multiple founding variants, which can influence the efficacy of putative prevention methods, and the reconstruction of epidemiologic histories. To infer who-infected-whom, and to compute the probability of alternative transmission scenarios while explicitly taking phylogenetic uncertainty into account, we created an approximate Bayesian computation (ABC) method based on a set of statistics measuring phylogenetic topology, branch lengths, and genetic diversity. We applied our method to a suspected heterosexual transmission case involving three individuals, showing a complex monophyletic-paraphyletic-polyphyletic phylogenetic topology. We detected that seven phylogenetic lineages had been transmitted between two of the individuals based on the available samples, implying that many more unsampled lineages had also been transmitted. Testing whether the lineages had been transmitted at one time or over some length of time suggested that an ongoing superinfection process over several years was most likely. While one individual was found unlinked to the other two, surprisingly, when evaluating two competing epidemiological priors, the donor of the two that did infect each other was not identified by the host root-label, and was also not the primary suspect in that transmission. This highlights that it is important to take epidemiological information into account when analyzing support for one transmission hypothesis over another, as results may be nonintuitive and sensitive to details about sampling dates relative to possible infection dates. Our study provides a formal inference framework to include information on infection and sampling times, and to investigate ancestral node-label states, transmission direction, transmitted genetic diversity, and frequency of transmission. PMID:28912340
The space of ultrametric phylogenetic trees.
Gavryushkin, Alex; Drummond, Alexei J
2016-08-21
The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
The second molecular epidemiological study of HIV infection in Mongolia between 2010 and 2016.
Jagdagsuren, Davaalkham; Hayashida, Tsunefusa; Takano, Misao; Gombo, Erdenetuya; Zayasaikhan, Setsen; Kanayama, Naomi; Tsuchiya, Kiyoto; Oka, Shinichi
2017-01-01
Our previous 2005-2009 molecular epidemiological study in Mongolia identified a hot spot of HIV-1 transmission in men who have sex with men (MSM). To control the infection, we collaborated with NGOs to promote safer sex and HIV testing since mid-2010. In this study, we carried out the second molecular epidemiological survey between 2010 and 2016 to determine the status of HIV-1 infection in Mongolia. The study included 143 new cases of HIV-1 infection. Viral RNA was extracted from stocked plasma samples and sequenced for the pol and the env regions using the Sanger method. Near-full length sequencing using MiSeq was performed in 3 patients who were suspected to be infected with recombinant HIV-1. Phylogenetic analysis was performed using the neighbor-joining method and Bayesian Markov chain Monte Carlo method. MSM was the main transmission route in the previous and current studies. However, heterosexual route showed a significant increase in recent years. Phylogenetic analysis documented three taxa; Mongolian B, Korean B, and CRF51_01B, though the former two were also observed in the previous study. CRF51_01B, which originated from Singapore and Malaysia, was confirmed by near-full length sequencing. Although these strains were mainly detected in MSM, they were also found in increasing numbers of heterosexual males and females. Bayesian phylogenetic analysis estimated transmission of CRF51_01B into Mongolia around early 2000s. An extended Bayesian skyline plot showed a rapid increase in the effective population size of Mongolian B cluster around 2004 and that of CRF51_01B cluster around 2011. HIV-1 infection might expand to the general population in Mongolia. Our study documented a new cluster of HIV-1 transmission, enhancing our understanding of the epidemiological status of HIV-1 in Mongolia.
Wilcox, Thomas P; Zwickl, Derrick J; Heath, Tracy A; Hillis, David M
2002-11-01
Four New World genera of dwarf boas (Exiliboa, Trachyboa, Tropidophis, and Ungaliophis) have been placed by many systematists in a single group (traditionally called Tropidophiidae). However, the monophyly of this group has been questioned in several studies. Moreover, the overall relationships among basal snake lineages, including the placement of the dwarf boas, are poorly understood. We obtained mtDNA sequence data for 12S, 16S, and intervening tRNA-val genes from 23 species of snakes representing most major snake lineages, including all four genera of New World dwarf boas. We then examined the phylogenetic position of these species by estimating the phylogeny of the basal snakes. Our phylogenetic analysis suggests that New World dwarf boas are not monophyletic. Instead, we find Exiliboa and Ungaliophis to be most closely related to sand boas (Erycinae), boas (Boinae), and advanced snakes (Caenophidea), whereas Tropidophis and Trachyboa form an independent clade that separated relatively early in snake radiation. Our estimate of snake phylogeny differs significantly in other ways from some previous estimates of snake phylogeny. For instance, pythons do not cluster with boas and sand boas, but instead show a strong relationship with Loxocemus and Xenopeltis. Additionally, uropeltids cluster strongly with Cylindrophis, and together are embedded in what has previously been considered the macrostomatan radiation. These relationships are supported by both bootstrapping (parametric and nonparametric approaches) and Bayesian analysis, although Bayesian support values are consistently higher than those obtained from nonparametric bootstrapping. Simulations show that Bayesian support values represent much better estimates of phylogenetic accuracy than do nonparametric bootstrap support values, at least under the conditions of our study. Copyright 2002 Elsevier Science (USA)
Bayesian molecular dating: opening up the black box.
Bromham, Lindell; Duchêne, Sebastián; Hua, Xia; Ritchie, Andrew M; Duchêne, David A; Ho, Simon Y W
2018-05-01
Molecular dating analyses allow evolutionary timescales to be estimated from genetic data, offering an unprecedented capacity for investigating the evolutionary past of all species. These methods require us to make assumptions about the relationship between genetic change and evolutionary time, often referred to as a 'molecular clock'. Although initially regarded with scepticism, molecular dating has now been adopted in many areas of biology. This broad uptake has been due partly to the development of Bayesian methods that allow complex aspects of molecular evolution, such as variation in rates of change across lineages, to be taken into account. But in order to do this, Bayesian dating methods rely on a range of assumptions about the evolutionary process, which vary in their degree of biological realism and empirical support. These assumptions can have substantial impacts on the estimates produced by molecular dating analyses. The aim of this review is to open the 'black box' of Bayesian molecular dating and have a look at the machinery inside. We explain the components of these dating methods, the important decisions that researchers must make in their analyses, and the factors that need to be considered when interpreting results. We illustrate the effects that the choices of different models and priors can have on the outcome of the analysis, and suggest ways to explore these impacts. We describe some major research directions that may improve the reliability of Bayesian dating. The goal of our review is to help researchers to make informed choices when using Bayesian phylogenetic methods to estimate evolutionary rates and timescales. © 2017 Cambridge Philosophical Society.
Hurtado, Luis A; Santamaria, Carlos A; Fitzgerald, Lee A
2014-05-06
The phylogenetic position of the critically endangered Saint Croix ground lizard Ameiva polops is presently unknown and several hypotheses have been proposed. We investigated the phylogenetic position of this species using molecular phylogenetic methods. We obtained sequences of DNA fragments of the mitochondrial ribosomal genes 12S rDNA and 16S rDNA for this species. We aligned these sequences with published sequences of other Ameiva species, which include most of the Ameiva species from the West Indies, three Ameiva species from Central America and South America, and one from the teiid lizard Tupinambis teguixin, which was used as outgroup. We conducted Maximum Likelihood and Bayesian phylogenetic analyses. The phylogenetic reconstructions among the different methods were very similar, supporting the monophyly of West Indian Ameiva and showing within this lineage, a basal polytomy of four clades that are separated geographically. Ameiva polops grouped in a cluster that included the other two Ameiva species found in the Puerto Rican Bank: A. wetmorei and A. exsul. A sister relationship between A. polops and A. wetmorei is suggested by our analyses. We compare our results with a previous study on molecular systematics of West Indian Ameiva.
Khan, Haseeb A; Arif, Ibrahim A; Bahkali, Ali H; Al Farhan, Ahmad H; Al Homaidan, Ali A
2008-10-06
This investigation was aimed to compare the inference of antelope phylogenies resulting from the 16S rRNA, cytochrome-b (cyt-b) and d-loop segments of mitochondrial DNA using three different computational models including Bayesian (BA), maximum parsimony (MP) and unweighted pair group method with arithmetic mean (UPGMA). The respective nucleotide sequences of three Oryx species (Oryx leucoryx, Oryx dammah and Oryx gazella) and an out-group (Addax nasomaculatus) were aligned and subjected to BA, MP and UPGMA models for comparing the topologies of respective phylogenetic trees. The 16S rRNA region possessed the highest frequency of conserved sequences (97.65%) followed by cyt-b (94.22%) and d-loop (87.29%). There were few transitions (2.35%) and none transversions in 16S rRNA as compared to cyt-b (5.61% transitions and 0.17% transversions) and d-loop (11.57% transitions and 1.14% transversions) while comparing the four taxa. All the three mitochondrial segments clearly differentiated the genus Addax from Oryx using the BA or UPGMA models. The topologies of all the gamma-corrected Bayesian trees were identical irrespective of the marker type. The UPGMA trees resulting from 16S rRNA and d-loop sequences were also identical (Oryx dammah grouped with Oryx leucoryx) to Bayesian trees except that the UPGMA tree based on cyt-b showed a slightly different phylogeny (Oryx dammah grouped with Oryx gazella) with a low bootstrap support. However, the MP model failed to differentiate the genus Addax from Oryx. These findings demonstrate the efficiency and robustness of BA and UPGMA methods for phylogenetic analysis of antelopes using mitochondrial markers.
Khan, Haseeb A.; Arif, Ibrahim A.; Bahkali, Ali H.; Al Farhan, Ahmad H.; Al Homaidan, Ali A.
2008-01-01
This investigation was aimed to compare the inference of antelope phylogenies resulting from the 16S rRNA, cytochrome-b (cyt-b) and d-loop segments of mitochondrial DNA using three different computational models including Bayesian (BA), maximum parsimony (MP) and unweighted pair group method with arithmetic mean (UPGMA). The respective nucleotide sequences of three Oryx species (Oryx leucoryx, Oryx dammah and Oryx gazella) and an out-group (Addax nasomaculatus) were aligned and subjected to BA, MP and UPGMA models for comparing the topologies of respective phylogenetic trees. The 16S rRNA region possessed the highest frequency of conserved sequences (97.65%) followed by cyt-b (94.22%) and d-loop (87.29%). There were few transitions (2.35%) and none transversions in 16S rRNA as compared to cyt-b (5.61% transitions and 0.17% transversions) and d-loop (11.57% transitions and 1.14% transversions) while comparing the four taxa. All the three mitochondrial segments clearly differentiated the genus Addax from Oryx using the BA or UPGMA models. The topologies of all the gamma-corrected Bayesian trees were identical irrespective of the marker type. The UPGMA trees resulting from 16S rRNA and d-loop sequences were also identical (Oryx dammah grouped with Oryx leucoryx) to Bayesian trees except that the UPGMA tree based on cyt-b showed a slightly different phylogeny (Oryx dammah grouped with Oryx gazella) with a low bootstrap support. However, the MP model failed to differentiate the genus Addax from Oryx. These findings demonstrate the efficiency and robustness of BA and UPGMA methods for phylogenetic analysis of antelopes using mitochondrial markers. PMID:19204824
Ritchie, Andrew M; Lo, Nathan; Ho, Simon Y W
2017-05-01
In Bayesian phylogenetic analyses of genetic data, prior probability distributions need to be specified for the model parameters, including the tree. When Bayesian methods are used for molecular dating, available tree priors include those designed for species-level data, such as the pure-birth and birth-death priors, and coalescent-based priors designed for population-level data. However, molecular dating methods are frequently applied to data sets that include multiple individuals across multiple species. Such data sets violate the assumptions of both the speciation and coalescent-based tree priors, making it unclear which should be chosen and whether this choice can affect the estimation of node times. To investigate this problem, we used a simulation approach to produce data sets with different proportions of within- and between-species sampling under the multispecies coalescent model. These data sets were then analyzed under pure-birth, birth-death, constant-size coalescent, and skyline coalescent tree priors. We also explored the ability of Bayesian model testing to select the best-performing priors. We confirmed the applicability of our results to empirical data sets from cetaceans, phocids, and coregonid whitefish. Estimates of node times were generally robust to the choice of tree prior, but some combinations of tree priors and sampling schemes led to large differences in the age estimates. In particular, the pure-birth tree prior frequently led to inaccurate estimates for data sets containing a mixture of inter- and intraspecific sampling, whereas the birth-death and skyline coalescent priors produced stable results across all scenarios. Model testing provided an adequate means of rejecting inappropriate tree priors. Our results suggest that tree priors do not strongly affect Bayesian molecular dating results in most cases, even when severely misspecified. However, the choice of tree prior can be significant for the accuracy of dating results in the case of data sets with mixed inter- and intraspecies sampling. [Bayesian phylogenetic methods; model testing; molecular dating; node time; tree prior.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.
Borths, Matthew R; Holroyd, Patricia A; Seiffert, Erik R
2016-01-01
Hyaenodonta is a diverse, extinct group of carnivorous mammals that included weasel- to rhinoceros-sized species. The oldest-known hyaenodont fossils are from the middle Paleocene of North Africa and the antiquity of the group in Afro-Arabia led to the hypothesis that it originated there and dispersed to Asia, Europe, and North America. Here we describe two new hyaenodont species based on the oldest hyaenodont cranial specimens known from Afro-Arabia. The material was collected from the latest Eocene Locality 41 (L-41, ∼34 Ma) in the Fayum Depression, Egypt. Akhnatenavus nefertiticyon sp. nov. has specialized, hypercarnivorous molars and an elongate cranial vault. In A. nefertiticyon the tallest, piercing cusp on M 1 -M 2 is the paracone. Brychotherium ephalmos gen. et sp. nov. has more generalized molars that retain the metacone and complex talonids. In B. ephalmos the tallest, piercing cusp on M 1 -M 2 is the metacone. We incorporate this new material into a series of phylogenetic analyses using a character-taxon matrix that includes novel dental, cranial, and postcranial characters, and samples extensively from the global record of the group. The phylogenetic analysis includes the first application of Bayesian methods to hyaenodont relationships. B. ephalmos is consistently placed within Teratodontinae, an Afro-Arabian clade with several generalist and hypercarnivorous forms, and Akhnatenavus is consistently recovered in Hyainailourinae as part of an Afro-Arabian radiation. The phylogenetic results suggest that hypercarnivory evolved independently three times within Hyaenodonta: in Teratodontinae, in Hyainailourinae, and in Hyaenodontinae. Teratodontines are consistently placed in a close relationship with Hyainailouridae (Hyainailourinae + Apterodontinae) to the exclusion of "proviverrines," hyaenodontines, and several North American clades, and we propose that the superfamily Hyainailouroidea be used to describe this relationship. Using the topologies recovered from each phylogenetic method, we reconstructed the biogeographic history of Hyaenodonta using parsimony optimization (PO), likelihood optimization (LO), and Bayesian Binary Markov chain Monte Carlo (MCMC) to examine support for the Afro-Arabian origin of Hyaenodonta. Across all analyses, we found that Hyaenodonta most likely originated in Europe, rather than Afro-Arabia. The clade is estimated by tip-dating analysis to have undergone a rapid radiation in the Late Cretaceous and Paleocene; a radiation currently not documented by fossil evidence. During the Paleocene, lineages are reconstructed as dispersing to Asia, Afro-Arabia, and North America. The place of origin of Hyainailouroidea is likely Afro-Arabia according to the Bayesian topologies but it is ambiguous using parsimony. All topologies support the constituent clades-Hyainailourinae, Apterodontinae, and Teratodontinae-as Afro-Arabian and tip-dating estimates that each clade is established in Afro-Arabia by the middle Eocene.
Seiffert, Erik R.
2016-01-01
Hyaenodonta is a diverse, extinct group of carnivorous mammals that included weasel- to rhinoceros-sized species. The oldest-known hyaenodont fossils are from the middle Paleocene of North Africa and the antiquity of the group in Afro-Arabia led to the hypothesis that it originated there and dispersed to Asia, Europe, and North America. Here we describe two new hyaenodont species based on the oldest hyaenodont cranial specimens known from Afro-Arabia. The material was collected from the latest Eocene Locality 41 (L-41, ∼34 Ma) in the Fayum Depression, Egypt. Akhnatenavus nefertiticyon sp. nov. has specialized, hypercarnivorous molars and an elongate cranial vault. In A. nefertiticyon the tallest, piercing cusp on M1–M2 is the paracone. Brychotherium ephalmos gen. et sp. nov. has more generalized molars that retain the metacone and complex talonids. In B. ephalmos the tallest, piercing cusp on M1–M2 is the metacone. We incorporate this new material into a series of phylogenetic analyses using a character-taxon matrix that includes novel dental, cranial, and postcranial characters, and samples extensively from the global record of the group. The phylogenetic analysis includes the first application of Bayesian methods to hyaenodont relationships. B. ephalmos is consistently placed within Teratodontinae, an Afro-Arabian clade with several generalist and hypercarnivorous forms, and Akhnatenavus is consistently recovered in Hyainailourinae as part of an Afro-Arabian radiation. The phylogenetic results suggest that hypercarnivory evolved independently three times within Hyaenodonta: in Teratodontinae, in Hyainailourinae, and in Hyaenodontinae. Teratodontines are consistently placed in a close relationship with Hyainailouridae (Hyainailourinae + Apterodontinae) to the exclusion of “proviverrines,” hyaenodontines, and several North American clades, and we propose that the superfamily Hyainailouroidea be used to describe this relationship. Using the topologies recovered from each phylogenetic method, we reconstructed the biogeographic history of Hyaenodonta using parsimony optimization (PO), likelihood optimization (LO), and Bayesian Binary Markov chain Monte Carlo (MCMC) to examine support for the Afro-Arabian origin of Hyaenodonta. Across all analyses, we found that Hyaenodonta most likely originated in Europe, rather than Afro-Arabia. The clade is estimated by tip-dating analysis to have undergone a rapid radiation in the Late Cretaceous and Paleocene; a radiation currently not documented by fossil evidence. During the Paleocene, lineages are reconstructed as dispersing to Asia, Afro-Arabia, and North America. The place of origin of Hyainailouroidea is likely Afro-Arabia according to the Bayesian topologies but it is ambiguous using parsimony. All topologies support the constituent clades–Hyainailourinae, Apterodontinae, and Teratodontinae–as Afro-Arabian and tip-dating estimates that each clade is established in Afro-Arabia by the middle Eocene. PMID:27867761
Dediu, Dan
2011-02-07
Language is a hallmark of our species and understanding linguistic diversity is an area of major interest. Genetic factors influencing the cultural transmission of language provide a powerful and elegant explanation for aspects of the present day linguistic diversity and a window into the emergence and evolution of language. In particular, it has recently been proposed that linguistic tone-the usage of voice pitch to convey lexical and grammatical meaning-is biased by two genes involved in brain growth and development, ASPM and Microcephalin. This hypothesis predicts that tone is a stable characteristic of language because of its 'genetic anchoring'. The present paper tests this prediction using a Bayesian phylogenetic framework applied to a large set of linguistic features and language families, using multiple software implementations, data codings, stability estimations, linguistic classifications and outgroup choices. The results of these different methods and datasets show a large agreement, suggesting that this approach produces reliable estimates of the stability of linguistic data. Moreover, linguistic tone is found to be stable across methods and datasets, providing suggestive support for the hypothesis of genetic influences on its distribution.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees.
Martínez-Aquino, Andrés
2016-08-01
Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host-parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a "compass" when "walking" through jungles of tangled phylogenetic trees.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees
2016-01-01
Abstract Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host–parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a “compass” when “walking” through jungles of tangled phylogenetic trees. PMID:29491928
Slater, Graham J; Harmon, Luke J; Wegmann, Daniel; Joyce, Paul; Revell, Liam J; Alfaro, Michael E
2012-03-01
In recent years, a suite of methods has been developed to fit multiple rate models to phylogenetic comparative data. However, most methods have limited utility at broad phylogenetic scales because they typically require complete sampling of both the tree and the associated phenotypic data. Here, we develop and implement a new, tree-based method called MECCA (Modeling Evolution of Continuous Characters using ABC) that uses a hybrid likelihood/approximate Bayesian computation (ABC)-Markov-Chain Monte Carlo approach to simultaneously infer rates of diversification and trait evolution from incompletely sampled phylogenies and trait data. We demonstrate via simulation that MECCA has considerable power to choose among single versus multiple evolutionary rate models, and thus can be used to test hypotheses about changes in the rate of trait evolution across an incomplete tree of life. We finally apply MECCA to an empirical example of body size evolution in carnivores, and show that there is no evidence for an elevated rate of body size evolution in the pinnipeds relative to terrestrial carnivores. ABC approaches can provide a useful alternative set of tools for future macroevolutionary studies where likelihood-dependent approaches are lacking. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.
Harlin-Cognato, April D; Honeycutt, Rodney L
2006-01-01
Background Dolphins of the genus Lagenorhynchus are anti-tropically distributed in temperate to cool waters. Phylogenetic analyses of cytochrome b sequences have suggested that the genus is polyphyletic; however, many relationships were poorly resolved. In this study, we present a combined-analysis phylogenetic hypothesis for Lagenorhynchus and members of the subfamily Lissodelphininae, which is derived from two nuclear and two mitochondrial data sets and the addition of 34 individuals representing 9 species. In addition, we characterize with parsimony and Bayesian analyses the phylogenetic utility and interaction of characters with statistical measures, including the utility of highly consistent (non-homoplasious) characters as a conservative measure of phylogenetic robustness. We also explore the effects of removing sources of character conflict on phylogenetic resolution. Results Overall, our study provides strong support for the monophyly of the subfamily Lissodelphininae and the polyphyly of the genus Lagenorhynchus. In addition, the simultaneous parsimony analysis resolved and/or improved resolution for 12 nodes including: (1) L. albirostris, L. acutus; (2) L. obscurus and L. obliquidens; and (3) L. cruciger and L. australis. In addition, the Bayesian analysis supported the monophyly of the Cephalorhynchus, and resolved ambiguities regarding the relationship of L. australis/L. cruciger to other members of the genus Lagenorhynchus. The frequency of highly consistent characters varied among data partitions, but the rate of evolution was consistent within data partitions. Although the control region was the greatest source of character conflict, removal of this data partition impeded phylogenetic resolution. Conclusion The simultaneous analysis approach produced a more robust phylogenetic hypothesis for Lagenorhynchus than previous studies, thus supporting a phylogenetic approach employing multiple data partitions that vary in overall rate of evolution. Even in cases where there was apparent conflict among characters, our data suggest a synergistic interaction in the simultaneous analysis, and speak against a priori exclusion of data because of potential conflicts, primarily because phylogenetic results can be less robust. For example, the removal of the control region, the putative source of character conflict, produced spurious results with inconsistencies among and within topologies from parsimony and Bayesian analyses. PMID:17078887
Post-Boreotropical dispersals explain the pantropical disjunction in Paederia (Rubiaceae)
Nie, Ze-Long; Deng, Tao; Meng, Ying; Sun, Hang; Wen, Jun
2013-01-01
Background and Aims Pantropical intercontinental disjunction is a common biogeographical pattern in flowering plants exhibiting a discontinuous distribution primarily in tropical Asia, Africa and the Americas. Only a few plant groups with this pattern have been investigated at the generic level with molecular phylogenetic and biogeographical methods. Paederia (Rubiaceae) is a pantropical genus of 31 species of woody lianas, with the greatest species diversity in continental Asia and Madagascar and only two species from tropical America. The aim of this study was to reconstruct the biogeographical history of Paederia based on phylogenetic analyses to explore how the genus attained its pantropical distribution. Methods Maximum parsimony and Bayesian inference were used for phylogenetic analyses using sequences of five plastid markers (the rbcL gene, rps16 intron, trnT-F region, atpB-rbcL spacer and psbA-trnH spacer). Biogeographical inferences were based on a Bayesian uncorrelated lognormal relaxed molecular clock together with both Bayesian and likelihood ancestral area reconstructions. Key Results The data suggest an early diverged Asian lineage sister to the clade of the remaining species consisting of a predominantly Asian sub-clade and a primarily Malagasy sub-clade. Paederia is inferred to have originated in the Oligocene in tropical continental Asia. It then reached Africa in the early to middle Miocene, most probably via long-distance dispersal across the Indian Ocean. The two Neotropical species are inferred to have derived independently in the late Miocene from ancestors of Asia and East Africa, respectively. Conclusions The results demonstrate the importance of post-Boreotropical long-distance dispersals (across three major oceans) in shaping the global pantropical disjunction in some plants, such as Paederia, with small, winged diaspores adapted to long-distance dispersal by various agents including wind, ocean currents or birds. Overland migration is less likely to explain its palaeotropical disjunction between Asia and Africa. PMID:23478944
Coalescent methods for estimating phylogenetic trees.
Liu, Liang; Yu, Lili; Kubatko, Laura; Pearl, Dennis K; Edwards, Scott V
2009-10-01
We review recent models to estimate phylogenetic trees under the multispecies coalescent. Although the distinction between gene trees and species trees has come to the fore of phylogenetics, only recently have methods been developed that explicitly estimate species trees. Of the several factors that can cause gene tree heterogeneity and discordance with the species tree, deep coalescence due to random genetic drift in branches of the species tree has been modeled most thoroughly. Bayesian approaches to estimating species trees utilizes two likelihood functions, one of which has been widely used in traditional phylogenetics and involves the model of nucleotide substitution, and the second of which is less familiar to phylogeneticists and involves the probability distribution of gene trees given a species tree. Other recent parametric and nonparametric methods for estimating species trees involve parsimony criteria, summary statistics, supertree and consensus methods. Species tree approaches are an appropriate goal for systematics, appear to work well in some cases where concatenation can be misleading, and suggest that sampling many independent loci will be paramount. Such methods can also be challenging to implement because of the complexity of the models and computational time. In addition, further elaboration of the simplest of coalescent models will be required to incorporate commonly known issues such as deviation from the molecular clock, gene flow and other genetic forces.
Inda, Luis A.; Pimentel, Manuel; Chase, Mark W.
2012-01-01
Background and aims Tribe Orchideae (Orchidaceae: Orchidoideae) comprises around 62 mostly terrestrial genera, which are well represented in the Northern Temperate Zone and less frequently in tropical areas of both the Old and New Worlds. Phylogenetic relationships within this tribe have been studied previously using only nuclear ribosomal DNA (nuclear ribosomal internal transcribed spacer, nrITS). However, different parts of the phylogenetic tree in these analyses were weakly supported, and integrating information from different plant genomes is clearly necessary in orchids, where reticulate evolution events are putatively common. The aims of this study were to: (1) obtain a well-supported and dated phylogenetic hypothesis for tribe Orchideae, (ii) assess appropriateness of recent nomenclatural changes in this tribe in the last decade, (3) detect possible examples of reticulate evolution and (4) analyse in a temporal context evolutionary trends for subtribe Orchidinae with special emphasis on pollination systems. Methods The analyses included 118 samples, belonging to 103 species and 25 genera, for three DNA regions (nrITS, mitochondrial cox1 intron and plastid rpl16 intron). Bayesian and maximum-parsimony methods were used to construct a well-supported and dated tree. Evolutionary trends in the subtribe were analysed using Bayesian and maximum-likelihood methods of character evolution. Key Results The dated phylogenetic tree strongly supported the recently recircumscribed generic concepts of Bateman and collaborators. Moreover, it was found that Orchidinae have diversified in the Mediterranean basin during the last 15 million years, and one potential example of reticulate evolution in the subtribe was identified. In Orchidinae, pollination systems have shifted on numerous occasions during the last 23 million years. Conclusions The results indicate that ancestral Orchidinae were hymenopteran-pollinated, food-deceptive plants and that these traits have been dominant throughout the evolutionary history of the subtribe in the Mediterranean. Evidence was also obtained that the onset of sexual deception might be linked to an increase in labellum size, and the possibility is discussed that diversification in Orchidinae developed in parallel with diversification of bees and wasps from the Miocene onwards. PMID:22539542
Subbotin, Sergei A; Ragsdale, Erik J; Mullens, Teresa; Roberts, Philip A; Mundo-Ocampo, Manuel; Baldwin, James G
2008-08-01
The root lesion nematodes of the genus Pratylenchus Filipjev, 1936 are migratory endoparasites of plant roots, considered among the most widespread and important nematode parasites in a variety of crops. We obtained gene sequences from the D2 and D3 expansion segments of 28S rRNA partial and 18S rRNA from 31 populations belonging to 11 valid and two unidentified species of root lesion nematodes and five outgroup taxa. These datasets were analyzed using maximum parsimony and Bayesian inference. The alignments were generated using the secondary structure models for these molecules and analyzed with Bayesian inference under the standard models and the complex model, considering helices under the doublet model and loops and bulges under the general time reversible model. The phylogenetic informativeness of morphological characters is tested by reconstruction of their histories on rRNA based trees using parallel parsimony and Bayesian approaches. Phylogenetic and sequence analyses of the 28S D2-D3 dataset with 145 accessions for 28 species and 18S dataset with 68 accessions for 15 species confirmed among large numbers of geographical diverse isolates that most classical morphospecies are monophyletic. Phylogenetic analyses revealed at least six distinct major clades of examined Pratylenchus species and these clades are generally congruent with those defined by characters derived from lip patterns, numbers of lip annules, and spermatheca shape. Morphological results suggest the need for sophisticated character discovery and analysis for morphology based phylogenetics in nematodes.
A phylogenetic study of Laeliinae (Orchidaceae) based on combined nuclear and plastid DNA sequences
van den Berg, Cássio; Higgins, Wesley E.; Dressler, Robert L.; Whitten, W. Mark; Soto-Arenas, Miguel A.; Chase, Mark W.
2009-01-01
Background and Aims Laeliinae are a neotropical orchid subtribe with approx. 1500 species in 50 genera. In this study, an attempt is made to assess generic alliances based on molecular phylogenetic analysis of DNA sequence data. Methods Six DNA datasets were gathered: plastid trnL intron, trnL-F spacer, matK gene and trnK introns upstream and dowstream from matK and nuclear ITS rDNA. Data were analysed with maximum parsimony (MP) and Bayesian analysis with mixed models (BA). Key Results Although relationships between Laeliinae and outgroups are well supported, within the subtribe sequence variation is low considering the broad taxonomic range covered. Localized incongruence between the ITS and plastid trees was found. A combined tree followed the ITS trees more closely, but the levels of support obtained with MP were low. The Bayesian analysis recovered more well-supported nodes. The trees from combined MP and BA allowed eight generic alliances to be recognized within Laeliinae, all of which show trends in morphological characters but lack unambiguous synapomorphies. Conclusions By using combined plastid and nuclear DNA data in conjunction with mixed-models Bayesian inference, it is possible to delimit smaller groups within Laeliinae and discuss general patterns of pollination and hybridization compatibility. Furthermore, these small groups can now be used for further detailed studies to explain morphological evolution and diversification patterns within the subtribe. PMID:19423551
Takamiya, Tomoko; Wongsawad, Pheravut; Sathapattayanon, Apirada; Tajima, Natsuko; Suzuki, Shunichiro; Kitamura, Saki; Shioda, Nao; Handa, Takashi; Kitanaka, Susumu; Iijima, Hiroshi; Yukawa, Tomohisa
2014-01-01
It is always difficult to construct coherent classification systems for plant lineages having diverse morphological characters. The genus Dendrobium, one of the largest genera in the Orchidaceae, includes ∼1100 species, and enormous morphological diversification has hindered the establishment of consistent classification systems covering all major groups of this genus. Given the particular importance of species in Dendrobium section Dendrobium and allied groups as floriculture and crude drug genetic resources, there is an urgent need to establish a stable classification system. To clarify phylogenetic relationships in Dendrobium section Dendrobium and allied groups, we analysed the macromolecular characters of the group. Phylogenetic analyses of 210 taxa of Dendrobium were conducted on DNA sequences of internal transcribed spacer (ITS) regions of 18S–26S nuclear ribosomal DNA and the maturase-coding gene (matK) located in an intron of the plastid gene trnK using maximum parsimony and Bayesian methods. The parsimony and Bayesian analyses revealed 13 distinct clades in the group comprising section Dendrobium and its allied groups. Results also showed paraphyly or polyphyly of sections Amblyanthus, Aporum, Breviflores, Calcarifera, Crumenata, Dendrobium, Densiflora, Distichophyllae, Dolichocentrum, Holochrysa, Oxyglossum and Pedilonum. On the other hand, the monophyly of section Stachyobium was well supported. It was found that many of the morphological characters that have been believed to reflect phylogenetic relationships are, in fact, the result of convergence. As such, many of the sections that have been recognized up to this point were found to not be monophyletic, so recircumscription of sections is required. PMID:25107672
Callejón, Rocío; Robles, María Del Rosario; Panei, Carlos Javier; Cutillas, Cristina
2016-08-01
A molecular phylogenetic hypothesis is presented for the genus Trichuris based on sequence data from mitochondrial cytochrome c oxidase 1 (cox1) and cytochrome b (cob). The taxa consisted of nine populations of whipworm from five species of Sigmodontinae rodents from Argentina. Bayesian Inference, Maximum Parsimony, and Maximum Likelihood methods were used to infer phylogenies for each gene separately but also for the combined mitochondrial data and the combined mitochondrial and nuclear dataset. Phylogenetic results based on cox1 and cob mitochondrial DNA (mtDNA) revealed three clades strongly resolved corresponding to three different species (Trichuris navonae, Trichuris bainae, and Trichuris pardinasi) showing phylogeographic variation, but relationships among Trichuris species were poorly resolved. Phylogenetic reconstruction based on concatenated sequences had greater phylogenetic resolution for delimiting species and populations intra-specific of Trichuris than those based on partitioned genes. Thus, populations of T. bainae and T. pardinasi could be affected by geographical factors and co-divergence parasite-host.
Feldman, Sanford H; Ntenda, Abraham M
2011-01-01
We used high-fidelity PCR to amplify 2 overlapping regions of the ribosomal gene complex from the rodent fur mite Myobia musculi. The amplicons encompassed a large portion of the mite's ribosomal gene complex spanning 3128 nucleotides containing the entire 18S rRNA, internal transcribed spacer (ITS) 1, 5.8S rRNA, ITS2, and a portion of the 5′-end of the 28S rRNA. M. musculi’s 179-nucleotide 5.8S rRNA nucleotide sequence was not conserved, so this region was identified by conservation of rRNA secondary structure. Maximum likelihood and Bayesian inference phylogenetic analyses were performed by using multiple sequence alignment consisting of 1524 nucleotides of M. musculi 18S rRNA and homologous sequences from 42 prostigmatid mites and the tick Dermacentor andersoni. The phylograms produced by both methods were in agreement regarding terminal, secondary, and some tertiary phylogenetic relationships among mites. Bayesian inference discriminated most infraordinal relationships between Eleutherengona and Parasitengona mites in the suborder Anystina. Basal relationships between suborders Anystina and Eupodina historically determined by comparing differences in anatomic characteristics were less well-supported by our molecular analysis. Our results recapitulated similar 18S rRNA sequence analyses recently reported. Our study supports M. musculi as belonging to the suborder Anystina, infraorder Eleutherenona, and superfamily Cheyletoidea. PMID:22330574
Morcillo, Felipe; Ornelas-García, Claudia Patricia; Alcaraz, Lourdes; Matamoros, Wilfredo A; Doadrio, Ignacio
2016-01-01
Freshwater fishes of Profundulidae, which until now was composed of two subgenera, represent one of the few extant fish families endemic to Mesoamerica. In this study we investigated the phylogenetic relationships and evolutionary history of the eight recognized extant species (from 37 populations) of Profundulidae using three mitochondrial and one nuclear gene markers (∼2.9 Kbp). We applied a Bayesian species delimitation method as a first approach to resolving speciation patterns within Profundulidae considering two different scenarios, eight-species and twelve-species models, obtained in a previous phylogenetic analysis. Based on our results, each of the two subgenera was resolved as monophyletic, with a remarkable molecular divergence of 24.5% for mtDNA and 7.8% for nDNA uncorrected p distances, and thus we propose that they correspond to separate genera. Moreover, we propose a conservative taxonomic hypothesis with five species within Profundulus and three within Tlaloc, although both eight-species and twelve-species models were highly supported by the bayesian species delimitation analysis, providing additional evidence of higher taxonomic diversity than currently recognized in this family. According to our divergence time estimates, the family originated during the Upper Oligocene 26 Mya, and Profundulus and Tlaloc diverged in the Upper Oligocene or Lower Miocene about 20 Mya. Copyright © 2015 Elsevier Inc. All rights reserved.
High endemism at cave entrances: a case study of spiders of the genus Uthina
Yao, Zhiyuan; Dong, Tingting; Zheng, Guo; Fu, Jinzhong; Li, Shuqiang
2016-01-01
Endemism, which is typically high on islands and in caves, has rarely been studied in the cave entrance ecotone. We investigated the endemism of the spider genus Uthina at cave entrances. Totally 212 spiders were sampled from 46 localities, from Seychelles across Southeast Asia to Fiji. They mostly occur at cave entrances but occasionally appear at various epigean environments. Phylogenetic analysis of DNA sequence data from COI and 28S genes suggested that Uthina was grouped into 13 well-supported clades. We used three methods, the Bayesian Poisson Tree Processes (bPTP) model, the Bayesian Phylogenetics and Phylogeography (BPP) method, and the general mixed Yule coalescent (GMYC) model, to investigate species boundaries. Both bPTP and BPP identified the 13 clades as 13 separate species, while GMYC identified 19 species. Furthermore, our results revealed high endemism at cave entrances. Of the 13 provisional species, twelve (one known and eleven new) are endemic to one or a cluster of caves, and all of them occurred only at cave entrances except for one population of one species. The only widely distributed species, U. luzonica, mostly occurred in epigean environments while three populations were found at cave entrances. Additionally, eleven new species of the genus are described. PMID:27775081
Does History Repeat Itself? Wavelets and the Phylodynamics of Influenza A
Tom, Jennifer A.; Sinsheimer, Janet S.; Suchard, Marc A.
2012-01-01
Unprecedented global surveillance of viruses will result in massive sequence data sets that require new statistical methods. These data sets press the limits of Bayesian phylogenetics as the high-dimensional parameters that comprise a phylogenetic tree increase the already sizable computational burden of these techniques. This burden often results in partitioning the data set, for example, by gene, and inferring the evolutionary dynamics of each partition independently, a compromise that results in stratified analyses that depend only on data within a given partition. However, parameter estimates inferred from these stratified models are likely strongly correlated, considering they rely on data from a single data set. To overcome this shortfall, we exploit the existing Monte Carlo realizations from stratified Bayesian analyses to efficiently estimate a nonparametric hierarchical wavelet-based model and learn about the time-varying parameters of effective population size that reflect levels of genetic diversity across all partitions simultaneously. Our methods are applied to complete genome influenza A sequences that span 13 years. We find that broad peaks and trends, as opposed to seasonal spikes, in the effective population size history distinguish individual segments from the complete genome. We also address hypotheses regarding intersegment dynamics within a formal statistical framework that accounts for correlation between segment-specific parameters. PMID:22160768
Dembo, Mana; Radovčić, Davorka; Garvin, Heather M; Laird, Myra F; Schroeder, Lauren; Scott, Jill E; Brophy, Juliet; Ackermann, Rebecca R; Musiba, Chares M; de Ruiter, Darryl J; Mooers, Arne Ø; Collard, Mark
2016-08-01
Homo naledi is a recently discovered species of fossil hominin from South Africa. A considerable amount is already known about H. naledi but some important questions remain unanswered. Here we report a study that addressed two of them: "Where does H. naledi fit in the hominin evolutionary tree?" and "How old is it?" We used a large supermatrix of craniodental characters for both early and late hominin species and Bayesian phylogenetic techniques to carry out three analyses. First, we performed a dated Bayesian analysis to generate estimates of the evolutionary relationships of fossil hominins including H. naledi. Then we employed Bayes factor tests to compare the strength of support for hypotheses about the relationships of H. naledi suggested by the best-estimate trees. Lastly, we carried out a resampling analysis to assess the accuracy of the age estimate for H. naledi yielded by the dated Bayesian analysis. The analyses strongly supported the hypothesis that H. naledi forms a clade with the other Homo species and Australopithecus sediba. The analyses were more ambiguous regarding the position of H. naledi within the (Homo, Au. sediba) clade. A number of hypotheses were rejected, but several others were not. Based on the available craniodental data, Homo antecessor, Asian Homo erectus, Homo habilis, Homo floresiensis, Homo sapiens, and Au. sediba could all be the sister taxon of H. naledi. According to the dated Bayesian analysis, the most likely age for H. naledi is 912 ka. This age estimate was supported by the resampling analysis. Our findings have a number of implications. Most notably, they support the assignment of the new specimens to Homo, cast doubt on the claim that H. naledi is simply a variant of H. erectus, and suggest H. naledi is younger than has been previously proposed. Copyright © 2016 Elsevier Ltd. All rights reserved.
Huang, Yi-Fei; Golding, G Brian
2015-02-15
A number of statistical phylogenetic methods have been developed to infer conserved functional sites or regions in proteins. Many methods, e.g. Rate4Site, apply the standard phylogenetic models to infer site-specific substitution rates and totally ignore the spatial correlation of substitution rates in protein tertiary structures, which may reduce their power to identify conserved functional patches in protein tertiary structures when the sequences used in the analysis are highly similar. The 3D sliding window method has been proposed to infer conserved functional patches in protein tertiary structures, but the window size, which reflects the strength of the spatial correlation, must be predefined and is not inferred from data. We recently developed GP4Rate to solve these problems under the Bayesian framework. Unfortunately, GP4Rate is computationally slow. Here, we present an intuitive web server, FuncPatch, to perform a fast approximate Bayesian inference of conserved functional patches in protein tertiary structures. Both simulations and four case studies based on empirical data suggest that FuncPatch is a good approximation to GP4Rate. However, FuncPatch is orders of magnitudes faster than GP4Rate. In addition, simulations suggest that FuncPatch is potentially a useful tool complementary to Rate4Site, but the 3D sliding window method is less powerful than FuncPatch and Rate4Site. The functional patches predicted by FuncPatch in the four case studies are supported by experimental evidence, which corroborates the usefulness of FuncPatch. The software FuncPatch is freely available at the web site, http://info.mcmaster.ca/yifei/FuncPatch golding@mcmaster.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Duchêne, Sebastián; Geoghegan, Jemma L; Holmes, Edward C; Ho, Simon Y W
2016-11-15
In rapidly evolving pathogens, including viruses and some bacteria, genetic change can accumulate over short time-frames. Accordingly, their sampling times can be used to calibrate molecular clocks, allowing estimation of evolutionary rates. Methods for estimating rates from time-structured data vary in how they treat phylogenetic uncertainty and rate variation among lineages. We compiled 81 virus data sets and estimated nucleotide substitution rates using root-to-tip regression, least-squares dating and Bayesian inference. Although estimates from these three methods were often congruent, this largely relied on the choice of clock model. In particular, relaxed-clock models tended to produce higher rate estimates than methods that assume constant rates. Discrepancies in rate estimates were also associated with high among-lineage rate variation, and phylogenetic and temporal clustering. These results provide insights into the factors that affect the reliability of rate estimates from time-structured sequence data, emphasizing the importance of clock-model testing. sduchene@unimelb.edu.au or garzonsebastian@hotmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Gao, Xiang; Lin, Huaiying; Revanna, Kashi; Dong, Qunfeng
2017-05-10
Species-level classification for 16S rRNA gene sequences remains a serious challenge for microbiome researchers, because existing taxonomic classification tools for 16S rRNA gene sequences either do not provide species-level classification, or their classification results are unreliable. The unreliable results are due to the limitations in the existing methods which either lack solid probabilistic-based criteria to evaluate the confidence of their taxonomic assignments, or use nucleotide k-mer frequency as the proxy for sequence similarity measurement. We have developed a method that shows significantly improved species-level classification results over existing methods. Our method calculates true sequence similarity between query sequences and database hits using pairwise sequence alignment. Taxonomic classifications are assigned from the species to the phylum levels based on the lowest common ancestors of multiple database hits for each query sequence, and further classification reliabilities are evaluated by bootstrap confidence scores. The novelty of our method is that the contribution of each database hit to the taxonomic assignment of the query sequence is weighted by a Bayesian posterior probability based upon the degree of sequence similarity of the database hit to the query sequence. Our method does not need any training datasets specific for different taxonomic groups. Instead only a reference database is required for aligning to the query sequences, making our method easily applicable for different regions of the 16S rRNA gene or other phylogenetic marker genes. Reliable species-level classification for 16S rRNA or other phylogenetic marker genes is critical for microbiome research. Our software shows significantly higher classification accuracy than the existing tools and we provide probabilistic-based confidence scores to evaluate the reliability of our taxonomic classification assignments based on multiple database matches to query sequences. Despite its higher computational costs, our method is still suitable for analyzing large-scale microbiome datasets for practical purposes. Furthermore, our method can be applied for taxonomic classification of any phylogenetic marker gene sequences. Our software, called BLCA, is freely available at https://github.com/qunfengdong/BLCA .
Phylogenetic relationships, diversification and expansion of chili peppers (Capsicum, Solanaceae)
Carrizo García, Carolina; Barfuss, Michael H. J.; Sehr, Eva M.; Barboza, Gloria E.; Samuel, Rosabelle; Moscone, Eduardo A.; Ehrendorfer, Friedrich
2016-01-01
Background and Aims Capsicum (Solanaceae), native to the tropical and temperate Americas, comprises the well-known sweet and hot chili peppers and several wild species. So far, only partial taxonomic and phylogenetic analyses have been done for the genus. Here, the phylogenetic relationships between nearly all taxa of Capsicum were explored to test the monophyly of the genus and to obtain a better knowledge of species relationships, diversification and expansion. Methods Thirty-four of approximately 35 Capsicum species were sampled. Maximum parsimony and Bayesian inference analyses were performed using two plastid markers (matK and psbA-trnH) and one single-copy nuclear gene (waxy). The evolutionary changes of nine key features were reconstructed following the parsimony ancestral states method. Ancestral areas were reconstructed through a Bayesian Markov chain Monte Carlo analysis. Key Results Capsicum forms a monophyletic clade, with Lycianthes as a sister group, following both phylogenetic approaches. Eleven well-supported clades (four of them monotypic) can be recognized within Capsicum, although some interspecific relationships need further analysis. A few features are useful to characterize different clades (e.g. fruit anatomy, chromosome base number), whereas some others are highly homoplastic (e.g. seed colour). The origin of Capsicum is postulated in an area along the Andes of western to north-western South America. The expansion of the genus has followed a clockwise direction around the Amazon basin, towards central and south-eastern Brazil, then back to western South America, and finally northwards to Central America. Conclusions New insights are provided regarding interspecific relationships, character evolution, and geographical origin and expansion of Capsicum. A clearly distinct early-diverging clade can be distinguished, centred in western–north-western South America. Subsequent rapid speciation has led to the origin of the remaining clades. The diversification of Capsicum has culminated in the origin of the main cultivated species in several regions of South to Central America. PMID:27245634
2010-01-01
Background Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. Results This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Conclusions Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service. PMID:21034504
The relationships of the Euparkeriidae and the rise of Archosauria
NASA Astrophysics Data System (ADS)
Sookias, Roland B.
2016-03-01
For the first time, a phylogenetic analysis including all putative euparkeriid taxa is conducted, using a large data matrix analysed with maximum parsimony and Bayesian analysis. Using parsimony, the putative euparkeriid Dorosuchus neoetus from Russia is the sister taxon to Archosauria + Phytosauria. Euparkeria capensis is placed one node further from the crown, and forms a euparkeriid clade with the Chinese taxa Halazhaisuchus qiaoensis and `Turfanosuchus shageduensis' and the Polish taxon Osmolskina czatkowicensis. Using Bayesian methods, Osmolskina and Halazhaisuchus are sister taxa within Euparkeriidae, in turn sister to `Turfanosuchus shageduensis' and then Euparkeria capensis. Dorosuchus is placed in a polytomy with Euparkeriidae and Archosauria + Phytosauria. Although conclusions remain tentative owing to low node support and incompleteness, a broad phylogenetic position close to the base of Archosauria is confirmed for all putative euparkeriids, and the ancestor of Archosauria +Phytosauria is optimized as similar to euparkeriids in its morphology. Ecomorphological characters and traits are optimized onto the maximum parsimony strict consensus phylogeny presented using squared change parsimony. This optimization indicates that the ancestral archosaur was probably similar in many respects to euparkeriids, being relatively small, terrestrial, carnivorous and showing relatively cursorial limb morphology; this Bauplan may have underlain the exceptional radiaton and success of crown Archosauria.
Rothfels, Carl J; Larsson, Anders; Kuo, Li-Yaung; Korall, Petra; Chiou, Wen-Liang; Pryer, Kathleen M
2012-05-01
Backbone relationships within the large eupolypod II clade, which includes nearly a third of extant fern species, have resisted elucidation by both molecular and morphological data. Earlier studies suggest that much of the phylogenetic intractability of this group is due to three factors: (i) a long root that reduces apparent levels of support in the ingroup; (ii) long ingroup branches subtended by a series of very short backbone internodes (the "ancient rapid radiation" model); and (iii) significantly heterogeneous lineage-specific rates of substitution. To resolve the eupolypod II phylogeny, with a particular emphasis on the backbone internodes, we assembled a data set of five plastid loci (atpA, atpB, matK, rbcL, and trnG-R) from a sample of 81 accessions selected to capture the deepest divergences in the clade. We then evaluated our phylogenetic hypothesis against potential confounding factors, including those induced by rooting, ancient rapid radiation, rate heterogeneity, and the Bayesian star-tree paradox artifact. While the strong support we inferred for the backbone relationships proved robust to these potential problems, their investigation revealed unexpected model-mediated impacts of outgroup composition, divergent effects of methods for countering the star-tree paradox artifact, and gave no support to concerns about the applicability of the unrooted model to data sets with heterogeneous lineage-specific rates of substitution. This study is among few to investigate these factors with empirical data, and the first to compare the performance of the two primary methods for overcoming the Bayesian star-tree paradox artifact. Among the significant phylogenetic results is the near-complete support along the eupolypod II backbone, the demonstrated paraphyly of Woodsiaceae as currently circumscribed, and the well-supported placement of the enigmatic genera Homalosorus, Diplaziopsis, and Woodsia.
BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC
Satija, Rahul; Novák, Ádám; Miklós, István; Lyngsø, Rune; Hein, Jotun
2009-01-01
Background We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. Results We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the α-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. Conclusion BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from PMID:19715598
BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC.
Satija, Rahul; Novák, Adám; Miklós, István; Lyngsø, Rune; Hein, Jotun
2009-08-28
We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the alpha-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from http://www.stats.ox.ac.uk/~satija/BigFoot/
Reconstructing the origin and elaboration of insect-trapping inflorescences in the Araceae1
Bröderbauer, David; Diaz, Anita; Weber, Anton
2016-01-01
Premise of the study Floral traps are among the most sophisticated devices that have evolved in angiosperms in the context of pollination, but the evolution of trap pollination has not yet been studied in a phylogenetic context. We aim to determine the evolutionary history of morphological traits that facilitate trap pollination and to elucidate the impact of pollinators on the evolution of inflorescence traps in the family Araceae. Methods Inflorescence morphology was investigated to determine the presence of trapping devices and to classify functional types of traps. We inferred phylogenetic relationships in the family using maximum likelihood and Bayesian methods. Character evolution of trapping devices, trap types, and pollinator types was then assessed with maximum parsimony and Bayesian methods. We also tested for an association of trap pollination with specific pollinator types. Key results Inflorescence traps have evolved independently at least 10 times within the Araceae. Trapping devices were found in 27 genera. On the basis of different combinations of trapping devices, six functional types of traps were identified. Trap pollination in Araceae is correlated with pollination by flies. Conclusions Trap pollination in the Araceae is more common than was previously thought. Preadaptations such as papillate cells or elongated sterile flowers facilitated the evolution of inflorescence traps. In some clades, imperfect traps served as a precursor for the evolution of more elaborate traps. Traps that evolved in association with fly pollination were most probably derived from mutualistic ancestors, offering a brood-site to their pollinators. PMID:22965851
Chen, Zhi-Teng; Zhao, Meng-Yuan; Xu, Cheng; Du, Yu-Zhou
2018-05-01
The infraorder Systellognatha is the most species-rich clade in the insect order Plecoptera and includes six families in two superfamilies: Pteronarcyoidea (Pteronarcyidae, Peltoperlidae, and Styloperlidae) and Perloidea (Perlidae, Perlodidae, and Chloroperlidae). To resolve the debatable phylogeny of Systellognatha, we carried out the first mitochondrial phylogenetic analysis covering all the six families, including three newly sequenced mitogenomes from two families (Perlodidae and Peltoperlidae) and 15 published mitogenomes. The three newly reported mitogenomes share conserved mitogenomic features with other sequenced stoneflies. For phylogenetic analyses, we assembled five datasets with two inference methods to assess their influence on topology and nodal support within Systellognatha. The results indicated that inclusion of the third codon positions of PCGs, exclusion of rRNA genes, the use of nucleotide datasets and Bayesian inference could improve the phylogenetic reconstruction of Systellognatha. The monophyly of Perloidea was supported in the mitochondrial phylogeny, but Pteronarcyoidea was recovered as paraphyletic and remained controversial. In this mitochondrial phylogenetic study, the relationships within Systellognatha were recovered as (((Perlidae + (Perlodidae + Chloroperlidae)) + (Pteronarcyidae + Styloperlidae)) + Peltoperlidae). Copyright © 2018 Elsevier B.V. All rights reserved.
Ned B. Klopfenstein; Jane E. Stewart; Yuko Ota; John W. Hanna; Bryce A. Richardson; Amy L. Ross-Davis; Ruben D. Elias-Roman; Kari Korhonen; Nenad Keca; Eugenia Iturritxa; Dionicio Alvarado-Rosales; Halvor Solheim; Nicholas J. Brazee; Piotr Lakomy; Michelle R. Cleary; Eri Hasegawa; Taisei Kikuchi; Fortunato Garza-Ocanas; Panaghiotis Tsopelas; Daniel Rigling; Simone Prospero; Tetyana Tsykun; Jean A. Berube; Franck O. P. Stefani; Saeideh Jafarpour; Vladimir Antonin; Michal Tomsovsky; Geral I. McDonald; Stephen Woodward; Mee-Sook Kim
2017-01-01
Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequenceâbased analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation...
Schmidt-Lebuhn, Alexander N; Aitken, Nicola C; Chuah, Aaron
2017-11-01
Datasets of hundreds or thousands of SNPs (Single Nucleotide Polymorphisms) from multiple individuals per species are increasingly used to study population structure, species delimitation and shallow phylogenetics. The principal software tool to infer species or population trees from SNP data is currently the BEAST template SNAPP which uses a Bayesian coalescent analysis. However, it is computationally extremely demanding and tolerates only small amounts of missing data. We used simulated and empirical SNPs from plants (Australian Craspedia, Asteraceae, and Pelargonium, Geraniaceae) to compare species trees produced (1) by SNAPP, (2) using SVD quartets, and (3) using Bayesian and parsimony analysis with several different approaches to summarising data from multiple samples into one set of traits per species. Our aims were to explore the impact of tree topology and missing data on the results, and to test which data summarising and analyses approaches would best approximate the results obtained from SNAPP for empirical data. SVD quartets retrieved the correct topology from simulated data, as did SNAPP except in the case of a very unbalanced phylogeny. Both methods failed to retrieve the correct topology when large amounts of data were missing. Bayesian analysis of species level summary data scoring the two alleles of each SNP as independent characters and parsimony analysis of data scoring each SNP as one character produced trees with branch length distributions closest to the true trees on which SNPs were simulated. For empirical data, Bayesian inference and Dollo parsimony analysis of data scored allele-wise produced phylogenies most congruent with the results of SNAPP. In the case of study groups divergent enough for missing data to be phylogenetically informative (because of additional mutations preventing amplification of genomic fragments or bioinformatic establishment of homology), scoring of SNP data as a presence/absence matrix irrespective of allele content might be an additional option. As this depends on sampling across species being reasonably even and a random distribution of non-informative instances of missing data, however, further exploration of this approach is needed. Properly chosen data summary approaches to inferring species trees from SNP data may represent a potential alternative to currently available individual-level coalescent analyses especially for quick data exploration and when dealing with computationally demanding or patchy datasets. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Bayesian phylogenetic estimation of fossil ages.
Drummond, Alexei J; Stadler, Tanja
2016-07-19
Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Authors.
Bayesian phylogenetic estimation of fossil ages
Drummond, Alexei J.; Stadler, Tanja
2016-01-01
Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325827
Hsieh, Chia-Hung; Ko, Chiun-Cheng; Chung, Cheng-Han; Wang, Hurng-Yi
2014-07-01
The sweet potato whitefly, Bemisia tabaci, is a highly differentiated species complex. Despite consisting of several morphologically indistinguishable entities and frequent invasions on all continents with important associated economic losses, the phylogenetic relationships, species status, and evolutionary history of this species complex is still debated. We sequenced and analyzed one mitochondrial and three single-copy nuclear genes from 9 of the 12 genetic groups of B. tabaci and 5 closely related species. Bayesian species delimitation was applied to investigate the speciation events of B. tabaci. The species statuses of the different genetic groups were strongly supported under different prior settings and phylogenetic scenarios. Divergence histories were estimated by a multispecies coalescence approach implemented in (*)BEAST. Based on mitochondrial locus, B. tabaci was originated 6.47 million years ago (MYA). Nevertheless, the time was 1.25MYA based on nuclear loci. According to the method of approximate Bayesian computation, this difference is probably due to different degrees of migration among loci; i.e., although the mitochondrial locus had differentiated, gene flow at nuclear loci was still possible, a scenario similar to parapatric mode of speciation. This is the first study in whiteflies using multilocus data and incorporating Bayesian coalescence approaches, both of which provide a more biologically realistic framework for delimiting species status and delineating the divergence history of B. tabaci. Our study illustrates that gene flow during species divergence should not be overlooked and has a great impact on divergence time estimation. Copyright © 2014 Elsevier Inc. All rights reserved.
Lagos, Doris M; Voegtlin, David J; Coeur d'acier, Armelle; Giordano, Rosanna
2014-06-01
A phylogeny of the genus Aphis Linnaeus, 1 758 was built primarily from specimens collected in the Midwest of the United States. A data matrix was constructed with 68 species and 41 morphological characters with respective character states of alate and apterous viviparous females. Dendrogram topologies of analyses performed using UPGMA (Unweighted Pair Group Method with Arithmetic Mean), Maximum Parsimony and Bayesian analysis of Cytochrome Oxidase I, Elongation Factor 1-α and primary endosymbiont Buchnera aphidicola 16S sequences were not congruent. Bayesian analysis strongly supported most terminal nodes of the phylogenetic trees. The phylogeny was strongly supported by EF1-α, and analysis of COI and EF1-α molecular data combined with morphological characters. It was not supported by single analysis of COI or Buchnera aphidicola 16S. Results from the Bayesian phylogeny show 4 main species groups: asclepiadis, fabae, gossypii, and middletonii. Results place Aphis and species of the genera Protaphis Börner, 1952, Toxoptera Koch, and Xerobion Nevsky, 1928 in a monophyletic clade. Morphological characters support this monophyly as well. The phylogeny shows that the monophyletic clade of the North American middletonii species group belong to the genus Protaphis: P. debilicornis (Gillette & Palmer, 1929), comb. nov., P. echinaceae (Lagos and Voegtlin, 2009), comb. nov., and P. middletonii (Thomas, 1879). The genus Toxoptera should be considered a subgenus of Aphis (stat. nov.). The analysis also indicates that the current genus Iowana Frison, 1954 should be considered a subgenus of Aphis (stat. nov.). © 2013 Institute of Zoology, Chinese Academy of Sciences.
Klopfenstein, Ned B; Stewart, Jane E; Ota, Yuko; Hanna, John W; Richardson, Bryce A; Ross-Davis, Amy L; Elías-Román, Rubén D; Korhonen, Kari; Keča, Nenad; Iturritxa, Eugenia; Alvarado-Rosales, Dionicio; Solheim, Halvor; Brazee, Nicholas J; Łakomy, Piotr; Cleary, Michelle R; Hasegawa, Eri; Kikuchi, Taisei; Garza-Ocañas, Fortunato; Tsopelas, Panaghiotis; Rigling, Daniel; Prospero, Simone; Tsykun, Tetyana; Bérubé, Jean A; Stefani, Franck O P; Jafarpour, Saeideh; Antonín, Vladimír; Tomšovský, Michal; McDonald, Geral I; Woodward, Stephen; Kim, Mee-Sook
2017-01-01
Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence-based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation elongation factor 1-α (tef1) sequences are highly informative for phylogenetic analysis of Armillaria species within diverse global regions. This study used Neighbor-net and coalescence-based Bayesian analyses to examine phylogenetic relationships of newly determined and existing tef1 sequences derived from diverse Armillaria species from across the Northern Hemisphere, with Southern Hemisphere Armillaria species included for reference. Based on the Bayesian analysis of tef1 sequences, Armillaria species from the Northern Hemisphere are generally contained within the following four superclades, which are named according to the specific epithet of the most frequently cited species within the superclade: (i) Socialis/Tabescens (exannulate) superclade including Eurasian A. ectypa, North American A. socialis (A. tabescens), and Eurasian A. socialis (A. tabescens) clades; (ii) Mellea superclade including undescribed annulate North American Armillaria sp. (Mexico) and four separate clades of A. mellea (Europe and Iran, eastern Asia, and two groups from North America); (iii) Gallica superclade including Armillaria Nag E (Japan), multiple clades of A. gallica (Asia and Europe), A. calvescens (eastern North America), A. cepistipes (North America), A. altimontana (western USA), A. nabsnona (North America and Japan), and at least two A. gallica clades (North America); and (iv) Solidipes/Ostoyae superclade including two A. solidipes/ostoyae clades (North America), A. gemina (eastern USA), A. solidipes/ostoyae (Eurasia), A. cepistipes (Europe and Japan), A. sinapina (North America and Japan), and A. borealis (Eurasia) clade 2. Of note is that A. borealis (Eurasia) clade 1 appears basal to the Solidipes/Ostoyae and Gallica superclades. The Neighbor-net analysis showed similar phylogenetic relationships. This study further demonstrates the utility of tef1 for global phylogenetic studies of Armillaria species and provides critical insights into multiple taxonomic issues that warrant further study.
The complete mitochondrial genome of Papilio glaucus and its phylogenetic implications.
Shen, Jinhui; Cong, Qian; Grishin, Nick V
2015-09-01
Due to the intriguing morphology, lifecycle, and diversity of butterflies and moths, Lepidoptera are emerging as model organisms for the study of genetics, evolution and speciation. The progress of these studies relies on decoding Lepidoptera genomes, both nuclear and mitochondrial. Here we describe a protocol to obtain mitogenomes from Next Generation Sequencing reads performed for whole-genome sequencing and report the complete mitogenome of Papilio (Pterourus) glaucus. The circular mitogenome is 15,306 bp in length and rich in A and T. It contains 13 protein-coding genes (PCGs), 22 transfer-RNA-coding genes (tRNA), and 2 ribosomal-RNA-coding genes (rRNA), with a gene order typical for mitogenomes of Lepidoptera. We performed phylogenetic analyses based on PCG and RNA-coding genes or protein sequences using Bayesian Inference and Maximum Likelihood methods. The phylogenetic trees consistently show that among species with available mitogenomes Papilio glaucus is the closest to Papilio (Agehana) maraho from Asia.
Dornburg, Alex; Friedman, Matt; Near, Thomas J
2015-08-01
Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Molecular phylogenetics reveals convergent evolution in lower Congo River spiny eels.
Alter, S Elizabeth; Brown, Bianca; Stiassny, Melanie L J
2015-10-15
The lower Congo River (LCR) is a region of exceptional species diversity and endemism in the Congo basin, including numerous species of spiny eels (genus Mastacembelus). Four of these exhibit distinctive phenotypes characterized by greatly reduced optic globes deeply embedded into the head (cryptophthalmia) and reduced (or absent) melanin pigmentation, among other characteristics. A strikingly similar cryptophthalmic phenotype is also found in members of a number of unrelated fish families, strongly suggesting the possibility of convergent evolution. However, little is known about the evolutionary processes that shaped diversification in LCR Mastacembelus, their biogeographic origins, or when colonization of the LCR occurred. We sequenced mitochondrial and nuclear genes from Mastacembelus species collected in the lower Congo River, and compared them with other African species and Asian representatives as outgroups. We analyzed the sequence data using Maximum Likelihood and Bayesian phylogenetic inference. Bayesian and Maximum Likelihood phylogenetic analyses, and Bayesian coalescent methods for species tree reconstruction, reveal that endemic LCR spiny eels derive from two independent origins, clearly demonstrating convergent evolution of the cryptophthalmic phenotype. Mastacembelus crassus, M. aviceps, and M. simbi form a clade, allied to species found in southern, eastern and central Africa. Unexpectedly, M. brichardi and brachyrhinus fall within a clade otherwise endemic to Lake Tanganikya (LT) ca. 1500 km east of the LCR. Divergence dating suggests the ages of these two clades of LCR endemics differ markedly. The age of the crassus group is estimated at ~4 Myr while colonization of the LCR by the brichardi-brachyrhinus progenitor was considerably more recent, dated at ~0.5 Myr. The phylogenetic framework of spiny eels presented here, the first to include LCR species, demonstrates that cryptophthalmia and associated traits evolved at least twice in Mastacembelus: once in M. brichardi and at least once in the M. crassus clade. Timing of diversification is broadly consistent with the onset of modern high-energy flow conditions in the LCR and with previous studies of endemic cichlids. The close genetic relationship between M. brichardi and M. brachyrhinus is particularly notable given the extreme difference in phenotype between these species, and additional work is needed to better understand the evolutionary history of diversification in this clade. The findings presented here demonstrate strong, multi-trait convergence in LCR spiny eels, suggesting that extreme selective pressures have shaped numerous phenotypic attributes of the endemic species of this region.
Flatfish monophyly refereed by the relationship of Psettodes in Carangimorphariae.
Shi, Wei; Chen, Shixi; Kong, Xiaoyu; Si, Lizhen; Gong, Li; Zhang, Yanchun; Yu, Hui
2018-05-25
The monophyly of flatfishes has not been supported in many molecular phylogenetic studies. The monophyly of Pleuronectoidei, which comprises all but one family of flatfishes, is broadly supported. However, the Psettodoidei, comprising the single family Psettodidae, is often found to be most closely related to other carangimorphs based on substantial sequencing efforts and diversely analytical methods. In this study, we examined why this particular result is often obtained. The mitogenomes of five flatfishes were determined. Select mitogenomes of representative carangimorph species were further employed for phylogenetic and molecular clock analyses. Our phylogenetic results do not fully support Psettodes as a sister group to pleuronectoids or other carangimorphs. And results also supported the evidence of long-branch attraction between Psettodes and the adjacent clades. Two chronograms, derived from Bayesian relaxed-clock methods, suggest that over a short period in the early Paleocene, a series of important evolutionary events occurred in carangimorphs. Based on insights provided by the molecular clock, we propose the following evolutionary explanation for the difficulty in determining the phylogenetic position of Psettodes: The initial diversification of Psettodes was very close in time to the initial diversification of carangimorphs, and the primary diversification time of pleuronectoids, the other suborder of flatfishes, occurred later than that of some percomorph taxa. Additionally, the clade of Psettodes is long and naked branch, which supports the uncertainty of its phylogenetic placement. Finally, we confirmed the monophyly of flatfishes, which was accepted by most ichthyologists.
Deep phylogeographic divergence and cytonuclear discordance in the grasshopper Oedaleus decorus.
Kindler, Eveline; Arlettaz, Raphaël; Heckel, Gerald
2012-11-01
The grasshopper Oedaleus decorus is a thermophilic insect with a large, mostly south-Palaearctic distribution range, stretching from the Mediterranean regions in Europe to Central-Asia and China. In this study, we analyzed the extent of phylogenetic divergence and the recent evolutionary history of the species based on 274 specimens from 26 localities across the distribution range in Europe. Phylogenetic relationships were determined using sequences of two mitochondrial loci (ctr, ND2) with neighbour-joining and Bayesian methods. Additionally, genetic differentiation was analyzed based on mitochondrial DNA and 11 microsatellite markers using F-statistics, model-free multivariate and model-based Bayesian clustering approaches. Phylogenetic analyses detected consistently two highly divergent, allopatrically distributed lineages within O. decorus. The divergence among these Western and Eastern lineages meeting in the region of the Alps was similar to the divergence of each lineage to the sister species O. asiaticus. Genetic differentiation for ctr was extremely high between Western and Eastern grasshopper populations (F(ct)=0.95). Microsatellite markers detected much lower but nevertheless very significant genetic structure among population samples. The nuclear data also demonstrated a case of cytonuclear discordance because the affiliation with mitochondrial lineages was incongruent in Northern Italy. Taken together these results provide evidence of an ancient separation within Oedaleus and either historical introgression of mtDNA among lineages and/or ongoing sex-specific gene flow in this grasshopper. Our study stresses the importance of multilocus approaches for unravelling the history and status of taxa of uncertain evolutionary divergence. Copyright © 2012 Elsevier Inc. All rights reserved.
Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics
Kolaczkowski, Bryan; Thornton, Joseph W.
2009-01-01
Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias—which is apparent under both controlled simulation conditions and in analyses of empirical sequence data—also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages—that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis. PMID:20011052
Long-branch attraction bias and inconsistency in Bayesian phylogenetics.
Kolaczkowski, Bryan; Thornton, Joseph W
2009-12-09
Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.
Kim, Jiyeon; Kern, Elizabeth; Kim, Taeho; Sim, Mikang; Kim, Jaebum; Kim, Yuseob; Park, Chungoo; Nadler, Steven A; Park, Joong-Ki
2017-02-01
Plectida is an important nematode order with species that occupy many different biological niches. The order includes free-living aquatic and soil-dwelling species, but its phylogenetic position has remained uncertain. We sequenced the complete mitochondrial genomes of two members of this order, Plectus acuminatus and Plectus aquatilis and compared them with those of other major nematode clades. The genome size and base composition of these species are similar to other nematodes; 14,831 and 14,372bp, respectively, with AT contents of 71.0% and 70.1%. Gene content was also similar to other nematodes, but gene order and coding direction of Plectus mtDNAs were dissimilar from other chromadorean species. P. acuminatus and P. aquatilis are the first chromadorean species found to contain a gene inversion. We reconstructed mitochondrial genome phylogenetic trees using nucleotide and amino acid datasets from 87 nematodes that represent major nematode clades, including the Plectus sequences. Trees from phylogenetic analyses using maximum likelihood and Bayesian methods depicted Plectida as the sister group to other sequenced chromadorean nematodes. This finding is consistent with several phylogenetic results based on SSU rDNA, but disagrees with a classification based on morphology. Mitogenomes representing other basal chromadorean groups (Araeolaimida, Monhysterida, Desmodorida, Chromadorida) are needed to confirm their phylogenetic relationships. Copyright © 2016 Elsevier Inc. All rights reserved.
Multiple optimality criteria support Ornithoscelida
NASA Astrophysics Data System (ADS)
Parry, Luke A.; Baron, Matthew G.; Vinther, Jakob
2017-10-01
A recent study of early dinosaur evolution using equal-weights parsimony recovered a scheme of dinosaur interrelationships and classification that differed from historical consensus in a single, but significant, respect; Ornithischia and Saurischia were not recovered as monophyletic sister-taxa, but rather Ornithischia and Theropoda formed a novel clade named Ornithoscelida. However, these analyses only used maximum parsimony, and numerous recent simulation studies have questioned the accuracy of parsimony under equal weights. Here, we provide additional support for this alternative hypothesis using Bayesian implementation of the Mkv model, as well as through number of additional parsimony analyses, including implied weighting. Using Bayesian inference and implied weighting, we recover the same fundamental topology for Dinosauria as the original study, with a monophyletic Ornithoscelida, demonstrating that the main suite of methods used in morphological phylogenetics recover this novel hypothesis. This result was further scrutinized through the systematic exclusion of different character sets. Novel characters from the original study (those not taken or adapted from previous phylogenetic studies) were found to be more important for resolving the relationships within Dinosauromorpha than the relationships within Dinosauria. Reanalysis of a modified version of the character matrix that supports the Ornithischia-Saurischia dichotomy under maximum parsimony also supports this hypothesis under implied weighting, but not under the Mkv model, with both Theropoda and Sauropodomorpha becoming paraphyletic with respect to Ornithischia.
Devitt, Thomas J
2006-12-01
The Western Lyresnake (Trimorphodon biscutatus) is a widespread, polytypic taxon inhabiting arid regions from the warm deserts of the southwestern United States southward along the Pacific versant of Mexico to the tropical deciduous forests of Mesoamerica. This broadly distributed species provides a unique opportunity to evaluate a priori biogeographical hypotheses spanning two major distinct biogeographical realms (the Nearctic and Neotropical) that are usually treated separately in phylogeographical analyses. I investigated the phylogeography of T. biscutatus using maximum likelihood and Bayesian phylogenetic analysis of mitochondrial DNA (mtDNA) from across this species' range. Phylogenetic analyses recovered five well-supported clades whose boundaries are concordant with existing geographical barriers, a pattern consistent with a model of vicariant allopatric divergence. Assuming a vicariance model, divergence times between mitochondrial lineages were estimated using Bayesian relaxed molecular clock methods calibrated using geological information from putative vicariant events. Divergence time point estimates were bounded by broad confidence intervals, and thus these highly conservative estimates should be considered tentative hypotheses at best. Comparison of mtDNA lineages and taxa traditionally recognized as subspecies based on morphology suggest this taxon is comprised of multiple independent lineages at various stages of divergence, ranging from putative secondary contact and hybridization to sympatry of 'subspecies'.
Dool, Serena E; Puechmaille, Sebastien J; Foley, Nicole M; Allegrini, Benjamin; Bastian, Anna; Mutumi, Gregory L; Maluleke, Tinyiko G; Odendaal, Lizelle J; Teeling, Emma C; Jacobs, David S
2016-04-01
Despite many studies illustrating the perils of utilising mitochondrial DNA in phylogenetic studies, it remains one of the most widely used genetic markers for this purpose. Over the last decade, nuclear introns have been proposed as alternative markers for phylogenetic reconstruction. However, the resolution capabilities of mtDNA and nuclear introns have rarely been quantified and compared. In the current study we generated a novel ∼5kb dataset comprising six nuclear introns and a mtDNA fragment. We assessed the relative resolution capabilities of the six intronic fragments with respect to each other, when used in various combinations together, and when compared to the traditionally used mtDNA. We focused on a major clade in the horseshoe bat family (Afro-Palaearctic clade; Rhinolophidae) as our case study. This old, widely distributed and speciose group contains a high level of conserved morphology. This morphological stasis renders the reconstruction of the phylogeny of this group with traditional morphological characters complex. We sampled multiple individuals per species to represent their geographic distributions as best as possible (122 individuals, 24 species, 68 localities). We reconstructed the species phylogeny using several complementary methods (partitioned Maximum Likelihood and Bayesian and Bayesian multispecies-coalescent) and made inferences based on consensus across these methods. We computed pairwise comparisons based on Robinson-Foulds tree distance metric between all Bayesian topologies generated (27,000) for every gene(s) and visualised the tree space using multidimensional scaling (MDS) plots. Using our supported species phylogeny we estimated the ancestral state of key traits of interest within this group, e.g. echolocation peak frequency which has been implicated in speciation. Our results revealed many potential cryptic species within this group, even in taxa where this was not suspected a priori and also found evidence for mtDNA introgression. We demonstrated that by using just two introns one can recover a better supported species tree than when using the mtDNA alone, despite the shorter overall length of the combined introns. Additionally, when combining any single intron with mtDNA, we showed that the result is highly similar to the mtDNA gene tree and far from the true species tree and therefore this approach should be avoided. We caution against the indiscriminate use of mtDNA in phylogenetic studies and advocate for pilot studies to select nuclear introns. The selection of marker type and number is a crucial step that is best based on critical examination of preliminary or previously published data. Based on our findings and previous publications, we recommend the following markers to recover phylogenetic relationships between recently diverged taxa (<20 My) in bats and other mammals: ACOX2, COPS7A, BGN, ROGDI and STAT5A. Copyright © 2016 Elsevier Inc. All rights reserved.
Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano
2007-01-01
Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612
Phylogeny of Neotropical Cercosaura (Squamata: Gymnophthalmidae) lizards.
Torres-Carvajal, Omar; Lobos, Simón E; Venegas, Pablo J
2015-12-01
Among Neotropical lizards, the geographically widespread gymnophthalmid Cercosaura as currently defined includes lowland and highland taxa from Panama to Argentina, with some species occurring in the northern Andes. In this study we analyze three mitochondrial (12S, 16S, ND4) and one nuclear (c-mos) gene using Bayesian methods to clarify the phylogenetic relationships among most species of Cercosaura based on a well-supported phylogenetic hypothesis that also includes a large sample of other taxa within Cercosaurini. The phylogenetic tree obtained in this paper shows that Cercosaura as currently defined is not monophyletic. Two species from the northern Andes (C. dicra and C. vertebralis) are nested within Pholidobolus, which has been formerly recognized as a major radiation along the Andes of Ecuador and Colombia. Therefore, Cercosaura has probably not diversified in the northern Andes, although the phylogenetic position of C. hypnoides from the Andes of Colombia remains unknown. Tree topology and genetic distances support both recognition of C. ocellata bassleri as a distinct species, C. bassleri, and recognition of C. argula and C. oshaughnessyi as two different species. In the interest of promoting clarity and precision regarding the names of clades of gymnophthalmid lizards, we propose a phylogenetic definition of Cercosaura. Copyright © 2015 Elsevier Inc. All rights reserved.
Is multiple-sequence alignment required for accurate inference of phylogeny?
Höhl, Michael; Ragan, Mark A
2007-04-01
The process of inferring phylogenetic trees from molecular sequences almost always starts with a multiple alignment of these sequences but can also be based on methods that do not involve multiple sequence alignment. Very little is known about the accuracy with which such alignment-free methods recover the correct phylogeny or about the potential for increasing their accuracy. We conducted a large-scale comparison of ten alignment-free methods, among them one new approach that does not calculate distances and a faster variant of our pattern-based approach; all distance-based alignment-free methods are freely available from http://www.bioinformatics.org.au (as Python package decaf+py). We show that most methods exhibit a higher overall reconstruction accuracy in the presence of high among-site rate variation. Under all conditions that we considered, variants of the pattern-based approach were significantly better than the other alignment-free methods. The new pattern-based variant achieved a speed-up of an order of magnitude in the distance calculation step, accompanied by a small loss of tree reconstruction accuracy. A method of Bayesian inference from k-mers did not improve on classical alignment-free (and distance-based) methods but may still offer other advantages due to its Bayesian nature. We found the optimal word length k of word-based methods to be stable across various data sets, and we provide parameter ranges for two different alphabets. The influence of these alphabets was analyzed to reveal a trade-off in reconstruction accuracy between long and short branches. We have mapped the phylogenetic accuracy for many alignment-free methods, among them several recently introduced ones, and increased our understanding of their behavior in response to biologically important parameters. In all experiments, the pattern-based approach emerged as superior, at the expense of higher resource consumption. Nonetheless, no alignment-free method that we examined recovers the correct phylogeny as accurately as does an approach based on maximum-likelihood distance estimates of multiply aligned sequences.
Fortunato, Laura; Jordan, Fiona
2010-01-01
Accurate reconstruction of prehistoric social organization is important if we are to put together satisfactory multidisciplinary scenarios about, for example, the dispersal of human groups. Such considerations apply in the case of Indo-European and Austronesian, two large-scale language families that are thought to represent Neolithic expansions. Ancestral kinship patterns have mostly been inferred through reconstruction of kin terminologies in ancestral proto-languages using the linguistic comparative method, and through geographical or distributional arguments based on the comparative patterns of kin terms and ethnographic kinship ‘facts’. While these approaches are detailed and valuable, the processes through which conclusions have been drawn from the data fail to provide explicit criteria for systematic testing of alternative hypotheses. Here, we use language trees derived using phylogenetic tree-building techniques on Indo-European and Austronesian vocabulary data. With these trees, ethnographic data and Bayesian phylogenetic comparative methods, we statistically reconstruct past marital residence and infer rates of cultural change between different residence forms, showing Proto-Indo-European to be virilocal and Proto-Malayo-Polynesian uxorilocal. The instability of uxorilocality and the rare loss of virilocality once gained emerge as common features of both families. PMID:21041215
Demaio, Pablo H; Barfuss, Michael H J; Kiesling, Roberto; Till, Walter; Chiapella, Jorge O
2011-11-01
The South American genus Gymnocalycium (Cactoideae-Trichocereae) demonstrates how the sole use of morphological data in Cactaceae results in conflicts in assessing phylogeny, constructing a taxonomic system, and analyzing trends in the evolution of the genus. Molecular phylogenetic analysis was performed using parsimony and Bayesian methods on a 6195-bp data matrix of plastid DNA sequences (atpI-atpH, petL-psbE, trnK-matK, trnT-trnL-trnF) of 78 samples, including 52 species and infraspecific taxa representing all the subgenera of Gymnocalycium. We assessed morphological character evolution using likelihood methods to optimize characters on a Bayesian tree and to reconstruct possible ancestral states. The results of the phylogenetic study confirm the monophyly of the genus, while supporting overall the available infrageneric classification based on seed morphology. Analysis showed the subgenera Microsemineum and Macrosemineum to be polyphyletic and paraphyletic. Analysis of morphological characters showed a tendency toward reduction of stem size, reduction in quantity and hardiness of spines, increment of seed size, development of napiform roots, and change from juicy and colorful fruits to dry and green fruits. Gymnocalycium saglionis is the only species of Microsemineum and a new name is required to identify the clade including the remaining species of Microsemineum; we propose the name Scabrosemineum in agreement with seed morphology. Identifying morphological trends and environmental features allows for a better understanding of the events that might have influenced the diversification of the genus.
Evidence of transoceanic dispersion of the genus Vanilla based on plastid DNA phylogenetic analysis.
Bouetard, Anthony; Lefeuvre, Pierre; Gigant, Rodolphe; Bory, Séverine; Pignal, Marc; Besse, Pascale; Grisoni, Michel
2010-05-01
The phylogeny and the biogeographical history of the genus Vanilla was investigated using four chloroplastic genes (psbB, psbC; psaB and rbcL), on 47 accessions of Vanilla chosen from the ex situ CIRAD collection maintained in Reunion Island and additional sequences from GenBank. Bayesian methods provided a fairly well supported reconstruction of the phylogeny of the Vanilloideae sub-family and more particularly of the genus Vanilla. Three major phylogenetic groups in the genus Vanilla were differentiated, which is in disagreement with the actual classification in two sections (Foliosae and Aphyllae) based on morphological traits. Recent Bayesian relaxed molecular clock methods allowed to test the two main hypotheses of the phylogeography of the genus Vanilla. Early radiation of the Vanilla genus and diversification by vicariance consecutive to the break-up of Gondwana, 95 million years ago (Mya), was incompatible with the admitted age of origin of Angiosperm. Based on the Vanilloideae age recently estimated to 71 million years ago (Mya), we conclude that the genus Vanilla would have appeared approximately 34 Mya in South America, when continents were already separated. Nevertheless, whatever the two extreme scenarios tested, at least three long distance migration events are needed to explain the present distribution of Vanilla species in tropical areas. These transoceanic dispersions could have occurred via transoceanic passageway such as the Rio Grande Ridge and the involvement of floating vegetation mats and migratory birds. Copyright 2010 Elsevier Inc. All rights reserved.
McCann, Jamie; Stuessy, Tod F.; Villaseñor, Jose L.; Weiss-Schneeweiss, Hanna
2016-01-01
Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction. PMID:27611687
McCann, Jamie; Schneeweiss, Gerald M; Stuessy, Tod F; Villaseñor, Jose L; Weiss-Schneeweiss, Hanna
2016-01-01
Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction.
Salas-Leiva, Dayana E; Meerow, Alan W; Calonje, Michael; Griffith, M Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W; Lewis, Carl E; Namoff, Sandra
2013-11-01
Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree-species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree-species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia-Lepidozamia-Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae.
Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A
2018-05-01
Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.
Calibrated birth-death phylogenetic time-tree priors for bayesian inference.
Heled, Joseph; Drummond, Alexei J
2015-05-01
Here we introduce a general class of multiple calibration birth-death tree priors for use in Bayesian phylogenetic inference. All tree priors in this class separate ancestral node heights into a set of "calibrated nodes" and "uncalibrated nodes" such that the marginal distribution of the calibrated nodes is user-specified whereas the density ratio of the birth-death prior is retained for trees with equal values for the calibrated nodes. We describe two formulations, one in which the calibration information informs the prior on ranked tree topologies, through the (conditional) prior, and the other which factorizes the prior on divergence times and ranked topologies, thus allowing uniform, or any arbitrary prior distribution on ranked topologies. Although the first of these formulations has some attractive properties, the algorithm we present for computing its prior density is computationally intensive. However, the second formulation is always faster and computationally efficient for up to six calibrations. We demonstrate the utility of the new class of multiple-calibration tree priors using both small simulations and a real-world analysis and compare the results to existing schemes. The two new calibrated tree priors described in this article offer greater flexibility and control of prior specification in calibrated time-tree inference and divergence time dating, and will remove the need for indirect approaches to the assessment of the combined effect of calibration densities and tree priors in Bayesian phylogenetic inference. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Efficiency of nuclear and mitochondrial markers recovering and supporting known amniote groups.
Lambret-Frotté, Julia; Perini, Fernando Araújo; de Moraes Russo, Claudia Augusta
2012-01-01
We have analysed the efficiency of all mitochondrial protein coding genes and six nuclear markers (Adora3, Adrb2, Bdnf, Irbp, Rag2 and Vwf) in reconstructing and statistically supporting known amniote groups (murines, rodents, primates, eutherians, metatherians, therians). The efficiencies of maximum likelihood, Bayesian inference, maximum parsimony, neighbor-joining and UPGMA were also evaluated, by assessing the number of correct and incorrect recovered groupings. In addition, we have compared support values using the conservative bootstrap test and the Bayesian posterior probabilities. First, no correlation was observed between gene size and marker efficiency in recovering or supporting correct nodes. As expected, tree-building methods performed similarly, even UPGMA that, in some cases, outperformed other most extensively used methods. Bayesian posterior probabilities tend to show much higher support values than the conservative bootstrap test, for correct and incorrect nodes. Our results also suggest that nuclear markers do not necessarily show a better performance than mitochondrial genes. The so-called dependency among mitochondrial markers was not observed comparing genome performances. Finally, the amniote groups with lowest recovery rates were therians and rodents, despite the morphological support for their monophyletic status. We suggest that, regardless of the tree-building method, a few carefully selected genes are able to unfold a detailed and robust scenario of phylogenetic hypotheses, particularly if taxon sampling is increased.
Peña, Carlos; Espeland, Marianne
2015-01-01
The species rich butterfly family Nymphalidae has been used to study evolutionary interactions between plants and insects. Theories of insect-hostplant dynamics predict accelerated diversification due to key innovations. In evolutionary biology, analysis of maximum credibility trees in the software MEDUSA (modelling evolutionary diversity using stepwise AIC) is a popular method for estimation of shifts in diversification rates. We investigated whether phylogenetic uncertainty can produce different results by extending the method across a random sample of trees from the posterior distribution of a Bayesian run. Using the MultiMEDUSA approach, we found that phylogenetic uncertainty greatly affects diversification rate estimates. Different trees produced diversification rates ranging from high values to almost zero for the same clade, and both significant rate increase and decrease in some clades. Only four out of 18 significant shifts found on the maximum clade credibility tree were consistent across most of the sampled trees. Among these, we found accelerated diversification for Ithomiini butterflies. We used the binary speciation and extinction model (BiSSE) and found that a hostplant shift to Solanaceae is correlated with increased net diversification rates in Ithomiini, congruent with the diffuse cospeciation hypothesis. Our results show that taking phylogenetic uncertainty into account when estimating net diversification rate shifts is of great importance, as very different results can be obtained when using the maximum clade credibility tree and other trees from the posterior distribution. PMID:25830910
Peña, Carlos; Espeland, Marianne
2015-01-01
The species rich butterfly family Nymphalidae has been used to study evolutionary interactions between plants and insects. Theories of insect-hostplant dynamics predict accelerated diversification due to key innovations. In evolutionary biology, analysis of maximum credibility trees in the software MEDUSA (modelling evolutionary diversity using stepwise AIC) is a popular method for estimation of shifts in diversification rates. We investigated whether phylogenetic uncertainty can produce different results by extending the method across a random sample of trees from the posterior distribution of a Bayesian run. Using the MultiMEDUSA approach, we found that phylogenetic uncertainty greatly affects diversification rate estimates. Different trees produced diversification rates ranging from high values to almost zero for the same clade, and both significant rate increase and decrease in some clades. Only four out of 18 significant shifts found on the maximum clade credibility tree were consistent across most of the sampled trees. Among these, we found accelerated diversification for Ithomiini butterflies. We used the binary speciation and extinction model (BiSSE) and found that a hostplant shift to Solanaceae is correlated with increased net diversification rates in Ithomiini, congruent with the diffuse cospeciation hypothesis. Our results show that taking phylogenetic uncertainty into account when estimating net diversification rate shifts is of great importance, as very different results can be obtained when using the maximum clade credibility tree and other trees from the posterior distribution.
Detecting Network Communities: An Application to Phylogenetic Analysis
Andrade, Roberto F. S.; Rocha-Neto, Ivan C.; Santos, Leonardo B. L.; de Santana, Charles N.; Diniz, Marcelo V. C.; Lobão, Thierry Petit; Goés-Neto, Aristóteles; Pinho, Suani T. R.; El-Hani, Charbel N.
2011-01-01
This paper proposes a new method to identify communities in generally weighted complex networks and apply it to phylogenetic analysis. In this case, weights correspond to the similarity indexes among protein sequences, which can be used for network construction so that the network structure can be analyzed to recover phylogenetically useful information from its properties. The analyses discussed here are mainly based on the modular character of protein similarity networks, explored through the Newman-Girvan algorithm, with the help of the neighborhood matrix . The most relevant networks are found when the network topology changes abruptly revealing distinct modules related to the sets of organisms to which the proteins belong. Sound biological information can be retrieved by the computational routines used in the network approach, without using biological assumptions other than those incorporated by BLAST. Usually, all the main bacterial phyla and, in some cases, also some bacterial classes corresponded totally (100%) or to a great extent (>70%) to the modules. We checked for internal consistency in the obtained results, and we scored close to 84% of matches for community pertinence when comparisons between the results were performed. To illustrate how to use the network-based method, we employed data for enzymes involved in the chitin metabolic pathway that are present in more than 100 organisms from an original data set containing 1,695 organisms, downloaded from GenBank on May 19, 2007. A preliminary comparison between the outcomes of the network-based method and the results of methods based on Bayesian, distance, likelihood, and parsimony criteria suggests that the former is as reliable as these commonly used methods. We conclude that the network-based method can be used as a powerful tool for retrieving modularity information from weighted networks, which is useful for phylogenetic analysis. PMID:21573202
Staggemeier, Vanessa Graziele; Diniz-Filho, José Alexandre Felizola; Forest, Félix; Lucas, Eve
2015-01-01
Background and Aims Myrcia section Aulomyrcia includes ∼120 species that are endemic to the Neotropics and disjunctly distributed in the moist Amazon and Atlantic coastal forests of Brazil. This paper presents the first comprehensive phylogenetic study of this group and this phylogeny is used as a basis to evaluate recent classification systems and to test alternative hypotheses associated with the history of this clade. Methods Fifty-three taxa were sampled out of the 120 species currently recognized, plus 40 outgroup taxa, for one nuclear marker (ribosomal internal transcribed spacer) and four plastid markers (psbA-trnH, trnL-trnF, trnQ-rpS16 and ndhF). The relationships were reconstructed based on Bayesian and maximum likelihood analyses. Additionally, a likelihood approach, ‘geographic state speciation and extinction’, was used to estimate region- dependent rates of speciation, extinction and dispersal, comparing historically climatic stable areas (refugia) and unstable areas. Key Results Maximum likelihood and Bayesian inferences indicate that Myrcia and Marlierea are polyphyletic, and the internal groupings recovered are characterized by combinations of morphological characters. Phylogenetic relationships support a link between Amazonian and north-eastern species and between north-eastern and south-eastern species. Lower extinction rates within glacial refugia suggest that these areas were important in maintaining diversity in the Atlantic forest biodiversity hotspot. Conclusions This study provides a robust phylogenetic framework to address important ecological questions for Myrcia s.l. within an evolutionary context, and supports the need to unite taxonomically the two traditional genera Myrcia and Marlierea in an expanded Myrcia s.l. Furthermore, this study offers valuable insights into the diversification of plant species in the highly impacted Atlantic forest of South America; evidence is presented that the lowest extinction rates are found inside refugia and that range expansion from unstable areas contributes to the highest levels of plant diversity in the Bahian refugium. PMID:25757471
Davis, Brian W; Li, Gang; Murphy, William J
2010-07-01
The pantherine lineage of cats diverged from the remainder of modern Felidae less than 11 million years ago and consists of the five big cats of the genus Panthera, the lion, tiger, jaguar, leopard, and snow leopard, as well as the closely related clouded leopard. A significant problem exists with respect to the precise phylogeny of these highly threatened great cats. Despite multiple publications on the subject, no two molecular studies have reconstructed Panthera with the same topology. These evolutionary relationships remain unresolved partially due to the recent and rapid radiation of pantherines in the Pliocene, individual speciation events occurring within less than 1 million years, and probable introgression between lineages following their divergence. We provide an alternative, highly supported interpretation of the evolutionary history of the pantherine lineage using novel and published DNA sequence data from the autosomes, both sex chromosomes and the mitochondrial genome. New sequences were generated for 39 single-copy regions of the felid Y chromosome, as well as four mitochondrial and four autosomal gene segments, totaling 28.7 kb. Phylogenetic analysis of these new data, combined with all published data in GenBank, highlighted the prevalence of phylogenetic disparities stemming either from the amplification of a mitochondrial to nuclear translocation event (numt), or errors in species identification. Our 47.6 kb combined dataset was analyzed as a supermatrix and with respect to individual partitions using maximum likelihood and Bayesian phylogenetic inference, in conjunction with Bayesian Estimation of Species Trees (BEST) which accounts for heterogeneous gene histories. Our results yield a robust consensus topology supporting the monophyly of lion and leopard, with jaguar sister to these species, as well as a sister species relationship of tiger and snow leopard. These results highlight new avenues for the study of speciation genomics and understanding the historical events surrounding the origin of the members of this lineage. Copyright 2010 Elsevier Inc. All rights reserved.
Baca, Stephen M; Toussaint, Emmanuel F A; Miller, Kelly B; Short, Andrew E Z
2017-02-01
The first molecular phylogenetic hypothesis for the aquatic beetle family Noteridae is inferred using DNA sequence data from five gene fragments (mitochondrial and nuclear): COI, H3, 16S, 18S, and 28S. Our analysis is the most comprehensive phylogenetic reconstruction of Noteridae to date, and includes 53 species representing all subfamilies, tribes and 16 of the 17 genera within the family. We examine the impact of data partitioning on phylogenetic inference by comparing two different algorithm-based partitioning strategies: one using predefined subsets of the dataset, and another recently introduced method, which uses the k-means algorithm to iteratively divide the dataset into clusters of sites evolving at similar rates across sampled loci. We conducted both maximum likelihood and Bayesian inference analyses using these different partitioning schemes. Resulting trees are strongly incongruent with prior classifications of Noteridae. We recover variant tree topologies and support values among the implemented partitioning schemes. Bayes factors calculated with marginal likelihoods of Bayesian analyses support a priori partitioning over k-means and unpartitioned data strategies. Our study substantiates the importance of data partitioning in phylogenetic inference, and underscores the use of comparative analyses to determine optimal analytical strategies. Our analyses recover Noterini Thomson to be paraphyletic with respect to three other tribes. The genera Suphisellus Crotch and Hydrocanthus Say are also recovered as paraphyletic. Following the results of the preferred partitioning scheme, we here propose a revised classification of Noteridae, comprising two subfamilies, three tribes and 18 genera. The following taxonomic changes are made: Notomicrinae sensu n. (= Phreatodytinae syn. n.) is expanded to include the tribe Phreatodytini; Noterini sensu n. (= Neohydrocoptini syn. n., Pronoterini syn. n., Tonerini syn. n.) is expanded to include all genera of the Noterinae; The genus Suphisellus Crotch is expanded to include species of Pronoterus Sharp syn. n.; and the former subgenus Sternocanthus Guignot stat. rev. is resurrected from synonymy and elevated to genus rank. Copyright © 2016 Elsevier Inc. All rights reserved.
Shiino, Teiichiro; Hattori, Junko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru
2014-01-01
Background One major circulating HIV-1 subtype in Southeast Asian countries is CRF01_AE, but little is known about its epidemiology in Japan. We conducted a molecular phylodynamic study of patients newly diagnosed with CRF01_AE from 2003 to 2010. Methods Plasma samples from patients registered in Japanese Drug Resistance HIV-1 Surveillance Network were analyzed for protease-reverse transcriptase sequences; all sequences undergo subtyping and phylogenetic analysis using distance-matrix-based, maximum likelihood and Bayesian coalescent Markov Chain Monte Carlo (MCMC) phylogenetic inferences. Transmission clusters were identified using interior branch test and depth-first searches for sub-tree partitions. Times of most recent common ancestor (tMRCAs) of significant clusters were estimated using Bayesian MCMC analysis. Results Among 3618 patient registered in our network, 243 were infected with CRF01_AE. The majority of individuals with CRF01_AE were Japanese, predominantly male, and reported heterosexual contact as their risk factor. We found 5 large clusters with ≥5 members and 25 small clusters consisting of pairs of individuals with highly related CRF01_AE strains. The earliest cluster showed a tMRCA of 1996, and consisted of individuals with their known risk as heterosexual contacts. The other four large clusters showed later tMRCAs between 2000 and 2002 with members including intravenous drug users (IVDU) and non-Japanese, but not men who have sex with men (MSM). In contrast, small clusters included a high frequency of individuals reporting MSM risk factors. Phylogenetic analysis also showed that some individuals infected with HIV strains spread in East and South-eastern Asian countries. Conclusions Introduction of CRF01_AE viruses into Japan is estimated to have occurred in the 1990s. CFR01_AE spread via heterosexual behavior, then among persons connected with non-Japanese, IVDU, and MSM. Phylogenetic analysis demonstrated that some viral variants are largely restricted to Japan, while others have a broad geographic distribution. PMID:25025900
Salazar, Gerardo A.; Cabrera, Lidia I.; Madriñán, Santiago; Chase, Mark W.
2009-01-01
Background and Aims Phylogenetic relationships of subtribes Cranichidinae and Prescottiinae, two diverse groups of neotropical terrestrial orchids, are not satisfactorily understood. A previous molecular phylogenetic study supported monophyly for Cranichidinae, but Prescottiinae consisted of two clades not sister to one another. However, that analysis included only 11 species and eight genera of these subtribes. Here, plastid and nuclear DNA sequences are analysed for an enlarged sample of genera and species of Cranichidinae and Prescottiinae with the aim of clarifying their relationships, evaluating the phylogenetic position of the monospecific genera Exalaria, Ocampoa and Pseudocranichis and examining the value of various structural traits as taxonomic markers. Methods Approx. 6000 bp of nucleotide sequences from nuclear ribosomal (ITS) and plastid DNA (rbcL, matK-trnK and trnL-trnF) were analysed with cladistic parsimony and Bayesian inference for 45 species/14 genera of Cranichidinae and Prescottiinae (plus suitable outgroups). The utility of flower orientation, thickenings of velamen cell walls, hamular viscidium and pseudolabellum to mark clades recovered by the molecular analysis was assessed by tracing these characters on the molecular trees. Key Results Spiranthinae, Cranichidinae, paraphyletic Prescottia (with Pseudocranichis embedded), and a group of mainly Andean ‘prescottioid’ genera (the ‘Stenoptera clade’) were strongly supported. Relationships among these clades were unresolved by parsimony but the Bayesian tree provided moderately strong support for the resolution (Spiranthinae–(Stenoptera clade-(Prescottia/Pseudocranichis–Cranichidinae))). Three of the four structural characters mark clades on the molecular trees, but the possession of a pseudolabellum is variable in the polyphyletic Ponthieva. Conclusions No evidence was found for monophyly of Prescottiinae and the reinstatement of Cranichidinae s.l. (including the genera of ‘Prescottiinae’) is favoured. Cranichidinae s.l. are diagnosed by non-resupinate flowers. Lack of support from parsimony for relationships among the major clades of core spiranthids is suggestive of a rapid morphological radiation or a slow rate of molecular evolution. PMID:19136493
treespace: Statistical exploration of landscapes of phylogenetic trees.
Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline
2017-11-01
The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Li, Min; Tian, Ying; Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic.
Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic. PMID:22384163
de Oliveira Bünger, Mariana; Fernanda Mazine, Fiorella; Forest, Félix; Leandro Bueno, Marcelo; Renato Stehmann, João; Lucas, Eve J.
2016-01-01
Background and Aims Eugenia sect. Phyllocalyx Nied. includes 14 species endemic to the Neotropics, mostly distributed in the Atlantic coastal forests of Brazil. Here the first comprehensive phylogenetic study of this group is presented, and this phylogeny is used as the basis to evaluate the recent infrageneric classification in Eugenia sensu lato (s.l.) to test the history of the evolution of traits in the group and test hypotheses associated with the history of this clade. Methods A total of 42 taxa were sampled, of which 14 were Eugenia sect. Phyllocalyx for one nuclear (ribosomal internal transcribed spacer) and four plastid markers (psbA-trnH, rpl16, trnL-rpl32 and trnQ-rps16). The relationships were reconstructed based on Bayesian analysis and maximum likelihood. Additionally, ancestral area analysis and modelling methods were used to estimate species dispersal, comparing historically climatic stable (refuges) and unstable areas. Key Results Maximum likelihood and Bayesian inferences indicate that Eugenia sect. Phyllocalyx is paraphyletic and the two clades recovered are characterized by combinations of morphological characters. Phylogenetic relationships support a link between Cerrado and south-eastern species and a difference in the composition of species from north-eastern and south-eastern Atlantic forest. Refugia and stable areas identified within unstable areas suggest that these areas were important to maintain diversity in the Atlantic forest biodiversity hotspot. Conclusion This study provides a robust phylogenetic framework to address important historical questions for Eugenia s.l. within an evolutionary context, supporting the need for better taxonomic study of one of the largest genera in the Neotropics. Furthermore, valuable insight is offered into diversification and biome shifts of plant species in the highly environmentally impacted Atlantic forest of South America. Evidence is presented that climate stability in the south-eastern Atlantic forest during the Quaternary contributed to the highest levels of plant diversity in this region that acted as a refugium. PMID:27974324
Pan-genome and phylogeny of Bacillus cereus sensu lato.
Bazinet, Adam L
2017-08-02
Bacillus cereus sensu lato (s. l.) is an ecologically diverse bacterial group of medical and agricultural significance. In this study, I use publicly available genomes and novel bioinformatic workflows to characterize the B. cereus s. l. pan-genome and perform the largest phylogenetic and population genetic analyses of this group to date in terms of the number of genes and taxa included. With these fundamental data in hand, I identify genes associated with particular phenotypic traits (i.e., "pan-GWAS" analysis), and quantify the degree to which taxa sharing common attributes are phylogenetically clustered. A rapid k-mer based approach (Mash) was used to create reduced representations of selected Bacillus genomes, and a fast distance-based phylogenetic analysis of this data (FastME) was performed to determine which species should be included in B. cereus s. l. The complete genomes of eight B. cereus s. l. species were annotated de novo with Prokka, and these annotations were used by Roary to produce the B. cereus s. l. pan-genome. Scoary was used to associate gene presence and absence patterns with various phenotypes. The orthologous protein sequence clusters produced by Roary were filtered and used to build HaMStR databases of gene models that were used in turn to construct phylogenetic data matrices. Phylogenetic analyses used RAxML, DendroPy, ClonalFrameML, PAUP*, and SplitsTree. Bayesian model-based population genetic analysis assigned taxa to clusters using hierBAPS. The genealogical sorting index was used to quantify the phylogenetic clustering of taxa sharing common attributes. The B. cereus s. l. pan-genome currently consists of ≈60,000 genes, ≈600 of which are "core" (common to at least 99% of taxa sampled). Pan-GWAS analysis revealed genes associated with phenotypes such as isolation source, oxygen requirement, and ability to cause diseases such as anthrax or food poisoning. Extensive phylogenetic analyses using an unprecedented amount of data produced phylogenies that were largely concordant with each other and with previous studies. Phylogenetic support as measured by bootstrap probabilities increased markedly when all suitable pan-genome data was included in phylogenetic analyses, as opposed to when only core genes were used. Bayesian population genetic analysis recommended subdividing the three major clades of B. cereus s. l. into nine clusters. Taxa sharing common traits and species designations exhibited varying degrees of phylogenetic clustering. All phylogenetic analyses recapitulated two previously used classification systems, and taxa were consistently assigned to the same major clade and group. By including accessory genes from the pan-genome in the phylogenetic analyses, I produced an exceptionally well-supported phylogeny of 114 complete B. cereus s. l. genomes. The best-performing methods were used to produce a phylogeny of all 498 publicly available B. cereus s. l. genomes, which was in turn used to compare three different classification systems and to test the monophyly status of various B. cereus s. l. species. The majority of the methodology used in this study is generic and could be leveraged to produce pan-genome estimates and similarly robust phylogenetic hypotheses for other bacterial groups.
Bayesian Total-Evidence Dating Reveals the Recent Crown Radiation of Penguins
Heath, Tracy A.; Ksepka, Daniel T.; Stadler, Tanja; Welch, David; Drummond, Alexei J.
2017-01-01
The total-evidence approach to divergence time dating uses molecular and morphological data from extant and fossil species to infer phylogenetic relationships, species divergence times, and macroevolutionary parameters in a single coherent framework. Current model-based implementations of this approach lack an appropriate model for the tree describing the diversification and fossilization process and can produce estimates that lead to erroneous conclusions. We address this shortcoming by providing a total-evidence method implemented in a Bayesian framework. This approach uses a mechanistic tree prior to describe the underlying diversification process that generated the tree of extant and fossil taxa. Previous attempts to apply the total-evidence approach have used tree priors that do not account for the possibility that fossil samples may be direct ancestors of other samples, that is, ancestors of fossil or extant species or of clades. The fossilized birth–death (FBD) process explicitly models the diversification, fossilization, and sampling processes and naturally allows for sampled ancestors. This model was recently applied to estimate divergence times based on molecular data and fossil occurrence dates. We incorporate the FBD model and a model of morphological trait evolution into a Bayesian total-evidence approach to dating species phylogenies. We apply this method to extant and fossil penguins and show that the modern penguins radiated much more recently than has been previously estimated, with the basal divergence in the crown clade occurring at \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\sim}12.7$\\end{document} Ma and most splits leading to extant species occurring in the last 2 myr. Our results demonstrate that including stem-fossil diversity can greatly improve the estimates of the divergence times of crown taxa. The method is available in BEAST2 (version 2.4) software www.beast2.org with packages SA (version at least 1.1.4) and morph-models (version at least 1.0.4) installed. [Birth–death process; calibration; divergence times; MCMC; phylogenetics.] PMID:28173531
Phylotranscriptomic analysis of the origin and early diversification of land plants
Wickett, Norman J.; Mirarab, Siavash; Nguyen, Nam; Warnow, Tandy; Carpenter, Eric; Matasci, Naim; Ayyampalayam, Saravanaraj; Barker, Michael S.; Burleigh, J. Gordon; Gitzendanner, Matthew A.; Ruhfel, Brad R.; Wafula, Eric; Graham, Sean W.; Mathews, Sarah; Melkonian, Michael; Soltis, Douglas E.; Soltis, Pamela S.; Miles, Nicholas W.; Rothfels, Carl J.; Pokorny, Lisa; Shaw, A. Jonathan; DeGironimo, Lisa; Stevenson, Dennis W.; Surek, Barbara; Villarreal, Juan Carlos; Roure, Béatrice; Philippe, Hervé; dePamphilis, Claude W.; Chen, Tao; Deyholos, Michael K.; Baucom, Regina S.; Kutchan, Toni M.; Augustin, Megan M.; Wang, Jun; Zhang, Yong; Tian, Zhijian; Yan, Zhixiang; Wu, Xiaolei; Sun, Xiao; Wong, Gane Ka-Shu; Leebens-Mack, James
2014-01-01
Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated. PMID:25355905
Baumsteiger, Jason; Kinziger, Andrew P; Aguilar, Andres
2012-12-01
The west coast of North America contains a number of biogeographic freshwater provinces which reflect an ever-changing aquatic landscape. Clues to understanding this complex structure are often encapsulated genetically in the ichthyofauna, though frequently as unresolved evolutionary relationships and putative cryptic species. Advances in molecular phylogenetics through species tree analyses now allow for improved exploration of these relationships. Using a comprehensive approach, we analyzed two mitochondrial and nine nuclear loci for a group of endemic freshwater fish (sculpin-Cottus) known for a wide ranging distribution and complex species structure in this region. Species delimitation techniques identified three novel cryptic lineages, all well supported by phylogenetic analyses. Comparative phylogenetic analyses consistently found five distinct clades reflecting a number of unique biogeographic provinces. Some internal node relationships varied by species tree reconstruction method, and were associated with either Bayesian or maximum likelihood statistical approaches or between mitochondrial, nuclear, and combined datasets. Limited cases of mitochondrial capture were also evident, suggestive of putative ancestral hybridization between species. Biogeographic diversification was associated with four major regions and revealed historical faunal exchanges across regions. Mapping of an important life-history character (amphidromy) revealed two separate instances of trait evolution, a transition that has occurred repeatedly in Cottus. This study demonstrates the power of current phylogenetic methods, the need for a comprehensive phylogenetic approach, and the potential for sculpin to serve as an indicator of biogeographic history for native ichthyofauna in the region. Copyright © 2012 Elsevier Inc. All rights reserved.
Domingos, Fabricius M C B; Colli, Guarino R; Lemmon, Alan; Lemmon, Emily Moriarty; Beheregaray, Luciano B
2017-02-01
The recognition of cryptic diversity within geographically widespread species is gradually becoming a trend in the highly speciose Neotropical biomes. The statistical methods to recognise such cryptic lineages are rapidly advancing, but have rarely been applied to genomic-scale datasets. Herein, we used phylogenomic data to investigate phylogenetic history and cryptic diversity within Tropidurus itambere, a lizard endemic to the Cerrado biodiversity hotspot. We applied a series of phylogenetic methods to reconstruct evolutionary relationships and a coalescent Bayesian species delimitation approach (BPP) to clarify species limits. The BPP results suggest that the widespread nominal taxon comprises a complex of 5 highly supported and geographically structured cryptic species. We highlight and discuss the different topological patterns recovered by concatenated and coalescent species tree methods for these closely related lineages. Finally, we suggest that the existence of cryptic lineages in the Cerrado is much more common than traditionally thought, highlighting the value of using NGS data and coalescent techniques to investigate patterns of species diversity. Copyright © 2016 Elsevier Inc. All rights reserved.
Dowie, Nicholas J; Grubisha, Lisa C; Burton, Brent A; Klooster, Matthew R; Miller, Steven L
2017-01-01
Rhizopogon species are ecologically significant ectomycorrhizal fungi in conifer ecosystems. The importance of this system merits the development and utilization of a more robust set of molecular markers specifically designed to evaluate their evolutionary ecology. Anonymous nuclear loci (ANL) were developed for R. subgenus Amylopogon. Members of this subgenus occur throughout the United States and are exclusive fungal symbionts associated with Pterospora andromedea, a threatened mycoheterotrophic plant endemic to disjunct eastern and western regions of North America. Candidate ANL were developed from 454 shotgun pyrosequencing and assessed for positive amplification across targeted species, sequencing success, and recovery of phylogenetically informative sites. Ten ANL were successfully developed and were subsequently used to sequence representative taxa, herbaria holotype and paratype specimens in R. subgenus Amylopogon. Phylogenetic reconstructions were performed on individual and concatenated data sets by Bayesian inference and maximum likelihood methods. Phylogenetic analyses of these 10 ANL were compared with a phylogeny traditionally constructed using the universal fungal barcode nuc rDNA ITS1-5.8S-ITS2 region (ITS). The resulting ANL phylogeny was consistent with most of the species designations delineated by ITS. However, the ANL phylogeny provided much greater phylogenetic resolution, yielding new evidence for cryptic species within previously defined species of R. subgenus Amylopogon. Additionally, the rooted ANL phylogeny provided an alternate topology to the ITS phylogeny, which inferred a novel set of evolutionary relationships not identified in prior phylogenetic studies.
Phylogenetic evidence for cladogenetic polyploidization in land plants.
Zhan, Shing H; Drori, Michal; Goldberg, Emma E; Otto, Sarah P; Mayrose, Itay
2016-07-01
Polyploidization is a common and recurring phenomenon in plants and is often thought to be a mechanism of "instant speciation". Whether polyploidization is associated with the formation of new species (cladogenesis) or simply occurs over time within a lineage (anagenesis), however, has never been assessed systematically. We tested this hypothesis using phylogenetic and karyotypic information from 235 plant genera (mostly angiosperms). We first constructed a large database of combined sequence and chromosome number data sets using an automated procedure. We then applied likelihood models (ClaSSE) that estimate the degree of synchronization between polyploidization and speciation events in maximum likelihood and Bayesian frameworks. Our maximum likelihood analysis indicated that 35 genera supported a model that includes cladogenetic transitions over a model with only anagenetic transitions, whereas three genera supported a model that incorporates anagenetic transitions over one with only cladogenetic transitions. Furthermore, the Bayesian analysis supported a preponderance of cladogenetic change in four genera but did not support a preponderance of anagenetic change in any genus. Overall, these phylogenetic analyses provide the first broad confirmation that polyploidization is temporally associated with speciation events, suggesting that it is indeed a major speciation mechanism in plants, at least in some genera. © 2016 Botanical Society of America.
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.
Origin of microbial biomineralization and magnetotaxis during the Archean.
Lin, Wei; Paterson, Greig A; Zhu, Qiyun; Wang, Yinzhao; Kopylova, Evguenia; Li, Ying; Knight, Rob; Bazylinski, Dennis A; Zhu, Rixiang; Kirschvink, Joseph L; Pan, Yongxin
2017-02-28
Microbes that synthesize minerals, a process known as microbial biomineralization, contributed substantially to the evolution of current planetary environments through numerous important geochemical processes. Despite its geological significance, the origin and evolution of microbial biomineralization remain poorly understood. Through combined metagenomic and phylogenetic analyses of deep-branching magnetotactic bacteria from the Nitrospirae phylum, and using a Bayesian molecular clock-dating method, we show here that the gene cluster responsible for biomineralization of magnetosomes, and the arrangement of magnetosome chain(s) within cells, both originated before or near the Archean divergence between the Nitrospirae and Proteobacteria This phylogenetic divergence occurred well before the Great Oxygenation Event. Magnetotaxis likely evolved due to environmental pressures conferring an evolutionary advantage to navigation via the geomagnetic field. Earth's dynamo must therefore have been sufficiently strong to sustain microbial magnetotaxis in the Archean, suggesting that magnetotaxis coevolved with the geodynamo over geological time.
Seifali, Mahvash; Arshad, Aziz; Moghaddam, Faezeh Yazdani; Esmaeili, Hamid Reza; Kiabi, Bahram H.; Daud, Siti Khalijah; Aliabadian, Mansour
2012-01-01
Background Knowledge about Alburnoides remains lacking relative to many other species, resulting in a lack of a systematic position and taxonomic diagnosis. Basic biological information for Alburnoides has been constructed, and it is necessary to understand further and obtain more information about this species. Its phylogenetic relationships are still debated and no molecular data have been used to study this taxon in Iran. A holistic approach for genetic methods was adopted to analyze possible spirlin population differences at selected centers in the south Caspian Sea basin of Iran. Methods The phylogenetic relationships were determined based on 774 base pairs of the mitochondrial cytochrome b gene of 32 specimens of spirlin from nine locations in the south Caspian Sea drainage basin of Iran. The nucleotide sequences were subjected to phylogenetic analysis using the neighbor-joining, maximum parsimony, maximum likelihood, and Bayesian methods. Results The mitochondrial gene tree largely supports the existence of three major clades. The western populations (clade I) may be considered as Alburnoides eichwaldii, whereas the Talar river populations (clade II) are represented as Alburnoides sp.1 and the eastern populations (clade III) may be distinct taxa of Alburnoides sp.2. Conclusion This molecular evidence supports the hypothesis that A. bipunctatus does not exist in the south Caspian Sea basin of Iran, and that the western and eastern populations are distinct taxa. PMID:22654487
Haklová, B; Majláthová, V; Majláth, I; Harris, D J; Petrilla, V; Litschka-Koen, T; Oros, M; Peťko, B
2014-03-01
The blood parasites from the genus Hepatozoon Miller, 1908 (Apicomplexa: Adeleida: Hepatozoidae) represent the most common intracellular protozoan parasites found in snakes. In the present study, we examined 209 individuals of snakes, from different zoogeographical regions (Africa, America, Asia and Europe), for the occurrence of blood parasites using both molecular and microscopic examination methods, and assess phylogenetic relationships of all Hepatozoon parasites from snakes for the first time. In total, 178 blood smears obtained from 209 individuals, representing 40 species, were examined, from which Hepatozoon unicellular parasites were found in 26 samples (14·6% prevalence). Out of 180 samples tested by molecular method polymerase chain reaction (PCR), the presence of parasites was observed in 21 individuals (prevalence 11·6%): 14 snakes from Africa belonging to six genera (Dendroaspis, Dispholidus, Mehelya, Naja, Philothamnus and Python), five snakes from Asia from the genus Morelia and two snakes from America, from two genera (Coluber and Corallus). The intensity of infection varied from one to 1433 infected cells per 10 000 erythrocytes. Results of phylogenetic analyses (Bayesian and Maximum Likelihood) revealed the existence of five haplotypes divided into four main lineages. The present data also indicate neither geographical pattern of studied Hepatozoon sp., nor congruency in the host association.
Moody, Michael L; Rieseberg, Loren H
2012-07-01
The annual sunflowers (Helianthus sect. Helianthus) present a formidable challenge for phylogenetic inference because of ancient hybrid speciation, recent introgression, and suspected issues with deep coalescence. Here we analyze sequence data from 11 nuclear DNA (nDNA) genes for multiple genotypes of species within the section to (1) reconstruct the phylogeny of this group, (2) explore the utility of nDNA gene trees for detecting hybrid speciation and introgression; and (3) test an empirical method of hybrid identification based on the phylogenetic congruence of nDNA gene trees from tightly linked genes. We uncovered considerable topological heterogeneity among gene trees with or without three previously identified hybrid species included in the analyses, as well as a general lack of reciprocal monophyly of species. Nonetheless, partitioned Bayesian analyses provided strong support for the reciprocal monophyly of all species except H. annuus (0.89 PP), the most widespread and abundant annual sunflower. Previous hypotheses of relationships among taxa were generally strongly supported (1.0 PP), except among taxa typically associated with H. annuus, apparently due to the paraphyly of the latter in all gene trees. While the individual nDNA gene trees provided a useful means for detecting recent hybridization, identification of ancient hybridization was problematic for all ancient hybrid species, even when linkage was considered. We discuss biological factors that affect the efficacy of phylogenetic methods for hybrid identification.
Phylogenetic Analysis of Dengue Virus in Bangkalan, Madura Island, East Java Province, Indonesia.
Sucipto, Teguh Hari; Kotaki, Tomohiro; Mulyatno, Kris Cahyo; Churrotin, Siti; Labiqah, Amaliah; Soegijanto, Soegeng; Kameoka, Masanori
2018-01-01
Dengue virus (DENV) infection is a major health issue in tropical and subtropical areas. Indonesia is one of the biggest dengue endemic countries in the world. In the present study, the phylogenetic analysis of DENV in Bangkalan, Madura Island, Indonesia, was performed in order to obtain a clearer understanding of its dynamics in this country. A total of 359 blood samples from dengue-suspected patients were collected between 2012 and 2014. Serotyping was conducted using a multiplex Reverse Transcriptase-Polymerase Chain Reaction and a phylogenetic analysis of E gene sequences was performed using the Bayesian Markov chain Monte Carlo (MCMC) method. 17 out of 359 blood samples (4.7%) were positive for the isolation of DENV. Serotyping and the phylogenetic analysis revealed the predominance of DENV-1 genotype I (9/17, 52.9%), followed by DENV-2 Cosmopolitan type (7/17, 41.2%) and DENV-3 genotype I (1/17, 5.9%) . DENV-4 was not isolated. The Madura Island isolates showed high nucleotide similarity to other Indonesian isolates, indicating frequent virus circulation in Indonesia. The results of the present study highlight the importance of continuous viral surveillance in dengue endemic areas in order to obtain a clearer understanding of the dynamics of DENV in Indonesia.
Johnson, Leigh A; Chan, Lauren M; Weese, Terri L; Busby, Lisa D; McMurry, Samuel
2008-09-01
Members of the phlox family (Polemoniaceae) serve as useful models for studying various evolutionary and biological processes. Despite its biological importance, no family-wide phylogenetic estimate based on multiple DNA regions with complete generic sampling is available. Here, we analyze one nuclear and five chloroplast DNA sequence regions (nuclear ITS, chloroplast matK, trnL intron plus trnL-trnF intergeneric spacer, and the trnS-trnG, trnD-trnT, and psbM-trnD intergenic spacers) using parsimony and Bayesian methods, as well as assessments of congruence and long branch attraction, to explore phylogenetic relationships among 84 ingroup species representing all currently recognized Polemoniaceae genera. Relationships inferred from the ITS and concatenated chloroplast regions are similar overall. A combined analysis provides strong support for the monophyly of Polemoniaceae and subfamilies Acanthogilioideae, Cobaeoideae, and Polemonioideae. Relationships among subfamilies, and thus for the precise root of Polemoniaceae, remain poorly supported. Within the largest subfamily, Polemonioideae, four clades corresponding to tribes Polemonieae, Phlocideae, Gilieae, and Loeselieae receive strong support. The monogeneric Polemonieae appears sister to Phlocideae. Relationships within Polemonieae, Phlocideae, and Gilieae are mostly consistent between analyses and data permutations. Many relationships within Loeselieae remain uncertain. Overall, inferred phylogenetic relationships support a higher-level classification for Polemoniaceae proposed in 2000.
Yuan, Le-Yang; Liu, Xiao-Xiang; Zhang, E
2015-12-21
Sequences from the mitochondrial control region of 14 putative species of Acrossocheilus (Cyprinidae) were examined to elucidate phylogenetic relationships within species of the barred group in that genus. Phylogenetic reconstructions were generated using three tree-building methods: maximum parsimony, maximum likelihood, and Bayesian inference. The resultant phylogenies were consistent with monophyly of the majority of the morphologically recognized species. However, mitochondrial DNA sequence evidence is incongruent with monophyly of A. fasciatus, as currently conceived. This species occurs only in the upper Qiantang-Jiang basin in Zhejiang and Anhui provinces, and coastal rivers in the Zhejiang Province. The species formerly recognized as A. paradoxus from Zhejiang Province is A. fasciatus. The specimens previously reported as A. fasciatus from river basins in Fujian Province are misidentified A. wuyiensis. The barred group of Acrossocheilus is shown to be polyphyletic. Acrossocheilus is restricted to the barred species here placed in "Clade II," containing A. paradoxus and relatives. Separate generic status is recommended for A. monticola and for A. longipinnis and their closest relatives, although more information on phylogenetic relationships based on multiple genes is required to develop robust phylogenetic hypotheses and diagnoses. Masticbarbus Tang, 1942 is available for A. longipinnis and three allied species (A. iridescens, A. microstomus and A. lamus).
Li, Xinnian; Duke, Norman C; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua
2016-01-01
Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time.
Li, Xinnian; Duke, Norman C.; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua
2016-01-01
Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time. PMID:27716800
Zeng, Xu; Yuan, Zhengrong; Tong, Xin; Li, Qiushi; Gao, Weiwei; Qin, Minjian; Liu, Zhihua
2012-05-01
Oryzoideae (Poaceae) plants have economic and ecological value. However, the phylogenetic position of some plants is not clear, such as Hygroryza aristata (Retz.) Nees. and Porteresia coarctata (Roxb.) Tateoka (syn. Oryza coarctata). Comprehensive molecular phylogenetic studies have been carried out on many genera in the Poaceae. The different DNA sequences, including nuclear and chloroplast sequences, had been extensively employed to determine relationships at both higher and lower taxonomic levels in the Poaceae. Chloroplast DNA ndhF gene and atpB-rbcL spacer were used to construct phylogenetic trees and estimate the divergence time of Oryzoideae, Bambusoideae, Panicoideae, Pooideae and so on. Complete sequences of atpB-rbcL and ndhF were generated for 17 species representing six species of the Oryzoideae and related subfamilies. Nicotiana tabacum L. was the outgroup species. The two DNA datasets were analyzed, using Maximum Parsimony and Bayesian analysis methods. The molecular phylogeny revealed that H. aristata (Retz.) Nees was the sister to Chikusichloa aquatica Koidz. Moreover, P. coarctata (Roxb.) Tateoka was in the genus Oryza. Furthermore, the result of evolution analysis, which based on the ndhF marker, indicated that the time of origin of Oryzoideae might be 31 million years ago.
Independent contrasts and PGLS regression estimators are equivalent.
Blomberg, Simon P; Lefevre, James G; Wells, Jessie A; Waterhouse, Mary
2012-05-01
We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has several implications: 1. Understanding the structure of the linear model for GLS regression provides insight into when and why phylogeny is important in comparative studies. 2. The limitations of the PIC regression analysis are the same as the limitations of the GLS model. In particular, phylogenetic covariance applies only to the response variable in the regression and the explanatory variable should be regarded as fixed. Calculation of PICs for explanatory variables should be treated as a mathematical idiosyncrasy of the PIC regression algorithm. 3. Since the GLS estimator is the best linear unbiased estimator (BLUE), the slope parameter estimated using PICs is also BLUE. 4. If the slope is estimated using different branch lengths for the explanatory and response variables in the PIC algorithm, the estimator is no longer the BLUE, so this is not recommended. Finally, we discuss whether or not and how to accommodate phylogenetic covariance in regression analyses, particularly in relation to the problem of phylogenetic uncertainty. This discussion is from both frequentist and Bayesian perspectives.
Salas-Leiva, Dayana E.; Meerow, Alan W.; Calonje, Michael; Griffith, M. Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W.; Lewis, Carl E.; Namoff, Sandra
2013-01-01
Background and aims Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree–species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. Methods DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree–species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Key Results Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia–Lepidozamia–Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. Conclusions A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae. PMID:23997230
Winterton, Shaun L; Wiegmann, Brian M; Schlinger, Evert I
2007-06-01
The first formal analysis of phylogenetic relationships among small-headed flies (Acroceridae) is presented based on DNA sequence data from two ribosomal (16S and 28S) and two protein-encoding genes: carbomoylphosphate synthase (CPS) domain of CAD (i.e., rudimentary locus) and cytochrome oxidase I (COI). DNA sequences from 40 species in 22 genera of Acroceridae (representing all three subfamilies) were compared with outgroup exemplars from Nemestrinidae, Stratiomyidae, Tabanidae, and Xylophagidae. Parsimony and Bayesian simultaneous analyses of the full data set recover a well-resolved and strongly supported hypothesis of phylogenetic relationships for major lineages within the family. Molecular evidence supports the monophyly of traditionally recognised subfamilies Philopotinae and Panopinae, but Acrocerinae are polyphyletic. Panopinae, sometimes considered "primitive" based on morphology and host-use, are always placed in a more derived position in the current study. Furthermore, these data support emerging morphological evidence that the type genus Acrocera Meigen, and its sister genus Sphaerops, are atypical acrocerids, comprising a sister lineage to all other Acroceridae. Based on the phylogeny generated in the simultaneous analysis, historical divergence times were estimated using Bayesian methodology constrained with fossil data. These estimates indicate Acroceridae likely evolved during the late Triassic but did not diversify greatly until the Cretaceous.
Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.
2011-01-01
The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275
Seven new dolphin mitochondrial genomes and a time-calibrated phylogeny of whales
Xiong, Ye; Brandley, Matthew C; Xu, Shixia; Zhou, Kaiya; Yang, Guang
2009-01-01
Background The phylogeny of Cetacea (whales) is not fully resolved with substantial support. The ambiguous and conflicting results of multiple phylogenetic studies may be the result of the use of too little data, phylogenetic methods that do not adequately capture the complex nature of DNA evolution, or both. In addition, there is also evidence that the generic taxonomy of Delphinidae (dolphins) underestimates its diversity. To remedy these problems, we sequenced the complete mitochondrial genomes of seven dolphins and analyzed these data with partitioned Bayesian analyses. Moreover, we incorporate a newly-developed "relaxed" molecular clock to model heterogenous rates of evolution among cetacean lineages. Results The "deep" phylogenetic relationships are well supported including the monophyly of Cetacea and Odontoceti. However, there is ambiguity in the phylogenetic affinities of two of the river dolphin clades Platanistidae (Indian River dolphins) and Lipotidae (Yangtze River dolphins). The phylogenetic analyses support a sister relationship between Delphinidae and Monodontidae + Phocoenidae. Additionally, there is statistically significant support for the paraphyly of Tursiops (bottlenose dolphins) and Stenella (spotted dolphins). Conclusion Our phylogenetic analysis of complete mitochondrial genomes using recently developed models of rate autocorrelation resolved the phylogenetic relationships of the major Cetacean lineages with a high degree of confidence. Our results indicate that a rapid radiation of lineages explains the lack of support the placement of Platanistidae and Lipotidae. Moreover, our estimation of molecular divergence dates indicates that these radiations occurred in the Middle to Late Oligocene and Middle Miocene, respectively. Furthermore, by collecting and analyzing seven new mitochondrial genomes, we provide strong evidence that the delphinid genera Tursiops and Stenella are not monophyletic, and the current taxonomy masks potentially interesting patterns of morphological, physiological, behavioral, and ecological evolution. PMID:19166626
Knowles, Lacey L; Klimov, Pavel B
2011-11-01
With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.
Phylogenetically marking the limits of the genus Fusarium for post-Article 59 usage
USDA-ARS?s Scientific Manuscript database
Fusarium (Hypocreales, Nectriaceae) is one of the most important and systematically challenging groups of mycotoxigenic, plant pathogenic, and human pathogenic fungi. We conducted maximum likelihood (ML), maximum parsimony (MP) and Bayesian (B) analyses on partial nucleotide sequences of genes encod...
The phylogeny and evolutionary history of tyrannosauroid dinosaurs.
Brusatte, Stephen L; Carr, Thomas D
2016-02-02
Tyrannosauroids--the group of carnivores including Tyrannosaurs rex--are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work.
The phylogeny and evolutionary history of tyrannosauroid dinosaurs
Brusatte, Stephen L.; Carr, Thomas D.
2016-01-01
Tyrannosauroids—the group of carnivores including Tyrannosaurs rex—are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work. PMID:26830019
The phylogeny and evolutionary history of tyrannosauroid dinosaurs
NASA Astrophysics Data System (ADS)
Brusatte, Stephen L.; Carr, Thomas D.
2016-02-01
Tyrannosauroids—the group of carnivores including Tyrannosaurs rex—are some of the most familiar dinosaurs of all. A surge of recent discoveries has helped clarify some aspects of their evolution, but competing phylogenetic hypotheses raise questions about their relationships, biogeography, and fossil record quality. We present a new phylogenetic dataset, which merges published datasets and incorporates recently discovered taxa. We analyze it with parsimony and, for the first time for a tyrannosauroid dataset, Bayesian techniques. The parsimony and Bayesian results are highly congruent, and provide a framework for interpreting the biogeography and evolutionary history of tyrannosauroids. Our phylogenies illustrate that the body plan of the colossal species evolved piecemeal, imply no clear division between northern and southern species in western North America as had been argued, and suggest that T. rex may have been an Asian migrant to North America. Over-reliance on cranial shape characters may explain why published parsimony studies have diverged and filling three major gaps in the fossil record holds the most promise for future work.
BEASTling: A software tool for linguistic phylogenetics using BEAST 2
Forkel, Robert; Kaiping, Gereon A.; Atkinson, Quentin D.
2017-01-01
We present a new open source software tool called BEASTling, designed to simplify the preparation of Bayesian phylogenetic analyses of linguistic data using the BEAST 2 platform. BEASTling transforms comparatively short and human-readable configuration files into the XML files used by BEAST to specify analyses. By taking advantage of Creative Commons-licensed data from the Glottolog language catalog, BEASTling allows the user to conveniently filter datasets using names for recognised language families, to impose monophyly constraints so that inferred language trees are backward compatible with Glottolog classifications, or to assign geographic location data to languages for phylogeographic analyses. Support for the emerging cross-linguistic linked data format (CLDF) permits easy incorporation of data published in cross-linguistic linked databases into analyses. BEASTling is intended to make the power of Bayesian analysis more accessible to historical linguists without strong programming backgrounds, in the hopes of encouraging communication and collaboration between those developing computational models of language evolution (who are typically not linguists) and relevant domain experts. PMID:28796784
BEASTling: A software tool for linguistic phylogenetics using BEAST 2.
Maurits, Luke; Forkel, Robert; Kaiping, Gereon A; Atkinson, Quentin D
2017-01-01
We present a new open source software tool called BEASTling, designed to simplify the preparation of Bayesian phylogenetic analyses of linguistic data using the BEAST 2 platform. BEASTling transforms comparatively short and human-readable configuration files into the XML files used by BEAST to specify analyses. By taking advantage of Creative Commons-licensed data from the Glottolog language catalog, BEASTling allows the user to conveniently filter datasets using names for recognised language families, to impose monophyly constraints so that inferred language trees are backward compatible with Glottolog classifications, or to assign geographic location data to languages for phylogeographic analyses. Support for the emerging cross-linguistic linked data format (CLDF) permits easy incorporation of data published in cross-linguistic linked databases into analyses. BEASTling is intended to make the power of Bayesian analysis more accessible to historical linguists without strong programming backgrounds, in the hopes of encouraging communication and collaboration between those developing computational models of language evolution (who are typically not linguists) and relevant domain experts.
Mahardika, G N K; Dibia, N; Budayanti, N S; Susilawathi, N M; Subrata, K; Darwinata, A E; Wignall, F S; Richt, J A; Valdivia-Granda, W A; Sudewi, A A R
2014-06-01
The emergence of human and animal rabies in Bali since November 2008 has attracted local, national and international interest. The potential origin and time of introduction of rabies virus to Bali is described. The nucleoprotein (N) gene of rabies virus from dog brain and human clinical specimens was sequenced using an automated DNA sequencer. Phylogenetic inference with Bayesian Markov Chain Monte Carlo (MCMC) analysis using the Bayesian Evolutionary Analysis by Sampling Trees (BEAST) v. 1.7.5 software confirmed that the outbreak of rabies in Bali was caused by an Indonesian lineage virus following a single introduction. The ancestor of Bali viruses was the descendant of a virus from Kalimantan. Contact tracing showed that the event most likely occurred in early 2008. The introduction of rabies into a large unvaccinated dog population in Bali clearly demonstrates the risk of disease transmission for government agencies and should lead to an increased preparedness and efforts for sustained risk reduction to prevent such events from occurring in future.
Phylogenetic relationships, diversification and expansion of chili peppers (Capsicum, Solanaceae).
Carrizo García, Carolina; Barfuss, Michael H J; Sehr, Eva M; Barboza, Gloria E; Samuel, Rosabelle; Moscone, Eduardo A; Ehrendorfer, Friedrich
2016-07-01
Capsicum (Solanaceae), native to the tropical and temperate Americas, comprises the well-known sweet and hot chili peppers and several wild species. So far, only partial taxonomic and phylogenetic analyses have been done for the genus. Here, the phylogenetic relationships between nearly all taxa of Capsicum were explored to test the monophyly of the genus and to obtain a better knowledge of species relationships, diversification and expansion. Thirty-four of approximately 35 Capsicum species were sampled. Maximum parsimony and Bayesian inference analyses were performed using two plastid markers (matK and psbA-trnH) and one single-copy nuclear gene (waxy). The evolutionary changes of nine key features were reconstructed following the parsimony ancestral states method. Ancestral areas were reconstructed through a Bayesian Markov chain Monte Carlo analysis. Capsicum forms a monophyletic clade, with Lycianthes as a sister group, following both phylogenetic approaches. Eleven well-supported clades (four of them monotypic) can be recognized within Capsicum, although some interspecific relationships need further analysis. A few features are useful to characterize different clades (e.g. fruit anatomy, chromosome base number), whereas some others are highly homoplastic (e.g. seed colour). The origin of Capsicum is postulated in an area along the Andes of western to north-western South America. The expansion of the genus has followed a clockwise direction around the Amazon basin, towards central and south-eastern Brazil, then back to western South America, and finally northwards to Central America. New insights are provided regarding interspecific relationships, character evolution, and geographical origin and expansion of Capsicum A clearly distinct early-diverging clade can be distinguished, centred in western-north-western South America. Subsequent rapid speciation has led to the origin of the remaining clades. The diversification of Capsicum has culminated in the origin of the main cultivated species in several regions of South to Central America. © The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Ling, Cheng; Hamada, Tsuyoshi; Gao, Jingyang; Zhao, Guoguang; Sun, Donghong; Shi, Weifeng
2016-01-01
MrBayes is a widespread phylogenetic inference tool harnessing empirical evolutionary models and Bayesian statistics. However, the computational cost on the likelihood estimation is very expensive, resulting in undesirably long execution time. Although a number of multi-threaded optimizations have been proposed to speed up MrBayes, there are bottlenecks that severely limit the GPU thread-level parallelism of likelihood estimations. This study proposes a high performance and resource-efficient method for GPU-oriented parallelization of likelihood estimations. Instead of having to rely on empirical programming, the proposed novel decomposition storage model implements high performance data transfers implicitly. In terms of performance improvement, a speedup factor of up to 178 can be achieved on the analysis of simulated datasets by four Tesla K40 cards. In comparison to the other publicly available GPU-oriented MrBayes, the tgMC 3 ++ method (proposed herein) outperforms the tgMC 3 (v1.0), nMC 3 (v2.1.1) and oMC 3 (v1.00) methods by speedup factors of up to 1.6, 1.9 and 2.9, respectively. Moreover, tgMC 3 ++ supports more evolutionary models and gamma categories, which previous GPU-oriented methods fail to take into analysis.
2012-01-01
Background The majority of Haemosporida species infect birds or reptiles, but many important genera, including Plasmodium, infect mammals. Dipteran vectors shared by avian, reptilian and mammalian Haemosporida, suggest multiple invasions of Mammalia during haemosporidian evolution; yet, phylogenetic analyses have detected only a single invasion event. Until now, several important mammal-infecting genera have been absent in these analyses. This study focuses on the evolutionary origin of Polychromophilus, a unique malaria genus that only infects bats (Microchiroptera) and is transmitted by bat flies (Nycteribiidae). Methods Two species of Polychromophilus were obtained from wild bats caught in Switzerland. These were molecularly characterized using four genes (asl, clpc, coI, cytb) from the three different genomes (nucleus, apicoplast, mitochondrion). These data were then combined with data of 60 taxa of Haemosporida available in GenBank. Bayesian inference, maximum likelihood and a range of rooting methods were used to test specific hypotheses concerning the phylogenetic relationships between Polychromophilus and the other haemosporidian genera. Results The Polychromophilus melanipherus and Polychromophilus murinus samples show genetically distinct patterns and group according to species. The Bayesian tree topology suggests that the monophyletic clade of Polychromophilus falls within the avian/saurian clade of Plasmodium and directed hypothesis testing confirms the Plasmodium origin. Conclusion Polychromophilus' ancestor was most likely a bird- or reptile-infecting Plasmodium before it switched to bats. The invasion of mammals as hosts has, therefore, not been a unique event in the evolutionary history of Haemosporida, despite the suspected costs of adapting to a new host. This was, moreover, accompanied by a switch in dipteran host. PMID:22356874
Xie, Lei; Yang, Zhi-Yun; Wen, Jun; Li, De-Zhu; Yi, Ting-Shuang
2014-08-01
Pistacia L. exhibits a disjunct distribution in Mediterranean Eurasia and adjacent North Africa, eastern Asia, and North to Central America. The spatio-temporal diversification history of Pistacia was assessed to test hypotheses on the Madrean-Tethyan and the Eurasian Tethyan disjunctions through phylogenetic and biogeographic analyses. Maximum parsimony and Bayesian methods were employed to analyze sequences of multiple nuclear and plastid loci of Pistacia species. Bayesian dating analysis was conducted to estimate the divergence times of clades. The likelihood method LAGRANGE was used to infer ancestral areas. The New World species of Pistacia formed a clade sister to the Old World clade in all phylogenetic analyses. The eastern Asian Pistacia weinmannifolia-P. cucphuongensis clade was sister to a clade of the remaining Old World species, which were further resolved into three subclades. Pistacia was estimated to have originated at 37.60 mya (with 95% highest posterior density interval (HPD): 25.42-48.51 mya). A vicariance event in the early Miocene (19.79 mya with 95% HPD: 10.88-30.36 mya) was inferred to account for the intercontinental disjunction between the New World and the Old World species, which is consistent with the Madrean-Tethyan hypothesis. The two Old World eastern Asian-Tethyan disjunctions are best explained by one vicariance event in the early Miocene (15.87 mya with 95% HPD: 8.36-24.36 mya) and one dispersal event in late Miocene (5.89 mya with 95% HPD: 2.68-9.16 mya). The diversification of the Old World Pistacia species was significantly affected by extensive geological and climatic changes in the Qinghai-Tibetan plateau (QTP) and in the Mediterranean region. Copyright © 2014 Elsevier Inc. All rights reserved.
Licona-Vera, Yuyini; Ornelas, Juan Francisco
2017-06-05
Geographical and temporal patterns of diversification in bee hummingbirds (Mellisugini) were assessed with respect to the evolution of migration, critical for colonization of North America. We generated a dated multilocus phylogeny of the Mellisugini based on a dense sampling using Bayesian inference, maximum-likelihood and maximum parsimony methods, and reconstructed the ancestral states of distributional areas in a Bayesian framework and migratory behavior using maximum parsimony, maximum-likelihood and re-rooting methods. All phylogenetic analyses confirmed monophyly of the Mellisugini and the inclusion of Atthis, Calothorax, Doricha, Eulidia, Mellisuga, Microstilbon, Myrmia, Tilmatura, and Thaumastura. Mellisugini consists of two clades: (1) South American species (including Tilmatura dupontii), and (2) species distributed in North and Central America and the Caribbean islands. The second clade consists of four subclades: Mexican (Calothorax, Doricha) and Caribbean (Archilochus, Calliphlox, Mellisuga) sheartails, Calypte, and Selasphorus (incl. Atthis). Coalescent-based dating places the origin of the Mellisugini in the mid-to-late Miocene, with crown ages of most subclades in the early Pliocene, and subsequent species splits in the Pleistocene. Bee hummingbirds reached western North America by the end of the Miocene and the ancestral mellisuginid (bee hummingbirds) was reconstructed as sedentary, with four independent gains of migratory behavior during the evolution of the Mellisugini. Early colonization of North America and subsequent evolution of migration best explained biogeographic and diversification patterns within the Mellisugini. The repeated evolution of long-distance migration by different lineages was critical for the colonization of North America, contributing to the radiation of bee hummingbirds. Comparative phylogeography is needed to test whether the repeated evolution of migration resulted from northward expansion of southern sedentary populations.
Bayesian phylogenetic analysis supports an agricultural origin of Japonic languages
Lee, Sean; Hasegawa, Toshikazu
2011-01-01
Languages, like genes, evolve by a process of descent with modification. This striking similarity between biological and linguistic evolution allows us to apply phylogenetic methods to explore how languages, as well as the people who speak them, are related to one another through evolutionary history. Language phylogenies constructed with lexical data have so far revealed population expansions of Austronesian, Indo-European and Bantu speakers. However, how robustly a phylogenetic approach can chart the history of language evolution and what language phylogenies reveal about human prehistory must be investigated more thoroughly on a global scale. Here we report a phylogeny of 59 Japonic languages and dialects. We used this phylogeny to estimate time depth of its root and compared it with the time suggested by an agricultural expansion scenario for Japanese origin. In agreement with the scenario, our results indicate that Japonic languages descended from a common ancestor approximately 2182 years ago. Together with archaeological and biological evidence, our results suggest that the first farmers of Japan had a profound impact on the origins of both people and languages. On a broader level, our results are consistent with a theory that agricultural expansion is the principal factor for shaping global linguistic diversity. PMID:21543358
Chen, Weicai; Zhang, Wei; Zhou, Shichu; Li, Ning; Huang, Yong; Mo, Yunming
2013-01-01
Lepobrachiun guangxiense Fei, Mo, Ye and Jiang, 2009 (Anura: Megophryidae), is presently thought to be endemic to Shangsi, Guangxi Province, China. A molecular phylogenetic analysis and morphological data were performed to gain insight into the phylogenetic position of this species. Maximum parsimony, maximum likelihood, and Bayesian inference methods were employed to reconstruct phylogenetic relationship, using 1914 bp of sequences from mtDNA genes of 12S rRNA, tRNAVal and 16S rRNA. Topologies revealed that L. guangxiense and Tam Dao (Vietnam) L. chapaense lineage (3A) formed a monophyletic group with well-supported values. The uncorrected p-distance of ~1.4k bp 16S rRNA data-sets between Tam Dao L. chapaense lineage (3A) and L. guangxiense is only 0.1%. Morphologically, L. guangxiense and Tam Dao L. chapaense lineage (3A) shared the same characters, and are distinguishable from "true" L. chapaense from the type locality in Sa Pa, Vietnam. Based on morphological characters and mitochondrial DNA, we suggested that the Tam Dao lineages of L. chapaense are conspecific with L. guangxiense. This represents a range extension for L. guangxiense, and a new country record for Vietnam.
López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel
2017-02-01
We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Mitochondrial DNA phylogeny of camel spiders (Arachnida: Solifugae) from Iran.
Maddahi, Hassan; Khazanehdari, Mahsa; Aliabadian, Mansour; Kami, Haji Gholi; Mirshamsi, Amin; Mirshamsi, Omid
2017-11-01
In the present study, the mitochondrial DNA phylogeny of five solifuge families of Iran is presented using phylogenetic analysis of mitochondrial cytochrome c oxidase, subunit 1 (COI) sequence data. Moreover, we included available representatives from seven families from GenBank to examine the genetic distance between Old and New World taxa and test the phylogenetic relationships among more solifuge families. Phylogenetic relationships were reconstructed based on the two most probabilistic methods, Maximum Likelihood (ML) and Bayesian inference (BI) approaches. Resulting topologies demonstrated the monophyly of the families Daesiidae, Eremobatidae, Galeodidae, Karschiidae and Rhagodidae, whereas the monophyly of the families Ammotrechidae and Gylippidae was not supported. Also, within the family Eremobatidae, the subfamilies Eremobatinae and Therobatinae and the genus Hemerotrecha were paraphyletic or polyphyletic. According to the resulted topologies, the taxonomic placements of Trichotoma michaelseni (Gylippidae) and Nothopuga sp. 1 (Ammotrechidae) are still remain under question and their revision might be appropriate. According to the results of this study, within the family Galeodidae, the validity of the genus Galeodopsis is supported, while the validity of the genus Paragaleodes still remains uncertain. Moreover, our results revealed that the species Galeodes bacillatus, and Rhagodes melanochaetus are junior synonyms of G. caspius, and R. eylandti, respectively.
From sea to land and beyond – New insights into the evolution of euthyneuran Gastropoda (Mollusca)
2008-01-01
Background The Euthyneura are considered to be the most successful and diverse group of Gastropoda. Phylogenetically, they are riven with controversy. Previous morphology-based phylogenetic studies have been greatly hampered by rampant parallelism in morphological characters or by incomplete taxon sampling. Based on sequences of nuclear 18S rRNA and 28S rRNA as well as mitochondrial 16S rRNA and COI DNA from 56 taxa, we reconstructed the phylogeny of Euthyneura utilising Maximum Likelihood and Bayesian inference methods. The evolution of colonization of freshwater and terrestrial habitats by pulmonate Euthyneura, considered crucial in the evolution of this group of Gastropoda, is reconstructed with Bayesian approaches. Results We found several well supported clades within Euthyneura, however, we could not confirm the traditional classification, since Pulmonata are paraphyletic and Opistobranchia are either polyphyletic or paraphyletic with several clades clearly distinguishable. Sacoglossa appear separately from the rest of the Opisthobranchia as sister taxon to basal Pulmonata. Within Pulmonata, Basommatophora are paraphyletic and Hygrophila and Eupulmonata form monophyletic clades. Pyramidelloidea are placed within Euthyneura rendering the Euthyneura paraphyletic. Conclusion Based on the current phylogeny, it can be proposed for the first time that invasion of freshwater by Pulmonata is a unique evolutionary event and has taken place directly from the marine environment via an aquatic pathway. The origin of colonisation of terrestrial habitats is seeded in marginal zones and has probably occurred via estuaries or semi-terrestrial habitats such as mangroves. PMID:18294406
Krishnan, Neeraja M; Seligmann, Hervé; Stewart, Caro-Beth; De Koning, A P Jason; Pollock, David D
2004-10-01
Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.
A Single Early Introduction of HIV-1 Subtype B into Central America Accounts for Most Current Cases
Murillo, Wendy; Veras, Nazle; Prosperi, Mattia; de Rivera, Ivette Lorenzana; Paz-Bailey, Gabriela; Morales-Miranda, Sonia; Juarez, Sandra I.; Yang, Chunfu; DeVos, Joshua; Marín, José Pablo; Mild, Mattias; Albert, Jan
2013-01-01
Human immunodeficiency virus type 1 (HIV-1) variants show considerable geographical separation across the world, but there is limited information from Central America. We provide the first detailed investigation of the genetic diversity and molecular epidemiology of HIV-1 in six Central American countries. Phylogenetic analysis was performed on 625 HIV-1 pol gene sequences collected between 2002 and 2010 in Honduras, El Salvador, Nicaragua, Costa Rica, Panama, and Belize. Published sequences from neighboring countries (n = 57) and the rest of the world (n = 740) were included as controls. Maximum likelihood methods were used to explore phylogenetic relationships. Bayesian coalescence-based methods were used to time HIV-1 introductions. Nearly all (98.9%) Central American sequences were of subtype B. Phylogenetic analysis revealed that 437 (70%) sequences clustered within five significantly supported monophyletic clades formed essentially by Central American sequences. One clade contained 386 (62%) sequences from all six countries; the other four clades were smaller and more country specific, suggesting discrete subepidemics. The existence of one large well-supported Central American clade provides evidence that a single introduction of HIV-1 subtype B in Central America accounts for most current cases. An introduction during the early phase of the HIV-1 pandemic may explain its epidemiological success. Moreover, the smaller clades suggest a subsequent regional spread related to specific transmission networks within each country. PMID:23616665
Ali, Syed Shujait; Yu, Yan; Pfosser, Martin; Wetschnig, Wolfgang
2012-01-01
Background and Aims Subfamily Hyacinthoideae (Hyacinthaceae) comprises more than 400 species. Members are distributed in sub-Saharan Africa, Madagascar, India, eastern Asia, the Mediterranean region and Eurasia. Hyacinthoideae, like many other plant lineages, show disjunct distribution patterns. The aim of this study was to reconstruct the biogeographical history of Hyacinthoideae based on phylogenetic analyses, to find the possible ancestral range of Hyacinthoideae and to identify factors responsible for the current disjunct distribution pattern. Methods Parsimony and Bayesian approaches were applied to obtain phylogenetic trees, based on sequences of the trnL-F region. Biogeographical inferences were obtained by applying statistical dispersal-vicariance analysis (S-DIVA) and Bayesian binary MCMC (BBM) analysis implemented in RASP (Reconstruct Ancestral State in Phylogenies). Key Results S-DIVA and BBM analyses suggest that the Hyacinthoideae clade seem to have originated in sub-Saharan Africa. Dispersal and vicariance played vital roles in creating the disjunct distribution pattern. Results also suggest an early dispersal to the Mediterranean region, and thus the northward route (from sub-Saharan Africa to Mediterranean) of dispersal is plausible for members of subfamily Hyacinthoideae. Conclusions Biogeographical analyses reveal that subfamily Hyacinthoideae has originated in sub-Saharan Africa. S-DIVA indicates an early dispersal event to the Mediterranean region followed by a vicariance event, which resulted in Hyacintheae and Massonieae tribes. By contrast, BBM analysis favours dispersal to the Mediterranean region, eastern Asia and Europe. Biogeographical analysis suggests that sub-Saharan Africa and the Mediterranean region have played vital roles as centres of diversification and radiation within subfamily Hyacinthoideae. In this bimodal distribution pattern, sub-Saharan Africa is the primary centre of diversity and the Mediterranean region is the secondary centre of diversity. Sub-Saharan Africa was the source area for radiation toward Madagascar, the Mediterranean region and India. Radiations occurred from the Mediterranean region to eastern Asia, Europe, western Asia and India. PMID:22039008
Sun, Xiaomin; Zhao, Ruoping; Zhang, Ting; Gong, Jie; Jing, Meidong; Huang, Ling
2017-10-01
Coraciiformes comprises 209 species belonging to ten families with significant divergence on external morphologies and life styles. The phylogenetic placement of Coraciiformes was still in debate. Here, we determined the complete mitochondrial genomes (mitogenomes) of Crested Kingfisher (Ceryle rudis) and Black-capped Kingfisher (Halcyon pileata). The mitogenomes were 17,355 bp (C. rudis) and 17,612 bp (H. pileata) in length, and both of them contained 37 genes (two rRNA genes, 22 tRNA genes and 13 protein-coding genes) and one control region. The gene organizations and characters of two mitogenomes were similar with those of other mitogenomes in Coraciiformes, however the sizes and nucleotide composition of control regions in different mitogenomes were significantly different. Phylogenetic trees were constructed with both Bayesian and Maximum Likelihood methods based on mitogenome sequences from 11 families of six orders. The trees based on two different data sets supported the basal position of Psittacidae (Psittaciformes), the closest relationship between Cuculiformes (Cuculidae) and Trogoniformes (Trogonidae), and the close relationship between Coraciiformes and Piciformes. The phylogenetic placement of the clade including Cuculiformes and Trogoniformes has not been resolved in present study, which need further investigations with more molecular markers and species. The mitogenome sequences presented here provided valuable data for further taxonomic studies on Coraciiformes and other related groups.
Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd
2017-01-26
The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
Highly structured genetic diversity of the Mycobacterium tuberculosis population in Djibouti.
Godreuil, S; Renaud, F; Choisy, M; Depina, J J; Garnotel, E; Morillon, M; Van de Perre, P; Bañuls, A L
2010-07-01
Djibouti is an East African country with a high tuberculosis incidence. This study was conducted over a 2-month period in Djibouti, during which 62 consecutive patients with pulmonary tuberculosis (TB) were included. Genetic characterization of Mycobacterium tuberculosis, using mycobacterial interspersed repetitive-unit variable-number tandem-repeat typing and spoligotyping, was performed. The genetic and phylogenetic analysis revealed only three major families (Central Asian, East African Indian and T). The high diversity and linkage disequilibrium within each family suggest a long period of clonal evolution. A Bayesian approach shows that the phylogenetic structure observed in our sample of 62 isolates is very likely to be representative of the phylogenetic structure of the M. tuberculosis population in the total number of TB cases.
Nagy, László G; Urban, Alexander; Orstadius, Leif; Papp, Tamás; Larsson, Ellen; Vágvölgyi, Csaba
2010-12-01
Recently developed comparative phylogenetic methods offer a wide spectrum of applications in evolutionary biology, although it is generally accepted that their statistical properties are incompletely known. Here, we examine and compare the statistical power of the ML and Bayesian methods with regard to selection of best-fit models of fruiting-body evolution and hypothesis testing of ancestral states on a real-life data set of a physiological trait (autodigestion) in the family Psathyrellaceae. Our phylogenies are based on the first multigene data set generated for the family. Two different coding regimes (binary and multistate) and two data sets differing in taxon sampling density are examined. The Bayesian method outperformed Maximum Likelihood with regard to statistical power in all analyses. This is particularly evident if the signal in the data is weak, i.e. in cases when the ML approach does not provide support to choose among competing hypotheses. Results based on binary and multistate coding differed only modestly, although it was evident that multistate analyses were less conclusive in all cases. It seems that increased taxon sampling density has favourable effects on inference of ancestral states, while model parameters are influenced to a smaller extent. The model best fitting our data implies that the rate of losses of deliquescence equals zero, although model selection in ML does not provide proper support to reject three of the four candidate models. The results also support the hypothesis that non-deliquescence (lack of autodigestion) has been ancestral in Psathyrellaceae, and that deliquescent fruiting bodies represent the preferred state, having evolved independently several times during evolution. Copyright © 2010 Elsevier Inc. All rights reserved.
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction
De Oliveira Martins, Leonardo; Mallo, Diego; Posada, David
2016-01-01
Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models. PMID:25281847
GPSit: An automated method for evolutionary analysis of nonculturable ciliated microeukaryotes.
Chen, Xiao; Wang, Yurui; Sheng, Yalan; Warren, Alan; Gao, Shan
2018-05-01
Microeukaryotes are among the most important components of the microbial food web in almost all aquatic and terrestrial ecosystems worldwide. In order to gain a better understanding their roles and functions in ecosystems, sequencing coupled with phylogenomic analyses of entire genomes or transcriptomes is increasingly used to reconstruct the evolutionary history and classification of these microeukaryotes and thus provide a more robust framework for determining their systematics and diversity. More importantly, phylogenomic research usually requires high levels of hands-on bioinformatics experience. Here, we propose an efficient automated method, "Guided Phylogenomic Search in trees" (GPSit), which starts from predicted protein sequences of newly sequenced species and a well-defined customized orthologous database. Compared with previous protocols, our method streamlines the entire workflow by integrating all essential and other optional operations. In so doing, the manual operation time for reconstructing phylogenetic relationships is reduced from days to several hours, compared to other methods. Furthermore, GPSit supports user-defined parameters in most steps and thus allows users to adapt it to their studies. The effectiveness of GPSit is demonstrated by incorporating available online data and new single-cell data of three nonculturable marine ciliates (Anteholosticha monilata, Deviata sp. and Diophrys scutum) under moderate sequencing coverage (~5×). Our results indicate that the former could reconstruct robust "deep" phylogenetic relationships while the latter reveals the presence of intermediate taxa in shallow relationships. Based on empirical phylogenomic data, we also used GPSit to evaluate the impact of different levels of missing data on two commonly used methods of phylogenetic analyses, maximum likelihood (ML) and Bayesian inference (BI) methods. We found that BI is less sensitive to missing data when fast-evolving sites are removed. © 2018 John Wiley & Sons Ltd.
Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).
Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A
2011-10-01
Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.
Guo, Aijiang
2016-08-31
Tapeworms Cladotaenia spp. are among the most important wildlife pathogens in birds of prey. The genus Cladotaenia is placed in the family Paruterinidae based on morphological characteristics and hosts. However, limited molecular information is available for studying the phylogenetic position of this genus in relation to other cestodes. In this study, the complete mitochondrial (mt) genome of Cladotaenia vulturi was amplified using "Long-PCR" and then sequenced by primer walking. Sequence annotation and gene identification were performed by comparison with published flatworm mt genomes. The phylogenetic relationships of C. vulturi with other cestode species were established using the concatenated amino acid sequences of 12 protein-coding genes with Bayesian Inference and Maximum Likelihood methods. The complete mitochondrial genome of the Cladotaenia vulturi is 13,411 kb in size and contains 36 genes. The gene arrangement of C. vulturi is identical to those in Anoplocephala spp. (Anoplocephalidae), Hymenolepis spp. (Hymenolepididae) and Dipylidium caninum (Dipylidiidae), but different from that in taeniids owing to the order shift between the tRNA (L1) and tRNA (S2) genes. Phylogenetic analyses based on the amino acid sequences of the concatenated 12 protein-coding genes showed that the species in the Taeniidae form a group and C. vulturi is a sister taxon to the species of the family Taeniidae. To our knowledge, the present study provides the first molecular data to support the early proposal from morphological evidence that the Taeniidae is a sister group to the family Paruterinidae. This novel mt genome sequence will be useful for further investigations into the population genetics, phylogenetics and systematics of the family Paruterinidae and inferring phylogenetic relationships among several lineages within the order Cyclophyllidea.
The dawn of open access to phylogenetic data.
Magee, Andrew F; May, Michael R; Moore, Brian R
2014-01-01
The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation--extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for [Formula: see text] of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.
Calibrated tree priors for relaxed phylogenetics and divergence time estimation.
Heled, Joseph; Drummond, Alexei J
2012-01-01
The use of fossil evidence to calibrate divergence time estimation has a long history. More recently, Bayesian Markov chain Monte Carlo has become the dominant method of divergence time estimation, and fossil evidence has been reinterpreted as the specification of prior distributions on the divergence times of calibration nodes. These so-called "soft calibrations" have become widely used but the statistical properties of calibrated tree priors in a Bayesian setting hashave not been carefully investigated. Here, we clarify that calibration densities, such as those defined in BEAST 1.5, do not represent the marginal prior distribution of the calibration node. We illustrate this with a number of analytical results on small trees. We also describe an alternative construction for a calibrated Yule prior on trees that allows direct specification of the marginal prior distribution of the calibrated divergence time, with or without the restriction of monophyly. This method requires the computation of the Yule prior conditional on the height of the divergence being calibrated. Unfortunately, a practical solution for multiple calibrations remains elusive. Our results suggest that direct estimation of the prior induced by specifying multiple calibration densities should be a prerequisite of any divergence time dating analysis.
Robles, María del Rosario; Cutillas, Cristina; Panei, Carlos Javier; Callejón, Rocío
2014-01-01
Populations of Trichuris spp. isolated from six species of sigmodontine rodents from Argentina were analyzed based on morphological characteristics and ITS2 (rDNA) region sequences. Molecular data provided an opportunity to discuss the phylogenetic relationships among the Trichuris spp. from Noth and South America (mainly from Argentina). Trichuris specimens were identified morphologically as Trichuris pardinasi, T. navonae, Trichuris sp. and Trichuris new species, described in this paper. Sequences analyzed by Maximum Parsimony, Maximum Likelihood and Bayesian inference methods showed four main clades corresponding with the four different species regardless of geographical origin and host species. These four species from sigmodontine rodents clustered together and separated from Trichuris species isolated from murine and arvicoline rodents (outgroup). Different genetic lineages observed among Trichuris species from sigmodontine rodents which supported the proposal of a new species. Moreover, host distribution showed correspondence with the different tribes within the subfamily Sigmodontinae. PMID:25393618
Castilho, Flávio J D; Torres, Rodrigo A; Barbosa, Aneli M; Dekker, Robert F H; Garcia, José E
2009-02-01
The present study is the first describing the sequencing of a fragment of the copper-oxidase domain of a laccase gene in the family Botryosphaeriaceae. The aim of this work was to assess the degree of genetic and evolutionary relationships of a laccase gene from Botryosphaeria rhodina MAMB-05 with other ascomycete and basidiomycete laccase genes. The 193-amino acid sequences of the copper-oxidase domain from several different fungi, insects, a plant, and a bacterial species were retrieved from GenBank and aligned. Phylogenetic analyses were performed using neighbor-joining, maximum parsimony, and Bayesian inference methods. The organisms studied clustered into five gene clades: fungi (ascomycetes and basidiomycetes), insects, plants, and bacteria. Also, the topologies showed that fungal laccases of the ascomycetes and basidiomycetes are clearly separated into two distinct clusters. This evidence indicated that B. rhodina MAMB-05 and other closely related ascomycetes are a new biological resource given the biotechnological potential of their laccase genes.
Buckley, Christopher D.
2012-01-01
The warp ikat method of making decorated textiles is one of the most geographically widespread in southeast Asia, being used by Austronesian peoples in Indonesia, Malaysia and the Philippines, and Daic peoples on the Asian mainland. In this study a dataset consisting of the decorative characters of 36 of these warp ikat weaving traditions is investigated using Bayesian and Neighbornet techniques, and the results are used to construct a phylogenetic tree and taxonomy for warp ikat weaving in southeast Asia. The results and analysis show that these diverse traditions have a common ancestor amongst neolithic cultures the Asian mainland, and parallels exist between the patterns of textile weaving descent and linguistic phylogeny for the Austronesian group. Ancestral state analysis is used to reconstruct some of the features of the ancestral weaving tradition. The widely held theory that weaving motifs originated in the late Bronze Age Dong-Son culture is shown to be inconsistent with the data. PMID:23272211
Vidal-Martínez, Victor M.
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1–2) and internal transcribed spacers (ITS1–5.8S–ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum (A. cf. americanum and A. burminis) and paraphyly of the Acanthostominae. These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments. PMID:29250471
Martínez-Aquino, Andrés; Vidal-Martínez, Victor M; Aguirre-Macedo, M Leopoldina
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1-2) and internal transcribed spacers (ITS1-5.8S-ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum ( A. cf. americanum and A. burminis ) and paraphyly of the Acanthostominae . These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments.
Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.
2013-01-01
Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195
Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J
2013-01-01
Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.
FPGA Acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods.
Zierke, Stephanie; Bakos, Jason D
2010-04-12
Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA's on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10x speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs).
Phylogenetic analysis of honey bee behavioral evolution.
Raffiudin, Rika; Crozier, Ross H
2007-05-01
DNA sequences from three mitochondrial (rrnL, cox2, nad2) and one nuclear gene (itpr) from all 9 known honey bee species (Apis), a 10th possible species, Apis dorsata binghami, and three outgroup species (Bombus terrestris, Melipona bicolor and Trigona fimbriata) were used to infer Apis phylogenetic relationships using Bayesian analysis. The dwarf honey bees were confirmed as basal, and the giant and cavity-nesting species to be monophyletic. All nodes were strongly supported except that grouping Apis cerana with A. nigrocincta. Two thousand post-burnin trees from the phylogenetic analysis were used in a Bayesian comparative analysis to explore the evolution of dance type, nest structure, comb structure and dance sound within Apis. The ancestral honey bee species was inferred with high support to have nested in the open, and to have more likely than not had a silent vertical waggle dance and a single comb. The common ancestor of the giant and cavity-dwelling bees is strongly inferred to have had a buzzing vertical directional dance. All pairwise combinations of characters showed strong association, but the multiple comparisons problem reduces the ability to infer associations between states between characters. Nevertheless, a buzzing dance is significantly associated with cavity-nesting, several vertical combs, and dancing vertically, a horizontal dance is significantly associated with a nest with a single comb wrapped around the support, and open nesting with a single pendant comb and a silent waggle dance.
USDA-ARS?s Scientific Manuscript database
Fusarium (Hypocreales, Nectriaceae) is one of the most economically important and systematically challenging groups of mycotoxigenic phytopathogens and emergent human pathogens. We conducted maximum likelihood (ML), maximum parsimony (MP) and Bayesian (B) analyses on partial RNA polymerase largest (...
2011-01-01
Background Efforts to solve higher-level evolutionary relationships within the class Insecta by using mitochondrial genomic data are hindered due to fast sequence evolution of several groups, most notably Hymenoptera, Strepsiptera, Phthiraptera, Hemiptera and Thysanoptera. Accelerated rates of substitution on their sequences have been shown to have negative consequences in phylogenetic inference. In this study, we tested several methodological approaches to recover phylogenetic signal from whole mitochondrial genomes. As a model, we used two classical problems in insect phylogenetics: The relationships within Paraneoptera and within Holometabola. Moreover, we assessed the mitochondrial phylogenetic signal limits in the deeper Eumetabola dataset, and we studied the contribution of individual genes. Results Long-branch attraction (LBA) artefacts were detected in all the datasets. Methods using Bayesian inference outperformed maximum likelihood approaches, and LBA was avoided in Paraneoptera and Holometabola when using protein sequences and the site-heterogeneous mixture model CAT. The better performance of this method was evidenced by resulting topologies matching generally accepted hypotheses based on nuclear and/or morphological data, and was confirmed by cross-validation and simulation analyses. Using the CAT model, the order Strepsiptera was recovered as sister to Coleoptera for the first time using mitochondrial sequences, in agreement with recent results based on large nuclear and morphological datasets. Also the Hymenoptera-Mecopterida association was obtained, leaving Coleoptera and Strepsiptera as the basal groups of the holometabolan insects, which coincides with one of the two main competing hypotheses. For the Paraneroptera, the currently accepted non-monophyly of Homoptera was documented as a phylogenetic novelty for mitochondrial data. However, results were not satisfactory when exploring the entire Eumetabola, revealing the limits of the phylogenetic signal that can be extracted from Insecta mitogenomes. Based on the combined use of the five best topology-performing genes we obtained comparable results to whole mitogenomes, highlighting the important role of data quality. Conclusion We show for the first time that mitogenomic data agrees with nuclear and morphological data for several of the most controversial insect evolutionary relationships, adding a new independent source of evidence to study relationships among insect orders. We propose that deeper divergences cannot be inferred with the current available methods due to sequence saturation and compositional bias inconsistencies. Our exploratory analysis indicates that the CAT model is the best dealing with LBA and it could be useful for other groups and datasets with similar phylogenetic difficulties. PMID:22032248
Fortunato, Laura; Holden, Clare; Mace, Ruth
2006-12-01
Significant amounts of wealth have been exchanged as part of marriage settlements throughout history. Although various models have been proposed for interpreting these practices, their development over time has not been investigated systematically. In this paper we use a Bayesian MCMC phylogenetic comparative approach to reconstruct the evolution of two forms of wealth transfers at marriage, dowry and bridewealth, for 51 Indo-European cultural groups. Results indicate that dowry is more likely to have been the ancestral practice, and that a minimum of four changes to bridewealth is necessary to explain the observed distribution of the two states across the cultural groups.
Zhou, Xiaoming; Chan, Paul K. S.; Tam, John S.; Tang, Julian W.
2011-01-01
Background Hepatitis C virus (HCV) 6a accounts for 23.6% of all HCV infections of the general population and 58.5% of intravenous drug users in Hong Kong. However, the geographical origin of this highly predominant HCV subgenotype is largely unknown. This study explores a hypothesis for one possible transmission route of HCV 6a to Hong Kong. Methods NS5A sequences derived from 26 HCV 6a samples were chosen from a five year period (1999–2004) from epidemiologically unrelated patients from Hong Kong. Partial-NS5A sequences (513-bp from nt 6728 to 7240) were adopted for Bayesian coalescent analysis to reconstruct the evolutionary history of HCV infections in Hong Kong using the BEAST v1.3 program. A rooted phylogenetic tree was drawn for these sequences by alignment with reference Vietnamese sequences. Demographic data were accessed from “The Statistic Yearbooks of Hong Kong”. Results Bayesian coalescent analysis showed that the rapid increase in 6a infections, which had increased more than 90-fold in Hong Kong from 1986 to 1994 correlated to two peaks of Vietnamese immigration to Hong Kong from 1978 to 1997. The second peak, which occurred from 1987 through 1997, overlapped with the rapid increase of HCV 6a occurrence in Hong Kong. Phylogenetic analyses have further revealed that HCV 6a strains from Vietnam may be ancestral to Hong Kong counterparts. Conclusions The high predominance of HCV 6a infections in Hong Kong was possibly associated with Vietnamese immigration during 1987–1997. PMID:21931867
Knapp, Jenny; Nakao, Minoru; Yanagida, Tetsuya; Okamoto, Munehiro; Saarma, Urmas; Lavikainen, Antti; Ito, Akira
2011-12-01
The family Taeniidae of tapeworms is composed of two genera, Echinococcus and Taenia, which obligately parasitize mammals including humans. Inferring phylogeny via molecular markers is the only way to trace back their evolutionary histories. However, molecular dating approaches are lacking so far. Here we established new markers from nuclear protein-coding genes for RNA polymerase II second largest subunit (rpb2), phosphoenolpyruvate carboxykinase (pepck) and DNA polymerase delta (pold). Bayesian inference and maximum likelihood analyses of the concatenated gene sequences allowed us to reconstruct phylogenetic trees for taeniid parasites. The tree topologies clearly demonstrated that Taenia is paraphyletic and that the clade of Echinococcus oligarthrus and Echinococcusvogeli is sister to all other members of Echinococcus. Both species are endemic in Central and South America, and their definitive hosts originated from carnivores that immigrated from North America after the formation of the Panamanian land bridge about 3 million years ago (Ma). A time-calibrated phylogeny was estimated by a Bayesian relaxed-clock method based on the assumption that the most recent common ancestor of E. oligarthrus and E. vogeli existed during the late Pliocene (3.0 Ma). The results suggest that a clade of Taenia including human-pathogenic species diversified primarily in the late Miocene (11.2 Ma), whereas Echinococcus started to diversify later, in the end of the Miocene (5.8 Ma). Close genetic relationships among the members of Echinococcus imply that the genus is a young group in which speciation and global radiation occurred rapidly. Copyright © 2011 Elsevier Inc. All rights reserved.
Bayesian Analysis of Evolutionary Divergence with Genomic Data under Diverse Demographic Models.
Chung, Yujin; Hey, Jody
2017-06-01
We present a new Bayesian method for estimating demographic and phylogenetic history using population genomic data. Several key innovations are introduced that allow the study of diverse models within an Isolation-with-Migration framework. The new method implements a 2-step analysis, with an initial Markov chain Monte Carlo (MCMC) phase that samples simple coalescent trees, followed by the calculation of the joint posterior density for the parameters of a demographic model. In step 1, the MCMC sampling phase, the method uses a reduced state space, consisting of coalescent trees without migration paths, and a simple importance sampling distribution without the demography of interest. Once obtained, a single sample of trees can be used in step 2 to calculate the joint posterior density for model parameters under multiple diverse demographic models, without having to repeat MCMC runs. Because migration paths are not included in the state space of the MCMC phase, but rather are handled by analytic integration in step 2 of the analysis, the method is scalable to a large number of loci with excellent MCMC mixing properties. With an implementation of the new method in the computer program MIST, we demonstrate the method's accuracy, scalability, and other advantages using simulated data and DNA sequences of two common chimpanzee subspecies: Pan troglodytes (P. t.) troglodytes and P. t. verus. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Lovette, I.J.; Perez-Eman, J. L.; Sullivan, J.P.; Banks, R.C.; Fiorentino, I.; Cordoba-Cordoba, S.; Echeverry-Galvis, M.; Barker, F.K.; Burns, K.J.; Klicka, J.; Lanyon, Scott M.; Bermingham, E.
2010-01-01
The birds in the family Parulidae-commonly termed the New World warblers or wood-warblers-are a classic model radiation for studies of ecological and behavioral differentiation. Although the monophyly of a 'core' wood-warbler clade is well established, no phylogenetic hypothesis for this group has included a full sampling of wood-warbler species diversity. We used parsimony, maximum likelihood, and Bayesian methods to reconstruct relationships among all genera and nearly all wood-warbler species, based on a matrix of mitochondrial DNA (5840 nucleotides) and nuclear DNA (6 loci, 4602 nucleotides) characters. The resulting phylogenetic hypotheses provide a highly congruent picture of wood-warbler relationships, and indicate that the traditional generic classification of these birds recognizes many non-monophyletic groups. We recommend a revised taxonomy in which each of 14 genera (Seiurus, Helmitheros, Mniotilta, Limnothlypis, Protonotaria, Parkesia, Vermivora, Oreothlypis, Geothlypis, Setophaga, Myioborus, Cardellina, Basileuterus, Myiothlypis) corresponds to a well-supported clade; these nomenclatural changes also involve subsuming a number of well-known, traditional wood-warbler genera (Catharopeza, Dendroica, Ergaticus, Euthlypis, Leucopeza, Oporornis, Parula, Phaeothlypis, Wilsonia). We provide a summary phylogenetic hypothesis that will be broadly applicable to investigations of the historical biogeography, processes of diversification, and evolution of trait variation in this well studied avian group. ?? 2010 Elsevier Inc.
Explaining the Linguistic Diversity of Sahul Using Population Models
Reesink, Ger; Singer, Ruth; Dunn, Michael
2009-01-01
The region of the ancient Sahul continent (present day Australia and New Guinea, and surrounding islands) is home to extreme linguistic diversity. Even apart from the huge Austronesian language family, which spread into the area after the breakup of the Sahul continent in the Holocene, there are hundreds of languages from many apparently unrelated families. On each of the subcontinents, the generally accepted classification recognizes one large, widespread family and a number of unrelatable smaller families. If these language families are related to each other, it is at a depth which is inaccessible to standard linguistic methods. We have inferred the history of structural characteristics of these languages under an admixture model, using a Bayesian algorithm originally developed to discover populations on the basis of recombining genetic markers. This analysis identifies 10 ancestral language populations, some of which can be identified with clearly defined phylogenetic groups. The results also show traces of early dispersals, including hints at ancient connections between Australian languages and some Papuan groups (long hypothesized, never before demonstrated). Systematic language contact effects between members of big phylogenetic groups are also detected, which can in some cases be identified with a diffusional or substrate signal. Most interestingly, however, there remains striking evidence of a phylogenetic signal, with many languages showing negligible amounts of admixture. PMID:19918360
Mitochondrial phylogeny of an Asian tree frog genus Theloderma (Anura: Rhacophoridae).
Nguyen, Tao Thien; Matsui, Masafumi; Eto, Koshiro
2015-04-01
We assessed phylogenetic and systematic relationships among 17 out of 23 species of Theloderma and all three species of Nyctixalus from 2412bp sequences of the mitochondrial DNA genes of 12S rRNA, tRNA(val), and 16S rRNA using maximum likelihood and Bayesian inference methods. With the exception of T. moloch, Theloderma and Nyctixalus are confirmed to form a clade, in which each genus also forms a clade. Theloderma moloch is phylogenetically outside these clades and closer to samples from Chiromantis, Feihyla, Gracixalus, Kurixalus, Philautus, Polypedates, Raorchestes, and Rhacophorus. Within Theloderma, T. horridum and T. stellatum form the sister taxon to a clade comprising the remaining species. The basal split within the latter clade groups T. asperum, T. licin, T. petilum, and T. ryabovi as the sister to a clade comprising T. bicolor, T. chuyangsinense, T. corticale, T. gordoni, T. laeve, T. lateriticum, T. nebulosum, T. rhododiscus, and T. truongsonense. Our phylogenetic results indicate homoplastic evolution of four morphological characters: small vs. large body size, presence of vomerine teeth, presence of a vocal opening in males, and interdigital webbing on hands. The common ancestor of Theloderma and Nyctixalus is inferred to have arisen in the area including the current Sunda region. Copyright © 2015 Elsevier Inc. All rights reserved.
How Accurate and Robust Are the Phylogenetic Estimates of Austronesian Language Relationships?
Greenhill, Simon J.; Drummond, Alexei J.; Gray, Russell D.
2010-01-01
We recently used computational phylogenetic methods on lexical data to test between two scenarios for the peopling of the Pacific. Our analyses of lexical data supported a pulse-pause scenario of Pacific settlement in which the Austronesian speakers originated in Taiwan around 5,200 years ago and rapidly spread through the Pacific in a series of expansion pulses and settlement pauses. We claimed that there was high congruence between traditional language subgroups and those observed in the language phylogenies, and that the estimated age of the Austronesian expansion at 5,200 years ago was consistent with the archaeological evidence. However, the congruence between the language phylogenies and the evidence from historical linguistics was not quantitatively assessed using tree comparison metrics. The robustness of the divergence time estimates to different calibration points was also not investigated exhaustively. Here we address these limitations by using a systematic tree comparison metric to calculate the similarity between the Bayesian phylogenetic trees and the subgroups proposed by historical linguistics, and by re-estimating the age of the Austronesian expansion using only the most robust calibrations. The results show that the Austronesian language phylogenies are highly congruent with the traditional subgroupings, and the date estimates are robust even when calculated using a restricted set of historical calibrations. PMID:20224774
Hadfield, J D; Nakagawa, S
2010-03-01
Although many of the statistical techniques used in comparative biology were originally developed in quantitative genetics, subsequent development of comparative techniques has progressed in relative isolation. Consequently, many of the new and planned developments in comparative analysis already have well-tested solutions in quantitative genetics. In this paper, we take three recent publications that develop phylogenetic meta-analysis, either implicitly or explicitly, and show how they can be considered as quantitative genetic models. We highlight some of the difficulties with the proposed solutions, and demonstrate that standard quantitative genetic theory and software offer solutions. We also show how results from Bayesian quantitative genetics can be used to create efficient Markov chain Monte Carlo algorithms for phylogenetic mixed models, thereby extending their generality to non-Gaussian data. Of particular utility is the development of multinomial models for analysing the evolution of discrete traits, and the development of multi-trait models in which traits can follow different distributions. Meta-analyses often include a nonrandom collection of species for which the full phylogenetic tree has only been partly resolved. Using missing data theory, we show how the presented models can be used to correct for nonrandom sampling and show how taxonomies and phylogenies can be combined to give a flexible framework with which to model dependence.
Rylková, K; Tůmová, E; Brožová, A; Jankovská, I; Vadlejch, J; Čadková, Z; Frýdlová, J; Peřinková, P; Langrová, I; Chodová, D; Nechybová, S; Scháňková, Š
2015-11-01
Trichuris sp. individuals were collected from Myocastor coypus from fancy breeder farms in the Czech Republic. Using morphological and biometrical methods, 30 female and 30 male nematodes were identified as Trichuris myocastoris. This paper presents the first molecular description of this species. The ribosomal DNA (rDNA) region, consisting of internal transcribed spacer (ITS)-1, 5.8 gene and ITS-2, was sequenced. Based on an analysis of 651 bp, T. myocastoris was found to be different from any other Trichuris species for which published sequencing of the ITS region is available. The phylogenetic relationships were estimated using the maximum parsimony methods and Bayesian analyses. T. myocastoris was found to be significantly closely related to Trichuris of rodents than those of ruminants.
Are humans the initial source of canine mange?
Andriantsoanirina, Valérie; Fang, Fang; Ariey, Frédéric; Izri, Arezki; Foulet, Françoise; Botterel, Françoise; Bernigaud, Charlotte; Chosidow, Olivier; Huang, Weiyi; Guillot, Jacques; Durand, Rémy
2016-03-25
Scabies, or mange as it is called in animals, is an ectoparasitic contagious infestation caused by the mite Sarcoptes scabiei. Sarcoptic mange is an important veterinary disease leading to significant morbidity and mortality in wild and domestic animals. A widely accepted hypothesis, though never substantiated by factual data, suggests that humans were the initial source of the animal contamination. In this study we performed phylogenetic analyses of populations of S. scabiei from humans and from canids to validate or not the hypothesis of a human origin of the mites infecting domestic dogs. Mites from dogs and foxes were obtained from three French sites and from other countries. A part of cytochrome c oxidase subunit 1 (cox1) gene was amplified and directly sequenced. Other sequences corresponding to mites from humans, raccoon dogs, foxes, jackal and dogs from various geographical areas were retrieved from GenBank. Phylogenetic analyses were performed using the Otodectes cynotis cox1 sequence as outgroup. Maximum Likelihood and Bayesian Inference analysis approaches were used. To visualize the relationship between the haplotypes, a median joining haplotype network was constructed using Network v4.6 according to host. Twenty-one haplotypes were observed among mites collected from five different host species, including humans and canids from nine geographical areas. The phylogenetic trees based on Maximum Likelihood and Bayesian Inference analyses showed similar topologies with few differences in node support values. The results were not consistent with a human origin of S. scabiei mites in dogs and, on the contrary, did not exclude the opposite hypothesis of a host switch from dogs to humans. Phylogenetic relatedness may have an impact in terms of epidemiological control strategy. Our results and other recent studies suggest to re-evaluate the level of transmission between domestic dogs and humans.
Hernández-León, Sergio; Gernandt, David S.; Pérez de la Rosa, Jorge A.; Jardón-Barbolla, Lev
2013-01-01
Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities. PMID:23936218
A phylogenetic Kalman filter for ancestral trait reconstruction using molecular data.
Lartillot, Nicolas
2014-02-15
Correlation between life history or ecological traits and genomic features such as nucleotide or amino acid composition can be used for reconstructing the evolutionary history of the traits of interest along phylogenies. Thus far, however, such ancestral reconstructions have been done using simple linear regression approaches that do not account for phylogenetic inertia. These reconstructions could instead be seen as a genuine comparative regression problem, such as formalized by classical generalized least-square comparative methods, in which the trait of interest and the molecular predictor are represented as correlated Brownian characters coevolving along the phylogeny. Here, a Bayesian sampler is introduced, representing an alternative and more efficient algorithmic solution to this comparative regression problem, compared with currently existing generalized least-square approaches. Technically, ancestral trait reconstruction based on a molecular predictor is shown to be formally equivalent to a phylogenetic Kalman filter problem, for which backward and forward recursions are developed and implemented in the context of a Markov chain Monte Carlo sampler. The comparative regression method results in more accurate reconstructions and a more faithful representation of uncertainty, compared with simple linear regression. Application to the reconstruction of the evolution of optimal growth temperature in Archaea, using GC composition in ribosomal RNA stems and amino acid composition of a sample of protein-coding genes, confirms previous findings, in particular, pointing to a hyperthermophilic ancestor for the kingdom. The program is freely available at www.phylobayes.org.
Hernández-León, Sergio; Gernandt, David S; Pérez de la Rosa, Jorge A; Jardón-Barbolla, Lev
2013-01-01
Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities.
Guo, Aijiang
2016-09-05
The cloacal tapeworm Cloacotaenia megalops (Hymenolepididae) is one of the most common cestode parasites of domestic and wild ducks worldwide. However, limited information is available regarding its epidemiology, biology, genetics and systematics. This study provides characterisation of the complete mitochondrial (mt) genome of C. megalops. The complete mt genome of C. megalops was obtained by long PCR, sequenced and annotated. The length of the entire mt genome of C. megalops is 13,887 bp; it contains 12 protein-coding, 2 ribosomal RNA and 22 transfer RNA genes, but lacks an atp8 gene. The mt gene arrangement of C. megalops is identical to that observed in Anoplocephala magna and A. perfoliata (Anoplocephalidae), Dipylidium caninum (Dipylidiidae) and Hymenolepis diminuta (Hymenolepididae), but differs from that reported in taeniids owing to the position shift between the tRNA (L1) and tRNA (S2) genes. The phylogenetic position of C. megalops was inferred using Maximum likelihood and Bayesian inference methods based on the concatenated amino acid data for 12 protein-coding genes. Phylogenetic trees showed that C. megalops is sister to Anoplocephala spp. (Anoplocephalidae) + Pseudanoplocephala crawfordi + Hymenolepis spp. (Hymenolepididae) indicating that the family Hymenolepididae is paraphyletic. The complete mt genome of C. megalops is sequenced. Phylogenetic analyses provided an insight into the phylogenetic relationships among the families Anoplocephalidae, Hymenolepididae, Dipylidiidae and Taeniidae. This novel genomic information also provides the opportunity to develop useful genetic markers for studying the molecular epidemiology, biology, genetics and systematics of C. megalops.
Gottschling, Marc; Soehner, Sylvia; Zinssmeister, Carmen; John, Uwe; Plötner, Jörg; Schweikert, Michael; Aligizaki, Katerina; Elbrächter, Malte
2012-01-01
The phylogenetic relationships of the Dinophyceae (Alveolata) are not sufficiently resolved at present. The Thoracosphaeraceae (Peridiniales) are the only group of the Alveolata that include members with calcareous coccoid stages; this trait is considered apomorphic. Although the coccoid stage apparently is not calcareous, Bysmatrum has been assigned to the Thoracosphaeraceae based on thecal morphology. We tested the monophyly of the Thoracosphaeraceae using large sets of ribosomal RNA sequence data of the Alveolata including the Dinophyceae. Phylogenetic analyses were performed using Maximum Likelihood and Bayesian approaches. The Thoracosphaeraceae were monophyletic, but included also a number of non-calcareous dinophytes (such as Pentapharsodinium and Pfiesteria) and even parasites (such as Duboscquodinium and Tintinnophagus). Bysmatrum had an isolated and uncertain phylogenetic position outside the Thoracosphaeraceae. The phylogenetic relationships among calcareous dinophytes appear complex, and the assumption of the single origin of the potential to produce calcareous structures is challenged. The application of concatenated ribosomal RNA sequence data may prove promising for phylogenetic reconstructions of the Dinophyceae in future. Copyright © 2011 Elsevier GmbH. All rights reserved.
Quantifying MCMC exploration of phylogenetic tree space.
Whidden, Chris; Matsen, Frederick A
2015-05-01
In order to gain an understanding of the effectiveness of phylogenetic Markov chain Monte Carlo (MCMC), it is important to understand how quickly the empirical distribution of the MCMC converges to the posterior distribution. In this article, we investigate this problem on phylogenetic tree topologies with a metric that is especially well suited to the task: the subtree prune-and-regraft (SPR) metric. This metric directly corresponds to the minimum number of MCMC rearrangements required to move between trees in common phylogenetic MCMC implementations. We develop a novel graph-based approach to analyze tree posteriors and find that the SPR metric is much more informative than simpler metrics that are unrelated to MCMC moves. In doing so, we show conclusively that topological peaks do occur in Bayesian phylogenetic posteriors from real data sets as sampled with standard MCMC approaches, investigate the efficiency of Metropolis-coupled MCMC (MCMCMC) in traversing the valleys between peaks, and show that conditional clade distribution (CCD) can have systematic problems when there are multiple peaks. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Bayesian reconstruction of transmission within outbreaks using genomic variants.
De Maio, Nicola; Worby, Colin J; Wilson, Daniel J; Stoesser, Nicole
2018-04-01
Pathogen genome sequencing can reveal details of transmission histories and is a powerful tool in the fight against infectious disease. In particular, within-host pathogen genomic variants identified through heterozygous nucleotide base calls are a potential source of information to identify linked cases and infer direction and time of transmission. However, using such data effectively to model disease transmission presents a number of challenges, including differentiating genuine variants from those observed due to sequencing error, as well as the specification of a realistic model for within-host pathogen population dynamics. Here we propose a new Bayesian approach to transmission inference, BadTrIP (BAyesian epiDemiological TRansmission Inference from Polymorphisms), that explicitly models evolution of pathogen populations in an outbreak, transmission (including transmission bottlenecks), and sequencing error. BadTrIP enables the inference of host-to-host transmission from pathogen sequencing data and epidemiological data. By assuming that genomic variants are unlinked, our method does not require the computationally intensive and unreliable reconstruction of individual haplotypes. Using simulations we show that BadTrIP is robust in most scenarios and can accurately infer transmission events by efficiently combining information from genetic and epidemiological sources; thanks to its realistic model of pathogen evolution and the inclusion of epidemiological data, BadTrIP is also more accurate than existing approaches. BadTrIP is distributed as an open source package (https://bitbucket.org/nicofmay/badtrip) for the phylogenetic software BEAST2. We apply our method to reconstruct transmission history at the early stages of the 2014 Ebola outbreak, showcasing the power of within-host genomic variants to reconstruct transmission events.
Pfeiffer, John M.; Johnson, Nathan A.; Randklev, Charles R.; Howells, Robert G.; Williams, James D.
2016-01-01
The Central Texas endemic freshwater mussel, Quadrula mitchelli (Simpson in Dall, 1896), had been presumed extinct until relict populations were recently rediscovered. To help guide ongoing and future conservation efforts focused on Q. mitchelli we set out to resolve several uncertainties regarding its evolutionary history, specifically its unknown generic position and untested species boundaries. We designed a molecular matrix consisting of two loci (cytochrome c oxidase subunit I and internal transcribed spacer I) and 57 terminal taxa to test the generic position of Q. mitchelli using Bayesian inference and maximum likelihood phylogenetic reconstruction. We also employed two Bayesian species validation methods to test five a priori species models (i.e. hypotheses of species delimitation). Our study is the first to test the generic position of Q.mitchelli and we found robust support for its inclusion in the genusFusconaia. Accordingly, we introduce the binomial, Fusconaia mitchelli comb. nov., to accurately represent the systematic position of the species. We resolved F. mitchelli individuals in two well supported and divergent clades that were generally distinguished as distinct species using Bayesian species validation methods, although alternative hypotheses of species delineation were also supported. Despite strong evidence of genetic isolation within F. mitchelli, we do not advocate for species-level status of the two clades as they are allopatrically distributed and no morphological, behavioral, or ecological characters are known to distinguish them. These results are discussed in the context of the systematics, distribution, and conservation ofF. mitchelli.
Kang, Hae Ji; Bennett, Shannon N.; Dizney, Laurie; Sumibcay, Laarni; Arai, Satoru; Ruedas, Luis A.; Song, Jin-Won; Yanagihara, Richard
2009-01-01
A genetically distinct hantavirus, designated Oxbow virus (OXBV), was detected in tissues of an American shrew mole (Neurotrichus gibbsii), captured in Gresham, Oregon, in September 2003. Pairwise analysis of full-length S- and M- and partial L-segment nucleotide and amino acid sequences of OXBV indicated low sequence similarity with rodent-borne hantaviruses. Phylogenetic analyses using maximum-likelihood and Bayesian methods, and host-parasite evolutionary comparisons, showed that OXBV and Asama virus, a hantavirus recently identified from the Japanese shrew mole (Urotrichus talpoides), were related to soricine shrew-borne hantaviruses from North America and Eurasia, respectively, suggesting parallel evolution associated with cross-species transmission. PMID:19394994
Montgelard, Claudine; Forty, Ellen; Arnal, Véronique; Matthee, Conrad A
2008-11-26
The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using approximately 7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha (Castoridae + Geomyoidea). The second suprafamilial clustering identified a novel association between the Sciuromorpha (Gliridae + (Sciuridae + Aplodontidae)) and the Hystricomorpha (Ctenodactylidae + Hystricognathi) which together represents the earliest dichotomy among Rodentia. Molecular time estimates using a relaxed Bayesian molecular clock dates the appearance of the five suborders nearly contemporaniously at the KT boundary and this is congruent with suggestions of an early explosion of rodent diversity. Based on these newly proposed phylogenetic relationships, the evolution of the zygomasseteric pattern that has been used for a long time in rodent systematics is evaluated.
2008-01-01
Background The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using ~7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Results Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). Conclusion The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha (Castoridae + Geomyoidea). The second suprafamilial clustering identified a novel association between the Sciuromorpha (Gliridae + (Sciuridae + Aplodontidae)) and the Hystricomorpha (Ctenodactylidae + Hystricognathi) which together represents the earliest dichotomy among Rodentia. Molecular time estimates using a relaxed Bayesian molecular clock dates the appearance of the five suborders nearly contemporaniously at the KT boundary and this is congruent with suggestions of an early explosion of rodent diversity. Based on these newly proposed phylogenetic relationships, the evolution of the zygomasseteric pattern that has been used for a long time in rodent systematics is evaluated. PMID:19036132
Dor, Roi; Carling, Matthew D; Lovette, Irby J; Sheldon, Frederick H; Winkler, David W
2012-10-01
The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group. Copyright © 2012 Elsevier Inc. All rights reserved.
Phylogenetic Diversity in the Macromolecular Composition of Microalgae
Finkel, Zoe V.; Follows, Mick J.; Liefer, Justin D.; Brown, Chris M.; Benner, Ina; Irwin, Andrew J.
2016-01-01
The elemental stoichiometry of microalgae reflects their underlying macromolecular composition and influences competitive interactions among species and their role in the food web and biogeochemistry. Here we provide a new estimate of the macromolecular composition of microalgae using a hierarchical Bayesian analysis of data compiled from the literature. The median macromolecular composition of nutrient-sufficient exponentially growing microalgae is 32.2% protein, 17.3% lipid, 15.0% carbohydrate, 17.3% ash, 5.7% RNA, 1.1% chlorophyll-a and 1.0% DNA as percent dry weight. Our analysis identifies significant phylogenetic differences in macromolecular composition undetected by previous studies due to small sample sizes and the large inherent variability in macromolecular pools. The phylogenetic differences in macromolecular composition lead to variations in carbon-to-nitrogen ratios that are consistent with independent observations. These phylogenetic differences in macromolecular and elemental composition reflect adaptations in cellular architecture and biochemistry; specifically in the cell wall, the light harvesting apparatus, and storage pools. PMID:27228080
Zhang, Ziqi; Sun, Tong; Kang, Chunlan; Liu, Yang; Liu, Shaoying; Yue, Bisong; Zeng, Tao
2016-01-01
The complete mitochondrial genome sequence of Cricetulus longicaudatus (Rodentia Cricetidae: Cricetinae) was determined and was deposited in GenBank (GenBank accession no. KM067270). The mitochondrial genome of C. longicaudatus was 16,302 bp in length and contained 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes and one control region, with an identical order to that of other rodents' mitochondrial genomes. The phylogenetic analysis was performed with Bayesian inference based on the concatenated nucleotide sequence of 12 protein-coding genes on the heavy strand. The result showed that these species from Cricetidae and its two subfamilies (Cricetinae and Arvicolines) formed solid monophyletic group, respectively. The Cricetulus had close phylogenetic relationship with Tscherskia among three genera (Cricetulus, Cricetulus and Mesocricetus). Neodon irene and Myodes regulus were embedded in Microtus and Eothenomys, respectively. The unusual phylogenetic positions of Neodon irene and Myodes regulus remain further study in the future.
A note on probabilistic models over strings: the linear algebra approach.
Bouchard-Côté, Alexandre
2013-12-01
Probabilistic models over strings have played a key role in developing methods that take into consideration indels as phylogenetically informative events. There is an extensive literature on using automata and transducers on phylogenies to do inference on these probabilistic models, in which an important theoretical question is the complexity of computing the normalization of a class of string-valued graphical models. This question has been investigated using tools from combinatorics, dynamic programming, and graph theory, and has practical applications in Bayesian phylogenetics. In this work, we revisit this theoretical question from a different point of view, based on linear algebra. The main contribution is a set of results based on this linear algebra view that facilitate the analysis and design of inference algorithms on string-valued graphical models. As an illustration, we use this method to give a new elementary proof of a known result on the complexity of inference on the "TKF91" model, a well-known probabilistic model over strings. Compared to previous work, our proving method is easier to extend to other models, since it relies on a novel weak condition, triangular transducers, which is easy to establish in practice. The linear algebra view provides a concise way of describing transducer algorithms and their compositions, opens the possibility of transferring fast linear algebra libraries (for example, based on GPUs), as well as low rank matrix approximation methods, to string-valued inference problems.
O'Reilly, Joseph E; Donoghue, Philip C J
2018-03-01
Consensus trees are required to summarize trees obtained through MCMC sampling of a posterior distribution, providing an overview of the distribution of estimated parameters such as topology, branch lengths, and divergence times. Numerous consensus tree construction methods are available, each presenting a different interpretation of the tree sample. The rise of morphological clock and sampled-ancestor methods of divergence time estimation, in which times and topology are coestimated, has increased the popularity of the maximum clade credibility (MCC) consensus tree method. The MCC method assumes that the sampled, fully resolved topology with the highest clade credibility is an adequate summary of the most probable clades, with parameter estimates from compatible sampled trees used to obtain the marginal distributions of parameters such as clade ages and branch lengths. Using both simulated and empirical data, we demonstrate that MCC trees, and trees constructed using the similar maximum a posteriori (MAP) method, often include poorly supported and incorrect clades when summarizing diffuse posterior samples of trees. We demonstrate that the paucity of information in morphological data sets contributes to the inability of MCC and MAP trees to accurately summarise of the posterior distribution. Conversely, majority-rule consensus (MRC) trees represent a lower proportion of incorrect nodes when summarizing the same posterior samples of trees. Thus, we advocate the use of MRC trees, in place of MCC or MAP trees, in attempts to summarize the results of Bayesian phylogenetic analyses of morphological data.
O’Reilly, Joseph E; Donoghue, Philip C J
2018-01-01
Abstract Consensus trees are required to summarize trees obtained through MCMC sampling of a posterior distribution, providing an overview of the distribution of estimated parameters such as topology, branch lengths, and divergence times. Numerous consensus tree construction methods are available, each presenting a different interpretation of the tree sample. The rise of morphological clock and sampled-ancestor methods of divergence time estimation, in which times and topology are coestimated, has increased the popularity of the maximum clade credibility (MCC) consensus tree method. The MCC method assumes that the sampled, fully resolved topology with the highest clade credibility is an adequate summary of the most probable clades, with parameter estimates from compatible sampled trees used to obtain the marginal distributions of parameters such as clade ages and branch lengths. Using both simulated and empirical data, we demonstrate that MCC trees, and trees constructed using the similar maximum a posteriori (MAP) method, often include poorly supported and incorrect clades when summarizing diffuse posterior samples of trees. We demonstrate that the paucity of information in morphological data sets contributes to the inability of MCC and MAP trees to accurately summarise of the posterior distribution. Conversely, majority-rule consensus (MRC) trees represent a lower proportion of incorrect nodes when summarizing the same posterior samples of trees. Thus, we advocate the use of MRC trees, in place of MCC or MAP trees, in attempts to summarize the results of Bayesian phylogenetic analyses of morphological data. PMID:29106675
Gomes, Laise de Azevedo; Moraes, Pablo Henrique Gonçalves; do Nascimento, Luciana de Cássia Silva; O'Dwyer, Lucia Helena; Nunes, Márcio Roberto Teixeira; Rossi, Adriana Dos Reis Ponce; Aguiar, Délia Cristina Figueira; Gonçalves, Evonnildo Costa
2016-10-01
This study aimed to optimize molecular methods for detecting DNA of Hepatozoon spp. as well as identify the phylogenetic relationships of Hepatozoon strains naturally infecting domestic dogs in Belém, Pará, northern Brazil. Blood samples were collected from 138 dogs, and screened for Hepatozoon spp. using a new nested PCR assay. Positive samples were subjected to genetic characterization based on amplification and sequencing of approximately 670bp of the Hepatozoon spp. 18S rRNA. Of the positive dogs, four shared the haplotype Belém 01, one dog presented the haplotype Belém 02 and two dogs shared the haplotype Belém 03. A Bayesian inference indicates that haplotypes Belém 01 and Belém 02 are phylogenetically related to H. canis, while Belém 03 is related to H. americanum. Overall, based on the first molecular evidence of H. americanum in Brazilian domestic dogs, the proposed protocol may improve the epidemiological investigation of canine hepatozoonosis. Copyright © 2016 Elsevier GmbH. All rights reserved.
Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun
2016-07-01
Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.
Hrbek, Tomas; Stölting, Kai N; Bardakci, Fevzi; Küçük, Fahrettin; Wildekamp, Rudolf H; Meyer, Axel
2004-07-01
We investigated the phylogenetic relationships of Pseudophoxinus (Cyprinidae: Leuciscinae) species from central Anatolia, Turkey to test the hypothesis of geographic speciation driven by early Pliocene orogenic events. We analyzed 1141 aligned base pairs of the complete cytochrome b mitochondrial gene. Phylogenetic relationships reconstructed by maximum likelihood, Bayesian likelihood, and maximum parsimony methods are identical, and generally well supported. Species and clades are restricted to geologically well-defined units, and are deeply divergent from each other. The basal diversification of central Anatolian Pseudophoxinus is estimated to have occurred approximately 15 million years ago. Our results are in agreement with a previous study of the Anatolian fish genus Aphanius that also shows a diversification pattern driven by the Pliocene orogenic events. The distribution of clades of Aphanius and Pseudophoxinus overlap, and areas of distribution comprise the same geological units. The geological history of Anatolia is likely to have had a major impact on the diversification history of many taxa occupying central Anatolia; many of these taxa are likely to be still unrecognized as distinct. Copyright 2004 Elsevier Inc.
Othman, Ines; Volle, Romain; Elargoubi, Aida; Guediche, Mohamed Neji; Chakroun, Mohamed; Sfar, Mohamed Tahar; Pereira, Bruno; Peigue-Lafeuille, Hélène; Aouni, Mahjoub; Archimbaud, Christine; Bailly, Jean-Luc
2016-02-01
Acute enterovirus (EV) meningitis is a frequent cause of hospitalisation, and over 100 EV serotypes may be involved. A total of 215 patients of all ages with meningitis signs were investigated in 2 Tunisian hospitals. Their cerebrospinal fluid (CSF) was analysed retrospectively for EVs with a TaqMan real-time RT-qPCR. The virus strains were typed, and their evolutionary relationships were determined by Bayesian phylogenetic methods. An EV genome was detected in 21/215 patients (9.8%). The CSF viral loads ranged from 3.27 to 5.63 log10 genome copies/mL. The strains were identified in 13/21 patients and assigned to EV-B types. Viruses identified in Tunisian patients were genetically related to variants detected in France. The viral loads were similar in Tunisian and French patients for most EV types. The phylogenetic data and viral loads determined in Tunisian and French patients suggest that close EV variants were involved in aseptic meningitis in the 2 countries over a same period. Copyright © 2016 Elsevier Inc. All rights reserved.
Fossils matter: improved estimates of divergence times in Pinus reveal older diversification.
Saladin, Bianca; Leslie, Andrew B; Wüest, Rafael O; Litsios, Glenn; Conti, Elena; Salamin, Nicolas; Zimmermann, Niklaus E
2017-04-04
The taxonomy of pines (genus Pinus) is widely accepted and a robust gene tree based on entire plastome sequences exists. However, there is a large discrepancy in estimated divergence times of major pine clades among existing studies, mainly due to differences in fossil placement and dating methods used. We currently lack a dated molecular phylogeny that makes use of the rich pine fossil record, and this study is the first to estimate the divergence dates of pines based on a large number of fossils (21) evenly distributed across all major clades, in combination with applying both node and tip dating methods. We present a range of molecular phylogenetic trees of Pinus generated within a Bayesian framework. We find the origin of crown Pinus is likely up to 30 Myr older (Early Cretaceous) than inferred in most previous studies (Late Cretaceous) and propose generally older divergence times for major clades within Pinus than previously thought. Our age estimates vary significantly between the different dating approaches, but the results generally agree on older divergence times. We present a revised list of 21 fossils that are suitable to use in dating or comparative analyses of pines. Reliable estimates of divergence times in pines are essential if we are to link diversification processes and functional adaptation of this genus to geological events or to changing climates. In addition to older divergence times in Pinus, our results also indicate that node age estimates in pines depend on dating approaches and the specific fossil sets used, reflecting inherent differences in various dating approaches. The sets of dated phylogenetic trees of pines presented here provide a way to account for uncertainties in age estimations when applying comparative phylogenetic methods.
The Phylogeny of Little Red Riding Hood
Tehrani, Jamshid J.
2013-01-01
Researchers have long been fascinated by the strong continuities evident in the oral traditions associated with different cultures. According to the ‘historic-geographic’ school, it is possible to classify similar tales into “international types” and trace them back to their original archetypes. However, critics argue that folktale traditions are fundamentally fluid, and that most international types are artificial constructs. Here, these issues are addressed using phylogenetic methods that were originally developed to reconstruct evolutionary relationships among biological species, and which have been recently applied to a range of cultural phenomena. The study focuses on one of the most debated international types in the literature: ATU 333, ‘Little Red Riding Hood’. A number of variants of ATU 333 have been recorded in European oral traditions, and it has been suggested that the group may include tales from other regions, including Africa and East Asia. However, in many of these cases, it is difficult to differentiate ATU 333 from another widespread international folktale, ATU 123, ‘The Wolf and the Kids’. To shed more light on these relationships, data on 58 folktales were analysed using cladistic, Bayesian and phylogenetic network-based methods. The results demonstrate that, contrary to the claims made by critics of the historic-geographic approach, it is possible to identify ATU 333 and ATU 123 as distinct international types. They further suggest that most of the African tales can be classified as variants of ATU 123, while the East Asian tales probably evolved by blending together elements of both ATU 333 and ATU 123. These findings demonstrate that phylogenetic methods provide a powerful set of tools for testing hypotheses about cross-cultural relationships among folktales, and point towards exciting new directions for research into the transmission and evolution of oral narratives. PMID:24236061
Tomasello, Salvatore; Álvarez, Inés; Vargas, Pablo; Oberprieler, Christoph
2015-01-01
The present study provides results of multi-species coalescent species tree analyses of DNA sequences sampled from multiple nuclear and plastid regions to infer the phylogenetic relationships among the members of the subtribe Leucanthemopsidinae (Compositae, Anthemideae), to which besides the annual Castrilanthemum debeauxii (Degen, Hervier & É.Rev.) Vogt & Oberp., one of the rarest flowering plant species of the Iberian Peninsula, two other unispecific genera (Hymenostemma, Prolongoa), and the polyploidy complex of the genus Leucanthemopsis belong. Based on sequence information from two single- to low-copy nuclear regions (C16, D35, characterised by Chapman et al. (2007)), the multi-copy region of the nrDNA internal transcribed spacer regions ITS1 and ITS2, and two intergenic spacer regions of the cpDNA gene trees were reconstructed using Bayesian inference methods. For the reconstruction of a multi-locus species tree we applied three different methods: (a) analysis of concatenated sequences using Bayesian inference (MrBayes), (b) a tree reconciliation approach by minimizing the number of deep coalescences (PhyloNet), and (c) a coalescent-based species-tree method in a Bayesian framework ((∗)BEAST). All three species tree reconstruction methods unequivocally support the close relationship of the subtribe with the hitherto unclassified genus Phalacrocarpum, the sister-group relationship of Castrilanthemum with the three remaining genera of the subtribe, and the further sister-group relationship of the clade of Hymenostemma+Prolongoa with a monophyletic genus Leucanthemopsis. Dating of the (∗)BEAST phylogeny supports the long-lasting (Early Miocene, 15-22Ma) taxonomical independence and the switch from the plesiomorphic perennial to the apomorphic annual life-form assumed for the Castrilanthemum lineage that may have occurred not earlier than in the Pliocene (3Ma) when the establishment of a Mediterranean climate with summer droughts triggered evolution towards annuality. Copyright © 2014 Elsevier Inc. All rights reserved.
Major clades of Agaricales: a multilocus phylogenetic overview.
P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett
2006-01-01
An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...
Recursive algorithms for phylogenetic tree counting.
Gavryushkina, Alexandra; Welch, David; Drummond, Alexei J
2013-10-28
In Bayesian phylogenetic inference we are interested in distributions over a space of trees. The number of trees in a tree space is an important characteristic of the space and is useful for specifying prior distributions. When all samples come from the same time point and no prior information available on divergence times, the tree counting problem is easy. However, when fossil evidence is used in the inference to constrain the tree or data are sampled serially, new tree spaces arise and counting the number of trees is more difficult. We describe an algorithm that is polynomial in the number of sampled individuals for counting of resolutions of a constraint tree assuming that the number of constraints is fixed. We generalise this algorithm to counting resolutions of a fully ranked constraint tree. We describe a quadratic algorithm for counting the number of possible fully ranked trees on n sampled individuals. We introduce a new type of tree, called a fully ranked tree with sampled ancestors, and describe a cubic time algorithm for counting the number of such trees on n sampled individuals. These algorithms should be employed for Bayesian Markov chain Monte Carlo inference when fossil data are included or data are serially sampled.
Inoue, Jun G; Kumazawa, Yoshinori; Miya, Masaki; Nishida, Mutsumi
2009-06-01
The continental distributions of freshwater fishes in the family Notopteridae (Osteoglossomorpha) across Africa, India, and Southeast Asia constitute a long standing and enigmatic problem of freshwater biogeography. The migrational pathway of the Asian notopterids has been discussed in light of two competing schemes: the first posits recent transcontinental dispersal while the second relies on distributions being shaped by ancient vicariance associated with plate-tectonic events. In this study, we determined complete mitochondrial DNA sequences from 10 osteoglossomorph fishes to estimate phylogenetic relationships using partitioned Bayesian and maximum likelihood methods and divergence dates of the family Notopteridae with a partitioned Bayesian approach. We used six species representing the major lineages of the Notopteridae and seven species from the remaining osteoglossomorph families. Fourteen more-derived teleosts, nine basal actinopterygians, two coelacanths, and one shark were used as outgroups. Phylogenetic analyses indicated that the African and Asian notopterids formed a sister group to each other and that these notopterids were a sister to a clade comprising two African families (Mormyridae and Gymnarchidae). Estimated divergence time between the African and Asian notopterids dated back to the early Cretaceous when India-Madagascar separated from the African part of Gondwanaland. Thus, estimated time of divergence based on the molecular evidence is at odds with the recent dispersal model. It can be reconciled with the geological and paleontological evidence to support the vicariance model in which the Asian notopterids diverged from the African notopterids in Gondwanaland and migrated into Eurasia on the Indian subcontinent from the Cretaceous to the Tertiary. However, we could not exclude an alternative explanation that the African and Asian notopterids diverged in Pangea before its complete separation into Laurasia and Gondwanaland, to which these two lineages were later confined, respectively.
Phylogeny of the Asian spiny frog tribe Paini (Family Dicroglossidae) sensu Dubois.
Che, Jing; Hu, Jian-sheng; Zhou, Wei-wei; Murphy, Robert W; Papenfuss, Theodore J; Chen, Ming-yong; Rao, Ding-qi; Li, Pi-peng; Zhang, Ya-ping
2009-01-01
The anuran tribe Paini, family Dicroglossidae, is known in this group only from Asia. The phylogenetic relationships and often the taxonomic recognition of species are controversial. In order to stabilize the classification, we used approximately 2100 bp of nuclear (rhodopsin, tyrosinase) and mitochondrial (12S, 16S rRNA) DNA sequence data to infer the phylogenetic relationships of these frogs. Phylogenetic trees reconstructed using Bayesian inference and maximum parsimony methods supported a monophyletic tribe Paini. Two distinct groups (I,II) were recovered with the mtDNA alone and the total concatenated data (mtDNA+nuDNA). The recognition of two genera, Quasipaa and Nanorana, was supported. Group I, Quasipaa, is widespread east of the Hengduan Mountain Ranges and consists of taxa from relatively low elevations in southern China, Vietnam and Laos. Group II, Nanorana, contains a mix of species occurring from high to low elevation predominantly in the Qinghai-Tibetan Plateau and Hengduan Mountain Ranges. The occurrence of frogs at high elevations appears to be a derived ecological condition. The composition of some major species groups based on morphological characteristics strongly conflicts with the molecular analysis. Some possible cryptic species are indicated by the molecular analyses. The incorporation of genetic data from type localities helped to resolve some of the taxonomic problems, although further combined analyses of morphological data from type specimens are required. The two nuDNA gene segments proved to be very informative for resolving higher phylogenetic relationships and more nuclear data should be explored to be more confident in the relationships.
Tosh, J.; Dessein, S.; Buerki, S.; Groeninckx, I.; Mouly, A.; Bremer, B.; Smets, E. F.; De Block, P.
2013-01-01
Background and Aims Previous work on the pantropical genus Ixora has revealed an Afro-Madagascan clade, but as yet no study has focused in detail on the evolutionary history and morphological trends in this group. Here the evolutionary history of Afro-Madagascan Ixora spp. (a clade of approx. 80 taxa) is investigated and the phylogenetic trees compared with several key morphological traits in taxa occurring in Madagascar. Methods Phylogenetic relationships of Afro-Madagascan Ixora are assessed using sequence data from four plastid regions (petD, rps16, rpoB-trnC and trnL-trnF) and nuclear ribosomal external transcribed spacer (ETS) and internal transcribed spacer (ITS) regions. The phylogenetic distribution of key morphological characters is assessed. Bayesian inference (implemented in BEAST) is used to estimate the temporal origin of Ixora based on fossil evidence. Key Results Two separate lineages of Madagascan taxa are recovered, one of which is nested in a group of East African taxa. Divergence in Ixora is estimated to have commenced during the mid Miocene, with extensive cladogenesis occurring in the Afro-Madagascan clade during the Pliocene onwards. Conclusions Both lineages of Madagascan Ixora exhibit morphological innovations that are rare throughout the rest of the genus, including a trend towards pauciflorous inflorescences and a trend towards extreme corolla tube length, suggesting that the same ecological and selective pressures are acting upon taxa from both Madagascan lineages. Novel ecological opportunities resulting from climate-induced habitat fragmentation and corolla tube length diversification are likely to have facilitated species radiation on Madagascar. PMID:24142919
Carotenuto, Francesco; Diniz-Filho, José Alexandre F.
2016-01-01
Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes—speciation, extinction and dispersal—in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. PMID:26977061
Inferring 'weak spots' in phylogenetic trees: application to mosasauroid nomenclature.
Madzia, Daniel; Cau, Andrea
2017-01-01
Mosasauroid squamates represented the apex predators within the Late Cretaceous marine and occasionally also freshwater ecosystems. Proper understanding of the origin of their ecological adaptations or paleobiogeographic dispersals requires adequate knowledge of their phylogeny. The studies assessing the position of mosasauroids on the squamate evolutionary tree and their origins have long given conflicting results. The phylogenetic relationships within Mosasauroidea, however, have experienced only little changes throughout the last decades. Considering the substantial improvements in the development of phylogenetic methodology that have undergone in recent years, resulting, among others, in numerous alterations in the phylogenetic hypotheses of other fossil amniotes, we test the robustness in our understanding of mosasauroid beginnings and their evolutionary history. We re-examined a data set that results from modifications assembled in the course of the last 20 years and performed multiple parsimony analyses and Bayesian tip-dating analysis. Following the inferred topologies and the 'weak spots' in the phylogeny of mosasauroids, we revise the nomenclature of the 'traditionally' recognized mosasauroid clades, to acknowledge the overall weakness among branches and the alternative topologies suggested previously, and discuss several factors that might have an impact on the differing phylogenetic hypotheses and their statistical support.
Villalobos, Fabricio; Carotenuto, Francesco; Raia, Pasquale; Diniz-Filho, José Alexandre F
2016-04-05
Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes--speciation, extinction and dispersal--in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. © 2016 The Author(s).
Inferring ‘weak spots’ in phylogenetic trees: application to mosasauroid nomenclature
2017-01-01
Mosasauroid squamates represented the apex predators within the Late Cretaceous marine and occasionally also freshwater ecosystems. Proper understanding of the origin of their ecological adaptations or paleobiogeographic dispersals requires adequate knowledge of their phylogeny. The studies assessing the position of mosasauroids on the squamate evolutionary tree and their origins have long given conflicting results. The phylogenetic relationships within Mosasauroidea, however, have experienced only little changes throughout the last decades. Considering the substantial improvements in the development of phylogenetic methodology that have undergone in recent years, resulting, among others, in numerous alterations in the phylogenetic hypotheses of other fossil amniotes, we test the robustness in our understanding of mosasauroid beginnings and their evolutionary history. We re-examined a data set that results from modifications assembled in the course of the last 20 years and performed multiple parsimony analyses and Bayesian tip-dating analysis. Following the inferred topologies and the ‘weak spots’ in the phylogeny of mosasauroids, we revise the nomenclature of the ‘traditionally’ recognized mosasauroid clades, to acknowledge the overall weakness among branches and the alternative topologies suggested previously, and discuss several factors that might have an impact on the differing phylogenetic hypotheses and their statistical support. PMID:28929018
de Oliveira Bünger, Mariana; Fernanda Mazine, Fiorella; Forest, Félix; Leandro Bueno, Marcelo; Renato Stehmann, João; Lucas, Eve J
2016-12-01
Eugenia sect. Phyllocalyx Nied. includes 14 species endemic to the Neotropics, mostly distributed in the Atlantic coastal forests of Brazil. Here the first comprehensive phylogenetic study of this group is presented, and this phylogeny is used as the basis to evaluate the recent infrageneric classification in Eugenia sensu lato (s.l.) to test the history of the evolution of traits in the group and test hypotheses associated with the history of this clade. A total of 42 taxa were sampled, of which 14 were Eugenia sect. Phyllocalyx for one nuclear (ribosomal internal transcribed spacer) and four plastid markers (psbA-trnH, rpl16, trnL-rpl32 and trnQ-rps16). The relationships were reconstructed based on Bayesian analysis and maximum likelihood. Additionally, ancestral area analysis and modelling methods were used to estimate species dispersal, comparing historically climatic stable (refuges) and unstable areas. Maximum likelihood and Bayesian inferences indicate that Eugenia sect. Phyllocalyx is paraphyletic and the two clades recovered are characterized by combinations of morphological characters. Phylogenetic relationships support a link between Cerrado and south-eastern species and a difference in the composition of species from north-eastern and south-eastern Atlantic forest. Refugia and stable areas identified within unstable areas suggest that these areas were important to maintain diversity in the Atlantic forest biodiversity hotspot. This study provides a robust phylogenetic framework to address important historical questions for Eugenia s.l. within an evolutionary context, supporting the need for better taxonomic study of one of the largest genera in the Neotropics. Furthermore, valuable insight is offered into diversification and biome shifts of plant species in the highly environmentally impacted Atlantic forest of South America. Evidence is presented that climate stability in the south-eastern Atlantic forest during the Quaternary contributed to the highest levels of plant diversity in this region that acted as a refugium. © The Authors 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Koepfli, Klaus-Peter; Deere, Kerry A; Slater, Graham J; Begg, Colleen; Begg, Keith; Grassman, Lon; Lucherini, Mauro; Veron, Geraldine; Wayne, Robert K
2008-01-01
Background Adaptive radiation, the evolution of ecological and phenotypic diversity from a common ancestor, is a central concept in evolutionary biology and characterizes the evolutionary histories of many groups of organisms. One such group is the Mustelidae, the most species-rich family within the mammalian order Carnivora, encompassing 59 species classified into 22 genera. Extant mustelids display extensive ecomorphological diversity, with different lineages having evolved into an array of adaptive zones, from fossorial badgers to semi-aquatic otters. Mustelids are also widely distributed, with multiple genera found on different continents. As with other groups that have undergone adaptive radiation, resolving the phylogenetic history of mustelids presents a number of challenges because ecomorphological convergence may potentially confound morphologically based phylogenetic inferences, and because adaptive radiations often include one or more periods of rapid cladogenesis that require a large amount of data to resolve. Results We constructed a nearly complete generic-level phylogeny of the Mustelidae using a data matrix comprising 22 gene segments (~12,000 base pairs) analyzed with maximum parsimony, maximum likelihood and Bayesian inference methods. We show that mustelids are consistently resolved with high nodal support into four major clades and three monotypic lineages. Using Bayesian dating techniques, we provide evidence that mustelids underwent two bursts of diversification that coincide with major paleoenvironmental and biotic changes that occurred during the Neogene and correspond with similar bursts of cladogenesis in other vertebrate groups. Biogeographical analyses indicate that most of the extant diversity of mustelids originated in Eurasia and mustelids have colonized Africa, North America and South America on multiple occasions. Conclusion Combined with information from the fossil record, our phylogenetic and dating analyses suggest that mustelid diversification may have been spurred by a combination of faunal turnover events and diversification at lower trophic levels, ultimately caused by climatically driven environmental changes. Our biogeographic analyses show Eurasia as the center of origin of mustelid diversity and that mustelids in Africa, North America and South America have been assembled over time largely via dispersal, which has important implications for understanding the ecology of mustelid communities. PMID:18275614
A supermatrix analysis of genomic, morphological, and paleontological data from crown Cetacea
2011-01-01
Background Cetacea (dolphins, porpoises, and whales) is a clade of aquatic species that includes the most massive, deepest diving, and largest brained mammals. Understanding the temporal pattern of diversification in the group as well as the evolution of cetacean anatomy and behavior requires a robust and well-resolved phylogenetic hypothesis. Although a large body of molecular data has accumulated over the past 20 years, DNA sequences of cetaceans have not been directly integrated with the rich, cetacean fossil record to reconcile discrepancies among molecular and morphological characters. Results We combined new nuclear DNA sequences, including segments of six genes (~2800 basepairs) from the functionally extinct Yangtze River dolphin, with an expanded morphological matrix and published genomic data. Diverse analyses of these data resolved the relationships of 74 taxa that represent all extant families and 11 extinct families of Cetacea. The resulting supermatrix (61,155 characters) and its sub-partitions were analyzed using parsimony methods. Bayesian and maximum likelihood (ML) searches were conducted on the molecular partition, and a molecular scaffold obtained from these searches was used to constrain a parsimony search of the morphological partition. Based on analysis of the supermatrix and model-based analyses of the molecular partition, we found overwhelming support for 15 extant clades. When extinct taxa are included, we recovered trees that are significantly correlated with the fossil record. These trees were used to reconstruct the timing of cetacean diversification and the evolution of characters shared by "river dolphins," a non-monophyletic set of species according to all of our phylogenetic analyses. Conclusions The parsimony analysis of the supermatrix and the analysis of morphology constrained to fit the ML/Bayesian molecular tree yielded broadly congruent phylogenetic hypotheses. In trees from both analyses, all Oligocene taxa included in our study fell outside crown Mysticeti and crown Odontoceti, suggesting that these two clades radiated in the late Oligocene or later, contra some recent molecular clock studies. Our trees also imply that many character states shared by river dolphins evolved in their oceanic ancestors, contradicting the hypothesis that these characters are convergent adaptations to fluvial habitats. PMID:21518443
A supermatrix analysis of genomic, morphological, and paleontological data from crown Cetacea.
Geisler, Jonathan H; McGowen, Michael R; Yang, Guang; Gatesy, John
2011-04-25
Cetacea (dolphins, porpoises, and whales) is a clade of aquatic species that includes the most massive, deepest diving, and largest brained mammals. Understanding the temporal pattern of diversification in the group as well as the evolution of cetacean anatomy and behavior requires a robust and well-resolved phylogenetic hypothesis. Although a large body of molecular data has accumulated over the past 20 years, DNA sequences of cetaceans have not been directly integrated with the rich, cetacean fossil record to reconcile discrepancies among molecular and morphological characters. We combined new nuclear DNA sequences, including segments of six genes (~2800 basepairs) from the functionally extinct Yangtze River dolphin, with an expanded morphological matrix and published genomic data. Diverse analyses of these data resolved the relationships of 74 taxa that represent all extant families and 11 extinct families of Cetacea. The resulting supermatrix (61,155 characters) and its sub-partitions were analyzed using parsimony methods. Bayesian and maximum likelihood (ML) searches were conducted on the molecular partition, and a molecular scaffold obtained from these searches was used to constrain a parsimony search of the morphological partition. Based on analysis of the supermatrix and model-based analyses of the molecular partition, we found overwhelming support for 15 extant clades. When extinct taxa are included, we recovered trees that are significantly correlated with the fossil record. These trees were used to reconstruct the timing of cetacean diversification and the evolution of characters shared by "river dolphins," a non-monophyletic set of species according to all of our phylogenetic analyses. The parsimony analysis of the supermatrix and the analysis of morphology constrained to fit the ML/Bayesian molecular tree yielded broadly congruent phylogenetic hypotheses. In trees from both analyses, all Oligocene taxa included in our study fell outside crown Mysticeti and crown Odontoceti, suggesting that these two clades radiated in the late Oligocene or later, contra some recent molecular clock studies. Our trees also imply that many character states shared by river dolphins evolved in their oceanic ancestors, contradicting the hypothesis that these characters are convergent adaptations to fluvial habitats.
Guillon, Myrtille; Mace, Ruth
2016-01-01
The classification of kin into structured groups is a diverse phenomenon which is ubiquitous in human culture. For populations which are organized into large agropastoral groupings of sedentary residence but not governed within the context of a centralised state, such as our study sample of 83 historical Bantu-speaking groups of sub-Saharan Africa, cultural kinship norms guide all aspects of everyday life and social organization. Such rules operate in part through the use of differing terminological referential systems of familial organization. Although the cross-cultural study of kinship terminology was foundational in Anthropology, few modern studies have made use of statistical advances to further our sparse understanding of the structuring and diversification of terminological systems of kinship over time. In this study we use Bayesian Markov Chain Monte Carlo methods of phylogenetic comparison to investigate the evolution of Bantu kinship terminology and reconstruct the ancestral state and diversification of cousin terminology in this family of sub-Saharan ethnolinguistic groups. Using a phylogenetic tree of Bantu languages, we then test the prominent hypothesis that structured variation in systems of cousin terminology has co-evolved alongside adaptive change in patterns of descent organization, as well as rules of residence. We find limited support for this hypothesis, and argue that the shaping of systems of kinship terminology is a multifactorial process, concluding with possible avenues of future research. PMID:27008364
Weisrock, David W; Macey, J Robert; Matsui, Masafumi; Mulcahy, Daniel G; Papenfuss, Theodore J
2013-01-01
The salamander family Hynobiidae contains over 50 species and has been the subject of a number of molecular phylogenetic investigations aimed at reconstructing branches across the entire family. In general, studies using the greatest amount of sequence data have used reduced taxon sampling, while the study with the greatest taxon sampling has used a limited sequence data set. Here, we provide insights into the phylogenetic history of the Hynobiidae using both dense taxon sampling and a large mitochondrial DNA sequence data set. We report exclusive new mitochondrial DNA data of 2566 aligned bases (with 151 excluded sites, of included sites 1157 are variable with 957 parsimony informative). This is sampled from two genic regions encoding a 12S-16S region (the 3' end of 12S rRNA, tRNA(VAI), and the 5' end of 16S rRNA), and a ND2-COI region (ND2, tRNA(Trp), tRNA(Ala), tRNA(Asn), the origin for light strand replication--O(L), tRNA(Cys), tRNAT(Tyr), and the 5' end of COI). Analyses using parsimony, Bayesian, and maximum likelihood optimality criteria produce similar phylogenetic trees, with discordant branches generally receiving low levels of branch support. Monophyly of the Hynobiidae is strongly supported across all analyses, as is the sister relationship and deep divergence between the genus Onychodactylus with all remaining hynobiids. Within this latter grouping our phylogenetic results identify six clades that are relatively divergent from one another, but for which there is minimal support for their phylogenetic placement. This includes the genus Batrachuperus, the genus Hynobius, the genus Pachyhynobius, the genus Salamandrella, a clade containing the genera Ranodon and Paradactylodon, and a clade containing the genera Liua and Pseudohynobius. This latter clade receives low bootstrap support in the parsimony analysis, but is consistent across all three analytical methods. Our results also clarify a number of well-supported relationships within the larger Batrachuperus and Hynobius clades. While the relationships identified in this study do much to clarify the phylogenetic history of the Hynobiidae, the poor resolution among major hynobiid clades, and the contrast of mtDNA-derived relationships with recent phylogenetic results from a small number of nuclear genes, highlights the need for continued phylogenetic study with larger numbers of nuclear loci.
Phylogeny of sipunculan worms: A combined analysis of four gene regions and morphology.
Schulze, Anja; Cutler, Edward B; Giribet, Gonzalo
2007-01-01
The intra-phyletic relationships of sipunculan worms were analyzed based on DNA sequence data from four gene regions and 58 morphological characters. Initially we analyzed the data under direct optimization using parsimony as optimality criterion. An implied alignment resulting from the direct optimization analysis was subsequently utilized to perform a Bayesian analysis with mixed models for the different data partitions. For this we applied a doublet model for the stem regions of the 18S rRNA. Both analyses support monophyly of Sipuncula and most of the same clades within the phylum. The analyses differ with respect to the relationships among the major groups but whereas the deep nodes in the direct optimization analysis generally show low jackknife support, they are supported by 100% posterior probability in the Bayesian analysis. Direct optimization has been useful for handling sequences of unequal length and generating conservative phylogenetic hypotheses whereas the Bayesian analysis under mixed models provided high resolution in the basal nodes of the tree.
Li, Xi; Jang, Tae-Soo; Temsch, Eva M; Kato, Hidetoshi; Takayama, Koji; Schneeweiss, Gerald M
2017-03-01
Molecular phylogenetic studies have greatly improved our understanding of phylogenetic relationships of non-photosynthetic parasitic broomrapes (Orobanche and related genera, Orobanchaceae), but a few genera have remained unstudied. One of those is Platypholis, whose sole species, Platypholis boninsimae, is restricted to the Bonin-Islands (Ogasawara Islands) about 1000 km southeast of Japan. Based on overall morphological similarity, Platypholis has been merged with Orobanche, but this hypothesis has never been tested with molecular data. Employing maximum likelihood and Bayesian analyses on a family-wide data set (two plastid markers, matK and rps2, and three nuclear markers, ITS, phyA and phyB) as well as on an ITS data set focusing on Orobanche s. str., it is shown that P. boninsimae Maxim. is phylogenetically closely linked to or even nested within Orobanche s. str. This position is supported both by morphological evidence and by the newly obtained chromosome number of 2n = 38, which is characteristic for the genus Orobanche s. str.
Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle
2016-01-01
Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much lower than in the entire plastid genome. The geographical clustering of the individuals against a background of overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression following secondary contact. PMID:27098088
Adaptive MCMC in Bayesian phylogenetics: an application to analyzing partitioned data in BEAST.
Baele, Guy; Lemey, Philippe; Rambaut, Andrew; Suchard, Marc A
2017-06-15
Advances in sequencing technology continue to deliver increasingly large molecular sequence datasets that are often heavily partitioned in order to accurately model the underlying evolutionary processes. In phylogenetic analyses, partitioning strategies involve estimating conditionally independent models of molecular evolution for different genes and different positions within those genes, requiring a large number of evolutionary parameters that have to be estimated, leading to an increased computational burden for such analyses. The past two decades have also seen the rise of multi-core processors, both in the central processing unit (CPU) and Graphics processing unit processor markets, enabling massively parallel computations that are not yet fully exploited by many software packages for multipartite analyses. We here propose a Markov chain Monte Carlo (MCMC) approach using an adaptive multivariate transition kernel to estimate in parallel a large number of parameters, split across partitioned data, by exploiting multi-core processing. Across several real-world examples, we demonstrate that our approach enables the estimation of these multipartite parameters more efficiently than standard approaches that typically use a mixture of univariate transition kernels. In one case, when estimating the relative rate parameter of the non-coding partition in a heterochronous dataset, MCMC integration efficiency improves by > 14-fold. Our implementation is part of the BEAST code base, a widely used open source software package to perform Bayesian phylogenetic inference. guy.baele@kuleuven.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Graf, Daniel L; Jones, Hugh; Geneva, Anthony J; Pfeiffer, John M; Klunzinger, Michael W
2015-04-01
The freshwater mussel family Hyriidae (Mollusca: Bivalvia: Unionida) has a disjunct trans-Pacific distribution in Australasia and South America. Previous phylogenetic analyses have estimated the evolutionary relationships of the family and the major infra-familial taxa (Velesunioninae and Hyriinae: Hyridellini in Australia; Hyriinae: Hyriini, Castaliini, and Rhipidodontini in South America), but taxon and character sampling have been too incomplete to support a predictive classification or allow testing of biogeographical hypotheses. We sampled 30 freshwater mussel individuals representing the aforementioned hyriid taxa, as well as outgroup species representing the five other freshwater mussel families and their marine sister group (order Trigoniida). Our ingroup included representatives of all Australian genera. Phylogenetic relationships were estimated from three gene fragments (nuclear 28S, COI and 16S mtDNA) using maximum parsimony, maximum likelihood, and Bayesian inference, and we applied a Bayesian relaxed clock model calibrated with fossil dates to estimate node ages. Our analyses found good support for monophyly of the Hyriidae and the subfamilies and tribes, as well as the paraphyly of the Australasian taxa (Velesunioninae, (Hyridellini, (Rhipidodontini, (Castaliini, Hyriini)))). The Hyriidae was recovered as sister to a clade comprised of all other Recent freshwater mussel families. Our molecular date estimation supported Cretaceous origins of the major hyriid clades, pre-dating the Tertiary isolation of South America from Antarctica/Australia. We hypothesize that early diversification of the Hyriidae was driven by terrestrial barriers on Gondwana rather than marine barriers following disintegration of the super-continent. Copyright © 2015 Elsevier Inc. All rights reserved.
Zhao, Zhe; Su, Tian-Juan; Chesters, Douglas; Wang, Shi-di; Ho, Simon Y W; Zhu, Chao-Dong; Chen, Xiao-Lin; Zhang, Chun-Tian
2013-01-01
Tachinid flies are natural enemies of many lepidopteran and coleopteran pests of forests, crops, and fruit trees. In order to address the lack of genetic data in this economically important group, we sequenced the complete mitochondrial genome of the Palaearctic tachinid fly Elodia flavipalpis Aldrich, 1933. Usually found in Northern China and Japan, this species is one of the primary natural enemies of the leaf-roller moths (Tortricidae), which are major pests of various fruit trees. The 14,932-bp mitochondrial genome was typical of Diptera, with 13 protein-coding genes, 22 tRNA genes, and 2 rRNA genes. However, its control region is only 105 bp in length, which is the shortest found so far in flies. In order to estimate dipteran evolutionary relationships, we conducted a phylogenetic analysis of 58 mitochondrial genomes from 23 families. Maximum-likelihood and Bayesian methods supported the monophyly of both Tachinidae and superfamily Oestroidea. Within the subsection Calyptratae, Muscidae was inferred as the sister group to Oestroidea. Within Oestroidea, Calliphoridae and Sarcophagidae formed a sister clade to Oestridae and Tachinidae. Using a Bayesian relaxed clock calibrated with fossil data, we estimated that Tachinidae originated in the middle Eocene.
Zhao, Zhe; Su, Tian-juan; Chesters, Douglas; Wang, Shi-di; Ho, Simon Y. W.; Zhu, Chao-dong; Chen, Xiao-lin; Zhang, Chun-tian
2013-01-01
Tachinid flies are natural enemies of many lepidopteran and coleopteran pests of forests, crops, and fruit trees. In order to address the lack of genetic data in this economically important group, we sequenced the complete mitochondrial genome of the Palaearctic tachinid fly Elodia flavipalpis Aldrich, 1933. Usually found in Northern China and Japan, this species is one of the primary natural enemies of the leaf-roller moths (Tortricidae), which are major pests of various fruit trees. The 14,932-bp mitochondrial genome was typical of Diptera, with 13 protein-coding genes, 22 tRNA genes, and 2 rRNA genes. However, its control region is only 105 bp in length, which is the shortest found so far in flies. In order to estimate dipteran evolutionary relationships, we conducted a phylogenetic analysis of 58 mitochondrial genomes from 23 families. Maximum-likelihood and Bayesian methods supported the monophyly of both Tachinidae and superfamily Oestroidea. Within the subsection Calyptratae, Muscidae was inferred as the sister group to Oestroidea. Within Oestroidea, Calliphoridae and Sarcophagidae formed a sister clade to Oestridae and Tachinidae. Using a Bayesian relaxed clock calibrated with fossil data, we estimated that Tachinidae originated in the middle Eocene. PMID:23626734
Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences
Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi
2015-01-01
Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094
Bärmann, Eva Verena; Rössner, Gertrud Elisabeth; Wörheide, Gert
2013-05-01
Antilopini (gazelles and their allies) are one of the most diverse but phylogenetically controversial groups of bovids. Here we provide a molecular phylogeny of this poorly understood taxon using combined analyses of mitochondrial (CYTB, COIII, 12S, 16S) and nuclear (KCAS, SPTBN1, PRKCI, MC1R, THYR) genes. We explore the influence of data partitioning and different analytical methods, including Bayesian inference, maximum likelihood and maximum parsimony, on the inferred relationships within Antilopini. We achieve increased resolution and support compared to previous analyses especially in the two most problematic parts of their tree. First, taxa commonly referred to as "gazelles" are recovered as paraphyletic, as the genus Gazella appears more closely related to the Indian blackbuck (Antilope cervicapra) than to the other two gazelle genera (Nanger and Eudorcas). Second, we recovered a strongly supported sister relationship between one of the dwarf antelopes (Ourebia) and the Antilopini subgroup Antilopina (Saiga, Gerenuk, Springbok, Blackbuck and gazelles). The assessment of the influence of taxon sampling, outgroup rooting, and data partitioning in Bayesian analyses helps explain the contradictory results of previous studies. Copyright © 2013 Elsevier Inc. All rights reserved.
Yu, Farong; Yu, Fahong; Pang, Junfeng; Kilpatrick, C William; McGuire, Peter M; Wang, Yingxiang; Lu, Shunqing; Woods, Charles A
2006-03-01
With modified DNA extraction and purification protocols, the complete cytochrome b gene sequences (1140 bp) were determined from degraded museum specimens. Molecular analysis and morphological examination of cranial characteristics of the giant flying squirrels of Petaurista philippensis complex (P. grandis, P. hainana, and P. yunanensis) and other Petaurista species yielded new insights into long-standing controversies in the Petaurista systematics. Patterns of genetic variations and morphological differences observed in this study indicate that P. hainana, P. albiventer, and P. yunanensis can be recognized as distinct species, and P. grandis and P. petaurista are conspecific populations. Phylogenetic relationships reconstructed by using parsimony, likelihood, and Bayesian methods reveal that, with P. leucogenys as the basal branch, all Petaurista groups formed two distinct clades. Petaurista philippensis, P. hainana, P. yunanensis, and P. albiventer are clustered in the same clade, while P. grandis shows a close relationship to P. petaurista. Deduced divergence times based on Bayesian analysis and the transversional substitution at the third codon suggest that the retreating of glaciers and upheavals or movements of tectonic plates in the Pliocene-Pleistocene were the major factors responsible for the present geographical distributions of Petaurista groups.
On the Adequacy of Bayesian Evaluations of Categorization Models: Reply to Vanpaemel and Lee (2012)
ERIC Educational Resources Information Center
Wills, Andy J.; Pothos, Emmanuel M.
2012-01-01
Vanpaemel and Lee (2012) argued, and we agree, that the comparison of formal models can be facilitated by Bayesian methods. However, Bayesian methods neither precede nor supplant our proposals (Wills & Pothos, 2012), as Bayesian methods can be applied both to our proposals and to their polar opposites. Furthermore, the use of Bayesian methods to…
Bayesian data analysis in population ecology: motivations, methods, and benefits
Dorazio, Robert
2016-01-01
During the 20th century ecologists largely relied on the frequentist system of inference for the analysis of their data. However, in the past few decades ecologists have become increasingly interested in the use of Bayesian methods of data analysis. In this article I provide guidance to ecologists who would like to decide whether Bayesian methods can be used to improve their conclusions and predictions. I begin by providing a concise summary of Bayesian methods of analysis, including a comparison of differences between Bayesian and frequentist approaches to inference when using hierarchical models. Next I provide a list of problems where Bayesian methods of analysis may arguably be preferred over frequentist methods. These problems are usually encountered in analyses based on hierarchical models of data. I describe the essentials required for applying modern methods of Bayesian computation, and I use real-world examples to illustrate these methods. I conclude by summarizing what I perceive to be the main strengths and weaknesses of using Bayesian methods to solve ecological inference problems.
Toussaint, Emmanuel F A; Morinière, Jérôme; Müller, Chris J; Kunte, Krushnamegh; Turlin, Bernard; Hausmann, Axel; Balke, Michael
2015-10-01
The charismatic tropical Polyura Nawab butterflies are distributed across twelve biodiversity hotspots in the Indomalayan/Australasian archipelago. In this study, we tested an array of species delimitation methods and compared the results to existing morphology-based taxonomy. We sequenced two mitochondrial and two nuclear gene fragments to reconstruct phylogenetic relationships within Polyura using both Bayesian inference and maximum likelihood. Based on this phylogenetic framework, we used the recently introduced bGMYC, BPP and PTP methods to investigate species boundaries. Based on our results, we describe two new species Polyura paulettae Toussaint sp. n. and Polyura smilesi Toussaint sp. n., propose one synonym, and five populations are raised to species status. Most of the newly recognized species are single-island endemics likely resulting from the recent highly complex geological history of the Indomalayan-Australasian archipelago. Surprisingly, we also find two newly recognized species in the Indomalayan region where additional biotic or abiotic factors have fostered speciation. Species delimitation methods were largely congruent and succeeded to cross-validate most extant morphological species. PTP and BPP seem to yield more consistent and robust estimations of species boundaries with respect to morphological characters while bGMYC delivered contrasting results depending on the different gene trees considered. Our findings demonstrate the efficiency of comparative approaches using molecular species delimitation methods on empirical data. They also pave the way for the investigation of less well-known groups to unveil patterns of species richness and catalogue Earth's concealed, therefore unappreciated diversity. Published by Elsevier Inc.
Bayesian demography 250 years after Bayes
Bijak, Jakub; Bryant, John
2016-01-01
Bayesian statistics offers an alternative to classical (frequentist) statistics. It is distinguished by its use of probability distributions to describe uncertain quantities, which leads to elegant solutions to many difficult statistical problems. Although Bayesian demography, like Bayesian statistics more generally, is around 250 years old, only recently has it begun to flourish. The aim of this paper is to review the achievements of Bayesian demography, address some misconceptions, and make the case for wider use of Bayesian methods in population studies. We focus on three applications: demographic forecasts, limited data, and highly structured or complex models. The key advantages of Bayesian methods are the ability to integrate information from multiple sources and to describe uncertainty coherently. Bayesian methods also allow for including additional (prior) information next to the data sample. As such, Bayesian approaches are complementary to many traditional methods, which can be productively re-expressed in Bayesian terms. PMID:26902889
A tribal level phylogeny of Lake Tanganyika cichlid fishes based on a genomic multi-marker approach
Meyer, Britta S.; Matschiner, Michael; Salzburger, Walter
2015-01-01
The species-flocks of cichlid fishes in the East African Great Lakes Victoria, Malawi and Tanganyika constitute the most diverse extant adaptive radiations in vertebrates. Lake Tanganyika, the oldest of the lakes, harbors the morphologically and genetically most diverse assemblage of cichlids and contains the highest number of endemic cichlid genera of all African lakes. Based on morphological grounds, the Tanganyikan cichlid species have been grouped into 12–16 distinct lineages, so-called tribes. While the monophyly of most of the tribes is well established, the phylogenetic relationships among the tribes remain largely elusive. Here, we present a new tribal level phylogenetic hypothesis for the cichlid fishes of Lake Tanganyika that is based on the so far largest set of nuclear markers and a total alignment length of close to 18 kb. Using next-generation amplicon sequencing with the 454 pyrosequencing technology, we compiled a dataset consisting of 42 nuclear loci in 45 East African cichlid species, which we subjected to maximum likelihood and Bayesian inference phylogenetic analyses. We analyzed the entire concatenated dataset and each marker individually, and performed a Bayesian concordance analysis and gene tree discordance tests. Overall, we find strong support for a position of the Oreochromini, Boulengerochromini, Bathybatini and Trematocarini outside of a clade combining the substrate spawning Lamprologini and the mouthbrooding tribes of the ‘H-lineage’, which are both strongly supported to be monophyletic. The Eretmodini are firmly placed within the ‘H-lineage’, as sister-group to the most species-rich tribe of cichlids, the Haplochromini. The phylogenetic relationships at the base of the ‘H-lineage’ received less support, which is likely due to high speciation rates in the early phase of the radiation. Discordance among gene trees and marker sets further suggests the occurrence of past hybridization and/or incomplete lineage sorting in the cichlid fishes of Lake Tanganyika. PMID:25433288
Appelhans, M. S.; Smets, E.; Razafimandimbison, S. G.; Haevermans, T.; van Marle, E. J.; Couloux, A.; Rabarison, H.; Randrianarivelojosia, M.; Keßler, P. J. A.
2011-01-01
Background and Aims The Spathelia–Ptaeroxylon clade is a group of morphologically diverse plants that have been classified together as a result of molecular phylogenetic studies. The clade is currently included in Rutaceae and recognized at a subfamilial level (Spathelioideae) despite the fact that most of its genera have traditionally been associated with other families and that there are no obvious morphological synapomorphies for the clade. The aim of the present study is to construct phylogenetic trees for the Spathelia–Ptaeroxylon clade and to investigate anatomical characters in order to decide whether it should be kept in Rutaceae or recognized at the familial level. Anatomical characters were plotted on a cladogram to help explain character evolution within the group. Moreover, phylogenetic relationships and generic limits within the clade are also addressed. Methods A species-level phylogenetic analysis of the Spathelia–Ptaeroxylon clade based on five plastid DNA regions (rbcL, atpB, trnL–trnF, rps16 and psbA–trnH) was conducted using Bayesian, maximum parsimony and maximum likelihood methods. Leaf and seed anatomical characters of all genera were (re)investigated by light and scanning electron microscopy. Key Results With the exception of Spathelia, all genera of the Spathelila–Ptaeroxylon clade are monophyletic. The typical leaf and seed anatomical characters of Rutaceae were found. Further, the presence of oil cells in the leaves provides a possible synapomorphy for the clade. Conclusions The Spathelia–Ptaeroxylon clade is well placed in Rutaceae and it is reasonable to unite the genera into one subfamily (Spathelioideae). We propose a new tribal classification of Spathelioideae. A narrow circumscription of Spathelia is established to make the genus monophyletic, and Sohnreyia is resurrected to accommodate the South American species of Spathelia. The most recent common ancestor of Spathelioideae probably had leaves with secretory cavities and oil cells, haplostemonous flowers with appendaged staminal filaments, and a tracheidal tegmen. PMID:21610209
The Estimation of Tree Posterior Probabilities Using Conditional Clade Probability Distributions
Larget, Bret
2013-01-01
In this article I introduce the idea of conditional independence of separated subtrees as a principle by which to estimate the posterior probability of trees using conditional clade probability distributions rather than simple sample relative frequencies. I describe an algorithm for these calculations and software which implements these ideas. I show that these alternative calculations are very similar to simple sample relative frequencies for high probability trees but are substantially more accurate for relatively low probability trees. The method allows the posterior probability of unsampled trees to be calculated when these trees contain only clades that are in other sampled trees. Furthermore, the method can be used to estimate the total probability of the set of sampled trees which provides a measure of the thoroughness of a posterior sample. [Bayesian phylogenetics; conditional clade distributions; improved accuracy; posterior probabilities of trees.] PMID:23479066
Romero, Pedro E; Pfenninger, Markus; Kano, Yasunori; Klussmann-Kolb, Annette
2016-04-01
Gastropods of the family Ellobiidae are an interesting group in which to study transitions from intertidal to terrestrial realms. However, the phylogenetic relationships within this family still lack resolution. We present a phylogenetic hypothesis of the Ellobiidae based on Bayesian and maximum likelihood phylograms. We used nuclear (18S, 28S, H3) and mitochondrial (16S, 12S, COI) data, increasing the numbers of markers and data, and making this the most comprehensive phylogenetic study of the family to date. Our results support phylogenetic hypotheses derived from morphological data, and provide a supported framework to evaluate the internal relationships within Ellobiidae. The resulting phylogenetic trees support the previous hypothesis that the Ellobiidae are monophyletic only if the Trimusculinae (Otina, Smeagol and Trimusculus) are considered part of this family. In addition, we found that the Carychiinae, Ellobiinae and Pythiinae are reciprocally monophyletic and closely related, with the Carychiinae as sister group to Ellobiinae. Relationships within Melampodinae and Pedipedinae and their phylogenetic positions remain unresolved. Land invasion by the Ellobiidae occurred independently in Carychiinae and Pythia during different geological times (Mesozoic and Cenozoic, respectively). Diversification in the family does not appear to be related to past climate and biotic changes, neither the Cretaceous-Paleogene boundary nor the lowering of the sea level in the Oligocene. Copyright © 2015 Elsevier Inc. All rights reserved.
Roth, Steffen; Fromm, Bastian; Gäde, Gerd; Predel, Reinhard
2009-01-01
Background Neuropeptide ligands have to fit exactly into their respective receptors and thus the evolution of the coding regions of their genes is constrained and may be strongly conserved. As such, they may be suitable for the reconstruction of phylogenetic relationships within higher taxa. CAPA peptides of major lineages of cockroaches (Blaberidae, Blattellidae, Blattidae, Polyphagidae, Cryptocercidae) and of the termite Mastotermes darwiniensis were chosen to test the above hypothesis. The phylogenetic relationships within various groups of the taxon Dictyoptera (praying mantids, termites and cockroaches) are still highly disputed. Results Tandem mass spectrometry of neuropeptides from perisympathetic organs was used to obtain sequence data of CAPA peptides from single specimens; the data were analysed by Maximum Parsimony and Bayesian Interference. The resulting cladograms, taking 61 species into account, show a topology which is in general agreement with recent molecular and morphological phylogenetic analyses, including the recent phylogenetic arrangement placing termites within the cockroaches. When sequence data sets from other neuropeptides, viz. adipokinetic hormones and sulfakinins, were included, the general topology of the cladogram did not change but bootstrap values increased considerably. Conclusion This study represents the first comprehensive survey of neuropeptides of insects for solely phylogenetic purposes and concludes that sequences of short neuropeptides are suitable to complement molecular biological and morphological data for the reconstruction of phylogenetic relationships. PMID:19257902
Evidence of two distinct phylogenetic lineages of dog rabies virus circulating in Cambodia.
Mey, Channa; Metlin, Artem; Duong, Veasna; Ong, Sivuth; In, Sotheary; Horwood, Paul F; Reynes, Jean-Marc; Bourhy, Hervé; Tarantola, Arnaud; Buchy, Philippe
2016-03-01
This first extensive retrospective study of the molecular epidemiology of dog rabies in Cambodia included 149 rabies virus (RABV) entire nucleoprotein sequences obtained from 1998-2011. The sequences were analyzed in conjunction with RABVs from other Asian countries. Phylogenetic reconstruction confirmed the South-East Asian phylogenetic clade comprising viruses from Cambodia, Vietnam, Thailand, Laos and Myanmar. The present study represents the first attempt to classify the phylogenetic lineages inside this clade, resulting in the confirmation that all the Cambodian viruses belonged to the South-East Asian (SEA) clade. Three distinct phylogenetic lineages in the region were established with the majority of viruses from Cambodia closely related to viruses from Thailand, Laos and Vietnam, forming the geographically widespread phylogenetic lineage SEA1. A South-East Asian lineage SEA2 comprised two viruses from Cambodia was identified, which shared a common ancestor with RABVs originating from Laos. Viruses from Myanmar formed separate phylogenetic lineages within the major SEA clade. Bayesian molecular clock analysis suggested that the time to most recent common ancestor (TMRCA) of all Cambodian RABVs dated to around 1950. The TMRCA of the Cambodian SEA1 lineage was around 1964 and that of the SEA2 lineage was around 1953. The results identified three phylogenetically distinct and geographically separated lineages inside the earlier identified major SEA clade, covering at least five countries in the region. A greater understanding of the molecular epidemiology of rabies in South-East Asia is an important step to monitor progress on the efforts to control canine rabies in the region. Copyright © 2015 Elsevier B.V. All rights reserved.
Probabilistic Graphical Model Representation in Phylogenetics
Höhna, Sebastian; Heath, Tracy A.; Boussau, Bastien; Landis, Michael J.; Ronquist, Fredrik; Huelsenbeck, John P.
2014-01-01
Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to grasp complex phylogenetic models, including their assumptions and parameter/variable dependencies. Graphical modeling is a unifying framework that has gained in popularity in the statistical literature in recent years. The core idea is to break complex models into conditionally independent distributions. The strength lies in the comprehensibility, flexibility, and adaptability of this formalism, and the large body of computational work based on it. Graphical models are well-suited to teach statistical models, to facilitate communication among phylogeneticists and in the development of generic software for simulation and statistical inference. Here, we provide an introduction to graphical models for phylogeneticists and extend the standard graphical model representation to the realm of phylogenetics. We introduce a new graphical model component, tree plates, to capture the changing structure of the subgraph corresponding to a phylogenetic tree. We describe a range of phylogenetic models using the graphical model framework and introduce modules to simplify the representation of standard components in large and complex models. Phylogenetic model graphs can be readily used in simulation, maximum likelihood inference, and Bayesian inference using, for example, Metropolis–Hastings or Gibbs sampling of the posterior distribution. [Computation; graphical models; inference; modularization; statistical phylogenetics; tree plate.] PMID:24951559
The genetic diversity of hepatitis A genotype I in Bulgaria
Cella, Eleonora; Golkocheva-Markova, Elitsa N.; Trandeva-Bankova, Diljana; Gregori, Giulia; Bruni, Roberto; Taffon, Stefania; Equestre, Michele; Costantino, Angela; Spoto, Silvia; Curtis, Melissa; Ciccaglione, Anna Rita; Ciccozzi, Massimo; Angeletti, Silvia
2018-01-01
Abstract The purpose of this study was to analyze sequences of hepatitis A virus (HAV) Ia and Ib genotypes from Bulgarian patients to investigate the molecular epidemiology of HAV genotype I during the years 2012 to 2014. Around 105 serum samples were collected by the Department of Virology of the National Center of Infectious and Parasitic Diseases in Bulgaria. The sequenced region encompassed the VP1/2A region of HAV genome. The sequences obtained from the samples were 103. For the phylogenetic analyses, 5 datasets were built to investigate the viral gene in/out flow among distinct HAV subpopulations in different geographic areas and to build a Bayesian dated tree, Bayesian phylogenetic and migration pattern analyses were performed. HAV Ib Bulgarian sequences mostly grouped into a single clade. This indicates that the Bulgarian epidemic is partially compartmentalized. It originated from a limited number of viruses and then spread through fecal-oral local transmission. HAV Ia Bulgarian sequences were intermixed with European sequences, suggesting that an Ia epidemic is not restricted to Bulgaria but can affect other European countries. The time-scaled phylogeny reconstruction showed the root of the tree dating in 2008 for genotype Ib and in 1999 for genotype Ia with a second epidemic entrance in 2003. The Bayesian skyline plot for genotype Ib showed a slow but continuous growth, sustained by fecal-oral route transmission. For genotype Ia, there was an exponential growth followed by a plateau, which suggests better infection control. Bidirectional viral flow for Ib genotype, involving different Bulgarian areas, was observed, whereas a unidirectional flow from Sofia to Ihtiman for genotype Ia was highlighted, suggesting the fecal-oral transmission route for Ia. PMID:29504993
The genetic diversity of hepatitis A genotype I in Bulgaria.
Cella, Eleonora; Golkocheva-Markova, Elitsa N; Trandeva-Bankova, Diljana; Gregori, Giulia; Bruni, Roberto; Taffon, Stefania; Equestre, Michele; Costantino, Angela; Spoto, Silvia; Curtis, Melissa; Ciccaglione, Anna Rita; Ciccozzi, Massimo; Angeletti, Silvia
2018-01-01
The purpose of this study was to analyze sequences of hepatitis A virus (HAV) Ia and Ib genotypes from Bulgarian patients to investigate the molecular epidemiology of HAV genotype I during the years 2012 to 2014. Around 105 serum samples were collected by the Department of Virology of the National Center of Infectious and Parasitic Diseases in Bulgaria. The sequenced region encompassed the VP1/2A region of HAV genome. The sequences obtained from the samples were 103. For the phylogenetic analyses, 5 datasets were built to investigate the viral gene in/out flow among distinct HAV subpopulations in different geographic areas and to build a Bayesian dated tree, Bayesian phylogenetic and migration pattern analyses were performed. HAV Ib Bulgarian sequences mostly grouped into a single clade. This indicates that the Bulgarian epidemic is partially compartmentalized. It originated from a limited number of viruses and then spread through fecal-oral local transmission. HAV Ia Bulgarian sequences were intermixed with European sequences, suggesting that an Ia epidemic is not restricted to Bulgaria but can affect other European countries. The time-scaled phylogeny reconstruction showed the root of the tree dating in 2008 for genotype Ib and in 1999 for genotype Ia with a second epidemic entrance in 2003. The Bayesian skyline plot for genotype Ib showed a slow but continuous growth, sustained by fecal-oral route transmission. For genotype Ia, there was an exponential growth followed by a plateau, which suggests better infection control. Bidirectional viral flow for Ib genotype, involving different Bulgarian areas, was observed, whereas a unidirectional flow from Sofia to Ihtiman for genotype Ia was highlighted, suggesting the fecal-oral transmission route for Ia. Copyright © 2017 The Authors. Published by Wolters Kluwer Health, Inc. All rights reserved.
Damerau, M; Freese, M; Hanel, R
2018-01-01
In this study, the phylogenetic trees of jacks and pompanos (Carangidae), an ecologically and morphologically diverse, globally distributed fish family, are inferred from a complete, concatenated data set of two mitochondrial (cytochrome c oxidase I, cytochrome b) loci and one nuclear (myosin heavy chain 6) locus. Maximum likelihood and Bayesian inferences are largely congruent and show a clear separation of Carangidae into the four subfamilies: Scomberoidinae, Trachinotinae, Naucratinae and Caranginae. The inclusion of the carangid sister lineages Coryphaenidae (dolphinfishes) and Rachycentridae (cobia), however, render Carangidae paraphyletic. The phylogenetic trees also show with high statistical support that the monotypic vadigo Campogramma glaycos is the sister to all other species within the Naucratinae. © 2017 The Fisheries Society of the British Isles.
Extreme environments select for reproductive assurance: evidence from evening primroses (Oenothera).
Evans, Margaret E K; Hearn, David J; Theiss, Kathryn E; Cranston, Karen; Holsinger, Kent E; Donoghue, Michael J
2011-07-01
Competing evolutionary forces shape plant breeding systems (e.g. inbreeding depression, reproductive assurance). Which of these forces prevails in a given population or species is predicted to depend upon such factors as life history, ecological conditions, and geographical context. Here, we examined two such predictions: that self-compatibility should be associated with the annual life history or extreme climatic conditions. We analyzed data from a clade of plants remarkable for variation in breeding system, life history and climatic conditions (Oenothera, sections Anogra and Kleinia, Onagraceae). We used a phylogenetic comparative approach and Bayesian or hybrid Bayesian tests to account for phylogenetic uncertainty. Geographic information system (GIS)-based climate data and ecological niche modeling allowed us to quantify climatic conditions. Breeding system and reproductive life span are not correlated in Anogra and Kleinia. Instead, self-compatibility is associated with the extremes of temperature in the coldest part of the year and precipitation in the driest part of the year. In the 60 yr since this pattern was anticipated, this is the first demonstration of a relationship between the evolution of self-compatibility and climatic extremes. We discuss possible explanations for this pattern and possible implications with respect to anthropogenic climate change. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.
Shin, Seunggwan; Jung, Sunghoon; Menzel, Frank; Heller, Kai; Lee, Heungsik; Lee, Seunghwan
2013-03-01
The phylogeny of the family Sciaridae is reconstructed, based on maximum likelihood, maximum parsimony, and Bayesian analyses of 4809bp from two mitochondrial (COI and 16S) and two nuclear (18S and 28S) genes for 100 taxa including the outgroup taxa. According to the present phylogenetic analyses, Sciaridae comprise three subfamilies and two genus groups: Sciarinae, Chaetosciara group, Cratyninae, and Pseudolycoriella group+Megalosphyinae. Our molecular results are largely congruent with one of the former hypotheses based on morphological data with respect to the monophyly of genera and subfamilies (Sciarinae, Megalosphyinae, and part of postulated "new subfamily"); however, the subfamily Cratyninae is shown to be polyphyletic, and the genera Bradysia, Corynoptera, Leptosciarella, Lycoriella, and Phytosciara are also recognized as non-monophyletic groups. While the ancestral larval habitat state of the family Sciaridae, based on Bayesian inference, is dead plant material (plant litter+rotten wood), the common ancestors of Phytosciara and Bradysia are inferred to living plants habitat. Therefore, shifts in larval habitats from dead plant material to living plants may have occurred within the Sciaridae at least once. Based on the results, we discuss phylogenetic relationships within the family, and present an evolutionary scenario of development of larval habitats. Copyright © 2012 Elsevier Inc. All rights reserved.
A Bayesian phylogenetic study of the Dravidian language family
Kolipakam, Vishnupriya
2018-01-01
The Dravidian language family consists of about 80 varieties (Hammarström H. 2016 Glottolog 2.7) spoken by 220 million people across southern and central India and surrounding countries (Steever SB. 1998 In The Dravidian languages (ed. SB Steever), pp. 1–39: 1). Neither the geographical origin of the Dravidian language homeland nor its exact dispersal through time are known. The history of these languages is crucial for understanding prehistory in Eurasia, because despite their current restricted range, these languages played a significant role in influencing other language groups including Indo-Aryan (Indo-European) and Munda (Austroasiatic) speakers. Here, we report the results of a Bayesian phylogenetic analysis of cognate-coded lexical data, elicited first hand from native speakers, to investigate the subgrouping of the Dravidian language family, and provide dates for the major points of diversification. Our results indicate that the Dravidian language family is approximately 4500 years old, a finding that corresponds well with earlier linguistic and archaeological studies. The main branches of the Dravidian language family (North, Central, South I, South II) are recovered, although the placement of languages within these main branches diverges from previous classifications. We find considerable uncertainty with regard to the relationships between the main branches. PMID:29657761
Phylogenetic Information Content of Copepoda Ribosomal DNA Repeat Units: ITS1 and ITS2 Impact
Zagoskin, Maxim V.; Lazareva, Valentina I.; Grishanin, Andrey K.; Mukha, Dmitry V.
2014-01-01
The utility of various regions of the ribosomal repeat unit for phylogenetic analysis was examined in 16 species representing four families, nine genera, and two orders of the subclass Copepoda (Crustacea). Fragments approximately 2000 bp in length containing the ribosomal DNA (rDNA) 18S and 28S gene fragments, the 5.8S gene, and the internal transcribed spacer regions I and II (ITS1 and ITS2) were amplified and analyzed. The DAMBE (Data Analysis in Molecular Biology and Evolution) software was used to analyze the saturation of nucleotide substitutions; this test revealed the suitability of both the 28S gene fragment and the ITS1/ITS2 rDNA regions for the reconstruction of phylogenetic trees. Distance (minimum evolution) and probabilistic (maximum likelihood, Bayesian) analyses of the data revealed that the 28S rDNA and the ITS1 and ITS2 regions are informative markers for inferring phylogenetic relationships among families of copepods and within the Cyclopidae family and associated genera. Split-graph analysis of concatenated ITS1/ITS2 rDNA regions of cyclopoid copepods suggested that the Mesocyclops, Thermocyclops, and Macrocyclops genera share complex evolutionary relationships. This study revealed that the ITS1 and ITS2 regions potentially represent different phylogenetic signals. PMID:25215300
Bayesian random local clocks, or one rate to rule them all
2010-01-01
Background Relaxed molecular clock models allow divergence time dating and "relaxed phylogenetic" inference, in which a time tree is estimated in the face of unequal rates across lineages. We present a new method for relaxing the assumption of a strict molecular clock using Markov chain Monte Carlo to implement Bayesian modeling averaging over random local molecular clocks. The new method approaches the problem of rate variation among lineages by proposing a series of local molecular clocks, each extending over a subregion of the full phylogeny. Each branch in a phylogeny (subtending a clade) is a possible location for a change of rate from one local clock to a new one. Thus, including both the global molecular clock and the unconstrained model results, there are a total of 22n-2 possible rate models available for averaging with 1, 2, ..., 2n - 2 different rate categories. Results We propose an efficient method to sample this model space while simultaneously estimating the phylogeny. The new method conveniently allows a direct test of the strict molecular clock, in which one rate rules them all, against a large array of alternative local molecular clock models. We illustrate the method's utility on three example data sets involving mammal, primate and influenza evolution. Finally, we explore methods to visualize the complex posterior distribution that results from inference under such models. Conclusions The examples suggest that large sequence datasets may only require a small number of local molecular clocks to reconcile their branch lengths with a time scale. All of the analyses described here are implemented in the open access software package BEAST 1.5.4 (http://beast-mcmc.googlecode.com/). PMID:20807414
Harris, Liam W.; Davies, T. Jonathan
2016-01-01
Explaining the uneven distribution of species richness across the branches of the tree of life has been a major challenge for evolutionary biologists. Advances in phylogenetic reconstruction, allowing the generation of large, well-sampled, phylogenetic trees have provided an opportunity to contrast competing hypotheses. Here, we present a new time-calibrated phylogeny of seed plant families using Bayesian methods and 26 fossil calibrations. While there are various published phylogenetic trees for plants which have a greater density of species sampling, we are still a long way from generating a complete phylogeny for all ~300,000+ plants. Our phylogeny samples all seed plant families and is a useful tool for comparative analyses. We use this new phylogenetic hypothesis to contrast two alternative explanations for differences in species richness among higher taxa: time for speciation versus ecological limits. We calculated net diversification rate for each clade in the phylogeny and assessed the relationship between clade age and species richness. We then fit models of speciation and extinction to individual branches in the tree to identify major rate-shifts. Our data suggest that the majority of lineages are diversifying very slowly while a few lineages, distributed throughout the tree, are diversifying rapidly. Diversification is unrelated to clade age, no matter the age range of the clades being examined, contrary to both the assumption of an unbounded lineage increase through time, and the paradigm of fixed ecological limits. These findings are consistent with the idea that ecology plays a role in diversification, but rather than imposing a fixed limit, it may have variable effects on per lineage diversification rates through time. PMID:27706173
Hochbach, Anne; Schneider, Julia; Röser, Martin
2015-06-01
To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
Stamatakis, Alexandros; Ott, Michael
2008-12-27
The continuous accumulation of sequence data, for example, due to novel wet-laboratory techniques such as pyrosequencing, coupled with the increasing popularity of multi-gene phylogenies and emerging multi-core processor architectures that face problems of cache congestion, poses new challenges with respect to the efficient computation of the phylogenetic maximum-likelihood (ML) function. Here, we propose two approaches that can significantly speed up likelihood computations that typically represent over 95 per cent of the computational effort conducted by current ML or Bayesian inference programs. Initially, we present a method and an appropriate data structure to efficiently compute the likelihood score on 'gappy' multi-gene alignments. By 'gappy' we denote sampling-induced gaps owing to missing sequences in individual genes (partitions), i.e. not real alignment gaps. A first proof-of-concept implementation in RAXML indicates that this approach can accelerate inferences on large and gappy alignments by approximately one order of magnitude. Moreover, we present insights and initial performance results on multi-core architectures obtained during the transition from an OpenMP-based to a Pthreads-based fine-grained parallelization of the ML function.
Watts, Joseph; Greenhill, Simon J.; Atkinson, Quentin D.; Currie, Thomas E.; Bulbulia, Joseph; Gray, Russell D.
2015-01-01
Supernatural belief presents an explanatory challenge to evolutionary theorists—it is both costly and prevalent. One influential functional explanation claims that the imagined threat of supernatural punishment can suppress selfishness and enhance cooperation. Specifically, morally concerned supreme deities or ‘moralizing high gods' have been argued to reduce free-riding in large social groups, enabling believers to build the kind of complex societies that define modern humanity. Previous cross-cultural studies claiming to support the MHG hypothesis rely on correlational analyses only and do not correct for the statistical non-independence of sampled cultures. Here we use a Bayesian phylogenetic approach with a sample of 96 Austronesian cultures to test the MHG hypothesis as well as an alternative supernatural punishment hypothesis that allows punishment by a broad range of moralizing agents. We find evidence that broad supernatural punishment drives political complexity, whereas MHGs follow political complexity. We suggest that the concept of MHGs diffused as part of a suite of traits arising from cultural exchange between complex societies. Our results show the power of phylogenetic methods to address long-standing debates about the origins and functions of religion in human society. PMID:25740888
Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin
2016-07-01
The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
McDade, Lucinda A; Daniel, Thomas F; Kiel, Carrie A
2008-09-01
Acanthaceae (Asteridae; Lamiales) include ∼4000 species and encompass a range of morphological diversity, habitats, and biogeographic patterns. Although they are important components of tropical and subtropical habitats worldwide, inadequate knowledge of the family's phylogenetic framework has impeded comparative research. In this study, we sampled all known lineages of Acanthaceae including Andrographideae. Also included were eight of 13 genera whose relationships remain enigmatic. We used sequence data from nrITS and four chloroplast noncoding regions, and parsimony and Bayesian methods of analysis. Results strongly support most aspects of relationships including inclusion of Avicennia in Acanthaceae. Excepting Neuracanthus, newly sampled taxa are placed with strong support; Kudoacanthus is in Justicieae, Tetramerium lineage, and the remaining enigmatic genera are in Whitfieldieae or Barlerieae, and Andrographideae are sister to Barlerieae. This last result is unanticipated, but placement of Andrographideae based on structural characters has been elusive. Neuracanthus is monophyletic but placement relative to (Whitfieldieae (Andrographideae + Barlerieae)) is weakly supported. Many clades have clear morphological synapomorphies, but nonmolecular evidence for some remains elusive. Results suggest an Old World origin with multiple dispersal events to the New World. This study informs future work by clarifying sampling strategy and identifying aspects of relationships that require further study.
Chen, Yuan
2017-01-01
Abstract In this study, we sequenced fragments of cytochrome oxidase subunit 1 (CO1), internal transcribed spacer 1 (ITS1), and internal transcribed spacer 2 (ITS2) genes from 150 specimens belonging to 16 species of the ant genus Formica from China. Odontoponera transversa from Ponerinae and Polyergus samurai from Formicinae were added as distant relative and close relative outgroups, respectively. Neighbor-joining, maximum parsimony, and Bayesian interference methods were used to analyze their phylogenetic relationships based on CO1 gene sequence as well as combined sequence data of CO1 + ITS1, CO1 + ITS2, and CO1 + ITS1 + ITS2. The results showed that nine Formica species (i.e., Formica sinensis, Formica manchu, Formica uralensis, Formica sanguinea, Formica gagatoides, Formica candida, Formica fusca, Formica glauca, and Formica sp.) formed monophyletic clades, which in agreement with the results based on morphological taxonomy. By comparing the results of DNA barcoding and morphological taxonomy, we propose that Formica aquilonia maybe a junior synonym of F. polyctena and that cryptic species could likely existed in Formica sinae. Further studies on morphology, biology, and geography are needed to confirm this notion.
Matrilocal residence is ancestral in Austronesian societies
Jordan, Fiona M.; Gray, Russell D.; Greenhill, Simon J.; Mace, Ruth
2009-01-01
The nature of social life in human prehistory is elusive, yet knowing how kinship systems evolve is critical for understanding population history and cultural diversity. Post-marital residence rules specify sex-specific dispersal and kin association, influencing the pattern of genetic markers across populations. Cultural phylogenetics allows us to practise ‘virtual archaeology’ on these aspects of social life that leave no trace in the archaeological record. Here we show that early Austronesian societies practised matrilocal post-marital residence. Using a Markov-chain Monte Carlo comparative method implemented in a Bayesian phylogenetic framework, we estimated the type of residence at each ancestral node in a sample of Austronesian language trees spanning 135 Pacific societies. Matrilocal residence has been hypothesized for proto-Oceanic society (ca 3500 BP), but we find strong evidence that matrilocality was predominant in earlier Austronesian societies ca 5000–4500 BP, at the root of the language family and its early branches. Our results illuminate the divergent patterns of mtDNA and Y-chromosome markers seen in the Pacific. The analysis of present-day cross-cultural data in this way allows us to directly address cultural evolutionary and life-history processes in prehistory. PMID:19324748
Silvestro, Daniele; Zizka, Alexander; Bacon, Christine D; Cascales-Miñana, Borja; Salamin, Nicolas; Antonelli, Alexandre
2016-04-05
Methods in historical biogeography have revolutionized our ability to infer the evolution of ancestral geographical ranges from phylogenies of extant taxa, the rates of dispersals, and biotic connectivity among areas. However, extant taxa are likely to provide limited and potentially biased information about past biogeographic processes, due to extinction, asymmetrical dispersals and variable connectivity among areas. Fossil data hold considerable information about past distribution of lineages, but suffer from largely incomplete sampling. Here we present a new dispersal-extinction-sampling (DES) model, which estimates biogeographic parameters using fossil occurrences instead of phylogenetic trees. The model estimates dispersal and extinction rates while explicitly accounting for the incompleteness of the fossil record. Rates can vary between areas and through time, thus providing the opportunity to assess complex scenarios of biogeographic evolution. We implement the DES model in a Bayesian framework and demonstrate through simulations that it can accurately infer all the relevant parameters. We demonstrate the use of our model by analysing the Cenozoic fossil record of land plants and inferring dispersal and extinction rates across Eurasia and North America. Our results show that biogeographic range evolution is not a time-homogeneous process, as assumed in most phylogenetic analyses, but varies through time and between areas. In our empirical assessment, this is shown by the striking predominance of plant dispersals from Eurasia into North America during the Eocene climatic cooling, followed by a shift in the opposite direction, and finally, a balance in biotic interchange since the middle Miocene. We conclude by discussing the potential of fossil-based analyses to test biogeographic hypotheses and improve phylogenetic methods in historical biogeography. © 2016 The Author(s).
Linguistic Phylogenies Support Back-Migration from Beringia to Asia
Sicoli, Mark A.; Holton, Gary
2014-01-01
Recent arguments connecting Na-Dene languages of North America with Yeniseian languages of Siberia have been used to assert proof for the origin of Native Americans in central or western Asia. We apply phylogenetic methods to test support for this hypothesis against an alternative hypothesis that Yeniseian represents a back-migration to Asia from a Beringian ancestral population. We coded a linguistic dataset of typological features and used neighbor-joining network algorithms and Bayesian model comparison based on Bayes factors to test the fit between the data and the linguistic phylogenies modeling two dispersal hypotheses. Our results support that a Dene-Yeniseian connection more likely represents radiation out of Beringia with back-migration into central Asia than a migration from central or western Asia to North America. PMID:24621925
Villano, Umbertina; Lo Presti, Alessandra; Equestre, Michele; Cella, Eleonora; Pisani, Giulio; Giovanetti, Marta; Bruni, Roberto; Tritarelli, Elena; Amicosante, Massimo; Grifoni, Alba; Scarcella, Carmelo; El-Hamad, Issa; Pezzoli, Maria Chiara; Angeletti, Silvia; Silvia, Angeletti; Ciccaglione, Anna Rita; Ciccozzi, Massimo
2015-07-25
Hepatitis B virus infection (HBV) is widespread and it is considered a major health problem worldwide. The global distribution of HBV varies significantly between countries and between regions of the world. Among the many factors contributing to the changing epidemiology of viral hepatitis, the movement of people within and between countries is a potentially important one. In Italy, the number of migrant individuals has been increasing during the past 25 years. HBV genotype D has been found throughout the world, although its highest prevalence is in the Mediterranean area, the Middle East and southern Asia. We describe the molecular epidemiology of HBV in a chronically infected population of migrants (living in Italy), by using the phylogenetic analysis. HBV-DNA was amplified and sequenced from 43 HBV chronically infected patients. Phylogenetic and evolutionary analysis were performed using both maximum Likelihood and Bayesian methods. Of the 43 HBV S gene isolates from migrants, 25 (58.1 %) were classified as D genotype. Maximum Likelihood analysis showed an intermixing between Moldavian and foreigners sequences mostly respect to Italian ones. Italian sequences clustered mostly together in a main clade separately from all others. The estimation of the time of the tree's root gave a mean value of 17 years ago, suggesting the origin of the tree back to 1992 year. The skyline plot showed that the number of infections softly increased until the early 2005s, after which reached a plateau. Comparing phylogenetic data to the migrants date of arrival in Italy, it should be possible that migrants arrived in Italy yet infected from their country of origin. In conclusion, this is the first paper where phylogenetic analysis and genetic evolution has been used to characterize HBV sub genotypes D1 circulation in a selected and homogenous group of migrants coming from a restricted area of Balkans and to approximately define the period of infection besides the migration date.
Argue, Debbie; Groves, Colin P; Lee, Michael S Y; Jungers, William L
2017-06-01
Although the diminutive Homo floresiensis has been known for a decade, its phylogenetic status remains highly contentious. A broad range of potential explanations for the evolution of this species has been explored. One view is that H. floresiensis is derived from Asian Homo erectus that arrived on Flores and subsequently evolved a smaller body size, perhaps to survive the constrained resources they faced in a new island environment. Fossil remains of H. erectus, well known from Java, have not yet been discovered on Flores. The second hypothesis is that H. floresiensis is directly descended from an early Homo lineage with roots in Africa, such as Homo habilis; the third is that it is Homo sapiens with pathology. We use parsimony and Bayesian phylogenetic methods to test these hypotheses. Our phylogenetic data build upon those characters previously presented in support of these hypotheses by broadening the range of traits to include the crania, mandibles, dentition, and postcrania of Homo and Australopithecus. The new data and analyses support the hypothesis that H. floresiensis is an early Homo lineage: H. floresiensis is sister either to H. habilis alone or to a clade consisting of at least H. habilis, H. erectus, Homo ergaster, and H. sapiens. A close phylogenetic relationship between H. floresiensis and H. erectus or H. sapiens can be rejected; furthermore, most of the traits separating H. floresiensis from H. sapiens are not readily attributable to pathology (e.g., Down syndrome). The results suggest H. floresiensis is a long-surviving relict of an early (>1.75 Ma) hominin lineage and a hitherto unknown migration out of Africa, and not a recent derivative of either H. erectus or H. sapiens. Copyright © 2017 Elsevier Ltd. All rights reserved.
Prior approval: the growth of Bayesian methods in psychology.
Andrews, Mark; Baguley, Thom
2013-02-01
Within the last few years, Bayesian methods of data analysis in psychology have proliferated. In this paper, we briefly review the history or the Bayesian approach to statistics, and consider the implications that Bayesian methods have for the theory and practice of data analysis in psychology.
An introduction to using Bayesian linear regression with clinical data.
Baldwin, Scott A; Larson, Michael J
2017-11-01
Statistical training psychology focuses on frequentist methods. Bayesian methods are an alternative to standard frequentist methods. This article provides researchers with an introduction to fundamental ideas in Bayesian modeling. We use data from an electroencephalogram (EEG) and anxiety study to illustrate Bayesian models. Specifically, the models examine the relationship between error-related negativity (ERN), a particular event-related potential, and trait anxiety. Methodological topics covered include: how to set up a regression model in a Bayesian framework, specifying priors, examining convergence of the model, visualizing and interpreting posterior distributions, interval estimates, expected and predicted values, and model comparison tools. We also discuss situations where Bayesian methods can outperform frequentist methods as well has how to specify more complicated regression models. Finally, we conclude with recommendations about reporting guidelines for those using Bayesian methods in their own research. We provide data and R code for replicating our analyses. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Li, L.; Xu, C.-Y.; Engeland, K.
2012-04-01
With respect to model calibration, parameter estimation and analysis of uncertainty sources, different approaches have been used in hydrological models. Bayesian method is one of the most widely used methods for uncertainty assessment of hydrological models, which incorporates different sources of information into a single analysis through Bayesian theorem. However, none of these applications can well treat the uncertainty in extreme flows of hydrological models' simulations. This study proposes a Bayesian modularization method approach in uncertainty assessment of conceptual hydrological models by considering the extreme flows. It includes a comprehensive comparison and evaluation of uncertainty assessments by a new Bayesian modularization method approach and traditional Bayesian models using the Metropolis Hasting (MH) algorithm with the daily hydrological model WASMOD. Three likelihood functions are used in combination with traditional Bayesian: the AR (1) plus Normal and time period independent model (Model 1), the AR (1) plus Normal and time period dependent model (Model 2) and the AR (1) plus multi-normal model (Model 3). The results reveal that (1) the simulations derived from Bayesian modularization method are more accurate with the highest Nash-Sutcliffe efficiency value, and (2) the Bayesian modularization method performs best in uncertainty estimates of entire flows and in terms of the application and computational efficiency. The study thus introduces a new approach for reducing the extreme flow's effect on the discharge uncertainty assessment of hydrological models via Bayesian. Keywords: extreme flow, uncertainty assessment, Bayesian modularization, hydrological model, WASMOD
Reyes-Velasco, Jacobo; Manthey, Joseph D; Bourgeois, Yann; Freilich, Xenia; Boissinot, Stéphane
2018-01-01
Understanding the diversification of biological lineages is central to evolutionary studies. To properly study the process of speciation, it is necessary to link micro-evolutionary studies with macro-evolutionary mechanisms. Micro-evolutionary studies require proper sampling across a taxon's range to adequately infer genetic diversity. Here we use the grass frogs of the genus Ptychadena from the Ethiopian highlands as a model to study the process of lineage diversification in this unique biodiversity hotspot. We used thousands of genome-wide SNPs obtained from double digest restriction site associated DNA sequencing (ddRAD-seq) in populations of the Ptychadena neumanni species complex from the Ethiopian highlands in order to infer their phylogenetic relationships and genetic structure, as well as to study their demographic history. Our genome-wide phylogenetic study supports the existence of approximately 13 lineages clustered into 3 species groups. Our phylogenetic and phylogeographic reconstructions suggest that those endemic lineages diversified in allopatry, and subsequently specialized to different habitats and elevations. Demographic analyses point to a continuous decrease in the population size across the majority of lineages and populations during the Pleistocene, which is consistent with a continuous period of aridification that East Africa experienced since the Pliocene. We discuss the taxonomic implications of our analyses and, in particular, we warn against the recent practice to solely use Bayesian species delimitation methods when proposing taxonomic changes.
Sites, J.W.; Morando, M.; Highton, R.; Huber, F.; Jung, R.E.
2004-01-01
The Shenandoah salamander (Plethodon shenandoah), known from isolated talus slopes on three of the highest mountains in Shenandoah National Park, is listed as state-endangered in Virginia and federally endangered under the U.S. Endangered Species Act. A 1999 paper by G. R. Thurow described P. shenandoah-like salamanders from three localities further south in the Blue Ridge Physiographic Province, which, if confirmed, would represent a range extension for P. shenandoah of approximately 90 km from its nearest known locality. Samples collected from two of these three localities were included in a molecular phylogenetic study of the known populations of P. shenandoah, and all other recognized species in the Plethodon cinereus group, using a 792 bp region of the mitochondrial cytochrome-b gene. Phylogenetic estimates were based on Bayesian, maximum likelihood, and maximum parsimony methods and topologies examined for placement of the new P. shenandoah-like samples relative to all others. All topologies recovered all haplotypes of the P. shenandoah-like animals nested within P. cinereus, and a statistical comparison of the best likelihood tree topology with one with an enforced (Thurow + Shenandoah P. shenandoah) clade revealed that the unconstrained tree had a significantly lower -In L score (P < 0.05, using the Shimodaira-Hasegawa test) than the constraint tree. This result and other anecdotal information give us no solid reason to consider the Thurow report valid. The current recovery program for P. shenandoah should remain focused on populations in Shenandoah National Park.
Manthey, Joseph D.; Bourgeois, Yann; Freilich, Xenia; Boissinot, Stéphane
2018-01-01
Understanding the diversification of biological lineages is central to evolutionary studies. To properly study the process of speciation, it is necessary to link micro-evolutionary studies with macro-evolutionary mechanisms. Micro-evolutionary studies require proper sampling across a taxon’s range to adequately infer genetic diversity. Here we use the grass frogs of the genus Ptychadena from the Ethiopian highlands as a model to study the process of lineage diversification in this unique biodiversity hotspot. We used thousands of genome-wide SNPs obtained from double digest restriction site associated DNA sequencing (ddRAD-seq) in populations of the Ptychadena neumanni species complex from the Ethiopian highlands in order to infer their phylogenetic relationships and genetic structure, as well as to study their demographic history. Our genome-wide phylogenetic study supports the existence of approximately 13 lineages clustered into 3 species groups. Our phylogenetic and phylogeographic reconstructions suggest that those endemic lineages diversified in allopatry, and subsequently specialized to different habitats and elevations. Demographic analyses point to a continuous decrease in the population size across the majority of lineages and populations during the Pleistocene, which is consistent with a continuous period of aridification that East Africa experienced since the Pliocene. We discuss the taxonomic implications of our analyses and, in particular, we warn against the recent practice to solely use Bayesian species delimitation methods when proposing taxonomic changes. PMID:29389966
Speciation within Columnea section Angustiflora (Gesneriaceae): islands, pollinators and climate.
Schulte, Lacie J; Clark, John L; Novak, Stephen J; Jeffries, Shandra K; Smith, James F
2015-03-01
Despite many advances in evolutionary biology, understanding the proximate mechanisms that lead to speciation for many taxonomic groups remains elusive. Phylogenetic analyses provide a means to generate well-supported estimates of species relationships. Understanding how genetic isolation (restricted gene flow) occurred in the past requires not only a well-supported molecular phylogenetic analysis, but also an understanding of when character states that define species may have changed. In this study, phylogenetic trees resolve species level relationships for fourteen of the fifteen species within Columnea section Angustiflorae (Gesneriaceae). The distributions of sister species pairs are compared and ancestral character states are reconstructed using Bayesian stochastic mapping. Climate variables were also assessed and shifts in ancestral climate conditions were mapped using SEEVA. The relationships between morphological character states and climate variables were assessed with correlation analyses. These results indicate that species in section Angustiflorae have likely diverged as a result of allopatric, parapatric, and sympatric speciation, with both biotic and abiotic forces driving morphological and phenological divergence. Copyright © 2015 Elsevier Inc. All rights reserved.
Derkarabetian, Shahan; Steinmann, David B.; Hedin, Marshal
2010-01-01
Background Many cave-dwelling animal species display similar morphologies (troglomorphism) that have evolved convergent within and among lineages under the similar selective pressures imposed by cave habitats. Here we study such ecomorphological evolution in cave-dwelling Sclerobuninae harvestmen (Opiliones) from the western United States, providing general insights into morphological homoplasy, rates of morphological change, and the temporal context of cave evolution. Methodology/Principal Findings We gathered DNA sequence data from three independent gene regions, and combined these data with Bayesian hypothesis testing, morphometrics analysis, study of penis morphology, and relaxed molecular clock analyses. Using multivariate morphometric analysis, we find that phylogenetically unrelated taxa have convergently evolved troglomorphism; alternative phylogenetic hypotheses involving less morphological convergence are not supported by Bayesian hypothesis testing. In one instance, this morphology is found in specimens from a high-elevation stony debris habitat, suggesting that troglomorphism can evolve in non-cave habitats. We discovered a strong positive relationship between troglomorphy index and relative divergence time, making it possible to predict taxon age from morphology. Most of our time estimates for the origin of highly-troglomorphic cave forms predate the Pleistocene. Conclusions/Significance While several regions in the eastern and central United States are well-known hotspots for cave evolution, few modern phylogenetic studies have addressed the evolution of cave-obligate species in the western United States. Our integrative studies reveal the recurrent evolution of troglomorphism in a perhaps unexpected geographic region, at surprisingly deep time depths, and in sometimes surprising habitats. Because some newly discovered troglomorphic populations represent undescribed species, our findings stress the need for further biological exploration, integrative systematic research, and conservation efforts in western US cave habitats. PMID:20479884
Reynaud, Yann; Rastogi, Nalin
2016-12-01
We recently showed that the Mycobacterium tuberculosis sublineage LAM9 could be subdivided as two distinct subpopulations - each reflecting its unique biogeographical structure and evolutionary history. We subsequently attempted to verify if this genetic structuration could be traced in an enlarged global sample. For this purpose, we analyzed global evolutionary relationships of LAM strains in a large dataset (n = 1923 isolates from 35 countries worldwide) with concomitant spoligotyping and MIRU-VNTR data, followed by a deeper analysis of LAM9 sublineage (n = 851 isolates). Based on a combination of phylogenetical analysis and Bayesian statistics, a total of three different clusters, tentatively named LAM9C1, C2 and C3 were described in this dataset. Closer inspection of the phylogenetic tree with concomitant data on origin of isolates with genetic clusterization revealed LAM9C3 being the most tightly knit group exclusively found in the Old World as opposed to LAM9C2 being a loosely-knit group without any phylogeographical specificity; while LAM9C1 appeared with a majority of strains being well-clustered despite some isolates that intermixed with unrelated LAM clusters. Subsequently, we hereby describe a new M. tuberculosis LAM sublineage named LAM9C3 with phylogeographical specificity for the Old World. These findings open new perspectives to study respective migration histories and adaptation to human hosts of specific M. tuberculosis clones during the exploration and conquest of the New World. We therefore plan to reevaluate the nomenclature and evolutionary history of various LAM sublineages using Whole Genome Sequencing (WGS). Copyright © 2016 Elsevier Ltd. All rights reserved.
Pereira, Sergio L; Johnson, Kevin P; Clayton, Dale H; Baker, Allan J
2007-08-01
Phylogenetic relationships among genera of pigeons and doves (Aves, Columbiformes) have not been fully resolved because of limited sampling of taxa and characters in previous studies. We therefore sequenced multiple nuclear and mitochondrial DNA genes totaling over 9000 bp from 33 of 41 genera plus 8 outgroup taxa, and, together with sequences from 5 other pigeon genera retrieved from GenBank, recovered a strong phylogenetic hypothesis for the Columbiformes. Three major clades were recovered with the combined data set, comprising the basally branching New World pigeons and allies (clade A) that are sister to Neotropical ground doves (clade B), and the Afro-Eurasian and Australasian taxa (clade C). None of these clades supports the monophyly of current families and subfamilies. The extinct, flightless dodo and solitaires (Raphidae) were embedded within pigeons and doves (Columbidae) in clade C, and monophyly of the subfamily Columbinae was refuted because the remaining subfamilies were nested within it. Divergence times estimated using a Bayesian framework suggest that Columbiformes diverged from outgroups such as Apodiformes and Caprimulgiformes in the Cretaceous before the mass extinction that marks the end of this period. Bayesian and maximum likelihood inferences of ancestral areas, accounting for phylogenetic uncertainty and divergence times, respectively, favor an ancient origin of Columbiformes in the Neotropical portion of what was then Gondwana. The radiation of modern genera of Columbiformes started in the Early Eocene to the Middle Miocene, as previously estimated for other avian groups such as ratites, tinamous, galliform birds, penguins, shorebirds, parrots, passerine birds, and toucans. Multiple dispersals of more derived Columbiformes between Australasian and Afro-Eurasian regions are required to explain current distributions.
A Gentle Introduction to Bayesian Analysis: Applications to Developmental Research
van de Schoot, Rens; Kaplan, David; Denissen, Jaap; Asendorpf, Jens B; Neyer, Franz J; van Aken, Marcel AG
2014-01-01
Bayesian statistical methods are becoming ever more popular in applied and fundamental research. In this study a gentle introduction to Bayesian analysis is provided. It is shown under what circumstances it is attractive to use Bayesian estimation, and how to interpret properly the results. First, the ingredients underlying Bayesian methods are introduced using a simplified example. Thereafter, the advantages and pitfalls of the specification of prior knowledge are discussed. To illustrate Bayesian methods explained in this study, in a second example a series of studies that examine the theoretical framework of dynamic interactionism are considered. In the Discussion the advantages and disadvantages of using Bayesian statistics are reviewed, and guidelines on how to report on Bayesian statistics are provided. PMID:24116396
Eberle, Jonas; Warnock, Rachel C M; Ahrens, Dirk
2016-05-05
Defining species units can be challenging, especially during the earliest stages of speciation, when phylogenetic inference and delimitation methods may be compromised by incomplete lineage sorting (ILS) or secondary gene flow. Integrative approaches to taxonomy, which combine molecular and morphological evidence, have the potential to be valuable in such cases. In this study we investigated the South African scarab beetle genus Pleophylla using data collected from 110 individuals of eight putative morphospecies. The dataset included four molecular markers (cox1, 16S, rrnL, ITS1) and morphometric data based on male genital morphology. We applied a suite of molecular and morphological approaches to species delimitation, and implemented a novel Bayesian approach in the software iBPP, which enables continuous morphological trait and molecular data to be combined. Traditional morphology-based species assignments were supported quantitatively by morphometric analyses of the male genitalia (eigenshape analysis, CVA, LDA). While the ITS1-based delineation was also broadly congruent with the morphospecies, the cox1 data resulted in over-splitting (GMYC modelling, haplotype networks, PTP, ABGD). In the most extreme case morphospecies shared identical haplotypes, which may be attributable to ILS based on statistical tests performed using the software JML. We found the strongest support for putative morphospecies based on phylogenetic evidence using the combined approach implemented in iBPP. However, support for putative species was sensitive to the use of alternative guide trees and alternative combinations of priors on the population size (θ) and rootage (τ 0 ) parameters, especially when the analysis was based on molecular or morphological data alone. We demonstrate that continuous morphological trait data can be extremely valuable in assessing competing hypotheses to species delimitation. In particular, we show that the inclusion of morphological data in an integrative Bayesian framework can improve the resolution of inferred species units. However, we also demonstrate that this approach is extremely sensitive to guide tree and prior parameter choice. These parameters should be chosen with caution - if possible - based on independent empirical evidence, or careful sensitivity analyses should be performed to assess the robustness of results. Young species provide exemplars for investigating the mechanisms of speciation and for assessing the performance of tools used to delimit species on the basis of molecular and/or morphological evidence.
Sex and the Catasetinae (Darwin's favourite orchids).
Pérez-Escobar, Oscar Alejandro; Gottschling, Marc; Whitten, W Mark; Salazar, Gerardo; Gerlach, Günter
2016-04-01
Two sexual systems are predominant in Catasetinae (Orchidaceae), namely protandry (which has evolved in other orchid lineages as well) and environmental sex determination (ESD) being a unique trait among Orchidaceae. Yet, the lack of a robust phylogenetic framework for Catasetinae has hampered deeper insights in origin and evolution of sexual systems. To investigate the origins of protandry and ESD in Catasetinae, we sequenced nuclear and chloroplast loci from 77 species, providing the most extensive data matrix of Catasetinae available so far with all major lineages represented. We used Maximum Parsimony, Maximum Likelihood and Bayesian methods to infer phylogenetic relationships and evolution of sexual systems. Irrespectively of the methods used, Catasetinae were monophyletic in molecular phylogenies, with all established generic lineages and their relationships resolved and highly supported. According to comparative reconstruction approaches, the last common ancestor of Catasetinae was inferred as having bisexual flowers (i.e., lacking protandry and ESD as well), and protandry originated once in core Catasetinae (comprising Catasetum, Clowesia, Cycnoches, Dressleria and Mormodes). In addition, three independent gains of ESD are reliably inferred, linked to corresponding loss of protandry within core Catasetinae. Thus, prior gain of protandry appears as the necessary prerequisite for gain of ESD in orchids. Our results contribute to a comprehensive evolutionary scenario for sexual systems in Catasetinae and more generally in orchids as well. Copyright © 2015 Elsevier Inc. All rights reserved.
Cultural and climatic changes shape the evolutionary history of the Uralic languages.
Honkola, T; Vesakoski, O; Korhonen, K; Lehtinen, J; Syrjänen, K; Wahlberg, N
2013-06-01
Quantitative phylogenetic methods have been used to study the evolutionary relationships and divergence times of biological species, and recently, these have also been applied to linguistic data to elucidate the evolutionary history of language families. In biology, the factors driving macroevolutionary processes are assumed to be either mainly biotic (the Red Queen model) or mainly abiotic (the Court Jester model) or a combination of both. The applicability of these models is assumed to depend on the temporal and spatial scale observed as biotic factors act on species divergence faster and in smaller spatial scale than the abiotic factors. Here, we used the Uralic language family to investigate whether both 'biotic' interactions (i.e. cultural interactions) and abiotic changes (i.e. climatic fluctuations) are also connected to language diversification. We estimated the times of divergence using Bayesian phylogenetics with a relaxed-clock method and related our results to climatic, historical and archaeological information. Our timing results paralleled the previous linguistic studies but suggested a later divergence of Finno-Ugric, Finnic and Saami languages. Some of the divergences co-occurred with climatic fluctuation and some with cultural interaction and migrations of populations. Thus, we suggest that both 'biotic' and abiotic factors contribute either directly or indirectly to the diversification of languages and that both models can be applied when studying language evolution. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.
Liu, Luxian; Jin, Xinjie; Chen, Nan; Li, Xian; Li, Pan; Fu, Chengxin
2015-01-01
Phylogenetic relationships among Chinese species of Morella (Myricaceae) are unresolved. Here, we use restriction site-associated DNA sequencing (RAD-seq) to identify candidate loci that will help in determining phylogenetic relationships among Morella rubra, M. adenophora, M. nana and M. esculenta. Three methods for inferring phylogeny, maximum parsimony (MP), maximum likelihood (ML) and Bayesian concordance, were applied to data sets including as many as 4253 RAD loci with 8360 parsimony informative variable sites. All three methods significantly favored the topology of (((M. rubra, M. adenophora), M. nana), M. esculenta). Two species from North America (M. cerifera and M. pensylvanica) were placed as sister to the four Chinese species. According to BEAST analysis, we deduced speciation of M. rubra to be at about the Miocene-Pliocene boundary (5.28 Ma). Intraspecific divergence in M. rubra occurred in the late Pliocene (3.39 Ma). From pooled data, we assembled 29378, 21902 and 23552 de novo contigs with an average length of 229, 234 and 234 bp for M. rubra, M. nana and M. esculenta respectively. The contigs were used to investigate functional classification of RAD tags in a BLASTX search. Additionally, we identified 3808 unlinked SNP sites across the four populations of M. rubra and discovered genes associated with fruit ripening and senescence, fruit quality and disease/defense metabolism based on KEGG database. PMID:26431030
Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A.
2015-01-01
Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study, next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina HiSeq 2500 instrument. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs. PMID:26656830
Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A
2015-01-01
Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study,next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina NextSeq 500 instrument [corrected]. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.
A Gentle Introduction to Bayesian Analysis: Applications to Developmental Research
ERIC Educational Resources Information Center
van de Schoot, Rens; Kaplan, David; Denissen, Jaap; Asendorpf, Jens B.; Neyer, Franz J.; van Aken, Marcel A. G.
2014-01-01
Bayesian statistical methods are becoming ever more popular in applied and fundamental research. In this study a gentle introduction to Bayesian analysis is provided. It is shown under what circumstances it is attractive to use Bayesian estimation, and how to interpret properly the results. First, the ingredients underlying Bayesian methods are…
Dengue Virus Type 4 Phylogenetics in Brazil 2011: Looking beyond the Veil
de Souza, Renato Pereira; Rocco, Iray M.; Maeda, Adriana Y.; Spenassatto, Carine; Bisordi, Ivani; Suzuki, Akemi; Silveira, Vivian R.; Silva, Sarai J. S.; Azevedo, Roberta M.; Tolentino, Fernanda M.; Assis, Jaqueline C.; Bassi, Margarida G.; Dambrós, Bibiana P.; Tumioto, Gabriela L.; Gregianini, Tatiana S.; Souza, Luiza Terezinha M.; Timenetsky, Maria do Carmo S. T.; Santos, Cecília L. S.
2011-01-01
Dengue Fever and Dengue Hemorrhagic Fever are diseases affecting approximately 100 million people/year and are a major concern in developing countries. In the present study, the phylogenetic relationship of six strains of the first autochthonous cases of DENV-4 infection occurred in Sao Paulo State, Parana State and Rio Grande do Sul State, Brazil, 2011 were studied. Nucleotide sequences of the envelope gene were determined and compared with sequences representative of the genotypes I, II, III and Sylvatic for DEN4 retrieved from GenBank. We employed a Bayesian phylogenetic approach to reconstruct the phylogenetic relationships of Brazilian DENV-4 and we estimated evolutionary rates and dates of divergence for DENV-4 found in Brazil in 2011. All samples sequenced in this study were located in Genotype II. The studied strains are monophyletic and our data suggest that they have been evolving separately for at least 4 to 6 years. Our data suggest that the virus might have been present in the region for some time, without being noticed by Health Surveillance Services due to a low level of circulation and a higher prevalence of DENV-1 and DENV- 2. PMID:22216365
Li, Wei; Zhang, Xin-Cheng; Zhao, Jian; Shi, Yan; Zhu, Xin-Ping
2015-01-25
Cuora trifasciata has become one of the most critically endangered species in the world. The complete mitochondrial genome of C. trifasciata (Chinese three-striped box turtle) was determined in this study. Its mitochondrial genome is a 16,575-bp-long circular molecule that consists of 37 genes that are typically found in other vertebrates. And the basic characteristics of the C. trifasciata mitochondrial genome were also determined. Moreover, a comparison of C. trifasciata with Cuora cyclornata, Cuora pani and Cuora aurocapitata indicated that the four mitogenomics differed in length, codons, overlaps, 13 protein-coding genes (PCGs), ND3, rRNA genes, control region, and other aspects. Phylogenetic analysis with Bayesian inference and maximum likelihood based on 12 protein-coding genes of the genus Cuora indicated the phylogenetic position of C. trifasciata within Cuora. The phylogenetic analysis also showed that C. trifasciata from Vietnam and China formed separate monophyletic clades with different Cuora species. The results of nucleotide base compositions, protein-coding genes and phylogenetic analysis showed that C. trifasciata from these two countries may represent different Cuora species. Copyright © 2014 Elsevier B.V. All rights reserved.
Rearrangement moves on rooted phylogenetic networks
Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2017-01-01
Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for practical phylogenetic network reconstruction. PMID:28763439
Cross-Border Sexual Transmission of the Newly Emerging HIV-1 Clade CRF51_01B
Cheong, Hui Ting; Ng, Kim Tien; Ong, Lai Yee; Chook, Jack Bee; Chan, Kok Gan; Takebe, Yutaka; Kamarulzaman, Adeeba; Tee, Kok Keng
2014-01-01
A novel HIV-1 recombinant clade (CRF51_01B) was recently identified among men who have sex with men (MSM) in Singapore. As cases of sexually transmitted HIV-1 infection increase concurrently in two socioeconomically intimate countries such as Malaysia and Singapore, cross transmission of HIV-1 between said countries is highly probable. In order to investigate the timeline for the emergence of HIV-1 CRF51_01B in Singapore and its possible introduction into Malaysia, 595 HIV-positive subjects recruited in Kuala Lumpur from 2008 to 2012 were screened. Phylogenetic relationship of 485 amplified polymerase gene sequences was determined through neighbour-joining method. Next, near-full length sequences were amplified for genomic sequences inferred to be CRF51_01B and subjected to further analysis implemented through Bayesian Markov chain Monte Carlo (MCMC) sampling and maximum likelihood methods. Based on the near full length genomes, two isolates formed a phylogenetic cluster with CRF51_01B sequences of Singapore origin, sharing identical recombination structure. Spatial and temporal information from Bayesian MCMC coalescent and maximum likelihood analysis of the protease, gp120 and gp41 genes suggest that Singapore is probably the country of origin of CRF51_01B (as early as in the mid-1990s) and featured a Malaysian who acquired the infection through heterosexual contact as host for its ancestral lineages. CRF51_01B then spread rapidly among the MSM in Singapore and Malaysia. Although the importation of CRF51_01B from Singapore to Malaysia is supported by coalescence analysis, the narrow timeframe of the transmission event indicates a closely linked epidemic. Discrepancies in the estimated divergence times suggest that CRF51_01B may have arisen through multiple recombination events from more than one parental lineage. We report the cross transmission of a novel CRF51_01B lineage between countries that involved different sexual risk groups. Understanding the cross-border transmission of HIV-1 involving sexual networks is crucial for effective intervention strategies in the region. PMID:25340817
Cross-border sexual transmission of the newly emerging HIV-1 clade CRF51_01B.
Cheong, Hui Ting; Ng, Kim Tien; Ong, Lai Yee; Chook, Jack Bee; Chan, Kok Gan; Takebe, Yutaka; Kamarulzaman, Adeeba; Tee, Kok Keng
2014-01-01
A novel HIV-1 recombinant clade (CRF51_01B) was recently identified among men who have sex with men (MSM) in Singapore. As cases of sexually transmitted HIV-1 infection increase concurrently in two socioeconomically intimate countries such as Malaysia and Singapore, cross transmission of HIV-1 between said countries is highly probable. In order to investigate the timeline for the emergence of HIV-1 CRF51_01B in Singapore and its possible introduction into Malaysia, 595 HIV-positive subjects recruited in Kuala Lumpur from 2008 to 2012 were screened. Phylogenetic relationship of 485 amplified polymerase gene sequences was determined through neighbour-joining method. Next, near-full length sequences were amplified for genomic sequences inferred to be CRF51_01B and subjected to further analysis implemented through Bayesian Markov chain Monte Carlo (MCMC) sampling and maximum likelihood methods. Based on the near full length genomes, two isolates formed a phylogenetic cluster with CRF51_01B sequences of Singapore origin, sharing identical recombination structure. Spatial and temporal information from Bayesian MCMC coalescent and maximum likelihood analysis of the protease, gp120 and gp41 genes suggest that Singapore is probably the country of origin of CRF51_01B (as early as in the mid-1990s) and featured a Malaysian who acquired the infection through heterosexual contact as host for its ancestral lineages. CRF51_01B then spread rapidly among the MSM in Singapore and Malaysia. Although the importation of CRF51_01B from Singapore to Malaysia is supported by coalescence analysis, the narrow timeframe of the transmission event indicates a closely linked epidemic. Discrepancies in the estimated divergence times suggest that CRF51_01B may have arisen through multiple recombination events from more than one parental lineage. We report the cross transmission of a novel CRF51_01B lineage between countries that involved different sexual risk groups. Understanding the cross-border transmission of HIV-1 involving sexual networks is crucial for effective intervention strategies in the region.
Tarasov, Sergei; Génier, François
2015-01-01
Scarabaeine dung beetles are the dominant dung feeding group of insects and are widely used as model organisms in conservation, ecology and developmental biology. Due to the conflicts among 13 recently published phylogenies dealing with the higher-level relationships of dung beetles, the phylogeny of this lineage remains largely unresolved. In this study, we conduct rigorous phylogenetic analyses of dung beetles, based on an unprecedented taxon sample (110 taxa) and detailed investigation of morphology (205 characters). We provide the description of morphology and thoroughly illustrate the used characters. Along with parsimony, traditionally used in the analysis of morphological data, we also apply the Bayesian method with a novel approach that uses anatomy ontology for matrix partitioning. This approach allows for heterogeneity in evolutionary rates among characters from different anatomical regions. Anatomy ontology generates a number of parameter-partition schemes which we compare using Bayes factor. We also test the effect of inclusion of autapomorphies in the morphological analysis, which hitherto has not been examined. Generally, schemes with more parameters were favored in the Bayesian comparison suggesting that characters located on different body regions evolve at different rates and that partitioning of the data matrix using anatomy ontology is reasonable; however, trees from the parsimony and all the Bayesian analyses were quite consistent. The hypothesized phylogeny reveals many novel clades and provides additional support for some clades recovered in previous analyses. Our results provide a solid basis for a new classification of dung beetles, in which the taxonomic limits of the tribes Dichotomiini, Deltochilini and Coprini are restricted and many new tribes must be described. Based on the consistency of the phylogeny with biogeography, we speculate that dung beetles may have originated in the Mesozoic contrary to the traditional view pointing to a Cenozoic origin. PMID:25781019
Nauheimer, Lars; Schley, Rowan J; Clements, Mark A; Micheneau, Claire; Nargar, Katharina
2018-06-02
Australia harbours a rich and highly endemic orchid flora, with c. 90 % of species endemic to the country. Despite that, the biogeographic history of Australasian orchid lineages is only poorly understood. Here we examined evolutionary relationships and the spatio-temporal evolution of the sun orchids (Thelymitra, 119 species), which display disjunct distribution patterns frequently found in Australasian orchid lineages. Phylogenetic analyses were conducted based on one nuclear (ITS) and three plastid markers (matK, psbJ-petA, ycf1) using Maximum Likelihood and Bayesian inference. Divergence time estimations were carried out with a relaxed molecular clock in a Bayesian framework. Ancestral ranges were estimated using the dispersal-extinction-cladogenesis model and an area coding based on major disjunctions. The phylogenetic analyses clarified intergeneric relationships within Thelymitrinae, with Epiblema being sister to Thelymitra plus Calochilus, both of which were well-supported. Within Thelymitra, eight major and several minor clades were retrieved in the nuclear and plastid phylogenetic reconstructions. Five major clades corresponded to species complexes previously recognized based on morphological characters, whereas other previously recognized species groups were found to be paraphyletic. Conflicting signals between the nuclear and plastid phylogenetic reconstructions provided support for hybridization and plastid capture events both in the deeper evolutionary history of the genus and more recently. Divergence time estimation placed the origin of Thelymitra in the late Miocene (c. 10.8 Ma) and the origin of the majority of the main clades within Thelymitra during the late Pliocene and early Pleistocene, with the majority of extant species arising during the Pleistocene. Ancestral range reconstruction revealed that the early diversification of the genus in the late Miocene and Pliocene took place predominantly in southwest Australia, where most species with highly restricted distributional ranges occur. Several long-distance dispersal events eastwards across the Nullarbor Plain were inferred, recurrently resulting in lineage divergence within the genus. The predominant eastwards direction of long-distance dispersal events in Thelymitra highlights the importance of the West Wind Drift for the present-day distribution of the genus, giving rise to the Thelymitra floras of Tasmania, New Zealand and New Caledonia, which were inferred to be of comparatively recent origin. Copyright © 2018. Published by Elsevier Inc.
A study of finite mixture model: Bayesian approach on financial time series data
NASA Astrophysics Data System (ADS)
Phoong, Seuk-Yen; Ismail, Mohd Tahir
2014-07-01
Recently, statistician have emphasized on the fitting finite mixture model by using Bayesian method. Finite mixture model is a mixture of distributions in modeling a statistical distribution meanwhile Bayesian method is a statistical method that use to fit the mixture model. Bayesian method is being used widely because it has asymptotic properties which provide remarkable result. In addition, Bayesian method also shows consistency characteristic which means the parameter estimates are close to the predictive distributions. In the present paper, the number of components for mixture model is studied by using Bayesian Information Criterion. Identify the number of component is important because it may lead to an invalid result. Later, the Bayesian method is utilized to fit the k-component mixture model in order to explore the relationship between rubber price and stock market price for Malaysia, Thailand, Philippines and Indonesia. Lastly, the results showed that there is a negative effect among rubber price and stock market price for all selected countries.
Teaching Bayesian Statistics in a Health Research Methodology Program
ERIC Educational Resources Information Center
Pullenayegum, Eleanor M.; Thabane, Lehana
2009-01-01
Despite the appeal of Bayesian methods in health research, they are not widely used. This is partly due to a lack of courses in Bayesian methods at an appropriate level for non-statisticians in health research. Teaching such a course can be challenging because most statisticians have been taught Bayesian methods using a mathematical approach, and…
Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar
2018-01-01
Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii , it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis - X. friesii clade.
Xylopsora canopeorum (Umbilicariaceae), a new lichen species from the canopy of Sequoia sempervirens
Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar
2018-01-01
Abstract Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii, it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis–X. friesii clade. PMID:29559828
Javadi, Firouzeh; Tun, Ye Tun; Kawase, Makoto; Guan, Kaiyun; Yamaguchi, Hirofumi
2011-08-01
The subgenus Ceratotropis in the genus Vigna is widely distributed from the Himalayan highlands to South, Southeast and East Asia. However, the interspecific and geographical relationships of its members are poorly understood. This study investigates the phylogeny and biogeography of the subgenus Ceratotropis using chloroplast DNA sequence data. Sequence data from four intergenic spacer regions (petA-psbJ, psbD-trnT, trnT-trnE and trnT-trnL) of chloroplast DNA, alone and in combination, were analysed using Bayesian and parsimony methods. Divergence times for major clades were estimated with penalized likelihood. Character evolution was examined by means of parsimony optimization and MacClade. Parsimony and Bayesian phylogenetic analyses on the combined data demonstrated well-resolved species relationships in which 18 Vigna species were divided into two major geographical clades: the East Asia-Southeast Asian clade and the Indian subcontinent clade. Within these two clades, three well-supported eco-geographical groups, temperate and subtropical (the East Asia-Southeast Asian clade) and tropical (the Indian subcontinent clade), are recognized. The temperate group consists of V. minima, V. nepalensis and V. angularis. The subtropical group comprises the V. nakashimae-V. riukiuensis-V. minima subgroup and the V. hirtella-V. exilis-V. umbellata subgroup. The tropical group contains two subgroups: the V. trinervia-V. reflexo-pilosa-V. trilobata subgroup and the V. mungo-V. grandiflora subgroup. An evolutionary rate analysis estimated the divergence time between the East Asia-Southeast Asia clade and the Indian subcontinent clade as 3·62 ± 0·3 million years, and that between the temperate and subtropical groups as 2·0 ± 0·2 million years. The findings provide an improved understanding of the interspecific relationships, and ecological and geographical phylogenetic structure of the subgenus Ceratotropis. The quaternary diversification of the subgenus Ceratotropis implicates its geographical dispersal in the south-eastern part of Asia involving adaptation to climatic condition after the collision of the Indian subcontinent with the Asian plate. The phylogenetic results indicate that the epigeal germination is plesiomorphic, and the germination type evolved independently multiple times in this subgenus, implying its limited taxonomic utility.
Motani, Ryosuke; Jiang, Da-Yong; Tintori, Andrea; Ji, Cheng; Huang, Jian-Dong
2017-05-17
The fossil record of a major clade often starts after a mass extinction even though evolutionary rates, molecular or morphological, suggest its pre-extinction emergence (e.g. squamates, placentals and teleosts). The discrepancy is larger for older clades, and the presence of a time-scale-dependent methodological bias has been suggested, yet it has been difficult to avoid the bias using Bayesian phylogenetic methods. This paradox raises the question of whether ecological vacancies, such as those after mass extinctions, prompt the radiations. We addressed this problem by using a unique temporal characteristic of the morphological data and a high-resolution stratigraphic record, for the oldest clade of Mesozoic marine reptiles, Ichthyosauromorpha. The evolutionary rate was fastest during the first few million years of ichthyosauromorph evolution and became progressively slower over time, eventually becoming six times slower. Using the later slower rates, estimates of divergence time become excessively older. The fast, initial rate suggests the emergence of ichthyosauromorphs after the end-Permian mass extinction, matching an independent result from high-resolution stratigraphic confidence intervals. These reptiles probably invaded the sea as a new ecosystem was formed after the end-Permian mass extinction. Lack of information on early evolution biased Bayesian clock rates. © 2017 The Author(s).
Ji, Cheng; Huang, Jian-dong
2017-01-01
The fossil record of a major clade often starts after a mass extinction even though evolutionary rates, molecular or morphological, suggest its pre-extinction emergence (e.g. squamates, placentals and teleosts). The discrepancy is larger for older clades, and the presence of a time-scale-dependent methodological bias has been suggested, yet it has been difficult to avoid the bias using Bayesian phylogenetic methods. This paradox raises the question of whether ecological vacancies, such as those after mass extinctions, prompt the radiations. We addressed this problem by using a unique temporal characteristic of the morphological data and a high-resolution stratigraphic record, for the oldest clade of Mesozoic marine reptiles, Ichthyosauromorpha. The evolutionary rate was fastest during the first few million years of ichthyosauromorph evolution and became progressively slower over time, eventually becoming six times slower. Using the later slower rates, estimates of divergence time become excessively older. The fast, initial rate suggests the emergence of ichthyosauromorphs after the end-Permian mass extinction, matching an independent result from high-resolution stratigraphic confidence intervals. These reptiles probably invaded the sea as a new ecosystem was formed after the end-Permian mass extinction. Lack of information on early evolution biased Bayesian clock rates. PMID:28515201
SIBIS: a Bayesian model for inconsistent protein sequence estimation.
Khenoussi, Walyd; Vanhoutrève, Renaud; Poch, Olivier; Thompson, Julie D
2014-09-01
The prediction of protein coding genes is a major challenge that depends on the quality of genome sequencing, the accuracy of the model used to elucidate the exonic structure of the genes and the complexity of the gene splicing process leading to different protein variants. As a consequence, today's protein databases contain a huge amount of inconsistency, due to both natural variants and sequence prediction errors. We have developed a new method, called SIBIS, to detect such inconsistencies based on the evolutionary information in multiple sequence alignments. A Bayesian framework, combined with Dirichlet mixture models, is used to estimate the probability of observing specific amino acids and to detect inconsistent or erroneous sequence segments. We evaluated the performance of SIBIS on a reference set of protein sequences with experimentally validated errors and showed that the sensitivity is significantly higher than previous methods, with only a small loss of specificity. We also assessed a large set of human sequences from the UniProt database and found evidence of inconsistency in 48% of the previously uncharacterized sequences. We conclude that the integration of quality control methods like SIBIS in automatic analysis pipelines will be critical for the robust inference of structural, functional and phylogenetic information from these sequences. Source code, implemented in C on a linux system, and the datasets of protein sequences are freely available for download at http://www.lbgi.fr/∼julie/SIBIS. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Enhancements to the Bayesian Infrasound Source Location Method
2012-09-01
ENHANCEMENTS TO THE BAYESIAN INFRASOUND SOURCE LOCATION METHOD Omar E. Marcillo, Stephen J. Arrowsmith, Rod W. Whitaker, and Dale N. Anderson Los...ABSTRACT We report on R&D that is enabling enhancements to the Bayesian Infrasound Source Location (BISL) method for infrasound event location...the Bayesian Infrasound Source Location Method 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER
Blanquart, Samuel; Lartillot, Nicolas
2006-11-01
Variations of nucleotidic composition affect phylogenetic inference conducted under stationary models of evolution. In particular, they may cause unrelated taxa sharing similar base composition to be grouped together in the resulting phylogeny. To address this problem, we developed a nonstationary and nonhomogeneous model accounting for compositional biases. Unlike previous nonstationary models, which are branchwise, that is, assume that base composition only changes at the nodes of the tree, in our model, the process of compositional drift is totally uncoupled from the speciation events. In addition, the total number of events of compositional drift distributed across the tree is directly inferred from the data. We implemented the method in a Bayesian framework, relying on Markov Chain Monte Carlo algorithms, and applied it to several nucleotidic data sets. In most cases, the stationarity assumption was rejected in favor of our nonstationary model. In addition, we show that our method is able to resolve a well-known artifact. By Bayes factor evaluation, we compared our model with 2 previously developed nonstationary models. We show that the coupling between speciations and compositional shifts inherent to branchwise models may lead to an overparameterization, resulting in a lesser fit. In some cases, this leads to incorrect conclusions, concerning the nature of the compositional biases. In contrast, our compound model more flexibly adapts its effective number of parameters to the data sets under investigation. Altogether, our results show that accounting for nonstationary sequence evolution may require more elaborate and more flexible models than those currently used.
Kruschke, John K; Liddell, Torrin M
2018-02-01
In the practice of data analysis, there is a conceptual distinction between hypothesis testing, on the one hand, and estimation with quantified uncertainty on the other. Among frequentists in psychology, a shift of emphasis from hypothesis testing to estimation has been dubbed "the New Statistics" (Cumming 2014). A second conceptual distinction is between frequentist methods and Bayesian methods. Our main goal in this article is to explain how Bayesian methods achieve the goals of the New Statistics better than frequentist methods. The article reviews frequentist and Bayesian approaches to hypothesis testing and to estimation with confidence or credible intervals. The article also describes Bayesian approaches to meta-analysis, randomized controlled trials, and power analysis.
NASA Astrophysics Data System (ADS)
Li, Lu; Xu, Chong-Yu; Engeland, Kolbjørn
2013-04-01
SummaryWith respect to model calibration, parameter estimation and analysis of uncertainty sources, various regression and probabilistic approaches are used in hydrological modeling. A family of Bayesian methods, which incorporates different sources of information into a single analysis through Bayes' theorem, is widely used for uncertainty assessment. However, none of these approaches can well treat the impact of high flows in hydrological modeling. This study proposes a Bayesian modularization uncertainty assessment approach in which the highest streamflow observations are treated as suspect information that should not influence the inference of the main bulk of the model parameters. This study includes a comprehensive comparison and evaluation of uncertainty assessments by our new Bayesian modularization method and standard Bayesian methods using the Metropolis-Hastings (MH) algorithm with the daily hydrological model WASMOD. Three likelihood functions were used in combination with standard Bayesian method: the AR(1) plus Normal model independent of time (Model 1), the AR(1) plus Normal model dependent on time (Model 2) and the AR(1) plus Multi-normal model (Model 3). The results reveal that the Bayesian modularization method provides the most accurate streamflow estimates measured by the Nash-Sutcliffe efficiency and provide the best in uncertainty estimates for low, medium and entire flows compared to standard Bayesian methods. The study thus provides a new approach for reducing the impact of high flows on the discharge uncertainty assessment of hydrological models via Bayesian method.
A total-evidence approach to dating with fossils, applied to the early radiation of the hymenoptera.
Ronquist, Fredrik; Klopfstein, Seraina; Vilhelmsen, Lars; Schulmeister, Susanne; Murray, Debra L; Rasnitsyn, Alexandr P
2012-12-01
Phylogenies are usually dated by calibrating interior nodes against the fossil record. This relies on indirect methods that, in the worst case, misrepresent the fossil information. Here, we contrast such node dating with an approach that includes fossils along with the extant taxa in a Bayesian total-evidence analysis. As a test case, we focus on the early radiation of the Hymenoptera, mostly documented by poorly preserved impression fossils that are difficult to place phylogenetically. Specifically, we compare node dating using nine calibration points derived from the fossil record with total-evidence dating based on 343 morphological characters scored for 45 fossil (4--20 complete) and 68 extant taxa. In both cases we use molecular data from seven markers (∼5 kb) for the extant taxa. Because it is difficult to model speciation, extinction, sampling, and fossil preservation realistically, we develop a simple uniform prior for clock trees with fossils, and we use relaxed clock models to accommodate rate variation across the tree. Despite considerable uncertainty in the placement of most fossils, we find that they contribute significantly to the estimation of divergence times in the total-evidence analysis. In particular, the posterior distributions on divergence times are less sensitive to prior assumptions and tend to be more precise than in node dating. The total-evidence analysis also shows that four of the seven Hymenoptera calibration points used in node dating are likely to be based on erroneous or doubtful assumptions about the fossil placement. With respect to the early radiation of Hymenoptera, our results suggest that the crown group dates back to the Carboniferous, ∼309 Ma (95% interval: 291--347 Ma), and diversified into major extant lineages much earlier than previously thought, well before the Triassic. [Bayesian inference; fossil dating; morphological evolution; relaxed clock; statistical phylogenetics.].
A Total-Evidence Approach to Dating with Fossils, Applied to the Early Radiation of the Hymenoptera
Ronquist, Fredrik; Klopfstein, Seraina; Vilhelmsen, Lars; Schulmeister, Susanne; Murray, Debra L.; Rasnitsyn, Alexandr P.
2012-01-01
Abstract Phylogenies are usually dated by calibrating interior nodes against the fossil record. This relies on indirect methods that, in the worst case, misrepresent the fossil information. Here, we contrast such node dating with an approach that includes fossils along with the extant taxa in a Bayesian total-evidence analysis. As a test case, we focus on the early radiation of the Hymenoptera, mostly documented by poorly preserved impression fossils that are difficult to place phylogenetically. Specifically, we compare node dating using nine calibration points derived from the fossil record with total-evidence dating based on 343 morphological characters scored for 45 fossil (4--20 complete) and 68 extant taxa. In both cases we use molecular data from seven markers (∼5 kb) for the extant taxa. Because it is difficult to model speciation, extinction, sampling, and fossil preservation realistically, we develop a simple uniform prior for clock trees with fossils, and we use relaxed clock models to accommodate rate variation across the tree. Despite considerable uncertainty in the placement of most fossils, we find that they contribute significantly to the estimation of divergence times in the total-evidence analysis. In particular, the posterior distributions on divergence times are less sensitive to prior assumptions and tend to be more precise than in node dating. The total-evidence analysis also shows that four of the seven Hymenoptera calibration points used in node dating are likely to be based on erroneous or doubtful assumptions about the fossil placement. With respect to the early radiation of Hymenoptera, our results suggest that the crown group dates back to the Carboniferous, ∼309 Ma (95% interval: 291--347 Ma), and diversified into major extant lineages much earlier than previously thought, well before the Triassic. [Bayesian inference; fossil dating; morphological evolution; relaxed clock; statistical phylogenetics.] PMID:22723471
Stevens, Nancy J.
2017-01-01
Throughout the Paleogene, most terrestrial carnivore niches in Afro-Arabia were occupied by Hyaenodonta, an extinct lineage of placental mammals. By the end of the Miocene, terrestrial carnivore niches had shifted to members of Carnivora, a clade with Eurasian origins. The transition from a hyaenodont-carnivore fauna to a carnivoran-carnivore fauna coincides with other ecological changes in Afro-Arabia as tectonic conditions in the African Rift System altered climatic conditions and facilitated faunal exchange with Eurasia. Fossil bearing deposits in the Nsungwe Formation in southwestern Tanzania are precisely dated to ~25.2 Ma (late Oligocene), preserving a late Paleogene Afro-Arabian fauna on the brink of environmental transition, including the earliest fossil evidence of the split between Old World monkeys and apes. Here we describe a new hyaenodont from the Nsungwe Formation, Pakakali rukwaensis gen. et sp. nov., a bobcat-sized taxon known from a portion of the maxilla that preserves a deciduous third premolar and alveoli of dP4 and M1. The crown of dP3 bears an elongate parastyle and metastyle and a small, blade-like metacone. Based on alveolar morphology, the two more distal teeth successively increased in size and had relatively large protocones. Using a hyaenodont character-taxon matrix that includes deciduous dental characters, Bayesian phylogenetic methods resolve Pakakali within the clade Hyainailouroidea. A Bayesian biogeographic analysis of phylogenetic results resolve the Pakakali clade as Afro-Arabian in origin, demonstrating that this small carnivorous mammal was part of an endemic Afro-Arabian lineage that persisted into the Miocene. Notably, Pakakali is in the size range of carnivoran forms that arrived and began to diversify in the region by the early Miocene. The description of Pakakali is important for exploring hyaenodont ontogeny and potential influences of Afro-Arabian tectonic events upon mammalian evolution, providing a deep time perspective on the stability of terrestrial carnivore niches through time. PMID:29020030
Multiple independent introductions of Plasmodium falciparum in South America
Yalcindag, Erhan; Elguero, Eric; Arnathau, Céline; Durand, Patrick; Akiana, Jean; Anderson, Timothy J.; Aubouy, Agnes; Balloux, François; Besnard, Patrick; Bogreau, Hervé; Carnevale, Pierre; D'Alessandro, Umberto; Fontenille, Didier; Gamboa, Dionicia; Jombart, Thibaut; Le Mire, Jacques; Leroy, Eric; Maestre, Amanda; Mayxay, Mayfong; Ménard, Didier; Musset, Lise; Newton, Paul N.; Nkoghé, Dieudonné; Noya, Oscar; Ollomo, Benjamin; Rogier, Christophe; Veron, Vincent; Wide, Albina; Zakeri, Sedigheh; Carme, Bernard; Legrand, Eric; Chevillon, Christine; Ayala, Francisco J.; Renaud, François; Prugnolle, Franck
2012-01-01
The origin of Plasmodium falciparum in South America is controversial. Some studies suggest a recent introduction during the European colonizations and the transatlantic slave trade. Other evidence—archeological and genetic—suggests a much older origin. We collected and analyzed P. falciparum isolates from different regions of the world, encompassing the distribution range of the parasite, including populations from sub-Saharan Africa, the Middle East, Southeast Asia, and South America. Analyses of microsatellite and SNP polymorphisms show that the populations of P. falciparum in South America are subdivided in two main genetic clusters (northern and southern). Phylogenetic analyses, as well as Approximate Bayesian Computation methods suggest independent introductions of the two clusters from African sources. Our estimates of divergence time between the South American populations and their likely sources favor a likely introduction from Africa during the transatlantic slave trade. PMID:22203975
Sallam, Hesham M; Seiffert, Erik R
2016-01-01
The Fayum Depression of Egypt has yielded fossils of hystricognathous rodents from multiple Eocene and Oligocene horizons that range in age from ∼37 to ∼30 Ma and document several phases in the early evolution of crown Hystricognathi and one of its major subclades, Phiomorpha. Here we describe two new genera and species of basal phiomorphs, Birkamys korai and Mubhammys vadumensis, based on rostra and maxillary and mandibular remains from the terminal Eocene (∼34 Ma) Fayum Locality 41 (L-41). Birkamys is the smallest known Paleogene hystricognath, has very simple molars, and, like derived Oligocene-to-Recent phiomorphs (but unlike contemporaneous and older taxa) apparently retained dP(4)∕4 late into life, with no evidence for P(4)∕4 eruption or formation. Mubhammys is very similar in dental morphology to Birkamys, and also shows no evidence for P(4)∕4 formation or eruption, but is considerably larger. Though parsimony analysis with all characters equally weighted places Birkamys and Mubhammys as sister taxa of extant Thryonomys to the exclusion of much younger relatives of that genus, all other methods (standard Bayesian inference, Bayesian "tip-dating," and parsimony analysis with scaled transitions between "fixed" and polymorphic states) place these species in more basal positions within Hystricognathi, as sister taxa of Oligocene-to-Recent phiomorphs. We also employ tip-dating as a means for estimating the ages of early hystricognath-bearing localities, many of which are not well-constrained by geological, geochronological, or biostratigraphic evidence. By simultaneously taking into account phylogeny, evolutionary rates, and uniform priors that appropriately encompass the range of possible ages for fossil localities, dating of tips in this Bayesian framework allows paleontologists to move beyond vague and assumption-laden "stage of evolution" arguments in biochronology to provide relatively rigorous age assessments of poorly-constrained faunas. This approach should become increasingly robust as estimates are combined from multiple independent analyses of distantly related clades, and is broadly applicable across the tree of life; as such it is deserving of paleontologists' close attention. Notably, in the example provided here, hystricognathous rodents from Libya and Namibia that are controversially considered to be of middle Eocene age are instead estimated to be of late Eocene and late Oligocene age, respectively. Finally, we reconstruct the evolution of first lower molar size among Paleogene African hystricognaths using a Bayesian approach; the results of this analysis reconstruct a rapid latest Eocene dwarfing event along the lineage leading to Birkamys.
Wiens, John J; Kuczynski, Caitlin A; Townsend, Ted; Reeder, Tod W; Mulcahy, Daniel G; Sites, Jack W
2010-12-01
Molecular data offer great potential to resolve the phylogeny of living taxa but can molecular data improve our understanding of relationships of fossil taxa? Simulations suggest that this is possible, but few empirical examples have demonstrated the ability of molecular data to change the placement of fossil taxa. We offer such an example here. We analyze the placement of snakes among squamate reptiles, combining published morphological data (363 characters) and new DNA sequence data (15,794 characters, 22 nuclear loci) for 45 living and 19 fossil taxa. We find several intriguing results. First, some fossil taxa undergo major changes in their phylogenetic position when molecular data are added. Second, most fossil taxa are placed with strong support in the expected clades by the combined data Bayesian analyses, despite each having >98% missing cells and despite recent suggestions that extensive missing data are problematic for Bayesian phylogenetics. Third, morphological data can change the placement of living taxa in combined analyses, even when there is an overwhelming majority of molecular characters. Finally, we find strong but apparently misleading signal in the morphological data, seemingly associated with a burrowing lifestyle in snakes, amphisbaenians, and dibamids. Overall, our results suggest promise for an integrated and comprehensive Tree of Life by combining molecular and morphological data for living and fossil taxa.
Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F
2015-07-01
The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Staggemeier, Vanessa Graziele; Diniz-Filho, José Alexandre Felizola; Forest, Félix; Lucas, Eve
2015-04-01
Myrcia section Aulomyrcia includes ∼120 species that are endemic to the Neotropics and disjunctly distributed in the moist Amazon and Atlantic coastal forests of Brazil. This paper presents the first comprehensive phylogenetic study of this group and this phylogeny is used as a basis to evaluate recent classification systems and to test alternative hypotheses associated with the history of this clade. Fifty-three taxa were sampled out of the 120 species currently recognized, plus 40 outgroup taxa, for one nuclear marker (ribosomal internal transcribed spacer) and four plastid markers (psbA-trnH, trnL-trnF, trnQ-rpS16 and ndhF). The relationships were reconstructed based on Bayesian and maximum likelihood analyses. Additionally, a likelihood approach, 'geographic state speciation and extinction', was used to estimate region- dependent rates of speciation, extinction and dispersal, comparing historically climatic stable areas (refugia) and unstable areas. Maximum likelihood and Bayesian inferences indicate that Myrcia and Marlierea are polyphyletic, and the internal groupings recovered are characterized by combinations of morphological characters. Phylogenetic relationships support a link between Amazonian and north-eastern species and between north-eastern and south-eastern species. Lower extinction rates within glacial refugia suggest that these areas were important in maintaining diversity in the Atlantic forest biodiversity hotspot. This study provides a robust phylogenetic framework to address important ecological questions for Myrcia s.l. within an evolutionary context, and supports the need to unite taxonomically the two traditional genera Myrcia and Marlierea in an expanded Myrcia s.l. Furthermore, this study offers valuable insights into the diversification of plant species in the highly impacted Atlantic forest of South America; evidence is presented that the lowest extinction rates are found inside refugia and that range expansion from unstable areas contributes to the highest levels of plant diversity in the Bahian refugium. © The Author 2015. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Estimating Tree Height-Diameter Models with the Bayesian Method
Duan, Aiguo; Zhang, Jianguo; Xiang, Congwei
2014-01-01
Six candidate height-diameter models were used to analyze the height-diameter relationships. The common methods for estimating the height-diameter models have taken the classical (frequentist) approach based on the frequency interpretation of probability, for example, the nonlinear least squares method (NLS) and the maximum likelihood method (ML). The Bayesian method has an exclusive advantage compared with classical method that the parameters to be estimated are regarded as random variables. In this study, the classical and Bayesian methods were used to estimate six height-diameter models, respectively. Both the classical method and Bayesian method showed that the Weibull model was the “best” model using data1. In addition, based on the Weibull model, data2 was used for comparing Bayesian method with informative priors with uninformative priors and classical method. The results showed that the improvement in prediction accuracy with Bayesian method led to narrower confidence bands of predicted value in comparison to that for the classical method, and the credible bands of parameters with informative priors were also narrower than uninformative priors and classical method. The estimated posterior distributions for parameters can be set as new priors in estimating the parameters using data2. PMID:24711733
Estimating tree height-diameter models with the Bayesian method.
Zhang, Xiongqing; Duan, Aiguo; Zhang, Jianguo; Xiang, Congwei
2014-01-01
Six candidate height-diameter models were used to analyze the height-diameter relationships. The common methods for estimating the height-diameter models have taken the classical (frequentist) approach based on the frequency interpretation of probability, for example, the nonlinear least squares method (NLS) and the maximum likelihood method (ML). The Bayesian method has an exclusive advantage compared with classical method that the parameters to be estimated are regarded as random variables. In this study, the classical and Bayesian methods were used to estimate six height-diameter models, respectively. Both the classical method and Bayesian method showed that the Weibull model was the "best" model using data1. In addition, based on the Weibull model, data2 was used for comparing Bayesian method with informative priors with uninformative priors and classical method. The results showed that the improvement in prediction accuracy with Bayesian method led to narrower confidence bands of predicted value in comparison to that for the classical method, and the credible bands of parameters with informative priors were also narrower than uninformative priors and classical method. The estimated posterior distributions for parameters can be set as new priors in estimating the parameters using data2.
Owen, Christopher L; Marshall, David C; Hill, Kathy B R; Simon, Chris
2015-02-01
The Pauropsalta generic complex is a large group of cicadas (72 described spp.; >82 undescribed spp.) endemic to Australia. No previous molecular work on deep level relationships within this complex has been conducted, but a recent morphological revision and phylogenetic analysis proposed relationships among the 11 genera. We present here the first comprehensive molecular phylogeny of the complex using five loci (1 mtDNA, 4 nDNA), two of which are from nuclear genes new to cicada systematics. We compare the molecular phylogeny to the morphological phylogeny. We evaluate the phylogenetic informativeness of the new loci to traditional cicada systematics loci to generate a baseline of performance and behavior to aid in gene choice decisions in future systematic and phylogenomic studies. Our maximum likelihood and Bayesian inference phylogenies strongly support the monophyly of most of the newly described genera; however, relationships among genera differ from the morphological phylogeny. A comparison of phylogenetic informativeness among all loci revealed that COI 3rd positions dominate the informativeness profiles relative to all other loci but exhibit some among taxon nucleotide bias. After removing COI 3rd positions, COI 1st positions dominate near the terminals, while the period intron has the most phylogenetic informativeness near the root. Among the nuclear loci, ARD1 and QtRNA have lower phylogenetic informativeness than period intron and elongation factor 1 alpha intron, but the informativeness increases at you move from the tips to the root. The increase in phylogenetic informativeness deeper in the tree suggests these loci may be useful for resolving older relationships. Copyright © 2015. Published by Elsevier Inc.
Ayres, Daniel L; Darling, Aaron; Zwickl, Derrick J; Beerli, Peter; Holder, Mark T; Lewis, Paul O; Huelsenbeck, John P; Ronquist, Fredrik; Swofford, David L; Cummings, Michael P; Rambaut, Andrew; Suchard, Marc A
2012-01-01
Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software.
Ayres, Daniel L.; Darling, Aaron; Zwickl, Derrick J.; Beerli, Peter; Holder, Mark T.; Lewis, Paul O.; Huelsenbeck, John P.; Ronquist, Fredrik; Swofford, David L.; Cummings, Michael P.; Rambaut, Andrew; Suchard, Marc A.
2012-01-01
Abstract Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software. PMID:21963610
McKenna, Duane D; Farrell, Brian D
2005-10-01
Here, we report the results of a species level phylogenetic study of Cephaloleia beetles designed to clarify relationships and patterns of host plant taxon and tissue use among species. Our study is based on up to 2088bp of mtDNA sequence data. Maximum parsimony, maximum likelihood, and Bayesian methods of phylogenetic inference consistently recover a monophyletic Cephaloleia outside of a basal clade of primarily palm feeding species (the 'Arecaceae-feeding clade'), and C. irregularis. In all three analyses, the 'Arecaceae-feeding clade' includes Cephaloleia spp. with unusual morphological features, and a few species currently placed in other cassidine genera and tribes. All three analyses also recover a clade that includes all Zingiberales feeding Cephaloleia and most Cephaloleia species (the 'Zingiberales-feeding clade'). Two notable clades are found within the 'Zingiberales-feeding clade.' One is comprised of beetles that normally feed only on the young rolled leaves of plants in the families Heliconiaceae and Marantaceae (the 'Heliconiaceae & Marantaceae-feeding clade'). The other is comprised of relative host tissue generalist, primarily Zingiberales feeding species (the 'generalist-feeding clade'). A few species in the 'generalist-feeding clade' utilize Cyperaceae or Poaceae as hosts. Overall, relatively basal Cephaloleia (e.g., the 'Arecaceae clade') feed on relatively basal monocots (e.g., Cyclanthaceae and Arecaceae), and relatively derived Cephaloleia (e.g., the 'Zingiberales-feeding clade') feed on relatively derived monocots (mostly in the order Zingiberales). Zingiberales feeding and specialization on young rolled Zingiberales leaves have each apparently evolved just once in Cephaloleia.
Liu, Jun; Liu, Helu; Zhang, Haibin
2018-04-22
The marine mussels (Mytilidae) are distributed in the oceans worldwide and occupy various habitats with diverse life styles. However, their taxonomy and phylogeny remain unclear from genus to family level due to equivocal morphological and anatomical characters among some taxa. In this study, we inferred the deep phylogenetic relationships among 42 mytiloid species, 19 genera, and five subfamilies of the extant marine mussels by using two mitochondrial (COI and 16S rRNA) and three nuclear (18S and 28S rRNA, and histone H3) genes. Phylogeny was reconstructed with a combination of five genes using Bayesian inference and maximum likelihood method, and divergence time was estimated for the major nodes using a relaxed clock model with three fossil calibrations. Phylogenetic trees revealed two major clades (Clades 1 and 2). In Clade 1, the deep-sea mussels (subfamily Bathymodiolinae) were sister to subfamily Modiolinae (represented by Modiolus), and then was clustered with Leiosolenus (subfamily Lithophaginae). Clade 2 comprised Lithophaga (Lithophaginae) and subfamily Mytilinae. Additionally, a Modiolus species and Musculus senhousia (subfamily Crenellinae) were positioned within the subfamily Mytilinae. The phylogenetic results strongly indicated monophyly of Mytilidae and Bathymodiolinae, polyphyly of Modiolinae and Lithophaginae, and paraphyly of Mytilinae. Divergence time estimation showed an ancient and gradual divergence in most mussel groups, whereas the deep-sea mussels originated recently and diverged rapidly during the Paleogene. The present study provides new insight into the evolutionary history of the marine mussels, and supports taxonomic revision for this important bivalve group. Copyright © 2018 Elsevier Inc. All rights reserved.
Phylogeny of haemosporidian blood parasites revealed by a multi-gene approach.
Borner, Janus; Pick, Christian; Thiede, Jenny; Kolawole, Olatunji Matthew; Kingsley, Manchang Tanyi; Schulze, Jana; Cottontail, Veronika M; Wellinghausen, Nele; Schmidt-Chanasit, Jonas; Bruchhaus, Iris; Burmester, Thorsten
2016-01-01
The apicomplexan order Haemosporida is a clade of unicellular blood parasites that infect a variety of reptilian, avian and mammalian hosts. Among them are the agents of human malaria, parasites of the genus Plasmodium, which pose a major threat to human health. Illuminating the evolutionary history of Haemosporida may help us in understanding their enormous biological diversity, as well as tracing the multiple host switches and associated acquisitions of novel life-history traits. However, the deep-level phylogenetic relationships among major haemosporidian clades have remained enigmatic because the datasets employed in phylogenetic analyses were severely limited in either gene coverage or taxon sampling. Using a PCR-based approach that employs a novel set of primers, we sequenced fragments of 21 nuclear genes from seven haemosporidian parasites of the genera Leucocytozoon, Haemoproteus, Parahaemoproteus, Polychromophilus and Plasmodium. After addition of genomic data from 25 apicomplexan species, the unreduced alignment comprised 20,580 bp from 32 species. Phylogenetic analyses were performed based on nucleotide, codon and amino acid data employing Bayesian inference, maximum likelihood and maximum parsimony. All analyses resulted in highly congruent topologies. We found consistent support for a basal position of Leucocytozoon within Haemosporida. In contrast to all previous studies, we recovered a sister group relationship between the genera Polychromophilus and Plasmodium. Within Plasmodium, the sauropsid and mammal-infecting lineages were recovered as sister clades. Support for these relationships was high in nearly all trees, revealing a novel phylogeny of Haemosporida, which is robust to the choice of the outgroup and the method of tree inference. Copyright © 2015 Elsevier Inc. All rights reserved.
A SAS Interface for Bayesian Analysis with WinBUGS
ERIC Educational Resources Information Center
Zhang, Zhiyong; McArdle, John J.; Wang, Lijuan; Hamagami, Fumiaki
2008-01-01
Bayesian methods are becoming very popular despite some practical difficulties in implementation. To assist in the practical application of Bayesian methods, we show how to implement Bayesian analysis with WinBUGS as part of a standard set of SAS routines. This implementation procedure is first illustrated by fitting a multiple regression model…
Groth, Katrina M.; Smith, Curtis L.; Swiler, Laura P.
2014-04-05
In the past several years, several international agencies have begun to collect data on human performance in nuclear power plant simulators [1]. This data provides a valuable opportunity to improve human reliability analysis (HRA), but there improvements will not be realized without implementation of Bayesian methods. Bayesian methods are widely used in to incorporate sparse data into models in many parts of probabilistic risk assessment (PRA), but Bayesian methods have not been adopted by the HRA community. In this article, we provide a Bayesian methodology to formally use simulator data to refine the human error probabilities (HEPs) assigned by existingmore » HRA methods. We demonstrate the methodology with a case study, wherein we use simulator data from the Halden Reactor Project to update the probability assignments from the SPAR-H method. The case study demonstrates the ability to use performance data, even sparse data, to improve existing HRA methods. Furthermore, this paper also serves as a demonstration of the value of Bayesian methods to improve the technical basis of HRA.« less
Yu, Jihyun; Nam, Bo-Hye; Yoon, Joon; Kim, Eun Bae; Park, Jung Youn; Kim, Heebal; Yoon, Sook Hee
2017-12-01
To explore the spatio-temporal dynamics of endangered fin whales (Balaenoptera physalus) within the baleen whale (Mysticeti) lineages, we analyzed 148 published mitochondrial genome sequences of baleen whales. We used a Bayesian coalescent approach as well as Bayesian inferences and maximum likelihood methods. The results showed that the fin whales had a single maternal origin, and that there is a significant correlation between geographic location and evolution of global fin whales. The most recent common female ancestor of this species lived approximately 9.88 million years ago (Mya). Here, North Pacific fin whales first appeared about 7.48 Mya, followed by a subsequent divergence in Southern Hemisphere approximately 6.63 Mya and North Atlantic about 4.42 Mya. Relatively recently, approximately 1.76 and 1.42 Mya, there were two additional occurrences of North Pacific populations; one originated from the Southern Hemisphere and the other from an uncertain location. The evolutionary rate of this species was 1.002 × 10 -3 substitutions/site/My. Our Bayesian skyline plot illustrates that the fin whale population has the rapid expansion event since ~ 2.5 Mya, during the Quaternary glaciation stage. Additionally, this study indicates that the fin whale has a sister group relationship with humpback whale (Meganoptera novaeangliae) within the baleen whale lineages. Of the 16 genomic regions, NADH5 showed the most powerful signal for baleen whale phylogenetics. Interestingly, fin whales have 16 species-specific amino acid residues in eight mitochondrial genes: NADH2, COX2, COX3, ATPase6, ATPase8, NADH4, NADH5, and Cytb.
2013-01-01
Background Protostrongylus rufescens is a metastrongyloid nematode of small ruminants, such as sheep and goats, causing protostrongylosis. In spite of its importance, the ecology and epidemiology of this parasite are not entirely understood. In addition, genetic data are scant for P. rufescens and related metastrongyloids. Methods The mt genome was amplified from a single adult worm of P. rufescens (from sheep) by long-PCR, sequenced using 454-technology and annotated using bioinformatic tools. Amino acid sequences inferred from individual genes of the mt genomes were concatenated and subjected to phylogenetic analysis using Bayesian inference. Results The circular mitochondrial genome was 13,619 bp in length and contained two ribosomal RNA, 12 protein-coding and 22 transfer RNA genes, consistent with nematodes of the order Strongylida for which mt genomes have been determined. Phylogenetic analysis of the concatenated amino acid sequence data for the 12 mt proteins showed that P. rufescens was closely related to Aelurostrongylus abstrusus, Angiostrongylus vasorum, Angiostrongylus cantonensis and Angiostrongylus costaricensis. Conclusions The mt genome determined herein provides a source of markers for future investigations of P. rufescens. Molecular tools, employing such mt markers, are likely to find applicability in studies of the population biology of this parasite and the systematics of lungworms. PMID:24025317
Chen, Zhi-Teng; Du, Yu-Zhou
2018-05-01
The complete mitochondrial genomes (mitogenomes) of Taeniopteryx ugola and Doddsia occidentalis (Plecoptera: Taeniopterygidae) were firstly sequenced from the family Taeniopterygidae. The 15,353-bp long mitogenome of T. ugola and the 16,020-bp long mitogenome of D. occidentalis each contained 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and a control region (CR). The mitochondrial gene arrangement of the two taeniopterygids and other stoneflies was identical with the putative ancestral mitogenome of Drosophila yakuba. Most PCGs used standard ATN start codons and TAN termination codons. Twenty-one of the 22 tRNAs in each mitogenome could fold into the cloverleaf secondary structures, while the dihydrouridine (DHU) arm of trnSer (AGN) was reduced or absent. Stem-loop (SL) structures, poly-T stretch, poly-[AT] n stretch and tandem repeats were found in the CRs of the two mitogenomes. The phylogenetic analyses using Bayesian inference (BI) and maximum likelihood methods (ML) generated identical results, both supporting the monophyly of all stonefly families and the two infraorders, Systellognatha and Euholognatha. Taeniopterygidae was grouped with another two families from Euholognatha. The relationships within Plecoptera were recovered as (((Perlidae+Peltoperlidae)+((Pteronarcyidae+Chloroperlidae)+Styloperlidae))+((Capniidae+Taeniopterygidae)+Nemouridae))+Gripopterygidae. Copyright © 2017 Elsevier B.V. All rights reserved.
A comprehensive molecular phylogeny for the hornbills (Aves: Bucerotidae).
Gonzalez, Juan-Carlos T; Sheldon, Ben C; Collar, Nigel J; Tobias, Joseph A
2013-05-01
The hornbills comprise a group of morphologically and behaviorally distinct Palaeotropical bird species that feature prominently in studies of ecology and conservation biology. Although the monophyly of hornbills is well established, previous phylogenetic hypotheses were based solely on mtDNA and limited sampling of species diversity. We used parsimony, maximum likelihood and Bayesian methods to reconstruct relationships among all 61 extant hornbill species, based on nuclear and mtDNA gene sequences extracted largely from historical samples. The resulting phylogenetic trees closely match vocal variation across the family but conflict with current taxonomic treatments. In particular, they highlight a new arrangement for the six major clades of hornbills and reveal that three groups traditionally treated as genera (Tockus, Aceros, Penelopides) are non-monophyletic. In addition, two other genera (Anthracoceros, Ocyceros) were non-monophyletic in the mtDNA gene tree. Our findings resolve some longstanding problems in hornbill systematics, including the placement of 'Penelopides exharatus' (embedded in Aceros) and 'Tockus hartlaubi' (sister to Tropicranus albocristatus). We also confirm that an Asiatic lineage (Berenicornis) is sister to a trio of Afrotropical genera (Tropicranus [including 'Tockus hartlaubi'], Ceratogymna, Bycanistes). We present a summary phylogeny as a robust basis for further studies of hornbill ecology, evolution and historical biogeography. Copyright © 2013. Published by Elsevier Inc.
Genetic diversity and cross-species transmission of kobuviruses in Vietnam
Van Dung, Nguyen; Ivens, Alasdair; O’Toole, Aine; Bryant, Juliet E; Carrique-Mas, Juan; Van Cuong, Nguyen; Anh, Pham Hong; Rabaa, Maia A; Tue, Ngo Tri; Thwaites, Guy E; Baker, Stephen; Simmonds, Peter; Woolhouse, Mark Ej
2018-01-01
Abstract Cross-species transmission of viruses poses a sustained threat to public health. Due to increased contact between humans and other animal species the possibility exists for cross-species transmissions and ensuing disease outbreaks. By using conventional PCR amplification and next generation sequencing, we obtained 130 partial or full genome kobuvirus sequences from humans in a sentinel cohort in Vietnam and various mammalian hosts including bats, rodents, pigs, cats, and civets. The evolution of kobuviruses in different hosts was analysed using Bayesian phylogenetic methods. We estimated and compared time of origin of kobuviruses in different host orders; we also examined the cross-species transmission of kobuviruses within the same host order and between different host orders. Our data provide new knowledge of rodent and bat kobuviruses, which are most closely related to human kobuviruses. The novel bat kobuviruses isolated from bat roosts in Southern Vietnam were genetically distinct from previously described bat kobuviruses, but closely related to kobuviruses found in rodents. We additionally found evidence of frequent cross-species transmissions of kobuviruses within rodents. Overall, our phylogenetic analyses reveal multiple cross-species transmissions both within and among mammalian species, which increases our understanding of kobuviruses genetic diversity and the complexity of their evolutionary history. PMID:29449965
Krawczak, Felipe S; Labruna, Marcelo B; Hecht, Joy A; Paddock, Christopher D; Karpathy, Sandor E
2018-01-01
The bacterium Rickettsia bellii belongs to a basal group of rickettsiae that diverged prior to the pathogenic spotted fever group and typhus group Rickettsia species. Despite a diverse representation of R. bellii across more than 25 species of hard and soft ticks in the American continent, phylogeographical relationships among strains of this basal group- Rickettsia species are unknown; the work described here explores these relationships. DNA was extracted from 30 R. bellii tick isolates: 15 from the United States, 14 from Brazil, and 1 from Argentina. A total of 2,269 aligned nucleotide sites of 3 protein coding genes ( glt A, atp A, and cox A) and 2 intergenic regions ( rpm E -tRN A fmet and RC1027-xth A 2 ) were concatenated and subjected to phylogenetic analysis by Bayesian methods. Results showed a separation of almost all isolates between North and South Americas, suggesting that they have radiated within their respective continents. Phylogenetic positions of the 30 isolates could be a result of not only their geographical origin but also the tick hosts they have coevolved with. Whether R. bellii originated with ticks in North or South America remains obscure, as our analyses did not show evidence for greater genetic divergence of R. bellii in either continent.
Yang, Caiting; Liu, An; Xu, Yusen; Xu, Yuan; Fan, Xinpeng; Al-Farraj, Saleh A; Ni, Bing; Gu, Fukang
2015-08-18
The morphology and infraciliature of the four hypotrichous ciliates; Rigidohymena inquieta (Stokes, 1887) Berger, 2011, Pattersoniella vitiphila Foissner, 1987, Notohymena australis Foissner & O' Donoghue, 1990, and Cyrtohymena (Cyrtohymenides) australis (Foissner, 1995) Foissner, 2004, collected from east China, were investigated by using live observation and protargol impregnation method. An improved diagnosis for R. inquieta was supplied based on descriptions of present and previous populations. New morphology and morphogenesis information based on Chinese populations of another three hypotrichids were also supplemented. The Small-subunit rRNA (SSU rRNA) gene sequences of the four species were characterized and their phylogenetic positions were revealed by means of Bayesian inference and Maximum-likelihood analysis. The analyses shows that R. inquieta clusters with other members of the subfamily Stylonychinae, which confirms the monophyly of the subfamily and verified R. inquieta as a separated species from R. candens though it differs from others mainly by body size. C. (C.) australis occupying the basal position of the clade which contains cyrtohymenids and some other groups, declines the idea of separating Cyrtohymena into two subgenus. Notohymena australis and China population of Pattersoniella vitiphila respectively clustering with their congeners correspond well with the systematics revealed by morphological similarities.
Watts, Joseph; Greenhill, Simon J; Atkinson, Quentin D; Currie, Thomas E; Bulbulia, Joseph; Gray, Russell D
2015-04-07
Supernatural belief presents an explanatory challenge to evolutionary theorists-it is both costly and prevalent. One influential functional explanation claims that the imagined threat of supernatural punishment can suppress selfishness and enhance cooperation. Specifically, morally concerned supreme deities or 'moralizing high gods' have been argued to reduce free-riding in large social groups, enabling believers to build the kind of complex societies that define modern humanity. Previous cross-cultural studies claiming to support the MHG hypothesis rely on correlational analyses only and do not correct for the statistical non-independence of sampled cultures. Here we use a Bayesian phylogenetic approach with a sample of 96 Austronesian cultures to test the MHG hypothesis as well as an alternative supernatural punishment hypothesis that allows punishment by a broad range of moralizing agents. We find evidence that broad supernatural punishment drives political complexity, whereas MHGs follow political complexity. We suggest that the concept of MHGs diffused as part of a suite of traits arising from cultural exchange between complex societies. Our results show the power of phylogenetic methods to address long-standing debates about the origins and functions of religion in human society. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Sun, Miao-Miao; Han, Liang; Zhang, Fu-Kai; Zhou, Dong-Hui; Wang, Shu-Qing; Ma, Jun; Zhu, Xing-Quan; Liu, Guo-Hua
2018-01-01
Marshallagia marshalli (Nematoda: Trichostrongylidae) infection can lead to serious parasitic gastroenteritis in sheep, goat, and wild ruminant, causing significant socioeconomic losses worldwide. Up to now, the study concerning the molecular biology of M. marshalli is limited. Herein, we sequenced the complete mitochondrial (mt) genome of M. marshalli and examined its phylogenetic relationship with selected members of the superfamily Trichostrongyloidea using Bayesian inference (BI) based on concatenated mt amino acid sequence datasets. The complete mt genome sequence of M. marshalli is 13,891 bp, including 12 protein-coding genes, 22 transfer RNA genes, and 2 ribosomal RNA genes. All protein-coding genes are transcribed in the same direction. Phylogenetic analyses based on concatenated amino acid sequences of the 12 protein-coding genes supported the monophylies of the families Haemonchidae, Molineidae, and Dictyocaulidae with strong statistical support, but rejected the monophyly of the family Trichostrongylidae. The determination of the complete mt genome sequence of M. marshalli provides novel genetic markers for studying the systematics, population genetics, and molecular epidemiology of M. marshalli and its congeners.
A “Shallow Phylogeny” of Shallow Barnacles (Chthamalus)
Wares, John P.; Pankey, M. Sabrina; Pitombo, Fabio; Daglio, Liza Gómez; Achituv, Yair
2009-01-01
Background We present a multi-locus phylogenetic analysis of the shallow water (high intertidal) barnacle genus Chthamalus, focusing on member species in the western hemisphere. Understanding the phylogeny of this group improves interpretation of classical ecological work on competition, distributional changes associated with climate change, and the morphological evolution of complex cirripede phenotypes. Methodology and Findings We use traditional and Bayesian phylogenetic and ‘deep coalescent’ approaches to identify a phylogeny that supports the monophyly of the mostly American ‘fissus group’ of Chthamalus, but that also supports a need for taxonomic revision of Chthamalus and Microeuraphia. Two deep phylogeographic breaks were also found within the range of two tropical American taxa (C. angustitergum and C. southwardorum) as well. Conclusions Our data, which include two novel gene regions for phylogenetic analysis of cirripedes, suggest that much more evaluation of the morphological evolutionary history and taxonomy of Chthamalid barnacles is necessary. These data and associated analyses also indicate that the radiation of species in the late Pliocene and Pleistocene was very rapid, and may provide new insights toward speciation via transient allopatry or ecological barriers. PMID:19440543
Defining the phylogenetic position of Amanita species from Andean Colombia.
Vargas, Natalia; Pardo-de La Hoz, Carlos José; Danies, Giovanna; Franco-Molano, Ana Esperanza; Jiménez, Pedro; Restrepo, Silvia; Grajales, Alejandro
2017-01-01
Amanita is a worldwide-distributed fungal genus, with approximately 600 known species. Most species within the genus are ectomycorrhizal (ECM), with some saprotrophic representatives. In this study, we constructed the first comprehensive phylogeny including ECM species from Colombia collected in native Quercus humboldtii forests and in introduced Pinus patula plantations. We included 8 species (A. brunneolocularis, A. colombiana, A. flavoconia, A. fuligineodisca, A. muscaria, A. rubescens, A. sororcula, and A. xylinivolva) out of 16 species reported for the country, two new reports: A. citrina and A. virosa, and a new variety A. brunneolocularis var. pallida. Morphological taxonomic keys together with a phylogenetic approach using three nuclear gene regions: partial nuc rDNA 28S nuc rDNA internal transcribed spacers ITS1 and ITS2 and partial translation elongation factor 1-α gene (TEF1), were used to classify the specimens. Several highly supported clades were obtained from the phylogenetic hypotheses obtained by Bayesian inference and maximum likelihood approaches, allowing us to position the Colombian collections in a coherent infrageneric level and to contribute to the knowledge of local Amanita diversity.
Popovkin, Alex V.; Mathews, Katherine G.; Santos, José Carlos Mendes; Molina, M. Carmen; Struwe, Lena
2011-01-01
Abstract A new species of Spigelia L. (Loganiaceae), Spigelia genuflexa Popovkin & Struwe, sp. n., from the Atlantic forest of northeastern Bahia, Brazil, is described, being the first reported geocarpic species in the family. During fruit maturation, the basal infructescences bend down towards the ground, depositing the fruit on the surface (and burying it in soft kinds of ground cover, e.g., moss), whereas the upper ones do so slightly but noticeably. The species is a short-lived annual apparently restricted to sandy-soil habitat of the Atlantic forest of northeastern Bahia, with variable and heterogeneous microenvironment and is known from only two restricted localities. A short review of amphi- and geocarpic species is provided. A discussion of comparative morphology within Spigelia with regards to dwarfism, indumentum, and annual habit is included. A phylogenetic parsimony and Bayesian analysis of ITS sequences from 15 Spigelia species plus 17 outgroups in Loganiaceae confirms its independent taxonomic status: on the basis of sequence similarity and phylogenetic topology it is phylogenetically distinct from all Spigelia species sequenced so far. PMID:22287919
Solórzano-García, Brenda; Nadler, Steven A; Pérez-Ponce de León, Gerardo
2016-10-01
Two new species of Trypanoxyuris are described from the intestine of free-ranging howler monkeys in Mexico, Trypanoxyuris multilabiatus n. sp. from the mantled howler Alouatta palliata, and Trypanoxyuris pigrae n. sp. from the black howler Alouatta pigra. An integrative taxonomic approach is followed, where conspicuous morphological traits and phylogenetic trees based on DNA sequences are used to test the validity of the two new species. The mitochondrial cytochrome oxidase subunit 1 gene, and the nuclear ribosomal 18S and 28S rRNA genes were used for evolutionary analyses, with the concatenated dataset of all three genes used for maximum likelihood and Bayesian phylogenetic analyses. The two new species of pinworms from howler monkeys were morphologically distinct and formed reciprocally monophyletic lineages in molecular phylogenetic trees. The three species from howler monkeys, T. multilabiatus n. sp., T. pigrae n. sp., and Trypanoxyuris minutus, formed a monophyletic group with high bootstrap and posterior probability support values. Phylogenetic patterns inferred from sequence data support the hypothesis of a close evolutionary association between these primate hosts and their pinworm parasites. The results suggest that the diversity of pinworm parasites from Neotropical primates might be underestimated. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Merz, Clayton; Catchen, Julian M; Hanson-Smith, Victor; Emerson, Kevin J; Bradshaw, William E; Holzapfel, Christina M
2013-01-01
Herein we tested the repeatability of phylogenetic inference based on high throughput sequencing by increased taxon sampling using our previously published techniques in the pitcher-plant mosquito, Wyeomyia smithii in North America. We sampled 25 natural populations drawn from different localities nearby 21 previous collection localities and used these new data to construct a second, independent phylogeny, expressly to test the reproducibility of phylogenetic patterns. Comparison of trees between the two data sets based on both maximum parsimony and maximum likelihood with Bayesian posterior probabilities showed close correspondence in the grouping of the most southern populations into clear clades. However, discrepancies emerged, particularly in the middle of W. smithii's current range near the previous maximum extent of the Laurentide Ice Sheet, especially concerning the most recent common ancestor to mountain and northern populations. Combining all 46 populations from both studies into a single maximum parsimony tree and taking into account the post-glacial historical biogeography of associated flora provided an improved picture of W. smithii's range expansion in North America. In a more general sense, we propose that extensive taxon sampling, especially in areas of known geological disruption is key to a comprehensive approach to phylogenetics that leads to biologically meaningful phylogenetic inference.
Kim, Sora; Kaila, Lauri; Lee, Seunghwan
2016-08-01
Phylogenetic relationships within family Oecophoridae have been poorly understood. Consequently the subfamily and genus level classifications with this family problematic. A comprehensive phylogenetic analysis of Oecophoridae, the concealer moths, was performed based on analysis of 4444 base pairs of mitochondrial COI, nuclear ribosomal RNA genes (18S and 28S) and nuclear protein coding genes (IDH, MDH, Rps5, EF1a and wingless) for 82 taxa. Data were analyzed using maximum likelihood (ML), parsimony (MP) and Bayesian (BP) phylogenetic frameworks. Phylogenetic analyses indicated that (i) genera Casmara, Tyrolimnas and Pseudodoxia did not belong to Oecophoridae, suggesting that Oecophoridae s. authors was not monophyletic; (ii) other oecophorids comprising two subfamilies, Pleurotinae and Oecophorinae, were nested within the same clade, and (iii) Martyringa, Acryptolechia and Periacmini were clustered with core Xyloryctidae. They appeared to be sister lineage with core Oecophoridae. BayesTraits were implemented to explore the ancestral character states to infer historical microhabitat patterns and sheltering strategy of larvae. Reconstruction of ancestral microhabitat of oecophorids indicated that oecophorids might have evolved from dried plant feeders and further convergently specialized. The ancestral larva sheltering strategy of oecophorids might have used a silk tube by making itself, shifting from mining leaves. Copyright © 2016 Elsevier Inc. All rights reserved.
Bayesian Inference for Functional Dynamics Exploring in fMRI Data.
Guo, Xuan; Liu, Bing; Chen, Le; Chen, Guantao; Pan, Yi; Zhang, Jing
2016-01-01
This paper aims to review state-of-the-art Bayesian-inference-based methods applied to functional magnetic resonance imaging (fMRI) data. Particularly, we focus on one specific long-standing challenge in the computational modeling of fMRI datasets: how to effectively explore typical functional interactions from fMRI time series and the corresponding boundaries of temporal segments. Bayesian inference is a method of statistical inference which has been shown to be a powerful tool to encode dependence relationships among the variables with uncertainty. Here we provide an introduction to a group of Bayesian-inference-based methods for fMRI data analysis, which were designed to detect magnitude or functional connectivity change points and to infer their functional interaction patterns based on corresponding temporal boundaries. We also provide a comparison of three popular Bayesian models, that is, Bayesian Magnitude Change Point Model (BMCPM), Bayesian Connectivity Change Point Model (BCCPM), and Dynamic Bayesian Variable Partition Model (DBVPM), and give a summary of their applications. We envision that more delicate Bayesian inference models will be emerging and play increasingly important roles in modeling brain functions in the years to come.
Shi, Cheng-Min; Yang, Ziheng
2018-01-01
Abstract The phylogenetic relationships among extant gibbon species remain unresolved despite numerous efforts using morphological, behavorial, and genetic data and the sequencing of whole genomes. A major challenge in reconstructing the gibbon phylogeny is the radiative speciation process, which resulted in extremely short internal branches in the species phylogeny and extensive incomplete lineage sorting with extensive gene-tree heterogeneity across the genome. Here, we analyze two genomic-scale data sets, with ∼10,000 putative noncoding and exonic loci, respectively, to estimate the species tree for the major groups of gibbons. We used the Bayesian full-likelihood method bpp under the multispecies coalescent model, which naturally accommodates incomplete lineage sorting and uncertainties in the gene trees. For comparison, we included three heuristic coalescent-based methods (mp-est, SVDQuartets, and astral) as well as concatenation. From both data sets, we infer the phylogeny for the four extant gibbon genera to be (Hylobates, (Nomascus, (Hoolock, Symphalangus))). We used simulation guided by the real data to evaluate the accuracy of the methods used. Astral, while not as efficient as bpp, performed well in estimation of the species tree even in presence of excessive incomplete lineage sorting. Concatenation, mp-est and SVDQuartets were unreliable when the species tree contains very short internal branches. Likelihood ratio test of gene flow suggests a small amount of migration from Hylobates moloch to H. pileatus, while cross-genera migration is absent or rare. Our results highlight the utility of coalescent-based methods in addressing challenging species tree problems characterized by short internal branches and rampant gene tree-species tree discordance. PMID:29087487
On Bayesian Testing of Additive Conjoint Measurement Axioms Using Synthetic Likelihood
ERIC Educational Resources Information Center
Karabatsos, George
2017-01-01
This article introduces a Bayesian method for testing the axioms of additive conjoint measurement. The method is based on an importance sampling algorithm that performs likelihood-free, approximate Bayesian inference using a synthetic likelihood to overcome the analytical intractability of this testing problem. This new method improves upon…
Power in Bayesian Mediation Analysis for Small Sample Research
Miočević, Milica; MacKinnon, David P.; Levy, Roy
2018-01-01
It was suggested that Bayesian methods have potential for increasing power in mediation analysis (Koopman, Howe, Hollenbeck, & Sin, 2015; Yuan & MacKinnon, 2009). This paper compares the power of Bayesian credibility intervals for the mediated effect to the power of normal theory, distribution of the product, percentile, and bias-corrected bootstrap confidence intervals at N≤ 200. Bayesian methods with diffuse priors have power comparable to the distribution of the product and bootstrap methods, and Bayesian methods with informative priors had the most power. Varying degrees of precision of prior distributions were also examined. Increased precision led to greater power only when N≥ 100 and the effects were small, N < 60 and the effects were large, and N < 200 and the effects were medium. An empirical example from psychology illustrated a Bayesian analysis of the single mediator model from prior selection to interpreting results. PMID:29662296
Power in Bayesian Mediation Analysis for Small Sample Research.
Miočević, Milica; MacKinnon, David P; Levy, Roy
2017-01-01
It was suggested that Bayesian methods have potential for increasing power in mediation analysis (Koopman, Howe, Hollenbeck, & Sin, 2015; Yuan & MacKinnon, 2009). This paper compares the power of Bayesian credibility intervals for the mediated effect to the power of normal theory, distribution of the product, percentile, and bias-corrected bootstrap confidence intervals at N≤ 200. Bayesian methods with diffuse priors have power comparable to the distribution of the product and bootstrap methods, and Bayesian methods with informative priors had the most power. Varying degrees of precision of prior distributions were also examined. Increased precision led to greater power only when N≥ 100 and the effects were small, N < 60 and the effects were large, and N < 200 and the effects were medium. An empirical example from psychology illustrated a Bayesian analysis of the single mediator model from prior selection to interpreting results.
Spatio-temporal interpolation of precipitation during monsoon periods in Pakistan
NASA Astrophysics Data System (ADS)
Hussain, Ijaz; Spöck, Gunter; Pilz, Jürgen; Yu, Hwa-Lung
2010-08-01
Spatio-temporal estimation of precipitation over a region is essential to the modeling of hydrologic processes for water resources management. The changes of magnitude and space-time heterogeneity of rainfall observations make space-time estimation of precipitation a challenging task. In this paper we propose a Box-Cox transformed hierarchical Bayesian multivariate spatio-temporal interpolation method for the skewed response variable. The proposed method is applied to estimate space-time monthly precipitation in the monsoon periods during 1974-2000, and 27-year monthly average precipitation data are obtained from 51 stations in Pakistan. The results of transformed hierarchical Bayesian multivariate spatio-temporal interpolation are compared to those of non-transformed hierarchical Bayesian interpolation by using cross-validation. The software developed by [11] is used for Bayesian non-stationary multivariate space-time interpolation. It is observed that the transformed hierarchical Bayesian method provides more accuracy than the non-transformed hierarchical Bayesian method.
Harrison, Luke B; Larsson, Hans C E
2015-03-01
Likelihood-based methods are commonplace in phylogenetic systematics. Although much effort has been directed toward likelihood-based models for molecular data, comparatively less work has addressed models for discrete morphological character (DMC) data. Among-character rate variation (ACRV) may confound phylogenetic analysis, but there have been few analyses of the magnitude and distribution of rate heterogeneity among DMCs. Using 76 data sets covering a range of plants, invertebrate, and vertebrate animals, we used a modified version of MrBayes to test equal, gamma-distributed and lognormally distributed models of ACRV, integrating across phylogenetic uncertainty using Bayesian model selection. We found that in approximately 80% of data sets, unequal-rates models outperformed equal-rates models, especially among larger data sets. Moreover, although most data sets were equivocal, more data sets favored the lognormal rate distribution relative to the gamma rate distribution, lending some support for more complex character correlations than in molecular data. Parsimony estimation of the underlying rate distributions in several data sets suggests that the lognormal distribution is preferred when there are many slowly evolving characters and fewer quickly evolving characters. The commonly adopted four rate category discrete approximation used for molecular data was found to be sufficient to approximate a gamma rate distribution with discrete characters. However, among the two data sets tested that favored a lognormal rate distribution, the continuous distribution was better approximated with at least eight discrete rate categories. Although the effect of rate model on the estimation of topology was difficult to assess across all data sets, it appeared relatively minor between the unequal-rates models for the one data set examined carefully. As in molecular analyses, we argue that researchers should test and adopt the most appropriate model of rate variation for the data set in question. As discrete characters are increasingly used in more sophisticated likelihood-based phylogenetic analyses, it is important that these studies be built on the most appropriate and carefully selected underlying models of evolution. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A Comparison of the β-Substitution Method and a Bayesian Method for Analyzing Left-Censored Data
Huynh, Tran; Quick, Harrison; Ramachandran, Gurumurthy; Banerjee, Sudipto; Stenzel, Mark; Sandler, Dale P.; Engel, Lawrence S.; Kwok, Richard K.; Blair, Aaron; Stewart, Patricia A.
2016-01-01
Classical statistical methods for analyzing exposure data with values below the detection limits are well described in the occupational hygiene literature, but an evaluation of a Bayesian approach for handling such data is currently lacking. Here, we first describe a Bayesian framework for analyzing censored data. We then present the results of a simulation study conducted to compare the β-substitution method with a Bayesian method for exposure datasets drawn from lognormal distributions and mixed lognormal distributions with varying sample sizes, geometric standard deviations (GSDs), and censoring for single and multiple limits of detection. For each set of factors, estimates for the arithmetic mean (AM), geometric mean, GSD, and the 95th percentile (X0.95) of the exposure distribution were obtained. We evaluated the performance of each method using relative bias, the root mean squared error (rMSE), and coverage (the proportion of the computed 95% uncertainty intervals containing the true value). The Bayesian method using non-informative priors and the β-substitution method were generally comparable in bias and rMSE when estimating the AM and GM. For the GSD and the 95th percentile, the Bayesian method with non-informative priors was more biased and had a higher rMSE than the β-substitution method, but use of more informative priors generally improved the Bayesian method’s performance, making both the bias and the rMSE more comparable to the β-substitution method. An advantage of the Bayesian method is that it provided estimates of uncertainty for these parameters of interest and good coverage, whereas the β-substitution method only provided estimates of uncertainty for the AM, and coverage was not as consistent. Selection of one or the other method depends on the needs of the practitioner, the availability of prior information, and the distribution characteristics of the measurement data. We suggest the use of Bayesian methods if the practitioner has the computational resources and prior information, as the method would generally provide accurate estimates and also provides the distributions of all of the parameters, which could be useful for making decisions in some applications. PMID:26209598
Exact Bayesian Inference for Phylogenetic Birth-Death Models.
Parag, K V; Pybus, O G
2018-04-26
Inferring the rates of change of a population from a reconstructed phylogeny of genetic sequences is a central problem in macro-evolutionary biology, epidemiology, and many other disciplines. A popular solution involves estimating the parameters of a birth-death process (BDP), which links the shape of the phylogeny to its birth and death rates. Modern BDP estimators rely on random Markov chain Monte Carlo (MCMC) sampling to infer these rates. Such methods, while powerful and scalable, cannot be guaranteed to converge, leading to results that may be hard to replicate or difficult to validate. We present a conceptually and computationally different parametric BDP inference approach using flexible and easy to implement Snyder filter (SF) algorithms. This method is deterministic so its results are provable, guaranteed, and reproducible. We validate the SF on constant rate BDPs and find that it solves BDP likelihoods known to produce robust estimates. We then examine more complex BDPs with time-varying rates. Our estimates compare well with a recently developed parametric MCMC inference method. Lastly, we performmodel selection on an empirical Agamid species phylogeny, obtaining results consistent with the literature. The SF makes no approximations, beyond those required for parameter quantisation and numerical integration, and directly computes the posterior distribution of model parameters. It is a promising alternative inference algorithm that may serve either as a standalone Bayesian estimator or as a useful diagnostic reference for validating more involved MCMC strategies. The Snyder filter is implemented in Matlab and the time-varying BDP models are simulated in R. The source code and data are freely available at https://github.com/kpzoo/snyder-birth-death-code. kris.parag@zoo.ox.ac.uk. Supplementary material is available at Bioinformatics online.
Martínez-Azorín, Mario; Crespo, Manuel B.; Juan, Ana; Fay, Michael F.
2011-01-01
Background and Aims The taxonomic arrangement within subfamily Ornithogaloideae (Hyacinthaceae) has been a matter of controversy in recent decades: several new taxonomic treatments have been proposed, based exclusively on plastid DNA sequences, and these have resulted in classifications which are to a great extent contradictory. Some authors have recognized only a single genus Ornithogalum for the whole subfamily, including 250–300 species of variable morphology, whereas others have recognized many genera. In the latter case, the genera are inevitably much smaller and they are better defined morphologically. However, some are not monophyletic as circumscribed. Methods Phylogenetic analyses of Ornithogaloideae were based on nucleotide sequences of four plastid regions (trnL intron, trnL-F spacer, rbcL and matK) and a nuclear region (ITS). Eighty species covering all relevant taxonomic groups previously recognized in the subfamily were sampled. Parsimony and Bayesian analyses were performed. The molecular data were compared with a matrix of 34 morphological characters. Key Results Combinations of plastid and nuclear data yielded phylogenetic trees which are better resolved than those obtained with any plastid region alone or plastid regions in combination. Three main clades are found, corresponding to the previously recognized tribes Albuceae, Dipcadieae and Ornithogaleae. In these, up to 19 clades are described which are definable by morphology and biogeography. These mostly correspond to previously described taxa, though some need recircumscription. Morphological characters are assessed for their diagnostic value for taxonomy in the subfamily. Conclusions On the basis of the phylogenetic analyses, 19 monophyletic genera are accepted within Ornithogaloideae: Albuca, Avonsera, Battandiera, Cathissa, Coilonox, Dipcadi, Eliokarmos, Elsiea, Ethesia, Galtonia, Honorius, Loncomelos, Melomphis, Neopatersonia, Nicipe, Ornithogalum, Pseudogaltonia, Stellarioides and Trimelopter. Each of these has a particular syndrome of morphological characters. As a result, 105 new combinations are made and two new names are proposed to accommodate the taxa studied in the new arrangement. A short morphological diagnosis, synonymy, details of distribution and an identification key are presented. PMID:21163815
Appelhans, M S; Smets, E; Razafimandimbison, S G; Haevermans, T; van Marle, E J; Couloux, A; Rabarison, H; Randrianarivelojosia, M; Kessler, P J A
2011-06-01
The Spathelia-Ptaeroxylon clade is a group of morphologically diverse plants that have been classified together as a result of molecular phylogenetic studies. The clade is currently included in Rutaceae and recognized at a subfamilial level (Spathelioideae) despite the fact that most of its genera have traditionally been associated with other families and that there are no obvious morphological synapomorphies for the clade. The aim of the present study is to construct phylogenetic trees for the Spathelia-Ptaeroxylon clade and to investigate anatomical characters in order to decide whether it should be kept in Rutaceae or recognized at the familial level. Anatomical characters were plotted on a cladogram to help explain character evolution within the group. Moreover, phylogenetic relationships and generic limits within the clade are also addressed. A species-level phylogenetic analysis of the Spathelia-Ptaeroxylon clade based on five plastid DNA regions (rbcL, atpB, trnL-trnF, rps16 and psbA-trnH) was conducted using Bayesian, maximum parsimony and maximum likelihood methods. Leaf and seed anatomical characters of all genera were (re)investigated by light and scanning electron microscopy. With the exception of Spathelia, all genera of the Spathelila-Ptaeroxylon clade are monophyletic. The typical leaf and seed anatomical characters of Rutaceae were found. Further, the presence of oil cells in the leaves provides a possible synapomorphy for the clade. The Spathelia-Ptaeroxylon clade is well placed in Rutaceae and it is reasonable to unite the genera into one subfamily (Spathelioideae). We propose a new tribal classification of Spathelioideae. A narrow circumscription of Spathelia is established to make the genus monophyletic, and Sohnreyia is resurrected to accommodate the South American species of Spathelia. The most recent common ancestor of Spathelioideae probably had leaves with secretory cavities and oil cells, haplostemonous flowers with appendaged staminal filaments, and a tracheidal tegmen.
Li, Qin-Qin; Zhou, Song-Dong; He, Xing-Jin; Yu, Yan; Zhang, Yu-Cheng; Wei, Xian-Qin
2010-01-01
Background and Aims The genus Allium comprises more than 800 species, placing it among the largest monocotyledonous genera. It is a variable group that is spread widely across the Holarctic region. Previous studies of Allium have been useful in identifying and assessing its evolutionary lineages. However, there are still many gaps in our knowledge of infrageneric taxonomy and evolution of Allium. Further understanding of its phylogeny and biogeography will be achieved only through continued phylogenetic studies, especially of those species endemic to China that have often been excluded from previous analyses. Earlier molecular studies have shown that Chinese Allium is not monophyletic, so the goal of the present study was to infer the phylogeny and biogeography of Allium and to provide a classification of Chinese Allium by placement of Chinese species in the context of the entire phylogeny. Methods Phylogenetic studies were based on sequence data of the nuclear ribosomal internal transcribed spacer (ITS) and chloroplast rps16 intron, analysed using parsimony and Bayesian approaches. Biogeographical patterns were conducted using statistical dispersal–vicariance analysis (S-DIVA). Key Results Phylogenetic analyses indicate that Allium is monophyletic and consists of three major clades. Optimal reconstructions have favoured the ancestors of Amerallium, Anguinum, Vvedenskya, Porphyroprason and Melanocrommyum as originating in eastern Asia. Conclusions Phylogenetic analyses reveal that Allium is monophyletic but that some subgenera are not. The large genetic distances imply that Allium is of ancient origin. Molecular data suggest that its evolution proceeded along three separate evolutionary lines. S-DIVA indicates that the ancestor of Amerallium, Anguinum, Vvedenskya, Porphyroprason and Melanocrommyum originated from eastern Asia and underwent different biogeographical pathways. A taxonomic synopsis of Chinese Allium at sectional level is given, which divides Chinese Allium into 13 subgenera and 34 sections. PMID:20966186
Phylogeny and Evolutionary Patterns in the Dwarf Crayfish Subfamily (Decapoda: Cambarellinae)
Pedraza-Lara, Carlos; Doadrio, Ignacio; Breinholt, Jesse W.; Crandall, Keith A.
2012-01-01
The Dwarf crayfish or Cambarellinae, is a morphologically singular subfamily of decapod crustaceans that contains only one genus, Cambarellus. Its intriguing distribution, along the river basins of the Gulf Coast of United States (Gulf Group) and into Central México (Mexican Group), has until now lacked of satisfactory explanation. This study provides a comprehensive sampling of most of the extant species of Cambarellus and sheds light on its evolutionary history, systematics and biogeography. We tested the impact of Gulf Group versus Mexican Group geography on rates of cladogenesis using a maximum likelihood framework, testing different models of birth/extinction of lineages. We propose a comprehensive phylogenetic hypothesis for the subfamily based on mitochondrial and nuclear loci (3,833 bp) using Bayesian and Maximum Likelihood methods. The phylogenetic structure found two phylogenetic groups associated to the two main geographic components (Gulf Group and Mexican Group) and is partially consistent with the historical structure of river basins. The previous hypothesis, which divided the genus into three subgenera based on genitalia morphology was only partially supported (P = 0.047), resulting in a paraphyletic subgenus Pandicambarus. We found at least two cases in which phylogenetic structure failed to recover monophyly of recognized species while detecting several cases of cryptic diversity, corresponding to lineages not assigned to any described species. Cladogenetic patterns in the entire subfamily are better explained by an allopatric model of speciation. Diversification analyses showed similar cladogenesis patterns between both groups and did not significantly differ from the constant rate models. While cladogenesis in the Gulf Group is coincident in time with changes in the sea levels, in the Mexican Group, cladogenesis is congruent with the formation of the Trans-Mexican Volcanic Belt. Our results show how similar allopatric divergence in freshwater organisms can be promoted through diverse vicariant factors. PMID:23155379
2014-01-01
Background Limited available sequence information has greatly impeded population genetics, phylogenetics and systematics studies in the subclass Acari (mites and ticks). Mitochondrial (mt) DNA is well known to provide genetic markers for investigations in these areas, but complete mt genomic data have been lacking for many Acari species. Herein, we present the complete mt genome of the scab mite Psoroptes cuniculi. Methods P. cuniculi was collected from a naturally infected New Zealand white rabbit from China and identified by morphological criteria. The complete mt genome of P. cuniculi was amplified by PCR and then sequenced. The relationships of this scab mite with selected members of the Acari were assessed by phylogenetic analysis of concatenated amino acid sequence datasets by Bayesian inference (BI), maximum likelihood (ML) and maximum parsimony (MP). Results This mt genome (14,247 bp) is circular and consists of 37 genes, including 13 genes for proteins, 22 genes for tRNA, 2 genes for rRNA. The gene arrangement in mt genome of P. cuniculi is the same as those of Dermatophagoides farinae (Pyroglyphidae) and Aleuroglyphus ovatus (Acaridae), but distinct from those of Steganacarus magnus (Steganacaridae) and Panonychus citri (Tetranychidae). Phylogenetic analyses using concatenated amino acid sequences of 12 protein-coding genes, with three different computational algorithms (BI, ML and MP), showed the division of subclass Acari into two superorders, supported the monophylies of the both superorders Parasitiformes and Acariformes; and the three orders Ixodida and Mesostigmata and Astigmata, but rejected the monophyly of the order Prostigmata. Conclusions The mt genome of P. cuniculi represents the first mt genome of any member of the family Psoroptidae. Analysis of mt genome sequences in the present study has provided new insights into the phylogenetic relationships among several major lineages of Acari species. PMID:25052180
Kitahara, Marcelo V.; Cairns, Stephen D.; Stolarski, Jarosław; Blair, David; Miller, David J.
2010-01-01
Background Classical morphological taxonomy places the approximately 1400 recognized species of Scleractinia (hard corals) into 27 families, but many aspects of coral evolution remain unclear despite the application of molecular phylogenetic methods. In part, this may be a consequence of such studies focusing on the reef-building (shallow water and zooxanthellate) Scleractinia, and largely ignoring the large number of deep-sea species. To better understand broad patterns of coral evolution, we generated molecular data for a broad and representative range of deep sea scleractinians collected off New Caledonia and Australia during the last decade, and conducted the most comprehensive molecular phylogenetic analysis to date of the order Scleractinia. Methodology Partial (595 bp) sequences of the mitochondrial cytochrome oxidase subunit 1 (CO1) gene were determined for 65 deep-sea (azooxanthellate) scleractinians and 11 shallow-water species. These new data were aligned with 158 published sequences, generating a 234 taxon dataset representing 25 of the 27 currently recognized scleractinian families. Principal Findings/Conclusions There was a striking discrepancy between the taxonomic validity of coral families consisting predominantly of deep-sea or shallow-water species. Most families composed predominantly of deep-sea azooxanthellate species were monophyletic in both maximum likelihood and Bayesian analyses but, by contrast (and consistent with previous studies), most families composed predominantly of shallow-water zooxanthellate taxa were polyphyletic, although Acroporidae, Poritidae, Pocilloporidae, and Fungiidae were exceptions to this general pattern. One factor contributing to this inconsistency may be the greater environmental stability of deep-sea environments, effectively removing taxonomic “noise” contributed by phenotypic plasticity. Our phylogenetic analyses imply that the most basal extant scleractinians are azooxanthellate solitary corals from deep-water, their divergence predating that of the robust and complex corals. Deep-sea corals are likely to be critical to understanding anthozoan evolution and the origins of the Scleractinia. PMID:20628613
Evolution of exceptional species richness among lineages of fleshy-fruited Myrtaceae
Biffin, Ed; Lucas, Eve J.; Craven, Lyn A.; Ribeiro da Costa, Itayguara; Harrington, Mark G.; Crisp, Michael D.
2010-01-01
Background and Aims The angiosperm family Myrtaceae comprises 17 tribes with more than half of the estimated 5500 species being referred to the fleshy-fruited and predominantly rainforest associated Syzygieae and Myrteae. Previous studies suggest that fleshy fruits have evolved separately in these lineages, whereas generally shifts in fruit morphology have been variously implicated in diversification rate shifts among angiosperms. A phylogenetic hypothesis and estimate divergence times for Myrtaceae is developed as a basis to explore the evidence for, and drivers of, elevated diversification rates among the fleshy-fruited tribes of Myrtaceae. Methods Bayesian phylogenetic analyses of plastid and nuclear DNA sequences were used to estimate intertribal relationships and lineage divergence times in Myrtaceae. Focusing on the fleshy-fruited tribes, a variety of statistical approaches were used to assess diversification rates and diversification rate shifts across the family. Key Results Analyses of the sequence data provide a strongly supported phylogenetic hypothesis for Myrtaceae. Relative to previous studies, substantially younger ages for many of the clades are reported, and it is argued that the use of flexible calibrations to incorporate fossil data provides more realistic divergence estimates than the use of errorless point calibrations. It is found that Syzygieae and Myrteae have experienced elevated diversification rates relative to other lineages of Myrtaceae. Positive shifts in diversification rate have occurred separately in each lineage, associated with a shift from dry to fleshy fruit. Conclusions Fleshy fruits have evolved independently in Syzygieae and Myrteae, and this is accompanied by exceptional diversification rate shifts in both instances, suggesting that the evolution of fleshy fruits is a key innovation for rainforest Myrtaceae. Noting the scale dependency of this hypothesis, more complex explanations may be required to explain diversification rate shifts occurring within the fleshy-fruited tribes, and the suggested phylogenetic hypothesis provides an appropriate framework for this undertaking. PMID:20462850
Bayesian flood forecasting methods: A review
NASA Astrophysics Data System (ADS)
Han, Shasha; Coulibaly, Paulin
2017-08-01
Over the past few decades, floods have been seen as one of the most common and largely distributed natural disasters in the world. If floods could be accurately forecasted in advance, then their negative impacts could be greatly minimized. It is widely recognized that quantification and reduction of uncertainty associated with the hydrologic forecast is of great importance for flood estimation and rational decision making. Bayesian forecasting system (BFS) offers an ideal theoretic framework for uncertainty quantification that can be developed for probabilistic flood forecasting via any deterministic hydrologic model. It provides suitable theoretical structure, empirically validated models and reasonable analytic-numerical computation method, and can be developed into various Bayesian forecasting approaches. This paper presents a comprehensive review on Bayesian forecasting approaches applied in flood forecasting from 1999 till now. The review starts with an overview of fundamentals of BFS and recent advances in BFS, followed with BFS application in river stage forecasting and real-time flood forecasting, then move to a critical analysis by evaluating advantages and limitations of Bayesian forecasting methods and other predictive uncertainty assessment approaches in flood forecasting, and finally discusses the future research direction in Bayesian flood forecasting. Results show that the Bayesian flood forecasting approach is an effective and advanced way for flood estimation, it considers all sources of uncertainties and produces a predictive distribution of the river stage, river discharge or runoff, thus gives more accurate and reliable flood forecasts. Some emerging Bayesian forecasting methods (e.g. ensemble Bayesian forecasting system, Bayesian multi-model combination) were shown to overcome limitations of single model or fixed model weight and effectively reduce predictive uncertainty. In recent years, various Bayesian flood forecasting approaches have been developed and widely applied, but there is still room for improvements. Future research in the context of Bayesian flood forecasting should be on assimilation of various sources of newly available information and improvement of predictive performance assessment methods.
Papasotiropoulos, Vasilis; Klossa-Kilia, Elena; Alahiotis, Stamatis N; Kilias, George
2007-08-01
Mitochondrial DNA sequence analysis has been used to explore genetic differentiation and phylogenetic relationships among five species of the Mugilidae family, Mugil cephalus, Chelon labrosus, Liza aurata, Liza ramada, and Liza saliens. DNA was isolated from samples originating from the Messolongi Lagoon in Greece. Three mtDNA segments (12s rRNA, 16s rRNA, and CO I) were PCR amplified and sequenced. Sequencing analysis revealed that the greatest genetic differentiation was observed between M. cephalus and all the other species studied, while C. labrosus and L. aurata were the closest taxa. Dendrograms obtained by the neighbor-joining method and Bayesian inference analysis exhibited the same topology. According to this topology, M. cephalus is the most distinct species and the remaining taxa are clustered together, with C. labrosus and L. aurata forming a single group. The latter result brings into question the monophyletic origin of the genus Liza.
Hu, Chao; Tian, Huaizhen; Li, Hongqing; Hu, Aiqun; Xing, Fuwu; Bhattacharjee, Avishek; Hsu, Tianchuan; Kumar, Pankaj; Chung, Shihwen
2016-01-01
A molecular phylogeny of Asiatic species of Goodyera (Orchidaceae, Cranichideae, Goodyerinae) based on the nuclear ribosomal internal transcribed spacer (ITS) region and two chloroplast loci (matK and trnL-F) was presented. Thirty-five species represented by 132 samples of Goodyera were analyzed, along with other 27 genera/48 species, using Pterostylis longifolia and Chloraea gaudichaudii as outgroups. Bayesian inference, maximum parsimony and maximum likelihood methods were used to reveal the intrageneric relationships of Goodyera and its intergeneric relationships to related genera. The results indicate that: 1) Goodyera is not monophyletic; 2) Goodyera could be divided into four sections, viz., Goodyera, Otosepalum, Reticulum and a new section; 3) sect. Reticulum can be further divided into two subsections, viz., Reticulum and Foliosum, whereas sect. Goodyera can in turn be divided into subsections Goodyera and a new subsection. PMID:26927946
Hu, Chao; Tian, Huaizhen; Li, Hongqing; Hu, Aiqun; Xing, Fuwu; Bhattacharjee, Avishek; Hsu, Tianchuan; Kumar, Pankaj; Chung, Shihwen
2016-01-01
A molecular phylogeny of Asiatic species of Goodyera (Orchidaceae, Cranichideae, Goodyerinae) based on the nuclear ribosomal internal transcribed spacer (ITS) region and two chloroplast loci (matK and trnL-F) was presented. Thirty-five species represented by 132 samples of Goodyera were analyzed, along with other 27 genera/48 species, using Pterostylis longifolia and Chloraea gaudichaudii as outgroups. Bayesian inference, maximum parsimony and maximum likelihood methods were used to reveal the intrageneric relationships of Goodyera and its intergeneric relationships to related genera. The results indicate that: 1) Goodyera is not monophyletic; 2) Goodyera could be divided into four sections, viz., Goodyera, Otosepalum, Reticulum and a new section; 3) sect. Reticulum can be further divided into two subsections, viz., Reticulum and Foliosum, whereas sect. Goodyera can in turn be divided into subsections Goodyera and a new subsection.
Ghosh, Sujit K
2010-01-01
Bayesian methods are rapidly becoming popular tools for making statistical inference in various fields of science including biology, engineering, finance, and genetics. One of the key aspects of Bayesian inferential method is its logical foundation that provides a coherent framework to utilize not only empirical but also scientific information available to a researcher. Prior knowledge arising from scientific background, expert judgment, or previously collected data is used to build a prior distribution which is then combined with current data via the likelihood function to characterize the current state of knowledge using the so-called posterior distribution. Bayesian methods allow the use of models of complex physical phenomena that were previously too difficult to estimate (e.g., using asymptotic approximations). Bayesian methods offer a means of more fully understanding issues that are central to many practical problems by allowing researchers to build integrated models based on hierarchical conditional distributions that can be estimated even with limited amounts of data. Furthermore, advances in numerical integration methods, particularly those based on Monte Carlo methods, have made it possible to compute the optimal Bayes estimators. However, there is a reasonably wide gap between the background of the empirically trained scientists and the full weight of Bayesian statistical inference. Hence, one of the goals of this chapter is to bridge the gap by offering elementary to advanced concepts that emphasize linkages between standard approaches and full probability modeling via Bayesian methods.
Natanegara, Fanni; Neuenschwander, Beat; Seaman, John W; Kinnersley, Nelson; Heilmann, Cory R; Ohlssen, David; Rochester, George
2014-01-01
Bayesian applications in medical product development have recently gained popularity. Despite many advances in Bayesian methodology and computations, increase in application across the various areas of medical product development has been modest. The DIA Bayesian Scientific Working Group (BSWG), which includes representatives from industry, regulatory agencies, and academia, has adopted the vision to ensure Bayesian methods are well understood, accepted more broadly, and appropriately utilized to improve decision making and enhance patient outcomes. As Bayesian applications in medical product development are wide ranging, several sub-teams were formed to focus on various topics such as patient safety, non-inferiority, prior specification, comparative effectiveness, joint modeling, program-wide decision making, analytical tools, and education. The focus of this paper is on the recent effort of the BSWG Education sub-team to administer a Bayesian survey to statisticians across 17 organizations involved in medical product development. We summarize results of this survey, from which we provide recommendations on how to accelerate progress in Bayesian applications throughout medical product development. The survey results support findings from the literature and provide additional insight on regulatory acceptance of Bayesian methods and information on the need for a Bayesian infrastructure within an organization. The survey findings support the claim that only modest progress in areas of education and implementation has been made recently, despite substantial progress in Bayesian statistical research and software availability. Copyright © 2013 John Wiley & Sons, Ltd.
Paleogene Radiation of a Plant Pathogenic Mushroom
Coetzee, Martin P. A.; Bloomer, Paulette; Wingfield, Michael J.; Wingfield, Brenda D.
2011-01-01
Background The global movement and speciation of fungal plant pathogens is important, especially because of the economic losses they cause and the ease with which they are able to spread across large areas. Understanding the biogeography and origin of these plant pathogens can provide insights regarding their dispersal and current day distribution. We tested the hypothesis of a Gondwanan origin of the plant pathogenic mushroom genus Armillaria and the currently accepted premise that vicariance accounts for the extant distribution of the species. Methods The phylogeny of a selection of Armillaria species was reconstructed based on Maximum Parsimony (MP), Maximum Likelihood (ML) and Bayesian Inference (BI). A timeline was then placed on the divergence of lineages using a Bayesian relaxed molecular clock approach. Results Phylogenetic analyses of sequenced data for three combined nuclear regions provided strong support for three major geographically defined clades: Holarctic, South American-Australasian and African. Molecular dating placed the initial radiation of the genus at 54 million years ago within the Early Paleogene, postdating the tectonic break-up of Gondwana. Conclusions The distribution of extant Armillaria species is the result of ancient long-distance dispersal rather than vicariance due to continental drift. As these finding are contrary to most prior vicariance hypotheses for fungi, our results highlight the important role of long-distance dispersal in the radiation of fungal pathogens from the Southern Hemisphere. PMID:22216099
Faurby, Søren; Svenning, Jens-Christian
2015-03-01
Across large clades, two problems are generally encountered in the estimation of species-level phylogenies: (a) the number of taxa involved is generally so high that computation-intensive approaches cannot readily be utilized and (b) even for clades that have received intense study (e.g., mammals), attention has been centered on relatively few selected species, and most taxa must therefore be positioned on the basis of very limited genetic data. Here, we describe a new heuristic-hierarchical Bayesian approach and use it to construct a species-level phylogeny for all extant and late Quaternary extinct mammals. In this approach, species with large quantities of genetic data are placed nearly freely in the mammalian phylogeny according to these data, whereas the placement of species with lower quantities of data is performed with steadily stricter restrictions for decreasing data quantities. The advantages of the proposed method include (a) an improved ability to incorporate phylogenetic uncertainty in downstream analyses based on the resulting phylogeny, (b) a reduced potential for long-branch attraction or other types of errors that place low-data taxa far from their true position, while maintaining minimal restrictions for better-studied taxa, and (c) likely improved placement of low-data taxa due to the use of closer outgroups. Copyright © 2014 Elsevier Inc. All rights reserved.
Wright, Jeremy J; David, Solomon R; Near, Thomas J
2012-06-01
Extant gars represent the remaining members of a formerly diverse assemblage of ancient ray-finned fishes and have been the subject of multiple phylogenetic analyses using morphological data. Here, we present the first hypothesis of phylogenetic relationships among living gar species based on molecular data, through the examination of gene tree heterogeneity and coalescent species tree analyses of a portion of one mitochondrial (COI) and seven nuclear (ENC1, myh6, plagl2, S7 ribosomal protein intron 1, sreb2, tbr1, and zic1) genes. Individual gene trees displayed varying degrees of resolution with regards to species-level relationships, and the gene trees inferred from COI and the S7 intron were the only two that were completely resolved. Coalescent species tree analyses of nuclear genes resulted in a well-resolved and strongly supported phylogenetic tree of living gar species, for which Bayesian posterior node support was further improved by the inclusion of the mitochondrial gene. Species-level relationships among gars inferred from our molecular data set were highly congruent with previously published morphological phylogenies, with the exception of the placement of two species, Lepisosteus osseus and L. platostomus. Re-examination of the character coding used by previous authors provided partial resolution of this topological discordance, resulting in broad concordance in the phylogenies inferred from individual genes, the coalescent species tree analysis, and morphology. The completely resolved phylogeny inferred from the molecular data set with strong Bayesian posterior support at all nodes provided insights into the potential for introgressive hybridization and patterns of allopatric speciation in the evolutionary history of living gars, as well as a solid foundation for future examinations of functional diversification and evolutionary stasis in a "living fossil" lineage. Copyright © 2012 Elsevier Inc. All rights reserved.
Mahony, Stephen; Foley, Nicole M; Biju, S D; Teeling, Emma C
2017-03-01
Molecular dating studies typically need fossils to calibrate the analyses. Unfortunately, the fossil record is extremely poor or presently nonexistent for many species groups, rendering such dating analysis difficult. One such group is the Asian horned frogs (Megophryinae). Sampling all generic nomina, we combined a novel ∼5 kb dataset composed of four nuclear and three mitochondrial gene fragments to produce a robust phylogeny, with an extensive external morphological study to produce a working taxonomy for the group. Expanding the molecular dataset to include out-groups of fossil-represented ancestral anuran families, we compared the priorless RelTime dating method with the widely used prior-based Bayesian timetree method, MCMCtree, utilizing a novel combination of fossil priors for anuran phylogenetic dating. The phylogeny was then subjected to ancestral phylogeographic analyses, and dating estimates were compared with likely biogeographic vicariant events. Phylogenetic analyses demonstrated that previously proposed systematic hypotheses were incorrect due to the paraphyly of genera. Molecular phylogenetic, morphological, and timetree results support the recognition of Megophryinae as a single genus, Megophrys, with a subgenus level classification. Timetree results using RelTime better corresponded with the known fossil record for the out-group anuran tree. For the priorless in-group, it also outperformed MCMCtree when node date estimates were compared with likely influential historical biogeographic events, providing novel insights into the evolutionary history of this pan-Asian anuran group. Given a relatively small molecular dataset, and limited prior knowledge, this study demonstrates that the computationally rapid RelTime dating tool may outperform more popular and complex prior reliant timetree methodologies. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Model Diagnostics for Bayesian Networks
ERIC Educational Resources Information Center
Sinharay, Sandip
2006-01-01
Bayesian networks are frequently used in educational assessments primarily for learning about students' knowledge and skills. There is a lack of works on assessing fit of Bayesian networks. This article employs the posterior predictive model checking method, a popular Bayesian model checking tool, to assess fit of simple Bayesian networks. A…
A guide to Bayesian model selection for ecologists
Hooten, Mevin B.; Hobbs, N.T.
2015-01-01
The steady upward trend in the use of model selection and Bayesian methods in ecological research has made it clear that both approaches to inference are important for modern analysis of models and data. However, in teaching Bayesian methods and in working with our research colleagues, we have noticed a general dissatisfaction with the available literature on Bayesian model selection and multimodel inference. Students and researchers new to Bayesian methods quickly find that the published advice on model selection is often preferential in its treatment of options for analysis, frequently advocating one particular method above others. The recent appearance of many articles and textbooks on Bayesian modeling has provided welcome background on relevant approaches to model selection in the Bayesian framework, but most of these are either very narrowly focused in scope or inaccessible to ecologists. Moreover, the methodological details of Bayesian model selection approaches are spread thinly throughout the literature, appearing in journals from many different fields. Our aim with this guide is to condense the large body of literature on Bayesian approaches to model selection and multimodel inference and present it specifically for quantitative ecologists as neutrally as possible. We also bring to light a few important and fundamental concepts relating directly to model selection that seem to have gone unnoticed in the ecological literature. Throughout, we provide only a minimal discussion of philosophy, preferring instead to examine the breadth of approaches as well as their practical advantages and disadvantages. This guide serves as a reference for ecologists using Bayesian methods, so that they can better understand their options and can make an informed choice that is best aligned with their goals for inference.
Hofman, Sebastian; Pabijan, Maciej; Osikowski, Artur; Litvinchuk, Spartak N; Szymura, Jacek M
2016-09-01
We present the full-length mitogenome sequences of four European water frog species: Pelophylax cypriensis, P. epeiroticus, P. kurtmuelleri and P. shqipericus. The mtDNA size varied from 17,363 to 17,895 bp, and its organization with the LPTF tRNA gene cluster preceding the 12 S rRNA gene displayed the typical Neobatrachian arrangement. Maximum likelihood and Bayesian inference revealed a well-resolved mtDNA phylogeny of seven European Pelophylax species. The uncorrected p-distance for among Pelophylax mitogenomes was 9.6 (range 0.01-0.13). Most divergent was the P. shqipericus mitogenome, clustering with the "P. lessonae" group, in contrast to the other three new Pelophylax mitogenomes related to the "P. bedriagae/ridibundus" lineage. The new mitogenomes resolve ambiguities of the phylogenetic placement of P. cretensis and P. epeiroticus.
Stephen, Alexa A; Leone, Angelique M; Toplon, David E; Archer, Linda L; Wellehan, James F X
2016-12-01
A juvenile female bald eagle ( Haliaeetus leucocephalus ) was presented with emaciation and proliferative periocular lesions. The eagle did not respond to supportive therapy and was euthanatized. Histopathologic examination of the skin lesions revealed plaques of marked epidermal hyperplasia parakeratosis, marked acanthosis and spongiosis, and eosinophilic intracytoplasmic inclusion bodies. Novel polymerase chain reaction (PCR) assays were done to amplify and sequence DNA polymerase and rpo147 genes. The 4b gene was also analyzed by a previously developed assay. Bayesian and maximum likelihood phylogenetic analyses of the obtained sequences found it to be poxvirus of the genus Avipoxvirus and clustered with other raptor isolates. Better phylogenetic resolution was found in rpo147 rather than the commonly used DNA polymerase. The novel consensus rpo147 PCR assay will create more accurate phylogenic trees and allow better insight into poxvirus history.
DNA barcoding and the identification of tree frogs (Amphibia: Anura: Rhacophoridae).
Dang, Ning-Xin; Sun, Feng-Hui; Lv, Yun-Yun; Zhao, Bo-Han; Wang, Ji-Chao; Murphy, Robert W; Wang, Wen-Zhi; Li, Jia-Tang
2016-07-01
The DNA barcoding gene COI (cytochrome c oxidase subunit I) effectively identifies many species. Herein, we barcoded 172 individuals from 37 species belonging to nine genera in Rhacophoridae to test if the gene serves equally well to identify species of tree frogs. Phenetic neighbor joining and phylogenetic Bayesian inference were used to construct phylogenetic trees, which resolved all nine genera as monophyletic taxa except for Rhacophorus, two new matrilines for Liuixalus, and Polypedates leucomystax species complex. Intraspecific genetic distances ranged from 0.000 to 0.119 and interspecific genetic distances ranged from 0.015 to 0.334. Within Rhacophorus and Kurixalus, the intra- and interspecific genetic distances did not reveal an obvious barcode gap. Notwithstanding, we found that COI sequences unambiguously identified rhacophorid species and helped to discover likely new cryptic species via the synthesis of genealogical relationships and divergence patterns. Our results supported that COI is an effective DNA barcoding marker for Rhacophoridae.
Expected time-invariant effects of biological traits on mammal species duration.
Smits, Peter D
2015-10-20
Determining which biological traits influence differences in extinction risk is vital for understanding the differential diversification of life and for making predictions about species' vulnerability to anthropogenic impacts. Here I present a hierarchical Bayesian survival model of North American Cenozoic mammal species durations in relation to species-level ecological factors, time of origination, and phylogenetic relationships. I find support for the survival of the unspecialized as a time-invariant generalization of trait-based extinction risk. Furthermore, I find that phylogenetic and temporal effects are both substantial factors associated with differences in species durations. Finally, I find that the estimated effects of these factors are partially incongruous with how these factors are correlated with extinction risk of the extant species. These findings parallel previous observations that background extinction is a poor predictor of mass extinction events and suggest that attention should be focused on mass extinctions to gain insight into modern species loss.
Phylogenetic Status and Timescale for the Diversification of Steno and Sotalia Dolphins
Cunha, Haydée A.; Moraes, Lucas C.; Medeiros, Bruna V.; Lailson-Brito, José; da Silva, Vera M. F.; Solé-Cava, Antonio M.; Schrago, Carlos G.
2011-01-01
Molecular data have provided many insights into cetacean evolution but some unsettled issues still remain. We estimated the topology and timing of cetacean evolutionary relationships using Bayesian and maximum likelihood analyses of complete mitochondrial genomes. In order to clarify the phylogenetic placement of Sotalia and Steno within the Delphinidae, we sequenced three new delphinid mitogenomes. Our analyses support three delphinid clades: one joining Steno and Sotalia (supporting the revised subfamily Stenoninae); another placing Sousa within the Delphininae; and a third, the Globicephalinae, which includes Globicephala, Feresa, Pseudorca, Peponocephala and Grampus. We also conclude that Orcinus does not belong in the Globicephalinae, but Orcaella may be part of that subfamily. Divergence dates were estimated using the relaxed molecular clock calibrated with fossil data. We hypothesise that the timing of separation of the marine and Amazonian Sotalia species (2.3 Ma) coincided with the establishment of the modern Amazon River basin. PMID:22163290
Phylogenetic status and timescale for the diversification of Steno and Sotalia dolphins.
Cunha, Haydée A; Moraes, Lucas C; Medeiros, Bruna V; Lailson-Brito, José; da Silva, Vera M F; Solé-Cava, Antonio M; Schrago, Carlos G
2011-01-01
Molecular data have provided many insights into cetacean evolution but some unsettled issues still remain. We estimated the topology and timing of cetacean evolutionary relationships using bayesian and maximum likelihood analyses of complete mitochondrial genomes. In order to clarify the phylogenetic placement of Sotalia and Steno within the Delphinidae, we sequenced three new delphinid mitogenomes. Our analyses support three delphinid clades: one joining Steno and Sotalia (supporting the revised subfamily Stenoninae); another placing Sousa within the Delphininae; and a third, the Globicephalinae, which includes Globicephala, Feresa, Pseudorca, Peponocephala and Grampus. We also conclude that Orcinus does not belong in the Globicephalinae, but Orcaella may be part of that subfamily. Divergence dates were estimated using the relaxed molecular clock calibrated with fossil data. We hypothesise that the timing of separation of the marine and Amazonian Sotalia species (2.3 Ma) coincided with the establishment of the modern Amazon River basin.
da Cruz, Marcos de O R; Weksler, Marcelo
2018-02-01
The use of genetic data and tree-based algorithms to delimit evolutionary lineages is becoming an important practice in taxonomic identification, especially in morphologically cryptic groups. The effects of different phylogenetic and/or coalescent models in the analyses of species delimitation, however, are not clear. In this paper, we assess the impact of different evolutionary priors in phylogenetic estimation, species delimitation, and molecular dating of the genus Oligoryzomys (Mammalia: Rodentia), a group with complex taxonomy and morphological cryptic species. Phylogenetic and coalescent analyses included 20 of the 24 recognized species of the genus, comprising of 416 Cytochrome b sequences, 26 Cytochrome c oxidase I sequences, and 27 Beta-Fibrinogen Intron 7 sequences. For species delimitation, we employed the General Mixed Yule Coalescent (GMYC) and Bayesian Poisson tree processes (bPTP) analyses, and contrasted 4 genealogical and phylogenetic models: Pure-birth (Yule), Constant Population Size Coalescent, Multiple Species Coalescent, and a mixed Yule-Coalescent model. GMYC analyses of trees from different genealogical models resulted in similar species delimitation and phylogenetic relationships, with incongruence restricted to areas of poor nodal support. bPTP results, however, significantly differed from GMYC for 5 taxa. Oligoryzomys early diversification was estimated to have occurred in the Early Pleistocene, between 0.7 and 2.6 MYA. The mixed Yule-Coalescent model, however, recovered younger dating estimates for Oligoryzomys diversification, and for the threshold for the speciation-coalescent horizon in GMYC. Eight of the 20 included Oligoryzomys species were identified as having two or more independent evolutionary units, indicating that current taxonomy of Oligoryzomys is still unsettled. Copyright © 2017 Elsevier Inc. All rights reserved.
A Comparison of the β-Substitution Method and a Bayesian Method for Analyzing Left-Censored Data.
Huynh, Tran; Quick, Harrison; Ramachandran, Gurumurthy; Banerjee, Sudipto; Stenzel, Mark; Sandler, Dale P; Engel, Lawrence S; Kwok, Richard K; Blair, Aaron; Stewart, Patricia A
2016-01-01
Classical statistical methods for analyzing exposure data with values below the detection limits are well described in the occupational hygiene literature, but an evaluation of a Bayesian approach for handling such data is currently lacking. Here, we first describe a Bayesian framework for analyzing censored data. We then present the results of a simulation study conducted to compare the β-substitution method with a Bayesian method for exposure datasets drawn from lognormal distributions and mixed lognormal distributions with varying sample sizes, geometric standard deviations (GSDs), and censoring for single and multiple limits of detection. For each set of factors, estimates for the arithmetic mean (AM), geometric mean, GSD, and the 95th percentile (X0.95) of the exposure distribution were obtained. We evaluated the performance of each method using relative bias, the root mean squared error (rMSE), and coverage (the proportion of the computed 95% uncertainty intervals containing the true value). The Bayesian method using non-informative priors and the β-substitution method were generally comparable in bias and rMSE when estimating the AM and GM. For the GSD and the 95th percentile, the Bayesian method with non-informative priors was more biased and had a higher rMSE than the β-substitution method, but use of more informative priors generally improved the Bayesian method's performance, making both the bias and the rMSE more comparable to the β-substitution method. An advantage of the Bayesian method is that it provided estimates of uncertainty for these parameters of interest and good coverage, whereas the β-substitution method only provided estimates of uncertainty for the AM, and coverage was not as consistent. Selection of one or the other method depends on the needs of the practitioner, the availability of prior information, and the distribution characteristics of the measurement data. We suggest the use of Bayesian methods if the practitioner has the computational resources and prior information, as the method would generally provide accurate estimates and also provides the distributions of all of the parameters, which could be useful for making decisions in some applications. © The Author 2015. Published by Oxford University Press on behalf of the British Occupational Hygiene Society.
2012-01-01
Background The Nymphaeales (waterlilly and relatives) lineage has diverged as the second branch of basal angiosperms and comprises of two families: Cabombaceae and Nymphaceae. The classification of Nymphaeales and phylogeny within the flowering plants are quite intriguing as several systems (Thorne system, Dahlgren system, Cronquist system, Takhtajan system and APG III system (Angiosperm Phylogeny Group III system) have attempted to redefine the Nymphaeales taxonomy. There have been also fossil records consisting especially of seeds, pollen, stems, leaves and flowers as early as the lower Cretaceous. Here we present an in silico study of the order Nymphaeales taking maturaseK (matK) and internal transcribed spacer (ITS2) as biomarkers for phylogeny reconstruction (using character-based methods and Bayesian approach) and identification of motifs for DNA barcoding. Results The Maximum Likelihood (ML) and Bayesian approach yielded congruent fully resolved and well-supported trees using a concatenated (ITS2+ matK) supermatrix aligned dataset. The taxon sampling corroborates the monophyly of Cabombaceae. Nuphar emerges as a monophyletic clade in the family Nymphaeaceae while there are slight discrepancies in the monophyletic nature of the genera Nymphaea owing to Victoria-Euryale and Ondinea grouping in the same node of Nymphaeaceae. ITS2 secondary structures alignment corroborate the primary sequence analysis. Hydatellaceae emerged as a sister clade to Nymphaeaceae and had a basal lineage amongst the water lilly clades. Species from Cycas and Ginkgo were taken as outgroups and were rooted in the overall tree topology from various methods. Conclusions MatK genes are fast evolving highly variant regions of plant chloroplast DNA that can serve as potential biomarkers for DNA barcoding and also in generating primers for angiosperms with identification of unique motif regions. We have reported unique genus specific motif regions in the Order Nymphaeles from matK dataset which can be further validated for barcoding and designing of PCR primers. Our analysis using a novel approach of sequence-structure alignment and phylogenetic reconstruction using molecular morphometrics congrue with the current placement of Hydatellaceae within the early-divergent angiosperm order Nymphaeales. The results underscore the fact that more diverse genera, if not fully resolved to be monophyletic, should be represented by all major lineages. PMID:23282079
Haklová-Kočíková, Božena; Hižňanová, Adriana; Majláth, Igor; Račka, Karol; Harris, David James; Földvári, Gábor; Tryjanowski, Piotr; Kokošová, Natália; Malčeková, Beáta; Majláthová, Viktória
2014-12-10
Blood parasites of the genus Karyolysus Labbé, 1894 (Apicomplexa: Adeleida: Karyolysidae) represent the protozoan haemogregarines found in various genera of lizards, including Lacerta, Podarcis, Darevskia (Lacertidae) and Mabouia (Scincidae). The vectors of parasites are gamasid mites from the genus Ophionyssus. A total of 557 individuals of lacertid lizards were captured in four different localities in Europe (Hungary, Poland, Romania and Slovakia) and blood was collected. Samples were examined using both microscopic and molecular methods, and phylogenetic relationships of all isolates of Karyolysus sp. were assessed for the first time. Karyolysus sp. 18S rRNA isolates were evaluated using Bayesian and Maximum Likelihood analyses. A total of 520 blood smears were examined microscopically and unicellular protozoan parasites were found in 116 samples (22.3% prevalence). The presence of two Karyolysus species, K. latus and K. lacazei was identified. In total, of 210 samples tested by polymerase chain reaction (PCR), the presence of parasites was observed in 64 individuals (prevalence 30.5%). Results of phylogenetic analyses revealed the existence of four haplotypes, all part of the same lineage, with other parasites identified as belonging to the genus Hepatozoon. Classification of these parasites using current taxonomy is complex - they were identified in both mites and ticks that typically are considered to host Karyolysus and Hepatozoon respectively. Furthermore although distortions to the intermediate host erythrocyte nuclei were observed, the defining characteristic of Karyolysus, the haplotypes were nearly identical to those reported from lizards in the Iberian Peninsula, where such distortions were not reported and which were thus identified as Hepatozoon. Based on the phylogenetic analyses, neither vertebrate host, nor geographical patterns of the studied blood parasites could be established.
Duchêne, Sebastián; Duchêne, David; Holmes, Edward C; Ho, Simon Y W
2015-07-01
Rates and timescales of viral evolution can be estimated using phylogenetic analyses of time-structured molecular sequences. This involves the use of molecular-clock methods, calibrated by the sampling times of the viral sequences. However, the spread of these sampling times is not always sufficient to allow the substitution rate to be estimated accurately. We conducted Bayesian phylogenetic analyses of simulated virus data to evaluate the performance of the date-randomization test, which is sometimes used to investigate whether time-structured data sets have temporal signal. An estimate of the substitution rate passes this test if its mean does not fall within the 95% credible intervals of rate estimates obtained using replicate data sets in which the sampling times have been randomized. We find that the test sometimes fails to detect rate estimates from data with no temporal signal. This error can be minimized by using a more conservative criterion, whereby the 95% credible interval of the estimate with correct sampling times should not overlap with those obtained with randomized sampling times. We also investigated the behavior of the test when the sampling times are not uniformly distributed throughout the tree, which sometimes occurs in empirical data sets. The test performs poorly in these circumstances, such that a modification to the randomization scheme is needed. Finally, we illustrate the behavior of the test in analyses of nucleotide sequences of cereal yellow dwarf virus. Our results validate the use of the date-randomization test and allow us to propose guidelines for interpretation of its results. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Prenner, Gerhard; Cardoso, Domingos
2017-01-01
Background and Aims The study of floral morphology and ontogeny and the re-investigation of existing data help to uncover potential synapomorphic characters and foster our understanding of phylogenetic relationships that rely primarily on molecular analyses. Goniorrhachis marginata is a monotypic caesalpinioid legume (Leguminosae) that shows some interesting floral features, such as a long hypanthium and regular Rosaceae-like flowers. We studied the ontogeny and morphology of the flowers in detail and present our results in a broad phylogenetic context. Methods Flower buds were collected in the field, fixed in 70 % ethanol and investigated using scanning electron microscopy. Older buds in spirit were carefully opened to investigate the direction of style bending. Characters of the style from 131 taxa from the main legume lineages were analysed and mapped on a Bayesian molecular phylogeny. Key Results The tetramerous calyx is the result of complete loss of one sepal. The formation of the radially symmetrical corolla starts in a typical caesalpinioid pattern with the adaxial petal innermost (ascending aestivation). The young style bends in the abaxial direction, which is a character found exclusively in all studied detarioid legumes and therefore a newly described synapomorphy for the clade. Conclusions We show that investigation of unstudied taxa and reinvestigation of published data can uncover new, previously overlooked and important characters. Curvature of the style can be detected in young buds with a hand lens and therefore is an important character for field botanists. Our study reveals the importance of including poorly studied and/or phylogenetically enigmatic taxa in molecular phylogenies and in detailed morphological and ontogenetic analyses. PMID:28025284
Fraga, Aline Padilha de; Gräf, Tiago; Pereira, Cleiton Schneider; Ikuta, Nilo; Fonseca, André Salvador Kazantzi; Lunge, Vagner Ricardo
2018-07-01
Avian infectious bronchitis virus (IBV) is the etiological agent of a highly contagious disease, which results in severe economic losses to the poultry industry. The spike protein (S1 subunit) is responsible for the molecular diversity of the virus and many sero/genotypes are described around the world. Recently a new standardized classification of the IBV molecular diversity was conducted, based on phylogenetic analysis of the S1 gene sequences sampled worldwide. Brazil is one of the biggest poultry producers in the world and the present study aimed to review the molecular diversity and reconstruct the evolutionary history of IBV in the country. All IBV S1 gene sequences, with local and year of collection information available on GenBank, were retrieved. Phylogenetic analyses were carried out based on a maximum likelihood method for the classification of genotypes occurring in Brazil, according to the new classification. Bayesian phylogenetic analyses were performed with the Brazilian clade and related international sequences to determine the evolutionary history of IBV in Brazil. A total of 143 Brazilian sequences were classified as GI-11 and 46 as GI-1 (Mass). Within the GI-11 clade, we have identified a potential recombinant strain circulating in Brazil. Phylodynamic analysis demonstrated that IBV GI-11 lineage was introduced in Brazil in the 1950s (1951, 1917-1975 95% HPD) and population dynamics was mostly constant throughout the time. Despite the national vaccination protocols, our results show the widespread dissemination and maintenance of the IBV GI-11 lineage in Brazil and highlight the importance of continuous surveillance to evaluate the impact of currently used vaccine strains on the observed viral diversity of the country. Copyright © 2018 Elsevier B.V. All rights reserved.
Li, Juan; Zhu, Jin-long; Lou, Shi-di; Wang, Ping; Zhang, You-sen; Wang, Lin; Yin, Ruo-chun; Zhang, Ping-ping
2018-01-01
Abstract Coptotermes suzhouensis (Isoptera: Rhinotermitidae) is a significant subterranean termite pest of wooden structures and is widely distributed in southeastern China. The complete mitochondrial DNA sequence of C. suzhouensis was analyzed in this study. The mitogenome was a circular molecule of 15,764 bp in length, which contained 13 protein-coding genes (PCGs), 22 transfer RNA genes, two ribosomal RNA genes, and an A+T-rich region with a gene arrangement typical of Isoptera mitogenomes. All PCGs were initiated by ATN codons and terminated by complete termination codons (TAA), except COX2, ND5, and Cytb, which ended with an incomplete termination codon T. All tRNAs displayed a typical clover-leaf structure, except for tRNASer(AGN), which did not contain the stem-loop structure in the DHU arm. The A+T content (69.23%) of the A+T-rich region (949 bp) was higher than that of the entire mitogenome (65.60%), and two different sets of repeat units (A+B) were distributed in this region. Comparison of complete mitogenome sequences with those of Coptotermes formosanus indicated that the two taxa have very high genetic similarity. Forty-one representative termite species were used to construct phylogenetic trees by maximum likelihood, maximum parsimony, and Bayesian inference methods. The phylogenetic analyses also strongly supported (BPP, MLBP, and MPBP = 100%) that all C. suzhouensis and C. formosanus samples gathered into one clade with genetic distances between 0.000 and 0.002. This study provides molecular evidence for a more robust phylogenetic position of C. suzhouensis and inferrs that C. suzhouensis was the synonymy of C. formosanus. PMID:29718488
Humphreys-Pereira, Danny A; Elling, Axel A
2014-01-01
Root-knot nematodes (Meloidogyne spp.) are among the most important plant pathogens. In this study, the mitochondrial (mt) genomes of the root-knot nematodes, M. chitwoodi and M. incognita were sequenced. PCR analyses suggest that both mt genomes are circular, with an estimated size of 19.7 and 18.6-19.1kb, respectively. The mt genomes each contain a large non-coding region with tandem repeats and the control region. The mt gene arrangement of M. chitwoodi and M. incognita is unlike that of other nematodes. Sequence alignments of the two Meloidogyne mt genomes showed three translocations; two in transfer RNAs and one in cox2. Compared with other nematode mt genomes, the gene arrangement of M. chitwoodi and M. incognita was most similar to Pratylenchus vulnus. Phylogenetic analyses (Maximum Likelihood and Bayesian inference) were conducted using 78 complete mt genomes of diverse nematode species. Analyses based on nucleotides and amino acids of the 12 protein-coding mt genes showed strong support for the monophyly of class Chromadorea, but only amino acid-based analyses supported the monophyly of class Enoplea. The suborder Spirurina was not monophyletic in any of the phylogenetic analyses, contradicting the Clade III model, which groups Ascaridomorpha, Spiruromorpha and Oxyuridomorpha based on the small subunit ribosomal RNA gene. Importantly, comparisons of mt gene arrangement and tree-based methods placed Meloidogyne as sister taxa of Pratylenchus, a migratory plant endoparasitic nematode, and not with the sedentary endoparasitic Heterodera. Thus, comparative analyses of mt genomes suggest that sedentary endoparasitism in Meloidogyne and Heterodera is based on convergent evolution. Copyright © 2014 Elsevier B.V. All rights reserved.
Kaden, Jennifer; Joshi, Jyoti; Bhattarai, Susmita; Kusi, Naresh; Sillero-Zubiri, Claudio; Macdonald, David W.
2017-01-01
Wolves in the Himalayan region form a monophyletic lineage distinct from the present-day Holarctic grey wolf Canis lupus spp. (Linnaeus 1758) found across Eurasia and North America. Here, we analyse phylogenetic relationships and the geographic distribution of mitochondrial DNA haplotypes of the contemporary Himalayan wolf (proposed in previous studies as Canis himalayensis) found in Central Asia. We combine genetic data from a living Himalayan wolf population collected in northwestern Nepal in this study with already published genetic data, and confirm the Himalayan wolf lineage based on mitochondrial genomic data (508 bp cytochrome b and 242 bp D-loop), and X- and Y-linked zinc-finger protein gene (ZFX and ZFY) sequences. We then compare the genetic profile of the Himalayan wolf lineage found in northwestern Nepal with canid reference sequences from around the globe with maximum likelihood and Bayesian phylogeny building methods to demonstrate that the Himalayan wolf forms a distinct monophyletic clade supported by posterior probabilities/bootstrap for D-loop of greater than 0.92/85 and cytochrome b greater than 0.99/93. The Himalayan wolf shows a unique Y-chromosome (ZFY) haplotype, and shares an X-chromosome haplotype (ZFX) with the newly postulated African wolf. Our results imply that the Himalayan wolf distribution range extends from the Himalayan range north across the Tibetan Plateau up to the Qinghai Lakes region in Qinghai Province in the People's Republic of China. Based on its phylogenetic distinction and its older age of divergence relative to the Holarctic grey wolf, the Himalayan wolf merits formal classification as a distinct taxon of special conservation concern. PMID:28680672
Reviewing the history of HIV-1: spread of subtype B in the Americas.
Junqueira, Dennis Maletich; de Medeiros, Rúbia Marília; Matte, Maria Cristina Cotta; Araújo, Leonardo Augusto Luvison; Chies, Jose Artur Bogo; Ashton-Prolla, Patricia; Almeida, Sabrina Esteves de Matos
2011-01-01
The dispersal of HIV-1 subtype B (HIV-1B) is a reflection of the movement of human populations in response to social, political, and geographical issues. The initial dissemination of HIV-1B outside Africa seems to have included the passive involvement of human populations from the Caribbean in spreading the virus to the United States. However, the exact pathways taken during the establishment of the pandemic in the Americas remain unclear. Here, we propose a geographical scenario for the dissemination of HIV-1B in the Americas, based on phylogenetic and genetic statistical analyses of 313 available sequences of the pol gene from 27 countries. Maximum likelihood and bayesian inference methods were used to explore the phylogenetic relationships between HIV-1B sequences, and molecular variance estimates were analyzed to infer the genetic structure of the viral population. We found that the initial dissemination and subsequent spread of subtype B in the Americas occurred via a single introduction event in the Caribbean around 1964 (1950-1967). Phylogenetic trees present evidence of several primary outbreaks in countries in South America, directly seeded by the Caribbean epidemic. Cuba is an exception insofar as its epidemic seems to have been introduced from South America. One clade comprising isolates from different countries emerged in the most-derived branches, reflecting the intense circulation of the virus throughout the American continents. Statistical analysis supports the genetic compartmentalization of the virus among the Americas, with a close relationship between the South American and Caribbean epidemics. These findings reflect the complex establishment of the HIV-1B pandemic and contribute to our understanding between the migration process of human populations and virus diffusion.
Reviewing the History of HIV-1: Spread of Subtype B in the Americas
Junqueira, Dennis Maletich; de Medeiros, Rúbia Marília; Matte, Maria Cristina Cotta; Araújo, Leonardo Augusto Luvison; Chies, Jose Artur Bogo; Ashton-Prolla, Patricia; Almeida, Sabrina Esteves de Matos
2011-01-01
The dispersal of HIV-1 subtype B (HIV-1B) is a reflection of the movement of human populations in response to social, political, and geographical issues. The initial dissemination of HIV-1B outside Africa seems to have included the passive involvement of human populations from the Caribbean in spreading the virus to the United States. However, the exact pathways taken during the establishment of the pandemic in the Americas remain unclear. Here, we propose a geographical scenario for the dissemination of HIV-1B in the Americas, based on phylogenetic and genetic statistical analyses of 313 available sequences of the pol gene from 27 countries. Maximum likelihood and Bayesian inference methods were used to explore the phylogenetic relationships between HIV-1B sequences, and molecular variance estimates were analyzed to infer the genetic structure of the viral population. We found that the initial dissemination and subsequent spread of subtype B in the Americas occurred via a single introduction event in the Caribbean around 1964 (1950–1967). Phylogenetic trees present evidence of several primary outbreaks in countries in South America, directly seeded by the Caribbean epidemic. Cuba is an exception insofar as its epidemic seems to have been introduced from South America. One clade comprising isolates from different countries emerged in the most-derived branches, reflecting the intense circulation of the virus throughout the American continents. Statistical analysis supports the genetic compartmentalization of the virus among the Americas, with a close relationship between the South American and Caribbean epidemics. These findings reflect the complex establishment of the HIV-1B pandemic and contribute to our understanding between the migration process of human populations and virus diffusion. PMID:22132104
Sessa, Emily B; Juslén, Aino; Väre, Henry; Chambers, Sally M
2017-03-01
Our goal was to infer the phylogenetic relationships and historical biogeography of the genus Dryopteris with a focus on taxa in sub-Saharan Africa and neighboring islands. In general, little is known about the relationships between African fern species and their congeners in other geographic regions, and our aim was to determine whether the sub-Saharan African species of Dryopteris are monophyletic and evolved within Africa or arrived there via repeated dispersals into Africa from other regions. We obtained sequence data for five chloroplast markers from 214 species of Dryopteris and 18 outgroups. We performed phylogenetic and molecular dating analyses using a Bayesian relaxed clock method in BEAST with fossil and secondary calibration points and estimated ancestral ranges for the genus globally by comparing multiple models in BioGeoBEARS. We found that 22 of 27 accessions of sub-Saharan African Dryopteris belong to a large clade of 31 accessions that also includes taxa from Indian and Atlantic Ocean islands. Additional accessions of taxa from our regions of interest have Asian, Hawaiian, European, or North American species as their closest relatives. The majority of sub-Saharan African Dryopteris species are descended from a shared common ancestor that dispersed to Africa from Asia approximately 10 Ma. There have been subsequent dispersal events from the African mainland to islands in the Atlantic and Indian Oceans, including Madagascar. Several additional species are estimated to have descended from ancestors that reached Africa via separate events over the last roughly 20 million years. © 2017 Sessa et al. Published by the Botanical Society of America.This work is licensed under a Creative Commons Attribution License (CC-BY-NC).
Physalis and physaloids: A recent and complex evolutionary history.
Zamora-Tavares, María Del Pilar; Martínez, Mahinda; Magallón, Susana; Guzmán-Dávalos, Laura; Vargas-Ponce, Ofelia
2016-07-01
The complex evolutionary history of the subtribe Physalinae is reflected in the poor resolution of the relationships of Physalis and the physaloid genera. We hypothesize that this low resolution is caused by recent evolutionary history in a complex geographic setting. The aims of this study were twofold: (1) To determine the phylogenetic relationships of the current genera recognized in Physalinae in order to identify monophyletic groups and resolve the physaloid grade; and (2) to determine the probable causes of the recent divergence in Physalinae. We conducted phylogenetic analyses with maximum likelihood (ML) and Bayesian inference with 50 Physalinae species and 19 others as outgroups, using morphological and molecular data from five plastid and two nuclear regions. A relaxed molecular clock was obtained from the ML topology and ancestral area reconstruction was conducted using the DEC model. The genera Chamaesaracha, Leucophysalis, and Physalis subgenus Rydbergis were recovered as monophyletic. Three clades, Alkekengi-Calliphysalis, Schraderanthus-Tzeltalia, and Witheringia-Brachistus, also received good support. However, even with morphological data and that of the DNA of seven regions, the tree was not completely resolved and many clades remained unsupported. Physalinae diverged at the end of the Miocene (∼9.22Mya) with one trend indicating that the greatest diversification within the subtribe occurred during the last 5My. The Neotropical region presented the highest probability (45%) of being the ancestral area of Physalinae followed by the Mexican Transition Zone (35%). During the Pliocene and Pleistocene, the geographical areas where species were found experienced significant geological and climatic changes, giving rise to rapid and relatively recent diversification events in Physalinae. Thus, recent origin, high diversification, and morphological complexity have contributed, at least with the currently available methods, to the inability to completely disentangle the phylogenetic relationships of Physalinae. Copyright © 2016 Elsevier Inc. All rights reserved.
Molecular phylogenetics and biogeography of the Neotropical redstarts (Myioborus; Aves, Parulinae).
Pérez-Emán, Jorge L
2005-11-01
Montane areas in the Neotropics are characterized by high diversity and endemism of birds and other groups. The avian genus Myioborus (Parulinae) is a group of insectivorous warblers, characteristic of cloud forests, that represents one of the few Parulinae genera (New World warblers) that has radiated substantially in South America. The genus is distributed throughout most montane regions from the southwestern United States to northern Argentina. Here, I use mitochondrial sequences from the cytochrome b, ND2, and ND3 genes to present the first hypothesis of phylogenetic relationship among all Myioborus species level taxa. Phylogenetic reconstructions based on maximum parsimony, maximum likelihood, and Bayesian methods produced similar results and suggest a northern origin for the genus Myioborus with subsequent colonization of the Neotropical Montane Region. The lower-montane species, M. miniatus, is the sister taxon to a clade in which all taxa occupy upper-montane habitats. These "highland" taxa diverged early in the history of the genus and produced two well-defined monophyletic lineages, a Central-northern Andean clade formed by M. albifrons, M. ornatus, and M. melanocephalus, and a Pantepui (table-mountains of southern Venezuela, northern Brazil, and western Guyana) clade consisting of M. castaneocapillus, M. albifacies, and M. cardonai, and probably M. pariae. M. brunniceps, M. flavivertex, and M. torquatus were included in this upper-montane clade but without clear relationships to other taxa. Lack of resolution of nodes defining the upper-montane species clade is likely to result from a period of rapid diversification mediated by geological and climatic events during the Late Pliocene. These results suggest that an interplay of dispersal and vicariance has shaped the current biogeographic patterns of Myioborus.
Mendes, Joana; Harris, D James; Carranza, Salvador; Salvi, Daniele
2016-07-01
Estimating the phylogeny of lacertid lizards, and particularly the tribe Lacertini has been challenging, possibly due to the fast radiation of this group resulting in a hard polytomy. However this is still an open question, as concatenated data primarily from mitochondrial markers have been used so far whereas in a recent phylogeny based on a compilation of these data within a squamate supermatrix the basal polytomy seems to be resolved. In this study, we estimate phylogenetic relationships between all Lacertini genera using for the first time DNA sequences from five fast evolving nuclear genes (acm4, mc1r, pdc, βfib and reln) and two mitochondrial genes (nd4 and 12S). We generated a total of 529 sequences from 88 species and used Maximum Likelihood and Bayesian Inference methods based on concatenated multilocus dataset as well as a coalescent-based species tree approach with the aim of (i) shedding light on the basal relationships of Lacertini (ii) assessing the monophyly of genera which were previously questioned, and (iii) discussing differences between estimates from this and previous studies based on different markers, and phylogenetic methods. Results uncovered (i) a new phylogenetic clade formed by the monotypic genera Archaeolacerta, Zootoca, Teira and Scelarcis; and (ii) support for the monophyly of the Algyroides clade, with two sister species pairs represented by western (A. marchi and A. fitzingeri) and eastern (A. nigropunctatus and A. moreoticus) lineages. In both cases the members of these groups show peculiar morphology and very different geographical distributions, suggesting that they are relictual groups that were once diverse and widespread. They probably originated about 11-13 million years ago during early events of speciation in the tribe, and the split between their members is estimated to be only slightly older. This scenario may explain why mitochondrial markers (possibly saturated at higher divergence levels) or slower nuclear markers used in previous studies (likely lacking enough phylogenetic signal) failed to recover these relationships. Finally, the phylogenetic position of most remaining genera was unresolved, corroborating the hypothesis of a hard polytomy in the Lacertini phylogeny due to a fast radiation. This is in agreement with all previous studies but in sharp contrast with a recent squamate megaphylogeny. We show that the supermatrix approach may provide high support for incorrect nodes that are not supported either by original sequence data or by new data from this study. This finding suggests caution when using megaphylogenies to integrate inter-generic relationships in comparative ecological and evolutionary studies. Copyright © 2016 Elsevier Inc. All rights reserved.
2016-01-01
The Fayum Depression of Egypt has yielded fossils of hystricognathous rodents from multiple Eocene and Oligocene horizons that range in age from ∼37 to ∼30 Ma and document several phases in the early evolution of crown Hystricognathi and one of its major subclades, Phiomorpha. Here we describe two new genera and species of basal phiomorphs, Birkamys korai and Mubhammys vadumensis, based on rostra and maxillary and mandibular remains from the terminal Eocene (∼34 Ma) Fayum Locality 41 (L-41). Birkamys is the smallest known Paleogene hystricognath, has very simple molars, and, like derived Oligocene-to-Recent phiomorphs (but unlike contemporaneous and older taxa) apparently retained dP4∕4 late into life, with no evidence for P4∕4 eruption or formation. Mubhammys is very similar in dental morphology to Birkamys, and also shows no evidence for P4∕4 formation or eruption, but is considerably larger. Though parsimony analysis with all characters equally weighted places Birkamys and Mubhammys as sister taxa of extant Thryonomys to the exclusion of much younger relatives of that genus, all other methods (standard Bayesian inference, Bayesian “tip-dating,” and parsimony analysis with scaled transitions between “fixed” and polymorphic states) place these species in more basal positions within Hystricognathi, as sister taxa of Oligocene-to-Recent phiomorphs. We also employ tip-dating as a means for estimating the ages of early hystricognath-bearing localities, many of which are not well-constrained by geological, geochronological, or biostratigraphic evidence. By simultaneously taking into account phylogeny, evolutionary rates, and uniform priors that appropriately encompass the range of possible ages for fossil localities, dating of tips in this Bayesian framework allows paleontologists to move beyond vague and assumption-laden “stage of evolution” arguments in biochronology to provide relatively rigorous age assessments of poorly-constrained faunas. This approach should become increasingly robust as estimates are combined from multiple independent analyses of distantly related clades, and is broadly applicable across the tree of life; as such it is deserving of paleontologists’ close attention. Notably, in the example provided here, hystricognathous rodents from Libya and Namibia that are controversially considered to be of middle Eocene age are instead estimated to be of late Eocene and late Oligocene age, respectively. Finally, we reconstruct the evolution of first lower molar size among Paleogene African hystricognaths using a Bayesian approach; the results of this analysis reconstruct a rapid latest Eocene dwarfing event along the lineage leading to Birkamys. PMID:26966657
Du, Yuanwei; Guo, Yubin
2015-01-01
The intrinsic mechanism of multimorbidity is difficult to recognize and prediction and diagnosis are difficult to carry out accordingly. Bayesian networks can help to diagnose multimorbidity in health care, but it is difficult to obtain the conditional probability table (CPT) because of the lack of clinically statistical data. Today, expert knowledge and experience are increasingly used in training Bayesian networks in order to help predict or diagnose diseases, but the CPT in Bayesian networks is usually irrational or ineffective for ignoring realistic constraints especially in multimorbidity. In order to solve these problems, an evidence reasoning (ER) approach is employed to extract and fuse inference data from experts using a belief distribution and recursive ER algorithm, based on which evidence reasoning method for constructing conditional probability tables in Bayesian network of multimorbidity is presented step by step. A multimorbidity numerical example is used to demonstrate the method and prove its feasibility and application. Bayesian network can be determined as long as the inference assessment is inferred by each expert according to his/her knowledge or experience. Our method is more effective than existing methods for extracting expert inference data accurately and is fused effectively for constructing CPTs in a Bayesian network of multimorbidity.
Molitor, John
2012-03-01
Bayesian methods have seen an increase in popularity in a wide variety of scientific fields, including epidemiology. One of the main reasons for their widespread application is the power of the Markov chain Monte Carlo (MCMC) techniques generally used to fit these models. As a result, researchers often implicitly associate Bayesian models with MCMC estimation procedures. However, Bayesian models do not always require Markov-chain-based methods for parameter estimation. This is important, as MCMC estimation methods, while generally quite powerful, are complex and computationally expensive and suffer from convergence problems related to the manner in which they generate correlated samples used to estimate probability distributions for parameters of interest. In this issue of the Journal, Cole et al. (Am J Epidemiol. 2012;175(5):368-375) present an interesting paper that discusses non-Markov-chain-based approaches to fitting Bayesian models. These methods, though limited, can overcome some of the problems associated with MCMC techniques and promise to provide simpler approaches to fitting Bayesian models. Applied researchers will find these estimation approaches intuitively appealing and will gain a deeper understanding of Bayesian models through their use. However, readers should be aware that other non-Markov-chain-based methods are currently in active development and have been widely published in other fields.
Kang, Seokha; Sultana, Tahera; Eom, Keeseon S; Park, Yung Chul; Soonthornpong, Nathan; Nadler, Steven A; Park, Joong-Ki
2009-01-15
The complete mitochondrial genome sequence was determined for the human pinworm Enterobius vermicularis (Oxyurida: Nematoda) and used to infer its phylogenetic relationship to other major groups of chromadorean nematodes. The E. vermicularis genome is a 14,010-bp circular DNA molecule that encodes 36 genes (12 proteins, 22 tRNAs, and 2 rRNAs). This mtDNA genome lacks atp8, as reported for almost all other nematode species investigated. Phylogenetic analyses (maximum parsimony, maximum likelihood, neighbor joining, and Bayesian inference) of nucleotide sequences for the 12 protein-coding genes of 25 nematode species placed E. vermicularis, a representative of the order Oxyurida, as sister to the main Ascaridida+Rhabditida group. Tree topology comparisons using statistical tests rejected an alternative hypothesis favoring a closer relationship among Ascaridida, Spirurida, and Oxyurida, which has been supported from most studies based on nuclear ribosomal DNA sequences. Unlike the relatively conserved gene arrangement found for most chromadorean taxa, E. vermicularis mtDNA gene order is very unique, not sharing similarity to any other nematode species reported to date. This lack of gene order similarity may represent idiosyncratic gene rearrangements unique to this specific lineage of the oxyurids. To more fully understand the extent of gene rearrangement and its evolutionary significance within the nematode phylogenetic framework, additional mitochondrial genomes representing a greater evolutionary diversity of species must be characterized.
Shaw, A Jonathan; Cox, Cymon J; Buck, William R; Devos, Nicolas; Buchanan, Alex M; Cave, Lynette; Seppelt, Rodney; Shaw, Blanka; Larraín, Juan; Andrus, Richard; Greilhuber, Johann; Temsch, Eva M
2010-09-01
The Sphagnopsida, an early-diverging lineage of mosses (phylum Bryophyta), are morphologically and ecologically unique and have profound impacts on global climate. The Sphagnopsida are currently classified in two genera, Sphagnum (peat mosses) with some 350-500 species and Ambuchanania with one species. An analysis of phylogenetic relationships among species and genera in the Sphagnopsida were conducted to resolve major lineages and relationships among species within the Sphagnopsida. • Phylogenetic analyses of nucleotide sequences from the nuclear, plastid, and mitochondrial genomes (11 704 nucleotides total) were conducted and analyzed using maximum likelihood and Bayesian inference employing seven different substitution models of varying complexity. • Phylogenetic analyses resolved three lineages within the Sphagnopsida: (1) Sphagnum sericeum, (2) S. inretortum plus Ambuchanania leucobryoides, and (3) all remaining species of Sphagnum. Sister group relationships among these three clades could not be resolved, but the phylogenetic results indicate that the highly divergent morphology of A. leucobryoides is derived within the Sphagnopsida rather than plesiomorphic. A new classification is proposed for class Sphagnopsida, with one order (Sphagnales), three families, and four genera. • The Sphagnopsida are an old lineage within the phylum Bryophyta, but the extant species of Sphagnum represent a relatively recent radiation. It is likely that additional species critical to understanding the evolution of peat mosses await discovery, especially in the southern hemisphere.
Phylogenetic Analysis and Epidemic History of Hepatitis C Virus Genotype 2 in Tunisia, North Africa
Rajhi, Mouna; Ghedira, Kais; Chouikha, Anissa; Djebbi, Ahlem; Cheikh, Imed; Ben Yahia, Ahlem; Sadraoui, Amel; Hammami, Walid; Azouz, Msaddek; Ben Mami, Nabil; Triki, Henda
2016-01-01
HCV genotype 2 (HCV-2) has a worldwide distribution with prevalence rates that vary from country to country. High genetic diversity and long-term endemicity were suggested in West African countries. A global dispersal of HCV-2 would have occurred during the 20th century, especially in European countries. In Tunisia, genotype 2 was the second prevalent genotype after genotype 1 and most isolates belong to subtypes 2c and 2k. In this study, phylogenetic analyses based on the NS5B genomic sequences of 113 Tunisian HCV isolates from subtypes 2c and 2k were carried out. A Bayesian coalescent-based framework was used to estimate the origin and the spread of these subtypes circulating in Tunisia. Phylogenetic analyses of HCV-2c sequences suggest the absence of country-specific or time-specific variants. In contrast, the phylogenetic grouping of HCV-2k sequences shows the existence of two major genetic clusters that may represent two distinct circulating variants. Coalescent analysis indicated a most recent common ancestor (tMRCA) of Tunisian HCV-2c around 1886 (1869–1902) before the introduction of HCV-2k in 1901 (1867–1931). Our findings suggest that the introduction of HCV-2c in Tunisia is possibly a result of population movements between Tunisia and European population following the French colonization. PMID:27100294
Phylogenetic Analysis and Epidemic History of Hepatitis C Virus Genotype 2 in Tunisia, North Africa.
Rajhi, Mouna; Ghedira, Kais; Chouikha, Anissa; Djebbi, Ahlem; Cheikh, Imed; Ben Yahia, Ahlem; Sadraoui, Amel; Hammami, Walid; Azouz, Msaddek; Ben Mami, Nabil; Triki, Henda
2016-01-01
HCV genotype 2 (HCV-2) has a worldwide distribution with prevalence rates that vary from country to country. High genetic diversity and long-term endemicity were suggested in West African countries. A global dispersal of HCV-2 would have occurred during the 20th century, especially in European countries. In Tunisia, genotype 2 was the second prevalent genotype after genotype 1 and most isolates belong to subtypes 2c and 2k. In this study, phylogenetic analyses based on the NS5B genomic sequences of 113 Tunisian HCV isolates from subtypes 2c and 2k were carried out. A Bayesian coalescent-based framework was used to estimate the origin and the spread of these subtypes circulating in Tunisia. Phylogenetic analyses of HCV-2c sequences suggest the absence of country-specific or time-specific variants. In contrast, the phylogenetic grouping of HCV-2k sequences shows the existence of two major genetic clusters that may represent two distinct circulating variants. Coalescent analysis indicated a most recent common ancestor (tMRCA) of Tunisian HCV-2c around 1886 (1869-1902) before the introduction of HCV-2k in 1901 (1867-1931). Our findings suggest that the introduction of HCV-2c in Tunisia is possibly a result of population movements between Tunisia and European population following the French colonization.
A Bayesian approach to meta-analysis of plant pathology studies.
Mila, A L; Ngugi, H K
2011-01-01
Bayesian statistical methods are used for meta-analysis in many disciplines, including medicine, molecular biology, and engineering, but have not yet been applied for quantitative synthesis of plant pathology studies. In this paper, we illustrate the key concepts of Bayesian statistics and outline the differences between Bayesian and classical (frequentist) methods in the way parameters describing population attributes are considered. We then describe a Bayesian approach to meta-analysis and present a plant pathological example based on studies evaluating the efficacy of plant protection products that induce systemic acquired resistance for the management of fire blight of apple. In a simple random-effects model assuming a normal distribution of effect sizes and no prior information (i.e., a noninformative prior), the results of the Bayesian meta-analysis are similar to those obtained with classical methods. Implementing the same model with a Student's t distribution and a noninformative prior for the effect sizes, instead of a normal distribution, yields similar results for all but acibenzolar-S-methyl (Actigard) which was evaluated only in seven studies in this example. Whereas both the classical (P = 0.28) and the Bayesian analysis with a noninformative prior (95% credibility interval [CRI] for the log response ratio: -0.63 to 0.08) indicate a nonsignificant effect for Actigard, specifying a t distribution resulted in a significant, albeit variable, effect for this product (CRI: -0.73 to -0.10). These results confirm the sensitivity of the analytical outcome (i.e., the posterior distribution) to the choice of prior in Bayesian meta-analyses involving a limited number of studies. We review some pertinent literature on more advanced topics, including modeling of among-study heterogeneity, publication bias, analyses involving a limited number of studies, and methods for dealing with missing data, and show how these issues can be approached in a Bayesian framework. Bayesian meta-analysis can readily include information not easily incorporated in classical methods, and allow for a full evaluation of competing models. Given the power and flexibility of Bayesian methods, we expect them to become widely adopted for meta-analysis of plant pathology studies.
2017-09-01
efficacy of statistical post-processing methods downstream of these dynamical model components with a hierarchical multivariate Bayesian approach to...Bayesian hierarchical modeling, Markov chain Monte Carlo methods , Metropolis algorithm, machine learning, atmospheric prediction 15. NUMBER OF PAGES...scale processes. However, this dissertation explores the efficacy of statistical post-processing methods downstream of these dynamical model components
Smith, Geoff M.; Hutson, Jarod M.; Kindler, Lutz; Garcia-Moreno, Alejandro; Villaluenga, Aritza; Turner, Elaine
2017-01-01
Background Ancient protein sequences are increasingly used to elucidate the phylogenetic relationships between extinct and extant mammalian taxa. Here, we apply these recent developments to Middle Pleistocene bone specimens of the rhinoceros genus Stephanorhinus. No biomolecular sequence data is currently available for this genus, leaving phylogenetic hypotheses on its evolutionary relationships to extant and extinct rhinoceroses untested. Furthermore, recent phylogenies based on Rhinocerotidae (partial or complete) mitochondrial DNA sequences differ in the placement of the Sumatran rhinoceros (Dicerorhinus sumatrensis). Therefore, studies utilising ancient protein sequences from Middle Pleistocene contexts have the potential to provide further insights into the phylogenetic relationships between extant and extinct species, including Stephanorhinus and Dicerorhinus. Methods ZooMS screening (zooarchaeology by mass spectrometry) was performed on several Late and Middle Pleistocene specimens from the genus Stephanorhinus, subsequently followed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) to obtain ancient protein sequences from a Middle Pleistocene Stephanorhinus specimen. We performed parallel analysis on a Late Pleistocene woolly rhinoceros specimen and extant species of rhinoceroses, resulting in the availability of protein sequence data for five extant species and two extinct genera. Phylogenetic analysis additionally included all extant Perissodactyla genera (Equus, Tapirus), and was conducted using Bayesian (MrBayes) and maximum-likelihood (RAxML) methods. Results Various ancient proteins were identified in both the Middle and Late Pleistocene rhinoceros samples. Protein degradation and proteome complexity are consistent with an endogenous origin of the identified proteins. Phylogenetic analysis of informative proteins resolved the Perissodactyla phylogeny in agreement with previous studies in regards to the placement of the families Equidae, Tapiridae, and Rhinocerotidae. Stephanorhinus is shown to be most closely related to the genera Coelodonta and Dicerorhinus. The protein sequence data further places the Sumatran rhino in a clade together with the genus Rhinoceros, opposed to forming a clade with the black and white rhinoceros species. Discussion The first biomolecular dataset available for Stephanorhinus places this genus together with the extinct genus Coelodonta and the extant genus Dicerorhinus. This is in agreement with morphological studies, although we are unable to resolve the order of divergence between these genera based on the protein sequences available. Our data supports the placement of the genus Dicerorhinus in a clade together with extant Rhinoceros species. Finally, the availability of protein sequence data for both extinct European rhinoceros genera allows future investigations into their geographic distribution and extinction chronologies. PMID:28316883
Díaz-Jaimes, Píndaro; Bayona-Vásquez, Natalia J.; Adams, Douglas H.; Uribe-Alcocer, Manuel
2015-01-01
Elasmobranchs are one of the most diverse groups in the marine realm represented by 18 orders, 55 families and about 1200 species reported, but also one of the most vulnerable to exploitation and to climate change. Phylogenetic relationships among main orders have been controversial since the emergence of the Hypnosqualean hypothesis by Shirai (1992) that considered batoids as a sister group of sharks. The use of the complete mitochondrial DNA (mtDNA) may shed light to further validate this hypothesis by increasing the number of informative characters. We report the mtDNA genome of the bonnethead shark Sphyrna tiburo, and compare it with mitogenomes of other 48 species to assess phylogenetic relationships. The mtDNA genome of S. tiburo, is quite similar in size to that of congeneric species but also similar to the reported mtDNA genome of other Carcharhinidae species. Like most vertebrate mitochondrial genomes, it contained 13 protein coding genes, two rRNA genes and 22 tRNA genes and the control region of 1086 bp (D-loop). The Bayesian analysis of the 49 mitogenomes supported the view that sharks and batoids are separate groups. PMID:27014583
Siriwut, Warut; Edgecombe, Gregory D.; Sutcharit, Chirasak; Panha, Somsak
2015-01-01
Seven Scolopendra species from the Southeast Asian mainland delimited based on standard external morphological characters represent monophyletic groups in phylogenetic trees inferred from concatenated sequences of three gene fragments (cytochrome c oxidase subunit 1, 16S rRNA and 28S rRNA) using Maximum likelihood and Bayesian inference. Geometric morphometric description of shape variation in the cephalic plate, forcipular coxosternite, and tergite of the ultimate leg-bearing segment provides additional criteria for distinguishing species. Colouration patterns in some Scolopendra species show a high degree of fit to phylogenetic trees at the population level. The most densely sampled species, Scolopendra dehaani Brandt, 1840, has three subclades with allopatric distributions in mainland SE Asia. The molecular phylogeny of S. pinguis Pocock, 1891, indicated ontogenetic colour variation among its populations. The taxonomic validation of S. dawydoffi Kronmüller, 2012, S. japonica Koch, 1878, and S. dehaani Brandt, 1840, each a former subspecies of S. subspinipes Leach, 1814 sensu Lewis, 2010, as full species was supported by molecular information and additional morphological data. Species delimitation in these taxonomically challenging animals is facilitated by an integrative approach that draws on both morphology and molecular phylogeny. PMID:26270342
Phylogenetic position of Loricifera inferred from nearly complete 18S and 28S rRNA gene sequences.
Yamasaki, Hiroshi; Fujimoto, Shinta; Miyazaki, Katsumi
2015-01-01
Loricifera is an enigmatic metazoan phylum; its morphology appeared to place it with Priapulida and Kinorhyncha in the group Scalidophora which, along with Nematoida (Nematoda and Nematomorpha), comprised the group Cycloneuralia. Scarce molecular data have suggested an alternative phylogenetic hypothesis, that the phylum Loricifera is a sister taxon to Nematomorpha, although the actual phylogenetic position of the phylum remains unclear. Ecdysozoan phylogeny was reconstructed through maximum-likelihood (ML) and Bayesian inference (BI) analyses of nuclear 18S and 28S rRNA gene sequences from 60 species representing all eight ecdysozoan phyla, and including a newly collected loriciferan species. Ecdysozoa comprised two clades with high support values in both the ML and BI trees. One consisted of Priapulida and Kinorhyncha, and the other of Loricifera, Nematoida, and Panarthropoda (Tardigrada, Onychophora, and Arthropoda). The relationships between Loricifera, Nematoida, and Panarthropoda were not well resolved. Loricifera appears to be closely related to Nematoida and Panarthropoda, rather than grouping with Priapulida and Kinorhyncha, as had been suggested by previous studies. Thus, both Scalidophora and Cycloneuralia are a polyphyletic or paraphyletic groups. In addition, Loricifera and Nematomorpha did not emerge as sister groups.
Phylogenetic study of Class Armophorea (Alveolata, Ciliophora) based on 18S-rDNA data.
da Silva Paiva, Thiago; do Nascimento Borges, Bárbara; da Silva-Neto, Inácio Domingos
2013-12-01
The 18S rDNA phylogeny of Class Armophorea, a group of anaerobic ciliates, is proposed based on an analysis of 44 sequences (out of 195) retrieved from the NCBI/GenBank database. Emphasis was placed on the use of two nucleotide alignment criteria that involved variation in the gap-opening and gap-extension parameters and the use of rRNA secondary structure to orientate multiple-alignment. A sensitivity analysis of 76 data sets was run to assess the effect of variations in indel parameters on tree topologies. Bayesian inference, maximum likelihood and maximum parsimony phylogenetic analyses were used to explore how different analytic frameworks influenced the resulting hypotheses. A sensitivity analysis revealed that the relationships among higher taxa of the Intramacronucleata were dependent upon how indels were determined during multiple-alignment of nucleotides. The phylogenetic analyses rejected the monophyly of the Armophorea most of the time and consistently indicated that the Metopidae and Nyctotheridae were related to the Litostomatea. There was no consensus on the placement of the Caenomorphidae, which could be a sister group of the Metopidae + Nyctorheridae, or could have diverged at the base of the Spirotrichea branch or the Intramacronucleata tree.
Phylogenetic study of Class Armophorea (Alveolata, Ciliophora) based on 18S-rDNA data
da Silva Paiva, Thiago; do Nascimento Borges, Bárbara; da Silva-Neto, Inácio Domingos
2013-01-01
The 18S rDNA phylogeny of Class Armophorea, a group of anaerobic ciliates, is proposed based on an analysis of 44 sequences (out of 195) retrieved from the NCBI/GenBank database. Emphasis was placed on the use of two nucleotide alignment criteria that involved variation in the gap-opening and gap-extension parameters and the use of rRNA secondary structure to orientate multiple-alignment. A sensitivity analysis of 76 data sets was run to assess the effect of variations in indel parameters on tree topologies. Bayesian inference, maximum likelihood and maximum parsimony phylogenetic analyses were used to explore how different analytic frameworks influenced the resulting hypotheses. A sensitivity analysis revealed that the relationships among higher taxa of the Intramacronucleata were dependent upon how indels were determined during multiple-alignment of nucleotides. The phylogenetic analyses rejected the monophyly of the Armophorea most of the time and consistently indicated that the Metopidae and Nyctotheridae were related to the Litostomatea. There was no consensus on the placement of the Caenomorphidae, which could be a sister group of the Metopidae + Nyctorheridae, or could have diverged at the base of the Spirotrichea branch or the Intramacronucleata tree. PMID:24385862
Two C++ Libraries for Counting Trees on a Phylogenetic Terrace.
Biczok, R; Bozsoky, P; Eisenmann, P; Ernst, J; Ribizel, T; Scholz, F; Trefzer, A; Weber, F; Hamann, M; Stamatakis, A
2018-05-08
The presence of terraces in phylogenetic tree space, that is, a potentially large number of distinct tree topologies that have exactly the same analytical likelihood score, was first described by Sanderson et al. (2011). However, popular software tools for maximum likelihood and Bayesian phylogenetic inference do not yet routinely report, if inferred phylogenies reside on a terrace, or not. We believe, this is due to the lack of an efficient library to (i) determine if a tree resides on a terrace, (ii) calculate how many trees reside on a terrace, and (iii) enumerate all trees on a terrace. In our bioinformatics practical that is set up as a programming contest we developed two efficient and independent C++ implementations of the SUPERB algorithm by Constantinescu and Sankoff (1995) for counting and enumerating trees on a terrace. Both implementations yield exactly the same results, are more than one order of magnitude faster, and require one order of magnitude less memory than a previous 3rd party python implementation. The source codes are available under GNU GPL at https://github.com/terraphast. Alexandros.Stamatakis@h-its.org. Supplementary data are available at Bioinformatics online.
Application of Bayesian Approach in Cancer Clinical Trial
Bhattacharjee, Atanu
2014-01-01
The application of Bayesian approach in clinical trials becomes more useful over classical method. It is beneficial from design to analysis phase. The straight forward statement is possible to obtain through Bayesian about the drug treatment effect. Complex computational problems are simple to handle with Bayesian techniques. The technique is only feasible to performing presence of prior information of the data. The inference is possible to establish through posterior estimates. However, some limitations are present in this method. The objective of this work was to explore the several merits and demerits of Bayesian approach in cancer research. The review of the technique will be helpful for the clinical researcher involved in the oncology to explore the limitation and power of Bayesian techniques. PMID:29147387
Riser, James P; Cardinal-McTeague, Warren M; Hall, Jocelyn C; Hahn, William J; Sytsma, Kenneth J; Roalson, Eric H
2013-10-01
A monophyletic group composed of five genera of the Cleomaceae represents an intriguing lineage with outstanding taxonomic and evolutionary questions. Generic boundaries are poorly defined, and historical hypotheses regarding the evolution of fruit type and phylogenetic relationships provide testable questions. This is the first detailed phylogenetic investigation of all 22 species in this group. We use this phylogenetic framework to assess generic monophyly and test Iltis's evolutionary "reduction series" hypothesis regarding phylogeny and fruit type/seed number. • Maximum likelihood and Bayesian analyses of four plastid intergenic spacer region sequences (rpl32-trnL, trnQ-rps16, ycf1-rps15, and psbA-trnH) and one nuclear (ITS) region were used to reconstruct phylogenetic relationships among the NA cleomoid species. Stochastic mapping and ancestral-state reconstruction were used to study the evolution of fruit type. • Both analyses recovered nearly identical phylogenies. Three of the currently recognized genera (Wislizenia, Carsonia, and Oxystylis) are monophyletic while two (Cleomella and Peritoma) are para- or polyphyletic. There was a single origin of the two-seeded schizocarp in the ancestor of the Oxystylis-Wislizenia clade and a secondary derivation of elongated capsule-type fruits in Peritoma from a truncated capsule state in Cleomella. • Our well-resolved phylogeny supports most of the current species circumscriptions but not current generic circumscriptions. Additionally, our results are inconsistent with Iltis's hypothesis of species with elongated many-seed fruits giving rise to species with truncated few-seeded fruits. Instead, we find support for the reversion to elongated multiseeded fruits from a truncate few-seeded ancestor in Peritoma.
Vieira, Willie A S; Lima, Waléria G; Nascimento, Eduardo S; Michereff, Sami J; Câmara, Marcos P S; Doyle, Vinson P
2017-01-01
Developing a comprehensive and reliable taxonomy for the Colletotrichum gloeosporioides species complex will require adopting data standards on the basis of an understanding of how methodological choices impact morphological evaluations and phylogenetic inference. We explored the impact of methodological choices in a morphological and molecular evaluation of Colletotrichum species associated with banana in Brazil. The choice of alignment filtering algorithm has a significant impact on topological inference and the retention of phylogenetically informative sites. Similarly, the choice of phylogenetic marker affects the delimitation of species boundaries, particularly if low phylogenetic signal is confounded with strong discordance, and inference of the species tree from multiple-gene trees. According to both phylogenetic informativeness profiling and Bayesian concordance analyses, the most informative loci are DNA lyase (APN2), intergenic spacer (IGS) between DNA lyase and the mating-type locus MAT1-2-1 (APN2/MAT-IGS), calmodulin (CAL), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), glutamine synthetase (GS), β-tubulin (TUB2), and a new marker, the intergenic spacer between GAPDH and an hypothetical protein (GAP2-IGS). Cornmeal agar minimizes the variance in conidial dimensions compared with potato dextrose agar and synthetic nutrient-poor agar, such that species are more readily distinguishable based on phenotypic differences. We apply these insights to investigate the diversity of Colletotrichum species associated with banana anthracnose in Brazil and report C. musae, C. tropicale, C. theobromicola, and C. siamense in association with banana anthracnose. One lineage did not cluster with any previously described species and is described here as C. chrysophilum.
Davaalkham, Jagdagsuren; Unenchimeg, Puntsag; Baigalmaa, Chultem; Erdenetuya, Gombo; Nyamkhuu, Dulmaa; Shiino, Teiichiro; Tsuchiya, Kiyoto; Hayashida, Tsunefusa; Gatanaga, Hiroyuki; Oka, Shinichi
2011-10-01
We investigated the current molecular epidemiological status of HIV-1 in Mongolia, a country with very low incidence of HIV-1 though with rapid expansion in recent years. HIV-1 pol (1065 nt) and env (447 nt) genes were sequenced to construct phylogenetic trees. The evolutionary rates, molecular clock phylogenies, and other evolutionary parameters were estimated from heterochronous genomic sequences of HIV-1 subtype B by the Bayesian Markov chain Monte Carlo method. We obtained 41 sera from 56 reported HIV-1-positive cases as of May 2009. The main route of infection was men who have sex with men (MSM). Dominant subtypes were subtype B in 32 cases (78%) followed by subtype CRF02_AG (9.8%). The phylogenetic analysis of the pol gene identified two clusters in subtype B sequences. Cluster 1 consisted of 21 cases including MSM and other routes of infection, and cluster 2 consisted of eight MSM cases. The tree analyses demonstrated very short branch lengths in cluster 1, suggesting a surprisingly active expansion of HIV-1 transmission during a short period with the same ancestor virus. Evolutionary analysis indicated that the outbreak started around the early 2000s. This study identified a current hot spot of HIV-1 transmission and potential seed of the epidemic in Mongolia. Comprehensive preventive measures targeting this group are urgently needed.
Amado, Yulien; Patiño-Uzcátegui, Anelvi; Cepero de García, Maria C; Tabima, Javier; Motta, Adriana; Cárdenas, Martha; Bernal, Adriana; Restrepo, Silvia; Celis, Adriana
2013-11-01
Seborrheic dermatitis (SD) is a chronic, widespread skin condition, which is considered a multifactorial disease influenced, in part, by Malassezia spp. opportunistic activities, as well as various endogenous and exogenous factors. Malassezia species are lipophilic, lipid-dependent yeasts that are members of the normal mycobiota of the human skin. Their isolation from SD lesions varies around the world and the study of the relationship among factors such as gender, age, immunosuppressive condition of the patient and SD development, can lead to a better understanding of this disease. To elucidate the association of age and gender with the development of SD and to precisely determine the Malassezia species involved in the disease, samples were obtained from 134 individuals, including individuals without lesions, human immunodeficiency virus positive patients, individuals with seborrheic dermatitis, and HIV patients with seborrheic dermatitis. Malassezia spp. were identified by phenotypic and genotypic methods and a phylogenetic analysis was performed using Bayesian inference. This study revealed that age and gender are not predisposing factors for SD development, and that the most frequent species of Malassezia related to SD development among the Colombian population is M. restricta. We also report the isolation of M. yamatoensis for the first time in Colombia, and propose an ITS2 secondary structure from Malassezia taxa that can be used for precise identification and to establish more robust phylogenetic relationships.
Rodriguez-Roche, Rosmari; Villegas, Elci; Cook, Shelley; Poh Kim, Pauline A.W.; Hinojosa, Yoandri; Rosario, Delfina; Villalobos, Iris; Bendezu, Herminia; Hibberd, Martin L.; Guzman, Maria G.
2012-01-01
During the past three decades there has been a notable increase in dengue disease severity in Venezuela. Nevertheless, the population structure of the viruses being transmitted in this country is not well understood. Here, we present a molecular epidemiological study on dengue viruses (DENV) circulating in Aragua State, Venezuela during 2006–2007. Twenty-one DENV full-length genomes representing all of the four serotypes were amplified and sequenced directly from the serum samples. Notably, only DENV-2 was associated with severe disease. Phylogenetic trees constructed using Bayesian methods indicated that only one genotype was circulating for each serotype. However, extensive viral genetic diversity was found in DENV isolated from the same area during the same period, indicating significant in situ evolution since the introduction of these genotypes. Collectively, the results suggest that the non-structural (NS) proteins may play an important role in DENV evolution, particularly NS1, NS2A and NS4B proteins. The phylogenetic data provide evidence to suggest that multiple introductions of DENV have occurred from the Latin American region into Venezuela and vice versa. The implications of the significant viral genetic diversity generated during hyperendemic transmission, particularly in NS protein are discussed and considered in the context of future development and use of human monoclonal antibodies as antivirals and tetravalent vaccines. PMID:22197765
Shirley, Matthew H.; Vliet, Kent A.; Carr, Amanda N.; Austin, James D.
2014-01-01
Accurate species delimitation is a central assumption of biology that, in groups such as the Crocodylia, is often hindered by highly conserved morphology and frequent introgression. In Africa, crocodilian systematics has been hampered by complex regional biogeography and confounded taxonomic history. We used rigorous molecular and morphological species delimitation methods to test the hypothesis that the slender-snouted crocodile (Mecistops cataphractus) is composed of multiple species corresponding to the Congolian and Guinean biogeographic zones. Speciation probability was assessed by using 11 mitochondrial and nuclear genes, and cranial morphology for over 100 specimens, representing the full geographical extent of the species distribution. Molecular Bayesian and phylogenetic species delimitation showed unanimous support for two Mecistops species isolated to the Upper Guinean and Congo (including Lower Guinean) biomes that were supported by 13 cranial characters capable of unambiguously diagnosing each species. Fossil-calibrated phylogenetic reconstruction estimated that the species split ± 6.5–7.5 Ma, which is congruent with intraspecies divergence within the sympatric crocodile genus Osteolaemus and the formation of the Cameroon Volcanic Line. Our results underscore the necessity of comprehensive phylogeographic analyses within currently recognized taxa to detect cryptic species within the Crocodylia. We recommend that the community of crocodilian researchers reconsider the conceptualization of crocodilian species especially in the light of the conservation ramifications for this economically and ecologically important group. PMID:24335982
Dediu, Dan; Levinson, Stephen C.
2012-01-01
Language is the best example of a cultural evolutionary system, able to retain a phylogenetic signal over many thousands of years. The temporal stability (conservatism) of basic vocabulary is relatively well understood, but the stability of the structural properties of language (phonology, morphology, syntax) is still unclear. Here we report an extensive Bayesian phylogenetic investigation of the structural stability of numerous features across many language families and we introduce a novel method for analyzing the relationships between the “stability profiles” of language families. We found that there is a strong universal component across language families, suggesting the existence of universal linguistic, cognitive and genetic constraints. Against this background, however, each language family has a distinct stability profile, and these profiles cluster by geographic area and likely deep genealogical relationships. These stability profiles seem to show, for example, the ancient historical relationships between the Siberian and American language families, presumed to be separated by at least 12,000 years, and possible connections between the Eurasian families. We also found preliminary support for the punctuated evolution of structural features of language across families, types of features and geographic areas. Thus, such higher-level properties of language seen as an evolutionary system might allow the investigation of ancient connections between languages and shed light on the peopling of the world. PMID:23028843
Dediu, Dan; Levinson, Stephen C
2012-01-01
Language is the best example of a cultural evolutionary system, able to retain a phylogenetic signal over many thousands of years. The temporal stability (conservatism) of basic vocabulary is relatively well understood, but the stability of the structural properties of language (phonology, morphology, syntax) is still unclear. Here we report an extensive Bayesian phylogenetic investigation of the structural stability of numerous features across many language families and we introduce a novel method for analyzing the relationships between the "stability profiles" of language families. We found that there is a strong universal component across language families, suggesting the existence of universal linguistic, cognitive and genetic constraints. Against this background, however, each language family has a distinct stability profile, and these profiles cluster by geographic area and likely deep genealogical relationships. These stability profiles seem to show, for example, the ancient historical relationships between the Siberian and American language families, presumed to be separated by at least 12,000 years, and possible connections between the Eurasian families. We also found preliminary support for the punctuated evolution of structural features of language across families, types of features and geographic areas. Thus, such higher-level properties of language seen as an evolutionary system might allow the investigation of ancient connections between languages and shed light on the peopling of the world.
Gehring, Philip-Sebastian; Tolley, Krystal A; Eckhardt, Falk Sebastian; Townsend, Ted M; Ziegler, Thomas; Ratsoavina, Fanomezana; Glaw, Frank; Vences, Miguel
2012-01-01
We conducted a comprehensive molecular phylogenetic study for a group of chameleons from Madagascar (Chamaeleonidae: Calumma nasutum group, comprising seven nominal species) to examine the genetic and species diversity in this widespread genus. Based on DNA sequences of the mitochondrial gene (ND2) from 215 specimens, we reconstructed the phylogeny using a Bayesian approach. Our results show deep divergences among several unnamed mitochondrial lineages that are difficult to identify morphologically. We evaluated lineage diversification using a number of statistical phylogenetic methods (general mixed Yule-coalescent model; SpeciesIdentifier; net p-distances) to objectively delimit lineages that we here consider as operational taxonomic units (OTUs), and for which the taxonomic status remains largely unknown. In addition, we compared molecular and morphological differentiation in detail for one particularly diverse clade (the C. boettgeri complex) from northern Madagascar. To assess the species boundaries within this group we used an integrative taxonomic approach, combining evidence from two independent molecular markers (ND2 and CMOS), together with genital and other external morphological characters, and conclude that some of the newly discovered OTUs are separate species (confirmed candidate species, CCS), while others should best be considered as deep conspecific lineages (DCLs). Our analysis supports a total of 33 OTUs, of which seven correspond to described species, suggesting that the taxonomy of the C. nasutum group is in need of revision. PMID:22957155
Phylogenetic analysis of the light-harvesting system in Chromera velia.
Pan, Hao; Slapeta, Jan; Carter, Dee; Chen, Min
2012-03-01
Chromera velia is a newly discovered photosynthetic eukaryotic alga that has functional chloroplasts closely related to the apicoplast of apicomplexan parasites. Recently, the chloroplast in C. velia was shown to be derived from the red algal lineage. Light-harvesting protein complexes (LHC), which are a group of proteins involved in photon capture and energy transfer in photosynthesis, are important for photosynthesis efficiency, photo-adaptation/accumulation and photo-protection. Although these proteins are encoded by genes located in the nucleus, LHC peptides migrate and function in the chloroplast, hence the LHC may have a different evolutionary history compared to chloroplast evolution. Here, we compare the phylogenetic relationship of the C. velia LHCs to LHCs from other photosynthetic organisms. Twenty-three LHC homologues retrieved from C. velia EST sequences were aligned according to their conserved regions. The C. velia LHCs are positioned in four separate groups on trees constructed by neighbour-joining, maximum likelihood and Bayesian methods. A major group of seventeen LHCs from C. velia formed a separate cluster that was closest to dinoflagellate LHC, and to LHC and fucoxanthin chlorophyll-binding proteins from diatoms. One C. velia LHC sequence grouped with LI1818/LI818-like proteins, which were recently identified as environmental stress-induced protein complexes. Only three LHC homologues from C. velia grouped with the LHCs from red algae.
Barth, Julia M. I.; Matschiner, Michael; Robertson, Bruce C.
2013-01-01
The New Zealand Dotterel (Charadrius obscurus), an endangered shorebird of the family Charadriidae, is endemic to New Zealand where two subspecies are recognized. These subspecies are not only separated geographically, with C. o. aquilonius being distributed in the New Zealand North Island and C. o. obscurus mostly restricted to Stewart Island, but also differ substantially in morphology and behavior. Despite these divergent traits, previous work has failed to detect genetic differentiation between the subspecies, and the question of when and where the two populations separated is still open. Here, we use mitochondrial and nuclear markers to address molecular divergence between the subspecies, and apply maximum likelihood and Bayesian methods to place C. obscurus within the non-monophyletic genus Charadrius. Despite very little overall differentiation, distinct haplotypes for the subspecies were detected, thus supporting molecular separation of the northern and southern populations. Phylogenetic analysis recovers a monophyletic clade combining the New Zealand Dotterel with two other New Zealand endemic shorebirds, the Wrybill and the Double-Banded Plover, thus suggesting a single dispersal event as the origin of this group. Divergence dates within Charadriidae were estimated with BEAST 2, and our results indicate a Middle Miocene origin of New Zealand endemic Charadriidae, a Late Miocene emergence of the lineage leading to the New Zealand Dotterel, and a Middle to Late Pleistocene divergence of the two New Zealand Dotterel subspecies. PMID:24205094
Comparing interval estimates for small sample ordinal CFA models
Natesan, Prathiba
2015-01-01
Robust maximum likelihood (RML) and asymptotically generalized least squares (AGLS) methods have been recommended for fitting ordinal structural equation models. Studies show that some of these methods underestimate standard errors. However, these studies have not investigated the coverage and bias of interval estimates. An estimate with a reasonable standard error could still be severely biased. This can only be known by systematically investigating the interval estimates. The present study compares Bayesian, RML, and AGLS interval estimates of factor correlations in ordinal confirmatory factor analysis models (CFA) for small sample data. Six sample sizes, 3 factor correlations, and 2 factor score distributions (multivariate normal and multivariate mildly skewed) were studied. Two Bayesian prior specifications, informative and relatively less informative were studied. Undercoverage of confidence intervals and underestimation of standard errors was common in non-Bayesian methods. Underestimated standard errors may lead to inflated Type-I error rates. Non-Bayesian intervals were more positive biased than negatively biased, that is, most intervals that did not contain the true value were greater than the true value. Some non-Bayesian methods had non-converging and inadmissible solutions for small samples and non-normal data. Bayesian empirical standard error estimates for informative and relatively less informative priors were closer to the average standard errors of the estimates. The coverage of Bayesian credibility intervals was closer to what was expected with overcoverage in a few cases. Although some Bayesian credibility intervals were wider, they reflected the nature of statistical uncertainty that comes with the data (e.g., small sample). Bayesian point estimates were also more accurate than non-Bayesian estimates. The results illustrate the importance of analyzing coverage and bias of interval estimates, and how ignoring interval estimates can be misleading. Therefore, editors and policymakers should continue to emphasize the inclusion of interval estimates in research. PMID:26579002
Comparing interval estimates for small sample ordinal CFA models.
Natesan, Prathiba
2015-01-01
Robust maximum likelihood (RML) and asymptotically generalized least squares (AGLS) methods have been recommended for fitting ordinal structural equation models. Studies show that some of these methods underestimate standard errors. However, these studies have not investigated the coverage and bias of interval estimates. An estimate with a reasonable standard error could still be severely biased. This can only be known by systematically investigating the interval estimates. The present study compares Bayesian, RML, and AGLS interval estimates of factor correlations in ordinal confirmatory factor analysis models (CFA) for small sample data. Six sample sizes, 3 factor correlations, and 2 factor score distributions (multivariate normal and multivariate mildly skewed) were studied. Two Bayesian prior specifications, informative and relatively less informative were studied. Undercoverage of confidence intervals and underestimation of standard errors was common in non-Bayesian methods. Underestimated standard errors may lead to inflated Type-I error rates. Non-Bayesian intervals were more positive biased than negatively biased, that is, most intervals that did not contain the true value were greater than the true value. Some non-Bayesian methods had non-converging and inadmissible solutions for small samples and non-normal data. Bayesian empirical standard error estimates for informative and relatively less informative priors were closer to the average standard errors of the estimates. The coverage of Bayesian credibility intervals was closer to what was expected with overcoverage in a few cases. Although some Bayesian credibility intervals were wider, they reflected the nature of statistical uncertainty that comes with the data (e.g., small sample). Bayesian point estimates were also more accurate than non-Bayesian estimates. The results illustrate the importance of analyzing coverage and bias of interval estimates, and how ignoring interval estimates can be misleading. Therefore, editors and policymakers should continue to emphasize the inclusion of interval estimates in research.
A new prior for bayesian anomaly detection: application to biosurveillance.
Shen, Y; Cooper, G F
2010-01-01
Bayesian anomaly detection computes posterior probabilities of anomalous events by combining prior beliefs and evidence from data. However, the specification of prior probabilities can be challenging. This paper describes a Bayesian prior in the context of disease outbreak detection. The goal is to provide a meaningful, easy-to-use prior that yields a posterior probability of an outbreak that performs at least as well as a standard frequentist approach. If this goal is achieved, the resulting posterior could be usefully incorporated into a decision analysis about how to act in light of a possible disease outbreak. This paper describes a Bayesian method for anomaly detection that combines learning from data with a semi-informative prior probability over patterns of anomalous events. A univariate version of the algorithm is presented here for ease of illustration of the essential ideas. The paper describes the algorithm in the context of disease-outbreak detection, but it is general and can be used in other anomaly detection applications. For this application, the semi-informative prior specifies that an increased count over baseline is expected for the variable being monitored, such as the number of respiratory chief complaints per day at a given emergency department. The semi-informative prior is derived based on the baseline prior, which is estimated from using historical data. The evaluation reported here used semi-synthetic data to evaluate the detection performance of the proposed Bayesian method and a control chart method, which is a standard frequentist algorithm that is closest to the Bayesian method in terms of the type of data it uses. The disease-outbreak detection performance of the Bayesian method was statistically significantly better than that of the control chart method when proper baseline periods were used to estimate the baseline behavior to avoid seasonal effects. When using longer baseline periods, the Bayesian method performed as well as the control chart method. The time complexity of the Bayesian algorithm is linear in the number of the observed events being monitored, due to a novel, closed-form derivation that is introduced in the paper. This paper introduces a novel prior probability for Bayesian outbreak detection that is expressive, easy-to-apply, computationally efficient, and performs as well or better than a standard frequentist method.
Li, De-Zhu
2011-01-01
Background Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies. Methodology/Principal Findings Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae. Conclusions/Significance The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of temperate woody bamboos, even with complete cp genome sequences, suggests that these lineages may have diverged very rapidly. PMID:21655229
Alfonso-Morales, Abdulahi; Rios, Liliam; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Ganges, Llilianne; Díaz de Arce, Heidy; Majó, Natàlia; Núñez, José I; Pérez, Lester J
2015-01-01
Infectious bursal disease (IBD) is a highly contagious and acute viral disease, which has caused high mortality rates in birds and considerable economic losses in different parts of the world for more than two decades and it still represents a considerable threat to poultry. The current study was designed to rigorously measure the reliability of a phylogenetic marker included into segment B. This marker can facilitate molecular epidemiology studies, incorporating this segment of the viral genome, to better explain the links between emergence, spreading and maintenance of the very virulent IBD virus (vvIBDV) strains worldwide. Sequences of the segment B gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank Database; Cuban sequences were obtained in the current work. A phylogenetic marker named B-marker was assessed by different phylogenetic principles such as saturation of substitution, phylogenetic noise and high consistency. This last parameter is based on the ability of B-marker to reconstruct the same topology as the complete segment B of the viral genome. From the results obtained from B-marker, demographic history for both main lineages of IBDV regarding segment B was performed by Bayesian skyline plot analysis. Phylogenetic analysis for both segments of IBDV genome was also performed, revealing the presence of a natural reassortant strain with segment A from vvIBDV strains and segment B from non-vvIBDV strains within Cuban IBDV population. This study contributes to a better understanding of the emergence of vvIBDV strains, describing molecular epidemiology of IBDV using the state-of-the-art methodology concerning phylogenetic reconstruction. This study also revealed the presence of a novel natural reassorted strain as possible manifest of change in the genetic structure and stability of the vvIBDV strains. Therefore, it highlights the need to obtain information about both genome segments of IBDV for molecular epidemiology studies.
Kuch, Ulrich; Keogh, J Scott; Weigel, John; Smith, Laurie A; Mebs, Dietrich
2005-03-01
King brown snakes or mulga snakes (Pseudechis australis) are the largest and among the most dangerous and wide-ranging venomous snakes in Australia and New Guinea. They occur in diverse habitats, are important predators, and exhibit considerable morphological variation. We infer the relationships and historical biogeography of P. australis based on phylogenetic analysis of 1,249 base pairs from the mitochondrial cytochrome b, NADH dehydrogenase subunit 4 and three adjacent tRNA genes using Bayesian, maximum-likelihood, and maximum-parsimony methods. All methods reveal deep phylogenetic structure with four strongly supported clades comprising snakes from New Guinea (I), localities all over Australia (II), the Kimberleys of Western Australia (III), and north-central Australia (IV), suggesting a much more ancient radiation than previously believed. This conclusion is robust to different molecular clock estimations indicating divergence in Pliocene or Late Miocene, after landbridge dispersal to New Guinea had occurred. While members of clades I, III and IV are medium-sized, slender snakes, those of clade II attain large sizes and a robust build, rendering them top predators in their ecosystems. Genetic differentiation within clade II is low and haplotype distribution largely incongruent with geography or colour morphs, suggesting Pleistocene dispersal and recent ecomorph evolution. Significant haplotype diversity exists in clades III and IV, implying that clade IV comprises two species. Members of clade II are broadly sympatric with members of both northern Australian clades. Thus, our data support the recognition of at least five species from within P. australis (auct.) under various criteria. We discuss biogeographical, ecological and medical implications of our findings.
NASA Astrophysics Data System (ADS)
Kuch, Ulrich; Keogh, J. Scott; Weigel, John; Smith, Laurie A.; Mebs, Dietrich
2005-03-01
King brown snakes or mulga snakes (Pseudechis australis) are the largest and among the most dangerous and wide-ranging venomous snakes in Australia and New Guinea. They occur in diverse habitats, are important predators, and exhibit considerable morphological variation. We infer the relationships and historical biogeography of P. australis based on phylogenetic analysis of 1,249 base pairs from the mitochondrial cytochrome b, NADH dehydrogenase subunit 4 and three adjacent tRNA genes using Bayesian, maximum-likelihood, and maximum-parsimony methods. All methods reveal deep phylogenetic structure with four strongly supported clades comprising snakes from New Guinea (I), localities all over Australia (II), the Kimberleys of Western Australia (III), and north-central Australia (IV), suggesting a much more ancient radiation than previously believed. This conclusion is robust to different molecular clock estimations indicating divergence in Pliocene or Late Miocene, after landbridge dispersal to New Guinea had occurred. While members of clades I, III and IV are medium-sized, slender snakes, those of clade II attain large sizes and a robust build, rendering them top predators in their ecosystems. Genetic differentiation within clade II is low and haplotype distribution largely incongruent with geography or colour morphs, suggesting Pleistocene dispersal and recent ecomorph evolution. Significant haplotype diversity exists in clades III and IV, implying that clade IV comprises two species. Members of clade II are broadly sympatric with members of both northern Australian clades. Thus, our data support the recognition of at least five species from within P. australis (auct.) under various criteria. We discuss biogeographical, ecological and medical implications of our findings.
Crandell, Jamie L.; Voils, Corrine I.; Chang, YunKyung; Sandelowski, Margarete
2010-01-01
The possible utility of Bayesian methods for the synthesis of qualitative and quantitative research has been repeatedly suggested but insufficiently investigated. In this project, we developed and used a Bayesian method for synthesis, with the goal of identifying factors that influence adherence to HIV medication regimens. We investigated the effect of 10 factors on adherence. Recognizing that not all factors were examined in all studies, we considered standard methods for dealing with missing data and chose a Bayesian data augmentation method. We were able to summarize, rank, and compare the effects of each of the 10 factors on medication adherence. This is a promising methodological development in the synthesis of qualitative and quantitative research. PMID:21572970
NASA Astrophysics Data System (ADS)
Granade, Christopher; Combes, Joshua; Cory, D. G.
2016-03-01
In recent years, Bayesian methods have been proposed as a solution to a wide range of issues in quantum state and process tomography. State-of-the-art Bayesian tomography solutions suffer from three problems: numerical intractability, a lack of informative prior distributions, and an inability to track time-dependent processes. Here, we address all three problems. First, we use modern statistical methods, as pioneered by Huszár and Houlsby (2012 Phys. Rev. A 85 052120) and by Ferrie (2014 New J. Phys. 16 093035), to make Bayesian tomography numerically tractable. Our approach allows for practical computation of Bayesian point and region estimators for quantum states and channels. Second, we propose the first priors on quantum states and channels that allow for including useful experimental insight. Finally, we develop a method that allows tracking of time-dependent states and estimates the drift and diffusion processes affecting a state. We provide source code and animated visual examples for our methods.
Bayesian data analysis in observational comparative effectiveness research: rationale and examples.
Olson, William H; Crivera, Concetta; Ma, Yi-Wen; Panish, Jessica; Mao, Lian; Lynch, Scott M
2013-11-01
Many comparative effectiveness research and patient-centered outcomes research studies will need to be observational for one or both of two reasons: first, randomized trials are expensive and time-consuming; and second, only observational studies can answer some research questions. It is generally recognized that there is a need to increase the scientific validity and efficiency of observational studies. Bayesian methods for the design and analysis of observational studies are scientifically valid and offer many advantages over frequentist methods, including, importantly, the ability to conduct comparative effectiveness research/patient-centered outcomes research more efficiently. Bayesian data analysis is being introduced into outcomes studies that we are conducting. Our purpose here is to describe our view of some of the advantages of Bayesian methods for observational studies and to illustrate both realized and potential advantages by describing studies we are conducting in which various Bayesian methods have been or could be implemented.
Markov Chain Monte Carlo Methods for Bayesian Data Analysis in Astronomy
NASA Astrophysics Data System (ADS)
Sharma, Sanjib
2017-08-01
Markov Chain Monte Carlo based Bayesian data analysis has now become the method of choice for analyzing and interpreting data in almost all disciplines of science. In astronomy, over the last decade, we have also seen a steady increase in the number of papers that employ Monte Carlo based Bayesian analysis. New, efficient Monte Carlo based methods are continuously being developed and explored. In this review, we first explain the basics of Bayesian theory and discuss how to set up data analysis problems within this framework. Next, we provide an overview of various Monte Carlo based methods for performing Bayesian data analysis. Finally, we discuss advanced ideas that enable us to tackle complex problems and thus hold great promise for the future. We also distribute downloadable computer software (available at https://github.com/sanjibs/bmcmc/ ) that implements some of the algorithms and examples discussed here.
NASA Astrophysics Data System (ADS)
Felgaer, Pablo; Britos, Paola; García-Martínez, Ramón
A Bayesian network is a directed acyclic graph in which each node represents a variable and each arc a probabilistic dependency; they are used to provide: a compact form to represent the knowledge and flexible methods of reasoning. Obtaining it from data is a learning process that is divided in two steps: structural learning and parametric learning. In this paper we define an automatic learning method that optimizes the Bayesian networks applied to classification, using a hybrid method of learning that combines the advantages of the induction techniques of the decision trees (TDIDT-C4.5) with those of the Bayesian networks. The resulting method is applied to prediction in health domain.
Chancey, Caren; Ball, Christopher; Akolkar, Namita; Land, Kevin J.; Winkelman, Valerie; Stramer, Susan L.; Kramer, Laura D.; Rios, Maria
2013-01-01
West Nile virus (WNV), an arbovirus maintained in a bird-mosquito enzootic cycle, can infect other vertebrates including humans. WNV was first reported in the US in 1999 where, to date, three genotypes belonging to WNV lineage I have been described (NY99, WN02, SW/WN03). We report here the WNV sequences obtained from two birds, one mosquito, and 29 selected human samples acquired during the US epidemics from 2006–2011 and our examination of the evolutionary dynamics in the open-reading frame of WNV isolates reported from 1999–2011. Maximum-likelihood and Bayesian methods were used to perform the phylogenetic analyses and selection pressure analyses were conducted with the HyPhy package. Phylogenetic analysis identified human WNV isolates within the main WNV genotypes that have circulated in the US. Within genotype SW/WN03, we have identified a cluster with strains derived from blood donors and birds from Idaho and North Dakota collected during 2006–2007, termed here MW/WN06. Using different codon-based and branch-site selection models, we detected a number of codons subjected to positive pressure in WNV genes. The mean nucleotide substitution rate for WNV isolates obtained from humans was calculated to be 5.06×10−4 substitutions/site/year (s/s/y). The Bayesian skyline plot shows that after a period of high genetic variability following the introduction of WNV into the US, the WNV population appears to have reached genetic stability. The establishment of WNV in the US represents a unique opportunity to understand how an arbovirus adapts and evolves in a naïve environment. We describe a novel, well-supported cluster of WNV formed by strains collected from humans and birds from Idaho and North Dakota. Adequate genetic surveillance is essential to public health since new mutants could potentially affect viral pathogenesis, decrease performance of diagnostic assays, and negatively impact the efficacy of vaccines and the development of specific therapies. PMID:23738027
Lund, Eric K; O'Connor, Patrick M; Loewen, Mark A; Jinnah, Zubair A
2016-01-01
The Upper Cretaceous (middle-late Campanian) Wahweap Formation of southern Utah contains the oldest diagnostic evidence of ceratopsids (to date, all centrosaurines) in North America, with a number of specimens recovered from throughout a unit that spans between 81 and 77 Ma. Only a single specimen has been formally named, Diabloceratops eatoni, from the lower middle member of the formation. Machairoceratops cronusi gen. et sp. nov., a new centrosaurine ceratopsid from the upper member of the Wahweap Formation, is here described based on cranial material representing a single individual recovered from a calcareous mudstone. The specimen consists of two curved and elongate orbital horncores, a left jugal, a nearly complete, slightly deformed braincase, the left squamosal, and a mostly complete parietal ornamented by posteriorly projected, anterodorsally curved, elongate spikes on either side of a midline embayment. The fan-shaped, stepped-squamosal is diagnostic of Centrosaurinae, however, this element differs from the rectangular squamosal in Diabloceratops. Machairoceratops also differs in the possession of two anterodorsally (rather than laterally) curved epiparietal ornamentations on either side of a midline embayment that are distinguished by a posteromedially-oriented sulcus along the entire length of the epiparietal. Additionally, the parietosquamosal frill is lacking any other epiossifications along its periphery. Machairoceratops shares a triangular (rather than round) frill and spike-like epiparietal loci (p1) ornamentation with the stratigraphically lower Diabloceratops. Both parsimony and Bayesian phylogenetic analyses place Machairoceratops as an early-branching centrosaurine. However, the parsimony-based analysis provides little resolution for the position of the new taxon, placing it in an unresolved polytomy with Diabloceratops. The resultant Bayesian topology yielded better resolution, aligning Machairoceratops as the definitive sister taxon to a clade formed by Diabloceratops and Albertaceratops. Considered together, both phylogenetic methods unequivocally place Machairoceratops as an early-branching centrosaurine, and given the biostratigraphic position of Machairoceratops, these details increase the known ceratopsid diversity from both the Wahweap Formation and the southern portion of Laramidia. Finally, the unique morphology of the parietal ornamentation highlights the evolutionary disparity of frill ornamentation near the base of Centrosaurinae.
2016-01-01
Background Cloud forests, characterized by a persistent, frequent or seasonal low-level cloud cover and fragmented distribution, are one of the most threatened habitats, especially in the Neotropics. Tree ferns are among the most conspicuous elements in these forests, and ferns are restricted to regions in which minimum temperatures rarely drop below freezing and rainfall is high and evenly distributed around the year. Current phylogeographic data suggest that some of the cloud forest-adapted species remained in situ or expanded to the lowlands during glacial cycles and contracted allopatrically during the interglacials. Although the observed genetic signals of population size changes of cloud forest-adapted species including tree ferns correspond to predicted changes by Pleistocene climate change dynamics, the observed patterns of intraspecific lineage divergence showed temporal incongruence. Methods Here we combined phylogenetic analyses, ancestral area reconstruction, and divergence time estimates with climatic and altitudinal data (environmental space) for phenotypic traits of tree fern species to make inferences about evolutionary processes in deep time. We used phylogenetic Bayesian inference and geographic and altitudinal distribution of tree ferns to investigate ancestral area and elevation and environmental preferences of Mesoamerican tree ferns. The phylogeny was then used to estimate divergence times and ask whether the ancestral area and elevation and environmental shifts were linked to climatic events and historical climatic preferences. Results Bayesian trees retrieved Cyathea, Alsophyla, Gymnosphaera and Sphaeropteris in monophyletic clades. Splits for species in these genera found in Mesoamerican cloud forests are recent, from the Neogene to the Quaternary, Australia was identified as the ancestral area for the clades of these genera, except for Gymnosphaera that was Mesoamerica. Climate tolerance was not divergent from hypothesized ancestors for the most significant variables or elevation. For elevational shifts, we found repeated change from low to high elevations. Conclusions Our data suggest that representatives of Cyatheaceae main lineages migrated from Australia to Mesoamerican cloud forests in different times and have persisted in these environmentally unstable areas but extant species diverged recentrly from their ancestors. PMID:27896030
Rodriguez, Juanita; Pitts, James P; Florez, Jaime A; Bond, Jason E; von Dohlen, Carol D
2016-01-01
Pompilinae is one of the largest subfamilies of spider wasps (Pompilidae). Most pompilines are generalist spider predators at the family level, but some taxa exhibit ecological specificity (i.e., to spider-host guild). Here we present the first molecular phylogenetic analysis of Pompilinae, toward the aim of evaluating the monophyly of tribes and genera. We further test whether changes in the rate of diversification are associated with host-guild shifts. Molecular data were collected from five nuclear loci (28S, EF1-F2, LWRh, Wg, Pol2) for 76 taxa in 39 genera. Data were analyzed using maximum likelihood (ML) and Bayesian inference (BI). The phylogenetic results were compared with previous hypotheses of subfamilial and tribal classification, as well as generic relationships in the subfamily. The classification of Pompilus and Agenioideus is also discussed. A Bayesian relaxed molecular clock analysis was used to examine divergence times. Diversification rate-shift tests accounted for taxon-sampling bias using ML and BI approaches. Ancestral host family and host guild were reconstructed using MP and ML methods. Ancestral host guild for all Pompilinae, for the ancestor at the node where a diversification rate-shift was detected, and two more nodes back in time was inferred using BI. In the resulting phylogenies, Aporini was the only previously proposed monophyletic tribe. Several genera (e.g., Pompilus, Microphadnus and Schistonyx) are also not monophyletic. Dating analyses produced a well-supported chronogram consistent with topologies from BI and ML results. The BI ancestral host-use reconstruction inferred the use of spiders belonging to the guild "other hunters" (frequenting the ground and vegetation) as the ancestral state for Pompilinae. This guild had the highest probability for the ML reconstruction and was equivocal for the MP reconstruction; various switching events to other guilds occurred throughout the evolution of the group. The diversification of Pompilinae shows one main rate-shift coinciding with a shift to ground-hunter spiders, as reconstructed by the BI ancestral character-state analysis. Copyright © 2015 Elsevier Inc. All rights reserved.
Bayesian Analysis of Longitudinal Data Using Growth Curve Models
ERIC Educational Resources Information Center
Zhang, Zhiyong; Hamagami, Fumiaki; Wang, Lijuan Lijuan; Nesselroade, John R.; Grimm, Kevin J.
2007-01-01
Bayesian methods for analyzing longitudinal data in social and behavioral research are recommended for their ability to incorporate prior information in estimating simple and complex models. We first summarize the basics of Bayesian methods before presenting an empirical example in which we fit a latent basis growth curve model to achievement data…
Yang, Jingjing; Cox, Dennis D; Lee, Jong Soo; Ren, Peng; Choi, Taeryon
2017-12-01
Functional data are defined as realizations of random functions (mostly smooth functions) varying over a continuum, which are usually collected on discretized grids with measurement errors. In order to accurately smooth noisy functional observations and deal with the issue of high-dimensional observation grids, we propose a novel Bayesian method based on the Bayesian hierarchical model with a Gaussian-Wishart process prior and basis function representations. We first derive an induced model for the basis-function coefficients of the functional data, and then use this model to conduct posterior inference through Markov chain Monte Carlo methods. Compared to the standard Bayesian inference that suffers serious computational burden and instability in analyzing high-dimensional functional data, our method greatly improves the computational scalability and stability, while inheriting the advantage of simultaneously smoothing raw observations and estimating the mean-covariance functions in a nonparametric way. In addition, our method can naturally handle functional data observed on random or uncommon grids. Simulation and real studies demonstrate that our method produces similar results to those obtainable by the standard Bayesian inference with low-dimensional common grids, while efficiently smoothing and estimating functional data with random and high-dimensional observation grids when the standard Bayesian inference fails. In conclusion, our method can efficiently smooth and estimate high-dimensional functional data, providing one way to resolve the curse of dimensionality for Bayesian functional data analysis with Gaussian-Wishart processes. © 2017, The International Biometric Society.
Conceptual issues in Bayesian divergence time estimation
2016-01-01
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325831
Conceptual issues in Bayesian divergence time estimation.
Rannala, Bruce
2016-07-19
Bayesian inference of species divergence times is an unusual statistical problem, because the divergence time parameters are not identifiable unless both fossil calibrations and sequence data are available. Commonly used marginal priors on divergence times derived from fossil calibrations may conflict with node order on the phylogenetic tree causing a change in the prior on divergence times for a particular topology. Care should be taken to avoid confusing this effect with changes due to informative sequence data. This effect is illustrated with examples. A topology-consistent prior that preserves the marginal priors is defined and examples are constructed. Conflicts between fossil calibrations and relative branch lengths (based on sequence data) can cause estimates of divergence times that are grossly incorrect, yet have a narrow posterior distribution. An example of this effect is given; it is recommended that overly narrow posterior distributions of divergence times should be carefully scrutinized.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Author(s).
A taxonomic monograph of Nearctic Scolytus Geoffroy (Coleoptera, Curculionidae, Scolytinae).
Smith, Sarah M; Cognato, Anthony I
2014-01-01
The Nearctic bark beetle genus Scolytus Geoffroy was revised based in part on a molecular and morphological phylogeny. Monophyly of the native species was tested using mitochondrial (COI) and nuclear (28S, CAD, ArgK) genes and 43 morphological characters in parsimony and Bayesian phylogenetic analyses. Parsimony analyses of molecular and combined datasets provided mixed results while Bayesian analysis recovered most nodes with posterior probabilities >90%. Native hardwood- and conifer-feeding Scolytus species were recovered as paraphyletic. Native Nearctic species were recovered as paraphyletic with hardwood-feeding species sister to Palearctic hardwood-feeding species rather than to native conifer-feeding species. The Nearctic conifer-feeding species were monophyletic. Twenty-five species were recognized. Four new synonyms were discovered: Scolytuspraeceps LeConte, 1868 (= Scolytusabietis Blackman, 1934; = Scolytusopacus Blackman, 1934), Scolytusreflexus Blackman, 1934 (= Scolytusvirgatus Bright, 1972; = Scolytuswickhami Blackman, 1934). Two species were reinstated: Scolytusfiskei Blackman, 1934 and Scolytussilvaticus Bright, 1972. A diagnosis, description, distribution, host records and images were provided for each species and a key is presented to all species.
Steingroever, Helen; Pachur, Thorsten; Šmíra, Martin; Lee, Michael D
2018-06-01
The Iowa Gambling Task (IGT) is one of the most popular experimental paradigms for comparing complex decision-making across groups. Most commonly, IGT behavior is analyzed using frequentist tests to compare performance across groups, and to compare inferred parameters of cognitive models developed for the IGT. Here, we present a Bayesian alternative based on Bayesian repeated-measures ANOVA for comparing performance, and a suite of three complementary model-based methods for assessing the cognitive processes underlying IGT performance. The three model-based methods involve Bayesian hierarchical parameter estimation, Bayes factor model comparison, and Bayesian latent-mixture modeling. We illustrate these Bayesian methods by applying them to test the extent to which differences in intuitive versus deliberate decision style are associated with differences in IGT performance. The results show that intuitive and deliberate decision-makers behave similarly on the IGT, and the modeling analyses consistently suggest that both groups of decision-makers rely on similar cognitive processes. Our results challenge the notion that individual differences in intuitive and deliberate decision styles have a broad impact on decision-making. They also highlight the advantages of Bayesian methods, especially their ability to quantify evidence in favor of the null hypothesis, and that they allow model-based analyses to incorporate hierarchical and latent-mixture structures.
Restricted Gene Flow among Hospital Subpopulations of Enterococcus faecium
Willems, Rob J. L.; Top, Janetta; van Schaik, Willem; Leavis, Helen; Bonten, Marc; Sirén, Jukka; Hanage, William P.; Corander, Jukka
2012-01-01
ABSTRACT Enterococcus faecium has recently emerged as an important multiresistant nosocomial pathogen. Defining population structure in this species is required to provide insight into the existence, distribution, and dynamics of specific multiresistant or pathogenic lineages in particular environments, like the hospital. Here, we probe the population structure of E. faecium using Bayesian-based population genetic modeling implemented in Bayesian Analysis of Population Structure (BAPS) software. The analysis involved 1,720 isolates belonging to 519 sequence types (STs) (491 for E. faecium and 28 for Enterococcus faecalis). E. faecium isolates grouped into 13 BAPS (sub)groups, but the large majority (80%) of nosocomial isolates clustered in two subgroups (2-1 and 3-3). Phylogenetic and eBURST analysis of BAPS groups 2 and 3 confirmed the existence of three separate hospital lineages (17, 18, and 78), highlighting different evolutionary trajectories for BAPS 2-1 (lineage 78) and 3-3 (lineage 17 and lineage 18) isolates. Phylogenomic analysis of 29 E. faecium isolates showed agreement between BAPS assignment of STs and their relative positions in the phylogenetic tree. Odds ratio calculation confirmed the significant association between hospital isolates with BAPS 3-3 and lineages 17, 18, and 78. Admixture analysis showed a scarce number of recombination events between the different BAPS groups. For the E. faecium hospital population, we propose an evolutionary model in which strains with a high propensity to colonize and infect hospitalized patients arise through horizontal gene transfer. Once adapted to the distinct hospital niche, this subpopulation becomes isolated, and recombination with other populations declines. PMID:22807567
Zhou, Xiaoming; Chan, Paul K S; Tam, John S; Tang, Julian W
2011-01-01
Hepatitis C virus (HCV) 6a accounts for 23.6% of all HCV infections of the general population and 58.5% of intravenous drug users in Hong Kong. However, the geographical origin of this highly predominant HCV subgenotype is largely unknown. This study explores a hypothesis for one possible transmission route of HCV 6a to Hong Kong. NS5A sequences derived from 26 HCV 6a samples were chosen from a five year period (1999-2004) from epidemiologically unrelated patients from Hong Kong. Partial-NS5A sequences (513-bp from nt 6728 to 7240) were adopted for Bayesian coalescent analysis to reconstruct the evolutionary history of HCV infections in Hong Kong using the BEAST v1.3 program. A rooted phylogenetic tree was drawn for these sequences by alignment with reference Vietnamese sequences. Demographic data were accessed from "The Statistic Yearbooks of Hong Kong". Bayesian coalescent analysis showed that the rapid increase in 6a infections, which had increased more than 90-fold in Hong Kong from 1986 to 1994 correlated to two peaks of Vietnamese immigration to Hong Kong from 1978 to 1997. The second peak, which occurred from 1987 through 1997, overlapped with the rapid increase of HCV 6a occurrence in Hong Kong. Phylogenetic analyses have further revealed that HCV 6a strains from Vietnam may be ancestral to Hong Kong counterparts. The high predominance of HCV 6a infections in Hong Kong was possibly associated with Vietnamese immigration during 1987-1997.
Haake, David A.; Suchard, Marc A.; Kelley, Melissa M.; Dundoo, Manjula; Alt, David P.; Zuerner, Richard L.
2004-01-01
Leptospires belong to a genus of parasitic bacterial spirochetes that have adapted to a broad range of mammalian hosts. Mechanisms of leptospiral molecular evolution were explored by sequence analysis of four genes shared by 38 strains belonging to the core group of pathogenic Leptospira species: L. interrogans, L. kirschneri, L. noguchii, L. borgpetersenii, L. santarosai, and L. weilii. The 16S rRNA and lipL32 genes were highly conserved, and the lipL41 and ompL1 genes were significantly more variable. Synonymous substitutions are distributed throughout the ompL1 gene, whereas nonsynonymous substitutions are clustered in four variable regions encoding surface loops. While phylogenetic trees for the 16S, lipL32, and lipL41 genes were relatively stable, 8 of 38 (20%) ompL1 sequences had mosaic compositions consistent with horizontal transfer of DNA between related bacterial species. A novel Bayesian multiple change point model was used to identify the most likely sites of recombination and to determine the phylogenetic relatedness of the segments of the mosaic ompL1 genes. Segments of the mosaic ompL1 genes encoding two of the surface-exposed loops were likely acquired by horizontal transfer from a peregrine allele of unknown ancestry. Identification of the most likely sites of recombination with the Bayesian multiple change point model, an approach which has not previously been applied to prokaryotic gene sequence analysis, serves as a model for future studies of recombination in molecular evolution of genes. PMID:15090524
Sequential Probability Ratio Test for Collision Avoidance Maneuver Decisions
NASA Technical Reports Server (NTRS)
Carpenter, J. Russell; Markley, F. Landis
2010-01-01
When facing a conjunction between space objects, decision makers must chose whether to maneuver for collision avoidance or not. We apply a well-known decision procedure, the sequential probability ratio test, to this problem. We propose two approaches to the problem solution, one based on a frequentist method, and the other on a Bayesian method. The frequentist method does not require any prior knowledge concerning the conjunction, while the Bayesian method assumes knowledge of prior probability densities. Our results show that both methods achieve desired missed detection rates, but the frequentist method's false alarm performance is inferior to the Bayesian method's
Informative priors on fetal fraction increase power of the noninvasive prenatal screen.
Xu, Hanli; Wang, Shaowei; Ma, Lin-Lin; Huang, Shuai; Liang, Lin; Liu, Qian; Liu, Yang-Yang; Liu, Ke-Di; Tan, Ze-Min; Ban, Hao; Guan, Yongtao; Lu, Zuhong
2017-11-09
PurposeNoninvasive prenatal screening (NIPS) sequences a mixture of the maternal and fetal cell-free DNA. Fetal trisomy can be detected by examining chromosomal dosages estimated from sequencing reads. The traditional method uses the Z-test, which compares a subject against a set of euploid controls, where the information of fetal fraction is not fully utilized. Here we present a Bayesian method that leverages informative priors on the fetal fraction.MethodOur Bayesian method combines the Z-test likelihood and informative priors of the fetal fraction, which are learned from the sex chromosomes, to compute Bayes factors. Bayesian framework can account for nongenetic risk factors through the prior odds, and our method can report individual positive/negative predictive values.ResultsOur Bayesian method has more power than the Z-test method. We analyzed 3,405 NIPS samples and spotted at least 9 (of 51) possible Z-test false positives.ConclusionBayesian NIPS is more powerful than the Z-test method, is able to account for nongenetic risk factors through prior odds, and can report individual positive/negative predictive values.Genetics in Medicine advance online publication, 9 November 2017; doi:10.1038/gim.2017.186.
2013-01-01
Background Radiation in some plant groups has occurred on islands and due to the characteristic rapid pace of phenotypic evolution, standard molecular markers often provide insufficient variation for phylogenetic reconstruction. To resolve relationships within a clade of 21 closely related New Caledonian Diospyros species and evaluate species boundaries we analysed genome-wide DNA variation via amplified fragment length polymorphisms (AFLP). Results A neighbour-joining (NJ) dendrogram based on Dice distances shows all species except D. minimifolia, D. parviflora and D. vieillardii to form unique clusters of genetically similar accessions. However, there was little variation between these species clusters, resulting in unresolved species relationships and a star-like general NJ topology. Correspondingly, analyses of molecular variance showed more variation within species than between them. A Bayesian analysis with BEAST produced a similar result. Another Bayesian method, this time a clustering method, Structure, demonstrated the presence of two groups, highly congruent with those observed in a principal coordinate analysis (PCO). Molecular divergence between the two groups is low and does not correspond to any hypothesised taxonomic, ecological or geographical patterns. Conclusions We hypothesise that such a pattern could have been produced by rapid and complex evolution involving a widespread progenitor for which an initial split into two groups was followed by subsequent fragmentation into many diverging populations, which was followed by range expansion of then divergent entities. Overall, this process resulted in an opportunistic pattern of phenotypic diversification. The time since divergence was probably insufficient for some species to become genetically well-differentiated, resulting in progenitor/derivative relationships being exhibited in a few cases. In other cases, our analyses may have revealed evidence for the existence of cryptic species, for which more study of morphology and ecology are now required. PMID:24330478
NASA Technical Reports Server (NTRS)
Kraft, Ralph P.; Burrows, David N.; Nousek, John A.
1991-01-01
Two different methods, classical and Bayesian, for determining confidence intervals involving Poisson-distributed data are compared. Particular consideration is given to cases where the number of counts observed is small and is comparable to the mean number of background counts. Reasons for preferring the Bayesian over the classical method are given. Tables of confidence limits calculated by the Bayesian method are provided for quick reference.
NASA Astrophysics Data System (ADS)
Gogu, C.; Haftka, R.; LeRiche, R.; Molimard, J.; Vautrin, A.; Sankar, B.
2008-11-01
The basic formulation of the least squares method, based on the L2 norm of the misfit, is still widely used today for identifying elastic material properties from experimental data. An alternative statistical approach is the Bayesian method. We seek here situations with significant difference between the material properties found by the two methods. For a simple three bar truss example we illustrate three such situations in which the Bayesian approach leads to more accurate results: different magnitude of the measurements, different uncertainty in the measurements and correlation among measurements. When all three effects add up, the Bayesian approach can have a large advantage. We then compared the two methods for identification of elastic constants from plate vibration natural frequencies.
Bayesian methods in reliability
NASA Astrophysics Data System (ADS)
Sander, P.; Badoux, R.
1991-11-01
The present proceedings from a course on Bayesian methods in reliability encompasses Bayesian statistical methods and their computational implementation, models for analyzing censored data from nonrepairable systems, the traits of repairable systems and growth models, the use of expert judgment, and a review of the problem of forecasting software reliability. Specific issues addressed include the use of Bayesian methods to estimate the leak rate of a gas pipeline, approximate analyses under great prior uncertainty, reliability estimation techniques, and a nonhomogeneous Poisson process. Also addressed are the calibration sets and seed variables of expert judgment systems for risk assessment, experimental illustrations of the use of expert judgment for reliability testing, and analyses of the predictive quality of software-reliability growth models such as the Weibull order statistics.
An introduction to Bayesian statistics in health psychology.
Depaoli, Sarah; Rus, Holly M; Clifton, James P; van de Schoot, Rens; Tiemensma, Jitske
2017-09-01
The aim of the current article is to provide a brief introduction to Bayesian statistics within the field of health psychology. Bayesian methods are increasing in prevalence in applied fields, and they have been shown in simulation research to improve the estimation accuracy of structural equation models, latent growth curve (and mixture) models, and hierarchical linear models. Likewise, Bayesian methods can be used with small sample sizes since they do not rely on large sample theory. In this article, we discuss several important components of Bayesian statistics as they relate to health-based inquiries. We discuss the incorporation and impact of prior knowledge into the estimation process and the different components of the analysis that should be reported in an article. We present an example implementing Bayesian estimation in the context of blood pressure changes after participants experienced an acute stressor. We conclude with final thoughts on the implementation of Bayesian statistics in health psychology, including suggestions for reviewing Bayesian manuscripts and grant proposals. We have also included an extensive amount of online supplementary material to complement the content presented here, including Bayesian examples using many different software programmes and an extensive sensitivity analysis examining the impact of priors.
Mishra, Priyanka; Kumar, Amit; Rodrigues, Vereena; Shukla, Ashutosh K; Sundaresan, Velusamy
2016-01-01
The internal transcribed spacer (ITS) region is situated between 18S and 26S in a polycistronic rRNA precursor transcript. It had been proved to be the most commonly sequenced region across plant species to resolve phylogenetic relationships ranging from shallow to deep taxonomic levels. Despite several taxonomical revisions in Cassiinae, a stable phylogeny remains elusive at the molecular level, particularly concerning the delineation of species in the genera Cassia, Senna and Chamaecrista . This study addresses the comparative potential of ITS datasets (ITS1, ITS2 and concatenated) in resolving the underlying morphological disparity in the highly complex genera, to assess their discriminatory power as potential barcode candidates in Cassiinae. A combination of experimental data and an in-silico approach based on threshold genetic distances, sequence similarity based and hierarchical tree-based methods was performed to decipher the discriminating power of ITS datasets on 18 different species of Cassiinae complex. Lab-generated s equences were compared against those available in the GenBank using BLAST and were aligned through MUSCLE 3.8.31 and analysed in PAUP 4.0 and BEAST1.8 using parsimony ratchet, maximum likelihood and Bayesian inference (BI) methods of gene and species tree reconciliation with bootstrapping. DNA barcoding gap was realized based on the Kimura two-parameter distance model (K2P) in TaxonDNA and MEGA. Based on the K2P distance, significant divergences between the inter- and intra-specific genetic distances were observed, while the presence of a DNA barcoding gap was obvious. The ITS1 region efficiently identified 81.63% and 90% of species using TaxonDNA and BI methods, respectively. The PWG-distance method based on simple pairwise matching indicated the significance of ITS1 whereby highest number of variable (210) and informative sites (206) were obtained. The BI tree-based methods outperformed the similarity-based methods producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. The reticulated phylogenetic hypothesis using the ITS1 region mainly supported the relationship between the species of Cassiinae established by traditional morphological methods. The ITS1 region showed a higher discrimination power and desirable characteristics as compared to ITS2 and ITS1 + 2, thereby concluding to be the locus of choice. Considering the complexity of the group and the underlying biological ambiguities, the results presented here are encouraging for developing DNA barcoding as a useful tool for resolving taxonomical challenges in corroboration with morphological framework.
Origin and Evolution of the Unique Hepatitis C Virus Circulating Recombinant Form 2k/1b
Thomas, Xiomara V.; Koekkoek, Sylvie M.; Schinkel, Janke; Molenkamp, Richard; van de Laar, Thijs J.; Takebe, Yutaka; Tanaka, Yasuhito; Mizokami, Masashi; Rambaut, Andrew
2012-01-01
Since its initial identification in St. Petersburg, Russia, the recombinant hepatitis C virus (HCV) 2k/1b has been isolated from several countries throughout Eurasia. The 2k/1b strain is the only recombinant HCV to have spread widely, raising questions about the epidemiological background in which it first appeared. In order to further understand the circumstances by which HCV recombinants might be formed and spread, we estimated the date of the recombination event that generated the 2k/1b strain using a Bayesian phylogenetic approach. Our study incorporates newly isolated 2k/1b strains from Amsterdam, The Netherlands, and has employed a hierarchical Bayesian framework to combine information from different genomic regions. We estimate that 2k/1b originated sometime between 1923 and 1956, substantially before the first detection of the strain in 1999. The timescale and the geographic spread of 2k/1b suggest that it originated in the former Soviet Union at about the time that the world's first centralized national blood transfusion and storage service was being established. We also reconstructed the epidemic history of 2k/1b using coalescent theory-based methods, matching patterns previously reported for other epidemic HCV subtypes. This study demonstrates the practicality of jointly estimating dates of recombination from flanking regions of the breakpoint and further illustrates that rare genetic-exchange events can be particularly informative about the underlying epidemiological processes. PMID:22114341
Kwon, Deukwoo; Hoffman, F Owen; Moroz, Brian E; Simon, Steven L
2016-02-10
Most conventional risk analysis methods rely on a single best estimate of exposure per person, which does not allow for adjustment for exposure-related uncertainty. Here, we propose a Bayesian model averaging method to properly quantify the relationship between radiation dose and disease outcomes by accounting for shared and unshared uncertainty in estimated dose. Our Bayesian risk analysis method utilizes multiple realizations of sets (vectors) of doses generated by a two-dimensional Monte Carlo simulation method that properly separates shared and unshared errors in dose estimation. The exposure model used in this work is taken from a study of the risk of thyroid nodules among a cohort of 2376 subjects who were exposed to fallout from nuclear testing in Kazakhstan. We assessed the performance of our method through an extensive series of simulations and comparisons against conventional regression risk analysis methods. When the estimated doses contain relatively small amounts of uncertainty, the Bayesian method using multiple a priori plausible draws of dose vectors gave similar results to the conventional regression-based methods of dose-response analysis. However, when large and complex mixtures of shared and unshared uncertainties are present, the Bayesian method using multiple dose vectors had significantly lower relative bias than conventional regression-based risk analysis methods and better coverage, that is, a markedly increased capability to include the true risk coefficient within the 95% credible interval of the Bayesian-based risk estimate. An evaluation of the dose-response using our method is presented for an epidemiological study of thyroid disease following radiation exposure. Copyright © 2015 John Wiley & Sons, Ltd.
Douglas, Hume B.
2017-01-01
Abstract The prior genus-level classification of Cardiophorinae had never been assessed phylogenetically, and not revised since 1906. A phylogeny for Cardiophorinae and Negastriinae is inferred by Bayesian analyses of 163 adult morphological characters to revise the generic classification. Parsimony analysis is also performed to assess the sensitivity of the Bayesian results to the choice of optimality criterion. Bayesian hypothesis testing rejected monophyly for: Negastriinae; Cardiophorinae (but monophyletic after addition of four taxa); Cardiophorini; cardiophorine genera Aphricus LeConte, 1853; Aptopus Eschscholtz, 1829; Cardiophorus Eschscholtz, 1829; Cardiotarsus Eschscholtz, 1836; Paracardiophorus Schwarz, 1895; Phorocardius Fleutiaux, 1931; Dicronychus sensu Platia, 1994; Dicronychus sensu Méquignon, 1931; Craspedostethus sensu Schwarz, 1906 (i.e., including Tropidiplus Fleutiaux, 1903); Paracardiophorus sensu Cobos, 1970, although well-supported alternative classifications were available for only some. Based on taxonomic interpretation of phylogenetic results: Nyctorini is syn. n. of Cardiophorini; Globothorax Fleutiaux, 1891 (Physodactylinae), Margogastrius Schwarz, 1903 (Physodactylinae), and Pachyelater Lesne, 1897 (Dendrometrinae) are transferred to Cardiophorinae. The following changes are proposed for cardiophorine genera: Aptopus Eschscholtz, 1829 is redefined to exclude Horistonotus-like species; Coptostethus Wollaston, 1854 is subgenus of Cardiophorus; Dicronychus Brullé, 1832 and Diocarphus Fleutiaux, 1947, Metacardiophorus Gurjeva, 1966, Platynychus Motschulsky, 1858, and Zygocardiophorus Iablokoff-Khnzorian and Mardjanian, 1981 are placed at genus rank; Paracardiophorus Schwarz, 1895 is redefined based on North American and Eurasian species only; Horistonotus Candèze, 1860 redefined to include species with multiple apices on each side of their tarsal claws; Patriciella Van Zwaluwenburg, 1953 is syn. n. of Aphricus LeConte, 1853; Teslasena Fleutiaux, 1892 (Physodactylinae) is syn. n. of Globothorax Fleutiaux, 1891. The following new genera are described: Austrocardiophorus (type species: Cardiophorus humeralis Fairmaire and Germain, 1860); Chileaphricus (type species: Aphricus chilensis Fleutiaux, 1940); Floridelater (type species: Coptostethus americanus Horn, 1871, transferred from Negastriinae to Cardiophorinae). Paradicronychus (nomen nudum), is syn. n. of Cardiophorus Eschscholtz, 1829. Generic reassignments to make Cardiodontulus, Cardiophorus, Cardiotarsus, Paracardiophorus consistent with phylogenetically revised genus concepts resulted in 84 new combinations. Lectotypes are designated for 29 type species to fix generic concepts: Anelastes femoralis Lucas, 1857; Aphricus chilensis Fleutiaux, 1940; Athous argentatus Abeille de Perrin, 1894; Cardiophorus adjutor Candèze, 1875; Cardiophorus florentini Fleutiaux, 1895; Cardiophorus inflatus Candèze, 1882; Cardiophorus luridipes Candèze, 1860; Cardiophorus mirabilis Candèze, 1860; Cardiophorus musculus Erichson, 1840; Cardiotarsus capensis Candèze, 1860; Cardiotarsus vitalisi Fleutiaux, 1918; Craspedostethus rufiventris Schwarz, 1898; Elater cinereus Herbst, 1784; Elater minutissimus Germar, 1817; Elater sputator Linnaeus, 1758; Elater thoracicus Fabricius, 1801; Eniconyx pullatus Horn, 1884; Esthesopus castaneus Eschscholtz, 1829; Gastrimargus schneideri Schwarz, 1902; Globothorax chevrolati Fleutiaux, 1891; Horistonotus flavidus Candèze, 1860; Horistonotus simplex LeConte, 1863; Lesnelater madagascariensis Fleutiaux, 1935; Oedostethus femoralis LeConte, 1853; Phorocardius solitarius Fleutiaux, 1931; Platynychus indicus Motschulsky, 1858; Platynychus mixtus Fleutiaux, 1931; Triplonychus acuminatus Candèze, 1860; Tropidiplus tellinii Fleutiaux, 1903. A key to genera and diagnoses are provided for all genera and subgenera. A bibliographic synonymy includes references for all taxonomic changes to genera and new species through 2015. PMID:28331397
Douglas, Hume B
2017-01-01
The prior genus-level classification of Cardiophorinae had never been assessed phylogenetically, and not revised since 1906. A phylogeny for Cardiophorinae and Negastriinae is inferred by Bayesian analyses of 163 adult morphological characters to revise the generic classification. Parsimony analysis is also performed to assess the sensitivity of the Bayesian results to the choice of optimality criterion. Bayesian hypothesis testing rejected monophyly for: Negastriinae; Cardiophorinae (but monophyletic after addition of four taxa); Cardiophorini; cardiophorine genera Aphricus LeConte, 1853; Aptopus Eschscholtz, 1829; Cardiophorus Eschscholtz, 1829; Cardiotarsus Eschscholtz, 1836; Paracardiophorus Schwarz, 1895; Phorocardius Fleutiaux, 1931; Dicronychus sensu Platia, 1994; Dicronychus sensu Méquignon, 1931; Craspedostethus sensu Schwarz, 1906 (i.e., including Tropidiplus Fleutiaux, 1903); Paracardiophorus sensu Cobos, 1970, although well-supported alternative classifications were available for only some. Based on taxonomic interpretation of phylogenetic results: Nyctorini is syn. n. of Cardiophorini; Globothorax Fleutiaux, 1891 (Physodactylinae), Margogastrius Schwarz, 1903 (Physodactylinae), and Pachyelater Lesne, 1897 (Dendrometrinae) are transferred to Cardiophorinae. The following changes are proposed for cardiophorine genera: Aptopus Eschscholtz, 1829 is redefined to exclude Horistonotus -like species; Coptostethus Wollaston, 1854 is subgenus of Cardiophorus ; Dicronychus Brullé, 1832 and Diocarphus Fleutiaux, 1947, Metacardiophorus Gurjeva, 1966, Platynychus Motschulsky, 1858, and Zygocardiophorus Iablokoff-Khnzorian and Mardjanian, 1981 are placed at genus rank; Paracardiophorus Schwarz, 1895 is redefined based on North American and Eurasian species only; Horistonotus Candèze, 1860 redefined to include species with multiple apices on each side of their tarsal claws; Patriciella Van Zwaluwenburg, 1953 is syn. n. of Aphricus LeConte, 1853; Teslasena Fleutiaux, 1892 (Physodactylinae) is syn. n. of Globothorax Fleutiaux, 1891. The following new genera are described: Austrocardiophorus (type species: Cardiophorus humeralis Fairmaire and Germain, 1860); Chileaphricus (type species: Aphricus chilensis Fleutiaux, 1940); Floridelater (type species: Coptostethus americanus Horn, 1871, transferred from Negastriinae to Cardiophorinae). Paradicronychus ( nomen nudum ), is syn. n. of Cardiophorus Eschscholtz, 1829. Generic reassignments to make Cardiodontulus , Cardiophorus , Cardiotarsus , Paracardiophorus consistent with phylogenetically revised genus concepts resulted in 84 new combinations. Lectotypes are designated for 29 type species to fix generic concepts: Anelastes femoralis Lucas, 1857; Aphricus chilensis Fleutiaux, 1940; Athous argentatus Abeille de Perrin, 1894; Cardiophorus adjutor Candèze, 1875; Cardiophorus florentini Fleutiaux, 1895; Cardiophorus inflatus Candèze, 1882; Cardiophorus luridipes Candèze, 1860; Cardiophorus mirabilis Candèze, 1860; Cardiophorus musculus Erichson, 1840; Cardiotarsus capensis Candèze, 1860; Cardiotarsus vitalisi Fleutiaux, 1918; Craspedostethus rufiventris Schwarz, 1898; Elater cinereus Herbst, 1784; Elater minutissimus Germar, 1817; Elater sputator Linnaeus, 1758; Elater thoracicus Fabricius, 1801; Eniconyx pullatus Horn, 1884; Esthesopus castaneus Eschscholtz, 1829; Gastrimargus schneideri Schwarz, 1902; Globothorax chevrolati Fleutiaux, 1891; Horistonotus flavidus Candèze, 1860; Horistonotus simplex LeConte, 1863; Lesnelater madagascariensis Fleutiaux, 1935; Oedostethus femoralis LeConte, 1853; Phorocardius solitarius Fleutiaux, 1931; Platynychus indicus Motschulsky, 1858; Platynychus mixtus Fleutiaux, 1931; Triplonychus acuminatus Candèze, 1860; Tropidiplus tellinii Fleutiaux, 1903. A key to genera and diagnoses are provided for all genera and subgenera. A bibliographic synonymy includes references for all taxonomic changes to genera and new species through 2015.
Schirtzinger, Erin E.; Matsumoto, Tania; Eberhard, Jessica R.; Graves, Gary R.; Sanchez, Juan J.; Capelli, Sara; Müller, Heinrich; Scharpegge, Julia; Chambers, Geoffrey K.; Fleischer, Robert C.
2008-01-01
The question of when modern birds (Neornithes) first diversified has generated much debate among avian systematists. Fossil evidence generally supports a Tertiary diversification, whereas estimates based on molecular dating favor an earlier diversification in the Cretaceous period. In this study, we used an alternate approach, the inference of historical biogeographic patterns, to test the hypothesis that the initial radiation of the Order Psittaciformes (the parrots and cockatoos) originated on the Gondwana supercontinent during the Cretaceous. We utilized broad taxonomic sampling (representatives of 69 of the 82 extant genera and 8 outgroup taxa) and multilocus molecular character sampling (3,941 bp from mitochondrial DNA (mtDNA) genes cytochrome oxidase I and NADH dehydrogenase 2 and nuclear introns of rhodopsin intron 1, tropomyosin alpha-subunit intron 5, and transforming growth factor ß-2) to generate phylogenetic hypotheses for the Psittaciformes. Analyses of the combined character partitions using maximum parsimony, maximum likelihood, and Bayesian criteria produced well-resolved and topologically similar trees in which the New Zealand taxa Strigops and Nestor (Psittacidae) were sister to all other psittaciforms and the cockatoo clade (Cacatuidae) was sister to a clade containing all remaining parrots (Psittacidae). Within this large clade of Psittacidae, some traditionally recognized tribes and subfamilies were monophyletic (e.g., Arini, Psittacini, and Loriinae), whereas several others were polyphyletic (e.g., Cyclopsittacini, Platycercini, Psittaculini, and Psittacinae). Ancestral area reconstructions using our Bayesian phylogenetic hypothesis and current distributions of genera supported the hypothesis of an Australasian origin for the Psittaciformes. Separate analyses of the timing of parrot diversification constructed with both Bayesian relaxed-clock and penalized likelihood approaches showed better agreement between geologic and diversification events in the chronograms based on a Cretaceous dating of the basal split within parrots than the chronograms based on a Tertiary dating of this split, although these data are more equivocal. Taken together, our results support a Cretaceous origin of Psittaciformes in Gondwana after the separation of Africa and the India/Madagascar block with subsequent diversification through both vicariance and dispersal. These well-resolved molecular phylogenies will be of value for comparative studies of behavior, ecology, and life history in parrots. PMID:18653733
Applying Bayesian statistics to the study of psychological trauma: A suggestion for future research.
Yalch, Matthew M
2016-03-01
Several contemporary researchers have noted the virtues of Bayesian methods of data analysis. Although debates continue about whether conventional or Bayesian statistics is the "better" approach for researchers in general, there are reasons why Bayesian methods may be well suited to the study of psychological trauma in particular. This article describes how Bayesian statistics offers practical solutions to the problems of data non-normality, small sample size, and missing data common in research on psychological trauma. After a discussion of these problems and the effects they have on trauma research, this article explains the basic philosophical and statistical foundations of Bayesian statistics and how it provides solutions to these problems using an applied example. Results of the literature review and the accompanying example indicates the utility of Bayesian statistics in addressing problems common in trauma research. Bayesian statistics provides a set of methodological tools and a broader philosophical framework that is useful for trauma researchers. Methodological resources are also provided so that interested readers can learn more. (c) 2016 APA, all rights reserved).
Bayesian data analysis for newcomers.
Kruschke, John K; Liddell, Torrin M
2018-02-01
This article explains the foundational concepts of Bayesian data analysis using virtually no mathematical notation. Bayesian ideas already match your intuitions from everyday reasoning and from traditional data analysis. Simple examples of Bayesian data analysis are presented that illustrate how the information delivered by a Bayesian analysis can be directly interpreted. Bayesian approaches to null-value assessment are discussed. The article clarifies misconceptions about Bayesian methods that newcomers might have acquired elsewhere. We discuss prior distributions and explain how they are not a liability but an important asset. We discuss the relation of Bayesian data analysis to Bayesian models of mind, and we briefly discuss what methodological problems Bayesian data analysis is not meant to solve. After you have read this article, you should have a clear sense of how Bayesian data analysis works and the sort of information it delivers, and why that information is so intuitive and useful for drawing conclusions from data.
Bayes in biological anthropology.
Konigsberg, Lyle W; Frankenberg, Susan R
2013-12-01
In this article, we both contend and illustrate that biological anthropologists, particularly in the Americas, often think like Bayesians but act like frequentists when it comes to analyzing a wide variety of data. In other words, while our research goals and perspectives are rooted in probabilistic thinking and rest on prior knowledge, we often proceed to use statistical hypothesis tests and confidence interval methods unrelated (or tenuously related) to the research questions of interest. We advocate for applying Bayesian analyses to a number of different bioanthropological questions, especially since many of the programming and computational challenges to doing so have been overcome in the past two decades. To facilitate such applications, this article explains Bayesian principles and concepts, and provides concrete examples of Bayesian computer simulations and statistics that address questions relevant to biological anthropology, focusing particularly on bioarchaeology and forensic anthropology. It also simultaneously reviews the use of Bayesian methods and inference within the discipline to date. This article is intended to act as primer to Bayesian methods and inference in biological anthropology, explaining the relationships of various methods to likelihoods or probabilities and to classical statistical models. Our contention is not that traditional frequentist statistics should be rejected outright, but that there are many situations where biological anthropology is better served by taking a Bayesian approach. To this end it is hoped that the examples provided in this article will assist researchers in choosing from among the broad array of statistical methods currently available. Copyright © 2013 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Mustac, M.; Kim, S.; Tkalcic, H.; Rhie, J.; Chen, Y.; Ford, S. R.; Sebastian, N.
2015-12-01
Conventional approaches to inverse problems suffer from non-linearity and non-uniqueness in estimations of seismic structures and source properties. Estimated results and associated uncertainties are often biased by applied regularizations and additional constraints, which are commonly introduced to solve such problems. Bayesian methods, however, provide statistically meaningful estimations of models and their uncertainties constrained by data information. In addition, hierarchical and trans-dimensional (trans-D) techniques are inherently implemented in the Bayesian framework to account for involved error statistics and model parameterizations, and, in turn, allow more rigorous estimations of the same. Here, we apply Bayesian methods throughout the entire inference process to estimate seismic structures and source properties in Northeast Asia including east China, the Korean peninsula, and the Japanese islands. Ambient noise analysis is first performed to obtain a base three-dimensional (3-D) heterogeneity model using continuous broadband waveforms from more than 300 stations. As for the tomography of surface wave group and phase velocities in the 5-70 s band, we adopt a hierarchical and trans-D Bayesian inversion method using Voronoi partition. The 3-D heterogeneity model is further improved by joint inversions of teleseismic receiver functions and dispersion data using a newly developed high-efficiency Bayesian technique. The obtained model is subsequently used to prepare 3-D structural Green's functions for the source characterization. A hierarchical Bayesian method for point source inversion using regional complete waveform data is applied to selected events from the region. The seismic structure and source characteristics with rigorously estimated uncertainties from the novel Bayesian methods provide enhanced monitoring and discrimination of seismic events in northeast Asia.
ERIC Educational Resources Information Center
Marcoulides, Katerina M.
2018-01-01
This study examined the use of Bayesian analysis methods for the estimation of item parameters in a two-parameter logistic item response theory model. Using simulated data under various design conditions with both informative and non-informative priors, the parameter recovery of Bayesian analysis methods were examined. Overall results showed that…
Tree Biomass Estimation of Chinese fir (Cunninghamia lanceolata) Based on Bayesian Method
Zhang, Jianguo
2013-01-01
Chinese fir (Cunninghamia lanceolata (Lamb.) Hook.) is the most important conifer species for timber production with huge distribution area in southern China. Accurate estimation of biomass is required for accounting and monitoring Chinese forest carbon stocking. In the study, allometric equation was used to analyze tree biomass of Chinese fir. The common methods for estimating allometric model have taken the classical approach based on the frequency interpretation of probability. However, many different biotic and abiotic factors introduce variability in Chinese fir biomass model, suggesting that parameters of biomass model are better represented by probability distributions rather than fixed values as classical method. To deal with the problem, Bayesian method was used for estimating Chinese fir biomass model. In the Bayesian framework, two priors were introduced: non-informative priors and informative priors. For informative priors, 32 biomass equations of Chinese fir were collected from published literature in the paper. The parameter distributions from published literature were regarded as prior distributions in Bayesian model for estimating Chinese fir biomass. Therefore, the Bayesian method with informative priors was better than non-informative priors and classical method, which provides a reasonable method for estimating Chinese fir biomass. PMID:24278198
Tree biomass estimation of Chinese fir (Cunninghamia lanceolata) based on Bayesian method.
Zhang, Xiongqing; Duan, Aiguo; Zhang, Jianguo
2013-01-01
Chinese fir (Cunninghamia lanceolata (Lamb.) Hook.) is the most important conifer species for timber production with huge distribution area in southern China. Accurate estimation of biomass is required for accounting and monitoring Chinese forest carbon stocking. In the study, allometric equation W = a(D2H)b was used to analyze tree biomass of Chinese fir. The common methods for estimating allometric model have taken the classical approach based on the frequency interpretation of probability. However, many different biotic and abiotic factors introduce variability in Chinese fir biomass model, suggesting that parameters of biomass model are better represented by probability distributions rather than fixed values as classical method. To deal with the problem, Bayesian method was used for estimating Chinese fir biomass model. In the Bayesian framework, two priors were introduced: non-informative priors and informative priors. For informative priors, 32 biomass equations of Chinese fir were collected from published literature in the paper. The parameter distributions from published literature were regarded as prior distributions in Bayesian model for estimating Chinese fir biomass. Therefore, the Bayesian method with informative priors was better than non-informative priors and classical method, which provides a reasonable method for estimating Chinese fir biomass.
Application of a data-mining method based on Bayesian networks to lesion-deficit analysis
NASA Technical Reports Server (NTRS)
Herskovits, Edward H.; Gerring, Joan P.
2003-01-01
Although lesion-deficit analysis (LDA) has provided extensive information about structure-function associations in the human brain, LDA has suffered from the difficulties inherent to the analysis of spatial data, i.e., there are many more variables than subjects, and data may be difficult to model using standard distributions, such as the normal distribution. We herein describe a Bayesian method for LDA; this method is based on data-mining techniques that employ Bayesian networks to represent structure-function associations. These methods are computationally tractable, and can represent complex, nonlinear structure-function associations. When applied to the evaluation of data obtained from a study of the psychiatric sequelae of traumatic brain injury in children, this method generates a Bayesian network that demonstrates complex, nonlinear associations among lesions in the left caudate, right globus pallidus, right side of the corpus callosum, right caudate, and left thalamus, and subsequent development of attention-deficit hyperactivity disorder, confirming and extending our previous statistical analysis of these data. Furthermore, analysis of simulated data indicates that methods based on Bayesian networks may be more sensitive and specific for detecting associations among categorical variables than methods based on chi-square and Fisher exact statistics.
Sironi, Emanuele; Taroni, Franco; Baldinotti, Claudio; Nardi, Cosimo; Norelli, Gian-Aristide; Gallidabino, Matteo; Pinchi, Vilma
2017-11-14
The present study aimed to investigate the performance of a Bayesian method in the evaluation of dental age-related evidence collected by means of a geometrical approximation procedure of the pulp chamber volume. Measurement of this volume was based on three-dimensional cone beam computed tomography images. The Bayesian method was applied by means of a probabilistic graphical model, namely a Bayesian network. Performance of that method was investigated in terms of accuracy and bias of the decisional outcomes. Influence of an informed elicitation of the prior belief of chronological age was also studied by means of a sensitivity analysis. Outcomes in terms of accuracy were adequate with standard requirements for forensic adult age estimation. Findings also indicated that the Bayesian method does not show a particular tendency towards under- or overestimation of the age variable. Outcomes of the sensitivity analysis showed that results on estimation are improved with a ration elicitation of the prior probabilities of age.
Nantón, Ana; Ruiz-Ruano, Francisco J.; Camacho, Juan Pedro M.; Méndez, Josefina
2017-01-01
Background Four species of the genus Donax (D. semistriatus, D. trunculus, D. variegatus and D. vittatus) are common on Iberian Peninsula coasts. Nevertheless, despite their economic importance and overexploitation, scarce genetic resources are available. In this work, we newly determined the complete mitochondrial genomes of these four representatives of the family Donacidae, with the aim of contributing to unveil phylogenetic relationships within the Veneroida order, and of developing genetic markers being useful in wedge clam identification and authentication, and aquaculture stock management. Principal findings The complete female mitochondrial genomes of the four species vary in size from 17,044 to 17,365 bp, and encode 13 protein-coding genes (including the atp8 gene), 2 rRNAs and 22 tRNAs, all located on the same strand. A long non-coding region was identified in each of the four Donax species between cob and cox2 genes, presumably corresponding to the Control Region. The Bayesian and Maximum Likelihood phylogenetic analysis of the Veneroida order indicate that all four species of Donax form a single clade as a sister group of other bivalves within the Tellinoidea superfamily. However, although Tellinoidea is actually monophyletic, none of its families are monophyletic. Conclusions Sequencing of complete mitochondrial genomes provides highly valuable information to establish the phylogenetic relationships within the Veneroida order. Furthermore, we provide here significant genetic resources for further research and conservation of this commercially important fishing resource. PMID:28886105
Rosas-Valdez, Rogelio; Morrone, Juan J; García-Varela, Martín
2012-08-01
Species of Floridosentis (Acanthocephala) are common parasites of mullets (Mugil spp., Mugilidae) found in tropical marine and brackish water in the Americas. Floridosentis includes 2 species distributed in Mexico, i.e., Floridosentis pacifica, restricted to the Pacific Ocean near Salina Cruz, Oaxaca, and Floridosentis mugilis, distributed along the coast of the Pacific Ocean and the Gulf of Mexico. We sampled 18 populations of F. mugilis and F. pacifica (12 from the Pacific and 6 from the Gulf of Mexico) and sequenced a fragment of the rDNA large subunit to evaluate phylogenetic relationships of populations of Floridosentis spp. from Mexico. Species identification of museum specimens of F. mugilis from the Pacific Ocean was confirmed by examination of morphology traits. Phylogenetic trees inferred with maximum parsimony, maximum likelihood, and Bayesian inference indicate that Floridosentis is monophyletic comprising of 2 major well-supported clades, the first clade corresponding to F. mugilis from the Gulf of Mexico, and the second to F. pacifica from the Pacific Ocean. Genetic divergence between species ranged from 7.68 to 8.60%. Intraspecific divergence ranged from 0.14 to 0.86% for F. mugilis and from 1.72 to 4.49% for F. pacifica. Data obtained from diagnostic characters indicate that specimens from the Pacific Ocean in Mexico have differences in some traits among locations. These results are consistent with the phylogenetic hypothesis, indicating that F. pacifica is distributed in the Pacific Ocean in Mexico with 3 major lineages.
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
[Bayesian approach for the cost-effectiveness evaluation of healthcare technologies].
Berchialla, Paola; Gregori, Dario; Brunello, Franco; Veltri, Andrea; Petrinco, Michele; Pagano, Eva
2009-01-01
The development of Bayesian statistical methods for the assessment of the cost-effectiveness of health care technologies is reviewed. Although many studies adopt a frequentist approach, several authors have advocated the use of Bayesian methods in health economics. Emphasis has been placed on the advantages of the Bayesian approach, which include: (i) the ability to make more intuitive and meaningful inferences; (ii) the ability to tackle complex problems, such as allowing for the inclusion of patients who generate no cost, thanks to the availability of powerful computational algorithms; (iii) the importance of a full use of quantitative and structural prior information to produce realistic inferences. Much literature comparing the cost-effectiveness of two treatments is based on the incremental cost-effectiveness ratio. However, new methods are arising with the purpose of decision making. These methods are based on a net benefits approach. In the present context, the cost-effectiveness acceptability curves have been pointed out to be intrinsically Bayesian in their formulation. They plot the probability of a positive net benefit against the threshold cost of a unit increase in efficacy.A case study is presented in order to illustrate the Bayesian statistics in the cost-effectiveness analysis. Emphasis is placed on the cost-effectiveness acceptability curves. Advantages and disadvantages of the method described in this paper have been compared to frequentist methods and discussed.
NASA Astrophysics Data System (ADS)
Wang, Hongrui; Wang, Cheng; Wang, Ying; Gao, Xiong; Yu, Chen
2017-06-01
This paper presents a Bayesian approach using Metropolis-Hastings Markov Chain Monte Carlo algorithm and applies this method for daily river flow rate forecast and uncertainty quantification for Zhujiachuan River using data collected from Qiaotoubao Gage Station and other 13 gage stations in Zhujiachuan watershed in China. The proposed method is also compared with the conventional maximum likelihood estimation (MLE) for parameter estimation and quantification of associated uncertainties. While the Bayesian method performs similarly in estimating the mean value of daily flow rate, it performs over the conventional MLE method on uncertainty quantification, providing relatively narrower reliable interval than the MLE confidence interval and thus more precise estimation by using the related information from regional gage stations. The Bayesian MCMC method might be more favorable in the uncertainty analysis and risk management.
Ma, Peng-Fei; Vorontsova, Maria S; Nanjarisoa, Olinirina Prisca; Razanatsoa, Jacqueline; Guo, Zhen-Hua; Haevermans, Thomas; Li, De-Zhu
2017-12-21
Heterogeneous rates of molecular evolution are universal across the tree of life, posing challenges for phylogenetic inference. The temperate woody bamboos (tribe Arundinarieae, Poaceae) are noted for their extremely slow molecular evolutionary rates, supposedly caused by their mysterious monocarpic reproduction. However, the correlation between substitution rates and flowering cycles has not been formally tested. Here we present 15 newly sequenced plastid genomes of temperate woody bamboos, including the first genomes ever sequenced from Madagascar representatives. A data matrix of 46 plastid genomes representing all 12 lineages of Arundinarieae was assembled for phylogenetic and molecular evolutionary analyses. We conducted phylogenetic analyses using different sequences (e.g., coding and noncoding) combined with different data partitioning schemes, revealing conflicting relationships involving internodes among several lineages. A great difference in branch lengths were observed among the major lineages, and topological inconsistency could be attributed to long-branch attraction (LBA). Using clock model-fitting by maximum likelihood and Bayesian approaches, we furthermore demonstrated extensive rate variation among these major lineages. Rate accelerations mainly occurred for the isolated lineages with limited species diversification, totaling 11 rate shifts during the tribe's evolution. Using linear regression analysis, we found a negative correlation between rates of molecular evolution and flowering cycles for Arundinarieae, notwithstanding that the correlation maybe insignificant when taking the phylogenetic structure into account. Using the temperate woody bamboos as an example, we found further evidence that rate heterogeneity is universal in plants, suggesting that this will pose a challenge for phylogenetic reconstruction of bamboos. The bamboos with longer flowering cycles tend to evolve more slowly than those with shorter flowering cycles, in accordance with a putative generation time effect.
Carvalho-Sobrinho, Jefferson G; Alverson, William S; Alcantara, Suzana; Queiroz, Luciano P; Mota, Aline C; Baum, David A
2016-08-01
Bombacoideae (Malvaceae) is a clade of deciduous trees with a marked dominance in many forests, especially in the Neotropics. The historical lack of a well-resolved phylogenetic framework for Bombacoideae hinders studies in this ecologically important group. We reexamined phylogenetic relationships in this clade based on a matrix of 6465 nuclear (ETS, ITS) and plastid (matK, trnL-trnF, trnS-trnG) DNA characters. We used maximum parsimony, maximum likelihood, and Bayesian inference to infer relationships among 108 species (∼70% of the total number of known species). We analyzed the evolution of selected morphological traits: trunk or branch prickles, calyx shape, endocarp type, seed shape, and seed number per fruit, using ML reconstructions of their ancestral states to identify possible synapomorphies for major clades. Novel phylogenetic relationships emerged from our analyses, including three major lineages marked by fruit or seed traits: the winged-seed clade (Bernoullia, Gyranthera, and Huberodendron), the spongy endocarp clade (Adansonia, Aguiaria, Catostemma, Cavanillesia, and Scleronema), and the Kapok clade (Bombax, Ceiba, Eriotheca, Neobuchia, Pachira, Pseudobombax, Rhodognaphalon, and Spirotheca). The Kapok clade, the most diverse lineage of the subfamily, includes sister relationships (i) between Pseudobombax and "Pochota fendleri" a historically incertae sedis taxon, and (ii) between the Paleotropical genera Bombax and Rhodognaphalon, implying just two bombacoid dispersals to the Old World, the other one involving Adansonia. This new phylogenetic framework offers new insights and a promising avenue for further evolutionary studies. In view of this information, we present a new tribal classification of the subfamily, accompanied by an identification key. Copyright © 2016 Elsevier Inc. All rights reserved.
The Psychology of Bayesian Reasoning
2014-10-21
The psychology of Bayesian reasoning David R. Mandel* Socio-Cognitive Systems Section, Defence Research and Development Canada and Department...belief revision, subjective probability, human judgment, psychological methods. Most psychological research on Bayesian reasoning since the 1970s has...attention to some important problems with the conventional approach to studying Bayesian reasoning in psychology that has been dominant since the
Bayesian survival analysis in clinical trials: What methods are used in practice?
Brard, Caroline; Le Teuff, Gwénaël; Le Deley, Marie-Cécile; Hampson, Lisa V
2017-02-01
Background Bayesian statistics are an appealing alternative to the traditional frequentist approach to designing, analysing, and reporting of clinical trials, especially in rare diseases. Time-to-event endpoints are widely used in many medical fields. There are additional complexities to designing Bayesian survival trials which arise from the need to specify a model for the survival distribution. The objective of this article was to critically review the use and reporting of Bayesian methods in survival trials. Methods A systematic review of clinical trials using Bayesian survival analyses was performed through PubMed and Web of Science databases. This was complemented by a full text search of the online repositories of pre-selected journals. Cost-effectiveness, dose-finding studies, meta-analyses, and methodological papers using clinical trials were excluded. Results In total, 28 articles met the inclusion criteria, 25 were original reports of clinical trials and 3 were re-analyses of a clinical trial. Most trials were in oncology (n = 25), were randomised controlled (n = 21) phase III trials (n = 13), and half considered a rare disease (n = 13). Bayesian approaches were used for monitoring in 14 trials and for the final analysis only in 14 trials. In the latter case, Bayesian survival analyses were used for the primary analysis in four cases, for the secondary analysis in seven cases, and for the trial re-analysis in three cases. Overall, 12 articles reported fitting Bayesian regression models (semi-parametric, n = 3; parametric, n = 9). Prior distributions were often incompletely reported: 20 articles did not define the prior distribution used for the parameter of interest. Over half of the trials used only non-informative priors for monitoring and the final analysis (n = 12) when it was specified. Indeed, no articles fitting Bayesian regression models placed informative priors on the parameter of interest. The prior for the treatment effect was based on historical data in only four trials. Decision rules were pre-defined in eight cases when trials used Bayesian monitoring, and in only one case when trials adopted a Bayesian approach to the final analysis. Conclusion Few trials implemented a Bayesian survival analysis and few incorporated external data into priors. There is scope to improve the quality of reporting of Bayesian methods in survival trials. Extension of the Consolidated Standards of Reporting Trials statement for reporting Bayesian clinical trials is recommended.
Kim, Joo-Hwan; Kim, Dong-Kap; Forest, Felix; Fay, Michael F.; Chase, Mark W.
2010-01-01
Background Previous phylogenetics studies of Asparagales, although extensive and generally well supported, have left several sets of taxa unclearly placed and have not addressed all relationships within certain clades thoroughly (some clades were relatively sparsely sampled). One of the most important of these is sampling within and placement of Nolinoideae (Ruscaceae s.l.) of Asparagaceae sensu Angiosperm Phylogeny Group (APG) III, which subfamily includes taxa previously referred to Convallariaceae, Dracaenaaceae, Eriospermaceae, Nolinaceae and Ruscaceae. Methods A phylogenetic analysis of a combined data set for 126 taxa of Ruscaceae s.l. and related groups in Asparagales based on three nuclear and plastid DNA coding genes, 18S rDNA (1796 bp), rbcL (1338 bp) and matK (1668 bp), representing a total of approx. 4·8 kb is presented. Parsimony and Bayesian inference analyses were conducted to elucidate relationships of Ruscaceae s.l. and related groups, and parsimony bootstrap analysis was performed to assess support of clades. Key Results The combination of the three genes results in the most highly resolved and strongly supported topology yet obtained for Asparagales including Ruscaceae s.l. Asparagales relationships are nearly congruent with previous combined gene analyses, which were reflected in the APG III classification. Parsimony and Bayesian analyses yield identical relationships except for some slight variation among the core asparagoid families, which nevertheless form a strongly supported group in both types of analyses. In core asparagoids, five major clades are identified: (1) Alliaceae s.l. (sensu APG III, Amarylidaceae–Agapanthaceae–Alliaceae); (2) Asparagaceae–Laxmanniaceae–Ruscaceae s.l.; (3) Themidaceae; (4) Hyacinthaceae; (5) Anemarrhenaceae–Behniaceae–Herreriaceae–Agavaceae (clades 2–5 collectively Asparagaceae s.l. sensu APG III). The position of Aphyllanthes is labile, but it is sister to Themidaceae in the combined maximum-parsimony tree and sister to Anemarrhenaceae in the Bayesian analysis. The highly supported clade of Xanthorrhoeaceae s.l. (sensu APG III, including Asphodelaceae and Hemerocallidaceae) is sister to the core asparagoids. Ruscaceae s.l. are a well-supported group. Asparagaceae s.s. are sister to Ruscaceae s.l., even though the clade of the two families is weakly supported; Laxmanniaceae are strongly supported as sister to Ruscaceae s.l. and Asparagaceae. Ruscaceae s.l. include six principal clades that often reflect previously named groups: (1) tribe Polygonateae (excluding Disporopsis); (2) tribe Ophiopogoneae; (3) tribe Convallarieae (excluding Theropogon); (4) Ruscaceae s.s. + Dracaenaceae + Theropogon + Disporopsis + Comospermum; (5) Nolinaceae, (6) Eriospermum. Conclusions The analyses here were largely conducted with new data collected for the same loci as in previous studies, but in this case from different species/DNA accessions and greater sampling in many cases than in previously published analyses; nonetheless, the results largely mirror those of previously conducted studies. This demonstrates the robustness of these results and answers questions often raised about reproducibility of DNA results, given the often sparse sampling of taxa in some studies, particularly the earliest ones. The results also provide a clear set of patterns on which to base a new classification of the subfamilies of Asparagaceae s.l., particularly Ruscaceae s.l. (= Nolinoideae of Asparagaceae s.l.), and examine other putatively important characters of Asparagales. PMID:20929900
Teves, Simone Caldas; Gardim, Sueli; Carbajal de la Fuente, Ana Laura; Lopes, Catarina Macedo; Gonçalves, Teresa Cristina Monte; Mallet, Jacenir Reis dos Santos; da Rosa, João Aristeu; Almeida, Carlos Eduardo
2016-01-01
Triatoma jatai was described using a set of morphological structures from specimens collected in Paranã municipality of Tocantins State, Brazil. Under a Bayesian framework and using two mitochondrial genes (16S and COI), phylogenetic analysis recovered T. jatai as a sister species to Triatoma costalimai with higher genetic distances than between other well-recognized species. Our results agree with previous suggestions based on morphometric analysis. In the light of the non-monophyly of Matogrossensis subcomplex, the inclusion of T. jatai shall be considered for reevaluating this group. PMID:26787157
Boopathi, Thangavelu; Faria, Daphne Georgina; Cheon, Ju-Yong; Youn, Seok Hyun; Ki, Jang-Seu
2015-01-01
The small and large nuclear subunit molecular phylogeny of the genus Prorocentrum demonstrated that the species are dichotomized into two clades. These two clades were significantly different (one-factor ANOVA, p < 0.01) with patterns compatible for both small and large subunit Bayesian phylogenetic trees, and for a larger taxon sampled dinoflagellate phylogeny. Evaluation of the molecular divergence levels showed that intraspecies genetic variations were significantly low (t-test, p < 0.05), than those for interspecies variations (> 2.9% and > 26.8% dissimilarity in the small and large subunit [D1/D2], respectively). Based on the calculated molecular divergence, the genus comprises two genetically distinct groups that should be considered as two separate genera, thereby setting the pace for major systematic changes for the genus Prorocentrum sensu Dodge. Moreover, the information presented in this study would be useful for improving species identification, detection of novel clades from environmental samples. © 2015 The Author(s) Journal of Eukaryotic Microbiology © 2015 International Society of Protistologists.
Liu, Guo-Hua; Li, Sheng; Zou, Feng-Cai; Wang, Chun-Ren; Zhu, Xing-Quan
2016-01-01
Passalurus ambiguus (Nematda: Oxyuridae) is a common pinworm which parasitizes in the caecum and colon of rabbits. Despite its significance as a pathogen, the epidemiology, genetics, systematics, and biology of this pinworm remain poorly understood. In the present study, we sequenced the complete mitochondrial (mt) genome of P. ambiguus. The circular mt genome is 14,023 bp in size and encodes of 36 genes, including 12 protein-coding, two ribosomal RNA, and 22 transfer RNA genes. The mt gene order of P. ambiguus is the same as that of Wellcomia siamensis, but distinct from that of Enterobius vermicularis. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes by Bayesian inference (BI) showed that P. ambiguus was more closely related to W. siamensis than to E. vermicularis. This mt genome provides novel genetic markers for studying the molecular epidemiology, population genetics, systematics of pinworm of animals and humans, and should have implications for the diagnosis, prevention, and control of passaluriasis in rabbits and other animals.
SpreaD3: Interactive Visualization of Spatiotemporal History and Trait Evolutionary Processes.
Bielejec, Filip; Baele, Guy; Vrancken, Bram; Suchard, Marc A; Rambaut, Andrew; Lemey, Philippe
2016-08-01
Model-based phylogenetic reconstructions increasingly consider spatial or phenotypic traits in conjunction with sequence data to study evolutionary processes. Alongside parameter estimation, visualization of ancestral reconstructions represents an integral part of these analyses. Here, we present a complete overhaul of the spatial phylogenetic reconstruction of evolutionary dynamics software, now called SpreaD3 to emphasize the use of data-driven documents, as an analysis and visualization package that primarily complements Bayesian inference in BEAST (http://beast.bio.ed.ac.uk, last accessed 9 May 2016). The integration of JavaScript D3 libraries (www.d3.org, last accessed 9 May 2016) offers novel interactive web-based visualization capacities that are not restricted to spatial traits and extend to any discrete or continuously valued trait for any organism of interest. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Zuki, Ameyra Aman; Mohammed, Muhamad Azmi; Md-Zain, Badrul Munir; Yaakop, Salmah
2018-04-01
The phylogenetic relationships of Microgastrinae remains unclear though some studies have been conducted to resolve it. The function of Microgastrinae as endoparasitoids of Lepidopteran larvae makes this subfamily an ideal and potential species to be applied as biological control agent of infesting crops. In this study, a total of 13 microgastrine samples under 13 genera were collected from nine localities throughout Peninsular Malaysia. Two molecular regions, 28S nuclear marker and 16S mitochondrial marker were utilized in this study to examine the effectiveness of those regions in resolving the relationships within Microgastrinae. Total of 36 sequences were implemented in the analyses of NJ, MP and Bayesian for both markers. Results obtained from this study were supported by morphological and biological characters. Henceforth, the outcome from this study provides a proof of effectiveness of 28S and 16S molecular markers in studying the phylogenetic relationships of Microgastrinae from Malaysia exclusively and Oriental generally.
Baker, Kate S.; Dallman, Timothy J.; Behar, Adi; Weill, François-Xavier; Gouali, Malika; Sobel, Jeremy; Fookes, Maria; Valinsky, Lea; Gal-Mor, Ohad; Connor, Thomas R.; Nissan, Israel; Bertrand, Sophie; Parkhill, Julian; Jenkins, Claire; Cohen, Dani
2016-01-01
Shigellae are sensitive indicator species for studying trends in the international transmission of antimicrobial-resistant Enterobacteriaceae. Orthodox Jewish communities (OJCs) are a known risk group for shigellosis; Shigella sonnei is cyclically epidemic in OJCs in Israel, and sporadic outbreaks occur in OJCs elsewhere. We generated whole-genome sequences for 437 isolates of S. sonnei from OJCs and non-OJCs collected over 22 years in Europe (the United Kingdom, France, and Belgium), the United States, Canada, and Israel and analyzed these within a known global genomic context. Through phylogenetic and genomic analysis, we showed that strains from outbreaks in OJCs outside of Israel are distinct from strains in the general population and relate to a single multidrug-resistant sublineage of S. sonnei that prevails in Israel. Further Bayesian phylogenetic analysis showed that this strain emerged approximately 30 years ago, demonstrating the speed at which antimicrobial drug–resistant pathogens can spread widely through geographically dispersed, but internationally connected, communities. PMID:27532625
Baker, Kate S; Dallman, Timothy J; Behar, Adi; Weill, François-Xavier; Gouali, Malika; Sobel, Jeremy; Fookes, Maria; Valinsky, Lea; Gal-Mor, Ohad; Connor, Thomas R; Nissan, Israel; Bertrand, Sophie; Parkhill, Julian; Jenkins, Claire; Cohen, Dani; Thomson, Nicholas R
2016-09-01
Shigellae are sensitive indicator species for studying trends in the international transmission of antimicrobial-resistant Enterobacteriaceae. Orthodox Jewish communities (OJCs) are a known risk group for shigellosis; Shigella sonnei is cyclically epidemic in OJCs in Israel, and sporadic outbreaks occur in OJCs elsewhere. We generated whole-genome sequences for 437 isolates of S. sonnei from OJCs and non-OJCs collected over 22 years in Europe (the United Kingdom, France, and Belgium), the United States, Canada, and Israel and analyzed these within a known global genomic context. Through phylogenetic and genomic analysis, we showed that strains from outbreaks in OJCs outside of Israel are distinct from strains in the general population and relate to a single multidrug-resistant sublineage of S. sonnei that prevails in Israel. Further Bayesian phylogenetic analysis showed that this strain emerged approximately 30 years ago, demonstrating the speed at which antimicrobial drug-resistant pathogens can spread widely through geographically dispersed, but internationally connected, communities.
Canedo, Clarissa; Haddad, Célio F B
2012-11-01
We present a phylogenetic hypothesis of the anuran clade Terrarana based on partial sequences of nuclear (Tyr and RAG1) and mitochondrial (12S, tRNA-Val, and 16S) genes, testing the monophyly of Ischnocnema and its species series. We performed maximum parsimony, maximum likelihood, and Bayesian inference analyses on 364 terminals: 11 outgroup terminals and 353 ingroup Terrarana terminals, including 139 Ischnocnema terminals (accounting for 29 of the 35 named Ischnocnema species) and 214 other Terrarana terminals within the families Brachycephalidae, Ceuthomantidae, Craugastoridae, and Eleutherodactylidae. Different optimality criteria produced similar results and mostly recovered the currently accepted families and genera. According to these topologies, Ischnocnema is not a monophyletic group. We propose new combinations for three species, relocating them to Pristimantis, and render Eleutherodactylus bilineatus Bokermann, 1975 incertae sedis status within Holoadeninae. The rearrangements in Ischnocnema place it outside the northernmost Brazilian Atlantic rainforest, where the fauna of Terrarana comprises typical Amazonian genera. Copyright © 2012 Elsevier Inc. All rights reserved.
Is probabilistic bias analysis approximately Bayesian?
MacLehose, Richard F.; Gustafson, Paul
2011-01-01
Case-control studies are particularly susceptible to differential exposure misclassification when exposure status is determined following incident case status. Probabilistic bias analysis methods have been developed as ways to adjust standard effect estimates based on the sensitivity and specificity of exposure misclassification. The iterative sampling method advocated in probabilistic bias analysis bears a distinct resemblance to a Bayesian adjustment; however, it is not identical. Furthermore, without a formal theoretical framework (Bayesian or frequentist), the results of a probabilistic bias analysis remain somewhat difficult to interpret. We describe, both theoretically and empirically, the extent to which probabilistic bias analysis can be viewed as approximately Bayesian. While the differences between probabilistic bias analysis and Bayesian approaches to misclassification can be substantial, these situations often involve unrealistic prior specifications and are relatively easy to detect. Outside of these special cases, probabilistic bias analysis and Bayesian approaches to exposure misclassification in case-control studies appear to perform equally well. PMID:22157311
Ritual human sacrifice promoted and sustained the evolution of stratified societies.
Watts, Joseph; Sheehan, Oliver; Atkinson, Quentin D; Bulbulia, Joseph; Gray, Russell D
2016-04-14
Evidence for human sacrifice is found throughout the archaeological record of early civilizations, the ethnographic records of indigenous world cultures, and the texts of the most prolific contemporary religions. According to the social control hypothesis, human sacrifice legitimizes political authority and social class systems, functioning to stabilize such social stratification. Support for the social control hypothesis is largely limited to historical anecdotes of human sacrifice, where the causal claims have not been subject to rigorous quantitative cross-cultural tests. Here we test the social control hypothesis by applying Bayesian phylogenetic methods to a geographically and socially diverse sample of 93 traditional Austronesian cultures. We find strong support for models in which human sacrifice stabilizes social stratification once stratification has arisen, and promotes a shift to strictly inherited class systems. Whilst evolutionary theories of religion have focused on the functionality of prosocial and moral beliefs, our results reveal a darker link between religion and the evolution of modern hierarchical societies.
Molecular Phylogeny of Hantaviruses Harbored by Insectivorous Bats in Côte d’Ivoire and Vietnam
Gu, Se Hun; Lim, Burton K.; Kadjo, Blaise; Arai, Satoru; Kim, Jeong-Ah; Nicolas, Violaine; Lalis, Aude; Denys, Christiane; Cook, Joseph A.; Dominguez, Samuel R.; Holmes, Kathryn V.; Urushadze, Lela; Sidamonidze, Ketevan; Putkaradze, Davit; Kuzmin, Ivan V.; Kosoy, Michael Y.; Song, Jin-Won; Yanagihara, Richard
2014-01-01
The recent discovery of genetically distinct hantaviruses in multiple species of shrews and moles prompted a further exploration of their host diversification by analyzing frozen, ethanol-fixed and RNAlater®-preserved archival tissues and fecal samples from 533 bats (representing seven families, 28 genera and 53 species in the order Chiroptera), captured in Asia, Africa and the Americas in 1981–2012, using RT-PCR. Hantavirus RNA was detected in Pomona roundleaf bats (Hipposideros pomona) (family Hipposideridae), captured in Vietnam in 1997 and 1999, and in banana pipistrelles (Neoromicia nanus) (family Vespertilionidae), captured in Côte d’Ivoire in 2011. Phylogenetic analysis, based on the full-length S- and partial M- and L-segment sequences using maximum likelihood and Bayesian methods, demonstrated that the newfound hantaviruses formed highly divergent lineages, comprising other recently recognized bat-borne hantaviruses in Sierra Leone and China. The detection of bat-associated hantaviruses opens a new era in hantavirology and provides insights into their evolutionary origins. PMID:24784569
Genetic Relationships among Different Chemotypes of Lupinus sulphureus.
Cook, Daniel; Mott, Ivan W; Larson, Steven R; Lee, Stephen T; Johnson, Robert; Stonecipher, Clinton A
2018-02-28
Lupines (Lupinus spp.) are a common plant legume species found on western U.S. rangelands. Lupinus spp. may contain quinolizidine and/or piperidine alkaloids that can be toxic and/or teratogenic to grazing livestock. Alkaloid profiles may vary between and within a species. The objectives of this study were to (1) further explore the characteristic alkaloid profiles of Lupinus sulphureus using field collections and (2) explore the phylogenetic relationship of the different populations and chemotypes of L. sulphureus using the amplified fragment length polymorphism method of DNA fingerprinting, thus providing possible explanations to the phenomena of multiple chemotypes within a species. A total of 49 accessions of L. sulphureus were classified into seven chemotypes. The DNA profiles showed that one L. sulphureus chemotype, chemotype A, is genetically divergent from the other chemotypes of L. sulphureus, suggesting that it represents an unresolved lupine taxon, possibly a new lupine species. Additionally, the different chemotypes of L. sulphureus represented different genetic groups, as shown by Bayesian cluster analysis and principle component analysis.