Science.gov

Sample records for phylogenetic inference depends

  1. Quantifying the impact of dependent evolution among sites in phylogenetic inference.

    PubMed

    Nasrallah, Chris A; Mathews, David H; Huelsenbeck, John P

    2011-01-01

    Nearly all commonly used methods of phylogenetic inference assume that characters in an alignment evolve independently of one another. This assumption is attractive for simplicity and computational tractability but is not biologically reasonable for RNAs and proteins that have secondary and tertiary structures. Here, we simulate RNA and protein-coding DNA sequence data under a general model of dependence in order to assess the robustness of traditional methods of phylogenetic inference to violation of the assumption of independence among sites. We find that the accuracy of independence-assuming methods is reduced by the dependence among sites; for proteins this reduction is relatively mild, but for RNA this reduction may be substantial. We introduce the concept of effective sequence length and its utility for considering information content in phylogenetics.

  2. Impacts of Terraces on Phylogenetic Inference.

    PubMed

    Sanderson, Michael J; McMahon, Michelle M; Stamatakis, Alexandros; Zwickl, Derrick J; Steel, Mike

    2015-09-01

    Terraces are sets of trees with precisely the same likelihood or parsimony score, which can be induced by missing sequences in partitioned multi-locus phylogenetic data matrices. The potentially large set of trees on a terrace can be characterized by enumeration algorithms or consensus methods that exploit the pattern of partial taxon coverage in the data, independent of the sequence data themselves. Terraces can add ambiguity and complexity to phylogenetic inference, particularly in settings where inference is already challenging: data sets with many taxa and relatively few loci. In this article we present five new findings about terraces and their impacts on phylogenetic inference. First, we clarify assumptions about partitioning scheme model parameters that are necessary for the existence of terraces. Second, we explore the dependence of terrace size on partitioning scheme and indicate how to find the partitioning scheme associated with the largest terrace containing a given tree. Third, we highlight the impact of terrace size on bootstrap estimates of confidence limits in clades, and characterize the surprising result that the bootstrap proportion for a clade, as it is usually calculated, can be entirely determined by the frequency of bipartitions on a terrace, with some bipartitions receiving high support even when incorrect. Fourth, we dissect some effects of prior distributions of edge lengths on the computed posterior probabilities of clades on terraces, to understand an example in which long edges "attract" each other in Bayesian inference. Fifth, we describe how assuming relationships between edge-lengths of different loci, as an attempt to avoid terraces, can also be problematic when taxon coverage is partial, specifically when heterotachy is present. Finally, we discuss strategies for remediation of some of these problems. One promising approach finds a minimal set of taxa which, when deleted from the data matrix, reduces the size of a terrace to a

  3. Phylogenetic Inference From Conserved sites Alignments

    SciTech Connect

    grundy, W.N.; Naylor, G.J.P.

    1999-08-15

    Molecular sequences provide a rich source of data for inferring the phylogenetic relationships among species. However, recent work indicates that even an accurate multiple alignment of a large sequence set may yield an incorrect phylogeny and that the quality of the phylogenetic tree improves when the input consists only of the highly conserved, motif regions of the alignment. This work introduces two methods of producing multiple alignments that include only the conserved regions of the initial alignment. The first method retains conserved motifs, whereas the second retains individual conserved sites in the initial alignment. Using parsimony analysis on a mitochondrial data set containing 19 species among which the phylogenetic relationships are widely accepted, both conserved alignment methods produce better phylogenetic trees than the complete alignment. Unlike any of the 19 inference methods used before to analyze this data, both methods produce trees that are completely consistent with the known phylogeny. The motif-based method employs far fewer alignment sites for comparable error rates. For a larger data set containing mitochondrial sequences from 39 species, the site-based method produces a phylogenetic tree that is largely consistent with known phylogenetic relationships and suggests several novel placements.

  4. Probabilistic phylogenetic inference with insertions and deletions.

    PubMed

    Rivas, Elena; Eddy, Sean R

    2008-01-01

    A fundamental task in sequence analysis is to calculate the probability of a multiple alignment given a phylogenetic tree relating the sequences and an evolutionary model describing how sequences change over time. However, the most widely used phylogenetic models only account for residue substitution events. We describe a probabilistic model of a multiple sequence alignment that accounts for insertion and deletion events in addition to substitutions, given a phylogenetic tree, using a rate matrix augmented by the gap character. Starting from a continuous Markov process, we construct a non-reversible generative (birth-death) evolutionary model for insertions and deletions. The model assumes that insertion and deletion events occur one residue at a time. We apply this model to phylogenetic tree inference by extending the program dnaml in phylip. Using standard benchmarking methods on simulated data and a new "concordance test" benchmark on real ribosomal RNA alignments, we show that the extended program dnamlepsilon improves accuracy relative to the usual approach of ignoring gaps, while retaining the computational efficiency of the Felsenstein peeling algorithm. PMID:18787703

  5. The evolution of HIV: Inferences using phylogenetics

    PubMed Central

    Castro-Nallar, Eduardo; Pérez-Losada, Marcos; Burton, Gregory F.; Crandall, Keith A.

    2011-01-01

    Molecular phylogenetics has revolutionized the study of not only evolution but also disparate fields such as genomics, bioinformatics, epidemiology, ecology, microbiology, molecular biology and biochemistry. Particularly significant are its achievements in population genetics as a result of the development of coalescent theory, which have contributed to more accurate model-based parameter estimation and explicit hypothesis testing. The study of the evolution of many microorganisms, and HIV in particular, have benefited from these new methodologies. HIV is well suited for such sophisticated population analyses because of its large population sizes, short generation times, high substitution rates and relatively small genomes. All these factors make HIV an ideal and fascinating model to study molecular evolution in real time. Here we review the significant advances made in HIV evolution through the application of phylogenetic approaches. We first examine the relative roles of mutation and recombination on the molecular evolution of HIV and its adaptive response to drug therapy and tissue allocation. We then review some of the fundamental questions in HIV evolution in relation to its origin and diversification and describe some of the insights gained using phylogenies. Finally, we show how phylogenetic analysis has advanced our knowledge of HIV dynamics (i.e., phylodynamics). PMID:22138161

  6. Verification of phylogenetic inference programs using metamorphic testing.

    PubMed

    Sadi, Md Shaik; Kuo, Fei-Ching; Ho, Joshua W K; Charleston, Michael A; Chen, T Y

    2011-12-01

    Many phylogenetic inference programs are available to infer evolutionary relationships among taxa using aligned sequences of characters, typically DNA or amino acids. These programs are often used to infer the evolutionary history of species. However, in most cases it is impossible to systematically verify the correctness of the tree returned by these programs, as the correct evolutionary history is generally unknown and unknowable. In addition, it is nearly impossible to verify whether any non-trivial tree is correct in accordance to the specification of the often complicated search and scoring algorithms. This difficulty is known as the oracle problem of software testing: there is no oracle that we can use to verify the correctness of the returned tree. This makes it very challenging to test the correctness of any phylogenetic inference programs. Here, we demonstrate how to apply a simple software testing technique, called Metamorphic Testing, to alleviate the oracle problem in testing phylogenetic inference programs. We have used both real and randomly generated test inputs to evaluate the effectiveness of metamorphic testing, and found that metamorphic testing can detect failures effectively in faulty phylogenetic inference programs with both types of test inputs.

  7. Inferring Phylogenetic Networks with Maximum Pseudolikelihood under Incomplete Lineage Sorting

    PubMed Central

    Solís-Lemus, Claudia; Ané, Cécile

    2016-01-01

    Phylogenetic networks are necessary to represent the tree of life expanded by edges to represent events such as horizontal gene transfers, hybridizations or gene flow. Not all species follow the paradigm of vertical inheritance of their genetic material. While a great deal of research has flourished into the inference of phylogenetic trees, statistical methods to infer phylogenetic networks are still limited and under development. The main disadvantage of existing methods is a lack of scalability. Here, we present a statistical method to infer phylogenetic networks from multi-locus genetic data in a pseudolikelihood framework. Our model accounts for incomplete lineage sorting through the coalescent model, and for horizontal inheritance of genes through reticulation nodes in the network. Computation of the pseudolikelihood is fast and simple, and it avoids the burdensome calculation of the full likelihood which can be intractable with many species. Moreover, estimation at the quartet-level has the added computational benefit that it is easily parallelizable. Simulation studies comparing our method to a full likelihood approach show that our pseudolikelihood approach is much faster without compromising accuracy. We applied our method to reconstruct the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), which is characterized by widespread hybridizations. PMID:26950302

  8. Indel reliability in indel-based phylogenetic inference.

    PubMed

    Ashkenazy, Haim; Cohen, Ofir; Pupko, Tal; Huchon, Dorothée

    2014-12-01

    It is often assumed that it is unlikely that the same insertion or deletion (indel) event occurred at the same position in two independent evolutionary lineages, and thus, indel-based inference of phylogeny should be less subject to homoplasy compared with standard inference which is based on substitution events. Indeed, indels were successfully used to solve debated evolutionary relationships among various taxonomical groups. However, indels are never directly observed but rather inferred from the alignment and thus indel-based inference may be sensitive to alignment errors. It is hypothesized that phylogenetic reconstruction would be more accurate if it relied only on a subset of reliable indels instead of the entire indel data. Here, we developed a method to quantify the reliability of indel characters by measuring how often they appear in a set of alternative multiple sequence alignments. Our approach is based on the assumption that indels that are consistently present in most alternative alignments are more reliable compared with indels that appear only in a small subset of these alignments. Using simulated and empirical data, we studied the impact of filtering and weighting indels by their reliability scores on the accuracy of indel-based phylogenetic reconstruction. The new method is available as a web-server at http://guidance.tau.ac.il/RELINDEL/.

  9. Inferring the evolutionary history of Mo-dependent nitrogen fixation from phylogenetic studies of nifK and nifDK.

    PubMed

    Hartmann, Linda S; Barnum, Susan R

    2010-07-01

    The ability to fix nitrogen is widely, but sporadically distributed among the Bacteria and Archaea suggesting either a vertically inherited, ancient function with widespread loss across genera or an adaptive feature transferred laterally between co-inhabitants of nitrogen-poor environments. As previous phylogenetic studies of nifH and nifD have not completely resolved the evolutionary history of nitrogenase, sixty nifD, nifK, and combined nifDK genes were analyzed using Bayesian, maximum likelihood, and parsimony algorithms to determine whether the individual and combined datasets could provide additional information. The results show congruence between the 16S and nifDK phylogenies at the phyla level and generally support vertical descent with loss. However, statistically significant differences between tree topographies suggest a complex evolutionary history with the underlying pattern of vertical descent obscured by recurring lateral transfer events and different patterns of evolution between the genes. Results support inheritance from the Last Common ancestor or an ancient lateral transfer of the nif genes between Bacteria and Archaea, ongoing gene transfer between cohabitants of similar biogeographic regions, acquisition of nitrogen-fixing capability via symbiosis islands, possible xenologous displacement of one gene in the operon, and possible retention of ancestral genes in heterocystous cyanobacteria. Analyses support the monophyly of the Cyanobacteria, alphabetagamma-Proteobacteria, and Actinobacteria (Frankia) and provide strong support for the placement of Frankia nif genes at the base of combined the Cyanobacteria/Proteobacteria clades.

  10. Explanation and falsification in phylogenetic inference: exercises in Popperian philosophy.

    PubMed

    Kluge, Arnold G

    2009-06-01

    Deduction leads to causal explanation in phylogenetic inference when the evidence, the systematic character, is conceptualized as a transformation series. Also, the deductive entailment of modus tollens is satisfied when those kinds of events are operationalized as patristic difference. Arguments to the contrary are based largely on the premise that character-states are defined intensionally as objects, in terms of similarity relations. However, such relations leave biologists without epistemological access to the causal explanation and explanatory power of historical statements. Moreover, the prediction-making to which those kinds of relations are limited in practice can lead to a category error--the mental conversion of an abstraction (the classes defined in terms of similarity relations) into a thing (such as an historical individual). The latter practices and problems characterize pattern cladistics, taxa being interpreted as homeostatic property cluster natural kinds, and other instrumentalist research programs. PMID:19190984

  11. Inferring influenza global transmission networks without complete phylogenetic information

    PubMed Central

    Aris-Brosou, Stéphane

    2014-01-01

    Influenza is one of the most severe respiratory infections affecting humans throughout the world, yet the dynamics of its global transmission network are still contentious. Here, I describe a novel combination of phylogenetics, time series, and graph theory to analyze 14.25 years of data stratified in space and in time, focusing on the main target of the human immune response, the hemagglutinin gene. While bypassing the complete phylogenetic inference of huge data sets, the method still extracts information suggesting that waves of genetic or of nucleotide diversity circulate continuously around the globe for subtypes that undergo sustained transmission over several seasons, such as H3N2 and pandemic H1N1/09, while diversity of prepandemic H1N1 viruses had until 2009 a noncontinuous transmission pattern consistent with a source/sink model. Irrespective of the shift in the structure of H1N1 diversity circulation with the emergence of the pandemic H1N1/09 strain, US prevalence peaks during the winter months when genetic diversity is at its lowest. This suggests that a dominant strain is generally responsible for epidemics and that monitoring genetic and/or nucleotide diversity in real time could provide public health agencies with an indirect estimate of prevalence. PMID:24665342

  12. Inferring the phylogenetic position of Boa constrictor among the Boinae.

    PubMed

    Burbrink, Frank T

    2005-01-01

    Snakes of the subfamily Boinae are found in Madagascar, the Papuan-Pacific Islands, and the Neotropics. It has been suggested that genera within each of these particular areas do not form monophyletic groups. Further, it was proposed that the New World Boa constrictor is more closely related to boine genera in Madagascar than to boines in the Neotropics. Along with inferring the relationship of all boine genera using data from the cytochrome b gene and morphology, the placement of Boa was also examined. Phylogenetic inferences using maximum likelihood and Bayesian (BI) methods for combined data analyses and separate analyses of DNA sequence and morphological data were conducted. Priors, parametric bootstraps, and the Shimodaira-Hasegawa test were used to examine the previously proposed placement of Boa with Madagascan taxa using these DNA data. DNA data and combined data analyses strongly reject the hypothesis that Boa is more closely related to Old World genera than to other New World genera. Additionally, strong tree support suggests that all species within Madagascar, the Papuan-Pacific Islands, and the Neotropics each form a monophyletic group with respect to their geographic region.

  13. Do we need many genes for phylogenetic inference?

    PubMed

    Aleshin, V V; Konstantinova, A V; Mikhailov, K V; Nikitin, M A; Petrov, N B

    2007-12-01

    Fifty-six nuclear protein coding genes from Taxonomically Broad EST Database and other databases were selected for phylogenomic-based examination of alternative phylogenetic hypotheses concerning intergroup relationship between multicellular animals (Metazoa) and other representatives of Opisthokonta. The results of this work support sister group relationship between Metazoa and Choanoflagellata. Both of these groups form the taxon Holozoa along with the monophyletic Ichthyosporea or Mesomycetozoea (a group that includes Amoebidium parasiticum, Sphaeroforma arctica, and Capsaspora owczarzaki). These phylogenetic hypotheses receive high statistical support both when utilizing whole alignment and when only 5000 randomly selected alignment positions are used. The presented results suggest subdivision of Fungi into Eumycota and lower fungi, Chytridiomycota. The latter form a monophyletic group that comprises Chytridiales+Spizellomycetales+Blastocladiales (Batrachochytrium, Spizellomyces, Allomyces, Blastocladiella), contrary to the earlier reports based on the analysis of 18S rRNA and a limited set of protein coding genes. The phylogenetic distribution of genes coding for a ubiquitin-fused ribosomal protein S30 implies at least three independent cases of gene fusion: in the ancestors of Holozoa, in heterotrophic Heterokonta (Oomycetes and Blastocystis) and in the ancestors of Cryptophyta and Glaucophyta. Ubiquitin-like sequences fused with ribosomal protein S30 outside of Holozoa are not FUBI orthologs. Two independent events of FUBI replacement by the ubiquitin sequence were detected in the lineage of C. owczarzaki and in the monophyletic group of nematode worms Tylenchomorpha+Cephalobidae. Bursaphelenchus xylophilus (Aphelenchoidoidea) retains a state typical of the rest of the Metazoa. The data emphasize the fact that the reliability of phylogenetic reconstructions depends on the number of analyzed genes to a lesser extent than on our ability to recognize

  14. Robustness of phylogenetic inference based on minimum evolution.

    PubMed

    Pardi, Fabio; Guillemot, Sylvain; Gascuel, Olivier

    2010-10-01

    Minimum evolution is the guiding principle of an important class of distance-based phylogeny reconstruction methods, including neighbor-joining (NJ), which is the most cited tree inference algorithm to date. The minimum evolution principle involves searching for the tree with minimum length, where the length is estimated using various least-squares criteria. Since evolutionary distances cannot be known precisely but only estimated, it is important to investigate the robustness of phylogenetic reconstruction to imprecise estimates for these distances. The safety radius is a measure of this robustness: it consists of the maximum relative deviation that the input distances can have from the correct distances, without compromising the reconstruction of the correct tree structure. Answering some open questions, we here derive the safety radius of two popular minimum evolution criteria: balanced minimum evolution (BME) and minimum evolution based on ordinary least squares (OLS + ME). Whereas BME has a radius of 1/2, which is the best achievable, OLS + ME has a radius tending to 0 as the number of taxa increases. This difference may explain the gap in reconstruction accuracy observed in practice between OLS + ME and BME (which forms the basis of popular programs such as NJ and FastME).

  15. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference

    PubMed Central

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-01-01

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal

  16. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

    PubMed

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-01-01

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal

  17. T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks.

    PubMed

    Boc, Alix; Diallo, Alpha Boubacar; Makarenkov, Vladimir

    2012-07-01

    T-REX (Tree and reticulogram REConstruction) is a web server dedicated to the reconstruction of phylogenetic trees, reticulation networks and to the inference of horizontal gene transfer (HGT) events. T-REX includes several popular bioinformatics applications such as MUSCLE, MAFFT, Neighbor Joining, NINJA, BioNJ, PhyML, RAxML, random phylogenetic tree generator and some well-known sequence-to-distance transformation models. It also comprises fast and effective methods for inferring phylogenetic trees from complete and incomplete distance matrices as well as for reconstructing reticulograms and HGT networks, including the detection and validation of complete and partial gene transfers, inference of consensus HGT scenarios and interactive HGT identification, developed by the authors. The included methods allows for validating and visualizing phylogenetic trees and networks which can be built from distance or sequence data. The web server is available at: www.trex.uqam.ca.

  18. Model inadequacy and mistaken inferences of trait-dependent speciation.

    PubMed

    Rabosky, Daniel L; Goldberg, Emma E

    2015-03-01

    Species richness varies widely across the tree of life, and there is great interest in identifying ecological, geographic, and other factors that affect rates of species proliferation. Recent methods for explicitly modeling the relationships among character states, speciation rates, and extinction rates on phylogenetic trees- BiSSE, QuaSSE, GeoSSE, and related models-have been widely used to test hypotheses about character state-dependent diversification rates. Here, we document the disconcerting ease with which neutral traits are inferred to have statistically significant associations with speciation rate. We first demonstrate this unfortunate effect for a known model assumption violation: shifts in speciation rate associated with a character not included in the model. We further show that for many empirical phylogenies, characters simulated in the absence of state-dependent diversification exhibit an even higher Type I error rate, indicating that the method is susceptible to additional, unknown model inadequacies. For traits that evolve slowly, the root cause appears to be a statistical framework that does not require replicated shifts in character state and diversification. However, spurious associations between character state and speciation rate arise even for traits that lack phylogenetic signal, suggesting that phylogenetic pseudoreplication alone cannot fully explain the problem. The surprising severity of this phenomenon suggests that many trait-diversification relationships reported in the literature may not be real. More generally, we highlight the need for diagnosing and understanding the consequences of model inadequacy in phylogenetic comparative methods. PMID:25601943

  19. Event inference in multidomain families with phylogenetic reconciliation

    PubMed Central

    2015-01-01

    Background Reconstructing evolution provides valuable insights into the processes of gene evolution and function. However, while there have been great advances in algorithms and software to reconstruct the history of gene families, these tools do not model the domain shuffling events (domain duplication, insertion, transfer, and deletion) that drive the evolution of multidomain protein families. Protein evolution through domain shuffling events allows for rapid exploration of functions by introducing new combinations of existing folds. This powerful mechanism was key to some significant evolutionary innovations, such as multicellularity and the vertebrate immune system. A method for reconstructing this important evolutionary process is urgently needed. Results Here, we introduce a novel, event-based framework for studying multidomain evolution by reconciling a domain tree with a gene tree, with additional information provided by the species tree. In the context of this framework, we present the first reconciliation algorithms to infer domain shuffling events, while addressing the challenges inherent in the inference of evolution across three levels of organization. Conclusions We apply these methods to the evolution of domains in the Membrane associated Guanylate Kinase family. These case studies reveal a more vivid and detailed evolutionary history than previously provided. Our algorithms have been implemented in software, freely available at http://www.cs.cmu.edu/˜durand/Notung. PMID:26451642

  20. Comparative cytogenetic analysis of some species of the Dendropsophus microcephalus group (Anura, Hylidae) in the light of phylogenetic inferences

    PubMed Central

    2013-01-01

    observe that the number of telocentric chromosomes is useful to distinguish among valid species in some cases, although it is unchanged in species that are not necessarily closely related phylogenetically. Therefore, inferences based on this chromosomal character must be made with caution; a proper evolutionary analysis of the karyotypic variation in Dendropsophus depends on further characterization of the telocentric chromosomes found in this group. PMID:23822759

  1. Phylogenetic analysis in Myrcia section Aulomyrcia and inferences on plant diversity in the Atlantic rainforest

    PubMed Central

    Staggemeier, Vanessa Graziele; Diniz-Filho, José Alexandre Felizola; Forest, Félix; Lucas, Eve

    2015-01-01

    Background and Aims Myrcia section Aulomyrcia includes ∼120 species that are endemic to the Neotropics and disjunctly distributed in the moist Amazon and Atlantic coastal forests of Brazil. This paper presents the first comprehensive phylogenetic study of this group and this phylogeny is used as a basis to evaluate recent classification systems and to test alternative hypotheses associated with the history of this clade. Methods Fifty-three taxa were sampled out of the 120 species currently recognized, plus 40 outgroup taxa, for one nuclear marker (ribosomal internal transcribed spacer) and four plastid markers (psbA-trnH, trnL-trnF, trnQ-rpS16 and ndhF). The relationships were reconstructed based on Bayesian and maximum likelihood analyses. Additionally, a likelihood approach, ‘geographic state speciation and extinction’, was used to estimate region- dependent rates of speciation, extinction and dispersal, comparing historically climatic stable areas (refugia) and unstable areas. Key Results Maximum likelihood and Bayesian inferences indicate that Myrcia and Marlierea are polyphyletic, and the internal groupings recovered are characterized by combinations of morphological characters. Phylogenetic relationships support a link between Amazonian and north-eastern species and between north-eastern and south-eastern species. Lower extinction rates within glacial refugia suggest that these areas were important in maintaining diversity in the Atlantic forest biodiversity hotspot. Conclusions This study provides a robust phylogenetic framework to address important ecological questions for Myrcia s.l. within an evolutionary context, and supports the need to unite taxonomically the two traditional genera Myrcia and Marlierea in an expanded Myrcia s.l. Furthermore, this study offers valuable insights into the diversification of plant species in the highly impacted Atlantic forest of South America; evidence is presented that the lowest extinction rates are found inside

  2. Geometric ergodicity of a hybrid sampler for Bayesian inference of phylogenetic branch lengths.

    PubMed

    Spade, David A; Herbei, Radu; Kubatko, Laura S

    2015-10-01

    One of the fundamental goals in phylogenetics is to make inferences about the evolutionary pattern among a group of individuals, such as genes or species, using present-day genetic material. This pattern is represented by a phylogenetic tree, and as computational methods have caught up to the statistical theory, Bayesian methods of making inferences about phylogenetic trees have become increasingly popular. Bayesian inference of phylogenetic trees requires sampling from intractable probability distributions. Common methods of sampling from these distributions include Markov chain Monte Carlo (MCMC) and Sequential Monte Carlo (SMC) methods, and one way that both of these methods can proceed is by first simulating a tree topology and then taking a sample from the posterior distribution of the branch lengths given the tree topology and the data set. In many MCMC methods, it is difficult to verify that the underlying Markov chain is geometrically ergodic, and thus, it is necessary to rely on output-based convergence diagnostics in order to assess convergence on an ad hoc basis. These diagnostics suffer from several important limitations, so in an effort to circumvent these limitations, this work establishes geometric convergence for a particular Markov chain that is used to sample branch lengths under a fairly general class of nucleotide substitution models and provides a numerical method for estimating the time this Markov chain takes to converge.

  3. A standard operating procedure for phylogenetic inference (SOPPI) using (rRNA) marker genes.

    PubMed

    Peplies, Jörg; Kottmann, Renzo; Ludwig, Wolfgang; Glöckner, Frank Oliver

    2008-09-01

    Phylogenetic analysis is currently used worldwide for taxonomic classification and identification of microorganisms. However, despite the countless trees that have been reconstructed and published in recent decades, so far, no user-friendly compilation of recommendations to standardize the data analysis and tree reconstruction process has been published. Consequently, this standard operating procedure for phylogenetic inference (SOPPI) offers a helping hand for working through the process from sampling in the field to phylogenetic tree reconstruction and publication. It is not meant to be authoritative or comprehensive, but should help to make phylogenetic inference and diversity analysis more reliable and comparable between different laboratories. It is mainly focused on using the ribosomal RNA as a universal phylogenetic marker, but the principles and recommendations can be applied to any valid marker gene. Feedback and suggestions from the scientific community are welcome in order to improve these guidelines further. Any updates will be made available on the SILVA webpage at http://www.arb-silva.de/projects/soppi.

  4. Phylogenetic relationships among the Nymphalidae (Lepidoptera) inferred from partial sequences of the wingless gene.

    PubMed Central

    Brower, A V

    2000-01-01

    A cladistic analysis was performed on a 378 bp region of the wingless gene from 103 nymphalid species and three pierid outgroups in order to infer higher level patterns of relationship among nymphalid subfamilies and tribes. Although the data are highly homoplastic, in many instances the most parsimonious cladograms corroborate traditionally recognized groups. The results suggest that this short gene region provides a useful source of data for phylogenetic inference, provided that adequate effort is made to sample a diversity of taxa. PMID:10902686

  5. Phi-LOG: a domain specific language for solving phylogenetic inference problems.

    PubMed

    Pontelli, E; Ranjan, D; Milligan, B; Gupta, G

    2002-01-01

    Domain experts think and reason at a high level of abstraction when they solve problems in their domain of expertise. We present the design and motivation behind a domain specific language called Phi-LOG to enable biologists (domain experts) to program solutions to phylogenetic inference problems at a very high level of abstraction. The implementation infrastructure (interpreter, compiler, debugger) for the DSL is automatically obtained through a software engineering framework based on Denotational Semantics and Logic Programming.

  6. Design and implementation of a domain specific language for phylogenetic inference.

    PubMed

    Pontelli, Enrico; Ranjan, Desh; Gupta, Gopal; Milligan, Brook

    2003-07-01

    Domain experts think and reason at a high level of abstraction when they solve problems in their domain of expertise. We present the design and motivation behind a domain specific language, called phi LOG, to enable biologists to program solutions to phylogenetic inference problems at a very high level of abstraction. The implementation infrastructure (interpreter, compiler, debugger) for the DSL is automatically obtained through a software engineering framework based on Denotational Semantics and Logic Programming.

  7. Phylogenetic and molecular clock inferences of cyanobacterial strains within Rivulariaceae from distant environments.

    PubMed

    Domínguez-Escobar, Julia; Beltrán, Yislem; Bergman, Birgitta; Díez, Beatriz; Ininbergs, Karolina; Souza, Valeria; Falcón, Luisa I

    2011-03-01

    Heterocyst-forming cyanobacteria are important players at both evolutionary and ecological scales, but to date it has been difficult to establish their phylogenetic affiliations. We present data from a phylogenetic and molecular clock analysis of heterocystous cyanobacteria within the family Rivulariaceae, including the genera Calothrix, Rivularia, Gloeotrichia and Tolypothrix. The strains were isolated from distant geographic regions including fresh and brackish water bodies, microbial mats from beach rock, microbialites, pebble beaches, plus PCC strains 7103 and 7504. Phylogenetic inferences (distance, likelihood and Bayesian) suggested the monophyly of genera Calothrix and Rivularia. Molecular clock estimates indicate that Calothrix and Rivularia originated ∼1500 million years ago (MYA) ago and species date back to 400-300 MYA while Tolypothrix and Gloeotrichia are younger genera (600-400 MYA).

  8. Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference

    PubMed Central

    Tan, Ge; Muffato, Matthieu; Ledergerber, Christian; Herrero, Javier; Goldman, Nick; Gil, Manuel; Dessimoz, Christophe

    2015-01-01

    Phylogenetic inference is generally performed on the basis of multiple sequence alignments (MSA). Because errors in an alignment can lead to errors in tree estimation, there is a strong interest in identifying and removing unreliable parts of the alignment. In recent years several automated filtering approaches have been proposed, but despite their popularity, a systematic and comprehensive comparison of different alignment filtering methods on real data has been lacking. Here, we extend and apply recently introduced phylogenetic tests of alignment accuracy on a large number of gene families and contrast the performance of unfiltered versus filtered alignments in the context of single-gene phylogeny reconstruction. Based on multiple genome-wide empirical and simulated data sets, we show that the trees obtained from filtered MSAs are on average worse than those obtained from unfiltered MSAs. Furthermore, alignment filtering often leads to an increase in the proportion of well-supported branches that are actually wrong. We confirm that our findings hold for a wide range of parameters and methods. Although our results suggest that light filtering (up to 20% of alignment positions) has little impact on tree accuracy and may save some computation time, contrary to widespread practice, we do not generally recommend the use of current alignment filtering methods for phylogenetic inference. By providing a way to rigorously and systematically measure the impact of filtering on alignments, the methodology set forth here will guide the development of better filtering algorithms. PMID:26031838

  9. Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference.

    PubMed

    Tan, Ge; Muffato, Matthieu; Ledergerber, Christian; Herrero, Javier; Goldman, Nick; Gil, Manuel; Dessimoz, Christophe

    2015-09-01

    Phylogenetic inference is generally performed on the basis of multiple sequence alignments (MSA). Because errors in an alignment can lead to errors in tree estimation, there is a strong interest in identifying and removing unreliable parts of the alignment. In recent years several automated filtering approaches have been proposed, but despite their popularity, a systematic and comprehensive comparison of different alignment filtering methods on real data has been lacking. Here, we extend and apply recently introduced phylogenetic tests of alignment accuracy on a large number of gene families and contrast the performance of unfiltered versus filtered alignments in the context of single-gene phylogeny reconstruction. Based on multiple genome-wide empirical and simulated data sets, we show that the trees obtained from filtered MSAs are on average worse than those obtained from unfiltered MSAs. Furthermore, alignment filtering often leads to an increase in the proportion of well-supported branches that are actually wrong. We confirm that our findings hold for a wide range of parameters and methods. Although our results suggest that light filtering (up to 20% of alignment positions) has little impact on tree accuracy and may save some computation time, contrary to widespread practice, we do not generally recommend the use of current alignment filtering methods for phylogenetic inference. By providing a way to rigorously and systematically measure the impact of filtering on alignments, the methodology set forth here will guide the development of better filtering algorithms. PMID:26031838

  10. Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

    PubMed Central

    Shin, Junha; Lee, Insuk

    2015-01-01

    Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co

  11. Phylogenetic relationships of Nembrothinae (Mollusca: Doridacea: Polyceridae) inferred from morphology and mitochondrial DNA.

    PubMed

    Pola, Marta; Cervera, J Lucas; Gosliner, Terrence M

    2007-06-01

    Within the Polyceridae, Nembrothinae includes some of the most striking and conspicuous sea slugs known, although several features of their biology and phylogenetic relationships remain unknown. This paper reports a phylogenetic analysis based on partial sequences of two mitochondrial genes (cytochrome c oxidase subunit I and 16S rRNA) and morphology for most species included in Nembrothinae. Our phylogenetic reconstructions using both molecular and combined morphological and molecular data support the taxonomic splitting of Nembrothinae into several taxa. Excluding one species (Tambja tentaculata), the monophyly of Roboastra was supported by all the phylogenetic analyses of the combined molecular data. Nembrotha was monophyletic both in the morphological and molecular analyses, always with high support. However, Tambja was recovered as para- or polyphyletic, depending on the analysis performed. Our study also rejects the monophyly of "phanerobranch" dorids based on molecular data.

  12. Hypothesis tests for phylogenetic quartets, with applications to coalescent-based species tree inference.

    PubMed

    Gaither, Jeff; Kubatko, Laura

    2016-11-01

    Numerous statistical methods have been developed to estimate evolutionary relationships among a collection of present-day species, typically represented by a phylogenetic tree, using the information contained in the DNA sequences sampled from representatives of each species. In the current era of high-throughput genome sequencing, the models underlying such methods have become increasingly sophisticated, and the resulting computations are often prohibitive. Here we consider the problem of rigorously testing the phylogenetic relationships among collections of four species under the multispecies coalescent model that accommodates both multi-locus datasets and SNP data. Our test employs a new statistic - the summed absolute differences between certain columns in flattened phylogenetic matrices - as well as a previously used statistic that measures the distance of a flattened matrix from the space of rank-10 matrices. We derive distributional results for both statistics and study the performance of the corresponding hypothesis tests using both simulated and empirical data. We discuss how these tests may be used to improve inference of phylogenetic relationships for larger samples of species under the multispecies coalescent model, a problem that has until recently been computationally intractable. PMID:27521524

  13. Inferring epidemiological parameters from phylogenetic information for the HIV-1 epidemic among MSM

    NASA Astrophysics Data System (ADS)

    Quax, Rick; van de Vijver, David A. M. C.; Frentz, Dineke; Sloot, Peter M. A.

    2013-09-01

    The HIV-1 epidemic in Europe is primarily sustained by a dynamic topology of sexual interactions among MSM who have individual immune systems and behavior. This epidemiological process shapes the phylogeny of the virus population. Both fields of epidemic modeling and phylogenetics have a long history, however it remains difficult to use phylogenetic data to infer epidemiological parameters such as the structure of the sexual network and the per-act infectiousness. This is because phylogenetic data is necessarily incomplete and ambiguous. Here we show that the cluster-size distribution indeed contains information about epidemiological parameters using detailed numberical experiments. We simulate the HIV epidemic among MSM many times using the Monte Carlo method with all parameter values and their ranges taken from literature. For each simulation and the corresponding set of parameter values we calculate the likelihood of reproducing an observed cluster-size distribution. The result is an estimated likelihood distribution of all parameters from the phylogenetic data, in particular the structure of the sexual network, the per-act infectiousness, and the risk behavior reduction upon diagnosis. These likelihood distributions encode the knowledge provided by the observed cluster-size distrbution, which we quantify using information theory. Our work suggests that the growing body of genetic data of patients can be exploited to understand the underlying epidemiological process.

  14. Phylogenetic relationships among the family Ommastrephidae (Mollusca: Cephalopoda) inferred from two mitochondrial DNA gene sequences.

    PubMed

    Wakabayashi, T; Suzuki, N; Sakai, M; Ichii, T; Chow, S

    2012-09-01

    Squids of the family Ommastrephidae are distributed worldwide, and the family includes many species of commercial importance. To investigate phylogenetic relationships among squid species of the family Ommastrephidae, partial nucleotide sequences of two mitochondrial gene loci (cytochrome c oxidase subunit I [1277bp] and 16S rRNA [443bp]) of 15 ommastrephid species and two outgroup species from the families Loliginidae and Enoploteuthidae were determined and used to construct parsimony and distance based phylogenetic trees. The molecular data provided several new phylogenetic inferences. The monophyletic status of three subfamilies (Illicinae, Todarodinae and Ommastrephinae) was well supported, although phylogenetic relationships between the subfamilies were not resolved. Inclusion of a problematic species, Ornithoteuthis volatilis, to Todarodinae was indicated. Within Todarodinae, the Japanese common squid Todarodes pacificus was observed to have much closer relationship to the species of the genus Nototodarus than to its congener (Todarodes filippovae). These results indicate that re-evaluation of several morphological key characters for ommastrephid taxonomy may be necessary.

  15. Phylogenetic Relationships of Pseudorasbora, Pseudopungtungia, and Pungtungia (Teleostei; Cypriniformes; Gobioninae) Inferred from Multiple Nuclear Gene Sequences

    PubMed Central

    Kim, Keun-Yong; Ko, Myeong-Hun; Liu, Huanzhang; Tang, Qiongying; Chen, Xianglin; Bang, In-Chul

    2013-01-01

    Gobionine species belonging to the genera Pseudorasbora, Pseudopungtungia, and Pungtungia (Teleostei; Cypriniformes; Cyprinidae) have been heavily studied because of problems on taxonomy, threats of extinction, invasion, and human health. Nucleotide sequences of three nuclear genes, that is, recombination activating protein gene 1 (rag1), recombination activating gene 2 (rag2), and early growth response 1 gene (egr1), from Pseudorasbora, Pseudopungtungia, and Pungtungia species residing in China, Japan, and Korea, were analyzed to elucidate their intergeneric and interspecific phylogenetic relationships. In the phylogenetic tree inferred from their multiple gene sequences, Pseudorasbora, Pseudopungtungia and Pungtungia species ramified into three phylogenetically distinct clades; the “tenuicorpa” clade composed of Pseudopungtungia tenuicorpa, the “parva” clade composed of all Pseudorasbora species/subspecies, and the “herzi” clade composed of Pseudopungtungia nigra, and Pungtungia herzi. The genus Pseudorasbora was recovered as monophyletic, while the genus Pseudopungtungia was recovered as polyphyletic. Our phylogenetic result implies the unstable taxonomic status of the genus Pseudopungtungia. PMID:24106702

  16. Phylogenetic position of Cryothecomonas inferred from nuclear-encoded small subunit ribosomal RNA.

    PubMed

    Kühn, S; Lange, M; Medlin, L K

    2000-12-01

    The systematic position of the genus Cryothecomonas has been determined from an analysis of the nuclear-encoded small subunit ribosomal RNA gene of Cryothecomonas longipes and two strains of Cryothecomonas aestivalis. Our phylogenetic trees inferred from maximum likelihood, distance and maximum parsimony methods robustly show that the genus Cryothecomonas clusters within the phylum Cercozoa, and is related to the sarcomonad flagellate Heteromita globosa. Morphological data supporting the taxonomic placement of Cryothecomonas near the sarcomonad flagellates has been compiled from the literature. The high number of nucleotide substitutions found between two morphologically indistinguishable strains of Cryothecomonas aestivalis suggests the possibility of cryptic species within Cryothecomonas aestivalis. PMID:11212894

  17. Inferring the evolutionary history of gene clusters from phylogenetic and gene order data.

    PubMed

    Lajoie, Mathieu; Bertrand, Denis; El-Mabrouk, Nadia

    2010-04-01

    Gene duplication is frequent within gene clusters and plays a fundamental role in evolution by providing a source of new genetic material upon which natural selection can act. Although classical phylogenetic inference methods provide some insight into the evolutionary history of a gene cluster, they are not sufficient alone to differentiate single- from multiple gene duplication events and to answer other questions regarding the nature and size of evolutionary events. In this paper, we present an algorithm allowing to infer a set of optimal evolutionary histories for a gene cluster in a single species, according to a general cost model involving variable length duplications (in tandem or inverted), deletions, and inversions. We applied our algorithm to the human olfactory receptor and protocadherin gene clusters, showing that the duplication size distribution differs significantly between the two gene families. The algorithm is available through a web interface at http://www-lbit.iro.umontreal.ca/DILTAG/.

  18. Phylogenetic relationships of yellowjackets inferred from nine loci (Hymenoptera: Vespidae, Vespinae, Vespula and Dolichovespula).

    PubMed

    Lopez-Osorio, Federico; Pickett, Kurt M; Carpenter, James M; Ballif, Bryan A; Agnarsson, Ingi

    2014-04-01

    Eusociality has arisen repeatedly and independently in the history of insects, often leading to evolutionary success and ecological dominance. Eusocial wasps of the genera Vespula and Dolichovespula, or yellowjackets, have developed advanced social traits in a relatively small number of species. The origin of traits such as effective paternity and colony size has been interpreted with reference to an established phylogenetic hypothesis that is based on phenotypic data, while the application of molecular evidence to phylogenetic analysis within yellowjackets has been limited. Here, we investigate the evolutionary history of yellowjackets on the basis of mitochondrial and nuclear markers (nuclear: 28S, EF1α, Pol II, and wg; mitochondrial: 12S, 16S, COI, COII, and Cytb). We use these data to test the monophyly of yellowjackets and species groups, and resolve species-level relationships within each genus using parsimony and Bayesian inference. Our results indicate that a yellowjacket clade is either weakly supported (parsimony) or rejected (Bayesian inference). However, the monophyly of each yellowjacket genus as well as species groups are strongly supported and concordant between methods. Our results agree with previous studies regarding the monophyly of the Vespula vulgaris group and the sister relationship between the V. rufa and V. squamosa groups. This suggests convergence of large colony size and high effective paternity in the vulgaris group and V. squamosa, or a single origin of both traits in the most recent common ancestor of all Vespula species and their evolutionary reversal in the rufa group. PMID:24462637

  19. Assessing the passerine "Tapestry": phylogenetic relationships of the Muscicapoidea inferred from nuclear DNA sequences.

    PubMed

    Cibois, Alice; Cracraft, Joel

    2004-07-01

    This study presents new comparative sequence data from the nuclear RAG-1 gene for an increased taxon sample in order to investigate phylogenetic relationships among a diverse songbird superfamily, the Muscicapoidea, which has variously included the waxwings, silky flycatchers, Palm Chat, dippers, starlings, mockingbirds, thrushes, chats, and Old World flycatchers. At the same time, our results provide a test of the often-cited relationships inferred from the phenetic studies of Sibley and Ahlquist [Phylogeny and Classification of Birds: A Study in Molecular Evolution. Yale University Press, New Haven, 1990] using DNA hybridization distances. Nuclear DNA sequences confirm the monophyly of the "core muscicapoid" group, as defined by Barker et al. [Proc. R. Soc. Lond. B 269 (2002) 295] and also support the sister-group relationship of the Sturnidae and Mimidae, on the one hand, and the large-bodied thrushes (Turdini)+the Old World flycatchers and robins, on the other. The results of the phylogenetic analysis allow preliminary inferences about muscicapoid biogeographic history.

  20. RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language

    PubMed Central

    Höhna, Sebastian; Landis, Michael J.

    2016-01-01

    Programs for Bayesian inference of phylogeny currently implement a unique and fixed suite of models. Consequently, users of these software packages are simultaneously forced to use a number of programs for a given study, while also lacking the freedom to explore models that have not been implemented by the developers of those programs. We developed a new open-source software package, RevBayes, to address these problems. RevBayes is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models. Phylogenetic-graphical models can be specified interactively in RevBayes, piece by piece, using a new succinct and intuitive language called Rev. Rev is similar to the R language and the BUGS model-specification language, and should be easy to learn for most users. The strength of RevBayes is the simplicity with which one can design, specify, and implement new and complex models. Fortunately, this tremendous flexibility does not come at the cost of slower computation; as we demonstrate, RevBayes outperforms competing software for several standard analyses. Compared with other programs, RevBayes has fewer black-box elements. Users need to explicitly specify each part of the model and analysis. Although this explicitness may initially be unfamiliar, we are convinced that this transparency will improve understanding of phylogenetic models in our field. Moreover, it will motivate the search for improvements to existing methods by brazenly exposing the model choices that we make to critical scrutiny. RevBayes is freely available at http://www.RevBayes.com. [Bayesian inference; Graphical models; MCMC; statistical phylogenetics.] PMID:27235697

  1. RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language.

    PubMed

    Höhna, Sebastian; Landis, Michael J; Heath, Tracy A; Boussau, Bastien; Lartillot, Nicolas; Moore, Brian R; Huelsenbeck, John P; Ronquist, Fredrik

    2016-07-01

    Programs for Bayesian inference of phylogeny currently implement a unique and fixed suite of models. Consequently, users of these software packages are simultaneously forced to use a number of programs for a given study, while also lacking the freedom to explore models that have not been implemented by the developers of those programs. We developed a new open-source software package, RevBayes, to address these problems. RevBayes is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models. Phylogenetic-graphical models can be specified interactively in RevBayes, piece by piece, using a new succinct and intuitive language called Rev. Rev is similar to the R language and the BUGS model-specification language, and should be easy to learn for most users. The strength of RevBayes is the simplicity with which one can design, specify, and implement new and complex models. Fortunately, this tremendous flexibility does not come at the cost of slower computation; as we demonstrate, RevBayes outperforms competing software for several standard analyses. Compared with other programs, RevBayes has fewer black-box elements. Users need to explicitly specify each part of the model and analysis. Although this explicitness may initially be unfamiliar, we are convinced that this transparency will improve understanding of phylogenetic models in our field. Moreover, it will motivate the search for improvements to existing methods by brazenly exposing the model choices that we make to critical scrutiny. RevBayes is freely available at http://www.RevBayes.com [Bayesian inference; Graphical models; MCMC; statistical phylogenetics.].

  2. RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language.

    PubMed

    Höhna, Sebastian; Landis, Michael J; Heath, Tracy A; Boussau, Bastien; Lartillot, Nicolas; Moore, Brian R; Huelsenbeck, John P; Ronquist, Fredrik

    2016-07-01

    Programs for Bayesian inference of phylogeny currently implement a unique and fixed suite of models. Consequently, users of these software packages are simultaneously forced to use a number of programs for a given study, while also lacking the freedom to explore models that have not been implemented by the developers of those programs. We developed a new open-source software package, RevBayes, to address these problems. RevBayes is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models. Phylogenetic-graphical models can be specified interactively in RevBayes, piece by piece, using a new succinct and intuitive language called Rev. Rev is similar to the R language and the BUGS model-specification language, and should be easy to learn for most users. The strength of RevBayes is the simplicity with which one can design, specify, and implement new and complex models. Fortunately, this tremendous flexibility does not come at the cost of slower computation; as we demonstrate, RevBayes outperforms competing software for several standard analyses. Compared with other programs, RevBayes has fewer black-box elements. Users need to explicitly specify each part of the model and analysis. Although this explicitness may initially be unfamiliar, we are convinced that this transparency will improve understanding of phylogenetic models in our field. Moreover, it will motivate the search for improvements to existing methods by brazenly exposing the model choices that we make to critical scrutiny. RevBayes is freely available at http://www.RevBayes.com [Bayesian inference; Graphical models; MCMC; statistical phylogenetics.]. PMID:27235697

  3. Phylogenetic analysis accounting for age-dependent death and sampling with applications to epidemics.

    PubMed

    Lambert, Amaury; Alexander, Helen K; Stadler, Tanja

    2014-07-01

    The reconstruction of phylogenetic trees based on viral genetic sequence data sequentially sampled from an epidemic provides estimates of the past transmission dynamics, by fitting epidemiological models to these trees. To our knowledge, none of the epidemiological models currently used in phylogenetics can account for recovery rates and sampling rates dependent on the time elapsed since transmission, i.e. age of infection. Here we introduce an epidemiological model where infectives leave the epidemic, by either recovery or sampling, after some random time which may follow an arbitrary distribution. We derive an expression for the likelihood of the phylogenetic tree of sampled infectives under our general epidemiological model. The analytic concept developed in this paper will facilitate inference of past epidemiological dynamics and provide an analytical framework for performing very efficient simulations of phylogenetic trees under our model. The main idea of our analytic study is that the non-Markovian epidemiological model giving rise to phylogenetic trees growing vertically as time goes by can be represented by a Markovian "coalescent point process" growing horizontally by the sequential addition of pairs of coalescence and sampling times. As examples, we discuss two special cases of our general model, described in terms of influenza and HIV epidemics. Though phrased in epidemiological terms, our framework can also be used for instance to fit macroevolutionary models to phylogenies of extant and extinct species, accounting for general species lifetime distributions. PMID:24607743

  4. Phylogenetic inference of calyptrates, with the first mitogenomes for Gasterophilinae (Diptera: Oestridae) and Paramacronychiinae (Diptera: Sarcophagidae)

    PubMed Central

    Zhang, Dong; Yan, Liping; Zhang, Ming; Chu, Hongjun; Cao, Jie; Li, Kai; Hu, Defu; Pape, Thomas

    2016-01-01

    The complete mitogenome of the horse stomach bot fly Gasterophilus pecorum (Fabricius) and a near-complete mitogenome of Wohlfahrt's wound myiasis fly Wohlfahrtia magnifica (Schiner) were sequenced. The mitogenomes contain the typical 37 mitogenes found in metazoans, organized in the same order and orientation as in other cyclorrhaphan Diptera. Phylogenetic analyses of mitogenomes from 38 calyptrate taxa with and without two non-calyptrate outgroups were performed using Bayesian Inference and Maximum Likelihood. Three sub-analyses were performed on the concatenated data: (1) not partitioned; (2) partitioned by gene; (3) 3rd codon positions of protein-coding genes omitted. We estimated the contribution of each of the mitochondrial genes for phylogenetic analysis, as well as the effect of some popular methodologies on calyptrate phylogeny reconstruction. In the favoured trees, the Oestroidea are nested within the muscoid grade. Relationships at the family level within Oestroidea are (remaining Calliphoridae (Sarcophagidae (Oestridae, Pollenia + Tachinidae))). Our mito-phylogenetic reconstruction of the Calyptratae presents the most extensive taxon coverage so far, and the risk of long-branch attraction is reduced by an appropriate selection of outgroups. We find that in the Calyptratae the ND2, ND5, ND1, COIII, and COI genes are more phylogenetically informative compared with other mitochondrial protein-coding genes. Our study provides evidence that data partitioning and the inclusion of conserved tRNA genes have little influence on calyptrate phylogeny reconstruction, and that the 3rd codon positions of protein-coding genes are not saturated and therefore should be included. PMID:27019632

  5. Comparative analysis of mitochondrial genomes in Diplura (hexapoda, arthropoda): taxon sampling is crucial for phylogenetic inferences.

    PubMed

    Chen, Wan-Jun; Koch, Markus; Mallatt, Jon M; Luan, Yun-Xia

    2014-01-01

    Two-pronged bristletails (Diplura) are traditionally classified into three major superfamilies: Campodeoidea, Projapygoidea, and Japygoidea. The interrelationships of these three superfamilies and the monophyly of Diplura have been much debated. Few previous studies included Projapygoidea in their phylogenetic considerations, and its position within Diplura still is a puzzle from both morphological and molecular points of view. Until now, no mitochondrial genome has been sequenced for any projapygoid species. To fill in this gap, we determined and annotated the complete mitochondrial genome of Octostigma sinensis (Octostigmatidae, Projapygoidea), and of three more dipluran species, one each from the Campodeidae, Parajapygidae, and Japygidae. All four newly sequenced dipluran mtDNAs encode the same set of genes in the same gene order as shared by most crustaceans and hexapods. Secondary structure truncations have occurred in trnR, trnC, trnS1, and trnS2, and the reduction of transfer RNA D-arms was found to be taxonomically correlated, with Campodeoidea having experienced the most reduction. Partitioned phylogenetic analyses, based on both amino acids and nucleotides of the protein-coding genes plus the ribosomal RNA genes, retrieve significant support for a monophyletic Diplura within Pancrustacea, with Projapygoidea more closely related to Campodeoidea than to Japygoidea. Another key finding is that monophyly of Diplura cannot be recovered unless Projapygoidea is included in the phylogenetic analyses; this explains the dipluran polyphyly found by past mitogenomic studies. Including Projapygoidea increased the sampling density within Diplura and probably helped by breaking up a long-branch-attraction artifact. This finding provides an example of how proper sampling is significant for phylogenetic inference.

  6. Bryozoans are returning home: recolonization of freshwater ecosystems inferred from phylogenetic relationships

    PubMed Central

    Koletić, Nikola; Novosel, Maja; Rajević, Nives; Franjević, Damjan

    2015-01-01

    Bryozoans are aquatic invertebrates that inhabit all types of aquatic ecosystems. They are small animals that form large colonies by asexual budding. Colonies can reach the size of several tens of centimeters, while individual units within a colony are the size of a few millimeters. Each individual within a colony works as a separate zooid and is genetically identical to each other individual within the same colony. Most freshwater species of bryozoans belong to the Phylactolaemata class, while several species that tolerate brackish water belong to the Gymnolaemata class. Tissue samples for this study were collected in the rivers of Adriatic and Danube basin and in the wetland areas in the continental part of Croatia (Europe). Freshwater and brackish taxons of bryozoans were genetically analyzed for the purpose of creating phylogenetic relationships between freshwater and brackish taxons of the Phylactolaemata and Gymnolaemata classes and determining the role of brackish species in colonizing freshwater and marine ecosystems. Phylogenetic relationships inferred on the genes for 18S rRNA, 28S rRNA, COI, and ITS2 region confirmed Phylactolaemata bryozoans as radix bryozoan group. Phylogenetic analysis proved Phylactolaemata bryozoan's close relations with taxons from Phoronida phylum as well as the separation of the Lophopodidae family from other families within the Plumatellida genus. Comparative analysis of existing knowledge about the phylogeny of bryozoans and the expansion of known evolutionary hypotheses is proposed with the model of settlement of marine and freshwater ecosystems by the bryozoans group during their evolutionary past. In this case study, brackish bryozoan taxons represent a link for this ecological phylogenetic hypothesis. Comparison of brackish bryozoan species Lophopus crystallinus and Conopeum seurati confirmed a dual colonization of freshwater ecosystems throughout evolution of this group of animals. PMID:25691955

  7. Bryozoans are returning home: recolonization of freshwater ecosystems inferred from phylogenetic relationships.

    PubMed

    Koletić, Nikola; Novosel, Maja; Rajević, Nives; Franjević, Damjan

    2015-01-01

    Bryozoans are aquatic invertebrates that inhabit all types of aquatic ecosystems. They are small animals that form large colonies by asexual budding. Colonies can reach the size of several tens of centimeters, while individual units within a colony are the size of a few millimeters. Each individual within a colony works as a separate zooid and is genetically identical to each other individual within the same colony. Most freshwater species of bryozoans belong to the Phylactolaemata class, while several species that tolerate brackish water belong to the Gymnolaemata class. Tissue samples for this study were collected in the rivers of Adriatic and Danube basin and in the wetland areas in the continental part of Croatia (Europe). Freshwater and brackish taxons of bryozoans were genetically analyzed for the purpose of creating phylogenetic relationships between freshwater and brackish taxons of the Phylactolaemata and Gymnolaemata classes and determining the role of brackish species in colonizing freshwater and marine ecosystems. Phylogenetic relationships inferred on the genes for 18S rRNA, 28S rRNA, COI, and ITS2 region confirmed Phylactolaemata bryozoans as radix bryozoan group. Phylogenetic analysis proved Phylactolaemata bryozoan's close relations with taxons from Phoronida phylum as well as the separation of the Lophopodidae family from other families within the Plumatellida genus. Comparative analysis of existing knowledge about the phylogeny of bryozoans and the expansion of known evolutionary hypotheses is proposed with the model of settlement of marine and freshwater ecosystems by the bryozoans group during their evolutionary past. In this case study, brackish bryozoan taxons represent a link for this ecological phylogenetic hypothesis. Comparison of brackish bryozoan species Lophopus crystallinus and Conopeum seurati confirmed a dual colonization of freshwater ecosystems throughout evolution of this group of animals. PMID:25691955

  8. Bryozoans are returning home: recolonization of freshwater ecosystems inferred from phylogenetic relationships.

    PubMed

    Koletić, Nikola; Novosel, Maja; Rajević, Nives; Franjević, Damjan

    2015-01-01

    Bryozoans are aquatic invertebrates that inhabit all types of aquatic ecosystems. They are small animals that form large colonies by asexual budding. Colonies can reach the size of several tens of centimeters, while individual units within a colony are the size of a few millimeters. Each individual within a colony works as a separate zooid and is genetically identical to each other individual within the same colony. Most freshwater species of bryozoans belong to the Phylactolaemata class, while several species that tolerate brackish water belong to the Gymnolaemata class. Tissue samples for this study were collected in the rivers of Adriatic and Danube basin and in the wetland areas in the continental part of Croatia (Europe). Freshwater and brackish taxons of bryozoans were genetically analyzed for the purpose of creating phylogenetic relationships between freshwater and brackish taxons of the Phylactolaemata and Gymnolaemata classes and determining the role of brackish species in colonizing freshwater and marine ecosystems. Phylogenetic relationships inferred on the genes for 18S rRNA, 28S rRNA, COI, and ITS2 region confirmed Phylactolaemata bryozoans as radix bryozoan group. Phylogenetic analysis proved Phylactolaemata bryozoan's close relations with taxons from Phoronida phylum as well as the separation of the Lophopodidae family from other families within the Plumatellida genus. Comparative analysis of existing knowledge about the phylogeny of bryozoans and the expansion of known evolutionary hypotheses is proposed with the model of settlement of marine and freshwater ecosystems by the bryozoans group during their evolutionary past. In this case study, brackish bryozoan taxons represent a link for this ecological phylogenetic hypothesis. Comparison of brackish bryozoan species Lophopus crystallinus and Conopeum seurati confirmed a dual colonization of freshwater ecosystems throughout evolution of this group of animals.

  9. Streptococcus massiliensis in the human mouth: a phylogenetic approach for the inference of bacterial habitats.

    PubMed

    Póntigo, F; Silva, C; Moraga, M; Flores, S V

    2015-12-29

    Streptococcus is a diverse bacterial lineage. Species of this genus occupy a myriad of environments inside humans and other animals. Despite the elucidation of several of these habitats, many remain to be identified. Here, we explore a methodological approach to reveal unknown bacterial environments. Specifically, we inferred the phylogeny of the Mitis group by analyzing the sequences of eight genes. In addition, information regarding habitat use of species belonging to this group was obtained from the scientific literature. The oral cavity emerged as a potential, previously unknown, environment of Streptococcus massiliensis. This phylogeny-based prediction was confirmed by species-specific polymerase chain reaction (PCR) amplification. We propose employing a similar approach, i.e., use of bibliographic data and molecular phylogenetics as predictive methods, and species-specific PCR as confirmation, in order to reveal other unknown habitats in further bacterial taxa.

  10. Phylogenetic relationships of Pseudohynobius (Urodela, Hynobiidae) inferred from DNA barcoding analysis.

    PubMed

    Zhao, Y-Y; Su, L-N; Zhang, Z-M; Wang, X-Y

    2016-01-01

    As a proven tool, DNA barcoding can identify species rapidly and unambiguously. In this study, we used mtDNA cyt b, COI, and 16s rRNA sequences of six species of Pseudohynobius, Protohynobius puxiongensis, Liua shihi, Ranodon sibiricus, and Pachyhynobius shangchengensis, to reconstruct the phylogenetic relationships using Bayesian inference and maximum likelihood methods. Approximate lineage divergence times were also estimated, the divergence between them was calculated to have taken place mainly in Miocene. Our results showed that: 1) Ps. guizhouensis is an independent and valid species that is a sister species to Ps. kuankuoshuiensis; 2) five Pseudohynobius species formed a monophyletic group; 3) Ps. tsinpaensis is different from L. shihi, and should be classified as belonging to the Liua genus; and 4) Pr. puxiongensis is the sister lineage to all Pseudohynobius species, and should therefore be named Pseudohynobius puxiongensis. PMID:27420996

  11. Phylogenetic relationships in the coral family acroporidae, reassessed by inference from mitochondrial genes.

    PubMed

    Fukami, H; Omori, M; Hatta, M

    2000-07-01

    Phylogenetic relationships within the dominant reef coral family Acroporidae were inferred from the mitochondrial genes cytochrome b and ATPase 6. The rate of nucleotide substitution in the genes gave proper resolution to deduce genetic relationships between the genera in this family. The molecular phylogeny divided this family into three major lineages: the genera Astreopora, Montipora and Acropora. The genus Anacropora was included in the same clade as the genus Montipora, suggesting its recent speciation from Montipora. The subgenus Isopora was significantly distant from the subgenus Acropora. Taken together with morphological and reproductive differences, we propose that these two subgenera be classified as independent genera. The divergence times deduced from the genetic distances were consistent with the fossil record for the major genera. The results also suggest that the extant reef corals speciated and expanded very recently, probably after the Miocene, from single lineage which survived repeated extinction by climate changes during the Cenozoic era. PMID:18517306

  12. Nucleotide sequence of nifD from Frankia alni strain ArI3: phylogenetic inferences.

    PubMed

    Normand, P; Gouy, M; Cournoyer, B; Simonet, P

    1992-05-01

    The complete nucleotide sequence of the nifD gene encoding the alpha subunit of component I of nitrogenase from Frankia alni strain ArI3 was determined. The coding region is 1,458 bp in length and encodes a polypeptide of 486 residues with a predicted molecular weight of 53,500. Phylogenetic inferences with 12 complete published nifD sequences were drawn using a variety of approaches. Frankia nifD clusters with proteobacteria rather than with Clostridium pasteurianum, the other Gram-positive bacterium studied. Extant eubacterial nif genes seem to have at least three distinct evolutionary origins as a result of ancient gene duplications. Within the Gram-positive bacterial phylum, functional nif genes descend from different duplicates. PMID:1584016

  13. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  14. Phylogenetic inferences reveal a large extent of novel biodiversity in chemically rich tropical marine cyanobacteria.

    PubMed

    Engene, Niclas; Gunasekera, Sarath P; Gerwick, William H; Paul, Valerie J

    2013-03-01

    Benthic marine cyanobacteria are known for their prolific biosynthetic capacities to produce structurally diverse secondary metabolites with biomedical application and their ability to form cyanobacterial harmful algal blooms. In an effort to provide taxonomic clarity to better guide future natural product drug discovery investigations and harmful algal bloom monitoring, this study investigated the taxonomy of tropical and subtropical natural product-producing marine cyanobacteria on the basis of their evolutionary relatedness. Our phylogenetic inferences of marine cyanobacterial strains responsible for over 100 bioactive secondary metabolites revealed an uneven taxonomic distribution, with a few groups being responsible for the vast majority of these molecules. Our data also suggest a high degree of novel biodiversity among natural product-producing strains that was previously overlooked by traditional morphology-based taxonomic approaches. This unrecognized biodiversity is primarily due to a lack of proper classification systems since the taxonomy of tropical and subtropical, benthic marine cyanobacteria has only recently been analyzed by phylogenetic methods. This evolutionary study provides a framework for a more robust classification system to better understand the taxonomy of tropical and subtropical marine cyanobacteria and the distribution of natural products in marine cyanobacteria.

  15. The Interrelationships of Placental Mammals and the Limits of Phylogenetic Inference.

    PubMed

    Tarver, James E; Dos Reis, Mario; Mirarab, Siavash; Moran, Raymond J; Parker, Sean; O'Reilly, Joseph E; King, Benjamin L; O'Connell, Mary J; Asher, Robert J; Warnow, Tandy; Peterson, Kevin J; Donoghue, Philip C J; Pisani, Davide

    2016-02-01

    Placental mammals comprise three principal clades: Afrotheria (e.g., elephants and tenrecs), Xenarthra (e.g., armadillos and sloths), and Boreoeutheria (all other placental mammals), the relationships among which are the subject of controversy and a touchstone for debate on the limits of phylogenetic inference. Previous analyses have found support for all three hypotheses, leading some to conclude that this phylogenetic problem might be impossible to resolve due to the compounded effects of incomplete lineage sorting (ILS) and a rapid radiation. Here we show, using a genome scale nucleotide data set, microRNAs, and the reanalysis of the three largest previously published amino acid data sets, that the root of Placentalia lies between Atlantogenata and Boreoeutheria. Although we found evidence for ILS in early placental evolution, we are able to reject previous conclusions that the placental root is a hard polytomy that cannot be resolved. Reanalyses of previous data sets recover Atlantogenata + Boreoeutheria and show that contradictory results are a consequence of poorly fitting evolutionary models; instead, when the evolutionary process is better-modeled, all data sets converge on Atlantogenata. Our Bayesian molecular clock analysis estimates that marsupials diverged from placentals 157-170 Ma, crown Placentalia diverged 86-100 Ma, and crown Atlantogenata diverged 84-97 Ma. Our results are compatible with placental diversification being driven by dispersal rather than vicariance mechanisms, postdating early phases in the protracted opening of the Atlantic Ocean. PMID:26733575

  16. The Interrelationships of Placental Mammals and the Limits of Phylogenetic Inference

    PubMed Central

    Tarver, James E.; dos Reis, Mario; Mirarab, Siavash; Moran, Raymond J.; Parker, Sean; O’Reilly, Joseph E.; King, Benjamin L.; O’Connell, Mary J.; Asher, Robert J.; Warnow, Tandy; Peterson, Kevin J.; Donoghue, Philip C.J.; Pisani, Davide

    2016-01-01

    Placental mammals comprise three principal clades: Afrotheria (e.g., elephants and tenrecs), Xenarthra (e.g., armadillos and sloths), and Boreoeutheria (all other placental mammals), the relationships among which are the subject of controversy and a touchstone for debate on the limits of phylogenetic inference. Previous analyses have found support for all three hypotheses, leading some to conclude that this phylogenetic problem might be impossible to resolve due to the compounded effects of incomplete lineage sorting (ILS) and a rapid radiation. Here we show, using a genome scale nucleotide data set, microRNAs, and the reanalysis of the three largest previously published amino acid data sets, that the root of Placentalia lies between Atlantogenata and Boreoeutheria. Although we found evidence for ILS in early placental evolution, we are able to reject previous conclusions that the placental root is a hard polytomy that cannot be resolved. Reanalyses of previous data sets recover Atlantogenata + Boreoeutheria and show that contradictory results are a consequence of poorly fitting evolutionary models; instead, when the evolutionary process is better-modeled, all data sets converge on Atlantogenata. Our Bayesian molecular clock analysis estimates that marsupials diverged from placentals 157–170 Ma, crown Placentalia diverged 86–100 Ma, and crown Atlantogenata diverged 84–97 Ma. Our results are compatible with placental diversification being driven by dispersal rather than vicariance mechanisms, postdating early phases in the protracted opening of the Atlantic Ocean. PMID:26733575

  17. Divergent Evolution Within Protein Superfolds Inferred from Profile-based Phylogenetics

    PubMed Central

    Theobald, Douglas L.; Wuttke, Deborah S.

    2006-01-01

    Many dissimilar protein sequences fold into similar structures. A central and persistent challenge facing protein structural analysis is the discrimination between homology and convergence for structurally similar domains that lack significant sequence similarity. Classic examples are the OB-fold and SH3 domains, both small, modular β-barrel protein superfolds. The similarities among these domains have variously been attributed to common descent or to convergent evolution. Using a sequence profile-based phylogenetic technique, we analyzed all structurally characterized OB-fold, SH3, and PDZ domains with less than 40% mutual sequence identity. An all-against-all, profile-versus-profile analysis of these domains revealed many previously undetectable significant interrelationships. The matrices of scores were used to infer phylogenies based on our derivation of the relationships between sequence similarity E-values and evolutionary distances. The resulting clades of domains correlate remarkably well with biological function, as opposed to structural similarity, indicating that the functionally distinct sub-families within these superfolds are homologous. This method extends phylogenetics into the challenging “twilight zone” of sequence similarity, providing the first objective resolution of deep evolutionary relationships among distant protein families. PMID:16266719

  18. Phylogenetic analysis of Leymus (Poaceae: Triticeae) inferred from nuclear rDNA ITS sequences.

    PubMed

    Sha, Li-Na; Yang, Rui-Wu; Fan, Xing; Wang, Xiao-Li; Zhou, Yong-Hong

    2008-10-01

    To investigate the phylogenetic relationships of polyploid Leymus (Poaceae: Triticeae), sequences of the nuclear rDNA internal transcribed spacer region (ITS) were analyzed for 34 Leymus accessions representing 25 species, together with three Psathyrostachys species (Ns genome), two Pseudoroegneria (St genome) species, Lophopyrum elongatum (E(e) genome), and Thinopyrum bessarabicum (E(b) genome). The phylogenetic analyses (maximum likelihood and Bayesian inference) supported two major clades, one including 21 Leymus species and three Psathyrostachys species, the other with nine Leymus species and four diploid species. The ITS RNA secondary structure of the Leymus species was compared with that of their putative diploid donor. It is suggested that (1) the species from the same areas or neighboring geographic regions are closely related to each other; (2) L. coreanus, L. duthiei, L. duthiei var. longearistatus, and L. komarovii are closely related to other Leymus species, and it is reasonable to transfer these species from the genus Hystrix to Leymus; (3) the ITS sequences of Leymus are evolutionarily distinct; (4) the different Leymus species and different distribution of a species derived their Ns genome from different Psathyrostachys species; and (5) there is a close relationship among Leymus, Pseudoroegneria, Lophopyrum, and Thinopyrum, but it is difficult to presume that the St, E(e), and E(b) genome may be the Xm genome donor of the Leymus species.

  19. Multi-locus phylogenetic inference among New World Vultures (Aves: Cathartidae).

    PubMed

    Johnson, Jeff A; Brown, Joseph W; Fuchs, Jérôme; Mindell, David P

    2016-12-01

    New World Vultures are large-bodied carrion feeding birds in the family Cathartidae, currently consisting of seven species from five genera with geographic distributions in North and South America. No study to date has included all cathartid species in a single phylogenetic analysis. In this study, we investigated the phylogenetic relationships among all cathartid species using five nuclear (nuc; 4060bp) and two mitochondrial (mt; 2165bp) DNA loci with fossil calibrated gene tree (27 outgroup taxa) and coalescent-based species tree (2 outgroup taxa) analyses. We also included an additional four nuclear loci (2578bp) for the species tree analysis to explore changes in nodal support values. Although the stem lineage is inferred to have originated ∼69 million years ago (Ma; 74.5-64.9 credible interval), a more recent basal split within Cathartidae was recovered at ∼14Ma (17.1-11.1 credible interval). Two primary clades were identified: (1) Black Vulture (Coragyps atratus) together with the three Cathartes species (Lesser C. burrovianus and Greater C. melambrotus Yellow-headed Vultures, and Turkey Vulture C. aura), and (2) King Vulture (Sarcoramphus papa), California (Gymnogyps californianus) and Andean (Vultur gryphus) Condors. Support for taxon relationships within the two basal clades were inconsistent between analyses with the exception of Black Vulture sister to a monophyletic Cathartes clade. Increased support for a yellow-headed vulture clade was recovered in the species tree analysis using the four additional nuclear loci. Overall, these results are in agreement with cathartid life history (e.g. olfaction ability and behavior) and contrasting habitat affinities among sister taxa with overlapping geographic distributions. More research is needed using additional molecular loci to further resolve the phylogenetic relationships within the two basal cathartid clades, as speciation appeared to have occurred in a relatively short period of time.

  20. Phylogenetic relationship among genera of Polymorphidae (Acanthocephala), inferred from nuclear and mitochondrial gene sequences.

    PubMed

    García-Varela, Martín; Pérez-Ponce de León, Gerardo; Aznar, Francisco J; Nadler, Steven A

    2013-08-01

    Acanthocephalans of the family Polymorphidae Meyer, 1931 are obligate endoparasites with complex life cycles. These worms use vertebrates (marine mammals, fish-eating birds and waterfowl) as definitive hosts and invertebrates (amphipods, decapods and euphausiids) as intermediate hosts to complete their life cycle. Polymorphidae has a wordwide distribution, containing 12 genera, with approximately 127 species. The family is diagnosed by having a spinose trunk, bulbose proboscis, double-walled proboscis receptacle, and usually four to eight tubular cement glands. To conduct a phylogenetic analysis, in the current study sequences of the small (18S) and large-subunit (28S) ribosomal RNA, and cytochrome c oxidase subunit 1 (cox 1) were generated for 27 taxa representing 10 of 12 genera of Polymorphidae, plus three additional species of acanthocephalans that were used as outgroups. Maximum likelihood (ML), maximum parsimony (MP), and Bayesian analyses were conducted on a combined nuclear rRNA (18S+28S) data set and on a concatenated dataset of nuclear plus one mitochondrial gene (18S+28S+cox 1). Phylogenetic analyses inferred with the concatenated dataset of three genes support the monophyly of nine genera (Andracantha, Corynosoma, Bolbosoma, Profilicollis, Pseudocorynosoma, Southwellina, Arhythmorhynchus, Hexaglandula and Ibirhynchus). However, the four sampled species of Polymorphus were nested within several clades, indicating that these species do not share a common ancestor, requiring further taxonomic revision using phylogenetic systematics, and reexamination of morphological and ecological data. By mapping definitive and intermediate host association onto the resulting cladogram, we observe that aquatic birds were the ancestral definitive hosts for the family with a secondary colonization and diversification to marine mammals. Whereas amphipods were ancestral intermediate hosts and that the association with decapods represent episodes of secondary colonization

  1. Multi-locus phylogenetic inference among New World Vultures (Aves: Cathartidae).

    PubMed

    Johnson, Jeff A; Brown, Joseph W; Fuchs, Jérôme; Mindell, David P

    2016-12-01

    New World Vultures are large-bodied carrion feeding birds in the family Cathartidae, currently consisting of seven species from five genera with geographic distributions in North and South America. No study to date has included all cathartid species in a single phylogenetic analysis. In this study, we investigated the phylogenetic relationships among all cathartid species using five nuclear (nuc; 4060bp) and two mitochondrial (mt; 2165bp) DNA loci with fossil calibrated gene tree (27 outgroup taxa) and coalescent-based species tree (2 outgroup taxa) analyses. We also included an additional four nuclear loci (2578bp) for the species tree analysis to explore changes in nodal support values. Although the stem lineage is inferred to have originated ∼69 million years ago (Ma; 74.5-64.9 credible interval), a more recent basal split within Cathartidae was recovered at ∼14Ma (17.1-11.1 credible interval). Two primary clades were identified: (1) Black Vulture (Coragyps atratus) together with the three Cathartes species (Lesser C. burrovianus and Greater C. melambrotus Yellow-headed Vultures, and Turkey Vulture C. aura), and (2) King Vulture (Sarcoramphus papa), California (Gymnogyps californianus) and Andean (Vultur gryphus) Condors. Support for taxon relationships within the two basal clades were inconsistent between analyses with the exception of Black Vulture sister to a monophyletic Cathartes clade. Increased support for a yellow-headed vulture clade was recovered in the species tree analysis using the four additional nuclear loci. Overall, these results are in agreement with cathartid life history (e.g. olfaction ability and behavior) and contrasting habitat affinities among sister taxa with overlapping geographic distributions. More research is needed using additional molecular loci to further resolve the phylogenetic relationships within the two basal cathartid clades, as speciation appeared to have occurred in a relatively short period of time. PMID:27601346

  2. Phylogenetic inference in Rafflesiales: the influence of rate heterogeneity and horizontal gene transfer

    PubMed Central

    Nickrent, Daniel L; Blarer, Albert; Qiu, Yin-Long; Vidal-Russell, Romina; Anderson, Frank E

    2004-01-01

    Background The phylogenetic relationships among the holoparasites of Rafflesiales have remained enigmatic for over a century. Recent molecular phylogenetic studies using the mitochondrial matR gene placed Rafflesia, Rhizanthes and Sapria (Rafflesiaceae s. str.) in the angiosperm order Malpighiales and Mitrastema (Mitrastemonaceae) in Ericales. These phylogenetic studies did not, however, sample two additional groups traditionally classified within Rafflesiales (Apodantheaceae and Cytinaceae). Here we provide molecular phylogenetic evidence using DNA sequence data from mitochondrial and nuclear genes for representatives of all genera in Rafflesiales. Results Our analyses indicate that the phylogenetic affinities of the large-flowered clade and Mitrastema, ascertained using mitochondrial matR, are congruent with results from nuclear SSU rDNA when these data are analyzed using maximum likelihood and Bayesian methods. The relationship of Cytinaceae to Malvales was recovered in all analyses. Relationships between Apodanthaceae and photosynthetic angiosperms varied depending upon the data partition: Malvales (3-gene), Cucurbitales (matR) or Fabales (atp1). The latter incongruencies suggest that horizontal gene transfer (HGT) may be affecting the mitochondrial gene topologies. The lack of association between Mitrastema and Ericales using atp1 is suggestive of HGT, but greater sampling within eudicots is needed to test this hypothesis further. Conclusions Rafflesiales are not monophyletic but composed of three or four independent lineages (families): Rafflesiaceae, Mitrastemonaceae, Apodanthaceae and Cytinaceae. Long-branch attraction appears to be misleading parsimony analyses of nuclear small-subunit rDNA data, but model-based methods (maximum likelihood and Bayesian analyses) recover a topology that is congruent with the mitochondrial matR gene tree, thus providing compelling evidence for organismal relationships. Horizontal gene transfer appears to be influencing only

  3. Automatic detection of key innovations, rate shifts, and diversity-dependence on phylogenetic trees.

    PubMed

    Rabosky, Daniel L

    2014-01-01

    A number of methods have been developed to infer differential rates of species diversification through time and among clades using time-calibrated phylogenetic trees. However, we lack a general framework that can delineate and quantify heterogeneous mixtures of dynamic processes within single phylogenies. I developed a method that can identify arbitrary numbers of time-varying diversification processes on phylogenies without specifying their locations in advance. The method uses reversible-jump Markov Chain Monte Carlo to move between model subspaces that vary in the number of distinct diversification regimes. The model assumes that changes in evolutionary regimes occur across the branches of phylogenetic trees under a compound Poisson process and explicitly accounts for rate variation through time and among lineages. Using simulated datasets, I demonstrate that the method can be used to quantify complex mixtures of time-dependent, diversity-dependent, and constant-rate diversification processes. I compared the performance of the method to the MEDUSA model of rate variation among lineages. As an empirical example, I analyzed the history of speciation and extinction during the radiation of modern whales. The method described here will greatly facilitate the exploration of macroevolutionary dynamics across large phylogenetic trees, which may have been shaped by heterogeneous mixtures of distinct evolutionary processes. PMID:24586858

  4. Inferring models of multiscale copy number evolution for single-tumor phylogenetics

    PubMed Central

    Chowdhury, Salim Akhter; Gertz, E. Michael; Wangsa, Darawalee; Heselmeyer-Haddad, Kerstin; Ried, Thomas; Schäffer, Alejandro A.; Schwartz, Russell

    2015-01-01

    Motivation: Phylogenetic algorithms have begun to see widespread use in cancer research to reconstruct processes of evolution in tumor progression. Developing reliable phylogenies for tumor data requires quantitative models of cancer evolution that include the unusual genetic mechanisms by which tumors evolve, such as chromosome abnormalities, and allow for heterogeneity between tumor types and individual patients. Previous work on inferring phylogenies of single tumors by copy number evolution assumed models of uniform rates of genomic gain and loss across different genomic sites and scales, a substantial oversimplification necessitated by a lack of algorithms and quantitative parameters for fitting to more realistic tumor evolution models. Results: We propose a framework for inferring models of tumor progression from single-cell gene copy number data, including variable rates for different gain and loss events. We propose a new algorithm for identification of most parsimonious combinations of single gene and single chromosome events. We extend it via dynamic programming to include genome duplications. We implement an expectation maximization (EM)-like method to estimate mutation-specific and tumor-specific event rates concurrently with tree reconstruction. Application of our algorithms to real cervical cancer data identifies key genomic events in disease progression consistent with prior literature. Classification experiments on cervical and tongue cancer datasets lead to improved prediction accuracy for the metastasis of primary cervical cancers and for tongue cancer survival. Availability and implementation: Our software (FISHtrees) and two datasets are available at ftp://ftp.ncbi.nlm.nih.gov/pub/FISHtrees. Contact: russells@andrew.cmu.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26072490

  5. Phylogenetic inference and comparative evolution of a complex microsatellite and its flanking regions in carnivores.

    PubMed

    Domingo-Roura, Xavier; López-Giráldez, Francesc; Saeki, Midori; Marmi, Josep

    2005-06-01

    We sequenced locus Mel 08, with complex short repetitive motifs, in 24 carnivore species belonging to five different families in order to explore mutational changes in the region in the context of locus and species evolution. This non-coding locus includes up to four different parts or repetitive motifs showing size variability. The variability consists of repeat additions and deletions; substitutions, insertions and/or deletions creating interruptions in the repeat; and substitutions, insertions and deletions in the flanking regions. The locus has different repeat expansions in different carnivore subfamilies. We hypothesize that the complexity of this locus is due to a high mutation rate at an ancestral DNA sequence and, thus, prompts the emergence of repeats at mutational hotspots. High levels of homoplasy were evident, with nine electromorphs representing 28 haplotypes never shared across species. The variability in flanking regions was informative for phylogenetic inference and their evolutionary content. Tree topologies were congruent with relevant hypotheses on current conflicts in carnivore phylogenies, such as: (i) the monophyly of Lutrinae, (ii) the paraphyly of Mustelinae, (iii) the basal position of the Eurasian badger, Meles meles , in the Mustelidae, (iv) the classification of skunks as a separate family, Mephitidae, and (v) the placement of the red panda, Ailurus fulgens , as a monotypic family, Ailuridae, at a basal position in the Musteloidea.

  6. Bayesian Inference of the Evolution of a Phenotype Distribution on a Phylogenetic Tree

    PubMed Central

    Ansari, M. Azim; Didelot, Xavier

    2016-01-01

    The distribution of a phenotype on a phylogenetic tree is often a quantity of interest. Many phenotypes have imperfect heritability, so that a measurement of the phenotype for an individual can be thought of as a single realization from the phenotype distribution of that individual. If all individuals in a phylogeny had the same phenotype distribution, measured phenotypes would be randomly distributed on the tree leaves. This is, however, often not the case, implying that the phenotype distribution evolves over time. Here we propose a new model based on this principle of evolving phenotype distribution on the branches of a phylogeny, which is different from ancestral state reconstruction where the phenotype itself is assumed to evolve. We develop an efficient Bayesian inference method to estimate the parameters of our model and to test the evidence for changes in the phenotype distribution. We use multiple simulated data sets to show that our algorithm has good sensitivity and specificity properties. Since our method identifies branches on the tree on which the phenotype distribution has changed, it is able to break down a tree into components for which this distribution is unique and constant. We present two applications of our method, one investigating the association between HIV genetic variation and human leukocyte antigen and the other studying host range distribution in a lineage of Salmonella enterica, and we discuss many other potential applications. PMID:27412711

  7. Inferring the temperature dependence of population parameters: the effects of experimental design and inference algorithm

    PubMed Central

    Palamara, Gian Marco; Childs, Dylan Z; Clements, Christopher F; Petchey, Owen L; Plebani, Marco; Smith, Matthew J

    2014-01-01

    Understanding and quantifying the temperature dependence of population parameters, such as intrinsic growth rate and carrying capacity, is critical for predicting the ecological responses to environmental change. Many studies provide empirical estimates of such temperature dependencies, but a thorough investigation of the methods used to infer them has not been performed yet. We created artificial population time series using a stochastic logistic model parameterized with the Arrhenius equation, so that activation energy drives the temperature dependence of population parameters. We simulated different experimental designs and used different inference methods, varying the likelihood functions and other aspects of the parameter estimation methods. Finally, we applied the best performing inference methods to real data for the species Paramecium caudatum. The relative error of the estimates of activation energy varied between 5% and 30%. The fraction of habitat sampled played the most important role in determining the relative error; sampling at least 1% of the habitat kept it below 50%. We found that methods that simultaneously use all time series data (direct methods) and methods that estimate population parameters separately for each temperature (indirect methods) are complementary. Indirect methods provide a clearer insight into the shape of the functional form describing the temperature dependence of population parameters; direct methods enable a more accurate estimation of the parameters of such functional forms. Using both methods, we found that growth rate and carrying capacity of Paramecium caudatum scale with temperature according to different activation energies. Our study shows how careful choice of experimental design and inference methods can increase the accuracy of the inferred relationships between temperature and population parameters. The comparison of estimation methods provided here can increase the accuracy of model predictions, with important

  8. Inferring the temperature dependence of population parameters: the effects of experimental design and inference algorithm.

    PubMed

    Palamara, Gian Marco; Childs, Dylan Z; Clements, Christopher F; Petchey, Owen L; Plebani, Marco; Smith, Matthew J

    2014-12-01

    Understanding and quantifying the temperature dependence of population parameters, such as intrinsic growth rate and carrying capacity, is critical for predicting the ecological responses to environmental change. Many studies provide empirical estimates of such temperature dependencies, but a thorough investigation of the methods used to infer them has not been performed yet. We created artificial population time series using a stochastic logistic model parameterized with the Arrhenius equation, so that activation energy drives the temperature dependence of population parameters. We simulated different experimental designs and used different inference methods, varying the likelihood functions and other aspects of the parameter estimation methods. Finally, we applied the best performing inference methods to real data for the species Paramecium caudatum. The relative error of the estimates of activation energy varied between 5% and 30%. The fraction of habitat sampled played the most important role in determining the relative error; sampling at least 1% of the habitat kept it below 50%. We found that methods that simultaneously use all time series data (direct methods) and methods that estimate population parameters separately for each temperature (indirect methods) are complementary. Indirect methods provide a clearer insight into the shape of the functional form describing the temperature dependence of population parameters; direct methods enable a more accurate estimation of the parameters of such functional forms. Using both methods, we found that growth rate and carrying capacity of Paramecium caudatum scale with temperature according to different activation energies. Our study shows how careful choice of experimental design and inference methods can increase the accuracy of the inferred relationships between temperature and population parameters. The comparison of estimation methods provided here can increase the accuracy of model predictions, with important

  9. Phylogenetic inference of reciprocal effects between geographic range evolution and diversification.

    PubMed

    Goldberg, Emma E; Lancaster, Lesley T; Ree, Richard H

    2011-07-01

    Geographic characters--traits describing the spatial distribution of a species-may both affect and be affected by processes associated with lineage birth and death. This is potentially confounding to comparative analyses of species distributions because current models do not allow reciprocal interactions between the evolution of ranges and the growth of phylogenetic trees. Here, we introduce a likelihood-based approach to estimating region-dependent rates of speciation, extinction, and range evolution from a phylogeny, using a new model in which these processes are interdependent. We demonstrate the method with simulation tests that accurately recover parameters relating to the mode of speciation and source-sink dynamics. We then apply it to the evolution of habitat occupancy in Californian plant communities, where we find higher rates of speciation in chaparral than in forests and evidence for expanding habitat tolerances.

  10. Conflicting genomic signals affect phylogenetic inference in four species of North American pines

    PubMed Central

    Koralewski, Tomasz E.; Mateos, Mariana; Krutovsky, Konstantin V.

    2016-01-01

    Adaptive evolutionary processes in plants may be accompanied by episodes of introgression, parallel evolution and incomplete lineage sorting that pose challenges in untangling species evolutionary history. Genus Pinus (pines) is one of the most abundant and most studied groups among gymnosperms, and a good example of a lineage where these phenomena have been observed. Pines are among the most ecologically and economically important plant species. Some, such as the pines of the southeastern USA (southern pines in subsection Australes), are subjects of intensive breeding programmes. Despite numerous published studies, the evolutionary history of Australes remains ambiguous and often controversial. We studied the phylogeny of four major southern pine species: shortleaf (Pinus echinata), slash (P. elliottii), longleaf (P. palustris) and loblolly (P. taeda), using sequences from 11 nuclear loci and maximum likelihood and Bayesian methods. Our analysis encountered resolution difficulties similar to earlier published studies. Although incomplete lineage sorting and introgression are two phenomena presumptively underlying our results, the phylogenetic inferences seem to be also influenced by the genes examined, with certain topologies supported by sets of genes sharing common putative functionalities. For example, genes involved in wood formation supported the clade echinata–taeda, genes linked to plant defence supported the clade echinata–elliottii and genes linked to water management properties supported the clade echinata–palustris. The support for these clades was very high and consistent across methods. We discuss the potential factors that could underlie these observations, including incomplete lineage sorting, hybridization and parallel or adaptive evolution. Our results likely reflect the relatively short evolutionary history of the subsection that is thought to have begun during the middle Miocene and has been influenced by climate fluctuations. PMID

  11. Phylogenetics.

    PubMed

    Sleator, Roy D

    2011-04-01

    The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects. PMID:21249334

  12. Bears in a forest of gene trees: phylogenetic inference is complicated by incomplete lineage sorting and gene flow.

    PubMed

    Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel

    2014-08-01

    Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal.

  13. Bears in a Forest of Gene Trees: Phylogenetic Inference Is Complicated by Incomplete Lineage Sorting and Gene Flow

    PubMed Central

    Kutschera, Verena E.; Bidon, Tobias; Hailer, Frank; Rodi, Julia L.; Fain, Steven R.; Janke, Axel

    2014-01-01

    Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. PMID:24903145

  14. Bears in a forest of gene trees: phylogenetic inference is complicated by incomplete lineage sorting and gene flow.

    PubMed

    Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel

    2014-08-01

    Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. PMID:24903145

  15. Phylogenetic relationships and biogeography of the desert plant genus Fagonia (Zygophyllaceae), inferred by parsimony and Bayesian model averaging.

    PubMed

    Beier, B-A; Nylander, J A A; Chase, M W; Thulin, M

    2004-10-01

    Phylogenetic relationships within Fagonia were inferred from analyses of plastid trnL intron and nuclear ribosomal ITS DNA sequences. Sampling of the genus was nearly complete, including 32 of 34 species. Phylogenetic analysis was carried out using parsimony, and Bayesian model averaging. The latter method allows model-based inference while accounting for model-selection uncertainty, and is here used for the first time in phylogenetic analyses. All species of Fagonia in the Old World, except F. cretica, form a weakly supported clade, and all Fagonia species of the New World, except F. scoparia, are well supported as sister to the Old World clade. Fagonia scoparia, from Mexico, and F. cretica, from Northern Africa, are well supported as sisters to all other Fagonia species. Vicariance-dispersal analysis, using DIVA, indicated that the occurrences of Fagonia in South America and southern Africa are due to dispersals, and also, that the ancestor of Fagonia had a distribution compatible with the boreotropics hypothesis. PMID:15324841

  16. Gene sampling can bias multi-gene phylogenetic inferences: the relationship between red algae and green plants as a case study.

    PubMed

    Inagaki, Yuji; Nakajima, Yoshihiro; Sato, Mitsuhisa; Sakaguchi, Miako; Hashimoto, Tetsuo

    2009-05-01

    The monophyly of Plantae including glaucophytes, red algae, and green plants (green algae plus land plants) has been recovered in recent phylogenetic analyses of large multi-gene data sets (e.g., those including >30,000 amino acid [aa] positions). On the other hand, Plantae monophyly has not been stably reconstructed in inferences from multi-gene data sets with fewer than 10,000 aa positions. An analysis of 5,216 aa positions in Nozaki et al. (Nozaki H, Iseki M, Hasegawa M, Misawa K, Nakada T, Sasaki N, Watanabe M. 2007. Phylogeny of primary photosynthetic eukaryotes as deduced from slowly evolving nuclear genes. Mol Biol Evol. 24:1592-1595.) strongly rejected the monophyly of Plantae, whereas Hackett et al. (Hackett JD, Yoon HS, Li S, Reyes-Prieto A, Rummele SE, Bhattacharya D. 2007. Phylogenomic analysis supports the monophyly of cryptophytes and haptophytes and the association of rhizaria with chromalveolates. Mol Biol Evol. 24:1702-1713.) robustly recovered the Plantae clade in an analysis of 6,735 aa positions. We suspected that the significant incongruity observed between the two studies was attributable to a bias generally overlooked in multi-gene phylogenetic estimation, rather than data size, taxon sampling, or methods for tree reconstruction. Although glaucophytes were excluded from our analyses due to a shortage of sequence data, we found that the recovery of a sister-group relationship between red algae and green plants primarily depends on gene sampling in phylogenetic inferences from <10,000 aa positions. Phylogenetic analyses of data sets with fewer than 10,000 aa positions, which can be prepared without large-scale sequencing (e.g., expressed sequence tag analyses), are practical in challenging various unresolved issues in eukaryotic evolution. However, our results indicate that severe biases can arise from gene sampling in multi-gene inferences from <10,000 aa positions. We also address the validity of fast-evolving gene exclusion in multi

  17. Learning Probabilistic Inference through Spike-Timing-Dependent Plasticity123

    PubMed Central

    Pecevski, Dejan

    2016-01-01

    Abstract Numerous experimental data show that the brain is able to extract information from complex, uncertain, and often ambiguous experiences. Furthermore, it can use such learnt information for decision making through probabilistic inference. Several models have been proposed that aim at explaining how probabilistic inference could be performed by networks of neurons in the brain. We propose here a model that can also explain how such neural network could acquire the necessary information for that from examples. We show that spike-timing-dependent plasticity in combination with intrinsic plasticity generates in ensembles of pyramidal cells with lateral inhibition a fundamental building block for that: probabilistic associations between neurons that represent through their firing current values of random variables. Furthermore, by combining such adaptive network motifs in a recursive manner the resulting network is enabled to extract statistical information from complex input streams, and to build an internal model for the distribution p* that generates the examples it receives. This holds even if p* contains higher-order moments. The analysis of this learning process is supported by a rigorous theoretical foundation. Furthermore, we show that the network can use the learnt internal model immediately for prediction, decision making, and other types of probabilistic inference. PMID:27419214

  18. Phylogenetic relationships and ancestral areas of the bustards (Gruiformes: Otididae), inferred from mitochondrial DNA and nuclear intron sequences.

    PubMed

    Pitra, Christian; Lieckfeldt, Dietmar; Frahnert, Sylke; Fickel, Joerns

    2002-04-01

    The taxonomy of the bustards is poorly understood phylogenetically and has not been extensively evaluated using molecular methods. We sequenced part of the mitochondrial cytochrome b gene, the control region (central domain II), and an intron-exon crossing fragment of the nuclear chromo-helicase-DNA binding gene (CHD1) in 27 bustard taxa (including multiple subspecies) representing 11 genera and four gruiform outgroup species. Molecular datings suggest a Miocene origin for the family. Inferred phylogenetic relationships include the following: (i) the basal polytomy consists of 10 branches (mostly consistent with traditional genera), suggesting a rapid early radiation; (ii) sister relationships between several couplets of genera include Ardeotis with Neotis, Afrotis with Eupodotis (excluding E. rueppellii), Otis with Chlamydotis, and Houbaropsis with Sypheotides; (iii) the genus Eupodotis may be polyphyletic; and (iv) the currently delimited genera Ardeotis and Neotis do not form independent monophyletic lineages. Molecular evidence for the Afro-tropical origin of the Otididae is provided. PMID:12182403

  19. Computerized assessment of communication for cognitive stimulation for people with cognitive decline using spectral-distortion measures and phylogenetic inference.

    PubMed

    Pham, Tuan D; Oyama-Higa, Mayumi; Truong, Cong-Thang; Okamoto, Kazushi; Futaba, Terufumi; Kanemoto, Shigeru; Sugiyama, Masahide; Lampe, Lisa

    2015-01-01

    Therapeutic communication and interpersonal relationships in care homes can help people to improve their mental wellbeing. Assessment of the efficacy of these dynamic and complex processes are necessary for psychosocial planning and management. This paper presents a pilot application of photoplethysmography in synchronized physiological measurements of communications between the care-giver and people with dementia. Signal-based evaluations of the therapy can be carried out using the measures of spectral distortion and the inference of phylogenetic trees. The proposed computational models can be of assistance and cost-effectiveness in caring for and monitoring people with cognitive decline. PMID:25803586

  20. Phylogenetic relationships of extant zokors (Myospalacinae) (Rodentia, Spalacidae) inferred from mitochondrial DNA sequences.

    PubMed

    Su, Junhu; Ji, Weihong; Wang, Jing; Gleeson, Dianne M; Zhou, Janwei; Hua, Limin; Wei, Yanming

    2014-04-01

    In this study, we use three mitochondrial markers, cytochrome b gene (Cyt b), NADH dehydrogenase subunit 4 (ND4) and control region (D-loop) to investigate the phylogenetic relationships of extant zokor species in Mysopalacinae. The phylogenetic tree constructed based on Cyt b strongly supports the monophyly genera Eospalax and Myospalax with E. fontanierii being the most ancient species in Eospalax. Further phylogenetic analyses of four species of Eospalax based on ND4 and D-loop sequences revealed two clades that correspond to two geographical distributions. The basal clade includes E. cansus which is mainly found on Loess Plateau (LP) and another clade including E. baileyi, E. smithii and E. rufescens that inhabits areas above 2000 m on Qinghai-Tibetan Plateau (QTP) and Qinling Mountains. Geographical events of QTP and LP may have played a major role in the diversification and evolution of Mysopalacinae.

  1. Molecular Phylogenetics of the Genus Trichosporon Inferred from Mitochondrial Cytochrome b Gene Sequences

    PubMed Central

    Biswas, Swarajit Kumar; Wang, Li; Yokoyama, Koji; Nishimura, Kazuko

    2005-01-01

    Mitochondrial cytochrome b (cyt b) genes of 42 strains representing 23 species of the genus Trichosporon were partially sequenced to determine their molecular phylogenetic relationships. Almost half of the 22 strains investigated (from 11 different species) contained introns in their sequences. Analysis of a 396-bp coding sequence from each strain of Trichosporon under investigation showed a total of 141 (35.6%) variable nucleotide sites. A phylogenetic tree based on the cyt b gene sequences revealed that all species of Trichosporon except Trichosporon domesticum and Trichosporon montevideense had species-specific cyt b genes. Trichosporon sp. strain CBS 5581 was identified as Trichosporon pullulans, and one clinical isolate, IFM 48794, was identified as Trichosporon faecale. Analysis of 132-bp deduced amino acid sequences showed a total of 34 (25.75%) variable amino acid sites. T. domesticum and T. montevideense, Trichosporon asahii and Trichosporon asteroides, and Trichosporon gracile and Trichosporon guehoae had identical amino acid sequences. A phylogenetic tree constructed with the ascomycetes Saccharomyces douglasii and Candida glabrata taken as outgroup species and including representative species from closely related genera species of Trichosporon clustered with other basidiomycetous yeasts that contain xylose in their cell wall compositions. These results indicate the effectiveness of mitochondrial cyt b gene sequences for both species identification and the phylogenetic analysis of Trichosporon species. PMID:16207980

  2. Genetic variation and phylogenetic relationship analysis of Jatropha curcas L. inferred from nrDNA ITS sequences.

    PubMed

    Guo, Guo-Ye; Chen, Fang; Shi, Xiao-Dong; Tian, Yin-Shuai; Yu, Mao-Qun; Han, Xue-Qin; Yuan, Li-Chun; Zhang, Ying

    2016-01-01

    Genetic variation and phylogenetic relationships among 102 Jatropha curcas accessions from Asia, Africa, and the Americas were assessed using the internal transcribed spacer region of nuclear ribosomal DNA (nrDNA ITS). The average G+C content (65.04%) was considerably higher than the A+T (34.96%) content. The estimated genetic diversity revealed moderate genetic variation. The pairwise genetic divergences (GD) between haplotypes were evaluated and ranged from 0.000 to 0.017, suggesting a higher level of genetic differentiation in Mexican accessions than those of other regions. Phylogenetic relationships and intraspecific divergence were inferred by Bayesian inference (BI), maximum parsimony (MP), and median joining (MJ) network analysis and were generally resolved. The J. curcas accessions were consistently divided into three lineages, groups A, B, and C, which demonstrated distant geographical isolation and genetic divergence between American accessions and those from other regions. The MJ network analysis confirmed that Central America was the possible center of origin. The putative migration route suggested that J. curcas was distributed from Mexico or Brazil, via Cape Verde and then split into two routes. One route was dispersed to Spain, then migrated to China, eventually spreading to southeastern Asia, while the other route was dispersed to Africa, via Madagascar and migrated to China, later spreading to southeastern Asia. PMID:27461559

  3. Phylogenetic relationships of true butterflies (Lepidoptera: Papilionoidea) inferred from COI, 16S rRNA and EF-1α sequences.

    PubMed

    Kim, Man Il; Wan, Xinlong; Kim, Min Jee; Jeong, Heon Cheon; Ahn, Neung-Ho; Kim, Ki-Gyoung; Han, Yeon Soo; Kim, Iksoo

    2010-11-01

    The molecular phylogenetic relationships among true butterfly families (superfamily Papilionoidea) have been a matter of substantial controversy; this debate has led to several competing hypotheses. Two of the most compelling of those hypotheses involve the relationships of (Nymphalidae + Lycaenidae) + (Pieridae + Papilionidae) and (((Nymphalidae + Lycaenidae) + Pieridae) + Papilionidae). In this study, approximately 3,500 nucleotide sequences from cytochrome oxidase subunit I (COI), 16S ribosomal RNA (16S rRNA), and elongation factor-1 alpha (EF-1α) were sequenced from 83 species belonging to four true butterfly families, along with those of three outgroup species belonging to three lepidopteran superfamilies. These sequences were subjected to phylogenetic reconstruction via Bayesian Inference (BI), Maximum Likelihood (ML), and Maximum Parsimony (MP) algorithms. The monophyletic Pieridae and monophyletic Papilionidae evidenced good recovery in all analyses, but in some analyses, the monophylies of the Lycaenidae and Nymphalidae were hampered by the inclusion of single species of the lycaenid subfamily Miletinae and the nymphalid subfamily Danainae. Excluding those singletons, all phylogenetic analyses among the four true butterfly families clearly identified the Nymphalidae as the sister to the Lycaenidae and identified this group as a sister to the Pieridae, with the Papilionidae identified as the most basal linage to the true butterfly, thus supporting the hypothesis: (Papilionidae + (Pieridae + (Nymphalidae + Lycaenidae))).

  4. Inferring the direction of implied motion depends on visual awareness

    PubMed Central

    Faivre, Nathan; Koch, Christof

    2014-01-01

    Visual awareness of an event, object, or scene is, by essence, an integrated experience, whereby different visual features composing an object (e.g., orientation, color, shape) appear as an unified percept and are processed as a whole. Here, we tested in human observers whether perceptual integration of static motion cues depends on awareness by measuring the capacity to infer the direction of motion implied by a static visible or invisible image under continuous flash suppression. Using measures of directional adaptation, we found that visible but not invisible implied motion adaptors biased the perception of real motion probes. In a control experiment, we found that invisible adaptors implying motion primed the perception of subsequent probes when they were identical (i.e., repetition priming), but not when they only shared the same direction (i.e., direction priming). Furthermore, using a model of visual processing, we argue that repetition priming effects are likely to arise as early as in the primary visual cortex. We conclude that although invisible images implying motion undergo some form of nonconscious processing, visual awareness is necessary to make inferences about motion direction. PMID:24706951

  5. Phylogenetic relationships and timing of diversification in gonorynchiform fishes inferred using nuclear gene DNA sequences (Teleostei: Ostariophysi).

    PubMed

    Near, Thomas J; Dornburg, Alex; Friedman, Matt

    2014-11-01

    The Gonorynchiformes are the sister lineage of the species-rich Otophysi and provide important insights into the diversification of ostariophysan fishes. Phylogenies of gonorynchiforms inferred using morphological characters and mtDNA gene sequences provide differing resolutions with regard to the sister lineage of all other gonorynchiforms (Chanos vs. Gonorynchus) and support for monophyly of the two miniaturized lineages Cromeria and Grasseichthys. In this study the phylogeny and divergence times of gonorynchiforms are investigated with DNA sequences sampled from nine nuclear genes and a published morphological character matrix. Bayesian phylogenetic analyses reveal substantial congruence among individual gene trees with inferences from eight genes placing Gonorynchus as the sister lineage to all other gonorynchiforms. Seven gene trees resolve Cromeria and Grasseichthys as a clade, supporting previous inferences using morphological characters. Phylogenies resulting from either concatenating the nuclear genes, performing a multispecies coalescent species tree analysis, or combining the morphological and nuclear gene DNA sequences resolve Gonorynchus as the living sister lineage of all other gonorynchiforms, strongly support the monophyly of Cromeria and Grasseichthys, and resolve a clade containing Parakneria, Cromeria, and Grasseichthys. The morphological dataset, which includes 13 gonorynchiform fossil taxa that range in age from Early Cretaceous to Eocene, was analyzed in combination with DNA sequences from the nine nuclear genes and a relaxed molecular clock to estimate times of evolutionary divergence. This "tip dating" strategy accommodates uncertainty in the phylogenetic resolution of fossil taxa that provide calibration information in the relaxed molecular clock analysis. The estimated age of the most recent common ancestor (MRCA) of living gonorynchiforms is slightly older than estimates from previous node dating efforts, but the molecular tip dating

  6. Phylogenetic Relationships among Asian species of Petaurista (Rodentia, Sciuridae), Inferred from Mitochondrial Cytochrome b Gene Sequences.

    PubMed

    Oshida, T; Lin, L K; Masuda, R; Yoshida, M C

    2000-01-01

    To elucidate the phylogenetic relationships among four species belonging to the genus Petaurista (P. alborufus castaneus, P. alborufus lena, P. leucogenys leucogenys, P. leucogenys nikkonis, P. petaurista melanotus, and P. philippensis grandis), we investigated the partial sequences (1,068 bp) of the mitochondrial cytochrome b gene for these giant flying squirrels. Phylogenetic trees (NJ, MP, and ML trees) constructed from cytochrome b sequences indicated that P. leucogenys was grouped independently with other species, and that P. philippensis was most closely related to P. petaurista with 99-100% bootstrap values. In addition, two subspecies of P. alborufus did not form a single clade: P. alborufus castaneus from China was most distantly related to the other species, whereas P. alborufus lena from Taiwan was closely related to P. petaurista and P. philippensis with 82-90% bootstrap values. This result suggests that it is reasonable to regard P. alborufus lena as a distinct species from P. alborufus castaneus. PMID:18494567

  7. Phylometrics: a pipeline for inferring phylogenetic trees from a sequence relationship network perspective

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of the 16S rRNA gene is frequently used to characterize the microbial diversity of environmental samples. However, sequence similarities do not always imply functional or evolutionary relatedness due to many factors, including unequal rates of change and convergence. Thus, relying on top BLASTN hits for phylogenetic studies may misrepresent the diversity of these constituents. Furthermore, attempts to circumvent this issue by including a large number of BLASTN hits per sequence in one tree to explore their relatedness presents other problems. For instance, the multiple sequence alignment will be poor and computationally costly if not relying on manual alignment, and it may be difficult to derive meaningful relationships from the resulting tree. Analyzing sequence relationship networks within collective BLASTN results, however, reveal sequences that are closely related despite low rank. Results We have developed a web application, Phylometrics, that relies on networks of collective BLASTN results (rather than single BLASTN hits) to facilitate the process of building phylogenetic trees in an automated, high-throughput fashion while offering novel tools to find sequences that are of significant phylogenetic interest with minimal human involvement. The application, which can be installed locally in a laboratory or hosted remotely, utilizes a simple wizard-style format to guide the user through the pipeline without necessitating a background in programming. Furthermore, Phylometrics implements an independent job queuing system that enables users to continue to use the system while jobs are run with little or no degradation in performance. Conclusions Phylometrics provides a novel data mining method to screen supplied DNA sequences and to identify sequences that are of significant phylogenetic interest using powerful analytical tools. Sequences that are identified as being similar to a number of supplied sequences may provide key

  8. PHYML Online—a web server for fast maximum likelihood-based phylogenetic inference

    PubMed Central

    Guindon, Stéphane; Lethiec, Franck; Duroux, Patrice; Gascuel, Olivier

    2005-01-01

    PHYML Online is a web interface to PHYML, a software that implements a fast and accurate heuristic for estimating maximum likelihood phylogenies from DNA and protein sequences. This tool provides the user with a number of options, e.g. nonparametric bootstrap and estimation of various evolutionary parameters, in order to perform comprehensive phylogenetic analyses on large datasets in reasonable computing time. The server and its documentation are available at . PMID:15980534

  9. PHYML Online--a web server for fast maximum likelihood-based phylogenetic inference.

    PubMed

    Guindon, Stéphane; Lethiec, Franck; Duroux, Patrice; Gascuel, Olivier

    2005-07-01

    PHYML Online is a web interface to PHYML, a software that implements a fast and accurate heuristic for estimating maximum likelihood phylogenies from DNA and protein sequences. This tool provides the user with a number of options, e.g. nonparametric bootstrap and estimation of various evolutionary parameters, in order to perform comprehensive phylogenetic analyses on large datasets in reasonable computing time. The server and its documentation are available at http://atgc.lirmm.fr/phyml.

  10. Molecular characterization and phylogenetic inferences of Dermanyssus gallinae isolates in Italy within an European framework.

    PubMed

    Marangi, M; Cantacessi, C; Sparagano, O A E; Camarda, A; Giangaspero, A

    2014-12-01

    In order to investigate the genetic relationships between Dermanyssus gallinae (Metastigmata: Dermanyssidae) (de Geer) isolates from poultry farms in Italy and other European countries, phylogenetic analysis was performed using a portion of the cytochrome c oxidase subunit 1 (cox1) gene of the mitochondrial DNA and the internal transcribed spacers (ITS1+5.8S+ITS2) of the ribosomal DNA. A total of 360 cox1 sequences and 360 ITS+ sequences were obtained from mites collected on 24 different poultry farms in 10 different regions of Northern and Southern Italy. Phylogenetic analysis of the cox1 sequences resulted in the clustering of two groups (A and B), whereas phylogenetic analysis of the ITS+ resulted in largely unresolved clusters. Knowledge of the genetic make-up of mite populations within countries, together with comparative analyses of D. gallinae isolates from different countries, will provide better understanding of the population dynamics of D. gallinae. This will also allow the identification of genetic markers of emerging acaricide resistance and the development of alternative strategies for the prevention and treatment of infestations.

  11. Multilocus phylogenetic inference in subfamily Chlorogaloideae and related genera of Agavaceae - informing questions in taxonomy at multiple ranks.

    PubMed

    Archibald, Jenny K; Kephart, Susan R; Theiss, Kathryn E; Petrosky, Anna L; Culley, Theresa M

    2015-03-01

    A series of taxonomic questions at the subfamilial, generic, and intrageneric levels have remained within subfamily Chlorogaloideae s.s. (comprising Camassia, Chlorogalum, Hastingsia, and Schoenolirion) and relatives in Agavaceae. We present the first phylogenetic hypotheses focused on Chlorogaloideae that are based on multiple independent loci and include a wide sampling of outgroups across Agavaceae. In addition to chloroplast regions ndhF and trnL-trnF, we used nrDNA ITS for phylogenetic inference. Incomplete concerted evolution of the latter is indicated by intra-individual site polymorphisms for nearly half of the individuals. Comparisons of four coding and analysis methods for these characters indicate that the region remains phylogenetically informative. Our results confirm that Chlorogaloideae s.s. is not monophyletic, due to the close relationship of Schoenolirion with Hesperaloe and Hesperoyucca, as well as the likely sister relationship between Hesperocallis and core Chlorogaloideae (Camassia, Chlorogalum, and Hastingsia). Chlorogalum is also not monophyletic, being divided with strong support into vespertine and diurnal clades. This study produced the first phylogenetic hypotheses across Hesperaloe, allowing initial tests of several taxonomic disagreements within this genus. Our results reveal the lack of cohesion of H. funifera, indicating that H. funifera ssp. funifera may be more closely related to H. campanulata than to H. funifera ssp. chiangii (=H. chiangii). With potential gene flow between many members of Hesperaloe and a possible hybrid origin for H. campanulata, the genetic relationships within this genus appear complex. Further population-level investigation of many of the taxa in Chlorogaloideae s.l. would benefit our understanding of the evolution and taxonomy of these groups; Camassia and Hastingsia are the current focus of ongoing study.

  12. Dermacentor everestianus Hirst, 1926 (Acari: Ixodidae): phylogenetic status inferred from molecular characteristics.

    PubMed

    Chen, Ze; Li, Youquan; Ren, Qiaoyun; Luo, Jin; Liu, Zhijie; Zhou, Xun; Liu, Guangyuan; Luo, Jianxun; Yin, Hong

    2014-10-01

    Dermacentor everestianus Hirst, 1926, is only reported in Northwestern China and Nepal. Few researches about this species have been involved, especially for molecular characteristics. The taxonomy studies of D.everestianus are mainly based on morphological features, and its taxonomic status is an ongoing controversy. To clarify the molecular characteristics and phylogenetic status of D.everestianus and other related species, the sequences of mitochondrial 16S ribosomal DNA (rDNA) and cox1 fragments were analyzed in the present study. Analysis of 16S rDNA and cox1 sequences showed 99.3-100% identity within D.everestianus individuals, with the genetic divergence among them was 0-0.0086. The interspecific distance of 16S rDNA and cox1 between D.everestianus and some other Palaearctic species including D. silvarum, D. nuttalli, and D. marginatus was much smaller than that between D.everestianus and Nearctic Dermacentor ticks (D.albipictus, D.nitens, and D.variabilis). Such relationships of these ticks were also verified in the phylogenetic analysis. Two major clades were recovered within Dermacentor spp. with more than 90% bootstrap support in the phylogenetic trees. D.everestianus together with D.silvarum, D.nuttalli, and D.marginatus were included in the clade I (Eurasia lineage). Other analyzed tick species including D.variabilis, D.nitens, and D.albipictus formed clade II, which are distributed in Nearctic realm. These indicated that the genus Dermacentor was at least composed of two lineages. Thus, further researches including additionally molecular markers on all Dermacentor species globally should be taken to precisely resolve relationships within Dermacentor. PMID:25049051

  13. Phylogenetic relationships among higher Nemertean (Nemertea) Taxa inferred from 18S rDNA sequences.

    PubMed

    Sundberg, P; Turbeville, J M; Lindh, S

    2001-09-01

    We estimated the phylogenetic relationships of 15 nemertean (phylum Nemertea) species from the four subclasses Hoplo-, Hetero-, Palaeo-, and Bdellonemertea with 18S rDNA sequence data. Three outgroup taxa were used for rooting: Annelida, Platyhelminthes, and Mollusca. Parsimony and maximum-likelihood analyses supported the monophyletic status of the Heteronemertea and a taxon consisting of hoplonemerteans and Bdellonemertea, while indicating that Palaeonemertea is paraphyletic. The monophyletic status of the two nemertean classes Anopla and Enopla is not supported by the data. The unambiguous clades are well supported, as assessed by a randomization test (bootstrapping) and branch support values.

  14. Phylogenetic positions of insectivora in eutheria inferred from mitochondrial cytochrome c oxidase subunit II gene.

    PubMed

    Onuma, M; Kusakabe, T; Kusakabe, S

    1998-02-01

    For the elucidation of the phylogenetic position of insectivora in eutheria, we have sequenced the cytochrome c oxidase subunit II (COII) gene of mitochondria for three insectivoran species [musk screw (Suncus murinus), shrew mole (Urotrichus talpoides), Japanese mole (Mogera wogura)] and analyzed these amino acid sequences with neighbor-joining (NJ) method and maximum likelihood (ML) method. NJ analysis shows polyphyly of Insectivora and Chiroptera. Assuming that each of Primates, Ferungulata, Chiroptera, Insectivora and Rodentia is a monophyletic group, ML analysis suggests that Chiroptera is a sister group of Insectivora and that Ferungulata is the closest outgroup to the (Insectivora and Chiroptera) clade.

  15. Phylogenetic position of foraminifera inferred from LSU rRNA gene sequences.

    PubMed

    Pawlowski, J; Bolivar, I; Guiard-Maffia, J; Gouy, M

    1994-11-01

    A 5'-terminal region of 1600-1800 base pairs was amplified, cloned, and sequenced in the large subunit rDNA (LSU rDNA) of four species of foraminifera. These sequences were compared with the homologous regions of 16 eukaryotic taxa in order to establish the phylogenetic position of foraminifera. Analysis of 610 unambiguously aligned bases shows that foraminifera branch closely to plasmodial and cellular slime molds in the middle of the eukaryotic tree--that is, much earlier than suggested by the fossil record. These data, the first DNA sequences reported for foraminifera, will help analyze this class of protists and the early evolution of eukaryotes.

  16. Phylogenetic relationships of the Hucul horse from Romania inferred from mitochondrial D-loop variation.

    PubMed

    Georgescu, S E; Manea, M A; Dudu, A; Costache, M

    2011-10-31

    The existence of the Hucul horse on Romanian territory has been documented from the very distant past; today Hucul is a unique breed that is part of the FAO Program for the Preservation of Animal Genetic Resources. We compared Hucul with several primitive European and Asiatic breeds in order to elucidate the origin of these horses. We analyzed a 683-bp mitochondrial DNA (mtDNA) D-loop fragment in a population of Hucul horses and compared the polymorphic sites with sequences from other primitive breeds, including Exmoor, Icelandic Pony, Sorraia, Przewalski Horse, Mongolian Wild Horse, Konik, and Shetland Pony, as well as with Arabian, Akhal Teke and Caspian Pony. The sequences were truncated to 247 bp to accommodate short sequence data for the other species. Eighty horses were analyzed; 35 polymorphic sites representing 33 haplotypes were observed. The mean percentage of polymorphic sites was 14.2% for this mtDNA fragment. A neighbor-joining phylogenetic tree was constructed based on Kimura two-parameter distances and the Network 3.111 software was used for phylogenetic analysis. The Hucul horse was classified separately from all other primitive breeds. It is possible that the Hucul horse is not part of the pony class, as it segregated apart from all primitive pony breeds. We found multiple origins in the maternal lineage of domestic horse breeds and demonstrated the uniqueness of the Hucul breed; its origins remain unclear.

  17. Evolutionary relationships among Magnetospirillum strains inferred from phylogenetic analysis of 16S rDNA sequences.

    PubMed Central

    Burgess, J G; Kawaguchi, R; Sakaguchi, T; Thornhill, R H; Matsunaga, T

    1993-01-01

    We have investigated the evolutionary relationships between two facultatively anaerobic Magnetospirillum strains (AMB-1 and MGT-1) and fastidious, obligately microaerophilic species, such as Magnetospirillum magnetotacticum, using a molecular phylogenetic approach. Genomic DNA from strains MGT-1 and AMB-1 was used as a template for amplification of the genes coding for 16S rRNA (16S rDNA) by the polymerase chain reaction. Amplified DNA fragments were sequenced (1,424 bp) and compared with sequences for M. magnetotacticum MS-1 and Magnetospirillum gryphiswaldense MSR-1. Phylogenetic analysis of the aligned 16S rDNA sequences indicated that the two new magnetic spirilla, AMB-1 and MGT-1, lie within the alpha subdivision (alpha-1) of the eubacterial group Proteobacteria and are closely related to Rhodospirillum fulvum and to several endosymbiotic bacteria. Strains AMB-1, MGT-1, and MS-1 formed a cluster, termed group I, in which they were more closely related to each other than to group II, which contained M. gryphiswaldense MSR-1. Group I strains were also physiologically distinct from strain MSR-1. Sequence alignment studies allowed elucidation of genus-specific regions of the 16S rDNA, and oligonucleotide primers complementary to two of these regions were used to develop a specific polymerase chain reaction assay for detection of magnetic spirilla in natural samples. Images PMID:7691800

  18. Phylogenetic relationships of the Hucul horse from Romania inferred from mitochondrial D-loop variation.

    PubMed

    Georgescu, S E; Manea, M A; Dudu, A; Costache, M

    2011-01-01

    The existence of the Hucul horse on Romanian territory has been documented from the very distant past; today Hucul is a unique breed that is part of the FAO Program for the Preservation of Animal Genetic Resources. We compared Hucul with several primitive European and Asiatic breeds in order to elucidate the origin of these horses. We analyzed a 683-bp mitochondrial DNA (mtDNA) D-loop fragment in a population of Hucul horses and compared the polymorphic sites with sequences from other primitive breeds, including Exmoor, Icelandic Pony, Sorraia, Przewalski Horse, Mongolian Wild Horse, Konik, and Shetland Pony, as well as with Arabian, Akhal Teke and Caspian Pony. The sequences were truncated to 247 bp to accommodate short sequence data for the other species. Eighty horses were analyzed; 35 polymorphic sites representing 33 haplotypes were observed. The mean percentage of polymorphic sites was 14.2% for this mtDNA fragment. A neighbor-joining phylogenetic tree was constructed based on Kimura two-parameter distances and the Network 3.111 software was used for phylogenetic analysis. The Hucul horse was classified separately from all other primitive breeds. It is possible that the Hucul horse is not part of the pony class, as it segregated apart from all primitive pony breeds. We found multiple origins in the maternal lineage of domestic horse breeds and demonstrated the uniqueness of the Hucul breed; its origins remain unclear. PMID:22057995

  19. PHYLOViZ Online: web-based tool for visualization, phylogenetic inference, analysis and sharing of minimum spanning trees.

    PubMed

    Ribeiro-Gonçalves, Bruno; Francisco, Alexandre P; Vaz, Cátia; Ramirez, Mário; Carriço, João André

    2016-07-01

    High-throughput sequencing methods generated allele and single nucleotide polymorphism information for thousands of bacterial strains that are publicly available in online repositories and created the possibility of generating similar information for hundreds to thousands of strains more in a single study. Minimum spanning tree analysis of allelic data offers a scalable and reproducible methodological alternative to traditional phylogenetic inference approaches, useful in epidemiological investigations and population studies of bacterial pathogens. PHYLOViZ Online was developed to allow users to do these analyses without software installation and to enable easy accessing and sharing of data and analyses results from any Internet enabled computer. PHYLOViZ Online also offers a RESTful API for programmatic access to data and algorithms, allowing it to be seamlessly integrated into any third party web service or software. PHYLOViZ Online is freely available at https://online.phyloviz.net. PMID:27131357

  20. PHYLOViZ Online: web-based tool for visualization, phylogenetic inference, analysis and sharing of minimum spanning trees

    PubMed Central

    Ribeiro-Gonçalves, Bruno; Francisco, Alexandre P.; Vaz, Cátia; Ramirez, Mário; Carriço, João André

    2016-01-01

    High-throughput sequencing methods generated allele and single nucleotide polymorphism information for thousands of bacterial strains that are publicly available in online repositories and created the possibility of generating similar information for hundreds to thousands of strains more in a single study. Minimum spanning tree analysis of allelic data offers a scalable and reproducible methodological alternative to traditional phylogenetic inference approaches, useful in epidemiological investigations and population studies of bacterial pathogens. PHYLOViZ Online was developed to allow users to do these analyses without software installation and to enable easy accessing and sharing of data and analyses results from any Internet enabled computer. PHYLOViZ Online also offers a RESTful API for programmatic access to data and algorithms, allowing it to be seamlessly integrated into any third party web service or software. PHYLOViZ Online is freely available at https://online.phyloviz.net. PMID:27131357

  1. Molecular phylogenetic inference from saber-toothed cat fossils of Rancho La Brea.

    PubMed Central

    Janczewski, D N; Yuhki, N; Gilbert, D A; Jefferson, G T; O'Brien, S J

    1992-01-01

    A method for the successful extraction of sequestered cellular DNA from 14,000-year-old fossil bones was developed and applied to asphalt-preserved specimens of the extinct saber-toothed cat, Smilodon fatalis. Two distinct gene segments, the mitochondrial gene for 12S rRNA and nuclear FLA-I (the feline class I major histocompatibility complex gene), from three different individual fossil specimens were cloned and sequenced after PCR amplification. Comparison of fossil-derived DNA sequences to homologous regions in 15 living carnivorous species, including 9 species of Felidae and 6 nonfelids, affirmed the phylogenetic placement of Smilodon within the modern radiation of Felidae distinct from the Miocene paleofelid (Nimravidae) saber-toothed "cat" species. These results raise the prospect of obtaining genetically informative DNA from preserved bones of extinct fossil species, particularly among the 2 million specimens excavated from the asphaltic sediments at Rancho La Brea in metropolitan Los Angeles. PMID:1409696

  2. Reversible polymorphism-aware phylogenetic models and their application to tree inference.

    PubMed

    Schrempf, Dominik; Minh, Bui Quang; De Maio, Nicola; von Haeseler, Arndt; Kosiol, Carolin

    2016-10-21

    We present a reversible Polymorphism-Aware Phylogenetic Model (revPoMo) for species tree estimation from genome-wide data. revPoMo enables the reconstruction of large scale species trees for many within-species samples. It expands the alphabet of DNA substitution models to include polymorphic states, thereby, naturally accounting for incomplete lineage sorting. We implemented revPoMo in the maximum likelihood software IQ-TREE. A simulation study and an application to great apes data show that the runtimes of our approach and standard substitution models are comparable but that revPoMo has much better accuracy in estimating trees, divergence times and mutation rates. The advantage of revPoMo is that an increase of sample size per species improves estimations but does not increase runtime. Therefore, revPoMo is a valuable tool with several applications, from speciation dating to species tree reconstruction. PMID:27480613

  3. Spiral cleavage and early embryology of a loxosomatid entoproct and the usefulness of spiralian apical cross patterns for phylogenetic inferences

    PubMed Central

    2012-01-01

    Background Among the four major bilaterian clades, Deuterostomia, Acoelomorpha, Ecdysozoa, and Lophotrochozoa, the latter shows an astonishing diversity of bodyplans. While the largest lophotrochozoan assemblage, the Spiralia, which at least comprises Annelida, Mollusca, Entoprocta, Platyhelminthes, and Nemertea, show a spiral cleavage pattern, Ectoprocta, Brachiopoda and Phoronida (the Lophophorata) cleave radially. Despite a vast amount of recent molecular phylogenetic analyses, the interrelationships of lophotrochozoan phyla remain largely unresolved. Thereby, Entoprocta play a key role, because they have frequently been assigned to the Ectoprocta, despite their differently cleaving embryos. However, developmental data on entoprocts employing modern methods are virtually non-existent and the data available rely exclusively on sketch drawings, thus calling for thorough re-investigation. Results By applying fluorescence staining in combination with confocal microscopy and 3D-imaging techniques, we analyzed early embryonic development of a basal loxosomatid entoproct. We found that cleavage is asynchronous, equal, and spiral. An apical rosette, typical for most spiralian embryos, is formed. We also identified two cross-like cellular arrangements that bear similarities to both, a "molluscan-like" as well as an "annelid-like" cross, respectively. Conclusions A broad comparison of cleavage types and apical cross patterns across Lophotrochozoa shows high plasticity of these character sets and we therefore argue that these developmental traits should be treated and interpreted carefully when used for phylogenetic inferences. PMID:22458754

  4. Cryptic intercontinental colonization in water fleas Daphnia pulicaria inferred from phylogenetic analysis of mitochondrial DNA variation.

    PubMed

    Marková, Silvia; Dufresne, France; Rees, David J; Cerný, Martin; Kotlík, Petr

    2007-07-01

    The water fleas of the Daphnia pulex complex play a key role in freshwater ecosystems throughout the northern hemisphere. Despite the fact that they have been the subject of study for numerous biological disciplines, their phylogeny and species delimitation remain controversial. We used DNA sequence variation of the mitochondrial ND5 gene to reconstruct the phylogenetic relationships of D. pulicaria Forbes, a widespread member of this complex from North America and Europe. Populations from the two continents respectively split into two evolutionary lineages, Eastern Nearctic and European, which each belong to another main clade within the D. pulex complex (the pulicaria and tenebrosa groups, respectively). Unexpectedly, melanin and carotenoid pigmented D. pulicaria populations from European high-mountain lakes were not allied with the transparent populations inhabiting the same lakes and the lowland ponds and reservoirs throughout Europe, but were included with the samples from Canada and Greenland in the Eastern Nearctic lineage. Until now populations belonging to this lineage were known only from Canada and North Atlantic islands, but not from mainland Europe. Independent data from microsatellite markers supported the genetic distinctiveness of the sympatric carotenoid pigmented and transparent populations and suggested that they may have undergone transition to obligate parthenogenesis, possibly as a consequence of past introgressive hybridization. Two different taxa are therefore confused under the name D. pulicaria in Europe. The close phylogenetic relationships of European populations with those from Canada and Greenland suggest that the Nearctic lineage is of recent origin in Europe via intercontinental dispersal from the North America. It has evolved melanin and carotenoid pigmentation as adaptations against the UV light stress, which enable it to share habitat occupied by the transparent European species. The Nearctic D. pulicaria thus provides a new model

  5. Clarification of the Phylogenetic Framework of the Tribe Baorini (Lepidoptera: Hesperiidae: Hesperiinae) Inferred from Multiple Gene Sequences.

    PubMed

    Fan, Xiaoling; Chiba, Hideyuki; Huang, Zhenfu; Fei, Wen; Wang, Min; Sáfián, Szabolcs

    2016-01-01

    Members of the skipper tribe Baorini generally resemble each other and are characterized by dark brown wings with hyaline white spots. These shared characteristics have caused difficulties with revealing the relationships among genera and species in the group, and some conflicting taxonomic views remain unresolved. The present study aims to infer a more comprehensive phylogeny of the tribe using molecular data, to test the monophyly of the tribe as well as the genera it includes in order to clarify their taxonomic status, and finally to revise the current classification of the group. In order to reconstruct a phylogenetic tree, the mitochondrial COI-COII and 16S genes as well as the nuclear EF-1α and 28S genes were analyzed using parsimony, maximum likelihood, and Bayesian inference. The analysis included 67 specimens of 41 species, and we confirmed the monophyly of Baorini, and revealed that 14 genera are well supported. The genus Borbo is separated into three clades: Borbo, Pseudoborbo, and Larsenia gen. nov. We confirmed that Polytremis is polyphyletic and separated into three genera: Polytremis, Zinaida, and Zenonoida gen. nov., and also confirmed that the genus Prusiana is a member of the tribe. Relationships among some genera were strongly supported. For example, Zenonia and Zenonoida were found to be sister taxa, closely related to Zinaida and Iton, while Pelopidas and Baoris were also found to cluster together. PMID:27463803

  6. Clarification of the Phylogenetic Framework of the Tribe Baorini (Lepidoptera: Hesperiidae: Hesperiinae) Inferred from Multiple Gene Sequences

    PubMed Central

    Fan, Xiaoling; Chiba, Hideyuki; Huang, Zhenfu; Fei, Wen; Wang, Min; Sáfián, Szabolcs

    2016-01-01

    Members of the skipper tribe Baorini generally resemble each other and are characterized by dark brown wings with hyaline white spots. These shared characteristics have caused difficulties with revealing the relationships among genera and species in the group, and some conflicting taxonomic views remain unresolved. The present study aims to infer a more comprehensive phylogeny of the tribe using molecular data, to test the monophyly of the tribe as well as the genera it includes in order to clarify their taxonomic status, and finally to revise the current classification of the group. In order to reconstruct a phylogenetic tree, the mitochondrial COI-COII and 16S genes as well as the nuclear EF-1α and 28S genes were analyzed using parsimony, maximum likelihood, and Bayesian inference. The analysis included 67 specimens of 41 species, and we confirmed the monophyly of Baorini, and revealed that 14 genera are well supported. The genus Borbo is separated into three clades: Borbo, Pseudoborbo, and Larsenia gen. nov. We confirmed that Polytremis is polyphyletic and separated into three genera: Polytremis, Zinaida, and Zenonoida gen. nov., and also confirmed that the genus Prusiana is a member of the tribe. Relationships among some genera were strongly supported. For example, Zenonia and Zenonoida were found to be sister taxa, closely related to Zinaida and Iton, while Pelopidas and Baoris were also found to cluster together. PMID:27463803

  7. Phylogenetic inference of the porcine Rotavirus A origin of the human G1 VP7 gene.

    PubMed

    Do, Loan Phuong; Nakagomi, Toyoko; Otaki, Hiroki; Agbemabiese, Chantal Ama; Nakagomi, Osamu; Tsunemitsu, Hiroshi

    2016-06-01

    Rotavirus A (RVA) is an important cause of acute gastroenteritis in children worldwide. The most common VP7 genotype of human RVA is G1, but G1 is rarely detected in porcine strains. To understand the evolutionary relationships between human and porcine G1 VP7 genes, we sequenced the VP7 genes of three Japanese G1 porcine strains; the first two (PRV2, S80B) were isolated in 1980 and the third (Kyusyu-14) was isolated in 2001. Then, we performed phylogenetic and in-silico structural analyses. All three VP7 sequences clustered into lineage VI, and the mean nucleotide sequence identity between any pair of porcine G1 VP7 sequences belonging to lineage VI was 91.9%. In contrast, the mean nucleotide sequence identity between any pair of human G1 VP7 sequences belonging to lineages I-V was 95.5%. While the mean nucleotide sequence identity between any pair of porcine lineage VI strain and human lineage I-V strain was 85.4%, the VP7 genes of PRV2 and a rare porcine-like human G1P[6] strain (AU19) were 98% identical, strengthening the porcine RVA origin of AU19. The phylogenetic tree suggests that human G1 VP7 genes originated from porcine G1 VP7 genes. The time of their most recent common ancestor was estimated to be 1948, and human and porcine RVA strains evolved along independent pathways. In-silico structural analyses identified 7 amino acid residues within the known neutralisation epitopes that show differences in electric charges and shape between different porcine and human G1 strains. When compared with much divergent porcine G1 VP7 lineages, monophyletic, less divergent human G1 VP7 lineages support the hypothesis that all human G1 VP7 genes included in this study originated from a rare event of a porcine RVA transmitting to humans that was followed by successful adaptation to the human host. By contrast, AU19 represents interspecies transmission that terminated in dead-end infection. PMID:26961591

  8. Phylogenetic relationships of Indian caecilians (Amphibia: Gymnophiona) inferred from mitochondrial rRNA gene sequences.

    PubMed

    Wilkinson, Mark; A Sheps, Jonathan; Oommen, Oommen V; Cohen, Bernard L

    2002-06-01

    India has a diverse caecilian fauna, including representatives of three of the six currently recognized families, the Caeciliidae, Ichthyophiidae, the endemic Uraeotyphlidae, but previous molecular phylogenetic studies of caecilians have not included sequences for any Indian caecilians. Partial 12S and 16S mitochondrial gene sequences were obtained for a single representative of each of the caecilian families found in India and aligned against previously reported sequences for 13 caecilian species. The resulting alignment (16 taxa, 1200 sites, of which 288 cannot be aligned unambiguously) was analyzed using parsimony, maximum-likelihood, and distance methods. As judged by bootstrap proportions, decay indices, and leaf stabilities, well-supported relationships of the Indian caecilians are recovered from the alignment. The data (1) corroborate the hypothesis, based on morphology, that the Uraeotyphlidae and Ichthyophiidae are sister taxa, (2) recover a monophyletic Ichthyophiidae, including Indian and South East Asian representatives, and (3) place the Indian caeciliid Gegeneophis ramaswamii as the sister group of the caeciliid caecilians of the Seychelles. Rough estimates of divergence times suggest an origin of the Uraeotyphlidae and Ichthyophiidae while India was isolated from Laurasia and Africa and are most consistent with an Indian origin of these families and subsequent dispersal of ichthyophiids into South East Asia.

  9. A New Miniature Characid (Ostariophysi: Characiformes: Characidae), with Phylogenetic Position Inferred from Morphological and Molecular Data

    PubMed Central

    Netto-Ferreira, André Luiz; Birindelli, José Luís Olivan; de Sousa, Leandro Melo; Mariguela, Tatiane Casagrande; Oliveira, Claudio

    2013-01-01

    Erythrocharax altipinnis is described from the Serra do Cachimbo, Pará, Brazil. The new taxon is distinguished from all of the Characidae genera by having the pelvic bones firmly attached through the isquiatic processes; a nearly triangular hiatus in the musculature covering the anterior chamber of the swim bladder between the first and second pleural ribs (pseudotympanum); the pedunculate, notably expanded and distally compressed teeth in both jaws; circumorbital series represented by antorbital and four infraorbital bones with laterosensory canals not enclosed; a single tooth row in the premaxillary with the teeth perfectly aligned and similar in shape and cusp number; the first three branched dorsal-fin rays distinctly elongate in males; a bright red adipose and caudal fins in life; a conspicuous dark midlateral stripe extending from the opercle to the tip of the median caudal-fin rays; and by the absence of a humeral spot. The phylogenetic position of the new taxon is discussed using morphological and molecular datasets, with conflicting results of both approaches discussed. Additionally, a summarized discussion on the current problems in the Characidae taxonomy is presented and the principal biases in the morphological dataset are also discussed. PMID:23300963

  10. Phylogenetic relationships of the genus Chamaecyparis inferred from leaf essential oil.

    PubMed

    Chen, Ying-Ju; Lin, Chun-Ya; Cheng, Seng-Sung; Chang, Shang-Tzen

    2011-06-01

    The species differentiation between Chamaecyparis formosensis, C. obtusa var. formosana, and C. obtusa, based on the composition of the leaf essential oils, was studied. The characterization of the oils by GC-FID and GC/MS analyses showed remarkable differences between these three essential oils. Cluster analysis (CA) and principal-component analysis (PCA) distinguished three groups of essential oils. The C. formosensis oil was dominated by α-pinene while those isolated from C. obtusa var. formosana and C. obtusa were characterized by high levels of (-)-thujopsene and α-terpinyl acetate, respectively. Moreover, the phylogenetic relationships of the genus Chamaecyparis were in agreement with previous findings based on morphological and molecular evidence. In addition, the essential oils from C. obtusa var. formosana could be classified into three chemical types, according to their different characteristic main compounds (β-elemol, (-)-thujopsene, and cis-thujopsenal). The biochemical correlations between the major constituents of the Chamaecyparis species were examined and their relationship is discussed. PMID:21674781

  11. Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes

    PubMed Central

    Hodgson, Jason A.; Burrell, Andrew S.; Sterner, Kirstin N.; Raaum, Ryan L.; Disotell, Todd R.

    2014-01-01

    The origins and the divergence times of the most basal lineages within primates have been difficult to resolve mainly due to the incomplete sampling of early fossil taxa. The main source of contention is related to the discordance between molecular and fossil estimates: while there are no crown primate fossils older than 56 Ma, most molecule-based estimates extend the origins of crown primates into the Cretaceous. Here we present a comprehensive mitogenomic study of primates. We assembled 87 mammalian mitochondrial genomes, including 62 primate species representing all the families of the order. We newly sequenced eleven mitochondrial genomes, including eight Old World monkeys and three strepsirrhines. Phylogenetic analyses support a strong topology, confirming the monophyly for all the major primate clades. In contrast to previous mitogenomic studies, the positions of tarsiers and colugos relative to strepsirrhines and anthropoids are well resolved. In order to improve our understanding of how fossil calibrations affect age estimates within primates, we explore the effect of seventeen fossil calibrations across primates and other mammalian groups and we select a subset of calibrations to date our mitogenomic tree. The divergence date estimates of the Strepsirrhine/Haplorhine split support an origin of crown primates in the Late Cretaceous, at around 74 Ma. This result supports a short fuse model of primate origins, whereby relatively little time passed between the origin of the order and the diversification of its major clades. It also suggests that the early primate fossil record is likely poorly sampled. PMID:24583291

  12. Algae or protozoa: phylogenetic position of euglenophytes and dinoflagellates as inferred from mitochondrial sequences.

    PubMed

    Inagaki, Y; Hayashi-Ishimaru, Y; Ehara, M; Igarashi, I; Ohama, T

    1997-09-01

    The chloroplasts of euglenophytes and dinoflagellates have been suggested to be the vestiges of endosymbiotic algae acquired during the process of evolution. However, the evolutionary positions of these organisms are still inconclusive, and they have been tentatively classified as both algae and protozoa. A representative gene of the mitochondrial genome, cytochrome oxidase subunit I (coxI), was chosen and sequenced to clarify the phylogenetic positions of four dinoflagellates, two euglenophytes and one apicomplexan protist. This is the first report of mitochondrial DNA sequences for dinoflagellates and euglenophytes. Our COXI tree shows clearly that dinoflagellates are closely linked to apicomplexan parasites but not with algae. Euglenophytes and algae appear to be only remotely related, with euglenophytes sharing a possible evolutionary link with kinetoplastids. The COXI tree is in general agreement with the tree based on the nuclear encoded small subunit of ribosomal RNA (SSU rRNA) genes, but conflicts with that based on plastid genes. These results support the interpretation that chloroplasts present in euglenophytes and dinoflagellates were captured from algae through endosymbioses, while their mitochondria were inherited from the host cell. We suggest that dinoflagellates and euglenophytes were originally heterotrophic protists and that their chloroplasts are remnants of endosymbiotic algae.

  13. Phylogenetic position of Gromia oviformis Dujardin inferred from nuclear-encoded small subunit ribosomal DNA.

    PubMed

    Burki, Fabien; Berney, Cédric; Pawlowski, Jan

    2002-09-01

    Gromia oviformis Dujardin is a common marine protist characterised by a large, globular test and filose pseudopodia. First considered a foraminifer, Gromia was later placed within the Filosea and recently included among amoebae of uncertain affinities. In order to clarify the phylogenetic position of this genus, we sequenced the complete small-subunit ribosomal DNA gene of G. oviformis collected at five different geographic localities. The high divergence of obtained sequences suggests that G. oviformis is a species complex composed of several genetically distinct sibling species. Sequence analyses show Gromia to be a member of the Cercozoa, a heterogeneous assemblage which includes filose amoebae, the amoeboflagellate cercomonads, the chlorarachniophytes and the plasmodiophorid plant pathogens. Contrary to traditional classification, Gromia is not closely related to other testate filose amoebae (the Euglyphida), but seems to branch early among the Cercozoa. Our analyses also show a close relationship between the Cercozoa and the Acantharea. Because the Cercozoa are related to the Foraminifera based on other molecular data, we propose that most protists possessing filopodia, reticulopodia and axopodia have a common origin.

  14. Phylogenetic inferences in Avena based on analysis of FL intron2 sequences.

    PubMed

    Peng, Yuan-Ying; Wei, Yu-Ming; Baum, Bernard R; Yan, Ze-Hong; Lan, Xiu-Jin; Dai, Shou-Fen; Zheng, You-Liang

    2010-09-01

    The development and application of molecular methods in oats has been relatively slow compared with other crops. Results from the previous analyses have left many questions concerning species evolutionary relationships unanswered, especially regarding the origins of the B and D genomes, which are only known to be present in polyploid oat species. To investigate the species and genome relationships in genus Avena, among 13 diploid (A and C genomes), we used the second intron of the nuclear gene FLORICAULA/LEAFY (FL int2) in seven tetraploid (AB and AC genomes), and five hexaploid (ACD genome) species. The Avena FL int2 is rather long, and high levels of variation in length and sequence composition were found. Evidence for more than one copy of the FL int2 sequence was obtained for both the A and C genome groups, and the degree of divergence of the A genome copies was greater than that observed within the C genome sequences. Phylogenetic analysis of the FL int2 sequences resulted in topologies that contained four major groups; these groups reemphasize the major genomic divergence between the A and C genomes, and the close relationship among the A, B, and D genomes. However, the D genome in hexaploids more likely originated from a C genome diploid rather than the generally believed A genome, and the C genome diploid A. clauda may have played an important role in the origination of both the C and D genome in polyploids.

  15. Phylogenetic relationships of Indian caecilians (Amphibia: Gymnophiona) inferred from mitochondrial rRNA gene sequences.

    PubMed

    Wilkinson, Mark; A Sheps, Jonathan; Oommen, Oommen V; Cohen, Bernard L

    2002-06-01

    India has a diverse caecilian fauna, including representatives of three of the six currently recognized families, the Caeciliidae, Ichthyophiidae, the endemic Uraeotyphlidae, but previous molecular phylogenetic studies of caecilians have not included sequences for any Indian caecilians. Partial 12S and 16S mitochondrial gene sequences were obtained for a single representative of each of the caecilian families found in India and aligned against previously reported sequences for 13 caecilian species. The resulting alignment (16 taxa, 1200 sites, of which 288 cannot be aligned unambiguously) was analyzed using parsimony, maximum-likelihood, and distance methods. As judged by bootstrap proportions, decay indices, and leaf stabilities, well-supported relationships of the Indian caecilians are recovered from the alignment. The data (1) corroborate the hypothesis, based on morphology, that the Uraeotyphlidae and Ichthyophiidae are sister taxa, (2) recover a monophyletic Ichthyophiidae, including Indian and South East Asian representatives, and (3) place the Indian caeciliid Gegeneophis ramaswamii as the sister group of the caeciliid caecilians of the Seychelles. Rough estimates of divergence times suggest an origin of the Uraeotyphlidae and Ichthyophiidae while India was isolated from Laurasia and Africa and are most consistent with an Indian origin of these families and subsequent dispersal of ichthyophiids into South East Asia. PMID:12099794

  16. Use of Endogenous Retroviral Sequences (ERVs) and structural markers for retroviral phylogenetic inference and taxonomy

    PubMed Central

    Jern, Patric; Sperber, Göran O; Blomberg, Jonas

    2005-01-01

    Background Endogenous retroviral sequences (ERVs) are integral parts of most eukaryotic genomes and vastly outnumber exogenous retroviruses (XRVs). ERVs with a relatively complete structure were retrieved from the genetic archives of humans and chickens, diametrically opposite representatives of vertebrate retroviruses (over 3300 proviruses), and analyzed, using a bioinformatic program, RetroTector©, developed by us. This rich source of proviral information, accumulated in a local database, and a collection of XRV sequences from the literature, allowed the reconstruction of a Pol based phylogenetic tree, more extensive than previously possible. The aim was to find traits useful for classification and evolutionary studies of retroviruses. Some of these traits have been used by others, but they are here tested in a wider context than before. Results In the ERV collection we found sequences similar to the XRV-based genera: alpha-, beta-, gamma-, epsilon- and spumaretroviruses. However, the occurrence of intermediates between them indicated an evolutionary continuum and suggested that taxonomic changes eventually will be necessary. No delta or lentivirus representatives were found among ERVs. Classification based on Pol similarity is congruent with a number of structural traits. Acquisition of dUTPase occurred three times in retroviral evolution. Loss of one or two NC zinc fingers appears to have occurred several times during evolution. Nucleotide biases have been described earlier for lenti-, delta- and betaretroviruses and were here confirmed in a larger context. Conclusion Pol similarities and other structural traits contribute to a better understanding of retroviral phylogeny. "Global" genomic properties useful in phylogenies are i.) translational strategy, ii.) number of Gag NC zinc finger motifs, iii.) presence of Pro N-terminal dUTPase (dUTPasePro), iv.) presence of Pro C-terminal G-patch and v.) presence of a GPY/F motif in the Pol integrase (IN) C

  17. Phylogenetic Relationships of Japanese Auritibicen Species (Hemiptera: Cicadidae: Cryptotympanini) Inferred from Mitochondrial and Nuclear Gene Sequences.

    PubMed

    Sota, Teiji; Kojima, Takanori; Lee, Young June; Lin, Chung-Ping

    2016-08-01

    We investigated the phylogenetic relationships and divergence times within the genus Auritibicen(Cicadidae: Cicadinae: Cryptotympanini), analyzing five Japanese species (A. japonicus, A. bihamatus,A. kyushyuensis, A. esakii and A. flammatus) and three species from East Asian mainland and Taiwan (A. atrofasciatus, A. intermedius and A. chujoi) using mitochondrial cytochrome oxidase subunit I (COI) and nuclear elongation factor 1-alpha (EF-1a) gene sequences. Although the EF-1a gene tree did not resolve the relationships among these Auritibicen species, the trees based on COI gene and the combined data set showed that Japanese taxa comprised three distinct lineages: the individual species A. flammatus and A. bihamatus, and the A. japonicus group, comprising A. japonicus, A. esakii and A. kyushyuensis from Japan and A. intermedius from Korea. In A. kyushyuensis, which comprises three populations in Kyushu, western Honshu and Shikoku, the specimens from western Honshu and Shikoku were closely related to each other, but not to the specimen from Kyushu; instead, they were sister to the Korean A. intermedius. The incongruence between the gene tree and species tree necessitates further population genetic and morphological studies to confirm the classification and species status of the western Honshu and Shikoku populations of A. kyushyuensis, which were originally described as two independent species. Divergence time estimation suggested that the most recent common ancestor of Auritibicen species studied dated back to the late Pliocene and that the species of the A. japonicus group diverged during the mid Pleistocene. Thus, the Pleistocene climatic fluctuation may have promoted the divergence of the Auritibicen species. PMID:27498799

  18. Phylogenetic relationships among diploid Aegilops species inferred from 5S rDNA units.

    PubMed

    Baum, B R; Edwards, T; Johnson, D A

    2009-10-01

    Relationships among the currently recognized 11 diploid species within the genus Aegilops have been investigated. Sequence similarity analysis, based upon 363 sequenced 5S rDNA clones from 44 accessions plus 15 sequences retrieved from GenBank, depicted two unit classes labeled the long AE1 and short AE1. Several different analytical methods were applied to infer relationships within haplomes, between haplomes and among the species, including maximum parsimony and maximum likelihood analyses of consensus sequences, "total evidence" phylogeny analysis and "matrix representation with parsimony" analysis. None were able to depict suites of markers or unit classes that could discern among the seven haplomes as is observed among established haplomes in other genera within the tribe Triticeae; however, most species could be separated when displayed on gene trees. These results suggest that the haplomes currently recognized are so refined that they may be relegated as sub-haplomes or haplome variants. Amblyopyrum shares the same 5S rDNA unit classes with the diploid Aegilops species suggesting that it belongs within the latter. Comparisons of the Aegilops sequences with those of Triticum showed that the long AE1 unit class of Ae. tauschii shared the clade with the equivalent long D1 unit class, i.e., the putative D haplome donor, but the short AE1 unit class did not. The long AE1 unit class but not the short, of Ae. speltoides and Ae. searsii both share the clade with the previously identified long {S1 and long G1 unit classes meaning that both Aegilops species can be equally considered putative B haplome donors to tetraploid Triticum species. The semiconserved nature of the nontranscribed spacer in Aegilops and in Triticeae in general is discussed in view that it may have originated by processes of incomplete gene conversion or biased gene conversion or birth-and-death evolution.

  19. Phylogenetic relationships and biogeography of the genus Algansea Girard (Cypriniformes: Cyprinidae) of central Mexico inferred from molecular data

    PubMed Central

    Pérez-Rodríguez, Rodolfo; Domínguez-Domínguez, Omar; de León, Gerardo Pérez Ponce; Doadrio, Ignacio

    2009-01-01

    Background The genus Algansea is one of the most representative freshwater fish groups in central Mexico due to its wide geographic distribution and unusual level of endemicity. Despite the small number of species, this genus has had an unsettled taxonomic history due to high levels of intraspecific morphological variation. Moreover, several phylogenetic hypotheses among congeners have been proposed but have had the following shortcomings: the use of homoplasious morphological characters, the use of character codification and polarisation methods that lacked objectivity, and incomplete taxonomic sampling. In this study, a phylogenetic analysis among species of Algansea is presented. This analysis is based upon two molecular markers, the mitochondrial gene cytochrome b and the first intron of the ribosomal protein S7 gene. Results Bayesian analysis based on a combined matrix (cytochrome b and first intron S7) showed that Algansea is a monophyletic group and that Agosia chrysogaster is the sister group. Divergence times dated the origin of the genus around 16.6 MYA, with subsequent cladogenetic events occurring between 6.4 and 2.8 MYA. When mapped onto the molecular phylogenetic hypothesis, the character states of three morphological characters did not support previous hypotheses on the evolution of morphological traits in the genus Algansea, whereas the character states of the remaining six characters partially corroborated those hypotheses. Conclusion Monophyly of the genus Algansea was corroborated in this study. Tree topology shows the genus consists of three main lineages: Central-Eastern, Western, and Southern clades. However, the relationships among these clades remained unresolved. Congruence found between the available geological and climatic history and the divergence times made it possible to infer the biogeographical history of Algansea, which suggested that vicariance events were responsible for the evolutionary history of the genus. Interestingly, this

  20. Phylogenetic relationships among Octopodidae species in coastal waters of China inferred from two mitochondrial DNA gene sequences.

    PubMed

    Lü, Z M; Cui, W T; Liu, L Q; Li, H M; Wu, C W

    2013-09-19

    Octopus in the family Octopodidae (Mollusca: Cephalopoda) has been generally recognized as a "catch-all" genus. The monophyly of octopus species in China's coastal waters has not yet been studied. In this paper, we inferred the phylogeny of 11 octopus species (family Octopodidae) in China's coastal waters using nucleotide sequences of two mitochondrial DNA genes: cytochrome c oxidase subunit I (COI) and 16S rRNA. Sequence analysis of both genes revealed that the 11 species of Octopodidae fell into four distinct groups, which were genetically distant from one another and exhibited identical phylogenetic resolution. The phylogenies indicated strongly that the genus Octopus in China's coastal waters is also not monophyletic, and it is therefore clear that the Octopodidae systematics in this area requires major revision. It is demonstrated that partial sequence information of both the mitochondrial genes 16S rRNA and COI could be used as diagnostic molecular markers in the identification and resolution of the taxonomic ambiguity of Octopodidae species.

  1. Molecular inference of phylogenetic relationships among Decapodiformes (Mollusca: Cephalopoda) with special focus on the squid order Oegopsida.

    PubMed

    Lindgren, Annie R

    2010-07-01

    Squids, cuttlefish and bobtail squids comprise the molluscan superorder Decapodiformes (Mollusca: Cephalopoda). Although these animals exemplify the morphological and ecological diversity seen in Cephalopoda, no previous study has focused resolving decapodiform relationships, particularly within Oegopsida, a large order comprised of pelagic squid. To further clarify the phylogenetic history of Decapodiformes, and Oegopsida in particular, molecular data for five genes (18S rRNA, 28S rRNA, Histone H3, 16S rRNA, COI) was collected for 90 taxa representing all major lineages and families and evaluated using parsimony, likelihood, and Bayesian analysis. Although ordinal relationships were sensitive to analytical method, several conclusions can be inferred: the pelagic order Myopsida is closely related to the benthic sepioids, whose relationships were ambiguous, and Bathyteuthoidea is distinct from Oegopsida. Within Oegopsida several clades are consistently recovered, some with previous morphological support (e.g. chiroteuthid, lepidoteuthid, histioteuthid families) while others suggest novel relationships (e.g. Architeuthidae+Neoteuthidae). This study, with its broad coverage of taxa, provides the first in-depth analysis of Decapodiformes with special focus on the morphologically and biogeographically diverse Oegopsida, confirms several sister-taxon relationships, and provides new hypotheses of cephalopod evolution in the open ocean.

  2. Identification and phylogenetic inferences on stocks of sharks affected by the fishing industry off the Northern coast of Brazil.

    PubMed

    da Silva Rodrigues-Filho, Luis Fernando; da Rocha, Tainá Carreira; do Rêgo, Péricles Sena; Schneider, Horacio; Sampaio, Iracilda; Vallinoto, Marcelo

    2009-04-01

    The ongoing decline in abundance and diversity of shark stocks, primarily due to uncontrolled fishery exploitation, is a worldwide problem. An additional problem for the development of conservation and management programmes is the identification of species diversity within a given area, given the morphological similarities among shark species, and the typical disembarkation of processed carcasses which are almost impossible to differentiate. The main aim of the present study was to identify those shark species being exploited off northern Brazil, by using the 12S-16S molecular marker. For this, DNA sequences were obtained from 122 specimens collected on the docks and the fish market in Bragança, in the Brazilian state of Pará. We identified at least 11 species. Three-quarters of the specimens collected were either Carcharhinus porosus or Rhizoprionodon sp, while a notable absence was the daggernose shark, Isogomphodon oxyrhyncus, previously one of the most common species in local catches. The study emphasises the value of molecular techniques for the identification of cryptic shark species, and the potential of the 12S-16S marker as a tool for phylogenetic inferences in a study of elasmobranchs.

  3. Identification and phylogenetic inferences on stocks of sharks affected by the fishing industry off the Northern coast of Brazil

    PubMed Central

    2009-01-01

    The ongoing decline in abundance and diversity of shark stocks, primarily due to uncontrolled fishery exploitation, is a worldwide problem. An additional problem for the development of conservation and management programmes is the identification of species diversity within a given area, given the morphological similarities among shark species, and the typical disembarkation of processed carcasses which are almost impossible to differentiate. The main aim of the present study was to identify those shark species being exploited off northern Brazil, by using the 12S-16S molecular marker. For this, DNA sequences were obtained from 122 specimens collected on the docks and the fish market in Bragança, in the Brazilian state of Pará. We identified at least 11 species. Three-quarters of the specimens collected were either Carcharhinus porosus or Rhizoprionodon sp, while a notable absence was the daggernose shark, Isogomphodon oxyrhyncus, previously one of the most common species in local catches. The study emphasises the value of molecular techniques for the identification of cryptic shark species, and the potential of the 12S-16S marker as a tool for phylogenetic inferences in a study of elasmobranchs. PMID:21637699

  4. Phylogenetic inference and SSR characterization of tropical woody bamboos tribe Bambuseae (Poaceae: Bambusoideae) based on complete plastid genome sequences.

    PubMed

    Vieira, Leila do Nascimento; Dos Anjos, Karina Goulart; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Greco, Thiago Machado; Pedrosa, Fábio de Oliveira; de Souza, Emanuel Maltempi; Rogalski, Marcelo; de Souza, Robson Francisco; Guerra, Miguel Pedro

    2016-05-01

    The complete plastome sequencing is an efficient option for increasing phylogenetic resolution and evolutionary studies, as well as may greatly facilitate the use of plastid DNA markers in plant population genetic studies. Merostachys and Guadua stand out as the most common and the highest potential utilization bamboos indigenous of Brazil. Here, we sequenced the complete plastome sequences of the Brazilian Guadua chacoensis and Merostachys sp. to perform full plastome phylogeny and characterize the occurrence, type, and distribution of SRRs using 20 Bambuseae species. The determined plastome sequence of Merostachys sp. and G. chacoensis is 136,334 and 135,403 bp in size, respectively, with an identical gene content and typical quadripartite structure consisting of a pair of IRs separated by the LSC and SSC regions. The Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of Paleotropical and Neotropical Bamboos clades. The Neotropical bamboos segregated into three well-supported lineages, Chusqueinae, Guaduinae, and Arthrostylidiinae, with the last two forming a well-supported sister relationship. Paleotropical bamboos segregated into two well-supported lineages, Hickeliinae and Bambusinae + Melocanninae. We identified 141.8 cpSSR in Bambuseae plastomes and an inferior value (38.15) for plastome coding sequences. Among them, we identified 16 polymorphic SSR loci, with number of alleles varying from 3 to 10. These 16 polymorphic cpSSR loci in Bambuseae plastome can be assessed for the intraspecific level of polymorphism, leading to innovative highly sensitive phylogeographic and population genetics studies for this tribe.

  5. Phylogenetic Diversity of the Enteric Pathogen Salmonella enterica subsp. enterica Inferred from Genome-Wide Reference-Free SNP Characters

    PubMed Central

    Timme, Ruth E.; Pettengill, James B.; Allard, Marc W.; Strain, Errol; Barrangou, Rodolphe; Wehnes, Chris; Van Kessel, JoAnn S.; Karns, Jeffrey S.; Musser, Steven M.; Brown, Eric W.

    2013-01-01

    The enteric pathogen Salmonella enterica is one of the leading causes of foodborne illness in the world. The species is extremely diverse, containing more than 2,500 named serovars that are designated for their unique antigen characters and pathogenicity profiles—some are known to be virulent pathogens, while others are not. Questions regarding the evolution of pathogenicity, significance of antigen characters, diversity of clustered regularly interspaced short palindromic repeat (CRISPR) loci, among others, will remain elusive until a strong evolutionary framework is established. We present the first large-scale S. enterica subsp. enterica phylogeny inferred from a new reference-free k-mer approach of gathering single nucleotide polymorphisms (SNPs) from whole genomes. The phylogeny of 156 isolates representing 78 serovars (102 were newly sequenced) reveals two major lineages, each with many strongly supported sublineages. One of these lineages is the S. Typhi group; well nested within the phylogeny. Lineage-through-time analyses suggest there have been two instances of accelerated rates of diversification within the subspecies. We also found that antigen characters and CRISPR loci reveal different evolutionary patterns than that of the phylogeny, suggesting that a horizontal gene transfer or possibly a shared environmental acquisition might have influenced the present character distribution. Our study also shows the ability to extract reference-free SNPs from a large set of genomes and then to use these SNPs for phylogenetic reconstruction. This automated, annotation-free approach is an important step forward for bacterial disease tracking and in efficiently elucidating the evolutionary history of highly clonal organisms. PMID:24158624

  6. How does habitat filtering affect the detection of conspecific and phylogenetic density dependence?

    PubMed

    Wu, Junjie; Swenson, Nathan G; Brown, Calum; Zhang, Caicai; Yang, Jie; Ci, Xiuqin; Li, Jie; Sha, Liqing; Cao, Min; Lin, Luxiang

    2016-05-01

    Conspecific negative density dependence (CNDD) has been recognized as a key mechanism underlying species coexistence, especially in tropical forests. Recently, some studies have reported that seedling survival is also negatively correlated with the phylogenetic relatedness between neighbors and focal individuals, termed phylogenetic negative density dependence (PNDD). In contrast to CNDD or PNDD, shared habitat requirements between closely related individuals are thought to be a cause of observed positive effects of closely related neighbors, which may affect the strength and detectability of CNDD or PNDD. In order to investigate the relative importance of these mechanisms for tropical tree seedling survival, we used generalized linear mixed models to analyze how the survival of more than 10 000 seedlings of woody plant species related to neighborhood and habitat variables in a tropical rainforest in southwest China. By comparing models with and without habitat variables, we tested how habitat filtering affected the detection of CNDD and PNDD. The best-fitting model suggested that CNDD and habitat filtering played key roles in seedling survival; but that, contrary to our expectations, phylogenetic positive density dependence (PPDD) had a distinct and important effect. While habitat filtering affected the detection of CNDD by decreasing its apparent strength, it did not explain the positive effects of closely related neighbors. Our results demonstrate that a failure to control for habitat variables and phylogenetic relationships may obscure the importance of conspecific and heterospecific neighbor densities for seedling survival. PMID:27349095

  7. How does habitat filtering affect the detection of conspecific and phylogenetic density dependence?

    PubMed

    Wu, Junjie; Swenson, Nathan G; Brown, Calum; Zhang, Caicai; Yang, Jie; Ci, Xiuqin; Li, Jie; Sha, Liqing; Cao, Min; Lin, Luxiang

    2016-05-01

    Conspecific negative density dependence (CNDD) has been recognized as a key mechanism underlying species coexistence, especially in tropical forests. Recently, some studies have reported that seedling survival is also negatively correlated with the phylogenetic relatedness between neighbors and focal individuals, termed phylogenetic negative density dependence (PNDD). In contrast to CNDD or PNDD, shared habitat requirements between closely related individuals are thought to be a cause of observed positive effects of closely related neighbors, which may affect the strength and detectability of CNDD or PNDD. In order to investigate the relative importance of these mechanisms for tropical tree seedling survival, we used generalized linear mixed models to analyze how the survival of more than 10 000 seedlings of woody plant species related to neighborhood and habitat variables in a tropical rainforest in southwest China. By comparing models with and without habitat variables, we tested how habitat filtering affected the detection of CNDD and PNDD. The best-fitting model suggested that CNDD and habitat filtering played key roles in seedling survival; but that, contrary to our expectations, phylogenetic positive density dependence (PPDD) had a distinct and important effect. While habitat filtering affected the detection of CNDD by decreasing its apparent strength, it did not explain the positive effects of closely related neighbors. Our results demonstrate that a failure to control for habitat variables and phylogenetic relationships may obscure the importance of conspecific and heterospecific neighbor densities for seedling survival.

  8. The Impact of Reconstruction Methods, Phylogenetic Uncertainty and Branch Lengths on Inference of Chromosome Number Evolution in American Daisies (Melampodium, Asteraceae).

    PubMed

    McCann, Jamie; Schneeweiss, Gerald M; Stuessy, Tod F; Villaseñor, Jose L; Weiss-Schneeweiss, Hanna

    2016-01-01

    Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction.

  9. The Impact of Reconstruction Methods, Phylogenetic Uncertainty and Branch Lengths on Inference of Chromosome Number Evolution in American Daisies (Melampodium, Asteraceae)

    PubMed Central

    McCann, Jamie; Stuessy, Tod F.; Villaseñor, Jose L.; Weiss-Schneeweiss, Hanna

    2016-01-01

    Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction. PMID:27611687

  10. The Impact of Reconstruction Methods, Phylogenetic Uncertainty and Branch Lengths on Inference of Chromosome Number Evolution in American Daisies (Melampodium, Asteraceae).

    PubMed

    McCann, Jamie; Schneeweiss, Gerald M; Stuessy, Tod F; Villaseñor, Jose L; Weiss-Schneeweiss, Hanna

    2016-01-01

    Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction. PMID:27611687

  11. Molecular phylogenetics of the species-rich angiosperm genus Goniothalamus (Annonaceae) inferred from nine chloroplast DNA regions: Synapomorphies and putative correlated evolutionary changes in fruit and seed morphology.

    PubMed

    Tang, Chin Cheung; Thomas, Daniel C; Saunders, Richard M K

    2015-11-01

    A phylogenetic study of the genus Goniothalamus (Annonaceae) is presented using maximum parsimony, maximum likelihood and Bayesian approaches, with 65 species sampled (48.5% of the genus) based on sequences of nine chloroplast DNA regions (11,214 aligned positions). The resultant phylogeny clearly indicates that Goniothalamus is monophyletic. Preliminary research initially focused on identifying synapomorphies and estimating the phylogenetic signal of selected morphological characters based on parsimony and likelihood ancestral character state reconstructions. This prescreening of characters enabled 40 to be selected for further study, and of these 15 are shown here to demonstrate significant phylogenetic signal and to provide clear synapomorphies for several infrageneric clades. Although floral structure in Goniothalamus is comparatively uniform, suggesting a common basic pattern of pollination ecology, fruit and seed morphology in the genus is very diverse and is presumably associated with different patterns of frugivory. The present study assesses correlations amongst fruit and seed characters which are putatively of functional importance with regard to frugivory and dispersal. One-way phylogenetic ANOVA indicates significant phylogenetically independent correlation between the following fruit and seed characters: fruits borne on older branches and/or on the main trunk have larger monocarps than fruits borne on young branches; and monocarps that contain seeds with a hairy testa are larger than those with glabrous seeds. We discuss fruit morphologies and potential explanations for the inferred correlations, and suggest that they may be the result of adaptation to different frugivores (birds, larger non-volant animal and primate seed dispersers, respectively).

  12. Phylogenetic relationships of the Cobitoidea (Teleostei: Cypriniformes) inferred from mitochondrial and nuclear genes with analyses of gene evolution.

    PubMed

    Liu, Si-Qing; Mayden, Richard L; Zhang, Jia-Bo; Yu, Dan; Tang, Qiong-Ying; Deng, Xin; Liu, Huan-Zhang

    2012-10-15

    The superfamily Cobitoidea of the order Cypriniformes is a diverse group of fishes, inhabiting freshwater ecosystems across Eurasia and North Africa. The phylogenetic relationships of this well-corroborated natural group and diverse clade are critical to not only informing scientific communities of the phylogeny of the order Cypriniformes, the world's largest freshwater fish order, but are key to every area of comparative biology examining the evolution of traits, functional structures, and breeding behaviors to their biogeographic histories, speciation, anagenetic divergence, and divergence time estimates. In the present study, two mitochondrial gene sequences (COI, ND4+5) and four single-copy nuclear gene segments (RH1, RAG1, EGR2B, IRBP) were used to infer the phylogenetic relationships of the Cobitoidea as reconstructed from maximum likelihood (ML) and partitioned Bayesian Analysis (BA). Analyses of the combined mitochondrial/nuclear gene datasets revealed five strongly supported monophyletic Cobitoidea families and their sister-group relationships: Botiidae+(Vaillantellidae+(Cobitidae+(Nemacheilidae+Balitoridae))). These recovered relationships are in agreement with previous systematic studies on the order Cypriniformes and/or those focusing on the superfamily Cobitoidea. Using these relationships, our analyses revealed pattern lineage- or ecological-group-specific evolution of these genes for the Cobitoidea. These observations and results corroborate the hypothesis that these group-specific-ancestral ecological characters have contributed in the diversification and/or adaptations within these groups. Positive selections were detected in RH1 of nemacheilids and in RAG1 of nemacheilids and genus Vaillantella, which indicated that evolution of RH1 (related to eye's optic sense) and RAG1 (related to immunity) genes appeared to be important for the diversification of these groups. The balitorid lineage (those species inhabiting fast-flowing riverine habitats) had

  13. Molecular and phylogenetic analysis of PCR-amplified cyclin-dependent kinase (CDK) family sequences from representatives of the earliest available lineages of eukaryotes.

    PubMed

    Riley, D E; Krieger, J N

    1995-10-01

    Cyclin-dependent kinase (CDK) and cell division control (CDC2) sequences are strongly conserved among eukaryotes and may complement the use of other sequence families in eukaryotic phylogenetic inference. We synthesized degenerate PCR primers to amplify the catalytic region of CDK homologs in representatives of the earliest available lineages of eukaryotes. CDK family sequence-based, maximum-likelihood distance measurements with neighbor-joining, and Fitch-Margoliash least-squares analyses produced unrooted dendrograms that included protists, yeasts, and higher eukaryotes. Bootstrap confidence estimates supported CDK-based phylogenetic groupings among the protists, fungi, and vertebrates although resolution within these groups was often insignificant. However, Trichomonas vaginalis and Giardia lamblia exhibited two of the most divergent CDK-like sequences consistent with rRNA-phylogenetic inference of early divergence of these eukaryotic lineages. In the evolution from unicellular to multicellular organisms, a constellation of amino acid residues aligning with the human, CDK N-terminal beta sheet may have undergone abrupt replacement.

  14. Phylogenetic analysis of different breeds of domestic chickens in selected area of Peninsular Malaysia inferred from partial cytochrome b gene information and RAPD markers.

    PubMed

    Yap, Fook Choy; Yan, Yap Jin; Loon, Kiung Teh; Zhen, Justina Lee Ning; Kamau, Nelly Warau; Kumaran, Jayaraj Vijaya

    2010-10-01

    The present investigation was carried out in an attempt to study the phylogenetic analysis of different breeds of domestic chickens in Peninsular Malaysia inferred from partial cytochrome b gene information and random amplified polymorphic DNA (RAPD) markers. Phylogenetic analysis using both neighbor-joining (NJ) and maximum parsimony (MP) methods produced three clusters that encompassed Type-I village chickens, the red jungle fowl subspecies and the Japanese Chunky broilers. The phylogenetic analysis also revealed that majority of the Malaysian commercial chickens were randomly assembled with the Type-II village chickens. In RAPD assay, phylogenetic analysis using neighbor-joining produced six clusters that were completely distinguished based on the locality of chickens. High levels of genetic variations were observed among the village chickens, the commercial broilers, and between the commercial broilers and layer chickens. In this study, it was found that Type-I village chickens could be distinguished from the commercial chickens and Type-II village chickens at the position of the 27th nucleotide of the 351 bp cytochrome b gene. This study also revealed that RAPD markers were unable to differentiate the type of chickens, but it showed the effectiveness of RAPD in evaluating the genetic variation and the genetic relationships between chicken lines and populations.

  15. Dependency-dependent interference: NPI interference, agreement attraction, and global pragmatic inferences

    PubMed Central

    Xiang, Ming; Grove, Julian; Giannakidou, Anastasia

    2013-01-01

    Previous psycholinguistics studies have shown that when forming a long distance dependency in online processing, the parser sometimes accepts a sentence even though the required grammatical constraints are only partially met. A mechanistic account of how such errors arise sheds light on both the underlying linguistic representations involved and the processing mechanisms that put such representations together. In the current study, we contrast the negative polarity items (NPI) interference effect, as shown by the acceptance of an ungrammatical sentence like “The bills that democratic senators have voted for will ever become law,” with the well-known phenomenon of agreement attraction (“The key to the cabinets are … ”). On the surface, these two types of errors look alike and thereby can be explained as being driven by the same source: similarity based memory interference. However, we argue that the linguistic representations involved in NPI licensing are substantially different from those of subject-verb agreement, and therefore the interference effects in each domain potentially arise from distinct sources. In particular, we show that NPI interference at least partially arises from pragmatic inferences. In a self-paced reading study with an acceptability judgment task, we showed NPI interference was modulated by participants' general pragmatic communicative skills, as quantified by the Autism-Spectrum Quotient (AQ, Baron-Cohen et al., 2001), especially in offline tasks. Participants with more autistic traits were actually less prone to the NPI interference effect than those with fewer autistic traits. This result contrasted with agreement attraction conditions, which were not influenced by individual pragmatic skill differences. We also show that different NPI licensors seem to have distinct interference profiles. We discuss two kinds of interference effects for NPI licensing: memory-retrieval based and pragmatically triggered. PMID:24109468

  16. Target dependency of brain mechanism involved in dispositional inference: a PET study.

    PubMed

    Sugiura, Motoaki; Gotoh, Ryoi; Okada, Ken; Yamaguchi, Keiichiroh; Itoh, Masatoshi; Fukuda, Hiroshi; Kawashima, Ryuta

    2004-04-01

    The cognitive mechanism for inference of personal dispositions, such as personality traits and abilities, is postulated to be dependent on the amount of episodic memory concerning target persons. To examine whether there is such target dependency in the brain mechanism during dispositional inference, we measured brain activity of normal volunteers while they were performing seven dispositional inference tasks, each for a target person in different categories, using positron emission tomography (PET). Effect of the target-person category on activation was significant in the posterodorsal, polar, and ventral subdivisions of medial prefrontal cortex, right orbitoinsular junction, left temporal pole and superior temporal sulcus, cerebellum, and thalamus, suggesting the existence of target dependency in activation during dispositional inference. The amount of episodic memory concerning a target person measured using the self-evaluative questionnaire was positively correlated with the activation in the polar subdivision of the medial prefrontal cortex, and negatively with that in a region in the left superior temporal sulcus. Together with the available knowledge on the functional roles of these regions and the proposed cognitive model in social psychology, our results suggest that these two regions play roles supplementary to each other in dispositional inference; a region in the superior temporal sulcus is involved in the processing of relevant episodic exemplar and the polar subdivision of the medial prefrontal cortex in the processing of summarized value information about the target person. PMID:15050563

  17. Phylogenetic position of symbiotic protist Dinenympha [correction of Dinemympha] exilis in the hindgut of the termite Reticulitermes speratus inferred from the protein phylogeny of elongation factor 1 alpha.

    PubMed

    Moriya, S; Ohkuma, M; Kudo, T

    1998-04-14

    The phylogenetic position of the symbiotic oxymonad Dinenympha exilis, found in the hindgut of the lower termite Reticulitermes speratus, was determined by analysis of translation elongation factor 1 alpha (EF-1 alpha). cDNA corresponding to a major part of the amino acid coding region of EF-1 alpha mRNA was amplified by the reverse transcription polymerase chain reaction (RT-PCR) method from total mRNA of termite hindgut microorganisms without cultivation. The product was cloned into a plasmid vector, pGEM-T, and the clones were isolated and sequenced. One of the EF-1 alpha clones isolated was assigned to the protist D. exilis by whole-cell in-situ hybridization using a specific oligonucleotide probe with enzymatic signal amplification. The deduced amino acid sequence was aligned with those of other eukaryotic and archaeabacterial EF-1 alpha s, and the phylogenetic relationships among early branching eukaryotes were inferred by using the distance matrix method and the maximum parsimony method. The phylogenetic analysis indicated that the D. exilis offshoot occurred before mitochondria-containing organisms and D. exilis branched out after the diplomonads clade. These results indicate that the oxymonad D. exilis is one of the early branching organisms and suggest that the oxymonads form a lineage independent of other early branching organisms. PMID:9573371

  18. Sorting through the chaff, nDNA gene trees for phylogenetic inference and hybrid identification of annual sunflowers (Helianthus sect. Helianthus).

    PubMed

    Moody, Michael L; Rieseberg, Loren H

    2012-07-01

    The annual sunflowers (Helianthus sect. Helianthus) present a formidable challenge for phylogenetic inference because of ancient hybrid speciation, recent introgression, and suspected issues with deep coalescence. Here we analyze sequence data from 11 nuclear DNA (nDNA) genes for multiple genotypes of species within the section to (1) reconstruct the phylogeny of this group, (2) explore the utility of nDNA gene trees for detecting hybrid speciation and introgression; and (3) test an empirical method of hybrid identification based on the phylogenetic congruence of nDNA gene trees from tightly linked genes. We uncovered considerable topological heterogeneity among gene trees with or without three previously identified hybrid species included in the analyses, as well as a general lack of reciprocal monophyly of species. Nonetheless, partitioned Bayesian analyses provided strong support for the reciprocal monophyly of all species except H. annuus (0.89 PP), the most widespread and abundant annual sunflower. Previous hypotheses of relationships among taxa were generally strongly supported (1.0 PP), except among taxa typically associated with H. annuus, apparently due to the paraphyly of the latter in all gene trees. While the individual nDNA gene trees provided a useful means for detecting recent hybridization, identification of ancient hybridization was problematic for all ancient hybrid species, even when linkage was considered. We discuss biological factors that affect the efficacy of phylogenetic methods for hybrid identification.

  19. Local dependence in random graph models: characterization, properties and statistical inference

    PubMed Central

    Schweinberger, Michael; Handcock, Mark S.

    2015-01-01

    Summary Dependent phenomena, such as relational, spatial and temporal phenomena, tend to be characterized by local dependence in the sense that units which are close in a well-defined sense are dependent. In contrast with spatial and temporal phenomena, though, relational phenomena tend to lack a natural neighbourhood structure in the sense that it is unknown which units are close and thus dependent. Owing to the challenge of characterizing local dependence and constructing random graph models with local dependence, many conventional exponential family random graph models induce strong dependence and are not amenable to statistical inference. We take first steps to characterize local dependence in random graph models, inspired by the notion of finite neighbourhoods in spatial statistics and M-dependence in time series, and we show that local dependence endows random graph models with desirable properties which make them amenable to statistical inference. We show that random graph models with local dependence satisfy a natural domain consistency condition which every model should satisfy, but conventional exponential family random graph models do not satisfy. In addition, we establish a central limit theorem for random graph models with local dependence, which suggests that random graph models with local dependence are amenable to statistical inference. We discuss how random graph models with local dependence can be constructed by exploiting either observed or unobserved neighbourhood structure. In the absence of observed neighbourhood structure, we take a Bayesian view and express the uncertainty about the neighbourhood structure by specifying a prior on a set of suitable neighbourhood structures. We present simulation results and applications to two real world networks with ‘ground truth’. PMID:26560142

  20. Evolutionary history and phylogenetic relationship between Auxis thazard and Auxis rochei inferred from COI sequences of mtDNA.

    PubMed

    Kumar, Girish; Kunal, Swaraj Priyaranjan; Shyama, S K

    2013-01-01

    Tunas of the genus Auxis are cosmopolitan species and the smallest members of the tribe Thunnini, the true tunas. In the present study, COI sequences of mtDNA were employed to examine the evolutionary history and phylogenetic relationship between A. thazard and A. rochei. A total of 29 COI sequences were retrieved from NCBI. Historic demographic analyses of sequence data showed that A. thazard has undergone sudden population expansion in the past while population size of A. rochei has been remain constant for long period. Non-significant value of Tajimas's D (P = 0.22400) and Fu's FS (P = 0.21400) test fail to reject the null hypothesis of neutral evolution for A. rochei. Phylogenetic analyses of nucleotide sequences demonstrated separate clusters for both species and are strongly supported by 98% bootstrap value. The results of the present study suggest the recent founding of A. thazard in world ocean while A. rochei represents the ancestral species. PMID:24084241

  1. Phylogenetic relationships of Paradiclybothrium pacificum and Diclybothrium armatum (Monogenoidea: Diclybothriidae) inferred from 18S rDNA sequence data.

    PubMed

    Rozhkovan, Konstantin V; Shedko, Marina B

    2015-10-01

    The Diclybothriidae (Monogenoidea: Oligonchoinea) includes specific parasites of fishes assigned to the ancient order Acipenseriformes. Phylogeny of the Diclybothriidae is still unclear despite several systematic studies based on morphological characters. Together with the closely related Hexabothriidae represented by parasites of sharks and ray-fishes, the position of Diclybothriidae in different taxonomical systems has been matter of discussion. Here, we present the first molecular data on Diclybothriidae. The SSU rRNA gene was used to investigate the phylogenetic position of Paradiclybothrium pacificum and Diclybothrium armatum among the other Oligonchoinea. Complete nucleotide sequences of P. pacificum and D. armatum demonstrated high identity (98.53%) with no intraspecific sequence variability. Specimens of D. armatum were obtained from different hosts (Acipenser schrenckii and Huso dauricus); however, variation by host was not detected. The sequence divergence and phylogenetic trees data show that Diclybothriidae and Hexabothriidae are more closely related to each other than with other representatives of Oligonchoinea.

  2. Phylogenetic analysis of hampala fishes (subfamily Cyprininae) in Malaysia inferred from partial mitochondrial cytochrome B DNA sequences.

    PubMed

    Ryan, Jeffrine R J; Esa, Yuzine B

    2006-10-01

    This study examined 396 base pairs of the mitochondrial cytochrome b gene from 110 individuals belonging to the genus Hampala, a group of freshwater cyprinids that inhabit Southeast Asia. The samples were taken from various locations throughout Sarawak, Sabah, and peninsular Malaysia. The nucleotide sequences were subjected to phylogenetic analyses by using the neighbor-joining, maximum parsimony, and maximum likelihood methods. All three methods revealed the reciprocally monophyletic relationship of Hampala macrolepidota to the other Hampala forms, thus strongly supporting its status as a distinct species. Phylogenetic analysis also discovered the existence of two H. bimaculata lineages endemic to Borneo: (1) a newly identified species from the southern and central part of Sarawak assigned as H. bimaculata Type A and (2) the previously described H. bimaculata from northern Sarawak and the west coast of Sabah assigned as H. bimaculata Type B. However, the status of H. sabana and an intermediate form were not elucidated. The results suggest that the intermediate form from the Tawau population is actually a subpopulation of H. sabana, while the highly divergent intermediate form from Kalabakan could represent a cryptic species. The sharing of H. macrolepidota haplotypes in the southern peninsular Malaysia and southern and central Sarawak samples (Hm1 and Hm2) reflected the recent disconnection of the two regions, during the late Pleistocene. Overall, the partial sequence of the mitochondrial cytochrome b gene was useful for resolving the phylogenetic relationships among Hampala fishes in Malaysia.

  3. Phylogenetic relationships of North American phoxinins (Actinopterygii: Cypriniformes: Leuciscidae) as inferred from S7 nuclear DNA sequences.

    PubMed

    Bufalino, Angelo P; Mayden, Richard L

    2010-04-01

    The phylogenetic systematics of North American (NA) phoxinins, and family Leuciscidae (formerly Cyprinidae, in part) in general, have had a very dynamic history marked by uncertainty regarding the validity of many taxonomic designations. Recent molecular studies, based on mitochondrial DNA sequences, of NA phoxinins have provided the first comprehensive hypotheses for the group and suggest that they are comprised of three major lineages: creek chub-plagopterin (CC-P), western, and open posterior myodome clades. A number of phylogenetic evaluations, using both mitochondrial and nuclear DNA sequences, of the formerly recognized family Cyprinidae (=Cyprinoidea) have failed to support the traditional subfamily designations. This study, based on sequences from the first intron of the nuclear gene S7, supports many of the findings of the earlier molecular studies of NA phoxinins, but also suggests that European Leuciscidae evolved within the NA phoxinin lineage and are most likely descended from the common ancestor of the CC-P clade. As in some other recent phylogenetic studies of the Cyprinoidea, analyses of this study fail to support the broader subfamily designations within the formerly recognized family Cyprinidae.

  4. Molecular and phylogenetic analysis of pyridoxal phosphate-dependent acyltransferase of Exiguobacterium acetylicum.

    PubMed

    Rajendran, Narayanan; Smith, Colby; Mazhawidza, Williard

    2009-01-01

    The pyridoxal-5'-phosphate (PLP)-dependent family of enzymes is a very diverse group of proteins that metabolize small molecules like amino acids and sugars, and synthesize cofactors for other metabolic pathways through transamination, decarboxylation, racemization, and substitution reactions. In this study we employed degenerated primer-based PCR amplification, using genomic DNA isolated from the soil bacterium Exiguobacterium acetylicum strain SN as template. We revealed the presence of a PLP-dependent family of enzymes, such as PLP-dependent acyltransferase, and similarity to 8-amino-7-oxononoate synthase. Sequencing analysis and multiple alignment of the thymidine-adenine-cloned PCR amplicon revealed PLP-dependent family enzymes with specific confering codes and consensus amino acid residues specific to this group of functional proteins. Amino acid residues common to the majority of PLP-dependent enzymes were also revealed by the Lasergene MegAlign software. A phylogenetic tree was constructed. Its analysis revealed a close relationship of E. acetylicum to other bacteria isolated from extreme environments suggesting similarities in anabolic adaptability and evolutionary development. PMID:20158163

  5. Phylogenetic Relationships of Tribes Within Harpalinae (Coleoptera: Carabidae) as Inferred from 28S Ribosomal DNA and the Wingless Gene

    PubMed Central

    Ober, Karen A.; Maddison, David R.

    2008-01-01

    Harpalinae is a large, monophyletic subfamily of carabid ground beetles containing more than 19,000 species in approximately 40 tribes. The higher level phylogenetic relationships within harpalines were investigated based on nucleotide data from two nuclear genes, wingless and 28S rDNA. Phylogenetic analyses of combined data indicate that many harpaline tribes are monophyletic, however the reconstructed trees showed little support for deeper nodes. In addition, our results suggest that the Lebiomorph Assemblage (tribes Lebiini, Cyclosomini, Graphipterini, Perigonini, Odacanthini, Lachnophorini, Pentagonicini, Catapiesini and Calophaenini), which is united by a morphological synapomorphy, is not monophyletic, and the tribe Lebiini is paraphyletic with respect to members of Cyclosomini. Two unexpected clades of tribes were supported: the Zuphiitae, comprised of Anthiini, Zuphiini, Helluonini, Dryptini, Galeritini, and Physocrotaphini; and a clade comprised of Orthogoniini, Pseudomorphini, and Graphipterini. The data presented in this study represent a dense sample of taxa to examine the molecular phylogeny of Harpalinae and provide a useful framework to examine the origin and evolution of morphological and ecological diversity in this group. PMID:20302528

  6. Phylogenetic Relationships and Morphological Character Evolution of Photosynthetic Euglenids (Excavata) Inferred from Taxon-rich Analyses of Five Genes.

    PubMed

    Karnkowska, Anna; Bennett, Matthew S; Watza, Donovan; Kim, Jong Im; Zakryś, Bożena; Triemer, Richard E

    2015-01-01

    Photosynthetic euglenids acquired chloroplasts by secondary endosymbiosis, which resulted in changes to their mode of nutrition and affected the evolution of their morphological characters. Mapping morphological characters onto a reliable molecular tree could elucidate major trends of those changes. We analyzed nucleotide sequence data from regions of three nuclear-encoded genes (nSSU, nLSU, hsp90), one chloroplast-encoded gene (cpSSU) and one nuclear-encoded chloroplast gene (psbO) to estimate phylogenetic relationships among 59 photosynthetic euglenid species. Our results were consistent with previous works; most genera were monophyletic, except for the polyphyletic genus Euglena, and the paraphyletic genus Phacus. We also analyzed character evolution in photosynthetic euglenids using our phylogenetic tree and eight morphological traits commonly used for generic and species diagnoses, including: characters corresponding to well-defined clades, apomorphies like presence of lorica and mucilaginous stalks, and homoplastic characters like rigid cells and presence of large paramylon grains. This research indicated that pyrenoids were lost twice during the evolution of phototrophic euglenids, and that mucocysts, which only occur in the genus Euglena, evolved independently at least twice. In contrast, the evolution of cell shape and chloroplast morphology was difficult to elucidate, and could not be unambiguously reconstructed in our analyses.

  7. Phylogenetic relationships and historical biogeography of neotropical parrots (Psittaciformes: Psittacidae: Arini) inferred from mitochondrial and nuclear DNA sequences.

    PubMed

    Tavares, Erika Sendra; Baker, Allan J; Pereira, Sérgio Luiz; Miyaki, Cristina Yumi

    2006-06-01

    Previous hypotheses of phylogenetic relationships among Neotropical parrots were based on limited taxon sampling and lacked support for most internal nodes. In this study we increased the number of taxa (29 species belonging to 25 of the 30 genera) and gene sequences (6388 base pairs of RAG-1, cyt b, NADH2, ATPase 6, ATPase 8, COIII, 12S rDNA, and 16S rDNA) to obtain a stronger molecular phylogenetic hypothesis for this group of birds. Analyses of the combined gene sequences using maximum likelihood and Bayesian methods resulted in a well-supported phylogeny and indicated that amazons and allies are a sister clade to macaws, conures, and relatives, and these two clades are in turn a sister group to parrotlets. Key morphological and behavioral characters used in previous classifications were mapped on the molecular tree and were phylogenetically uninformative. We estimated divergence times of taxa using the molecular tree and Bayesian and penalized likelihood methods that allow for rate variation in DNA substitutions among sites and taxa. Our estimates suggest that the Neotropical parrots shared a common ancestor with Australian parrots 59 Mya (million of years ago; 95% credibility interval (CrI) 66, 51 Mya), well before Australia separated from Antarctica and South America, implying that ancestral parrots were widespread in Gondwanaland. Thus, the divergence of Australian and Neotropical parrots could be attributed to vicariance. The three major clades of Neotropical parrots originated about 50 Mya (95% CrI 57, 41 Mya), coinciding with periods of higher sea level when both Antarctica and South America were fragmented with transcontinental seaways, and likely isolated the ancestors of modern Neotropical parrots in different regions in these continents. The correspondence between major paleoenvironmental changes in South America and the diversification of genera in the clade of amazons and allies between 46 and 16 Mya suggests they diversified exclusively in South

  8. Evolutionary history of tall fescue morphotypes inferred from molecular phylogenetics of the Lolium-Festuca species complex

    PubMed Central

    2010-01-01

    Background The agriculturally important pasture grass tall fescue (Festuca arundinacea Schreb. syn. Lolium arundinaceum (Schreb.) Darbysh.) is an outbreeding allohexaploid, that may be more accurately described as a species complex consisting of three major (Continental, Mediterranean and rhizomatous) morphotypes. Observation of hybrid infertility in some crossing combinations between morphotypes suggests the possibility of independent origins from different diploid progenitors. This study aims to clarify the evolutionary relationships between each tall fescue morphotype through phylogenetic analysis using two low-copy nuclear genes (encoding plastid acetyl-CoA carboxylase [Acc1] and centroradialis [CEN]), the nuclear ribosomal DNA internal transcribed spacer (rDNA ITS) and the chloroplast DNA (cpDNA) genome-located matK gene. Other taxa within the closely related Lolium-Festuca species complex were also included in the study, to increase understanding of evolutionary processes in a taxonomic group characterised by multiple inter-specific hybridisation events. Results Putative homoeologous sequences from both nuclear genes were obtained from each polyploid species and compared to counterparts from 15 diploid taxa. Phylogenetic reconstruction confirmed F. pratensis and F. arundinacea var. glaucescens as probable progenitors to Continental tall fescue, and these species are also likely to be ancestral to the rhizomatous morphotype. However, these two morphotypes are sufficiently distinct to be located in separate clades based on the ITS-derived data set. All four of the generated data sets suggest independent evolution of the Mediterranean and Continental morphotypes, with minimal affinity between cognate sequence haplotypes. No obvious candidate progenitor species for Mediterranean tall fescues were identified, and only two putative sub-genome-specific haplotypes were identified for this morphotype. Conclusions This study describes the first phylogenetic analysis of

  9. Phylogenetic relationships in the Niviventer-Chiromyscus complex (Rodentia, Muridae) inferred from molecular data, with description of a new species

    PubMed Central

    Balakirev, Alexander E.; Abramov, Alexei V.; Rozhnov, Viatcheslav V.

    2014-01-01

    Abstract Based on molecular data for mitochondrial (Cyt b, COI) and nuclear (IRBP, GHR) genes, and morphological examinations of museum specimens, we examined diversity, species boundaries, and relationships within and between the murine genera Chiromyscus and Niviventer. Phylogenetic patterns recovered demonstrate that Niviventer sensu lato is not monophyletic but instead includes Chiromyscus chiropus, the only previously recognized species of Chiropus. To maintain the genera Niviventer and Chiropus as monophyletic lineages, the scope and definition of the genus Chiromyscus is revised to include at least three distinct species: Chiromyscus chiropus (the type species of Chiromyscus), Chiromyscus langbianis (previously regarded as a species of Niviventer), and a new species, described in this paper under the name Chiromyscus thomasi sp. n. PMID:25493050

  10. Differential introgression and effective size of marker type influence phylogenetic inference of a recently divergent avian group (Phasianidae: Tympanuchus).

    PubMed

    Galla, Stephanie J; Johnson, Jeff A

    2015-03-01

    Life history strategies can influence the effective population size (Ne) of loci differently based on their mode of inheritance. Recognizing how this may affect the rate of lineage sorting among marker types is important for studies focused on resolving phylogenetic relationships among recently divergent taxa. In this study, we use gene tree, coalescent-based species tree, and isolation-with-migration analyses to explore the differences between marker types (autosomal, Z-linked, and mitochondrial) in resolving phylogenetic relationships among North American prairie grouse (Tympanuchus). We found that Z-linked loci were more likely to identify monophyletic relationships among prairie grouse species compared to autosomal and mtDNA loci in both species and gene tree analyses, with species tree analyses outperforming gene trees. These results were further supported with isolation-with-migration analyses, where Z-linked loci largely followed a strict isolation model while autosomal loci were more likely to fit a model with gene flow between species following population divergence. While accounting for differences in inheritance pattern (or Ne) for marker type, results suggest that additional factors, such as strong sexual selection and sex-biased introgression (i.e., male-biased postzygotic hybrid behavioral isolation or "unsexy son"), may further explain the decreased diversity levels and increased rate of lineage sorting observed with the Z-linked loci relative to autosomal and mtDNA loci. In fact, to our knowledge no hybrid male prairie grouse have been observed breeding in the wild, yet hybrid females along with backcross females are known to produce viable offspring. Overall, this study highlights that more work is needed to determine how complex models of gene flow (i.e., sex biased introgression) and differences in the effective size among marker types based on differing life history strategies influence divergence date estimation and species delimitation.

  11. Evolutionary history of the Afro-Madagascan Ixora species (Rubiaceae): species diversification and distribution of key morphological traits inferred from dated molecular phylogenetic trees

    PubMed Central

    Tosh, J.; Dessein, S.; Buerki, S.; Groeninckx, I.; Mouly, A.; Bremer, B.; Smets, E. F.; De Block, P.

    2013-01-01

    Background and Aims Previous work on the pantropical genus Ixora has revealed an Afro-Madagascan clade, but as yet no study has focused in detail on the evolutionary history and morphological trends in this group. Here the evolutionary history of Afro-Madagascan Ixora spp. (a clade of approx. 80 taxa) is investigated and the phylogenetic trees compared with several key morphological traits in taxa occurring in Madagascar. Methods Phylogenetic relationships of Afro-Madagascan Ixora are assessed using sequence data from four plastid regions (petD, rps16, rpoB-trnC and trnL-trnF) and nuclear ribosomal external transcribed spacer (ETS) and internal transcribed spacer (ITS) regions. The phylogenetic distribution of key morphological characters is assessed. Bayesian inference (implemented in BEAST) is used to estimate the temporal origin of Ixora based on fossil evidence. Key Results Two separate lineages of Madagascan taxa are recovered, one of which is nested in a group of East African taxa. Divergence in Ixora is estimated to have commenced during the mid Miocene, with extensive cladogenesis occurring in the Afro-Madagascan clade during the Pliocene onwards. Conclusions Both lineages of Madagascan Ixora exhibit morphological innovations that are rare throughout the rest of the genus, including a trend towards pauciflorous inflorescences and a trend towards extreme corolla tube length, suggesting that the same ecological and selective pressures are acting upon taxa from both Madagascan lineages. Novel ecological opportunities resulting from climate-induced habitat fragmentation and corolla tube length diversification are likely to have facilitated species radiation on Madagascar. PMID:24142919

  12. Inferring Phylogenetic Relationships of Indian Citron (Citrus medica L.) based on rbcL and matK Sequences of Chloroplast DNA.

    PubMed

    Uchoi, Ajit; Malik, Surendra Kumar; Choudhary, Ravish; Kumar, Susheel; Rohini, M R; Pal, Digvender; Ercisli, Sezai; Chaudhury, Rekha

    2016-06-01

    Phylogenetic relationships of Indian Citron (Citrus medica L.) with other important Citrus species have been inferred through sequence analyses of rbcL and matK gene region of chloroplast DNA. The study was based on 23 accessions of Citrus genotypes representing 15 taxa of Indian Citrus, collected from wild, semi-wild, and domesticated stocks. The phylogeny was inferred using the maximum parsimony (MP) and neighbor-joining (NJ) methods. Both MP and NJ trees separated all the 23 accessions of Citrus into five distinct clusters. The chloroplast DNA (cpDNA) analysis based on rbcL and matK sequence data carried out in Indian taxa of Citrus was useful in differentiating all the true species and species/varieties of probable hybrid origin in distinct clusters or groups. Sequence analysis based on rbcL and matK gene provided unambiguous identification and disposition of true species like C. maxima, C. medica, C. reticulata, and related hybrids/cultivars. The separation of C. maxima, C. medica, and C. reticulata in distinct clusters or sub-clusters supports their distinctiveness as the basic species of edible Citrus. However, the cpDNA sequence analysis of rbcL and matK gene could not find any clear cut differentiation between subgenera Citrus and Papeda as proposed in Swingle's system of classification. PMID:26956119

  13. Time scale for cyclostome evolution inferred with a phylogenetic diagnosis of hagfish and lamprey cDNA sequences.

    PubMed

    Kuraku, Shigehiro; Kuratani, Shigeru

    2006-12-01

    The Cyclostomata consists of the two orders Myxiniformes (hagfishes) and Petromyzoniformes (lampreys), and its monophyly has been unequivocally supported by recent molecular phylogenetic studies. Under this updated vertebrate phylogeny, we performed in silico evolutionary analyses using currently available cDNA sequences of cyclostomes. We first calculated the GC-content at four-fold degenerate sites (GC(4)), which revealed that an extremely high GC-content is shared by all the lamprey species we surveyed, whereas no striking pattern in GC-content was observed in any of the hagfish species surveyed. We then estimated the timing of diversification in cyclostome evolution using nucleotide and amino acid sequences. We obtained divergence times of 470-390 million years ago (Mya) in the Ordovician-Silurian-Devonian Periods for the interordinal split between Myxiniformes and Petromyzoniformes; 90-60 Mya in the Cretaceous-Tertiary Periods for the split between the two hagfish subfamilies, Myxininae and Eptatretinae; 280-220 Mya in the Permian-Triassic Periods for the split between the two lamprey subfamilies, Geotriinae and Petromyzoninae; and 30-10 Mya in the Tertiary Period for the split between the two lamprey genera, Petromyzon and Lethenteron. This evolutionary configuration indicates that Myxiniformes and Petromyzoniformes diverged shortly after the common ancestor of cyclostomes split from the future gnathostome lineage. Our results also suggest that intra-subfamilial diversification in hagfish and lamprey lineages (especially those distributed in the northern hemisphere) occurred in the Cretaceous or Tertiary Periods.

  14. Myo-anatomy of juvenile and adult loxosomatid Entoprocta and the use of muscular body plans for phylogenetic inferences.

    PubMed

    Wanninger, Andreas

    2004-08-01

    The anatomy of the muscle bauplan in juvenile buds and adult specimens of the solitary loxosomatid entoprocts Loxosoma nielseni and L. annelidicola was studied by means of fluorescence staining of F-actin and confocal laser scanning microscopy. Although the general myo-anatomy of the body wall shows numerous similarities, both species express significant variations in the arrangement of their pedal muscles. In addition, L. annelidicola alone shows a distinct pair of rectum retractor muscles. Circular muscles are absent in the entire body wall of both species, as well as in previously investigated colonial taxa, which is therefore regarded as basal for Entoprocta. This is in striking contrast to the conditions found in other spiralian or lophotrochozoan taxa. The simple morphology of entoproct tentacle muscles, however, coincides with the phoronid-ectoproct condition and may be due to functional constraints of a simple filter-feeding system. This work shows that variations in the muscular anatomy provide useful characters for systematic analyses on species as well as phylum level and thus allow significant insight regarding metazoan body plan evolution. The phenomenon of phenotypic plasticity and its consequences for phylogenetic interpretations, however, must be carefully considered.

  15. Intraspecific differentiation of Paramecium novaurelia strains (Ciliophora, Protozoa) inferred from phylogenetic analysis of ribosomal and mitochondrial DNA variation.

    PubMed

    Tarcz, Sebastian

    2013-01-01

    Paramecium novaurelia Beale and Schneller, 1954, was first found in Scotland and is known to occur mainly in Europe, where it is the most common species of the P. aurelia complex. In recent years, two non-European localities have been described: Turkey and the United States of America. This article presents the analysis of intraspecific variability among 25 strains of P. novaurelia with the application of ribosomal and mitochondrial loci (ITS1-5.8S-ITS2, 5' large subunit rDNA (5'LSU rDNA) and cytochrome c oxidase subunit 1 (COI) mtDNA). The mean distance observed for all of the studied P. novaurelia sequence pairs was p=0.008/0.016/0.092 (ITS1-5.8S-ITS2/5'LSU rDNA/COI). Phylogenetic trees (NJ/MP/BI) based on a comparison of all of the analysed sequences show that the studied strains of P. novaurelia form a distinct clade, separate from the P. caudatum outgroup, and are divided into two clusters (A and B) and two branches (C and D). The occurrence of substantial genetic differentiation within P. novaurelia, confirmed by the analysed DNA fragments, indicates a rapid evolution of particular species within the Paramecium genus.

  16. Time scale for cyclostome evolution inferred with a phylogenetic diagnosis of hagfish and lamprey cDNA sequences.

    PubMed

    Kuraku, Shigehiro; Kuratani, Shigeru

    2006-12-01

    The Cyclostomata consists of the two orders Myxiniformes (hagfishes) and Petromyzoniformes (lampreys), and its monophyly has been unequivocally supported by recent molecular phylogenetic studies. Under this updated vertebrate phylogeny, we performed in silico evolutionary analyses using currently available cDNA sequences of cyclostomes. We first calculated the GC-content at four-fold degenerate sites (GC(4)), which revealed that an extremely high GC-content is shared by all the lamprey species we surveyed, whereas no striking pattern in GC-content was observed in any of the hagfish species surveyed. We then estimated the timing of diversification in cyclostome evolution using nucleotide and amino acid sequences. We obtained divergence times of 470-390 million years ago (Mya) in the Ordovician-Silurian-Devonian Periods for the interordinal split between Myxiniformes and Petromyzoniformes; 90-60 Mya in the Cretaceous-Tertiary Periods for the split between the two hagfish subfamilies, Myxininae and Eptatretinae; 280-220 Mya in the Permian-Triassic Periods for the split between the two lamprey subfamilies, Geotriinae and Petromyzoninae; and 30-10 Mya in the Tertiary Period for the split between the two lamprey genera, Petromyzon and Lethenteron. This evolutionary configuration indicates that Myxiniformes and Petromyzoniformes diverged shortly after the common ancestor of cyclostomes split from the future gnathostome lineage. Our results also suggest that intra-subfamilial diversification in hagfish and lamprey lineages (especially those distributed in the northern hemisphere) occurred in the Cretaceous or Tertiary Periods. PMID:17261918

  17. Phylogenetic position of Mediterranean Astereae and character evolution of daisies (Bellis, Asteraceae) inferred from nrDNA ITS sequences.

    PubMed

    Fiz, Omar; Valcárcel, Virginia; Vargas, Pablo

    2002-10-01

    Phylogenetic analyses of nrITS sequences of Asteraceae revealed that the Bellis group is a natural assemblage comprising all the species of Bellis and Bellium, but not Rhynchospermum. In contrast, we propose to include the genera Bellis, Bellium, and Bellidastrum in the subtribe Bellidinae in the interest of circumscribing natural groups. Our results also suggest an early diversification in the western Mediterranean Basin of two monophyletic lineages, Bellis and Bellium. Three major groups can be distinguished within BELLIS: (1) the B. perennis group, containing five annual and perennial species with three ploidy levels (diploid, octoploid, and decaploid), which are distributed throughout the Mediterranean Basin despite lack of pappus; (2) the Bellis sylvestris group, with five annual and perennial species primarily from the western Mediterranean, in which there are five ploidy levels (diploid, tetraploid, hexaploid, octoploid, and decaploid); and (3) a basal grade consisting of three diploid, perennial species which displays remarkable diversification of morphologies. Striking characteristics, such as an annual life form, polyploidy, and loss of pappus, seem to have occurred in parallel several times and in different geographical areas during the early diversification of Bellis species in the western Mediterranean. Character evolution reconstructions allow us to describe a putative ancestor of the genus Bellis (proto-Bellis). PMID:12383758

  18. Phylogenetically resolving epidemiologic linkage

    DOE PAGES

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-02-22

    The use of phylogenetic trees in epidemiological investigations has become commonplace, but their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the truemore » transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. Moreover, we confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results.« less

  19. Phylogenetically resolving epidemiologic linkage.

    PubMed

    Romero-Severson, Ethan O; Bulla, Ingo; Leitner, Thomas

    2016-03-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals' HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  20. Phylogenetically resolving epidemiologic linkage

    PubMed Central

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-01-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  1. Arthropod Phylogenetics in Light of Three Novel Millipede (Myriapoda: Diplopoda) Mitochondrial Genomes with Comments on the Appropriateness of Mitochondrial Genome Sequence Data for Inferring Deep Level Relationships

    PubMed Central

    Brewer, Michael S.; Swafford, Lynn; Spruill, Chad L.; Bond, Jason E.

    2013-01-01

    Background Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. Results The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. Conclusions The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the

  2. Sex-dependent competitive dominance of phylogenetic group B2 Escherichia coli strains within human hosts.

    PubMed

    Blyton, Michaela D J; Cornall, Samantha J; Kennedy, Karina; Colligon, Peter; Gordon, David M

    2014-12-01

    Escherichia coli can be divided into several distinct phylogenetic groups that differ in their capacity to cause disease. However, what drives the relative abundance of these different phylogenetic groups in the commensal intestinal community of humans is poorly understood. This study investigated how host age and sex influences E. coli community structure in humans. Faecal samples were collected from 205 outpatients in Australia. Different strains within each sample were identified using rep-PCR profiles and their phylogenetic group membership was determined by quadruplex PCR. Female individuals carrying a dominant B2 strain were found to possess fewer strains than those carrying dominant A or B1 strains. Additionally, strains from the same phylogenetic group were more likely to co-occur in females. By contrast, strain diversity and phylogenetic group associations did not differ significantly from random in males. Host age was found to have a significant effect on the phylogenetic group of the dominant strain. Together these findings indicate that the distribution of the different phylogenetic groups within the human intestinal tract may be mediated by a complex interaction between the host environment and the competitive interactions between strains.

  3. Molecular detection, identification and phylogenetic inference of Leishmania spp. in some desert lizards from Northwest China by using internal transcribed spacer 1 (ITS1) sequences.

    PubMed

    Zhang, Jun-Rong; Guo, Xian-Guang; Liu, Jin-Long; Zhou, Tian-He; Gong, Xiong; Chen, Da-Li; Chen, Jian-Ping

    2016-10-01

    Leishmaniasis caused by Leishmania is still endemic in Northwest China. It has been thought that reptiles could be a reservoir for mammalian leishmaniasis. However, data are still scarce on natural infection of lizards with Leishmania spp. in China. The present study deals with detection, identification and phylogenetic inference of Leishmania parasites at species and intraspecies levels isolated from six desert lizard species from 10 geographical locations in Northwest China using amplification and sequencing of ITS-rDNA. In total, 83 haplotypes were found among 137 ITS1 sequences obtained from up to 64.6% of all captured lizards. Representative sequences of Leishmania available in GenBank were compiled for comparison with the obtained haplotypes. Tree-based species delimitation was achieved by using Bayesian phylogenitc analyses and maximum parsimony approach. Phylogenetic trees congruently supported that the haplotypes were found to belong to three Leishmania species including L. (sauroleishmania) sp., Leishmania tropica and Leishmania donovani complex. A network approach revealed paraphyletic populations of L. (sauroleishmania) sp. and L. tropica at intraspecies level regarding geographical origin and low host specificity. Chinese L. tropica from lizards showed significant heterogeneity as the obtained haplotypes were distributed in different clusters from other countries. Common ancestry was observed between some sequences of L. tropica from lizards and other sequence types from clinical samples from other countries. This may lend support to the potential reservoir role of lizards for human leishmaniasis. Our results appear to be the first molecular evidence for natural infection of lizards in Northwest China with reptilian Leishmania and mammalian Leishmania species. Desert lizards may be considered as putative reservoir hosts for Leishmania in China. Further studies on persistence of the Leishmania parasites in lizards and sandflies are recommended for the

  4. Molecular detection, identification and phylogenetic inference of Leishmania spp. in some desert lizards from Northwest China by using internal transcribed spacer 1 (ITS1) sequences.

    PubMed

    Zhang, Jun-Rong; Guo, Xian-Guang; Liu, Jin-Long; Zhou, Tian-He; Gong, Xiong; Chen, Da-Li; Chen, Jian-Ping

    2016-10-01

    Leishmaniasis caused by Leishmania is still endemic in Northwest China. It has been thought that reptiles could be a reservoir for mammalian leishmaniasis. However, data are still scarce on natural infection of lizards with Leishmania spp. in China. The present study deals with detection, identification and phylogenetic inference of Leishmania parasites at species and intraspecies levels isolated from six desert lizard species from 10 geographical locations in Northwest China using amplification and sequencing of ITS-rDNA. In total, 83 haplotypes were found among 137 ITS1 sequences obtained from up to 64.6% of all captured lizards. Representative sequences of Leishmania available in GenBank were compiled for comparison with the obtained haplotypes. Tree-based species delimitation was achieved by using Bayesian phylogenitc analyses and maximum parsimony approach. Phylogenetic trees congruently supported that the haplotypes were found to belong to three Leishmania species including L. (sauroleishmania) sp., Leishmania tropica and Leishmania donovani complex. A network approach revealed paraphyletic populations of L. (sauroleishmania) sp. and L. tropica at intraspecies level regarding geographical origin and low host specificity. Chinese L. tropica from lizards showed significant heterogeneity as the obtained haplotypes were distributed in different clusters from other countries. Common ancestry was observed between some sequences of L. tropica from lizards and other sequence types from clinical samples from other countries. This may lend support to the potential reservoir role of lizards for human leishmaniasis. Our results appear to be the first molecular evidence for natural infection of lizards in Northwest China with reptilian Leishmania and mammalian Leishmania species. Desert lizards may be considered as putative reservoir hosts for Leishmania in China. Further studies on persistence of the Leishmania parasites in lizards and sandflies are recommended for the

  5. Probabilistic Inference: Task Dependency and Individual Differences of Probability Weighting Revealed by Hierarchical Bayesian Modeling.

    PubMed

    Boos, Moritz; Seer, Caroline; Lange, Florian; Kopp, Bruno

    2016-01-01

    Cognitive determinants of probabilistic inference were examined using hierarchical Bayesian modeling techniques. A classic urn-ball paradigm served as experimental strategy, involving a factorial two (prior probabilities) by two (likelihoods) design. Five computational models of cognitive processes were compared with the observed behavior. Parameter-free Bayesian posterior probabilities and parameter-free base rate neglect provided inadequate models of probabilistic inference. The introduction of distorted subjective probabilities yielded more robust and generalizable results. A general class of (inverted) S-shaped probability weighting functions had been proposed; however, the possibility of large differences in probability distortions not only across experimental conditions, but also across individuals, seems critical for the model's success. It also seems advantageous to consider individual differences in parameters of probability weighting as being sampled from weakly informative prior distributions of individual parameter values. Thus, the results from hierarchical Bayesian modeling converge with previous results in revealing that probability weighting parameters show considerable task dependency and individual differences. Methodologically, this work exemplifies the usefulness of hierarchical Bayesian modeling techniques for cognitive psychology. Theoretically, human probabilistic inference might be best described as the application of individualized strategic policies for Bayesian belief revision.

  6. Probabilistic Inference: Task Dependency and Individual Differences of Probability Weighting Revealed by Hierarchical Bayesian Modeling

    PubMed Central

    Boos, Moritz; Seer, Caroline; Lange, Florian; Kopp, Bruno

    2016-01-01

    Cognitive determinants of probabilistic inference were examined using hierarchical Bayesian modeling techniques. A classic urn-ball paradigm served as experimental strategy, involving a factorial two (prior probabilities) by two (likelihoods) design. Five computational models of cognitive processes were compared with the observed behavior. Parameter-free Bayesian posterior probabilities and parameter-free base rate neglect provided inadequate models of probabilistic inference. The introduction of distorted subjective probabilities yielded more robust and generalizable results. A general class of (inverted) S-shaped probability weighting functions had been proposed; however, the possibility of large differences in probability distortions not only across experimental conditions, but also across individuals, seems critical for the model's success. It also seems advantageous to consider individual differences in parameters of probability weighting as being sampled from weakly informative prior distributions of individual parameter values. Thus, the results from hierarchical Bayesian modeling converge with previous results in revealing that probability weighting parameters show considerable task dependency and individual differences. Methodologically, this work exemplifies the usefulness of hierarchical Bayesian modeling techniques for cognitive psychology. Theoretically, human probabilistic inference might be best described as the application of individualized strategic policies for Bayesian belief revision. PMID:27303323

  7. The origin of land plants: phylogenetic relationships among charophytes, bryophytes, and vascular plants inferred from complete small-subunit ribosomal RNA gene sequences.

    PubMed

    Kranz, H D; Miks, D; Siegler, M L; Capesius, I; Sensen, C W; Huss, V A

    1995-07-01

    Complete nuclear-encoded small-subunit 18S rRNA (= SSU rRNA) gene sequences were determined for the prasinophyte green alga Mantoniella squamata; the charophycean green algae Chara foetida, Coleochaete scutata, Klebsormidium flaccidum, and Mougeotia scalaris; the bryophytes Marchantia polymorpha, Fossombronia pusilla, and Funaria hygrometrica; and the lycopod Selaginella galleottii to get a better insight into the sequential evolution from green algae to land plants. The sequences were aligned with several previously published SSU rRNA sequences from chlorophytic and charophytic algae as well as from land plants to infer the evolutionary relationships for major evolutionary lineages within the Chlorobionta by distance matrix, maximum parsimony, and maximum likelihood analyses. Phylogenetic trees created by the different methods consistently placed the Charophyceae on the branch leading to the land plants. The Charophyceae were shown to be polyphyletic with the Charales ("charalean" algae) diverging earlier than the Coleochaetales, Klebsormidiales, Chlorokybales, and Zygnematales ("charophycean" algae) which branch from a point closer to the land plants in most analyses. Maximum parsimony and maximum likelihood analyses imply a successive evolution from "charophycean" algae, particularly Coleochaetales, to bryophytes, lycopods, and seed plants. In contrast, distance matrix methods group the bryophytes together with the "charophycean" algae, suggesting a separate evolution of these organisms compared with the club moss and the seed plants.

  8. Inference of stress and texture from angular dependence of ultrasonic plate mode velocities

    NASA Technical Reports Server (NTRS)

    Thompson, R. B.; Smith, J. F.; Lee, S. S.

    1986-01-01

    The theory for the angular dependence of the ultrasonic wave velocity in a symmetry plane of an orthorhombic, stressed material is presented. The two waves having polarizations in this plane are shown to have velocities which can be estimated from measurements of the SH sub 0 and S sub 0 guided modes of a thin plate: the relationship being exact for the SH sub 0 mode and requiring a 10% correction for the S sub 0 mode at long wavelength. It is then shown how stress and texture can be independently inferred from various features of the angular dependence of these two velocities. From the SH sub 0 data, the ability to determine the directions and differences in magnitudes of principal stresses is described and supported by experimental data on several materials. From a combination of the SH sub 0 and S sub 0 data, a procedure is proposed for determining the coefficients W sub 400, W sub 420 and W sub 440 of an expansion of the crystallite orientation distribution function in terms of generalized Legendre functions. Possible applications in process control are indicated.

  9. Probabilistic Graphical Model Representation in Phylogenetics

    PubMed Central

    Höhna, Sebastian; Heath, Tracy A.; Boussau, Bastien; Landis, Michael J.; Ronquist, Fredrik; Huelsenbeck, John P.

    2014-01-01

    Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to grasp complex phylogenetic models, including their assumptions and parameter/variable dependencies. Graphical modeling is a unifying framework that has gained in popularity in the statistical literature in recent years. The core idea is to break complex models into conditionally independent distributions. The strength lies in the comprehensibility, flexibility, and adaptability of this formalism, and the large body of computational work based on it. Graphical models are well-suited to teach statistical models, to facilitate communication among phylogeneticists and in the development of generic software for simulation and statistical inference. Here, we provide an introduction to graphical models for phylogeneticists and extend the standard graphical model representation to the realm of phylogenetics. We introduce a new graphical model component, tree plates, to capture the changing structure of the subgraph corresponding to a phylogenetic tree. We describe a range of phylogenetic models using the graphical model framework and introduce modules to simplify the representation of standard components in large and complex models. Phylogenetic model graphs can be readily used in simulation, maximum likelihood inference, and Bayesian inference using, for example, Metropolis–Hastings or Gibbs sampling of the posterior distribution. [Computation; graphical models; inference; modularization; statistical phylogenetics; tree plate.] PMID:24951559

  10. Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1

    PubMed Central

    Figliuzzi, Matteo; Jacquier, Hervé; Schug, Alexander; Tenaillon, Oliver; Weigt, Martin

    2016-01-01

    The quantitative characterization of mutational landscapes is a task of outstanding importance in evolutionary and medical biology: It is, for example, of central importance for our understanding of the phenotypic effect of mutations related to disease and antibiotic drug resistance. Here we develop a novel inference scheme for mutational landscapes, which is based on the statistical analysis of large alignments of homologs of the protein of interest. Our method is able to capture epistatic couplings between residues, and therefore to assess the dependence of mutational effects on the sequence context where they appear. Compared with recent large-scale mutagenesis data of the beta-lactamase TEM-1, a protein providing resistance against beta-lactam antibiotics, our method leads to an increase of about 40% in explicative power as compared with approaches neglecting epistasis. We find that the informative sequence context extends to residues at native distances of about 20 Å from the mutated site, reaching thus far beyond residues in direct physical contact. PMID:26446903

  11. Phylogenetic relationships among Japanese species of the family Sciuridae (Mammalia, Rodentia), inferred from nucleotide sequences of mitochondrial 12S ribosomal RNA genes.

    PubMed

    Oshida, T; Masuda, R; Yoshida, M C

    1996-08-01

    In order to investigate phylogenetic relationships of the family Sciuridae living in Japan, we sequenced partial regions (379 bases) of mitochondrial 12S rRNA genes in six species of Japanese and other Asian squirrels. Phylogenetic trees constructed by sequence data indicated that two genera of flying squirrels (Petaurista and Pteromys) were clustered in a group distinct from non-flying squirrels, suggesting a possible monophyletic relationships of these flying squirrels. The evolutionary distance between the Japanese squirrel (Sciurus lis) from Honshu island and the Eurasian red squirrel (Sciurus vulgaris) from Hokkaido island was comparable to intraspecific distances of the remaining species examined. PMID:8940915

  12. Phylogenetic position of Indian termites (Isoptera: Termitidae) with their respective genera inferred from DNA sequence analysis of the mitochondrial cytochrome oxidase gene subunit I compared to subunit II.

    PubMed

    Sharma, Vijay Lakshmi; Singla, Mandakini; Sobti, Ranbir Chander

    2013-08-20

    The present work was aimed to investigate the phylogenetic analysis of different species of Indian termites belonging to the family termitidae based on mitochondrial genes COI and COII. The sequences so obtained from public database revealed grouping of termites according to their ecological distribution. The sequences of the species under investigation were characterized on the basis of frequencies of nucleotide bases and in most of the species, a significantly high percentage of A+T base composition was observed. Phylogenetic tree revealed positioning of species according to the analysis of their cytochrome oxidase subunits.

  13. Detection and phylogenetic analysis of coastal bioaerosols using culture dependent and independent techniques

    NASA Astrophysics Data System (ADS)

    Urbano, R.; Palenik, B.; Gaston, C. J.; Prather, K. A.

    2010-08-01

    Bioaerosols are emerging as important yet poorly understood players in atmospheric processes. Microorganisms in the atmosphere have great potential to impact chemical and physical processes that influence global climateby participating in both ice nucleation and cloud droplet formation. The role of microorganisms in atmospheric processes is thought to be species-specific and, potentially, dependent on the viability of the cell; however, few simultaneous measurements of both parameters exist. Using a coastal pier monitoring site as a sampling platform to investigate the exchange of airborne microorganisms at the air-sea interface, culture independent (i.e. DNA clone libraries from filters) and culture dependent approaches (i.e. agar plates) were combined with 18S rRNA and 16S rRNA gene targeting to determine the microbial diversity. The results indicate that in these coastal air samples two fungal phyla, Basidiomycota and Ascomycota, predominate among eukaryotes while Firmicutes and Proteobacteria predominate among bacteria. Furthermore, our culture dependent study verifies the viability of microbes from all four phyla detected through our culture independent study. Contrary to our expectations and despite oceanic air mass sources, common marine planktonic bacteria and phytoplankton were not abundantly found in our air samples indicating the potential importance of bioaerosols derived from beaches and/or coastal erosion processes.

  14. Detection and phylogenetic analysis of coastal bioaerosols using culture dependent and independent techniques

    NASA Astrophysics Data System (ADS)

    Urbano, R.; Palenik, B.; Gaston, C. J.; Prather, K. A.

    2011-02-01

    Bioaerosols are emerging as important yet poorly understood players in atmospheric processes. Microorganisms can impact atmospheric chemistry through metabolic reactions and can potentially influence physical processes by participating in ice nucleation and cloud droplet formation. Microbial roles in atmospheric processes are thought to be species-specific and potentially dependent on cell viability. Using a coastal pier monitoring site as a sampling platform, culture-dependent (i.e. agar plates) and culture-independent (i.e. DNA clone libraries from filters) approaches were combined with 18S rRNA and 16S rRNA gene targeting to obtain insight into the local atmospheric microbial composition. From 13 microbial isolates and 42 DNA library clones, a total of 55 sequences were obtained representing four independent sampling events. Sequence analysis revealed that in these coastal samples two fungal phyla, Ascomycota and Basidiomycota, predominate among eukaryotes while Firmicutes and Proteobacteria predominate among bacteria. Furthermore, our culture-dependent study verifies the viability of microbes from all four phyla detected through our culture-independent study. Contrary to our expectations and despite oceanic air mass sources, common marine planktonic bacteria and phytoplankton were not typically found. The abundance of terrestrial and marine sediment-associated microorganisms suggests a potential importance for bioaerosols derived from beaches and/or coastal erosion processes.

  15. Phylogenetic utility of nuclear nitrate reductase: a multi-locus comparison of nuclear and chloroplast sequence data for inference of relationships among American Lycieae (Solanaceae).

    PubMed

    Levin, Rachel A; Blanton, Jessica; Miller, Jill S

    2009-03-01

    There has been considerable interest and research into the benefits of multiple low-copy nuclear regions for phylogenetic studies at low taxonomic levels. In this study, the phylogenetic utility of DNA sequence data from two low-copy nuclear genes, nitrate reductase (NIA) and granule-bound starch synthase I (GBSSI), was compared with data from nuclear ribosomal ITS and the cpDNA spacers trnT-trnF and trnD-trnT across 33 closely related taxa in tribe Lycieae (Solanaceae). The NIA data (introns 1-2) had the most parsimony-informative characters, with over twice the number provided by GBSSI, but NIA also had greater homoplasy. Although gene trees were generally concordant across the four regions, there were some notable areas of incongruence, suggesting both incomplete lineage sorting as well as possible reticulate origins.

  16. Discordant 16S and 23S rRNA gene phylogenies for the genus Helicobacter: implications for phylogenetic inference and systematics.

    PubMed

    Dewhirst, Floyd E; Shen, Zeli; Scimeca, Michael S; Stokes, Lauren N; Boumenna, Tahani; Chen, Tsute; Paster, Bruce J; Fox, James G

    2005-09-01

    Analysis of 16S rRNA gene sequences has become the primary method for determining prokaryotic phylogeny. Phylogeny is currently the basis for prokaryotic systematics. Therefore, the validity of 16S rRNA gene-based phylogenetic analyses is of fundamental importance for prokaryotic systematics. Discrepancies between 16S rRNA gene analyses and DNA-DNA hybridization and phenotypic analyses have been noted in the genus Helicobacter. To clarify these discrepancies, we sequenced the 23S rRNA genes for 55 helicobacter strains representing 41 taxa (>2,700 bases per sequence). Phylogenetic-tree construction using neighbor-joining, parsimony, and maximum likelihood methods for 23S rRNA gene sequence data yielded stable trees which were consistent with other phenotypic and genotypic methods. The 16S rRNA gene sequence-derived trees were discordant with the 23S rRNA gene trees and other data. Discrepant 16S rRNA gene sequence data for the helicobacters are consistent with the horizontal transfer of 16S rRNA gene fragments and the creation of mosaic molecules with loss of phylogenetic information. These results suggest that taxonomic decisions must be supported by other phylogenetically informative macromolecules, such as the 23S rRNA gene, when 16S rRNA gene-derived phylogeny is discordant with other credible phenotypic and genotypic methods. This study found Wolinella succinogenes to branch with the unsheathed-flagellum cluster of helicobacters by 23S rRNA gene analyses and whole-genome comparisons. This study also found intervening sequences (IVSs) in the 23S rRNA genes of strains of 12 Helicobacter species. IVSs were found in helices 10, 25, and 45, as well as between helices 31' and 27'. Simultaneous insertion of IVSs at three sites was found in H. mesocricetorum. PMID:16109952

  17. Phylogenetic relationships of the enigmatic angiosperm family Podostemaceae inferred from 18S rDNA and rbcL sequence data.

    PubMed

    Soltis, D E; Mort, M E; Soltis, P S; Hibsch-Jetter, C; Zimmer, E A; Morgan, D

    1999-03-01

    The phylogenetic relationships of some angiosperm families have remained enigmatic despite broad phylogenetic analyses of rbcL sequences. One example is the aquatic family Podostemaceae, the relationships of which have long been controversial because of major morphological modifications associated with their aquatic habit. Podostemaceae have variously been associated with Piperaceae, Nepenthaceae, Polygonaceae, Caryophyllaceae, Scrophulariaceae, Rosaceae, Crassulaceae, and Saxifragaceae. Two recent analyses of rbcL sequences suggest a possible sister-group relationship of Podostemaceae to Crassulaceae (Saxifragales). However, the branch leading to Podostemaceae was long, and use of different outgroups resulted in alternative placements. We explored the phylogenetic relationships of Podostemaceae using 18S rDNA sequences and a combined rbcL + 18S rDNA matrix representing over 250 angiosperms. In analyses based on 18S rDNA data, Podostemaceae are not characterized by a long branch; the family consistently appears as part of a Malpighiales clade that also includes Malpighiaceae, Turneraceae, Passifloraceae, Salicaceae, Euphorbiaceae, Violaceae, Linaceae, Chrysobalanaceae, Trigoniaceae, Humiriaceae, and Ochnaceae. Phylogenetic analyses based on a combined 18S rDNA + rbcL data set (223 ingroup taxa) with basal angiosperms as the outgroup also suggest that Podostemaceae are part of a Malpighiales clade. These searches swapped to completion, and the shortest trees showed enhanced resolution and increased internal support compared to those based on 18S rDNA or rbcL alone. However, when Gnetales are used as the outgroup, Podostemaceae appear with members of the nitrogen fixing clade (e.g., Elaeagnaceae, Ulmaceae, Rhamnaceae, Cannabaceae, Moraceae, and Urticaceae). None of the relationships suggested here for Podostemaceae receives strong bootstrap support. Our analyses indicate that Podostemaceae are not closely allied with Crassulaceae or with other members of the

  18. Host defense peptides from Lithobates forreri, Hylarana luctuosa, and Hylarana signata (Ranidae): phylogenetic relationships inferred from primary structures of ranatuerin-2 and brevinin-2 peptides.

    PubMed

    Conlon, J Michael; Kolodziejek, Jolanta; Mechkarska, Milena; Coquet, Laurent; Leprince, Jérôme; Jouenne, Thierry; Vaudry, Hubert; Nielsen, Per F; Nowotny, Norbert; King, Jay D

    2014-03-01

    The primary structures of host-defense peptides present in frog skin secretions constitute useful molecular markers for establishing taxonomic classifications and investigating phylogenetic relationships between species within a particular genus. Peptidomic analysis has led to the characterization of multiple host-defense peptides in norepinephrine-stimulated skin secretions of three species of frogs from the family Ranidae: Lithobates forreri (Boulenger, 1883), Hylarana luctuosa (Peters, 1871), and Hylarana signata (Günther, 1872). The L. forreri secretions contain ranatuerin-2 (2 peptides), brevinin-1 (4 peptides), and temporin (1 peptide). The H. luctuosa secretions contain brevinin-2 (4 peptides), esculentin-1 (1 peptide), esculentin-2 (1 peptide), palustrin-2 (2 peptides), and temporin (2 peptides). The H. signata secretions contain brevinin-2 (4 peptides), brevinin-1 (5 peptides), palustrin-2 (1 peptide), and temporin (2 peptides). Cladistic analysis based upon the primary structures of 44 ranatuerin-2 peptides from 20 Lithobates species indicates a close phylogenetic relationship between L. forreri, Lithobates onca, and Lithobates yavapaiensis. A similar cladistic analysis based upon the primary structures of 27 brevinin-2 peptides from 8 Hylarana species provides support for a close phylogenetic relationship between H. signata and Hylarana picturata, while showing that the species are not conspecific, with H. luctuosa more distantly related. PMID:24463457

  19. Host defense peptides from Lithobates forreri, Hylarana luctuosa, and Hylarana signata (Ranidae): phylogenetic relationships inferred from primary structures of ranatuerin-2 and brevinin-2 peptides.

    PubMed

    Conlon, J Michael; Kolodziejek, Jolanta; Mechkarska, Milena; Coquet, Laurent; Leprince, Jérôme; Jouenne, Thierry; Vaudry, Hubert; Nielsen, Per F; Nowotny, Norbert; King, Jay D

    2014-03-01

    The primary structures of host-defense peptides present in frog skin secretions constitute useful molecular markers for establishing taxonomic classifications and investigating phylogenetic relationships between species within a particular genus. Peptidomic analysis has led to the characterization of multiple host-defense peptides in norepinephrine-stimulated skin secretions of three species of frogs from the family Ranidae: Lithobates forreri (Boulenger, 1883), Hylarana luctuosa (Peters, 1871), and Hylarana signata (Günther, 1872). The L. forreri secretions contain ranatuerin-2 (2 peptides), brevinin-1 (4 peptides), and temporin (1 peptide). The H. luctuosa secretions contain brevinin-2 (4 peptides), esculentin-1 (1 peptide), esculentin-2 (1 peptide), palustrin-2 (2 peptides), and temporin (2 peptides). The H. signata secretions contain brevinin-2 (4 peptides), brevinin-1 (5 peptides), palustrin-2 (1 peptide), and temporin (2 peptides). Cladistic analysis based upon the primary structures of 44 ranatuerin-2 peptides from 20 Lithobates species indicates a close phylogenetic relationship between L. forreri, Lithobates onca, and Lithobates yavapaiensis. A similar cladistic analysis based upon the primary structures of 27 brevinin-2 peptides from 8 Hylarana species provides support for a close phylogenetic relationship between H. signata and Hylarana picturata, while showing that the species are not conspecific, with H. luctuosa more distantly related.

  20. Phylogenetic relationships of the soybean sudden death syndrome pathogen Fusarium solani f. sp. phaseoli inferred from rDNA sequence data and PCR primers for its identification.

    PubMed

    O'Donnell, K; Gray, L E

    1995-01-01

    Phylogenetic relationships of several species within the Fusarium solani-complex were investigated using characters from the nuclear ribosomal DNA. Genetic variation within 24 isolates, including 5 soybean sudden death syndrome (SDS) strains, was assessed using rDNA sequence data and restriction fragment length polymorphic markers. By these techniques, the causal agent of soybean SDS was identified as F. solani f. sp. phaseoli. In separate cladistic analyses, Plectosphaerella cucumerina and Nectria cinnabarina or F. ventricosum were used for rooting purposes. Monophyly of the F. solani-complex was strongly supported by bootstrap and decay analyses. Parsimony analysis indicates that this complex is composed of a number of phylogenetically distinct species, including Neocosmospora vasinfecta, F. solani f. sp. phaseoli, and biological species designated as MPI, MPV, and MPVI of N. haematococca. The results demonstrate complete congruence between biological and phylogenetic species within the N. haematococca-complex. In addition, DNA sequence data were used to design a PCR primer pair which could specifically amplify DNA from isolates of the SDS pathogen from infected plants. PMID:7579615

  1. The feasibility of using time-dependent photochemical calculations to infer radical species concentrations from solar occultation absorption measurements

    NASA Technical Reports Server (NTRS)

    Larsen, J. C.; Boughner, R. E.

    1981-01-01

    In connection with fast chemical reactions, short-lived stratospheric species experience rapid concentration variations at sunset and sunrise. For solar occultation absorption measurements, these rapid concentration variations may introduce significant errors with respect to the inference of atmospheric abundances for some species due to asymmetrical concentration distributions. Most retrieval algorithms assume that concentration distributions are spherically symmetric. The effect of this assumption on the accuracy of inferred concentrations has been studied by Kerr et al. (1977). The present investigation considers the feasibility of using a time-dependent one-dimensional photochemical model to provide detailed information about the asymmetrical distribution for use in the retrieval procedure. As shown by Boughner et al. (1980), diurnal effects can be represented by an inhomogeneity factor. It is found that the NO retrieval improves considerably with the inclusion of a correction factor containing the asymmetrical variations.

  2. One- and two-sample nonparametric inference procedures in the presence of a mixture of independent and dependent censoring.

    PubMed

    Park, Yuhyun; Tian, Lu; Wei, L J

    2006-04-01

    In survival analysis, the event time T is often subject to dependent censorship. Without assuming a parametric model between the failure and censoring times, the parameter Theta of interest, for example, the survival function of T, is generally not identifiable. On the other hand, the collection Omega of all attainable values for Theta may be well defined. In this article, we present nonparametric inference procedures for Omega in the presence of a mixture of dependent and independent censoring variables. By varying the criteria of classifying censoring to the dependent or independent category, our proposals can be quite useful for the so-called sensitivity analysis of censored failure times. The case that the failure time is subject to possibly dependent interval censorship is also discussed in this article. The new proposals are illustrated with data from two clinical studies on HIV-related diseases. PMID:16282531

  3. A multi-locus analysis of phylogenetic relationships within grass subfamily Pooideae (Poaceae) inferred from sequences of nuclear single copy gene regions compared with plastid DNA.

    PubMed

    Hochbach, Anne; Schneider, Julia; Röser, Martin

    2015-06-01

    To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa.

  4. Phylogenetic position of phylum Nemertini, inferred from 18S rRNA sequences: molecular data as a test of morphological character homology.

    PubMed

    Turbeville, J M; Field, K G; Raff, R A

    1992-03-01

    Partial 18S rRNA sequence of the nemertine Cerebratulus lacteus was obtained and compared with those of coelomate metazoans and acoelomate platyhelminths to test whether nemertines share a most recent common ancestor with the platyhelminths, as traditionally has been implied, or whether nemertines lie within a protostome coelomate clade, as suggested by more recent morphological analyses. Maximum-parsimony analysis supports the inclusion of the nemertine within a protostome-coelomate clade that falls within a more inclusive coelomate clade. Bootstrap analysis indicates strong support for a monophyletic Coelomata composed of a deuterostome and protostome-coelomate clade. Support for a monophyletic protostome Coelomata is weak. Inference by distance analysis is consistent with that of maximum parsimony. Analysis of down-weighted paired sites by maximum parsimony reveals variation in topology only within the protostome-coelomate clade. The relationships among the protostome coelomates cannot be reliably inferred from the partial sequences, suggesting that coelomate protostomes diversified rapidly. Results with evolutionary parsimony are consistent with the inclusion of the nemertine in a coelomate clade. The molecular inference corroborates recent morphological character analyses that reveal no synapomorphies of nemertines and flatworms but instead suggest that the circulatory system and rhynchocoel of nemertines are homologous to coelomic cavities of protostome coelomates, thus supporting the corresponding hypothesis that nemertines belong within a protostome-coelomate clade. The sequence data provide an independent test of morphological character homology.

  5. From a comb to a tree: phylogenetic relationships of the comb-footed spiders (Araneae, Theridiidae) inferred from nuclear and mitochondrial genes.

    PubMed

    Arnedo, Miquel A; Coddington, Jonathan; Agnarsson, Ingi; Gillespie, Rosemary G

    2004-04-01

    The family Theridiidae is one of the most diverse assemblages of spiders, from both a morphological and ecological point of view. The family includes some of the very few cases of sociality reported in spiders, in addition to bizarre foraging behaviors such as kleptoparasitism and araneophagy, and highly diverse web architecture. Theridiids are one of the seven largest families in the Araneae, with about 2200 species described. However, this species diversity is currently grouped in half the number of genera described for other spider families of similar species richness. Recent cladistic analyses of morphological data have provided an undeniable advance in identifying the closest relatives of the theridiids as well as establishing the family's monophyly. Nevertheless, the comb-footed spiders remain an assemblage of poorly defined genera, among which hypothesized relationships have yet to be examined thoroughly. Providing a robust cladistic structure for the Theridiidae is an essential step towards the clarification of the taxonomy of the group and the interpretation of the evolution of the diverse traits found in the family. Here we present results of a molecular phylogenetic analysis of a broad taxonomic sample of the family (40 taxa in 33 of the 79 currently recognized genera) and representatives of nine additional araneoid families, using approximately 2.5kb corresponding to fragments of three nuclear genes (Histone 3, 18SrDNA, and 28SrDNA) and two mitochondrial genes (16SrDNA and CoI). Several methods for incorporating indel information into the phylogenetic analysis are explored, and partition support for the different clades and sensitivity of the results to different assumptions of the analysis are examined as well. Our results marginally support theridiid monophyly, although the phylogenetic structure of the outgroup is unstable and largely contradicts current phylogenetic hypotheses based on morphological data. Several groups of theridiids receive strong

  6. Phylogenetics and systematics of Angiostrongylus lungworms and related taxa (Nematoda: Metastrongyloidea) inferred from the nuclear small subunit (SSU) ribosomal DNA sequences.

    PubMed

    Eamsobhana, P; Lim, P E; Yong, H S

    2015-05-01

    The Angiostrongylus lungworms are of public health and veterinary concern in many countries. At the family level, the Angiostrongylus lungworms have been included in the family Angiostrongylidae or the family Metastrongylidae. The present study was undertaken to determine the usefulness and suitability of the nuclear 18S (small subunit, SSU) rDNA sequences for differentiating various taxa of the genus Angiostrongylus, as well as to determine the systematics and phylogenetic relationship of Angiostrongylus species and other metastrongyloid taxa. This study revealed six 18S (SSU) haplotypes in A. cantonensis, indicating considerable genetic diversity. The uncorrected pairwise 'p' distances among A. cantonensis ranged from 0 to 0.86%. The 18S (SSU) rDNA sequences unequivocally distinguished the five Angiostrongylus species, confirmed the close relationship of A. cantonensis and A. malaysiensis and that of A. costaricensis and A. dujardini, and were consistent with the family status of Angiostrongylidae and Metastrongylidae. In all cases, the congeneric metastrongyloid species clustered together. There was no supporting evidence to include the genus Skrjabingylus as a member of Metastrongylidae. The genera Aelurostrongylus and Didelphostrongylus were not recovered with Angiostrongylus, indicating polyphyly of the Angiostrongylidae. Of the currently recognized families of Metastrongyloidea, only Crenosomatidae appeared to be monophyletic. In view of the unsettled questions regarding the phylogenetic relationships of various taxa of the metastrongyloid worms, further analyses using more markers and more taxa are warranted.

  7. Dependence of SuperDARN-inferred cross polar cap potential upon the solar wind electric field and magnetopause subsolar distance

    NASA Astrophysics Data System (ADS)

    Koustov, Alexandre; Khachikjan, Galina; Sofko, George

    Analysis of statistically significant data set on the cross polar cap potential (CPCP) derived from Super Dual Auroral Radar Network (SuperDARN) measurements for the geoeffective solar wind electric field Ey of up to 12 mV/m is performed. It is shown that if all CPCP data are plotted against Ey, there is a linear increase between 0 and 5 mV/m and saturation at larger values. If the same data are arranged in bins of RMS values, the CPCP depends on Ey linearly in each bin with the slope and intercept of the regression line related to RMS. The inferred linear dependencies CPCP (Ey) for various fixed bins of RMS intersect each other at Ey 2.5-3.0 mV/m, and the character of the CPCP dependence upon the solar wind rum pressure changes at this value. For this reason, the value of 3 mV/m is interpreted as the true threshold of CPCP saturation. This value also corresponds to the condition of the solar wind magnetic field at the subsolar point to be equal to the Earth's dipole field as shown in published MHD simulations. It is argued that the initially inferred saturation threshold of 5 mV/m (for the case of the CPCP data being not sorted according to RMS) is an apparent effect. Presented analysis favors the hypothesis that the CPCP saturates due to shortening of the merging x-line

  8. Phylogenetic affinities of Diplonema within the Euglenozoa as inferred from the SSU rRNA gene and partial COI protein sequences.

    PubMed

    Maslov, D A; Yasuhira, S; Simpson, L

    1999-03-01

    In order to shed light on the phylogenetic position of diplonemids within the phylum Euglenozoa, we have sequenced small subunit rRNA (SSU rRNA) genes from Diplonema (syn. Isonema) papillatum and Diplonema sp. We have also analyzed a partial sequence of the mitochondrial gene for cytochrome c oxidase subunit I from D. papillatum. With both markers, the maximum likelihood method favored a closer grouping of diplonemids with kinetoplastids, while the parsimony and distance suggested a closer relationship of diplonemids with euglenoids. In each case, the differences between the best tree and the alternative trees were small. The frequency of codon usage in the partial D. papillatum COI was different from both related groups; however, as is the case in kinetoplastids but not in Euglena, both the non-canonical UGA codon and the canonical UGG codon were used to encode tryptophan in Diplonema. PMID:10724517

  9. A smaller Macadamia from a more vagile tribe: inference of phylogenetic relationships, divergence times, and diaspore evolution in Macadamia and relatives (tribe Macadamieae; Proteaceae).

    PubMed

    Mast, Austin R; Willis, Crystal L; Jones, Eric H; Downs, Katherine M; Weston, Peter H

    2008-07-01

    Tribe Macadamieae (91 spp., 16 genera; Proteaceae) is widespread across the southern hemisphere on all major fragments of Gondwana except New Zealand and India. Macadamia is cultivated outside its natural range as a "nut" crop (notably in Hawaii, where it is the principal orchard crop). We sampled seven DNA regions and 53 morphological characters from the tribe to infer its phylogeny and address the common assumption that the distribution of the extant diversity of the tribe arose by the rafting of ancestors on Gondwanan fragments. Macadamia proves to be paraphyletic with respect to the African genus Brabejum, the South American genus Panopsis, and the Australian species Orites megacarpus. We erect two new generic names, Nothorites and Lasjia, to produce monophyly at that rank. The earliest disjunctions in the tribe are inferred to be the result of long-distance dispersal out of Australia (with one possible exception), rather than vicariance. Evolution of tardy fruit dehiscence is correlated with these dispersals, and the onset of the Antarctic Circumpolar Current (ACC) precedes them. We suggest that the ancestors of extant diversity arrived on their respective continents via the ACC, and we recognize that this is a mechanism precluded, rather than facilitated, by Gondwana's terrestrial continuity.

  10. A smaller Macadamia from a more vagile tribe: inference of phylogenetic relationships, divergence times, and diaspore evolution in Macadamia and relatives (tribe Macadamieae; Proteaceae).

    PubMed

    Mast, Austin R; Willis, Crystal L; Jones, Eric H; Downs, Katherine M; Weston, Peter H

    2008-07-01

    Tribe Macadamieae (91 spp., 16 genera; Proteaceae) is widespread across the southern hemisphere on all major fragments of Gondwana except New Zealand and India. Macadamia is cultivated outside its natural range as a "nut" crop (notably in Hawaii, where it is the principal orchard crop). We sampled seven DNA regions and 53 morphological characters from the tribe to infer its phylogeny and address the common assumption that the distribution of the extant diversity of the tribe arose by the rafting of ancestors on Gondwanan fragments. Macadamia proves to be paraphyletic with respect to the African genus Brabejum, the South American genus Panopsis, and the Australian species Orites megacarpus. We erect two new generic names, Nothorites and Lasjia, to produce monophyly at that rank. The earliest disjunctions in the tribe are inferred to be the result of long-distance dispersal out of Australia (with one possible exception), rather than vicariance. Evolution of tardy fruit dehiscence is correlated with these dispersals, and the onset of the Antarctic Circumpolar Current (ACC) precedes them. We suggest that the ancestors of extant diversity arrived on their respective continents via the ACC, and we recognize that this is a mechanism precluded, rather than facilitated, by Gondwana's terrestrial continuity. PMID:21632410

  11. Intricate patterns of phylogenetic relationships in the olive family as inferred from multi-locus plastid and nuclear DNA sequence analyses: a close-up on Chionanthus and Noronhia (Oleaceae).

    PubMed

    Hong-Wa, Cynthia; Besnard, Guillaume

    2013-05-01

    Noronhia represents the most successful radiation of the olive family (Oleaceae) in Madagascar with more than 40 named endemic species distributed in all ecoregions from sea level to high mountains. Its position within the subtribe Oleinae has, however, been largely unresolved and its evolutionary history has remained unexplored. In this study, we generated a dataset of plastid (trnL-F, trnT-L, trnS-G, trnK-matK) and nuclear (internal transcribed spacer [ITS]) DNA sequences to infer phylogenetic relationships within Oleinae and to examine evolutionary patterns within Noronhia. Our sample included most species of Noronhia and representatives of the ten other extant genera within the subtribe with an emphasis on Chionanthus. Bayesian inferences and maximum likelihood analyses of plastid and nuclear data indicated several instances of paraphyly and polyphyly within Oleinae, with some geographic signal. Both plastid and ITS data showed a polyphyletic Noronhia that included Indian Ocean species of Chionanthus. They also found close relationships between Noronhia and African Chionanthus. However, the plastid data showed little clear differentiation between Noronhia and the African Chionanthus whereas relationships suggested by the nuclear ITS data were more consistent with taxonomy and geography. We used molecular dating to discriminate between hybridization and lineage sorting/gene duplication as alternative explanations for these topological discordances and to infer the biogeographic history of Noronhia. Hybridization between African Chionanthus and Noronhia could not be ruled out. However, Noronhia has long been established in Madagascar after a likely Cenozoic dispersal from Africa, suggesting any hybridization between representatives of African and Malagasy taxa was ancient. In any case, the African and Indian Ocean Chionanthus and Noronhia together formed a strongly supported monophyletic clade distinct and distant from other Chionanthus, which calls for a revised

  12. Analysis of kinetoplast cytochrome b gene of 16 Leishmania isolates from different foci of China: different species of Leishmania in China and their phylogenetic inference

    PubMed Central

    2013-01-01

    Background Leishmania species belong to the family Trypanosomatidae and cause leishmaniasis, a geographically widespread disease that infects humans and other vertebrates. This disease remains endemic in China. Due to the large geographic area and complex ecological environment, the taxonomic position and phylogenetic relationship of Chinese Leishmania isolates remain uncertain. A recent internal transcribed spacer 1 and cytochrome oxidase II phylogeny of Chinese Leishmania isolates has challenged some aspects of their traditional taxonomy as well as cladistics hypotheses of their phylogeny. The current study was designed to provide further disease background and sequence analysis. Methods We systematically analyzed 50 cytochrome b (cyt b) gene sequences of 19 isolates (16 from China, 3 from other countries) sequenced after polymerase chain reaction (PCR) using a special primer for cyt b as well as 31 sequences downloaded from GenBank. After alignment, the data were analyzed using the maximum parsimony, Bayesian and netwok methods. Results Sequences of six haplotypes representing 10 Chinese isolates formed a monophyletic group and clustered with Leishmania tarentolae. The isolates GS1, GS7, XJ771 of this study from China clustered with other isolates of Leishmania donovani complex. The isolate JS1 was a sister to Leishmania tropica, which represented an L. tropica complex instead of clustering with L. donovani complex or with the other 10 Chinese isolates. The isolates KXG-2 and GS-GER20 formed a monophyletic group with Leishmania turanica from central Asia. In the different phylogenetic trees, all of the Chinese isolates occurred in at least four groups regardless of geographic distribution. Conclusions The undescribed Leishmania species of China, which are clearly causative agents of canine leishmaniasis and human visceral leishmaniasis and are related to Sauroleishmania, may have evolved from a common ancestral parasite that came from the Americas and may have

  13. Latitudinal Density Dependence of Magnetic Field Lines Inferred from Polar Plasma Wave Data

    NASA Technical Reports Server (NTRS)

    Goldstein, J.; Denton, R. E.; Hudson, M. K.; Miftakhova, E. G.; Menietti, J. D.; Gallagher, D. L.

    2000-01-01

    Using observations of the electron density, n(sub e), based on measurement of the upper hybrid resonance frequency by the Polar spacecraft Plasma Wave Instrument, we have examined the radial density dependence along field lines in the outer plasmasphere and the near plasmatrough. Sampled L values range from 2.5 to 6.6. Our technique depends on the fact that Polar crosses particular L values at two different points with different radial distance R. In our plasmaspheric data set (n(sub e) > 100/cm3), we find that on average n(sub e) is flat along field lines from the equator up to the latitudes sampled by Polar (R approximately equal to or > 2.0). In the plasmatrough data set (n(sub e) < 100/cm-3), there is on average a mild radial dependence n(sub e) varies as R(exp -1.7).

  14. Temperature dependent spin transport properties of platinum inferred from spin Hall magnetoresistance measurements

    SciTech Connect

    Meyer, Sibylle Althammer, Matthias; Geprägs, Stephan; Opel, Matthias; Goennenwein, Sebastian T. B.; Gross, Rudolf

    2014-06-16

    We study the temperature dependence of the spin Hall magnetoresistance (SMR) in yttrium iron garnet/platinum hybrid structures via magnetization orientation dependent magnetoresistance measurements. Our experiments show a decrease of the SMR magnitude with decreasing temperature. Using the sensitivity of the SMR to the spin transport properties of the normal metal, we interpret our data in terms of a decrease of the spin Hall angle in platinum from 0.11 at room temperature to 0.075 at 10 K, while the spin diffusion length and the spin mixing conductance of the ferrimagnetic insulator/normal metal interface remain almost constant.

  15. Stress dependence of fracturing-stimulated permeability inferred from microseismic signatures. (Invited)

    NASA Astrophysics Data System (ADS)

    Shapiro, S. A.; Hummel, N.

    2013-12-01

    For developments of hydrocarbon or geothermal reservoirs knowledge of the hydraulic transport is of importance. A substantial contribution in understanding of the permeability evolution as an anisotropic function of stress can be obtained from the evolution of stimulation-induced seismicity in space and time. We focus on analyzing spatio-temporal characteristics of seismicity recorded during different hydraulic fracturing treatments in tight hydrocarbon reservoirs. Pore pressure diffusion based on a power-law pressure dependence of permeability frequently explains well corresponding spatio-temporal features of induced seismicity. We show that in hydraulically anisotropic shale a model of the factorized anisotropic pressure-dependent permeability is in agreement with the microseismic data. Hydraulic stimulation of rocks is accompanied by microseismic activity not only during but also after termination of the fluid injection. The evolution of the growing from the injection borehole seismic quiescence zone (which is enveloped by the so-called back-front of the induced seismicity) yields additional information on the permeability dependence on stress. We observe that under some conditions a power-law pressure dependence of the stimulated permeability can well explain this evolution. However, frequently an exponential pressure dependence of the permeability seems to be more adequate. To explain these different observations we compare the behavior of permeability and elastic properties of rocks as functions of the tectonic stress and pore pressure. We propose that different parts of the pore and fracture space differently contribute to a control of these two types of physical properties of rocks. The stress dependence of elastic moduli of drained rocks is mainly controlled by deformations of the compliant part of the pore (including fractures) space. This leads to saturating exponential functions expressing load dependencies of seismic velocities and of related seismic

  16. The phylogenetic relationships of insectivores with special reference to the lesser hedgehog tenrec as inferred from the complete sequence of their mitochondrial genome.

    PubMed

    Nikaido, Masato; Cao, Ying; Okada, Norihiro; Hasegawa, Masami

    2003-02-01

    The complete mitochondrial genome of a lesser hedgehog tenrec Echinops telfairi was determined in this study. It is an endemic African insectivore that is found specifically in Madagascar. The tenrec's back is covered with hedgehog-like spines. Unlike other spiny mammals, such as spiny mice, spiny rats, spiny dormice and porcupines, lesser hedgehog tenrecs look amazingly like true hedgehogs (Erinaceidae). However, they are distinguished morphologically from hedgehogs by the absence of a jugal bone. We determined the complete sequence of the mitochondrial genome of a lesser hedgehog tenrec and analyzed the results phylogenetically to determine the relationships between the tenrec and other insectivores (moles, shrews and hedgehogs), as well as the relationships between the tenrec and endemic African mammals, classified as Afrotheria, that have recently been shown by molecular analysis to be close relatives of the tenrec. Our data confirmed the afrotherian status of the tenrec, and no direct relation was recovered between the tenrec and the hedgehog. Comparing our data with those of others, we found that within-species variations in the mitochondrial DNA of lesser hedgehog tenrecs appear to be the largest recognized to date among mammals, apart from orangutans, which might be interesting from the view point of evolutionary history of tenrecs on Madagascar.

  17. Phylogenetic relationships among domesticated and wild species of Cucurbita (Cucurbitaceae) inferred from a mitochondrial gene: Implications for crop plant evolution and areas of origin.

    PubMed

    Sanjur, Oris I; Piperno, Dolores R; Andres, Thomas C; Wessel-Beaver, Linda

    2002-01-01

    We have investigated the phylogenetic relationships among six wild and six domesticated taxa of Cucurbita using as a marker an intron region from the mitochondrial nad1 gene. Our study represents one of the first successful uses of a mtDNA gene in resolving inter- and intraspecific taxonomic relationships in Angiosperms and yields several important insights into the origins of domesticated Cucurbita. First, our data suggest at least six independent domestication events from distinct wild ancestors. Second, Cucurbita argyrosperma likely was domesticated from a wild Mexican gourd, Cucurbita sororia, probably in the same region of southwest Mexico that gave rise to maize. Third, the wild ancestor of Cucurbita moschata is still unknown, but mtDNA data combined with other sources of information suggest that it will probably be found in lowland northern South America. Fourth, Cucurbita andreana is supported as the wild progenitor of Cucurbita maxima, but humid lowland regions of Bolivia in addition to warmer temperate zones in South America from where C. andreana was originally described should possibly be considered as an area of origin for C. maxima. Fifth, our data support other molecular results that indicate two separate domestications in the Cucurbita pepo complex. The potential zone of domestication for one of the domesticated subspecies, C. pepo subsp. ovifera, includes eastern North America and should be extended to northeastern Mexico. The wild ancestor of the other domesticated subspecies, C. pepo subsp. pepo, is undiscovered but is closely related to C. pepo subsp. fraterna and possibly will be found in southern Mexico.

  18. A new myco-heterotrophic genus, Yunorchis, and the molecular phylogenetic relationships of the tribe Calypsoeae (Epidendroideae, Orchidaceae) inferred from plastid and nuclear DNA sequences.

    PubMed

    Zhang, Guo-Qiang; Li, Ming-He; Su, Yong-Yu; Chen, Li-Jun; Lan, Si-Ren; Liu, Zhong-Jian

    2015-01-01

    We identified a new holomycotrophic orchid that is related to the myco-heterotrophic Calypsoeae. Because chloroplast genes are primarily lacking or are highly divergent, key morphological characters are either reduced or lost from many myco-heterotrophs, and the phylogenetic relationships of weakly supported paraphyletic Calypsoeae within Epidendroideae have been poorly understood in previous molecular systematic studies. Using chloroplast rbcL, psaB, and matK and nuclear Xdh and ITS sequences, we determined the circumscription and systematic positions of the new orchid and the tribe. The results indicate that the epidendroid taxa include most of the clades that are successively sister to the grade of clades representing previously recognized tribes. Calypsoeae comprising four well-supported clades with 12 genera (except for the previous temporarily placed Wullschlaegelia) is supported as a monophyletic and sister clade to Epidendreae (excluding Coeliinae). The new orchid is nested in Calypsoeae and is a sister to Dactylostalix and/or Calypso. This new holomycotrophic orchid presents a subumbel inflorescence that grows underground, and flower with a long pedicel reputing the ground to open and two fragments at the base of the hook, which are obviously morphologically different from those of Calypsoeae. To accommodate this species in the current generic circumscription, a new genus Yunorchis was created. PMID:25902264

  19. Natural history of human respiratory syncytial virus inferred from phylogenetic analysis of the attachment (G) glycoprotein with a 60-nucleotide duplication.

    PubMed

    Trento, Alfonsina; Viegas, Mariana; Galiano, Mónica; Videla, Cristina; Carballal, Guadalupe; Mistchenko, Alicia S; Melero, José A

    2006-01-01

    A total of 47 clinical samples were identified during an active surveillance program of respiratory infections in Buenos Aires (BA) (1999 to 2004) that contained sequences of human respiratory syncytial virus (HRSV) with a 60-nucleotide duplication in the attachment (G) protein gene. This duplication was analogous to that previously described for other three viruses also isolated in Buenos Aires in 1999 (A. Trento et al., J. Gen. Virol. 84:3115-3120, 2003). Phylogenetic analysis indicated that BA sequences with that duplication shared a common ancestor (dated about 1998) with other HRSV G sequences reported worldwide after 1999. The duplicated nucleotide sequence was an exact copy of the preceding 60 nucleotides in early viruses, but both copies of the duplicated segment accumulated nucleotide substitutions in more recent viruses at a rate apparently higher than in other regions of the G protein gene. The evolution of the viruses with the duplicated G segment apparently followed the overall evolutionary pattern previously described for HRSV, and this genotype has replaced other prevailing antigenic group B genotypes in Buenos Aires and other places. Thus, the duplicated segment represents a natural tag that can be used to track the dissemination and evolution of HRSV in an unprecedented setting. We have taken advantage of this situation to reexamine the molecular epidemiology of HRSV and to explore the natural history of this important human pathogen.

  20. Bayesian Inference for Growth Mixture Models with Latent Class Dependent Missing Data

    ERIC Educational Resources Information Center

    Lu, Zhenqiu Laura; Zhang, Zhiyong; Lubke, Gitta

    2011-01-01

    "Growth mixture models" (GMMs) with nonignorable missing data have drawn increasing attention in research communities but have not been fully studied. The goal of this article is to propose and to evaluate a Bayesian method to estimate the GMMs with latent class dependent missing data. An extended GMM is first presented in which class…

  1. Bayesian inference for an illness-death model for stroke with cognition as a latent time-dependent risk factor.

    PubMed

    van den Hout, Ardo; Fox, Jean-Paul; Klein Entink, Rinke H

    2015-12-01

    Longitudinal data can be used to estimate the transition intensities between healthy and unhealthy states prior to death. An illness-death model for history of stroke is presented, where time-dependent transition intensities are regressed on a latent variable representing cognitive function. The change of this function over time is described by a linear growth model with random effects. Occasion-specific cognitive function is measured by an item response model for longitudinal scores on the Mini-Mental State Examination, a questionnaire used to screen for cognitive impairment. The illness-death model will be used to identify and to explore the relationship between occasion-specific cognitive function and stroke. Combining a multi-state model with the latent growth model defines a joint model which extends current statistical inference regarding disease progression and cognitive function. Markov chain Monte Carlo methods are used for Bayesian inference. Data stem from the Medical Research Council Cognitive Function and Ageing Study in the UK (1991-2005). PMID:22080595

  2. Bayesian inference for an illness-death model for stroke with cognition as a latent time-dependent risk factor

    PubMed Central

    Fox, Jean-Paul; Klein Entink, Rinke H

    2015-01-01

    Longitudinal data can be used to estimate the transition intensities between healthy and unhealthy states prior to death. An illness-death model for history of stroke is presented, where time-dependent transition intensities are regressed on a latent variable representing cognitive function. The change of this function over time is described by a linear growth model with random effects. Occasion-specific cognitive function is measured by an item response model for longitudinal scores on the Mini-Mental State Examination, a questionnaire used to screen for cognitive impairment. The illness-death model will be used to identify and to explore the relationship between occasion-specific cognitive function and stroke. Combining a multi-state model with the latent growth model defines a joint model which extends current statistical inference regarding disease progression and cognitive function. Markov chain Monte Carlo methods are used for Bayesian inference. Data stem from the Medical Research Council Cognitive Function and Ageing Study in the UK (1991–2005). PMID:22080595

  3. EMPIRICAL LIKELIHOOD INFERENCE FOR THE COX MODEL WITH TIME-DEPENDENT COEFFICIENTS VIA LOCAL PARTIAL LIKELIHOOD

    PubMed Central

    Sun, Yanqing; Sundaram, Rajeshwari; Zhao, Yichuan

    2009-01-01

    The Cox model with time-dependent coefficients has been studied by a number of authors recently. In this paper, we develop empirical likelihood (EL) pointwise confidence regions for the time-dependent regression coefficients via local partial likelihood smoothing. The EL simultaneous confidence bands for a linear combination of the coefficients are also derived based on the strong approximation methods. The empirical likelihood ratio is formulated through the local partial log-likelihood for the regression coefficient functions. Our numerical studies indicate that the EL pointwise/simultaneous confidence regions/bands have satisfactory finite sample performances. Compared with the confidence regions derived directly based on the asymptotic normal distribution of the local constant estimator, the EL confidence regions are overall tighter and can better capture the curvature of the underlying regression coefficient functions. Two data sets, the gastric cancer data and the Mayo Clinic primary biliary cirrhosis data, are analyzed using the proposed method. PMID:19838322

  4. Bayesian Inference on the Effect of Density Dependence and Weather on a Guanaco Population from Chile

    PubMed Central

    Zubillaga, María; Skewes, Oscar; Soto, Nicolás; Rabinovich, Jorge E.; Colchero, Fernando

    2014-01-01

    Understanding the mechanisms that drive population dynamics is fundamental for management of wild populations. The guanaco (Lama guanicoe) is one of two wild camelid species in South America. We evaluated the effects of density dependence and weather variables on population regulation based on a time series of 36 years of population sampling of guanacos in Tierra del Fuego, Chile. The population density varied between 2.7 and 30.7 guanaco/km2, with an apparent monotonic growth during the first 25 years; however, in the last 10 years the population has shown large fluctuations, suggesting that it might have reached its carrying capacity. We used a Bayesian state-space framework and model selection to determine the effect of density and environmental variables on guanaco population dynamics. Our results show that the population is under density dependent regulation and that it is currently fluctuating around an average carrying capacity of 45,000 guanacos. We also found a significant positive effect of previous winter temperature while sheep density has a strong negative effect on the guanaco population growth. We conclude that there are significant density dependent processes and that climate as well as competition with domestic species have important effects determining the population size of guanacos, with important implications for management and conservation. PMID:25514510

  5. Bayesian inference on the effect of density dependence and weather on a guanaco population from Chile.

    PubMed

    Zubillaga, María; Skewes, Oscar; Soto, Nicolás; Rabinovich, Jorge E; Colchero, Fernando

    2014-01-01

    Understanding the mechanisms that drive population dynamics is fundamental for management of wild populations. The guanaco (Lama guanicoe) is one of two wild camelid species in South America. We evaluated the effects of density dependence and weather variables on population regulation based on a time series of 36 years of population sampling of guanacos in Tierra del Fuego, Chile. The population density varied between 2.7 and 30.7 guanaco/km2, with an apparent monotonic growth during the first 25 years; however, in the last 10 years the population has shown large fluctuations, suggesting that it might have reached its carrying capacity. We used a Bayesian state-space framework and model selection to determine the effect of density and environmental variables on guanaco population dynamics. Our results show that the population is under density dependent regulation and that it is currently fluctuating around an average carrying capacity of 45,000 guanacos. We also found a significant positive effect of previous winter temperature while sheep density has a strong negative effect on the guanaco population growth. We conclude that there are significant density dependent processes and that climate as well as competition with domestic species have important effects determining the population size of guanacos, with important implications for management and conservation.

  6. Phylogenetic relationships and generic delimitation of Eurasian Aster (Asteraceae: Astereae) inferred from ITS, ETS and trnL-F sequence data

    PubMed Central

    Li, Wei-Ping; Yang, Fu-Sheng; Jivkova, Todorka; Yin, Gen-Shen

    2012-01-01

    Background and Aims The classification and phylogeny of Eurasian (EA) Aster (Asterinae, Astereae, Asteraceae) remain poorly resolved. Some taxonomists adopt a broad definition of EA Aster, whereas others favour a narrow generic concept. The present study aims to delimit EA Aster sensu stricto (s.s.), elucidate the phylogenetic relationships of EA Aster s.s. and segregate genera. Methods The internal and external transcribed spacers of nuclear ribosomal DNA and the plastid DNA trnL-F region were used to reconstruct the phylogeny of EA Aster through maximum parsimony and Bayesian analyses. Key Results The analyses strongly support an Aster clade including the genera Sheareria, Rhynchospermum, Kalimeris (excluding Kalimeris longipetiolata), Heteropappus, Miyamayomena, Turczaninowia, Rhinactinidia, eastern Asian Doellingeria, Asterothamnus and Arctogeron. Many well-recognized species of Chinese Aster s.s. lie outside of the Aster clade. Conclusions The results reveal that EA Aster s.s. is both paraphyletic and polyphyletic. Sheareria, Rhynchospermum, Kalimeris (excluding K. longipetiolata), Heteropappus, Miyamayomena, Turczaninowia, Rhinactinidia, eastern Asian Doellingeria, Asterothamnus and Arctogeron should be included in Aster, whereas many species of Chinese Aster s.s. should be excluded. The recircumscribed Aster should be divided into two subgenera and nine sections. Kalimeris longipetiolata, Aster batangensis, A. ser. Albescentes, A. series Hersileoides, a two-species group composed of A. senecioides and A. fuscescens, and a six-species group including A. asteroides, should be elevated to generic level. With the Aster clade, they belong to the Australasian lineages. The generic status of Callistephus should be maintained. Whether Galatella (including Crinitina) and Tripolium should remain as genera or be merged into a single genus remains to be determined. In addition, the taxonomic status of A. auriculatus and the A. pycnophyllus–A. panduratus clade remains

  7. Molecular phylogenetic analysis of Chinese indigenous blue-shelled chickens inferred from whole genomic region of the SLCO1B3 gene.

    PubMed

    Dalirsefat, Seyed Benyamin; Dong, Xianggui; Deng, Xuemei

    2015-08-01

    In total, 246 individuals from 8 Chinese indigenous blue- and brown-shelled chicken populations (Yimeng Blue, Wulong Blue, Lindian Blue, Dongxiang Blue, Lushi Blue, Jingmen Blue, Dongxiang Brown, and Lushi Brown) were genotyped for 21 SNP markers from the SLCO1B3 gene to evaluate phylogenetic relationships. As a representative of nonblue-shelled breeds, White Leghorn was included in the study for reference. A high proportion of SNP polymorphism was observed in Chinese chicken populations, ranging from 89% in Jingmen Blue to 100% in most populations, with a mean of 95% across all populations. The White Leghorn breed showed the lowest polymorphism, accounting for 43% of total SNPs. The mean expected heterozygosity varied from 0.11 in Dongxiang Blue to 0.46 in Yimeng Blue. Analysis of molecular variation (AMOVA) for 2 groups of Chinese chickens based on eggshell color type revealed 52% within-group and 43% between-group variations of the total genetic variation. As expected, FST and Reynolds' genetic distance were greatest between White Leghorn and Chinese chicken populations, with average values of 0.40 and 0.55, respectively. The first and second principal coordinates explained approximately 92% of the total variation and supported the clustering of the populations according to their eggshell color type and historical origins. STRUCTURE analysis showed a considerable source of variation among populations for the clustering into blue-shelled and nonblue-shelled chicken populations. The low estimation of genetic differentiation (FST) between Chinese chicken populations is possibly due to a common historical origin and high gene flow. Remarkably similar population classifications were obtained with all methods used in the study. Aligning endogenous avian retroviral (EAV)-HP insertion sequences showed no difference among the blue-shelled chickens.

  8. Molecular evolution inferred from small subunit rRNA sequences: what does it tell us about phylogenetic relationships and taxonomy of the parabasalids?

    NASA Technical Reports Server (NTRS)

    Viscogliosi, E.; Edgcomb, V. P.; Gerbod, D.; Noel, C.; Delgado-Viscogliosi, P.; Sogin, M. L. (Principal Investigator)

    1999-01-01

    The Parabasala are a primitive group of protists divided into two classes: the trichomonads and the hypermastigids. Until recently, phylogeny and taxonomy of parabasalids were mainly based on the comparative analysis of morphological characters primarily linked to the development of their cytoskeleton. Recent use of molecular markers, such as small subunit (SSU) rRNA has led to now insights into the systematics of the Parabasala and other groups of prolists. An updated phylogeny based on SSU rRNA is provided and compared to that inferred from ultrastructural data. The SSU rRNA phylogeny contradicts the dogma equating simple characters with pumitive characters. Hypermastigids, possessing a hyperdeveloped cytoskeleton, exhibit the most basal emergence in the parabasalid lineage. Other observations emerge from the SSU rRNA analysis, such as the secondary loss of some cytoskeleton structures in all representatives of the Monocercomonadidae, the existence of secondarily free living taxa (reversibility of parasitism) and the evidence against the co-evolution of the endobiotic parabasalids and their animal hosts. According to phylogenies based on SSU rRNA, all the trichomonad families are not monophyletic groups, putting into question the validity of current taxonomic assignments. The precise branching order of some taxa remains unclear, but this issue can possibly be addressed by the molecular analysis of additional parabasalids. The goal of such additional analyses would be to propose, in a near future, a revision of the taxonomy of this group of protists that takes into account both molecular and morphological data.

  9. Molecular evolution inferred from small subunit rRNA sequences: what does it tell us about phylogenetic relationships and taxonomy of the parabasalids?

    PubMed

    Viscogliosi, E; Edgcomb, V P; Gerbod, D; Noël, C; Delgado-Viscogliosi, P

    1999-12-01

    The Parabasala are a primitive group of protists divided into two classes: the trichomonads and the hypermastigids. Until recently, phylogeny and taxonomy of parabasalids were mainly based on the comparative analysis of morphological characters primarily linked to the development of their cytoskeleton. Recent use of molecular markers, such as small subunit (SSU) rRNA has led to now insights into the systematics of the Parabasala and other groups of prolists. An updated phylogeny based on SSU rRNA is provided and compared to that inferred from ultrastructural data. The SSU rRNA phylogeny contradicts the dogma equating simple characters with pumitive characters. Hypermastigids, possessing a hyperdeveloped cytoskeleton, exhibit the most basal emergence in the parabasalid lineage. Other observations emerge from the SSU rRNA analysis, such as the secondary loss of some cytoskeleton structures in all representatives of the Monocercomonadidae, the existence of secondarily free living taxa (reversibility of parasitism) and the evidence against the co-evolution of the endobiotic parabasalids and their animal hosts. According to phylogenies based on SSU rRNA, all the trichomonad families are not monophyletic groups, putting into question the validity of current taxonomic assignments. The precise branching order of some taxa remains unclear, but this issue can possibly be addressed by the molecular analysis of additional parabasalids. The goal of such additional analyses would be to propose, in a near future, a revision of the taxonomy of this group of protists that takes into account both molecular and morphological data.

  10. Ionospheric convection inferred from interplanetary magnetic field-dependent Birkeland currents

    NASA Technical Reports Server (NTRS)

    Rasmussen, C. E.; Schunk, R. W.

    1988-01-01

    Computer simulations of ionospheric convection have been performed, combining empirical models of Birkeland currents with a model of ionospheric conductivity in order to investigate IMF-dependent convection characteristics. Birkeland currents representing conditions in the northern polar cap of the negative IMF By component are used. Two possibilities are considered: (1) the morning cell shifting into the polar cap as the IMF turns northward, and this cell and a distorted evening cell providing for sunward flow in the polar cap; and (2) the existence of a three-cell pattern when the IMF is strongly northward.

  11. Structure, expression profile and phylogenetic inference of chalcone isomerase-like genes from the narrow-leafed lupin (Lupinus angustifolius L.) genome

    PubMed Central

    Przysiecka, Łucja; Książkiewicz, Michał; Wolko, Bogdan; Naganowska, Barbara

    2015-01-01

    Lupins, like other legumes, have a unique biosynthesis scheme of 5-deoxy-type flavonoids and isoflavonoids. A key enzyme in this pathway is chalcone isomerase (CHI), a member of CHI-fold protein family, encompassing subfamilies of CHI1, CHI2, CHI-like (CHIL), and fatty acid-binding (FAP) proteins. Here, two Lupinus angustifolius (narrow-leafed lupin) CHILs, LangCHIL1 and LangCHIL2, were identified and characterized using DNA fingerprinting, cytogenetic and linkage mapping, sequencing and expression profiling. Clones carrying CHIL sequences were assembled into two contigs. Full gene sequences were obtained from these contigs, and mapped in two L. angustifolius linkage groups by gene-specific markers. Bacterial artificial chromosome fluorescence in situ hybridization approach confirmed the localization of two LangCHIL genes in distinct chromosomes. The expression profiles of both LangCHIL isoforms were very similar. The highest level of transcription was in the roots of the third week of plant growth; thereafter, expression declined. The expression of both LangCHIL genes in leaves and stems was similar and low. Comparative mapping to reference legume genome sequences revealed strong syntenic links; however, LangCHIL2 contig had a much more conserved structure than LangCHIL1. LangCHIL2 is assumed to be an ancestor gene, whereas LangCHIL1 probably appeared as a result of duplication. As both copies are transcriptionally active, questions arise concerning their hypothetical functional divergence. Screening of the narrow-leafed lupin genome and transcriptome with CHI-fold protein sequences, followed by Bayesian inference of phylogeny and cross-genera synteny survey, identified representatives of all but one (CHI1) main subfamilies. They are as follows: two copies of CHI2, FAPa2 and CHIL, and single copies of FAPb and FAPa1. Duplicated genes are remnants of whole genome duplication which is assumed to have occurred after the divergence of Lupinus, Arachis, and Glycine

  12. Phylogenetic position of Magnivitellinum Kloss, 1966 and Perezitrema Baruš & Moravec, 1967 (Trematoda: Plagiorchioidea: Macroderoididae) inferred from partial 28S rDNA sequences, with the establishment of Alloglossidiidae n. fam.

    PubMed

    Hernández-Mena, David Iván; Mendoza-Garfias, Berenit; Ornelas-García, Claudia Patricia; Pérez-Ponce de León, Gerardo

    2016-07-01

    The systematic position of two genera of Macroderoididae McMullen, 1937, Perezitrema Baruš & Moravec, 1967 and Magnivitellinum Kloss, 1966 is reviewed based on a phylogenetic analysis of the interrelationships of 15 species of the family allocated into six genera, along with 44 species of plagiorchioid trematodes, using partial sequences of the 28S rRNA gene. Sequences were analysed through parsimony, maximum likelihood and Bayesian inference. The obtained topologies show Perezitrema as the sister taxon of three species of Macroderoides Pearse, 1924; the latter genus appears to be paraphyletic since another three species are not included in this group. Instead, Magnivitellinum was placed as the sister taxon of Alloglossidium Simer, 1929. These relationships are well supported by high bootstrap and posterior probability values. The resulting trees demonstrate that the family Macroderoididae, as currently conceived in taxonomic treatments, is not monophyletic. Magnivitellinum simplex Kloss, 1966 and Alloglossidium spp. were nested as sister taxa of members of the family Leptophallidae Dayal, 1938, whereas Perezitrema bychowskii Baruš & Moravec, 1967 and species of Macroderoides and Paramacroderoides Venard, 1941 were grouped with Auridistomum chelydrae (Stafford, 1900), a monotypic member of Auridistomidae Stunkard, 1924. Based on our results, a new family, Alloglossidiidae n. fam. was established to accommodate the genera Magnivitellinum and Alloglossidium. PMID:27307166

  13. Utility of ITS1-5.8S-ITS2 and 16S mitochondrial DNA sequences for species identification and phylogenetic inference within the Rhinonyssus coniventris species complex (Acari: Rhinonyssidae).

    PubMed

    de Rojas, Manuel; Ubeda, José Manuel; Cutillas, Cristina; Mora, Maria Dolores; Ariza, Concepción; Guevara, Diego

    2007-04-01

    The complete internal transcribed spacer 1 (ITS1), 5.8S rDNA, and ITS2 region of the ribosomal DNA and a 390-bp region of the 16S rDNA gene from five taxa belonging to Rhinonyssus (Rhinonyssus vanellus, Rhinonyssus tringae, Rhinonyssus neglectus, Rhinonyssus echinipes from Kentish plover, and Rhinonyssus echinipes from grey plover) were sequenced to examine the level of sequence variation and the taxonomic levels to show utility in phylogeny estimation. Our data show that these molecular markers can help to discriminate between species and populations included in the Rhinonyssus coniventris complex (R. tringae, R. neglectus, R. echinipes), which are morphologically very close and difficult to separate by classic methods. A comparative study with sequences from other rhinonyssid mites previously published was also carried out. The resulting phylogenetic tree inferred from ITS1-5.8S-ITS2 region sequences obtained in this paper, together with those from other 11 taxa of rhinonyssid, shows slight differences from the current taxonomy of the Rhinonyssidae. This study appeals for the revision of the taxonomic status of the R. coniventris complex, as well as for the species included within it.

  14. The Phylogenetic Diversity of Metagenomes

    PubMed Central

    Kembel, Steven W.; Eisen, Jonathan A.; Pollard, Katherine S.; Green, Jessica L.

    2011-01-01

    Phylogenetic diversity—patterns of phylogenetic relatedness among organisms in ecological communities—provides important insights into the mechanisms underlying community assembly. Studies that measure phylogenetic diversity in microbial communities have primarily been limited to a single marker gene approach, using the small subunit of the rRNA gene (SSU-rRNA) to quantify phylogenetic relationships among microbial taxa. In this study, we present an approach for inferring phylogenetic relationships among microorganisms based on the random metagenomic sequencing of DNA fragments. To overcome challenges caused by the fragmentary nature of metagenomic data, we leveraged fully sequenced bacterial genomes as a scaffold to enable inference of phylogenetic relationships among metagenomic sequences from multiple phylogenetic marker gene families. The resulting metagenomic phylogeny can be used to quantify the phylogenetic diversity of microbial communities based on metagenomic data sets. We applied this method to understand patterns of microbial phylogenetic diversity and community assembly along an oceanic depth gradient, and compared our findings to previous studies of this gradient using SSU-rRNA gene and metagenomic analyses. Bacterial phylogenetic diversity was highest at intermediate depths beneath the ocean surface, whereas taxonomic diversity (diversity measured by binning sequences into taxonomically similar groups) showed no relationship with depth. Phylogenetic diversity estimates based on the SSU-rRNA gene and the multi-gene metagenomic phylogeny were broadly concordant, suggesting that our approach will be applicable to other metagenomic data sets for which corresponding SSU-rRNA gene sequences are unavailable. Our approach opens up the possibility of using metagenomic data to study microbial diversity in a phylogenetic context. PMID:21912589

  15. Inference for binomial probability based on dependent Bernoulli random variables with applications to meta-analysis and group level studies.

    PubMed

    Bakbergenuly, Ilyas; Kulinskaya, Elena; Morgenthaler, Stephan

    2016-07-01

    We study bias arising as a result of nonlinear transformations of random variables in random or mixed effects models and its effect on inference in group-level studies or in meta-analysis. The findings are illustrated on the example of overdispersed binomial distributions, where we demonstrate considerable biases arising from standard log-odds and arcsine transformations of the estimated probability p̂, both for single-group studies and in combining results from several groups or studies in meta-analysis. Our simulations confirm that these biases are linear in ρ, for small values of ρ, the intracluster correlation coefficient. These biases do not depend on the sample sizes or the number of studies K in a meta-analysis and result in abysmal coverage of the combined effect for large K. We also propose bias-correction for the arcsine transformation. Our simulations demonstrate that this bias-correction works well for small values of the intraclass correlation. The methods are applied to two examples of meta-analyses of prevalence. PMID:27192062

  16. Inference for binomial probability based on dependent Bernoulli random variables with applications to meta‐analysis and group level studies

    PubMed Central

    Bakbergenuly, Ilyas; Morgenthaler, Stephan

    2016-01-01

    We study bias arising as a result of nonlinear transformations of random variables in random or mixed effects models and its effect on inference in group‐level studies or in meta‐analysis. The findings are illustrated on the example of overdispersed binomial distributions, where we demonstrate considerable biases arising from standard log‐odds and arcsine transformations of the estimated probability p^, both for single‐group studies and in combining results from several groups or studies in meta‐analysis. Our simulations confirm that these biases are linear in ρ, for small values of ρ, the intracluster correlation coefficient. These biases do not depend on the sample sizes or the number of studies K in a meta‐analysis and result in abysmal coverage of the combined effect for large K. We also propose bias‐correction for the arcsine transformation. Our simulations demonstrate that this bias‐correction works well for small values of the intraclass correlation. The methods are applied to two examples of meta‐analyses of prevalence. PMID:27192062

  17. The phylogenetic likelihood library.

    PubMed

    Flouri, T; Izquierdo-Carrasco, F; Darriba, D; Aberer, A J; Nguyen, L-T; Minh, B Q; Von Haeseler, A; Stamatakis, A

    2015-03-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2-10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL).

  18. Is it possible to infer the frequency-dependent seismic attenuation of fractured materials from high-strain creep tests?

    NASA Astrophysics Data System (ADS)

    mallet, celine; quintal, beatriz; caspari, eva; holliger, klaus

    2016-04-01

    The seismic and hydraulic characterization of fractured rocks is an important objective for reservoir development in general and the production of geothermal energy in particular. The attenuation of seismic waves in saturated fractured media is governed by local displacements of the fluid relative to the solid induced by the compressions and extensions associated with the passing wavefield. This phenomenon is generally referred to as wave-induced fluid flow (WIFF). Recent evidence suggests that this energy dissipation mechanism is sensitive to the interconnectivity of the fractures, which offers the perspective of linking seismic observations to the hydraulic properties of fractured rocks. Here, we consider the results of laboratory experiments, which are referred to as creep tests. Such tests consist of applying a constant stress to a water-saturated thermally cracked glass sample and recording the resulting strain response as a function of time. The primary advantages of the considered material are (i) that the fracture network is well documented and (ii) that the homogeneous and non-porous glass matrix limits WIFF to the fracture network. Due to the high stress levels as well as other technical issues, creep tests are not commonly used for laboratory-based measurements of energy dissipation. Therefore, an objective of this study is to explore whether and to what extent such data can be interpreted in terms of the seismic attenuation characteristics of the probed samples, as this might open access to a vast reservoir of corresponding data, notably for cracked materials. Transforming the observed time-dependent stress-strain relation into the Fourier domain, allows us to infer the corresponding frequency-dependent attenuation characteristics, which we then seek to interpret through numerical simulations based on Biot's quasi-static poroelastic equations. The 2D geometry of the fracture network considered in these simulations is derived from a scanning electron

  19. Water relations traits of C4 grasses depend on phylogenetic lineage, photosynthetic pathway, and habitat water availability.

    PubMed

    Liu, Hui; Osborne, Colin P

    2015-02-01

    The repeated evolution of C4 photosynthesis in independent lineages has resulted in distinct biogeographical distributions in different phylogenetic lineages and the variants of C4 photosynthesis. However, most previous studies have only considered C3/C4 differences without considering phylogeny, C4 subtype, or habitat characteristics. We hypothesized that independent lineages of C4 grasses have structural and physiological traits that adapt them to environments with differing water availability. We measured 40 traits of 33 species from two major C4 grass lineages in a common glasshouse environment. Chloridoideae species were shorter, with narrower and longer leaves, smaller but denser stomata, and faster curling leaves than Panicoideae species, but overall differences in leaf hydraulic and gas exchange traits between the two lineages were weak. Chloridoideae species had two different ways to reach higher drought resistance potential than Panicoideae; NAD-ME species used water saving, whereas PCK species used osmotic adjustment. These patterns could be explained by the interactions of lineage×C4 subtype and lineage×habitat water availability in affected traits. Specifically, phylogeny tended to have a stronger influence on structural traits, and C4 subtype had more important effects on physiological traits. Although hydraulic traits did not differ consistently between lineages, they showed strong covariation and relationships with leaf structure. Thus, phylogenetic lineage, photosynthetic pathway, and adaptation to habitat water availability act together to influence the leaf water relations traits of C4 grasses. This work expands our understanding of ecophysiology in major C4 grass lineages, with implications for explaining their regional and global distributions in relation to climate.

  20. Water relations traits of C4 grasses depend on phylogenetic lineage, photosynthetic pathway, and habitat water availability

    PubMed Central

    Liu, Hui; Osborne, Colin P.

    2015-01-01

    The repeated evolution of C4 photosynthesis in independent lineages has resulted in distinct biogeographical distributions in different phylogenetic lineages and the variants of C4 photosynthesis. However, most previous studies have only considered C3/C4 differences without considering phylogeny, C4 subtype, or habitat characteristics. We hypothesized that independent lineages of C4 grasses have structural and physiological traits that adapt them to environments with differing water availability. We measured 40 traits of 33 species from two major C4 grass lineages in a common glasshouse environment. Chloridoideae species were shorter, with narrower and longer leaves, smaller but denser stomata, and faster curling leaves than Panicoideae species, but overall differences in leaf hydraulic and gas exchange traits between the two lineages were weak. Chloridoideae species had two different ways to reach higher drought resistance potential than Panicoideae; NAD-ME species used water saving, whereas PCK species used osmotic adjustment. These patterns could be explained by the interactions of lineage×C4 subtype and lineage×habitat water availability in affected traits. Specifically, phylogeny tended to have a stronger influence on structural traits, and C4 subtype had more important effects on physiological traits. Although hydraulic traits did not differ consistently between lineages, they showed strong covariation and relationships with leaf structure. Thus, phylogenetic lineage, photosynthetic pathway, and adaptation to habitat water availability act together to influence the leaf water relations traits of C4 grasses. This work expands our understanding of ecophysiology in major C4 grass lineages, with implications for explaining their regional and global distributions in relation to climate. PMID:25504656

  1. Habitat species pools for phylogenetic structure in microbes.

    PubMed

    Wang, Jianjun; Soininen, Janne; Shen, Ji

    2013-06-01

    The processes underlying the phylogenetic structure of the biotic communities are typically scale-dependent and thus often poorly resolved. Illustrated by the study of macroorganisms, it is suggested that the relative influence of ecological processes on the phylogenetic structure of the communities can be inferred by the geographical definition of the species pools. However, given the high dispersal ability of microbes, the spatial delineation of the species pool may not be that practical for microbial taxa. This idea is supported by the observational data on bacteria along an elevational gradient. Significant negative values of standardized effect size of the mean nearest taxon distance were consistently observed for different sized species pools considered. Reviewing the reports on microbial phylogenetic structure so far, we suggested that the 'habitat species pools' are perhaps more important for microbes than spatial delineated 'regional species pools'. PMID:23754726

  2. Phylogenetic trees in bioinformatics

    SciTech Connect

    Burr, Tom L

    2008-01-01

    Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding the best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.

  3. Inference in `poor` languages

    SciTech Connect

    Petrov, S.

    1996-10-01

    Languages with a solvable implication problem but without complete and consistent systems of inference rules (`poor` languages) are considered. The problem of existence of finite complete and consistent inference rule system for a ``poor`` language is stated independently of the language or rules syntax. Several properties of the problem arc proved. An application of results to the language of join dependencies is given.

  4. CVTree: a phylogenetic tree reconstruction tool based on whole genomes.

    PubMed

    Qi, Ji; Luo, Hong; Hao, Bailin

    2004-07-01

    Composition Vector Tree (CVTree) implements a systematic method of inferring evolutionary relatedness of microbial organisms from the oligopeptide content of their complete proteomes (http://cvtree.cbi.pku.edu.cn). Since the first bacterial genomes were sequenced in 1995 there have been several attempts to infer prokaryote phylogeny from complete genomes. Most of them depend on sequence alignment directly or indirectly and, in some cases, need fine-tuning and adjustment. The composition vector method circumvents the ambiguity of choosing the genes for phylogenetic reconstruction and avoids the necessity of aligning sequences of essentially different length and gene content. This new method does not contain 'free' parameter and 'fine-tuning'. A bootstrap test for a phylogenetic tree of 139 organisms has shown the stability of the branchings, which support the small subunit ribosomal RNA (SSU rRNA) tree of life in its overall structure and in many details. It may provide a quick reference in prokaryote phylogenetics whenever the proteome of an organism is available, a situation that will become commonplace in the near future.

  5. The Association of Selected Conative Variables to Field-Dependence with Inferences for Reasoning Characteristics in Marketing Education.

    ERIC Educational Resources Information Center

    Fritz, Robert L.

    A study examined the association between field-dependence and its related information processing characteristics, and educational cognitive style as a model of conative influence. Data were collected from 145 secondary marketing education students in nothern Georgia during spring 1991. Descriptive statistics, Pearson product moment correlations,…

  6. Satellite-Based Evidence of Wavelength-Dependent Aerosol Absorption in Biomass Burning Smoke Inferred from Ozone Monitoring Instrument

    NASA Technical Reports Server (NTRS)

    Jethva, H.; Torres, O.

    2012-01-01

    We provide satellite-based evidence of the spectral dependence of absorption in biomass burning aerosols over South America using near-UV measurements made by the Ozone Monitoring Instrument (OMI) during 2005-2007. In the current near-UV OMI aerosol algorithm (OMAERUV), it is implicitly assumed that the only absorbing component in carbonaceous aerosols is black carbon whose imaginary component of the refractive index is wavelength independent. With this assumption, OMI-derived aerosol optical depth (AOD) is found to be significantly over-estimated compared to that of AERONET at several sites during intense biomass burning events (August-September). Other well-known sources of error affecting the near-UV method of aerosol retrieval do not explain the large observed AOD discrepancies between the satellite and the ground-based observations. A number of studies have revealed strong spectral dependence in carbonaceous aerosol absorption in the near-UV region suggesting the presence of organic carbon in biomass burning generated aerosols. A sensitivity analysis examining the importance of accounting for the presence of wavelength-dependent aerosol absorption in carbonaceous particles in satellite-based remote sensing was carried out in this work. The results convincingly show that the inclusion of spectrally-dependent aerosol absorption in the radiative transfer calculations leads to a more accurate characterization of the atmospheric load of carbonaceous aerosols.

  7. High-resolution phylogenetic microbial community profiling

    PubMed Central

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; Bowman, Brett; Bowers, Robert M; Levy, Asaf; Gies, Esther A; Cheng, Jan-Fang; Copeland, Alex; Klenk, Hans-Peter; Hallam, Steven J; Hugenholtz, Philip; Tringe, Susannah G; Woyke, Tanja

    2016-01-01

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential. PMID:26859772

  8. The dependence of PGA and PGV on distance and magnitude inferred from Northern California ShakeMap data

    USGS Publications Warehouse

    Boatwright, J.; Bundock, H.; Luetgert, J.; Seekins, L.; Gee, L.; Lombard, P.

    2003-01-01

    We analyze peak ground velocity (PGV) and peak ground acceleration (PGA) data from 95 moderate (3.5 ??? M 100 km, the peak motions attenuate more rapidly than a simple power law (that is, r-??) can fit. Instead, we use an attenuation function that combines a fixed power law (r-0.7) with a fitted exponential dependence on distance, which is estimated as expt(-0.0063r) and exp(-0.0073r) for PGV and PGA, respectively, for moderate earthquakes. We regress log(PGV) and log(PGA) as functions of distance and magnitude. We assume that the scaling of log(PGV) and log(PGA) with magnitude can differ for moderate and large earthquakes, but must be continuous. Because the frequencies that carry PGV and PGA can vary with earthquake size for large earthquakes, the regression for large earthquakes incorporates a magnitude dependence in the exponential attenuation function. We fix the scaling break between moderate and large earthquakes at M 5.5; log(PGV) and log(PGA) scale as 1.06M and 1.00M, respectively, for moderate earthquakes and 0.58M and 0.31M for large earthquakes.

  9. Gravity wave activity in the thermosphere inferred from GOCE data, and its dependence on solar flux conditions.

    NASA Astrophysics Data System (ADS)

    Garcia, Raphael F.; Bruinsma, Sean; Doornbos, Eelco; Massarweh, Lotfi

    2016-04-01

    This study is focused on the effect of solar flux conditions on the dynamics of Gravity Waves (GW) in thermosphere. Air density and cross-wind in situ estimates from the Gravity Field and Steady-State Ocean Circulation Explorer (GOCE) accelerometers are analyzed for the whole mission duration. The analysis was performed in the Fourier spectral domain averaging spectral results over periods of 2 months close to solstices. First the Amplitude Spectral Density (ASD) and the Magnitude Squared Coherence (MSC) of physical parameters are linked to local gravity waves. Then, a new GW marker (called Cf3) was introduced here to constrain GWs activity under Low, Medium and High solar flux conditions, showing a clear solar dumping effect on GW activity. Most of GW signal has been found in a spectral range above 8 mHz in GOCE data, meaning a maximum horizontal wavelength around 1000 km. The level GW activity at GOCE altitude is strongly decreasing with increasing solar flux. Furthermore, a shift in the dominant frequency with solar flux conditions has been noted, leading to a larger horizontal wavelengths (from 200 to 500 km) during high solar flux conditions. The influence of correlated error sources, between air density and cross-winds, is discussed. Consistency of the spectral domain results has been verified in time-domain with a global mapping of high frequency perturbations along GOCE orbit. This analysis shows a clear dependence with geomagnetic latitude with strong perturbations at magnetic poles, and an extension to lower latitudes favoured by low solar activity conditions. Various possible causes of this spatial trend are discussed.

  10. Phylogenetic and morphological relationships between nonvolant small mammals reveal assembly processes at different spatial scales

    PubMed Central

    Luza, André Luís; Gonçalves, Gislene Lopes; Hartz, Sandra Maria

    2015-01-01

    The relative roles of historical processes, environmental filtering, and ecological interactions in the organization of species assemblages vary depending on the spatial scale. We evaluated the phylogenetic and morphological relationships between species and individuals (i.e., inter- and intraspecific variability) of Neotropical nonvolant small mammals coexisting in grassland-forest ecotones, in landscapes and in regions, that is, three different scales. We used a phylogenetic tree to infer evolutionary relationships, and morphological traits as indicators of performance and niche similarities between species and individuals. Subsequently, we applied phylogenetic and morphologic indexes of diversity and distance between species to evaluate small mammal assemblage structures on the three scales. The results indicated a repulsion pattern near forest edges, showing that phylogenetically similar species coexisted less often than expected by chance. The strategies for niche differentiation might explain the phylogenetic repulsion observed at the edge. Phylogenetic and morphological clustering in the grassland and at the forest interior indicated the coexistence of closely related and ecologically similar species and individuals. Coexistence patterns were similar whether species-trait values or individual values were used. At the landscape and regional scales, assemblages showed a predominant pattern of phylogenetic and morphological clustering. Environmental filters influenced the coexistence patterns at three scales, showing the importance of phylogenetically conserved ecological tolerances in enabling taxa co-occurrence. Evidence of phylogenetic repulsion in one region indicated that other processes beyond environmental filtering are important for community assembly at broad scales. Finally, ecological interactions and environmental filtering seemed important at the local scale, while environmental filtering and historical colonization seemed important for community

  11. Phylogeny and the inference of evolutionary trajectories.

    PubMed

    Hancock, Lillian; Edwards, Erika J

    2014-07-01

    Most important organismal adaptations are not actually single traits, but complex trait syndromes that are evolutionarily integrated into a single emergent phenotype. Two alternative photosynthetic pathways, C4 photosynthesis and crassulacean acid metabolism (CAM), are primary plant adaptations of this sort, each requiring multiple biochemical and anatomical modifications. Phylogenetic methods are a promising approach for teasing apart the order of character acquisition during the evolution of complex traits, and the phylogenetic placement of intermediate phenotypes as sister taxa to fully optimized syndromes has been taken as good evidence of an 'ordered' evolutionary trajectory. But how much power does the phylogenetic approach have to detect ordered evolution? This study simulated ordered and unordered character evolution across a diverse set of phylogenetic trees to understand how tree size, models of evolution, and sampling efforts influence the ability to detect an evolutionary trajectory. The simulations show that small trees (15 taxa) do not contain enough information to correctly infer either an ordered or unordered trajectory, although inference improves as tree size and sampling increases. However, even when working with a 1000-taxon tree, the possibility of inferring the incorrect evolutionary model (type I/type II error) remains. Caution is needed when interpreting the phylogenetic placement of intermediate phenotypes, especially in small lineages. Such phylogenetic patterns can provide a line of evidence for the existence of a particular evolutionary trajectory, but they should be coupled with other types of data to infer the stepwise evolution of a complex character trait.

  12. Cancer evolution: mathematical models and computational inference.

    PubMed

    Beerenwinkel, Niko; Schwarz, Roland F; Gerstung, Moritz; Markowetz, Florian

    2015-01-01

    Cancer is a somatic evolutionary process characterized by the accumulation of mutations, which contribute to tumor growth, clinical progression, immune escape, and drug resistance development. Evolutionary theory can be used to analyze the dynamics of tumor cell populations and to make inference about the evolutionary history of a tumor from molecular data. We review recent approaches to modeling the evolution of cancer, including population dynamics models of tumor initiation and progression, phylogenetic methods to model the evolutionary relationship between tumor subclones, and probabilistic graphical models to describe dependencies among mutations. Evolutionary modeling helps to understand how tumors arise and will also play an increasingly important prognostic role in predicting disease progression and the outcome of medical interventions, such as targeted therapy.

  13. Cancer Evolution: Mathematical Models and Computational Inference

    PubMed Central

    Beerenwinkel, Niko; Schwarz, Roland F.; Gerstung, Moritz; Markowetz, Florian

    2015-01-01

    Cancer is a somatic evolutionary process characterized by the accumulation of mutations, which contribute to tumor growth, clinical progression, immune escape, and drug resistance development. Evolutionary theory can be used to analyze the dynamics of tumor cell populations and to make inference about the evolutionary history of a tumor from molecular data. We review recent approaches to modeling the evolution of cancer, including population dynamics models of tumor initiation and progression, phylogenetic methods to model the evolutionary relationship between tumor subclones, and probabilistic graphical models to describe dependencies among mutations. Evolutionary modeling helps to understand how tumors arise and will also play an increasingly important prognostic role in predicting disease progression and the outcome of medical interventions, such as targeted therapy. PMID:25293804

  14. Extreme gender flexibility: using a phylogenetic framework to infer the evolution of variation in sex allocation, phylogeography, and speciation in a genus of bidirectional sex changing fishes(Lythrypnus, Gobiidae).

    PubMed

    Maxfield, Jessica M; Van Tassell, James L; St Mary, Colette M; Joyeux, Jean-Christophe; Crow, Karen D

    2012-09-01

    The genus Lythrypnus is a group of marine gobies that exhibit extreme gender flexibility as bidirectional sex changers. The genus consists of 20 described species and several undescribed species that are distributed in the Americas. Five species have been characterized with respect to sex allocation and gonad morphology. The hormonal, morphological, and behavioral aspects of sex change have been studied extensively for one species, L. dalli. These data, however, have not been interpreted in an evolutionary context because a phylogenetic hypothesis has not previously been proposed for the genus Lythrypnus. We propose the first phylogenetic hypothesis for the genus based on molecular data from three mitochondrial genes (12s, ND2, and Cytb), one nuclear gene (Rag1) and one nuclear intron (S7). We also include three previously undescribed Lythrypnus species. Our results support the monophyly of the genus with L. heterochroma, an Atlantic species, as the basal taxon. After the divergence of L. heterochroma, there are two main clades, one comprised of species distributed in the Atlantic, the other comprised of species distributed in the Pacific. These data indicate an Atlantic origin for the genus, followed by divergence after the closure of the Isthmus of Panama. Our data also support the monophyly of three previously described species complexes, the L. rhizophora complex and L. dalli complex in the Pacific, and the L. mowbrayi complex in the Atlantic. We mapped patterns of sex allocation within this genus onto the fully resolved and supported topology, and found that sexual plasticity and gender flexibility is likely a synapomorphy for the genus. Overall our results create a well-supported framework to understand the phylogeography of the genus, and to interpret the evolution of sex allocation in Lythrypnus gobies.

  15. Phylogenetic position of Linguatula arctica and Linguatula serrata (Pentastomida) as inferred from the nuclear 18S rRNA gene and the mitochondrial cytochrome c oxidase subunit I gene.

    PubMed

    Gjerde, Bjørn

    2013-10-01

    Genomic DNA was isolated from a Linguatula serrata female expelled from a dog imported to Norway from Romania and from four Linguatula arctica females collected from semi-domesticated reindeer from northern Norway and subjected to PCR amplification of the complete nuclear 18S rRNA gene and a 1,045-bp portion of the mitochondrial cytochrome c oxidase subunit I gene (cox1). The two species differed at two of 1,830 nucleotide positions (99.9% identity) of the complete 18S rRNA gene sequences and at 102 of 1,045 nucleotide positions (90.2% identity) of the partial cox1 sequences. The four isolates of L. arctica showed no genetic variation in either gene. The new cox1 primers may facilitate the diagnosis of various developmental stages of L. arctica and L. serrata in their hosts. In separate phylogenetic analyses using the maximum likelihood method on sequence data from either gene, L. arctica and L. serrata clustered with members of the order Cephalobaenida rather than with members of the order Porocephalida, in which the genus Linguatula is currently placed based on morphological characters. The phylogenetic relationship of L. arctica, L. serrata and other pentastomids to other metazoan groups could not be clearly resolved, but the pentastomids did not seem to have a sister relationship to crustaceans of the subclass Branchiura as found in other studies. A more extensive taxon sampling, including molecular characterisation of more pentastomid taxa across different genera, seems to be necessary in order to estimate the true relationship of the Pentastomida to other metazoan groups.

  16. Inferring biotic interactions from proxies.

    PubMed

    Morales-Castilla, Ignacio; Matias, Miguel G; Gravel, Dominique; Araújo, Miguel B

    2015-06-01

    Inferring biotic interactions from functional, phylogenetic and geographical proxies remains one great challenge in ecology. We propose a conceptual framework to infer the backbone of biotic interaction networks within regional species pools. First, interacting groups are identified to order links and remove forbidden interactions between species. Second, additional links are removed by examination of the geographical context in which species co-occur. Third, hypotheses are proposed to establish interaction probabilities between species. We illustrate the framework using published food-webs in terrestrial and marine systems. We conclude that preliminary descriptions of the web of life can be made by careful integration of data with theory.

  17. Morphological and molecular convergences in mammalian phylogenetics

    PubMed Central

    Zou, Zhengting; Zhang, Jianzhi

    2016-01-01

    Phylogenetic trees reconstructed from molecular sequences are often considered more reliable than those reconstructed from morphological characters, in part because convergent evolution, which confounds phylogenetic reconstruction, is believed to be rarer for molecular sequences than for morphologies. However, neither the validity of this belief nor its underlying cause is known. Here comparing thousands of characters of each type that have been used for inferring the phylogeny of mammals, we find that on average morphological characters indeed experience much more convergences than amino acid sites, but this disparity is explained by fewer states per character rather than an intrinsically higher susceptibility to convergence for morphologies than sequences. We show by computer simulation and actual data analysis that a simple method for identifying and removing convergence-prone characters improves phylogenetic accuracy, potentially enabling, when necessary, the inclusion of morphologies and hence fossils for reliable tree inference. PMID:27585543

  18. Morphological and molecular convergences in mammalian phylogenetics.

    PubMed

    Zou, Zhengting; Zhang, Jianzhi

    2016-01-01

    Phylogenetic trees reconstructed from molecular sequences are often considered more reliable than those reconstructed from morphological characters, in part because convergent evolution, which confounds phylogenetic reconstruction, is believed to be rarer for molecular sequences than for morphologies. However, neither the validity of this belief nor its underlying cause is known. Here comparing thousands of characters of each type that have been used for inferring the phylogeny of mammals, we find that on average morphological characters indeed experience much more convergences than amino acid sites, but this disparity is explained by fewer states per character rather than an intrinsically higher susceptibility to convergence for morphologies than sequences. We show by computer simulation and actual data analysis that a simple method for identifying and removing convergence-prone characters improves phylogenetic accuracy, potentially enabling, when necessary, the inclusion of morphologies and hence fossils for reliable tree inference. PMID:27585543

  19. Phylogenetic approaches to natural product structure prediction.

    PubMed

    Ziemert, Nadine; Jensen, Paul R

    2012-01-01

    Phylogenetics is the study of the evolutionary relatedness among groups of organisms. Molecular phylogenetics uses sequence data to infer these relationships for both organisms and the genes they maintain. With the large amount of publicly available sequence data, phylogenetic inference has become increasingly important in all fields of biology. In the case of natural product research, phylogenetic relationships are proving to be highly informative in terms of delineating the architecture and function of the genes involved in secondary metabolite biosynthesis. Polyketide synthases and nonribosomal peptide synthetases provide model examples in which individual domain phylogenies display different predictive capacities, resolving features ranging from substrate specificity to structural motifs associated with the final metabolic product. This chapter provides examples in which phylogeny has proven effective in terms of predicting functional or structural aspects of secondary metabolism. The basics of how to build a reliable phylogenetic tree are explained along with information about programs and tools that can be used for this purpose. Furthermore, it introduces the Natural Product Domain Seeker, a recently developed Web tool that employs phylogenetic logic to classify ketosynthase and condensation domains based on established enzyme architecture and biochemical function.

  20. Phylogenetic and biogeographic implications inferred by mitochondrial intergenic region analyses and ITS1-5.8S-ITS2 of the entomopathogenic fungi Beauveria bassiana and B. brongniartii

    PubMed Central

    2010-01-01

    Background The entomopathogenic fungi of the genus Beauveria are cosmopolitan with a variety of different insect hosts. The two most important species, B. bassiana and B. brongniartii, have already been used as biological control agents of pests in agriculture and as models for the study of insect host - pathogen interactions. Mitochondrial (mt) genomes, due to their properties to evolve faster than the nuclear DNA, to contain introns and mobile elements and to exhibit extended polymorphisms, are ideal tools to examine genetic diversity within fungal populations and genetically identify a species or a particular isolate. Moreover, mt intergenic region can provide valuable phylogenetic information to study the biogeography of the fungus. Results The complete mt genomes of B. bassiana (32,263 bp) and B. brongniartii (33,920 bp) were fully analysed. Apart from a typical gene content and organization, the Beauveria mt genomes contained several introns and had longer intergenic regions when compared with their close relatives. The phylogenetic diversity of a population of 84 Beauveria strains -mainly B. bassiana (n = 76) - isolated from temperate, sub-tropical and tropical habitats was examined by analyzing the nucleotide sequences of two mt intergenic regions (atp6-rns and nad3-atp9) and the nuclear ITS1-5.8S-ITS2 domain. Mt sequences allowed better differentiation of strains than the ITS region. Based on mt and the concatenated dataset of all genes, the B. bassiana strains were placed into two main clades: (a) the B. bassiana s. l. and (b) the "pseudobassiana". The combination of molecular phylogeny with criteria of geographic and climatic origin showed for the first time in entomopathogenic fungi, that the B. bassiana s. l. can be subdivided into seven clusters with common climate characteristics. Conclusions This study indicates that mt genomes and in particular intergenic regions provide molecular phylogeny tools that combined with criteria of geographic and

  1. Phylogenetic relationships of Russian far eastern flatfish (Pleuronectiformes, Pleuronectidae) based on two mitochondrial gene sequences, Co-1 and Cyt-b, with inferences in order phylogeny using complete mitogenome data.

    PubMed

    Kartavtsev, Yuri Phedorovich; Sharina, Svetlana N; Saitoh, Kenji; Imoto, Junichi M; Hanzawa, Naoto; Redin, Alexander D

    2016-01-01

    The systematics and phylogeny of flatfish is investigated on the complete sequence of nucleotides at subunit 1 cytochrome c oxidase (Co-1) and cytochrome b (Cyt-b) genes. In total 17 species from our collection and some species from GenBank were analyzed. Four types of trees were built: Bayesian (BA), maximum likelihood (ML), maximum parsimony (MP), and neighbor joining (NJ). These trees showed similar topology. Two separate clusters on the trees support subfamily Hippoglossoidinae and Hippoglossinae subdivision and monophyletic status of these taxa. The subfamily Pleuronectinae also can be considered monophyletic, if the tribe Microstomini is excluded from it and genus Lepidopsetta is moved into the tribe Pleuronectini. Mitogenomes represented by 25 complete sequences from NCBI GenBank were analyzed. After alignment two sets of nucleotide sequences were formed and investigated independently. One set included 13 structural genes (14,886 bp), the second set comprised by the mtDNA without ND6 gene (10,457 bp). Both data sets give congruent phylogenetic signal that agreed with conventional views on the taxonomy of the order Pleuronectiformes; however, the first set gives better topology. In BA gene tree there are two well supported nodes which include the representatives of suborders Pleuronectoidei and Psettoidei. Within Pleuronectoidei two superfamilies, Pleuronectoidea and Soleidea are highly supported in BA but in all four kinds of gene trees (BA, ML, MP and NJ) the only superfamily Pleuronectoidea is well supported. At the top of hierarchy, all flatfishes belonging to the order Pleuronectiformes forming also a monophyletic clade in our data, with support level of 100% but in BA tree only. The monophyly of the family Pleuronectidae is well supported both by single gene data and by complete mtDNA sequences.

  2. Evolution of prokaryote and eukaryote lines inferred from sequence evidence

    NASA Technical Reports Server (NTRS)

    Hunt, L. T.; George, D. G.; Yeh, L.-S.; Dayhoff, M. O.

    1984-01-01

    This paper describes the evolution of prokaryotes and early eukaryotes, including their symbiotic relationships, as inferred from phylogenetic trees of bacterial ferredoxin, 5S ribosomal RNA, ribulose-1,5-biphosphate carboxylase large chain, and mitochondrial cytochrome oxidase polypeptide II.

  3. Ultrafast approximation for phylogenetic bootstrap.

    PubMed

    Minh, Bui Quang; Nguyen, Minh Anh Thi; von Haeseler, Arndt

    2013-05-01

    Nonparametric bootstrap has been a widely used tool in phylogenetic analysis to assess the clade support of phylogenetic trees. However, with the rapidly growing amount of data, this task remains a computational bottleneck. Recently, approximation methods such as the RAxML rapid bootstrap (RBS) and the Shimodaira-Hasegawa-like approximate likelihood ratio test have been introduced to speed up the bootstrap. Here, we suggest an ultrafast bootstrap approximation approach (UFBoot) to compute the support of phylogenetic groups in maximum likelihood (ML) based trees. To achieve this, we combine the resampling estimated log-likelihood method with a simple but effective collection scheme of candidate trees. We also propose a stopping rule that assesses the convergence of branch support values to automatically determine when to stop collecting candidate trees. UFBoot achieves a median speed up of 3.1 (range: 0.66-33.3) to 10.2 (range: 1.32-41.4) compared with RAxML RBS for real DNA and amino acid alignments, respectively. Moreover, our extensive simulations show that UFBoot is robust against moderate model violations and the support values obtained appear to be relatively unbiased compared with the conservative standard bootstrap. This provides a more direct interpretation of the bootstrap support. We offer an efficient and easy-to-use software (available at http://www.cibiv.at/software/iqtree) to perform the UFBoot analysis with ML tree inference.

  4. Phylogenetic analysis of adenovirus sequences.

    PubMed

    Harrach, Balázs; Benko, Mária

    2007-01-01

    Members of the family Adenoviridae have been isolated from a large variety of hosts, including representatives from every major vertebrate class from fish to mammals. The high prevalence, together with the fairly conserved organization of the central part of their genomes, make the adenoviruses one of (if not the) best models for studying viral evolution on a larger time scale. Phylogenetic calculation can infer the evolutionary distance among adenovirus strains on serotype, species, and genus levels, thus helping the establishment of a correct taxonomy on the one hand, and speeding up the process of typing new isolates on the other. Initially, four major lineages corresponding to four genera were recognized. Later, the demarcation criteria of lower taxon levels, such as species or types, could also be defined with phylogenetic calculations. A limited number of possible host switches have been hypothesized and convincingly supported. Application of the web-based BLAST and MultAlin programs and the freely available PHYLIP package, along with the TreeView program, enables everyone to make correct calculations. In addition to step-by-step instruction on how to perform phylogenetic analysis, critical points where typical mistakes or misinterpretation of the results might occur will be identified and hints for their avoidance will be provided. PMID:17656792

  5. Phylogenetic analysis of adenovirus sequences.

    PubMed

    Harrach, Balázs; Benko, Mária

    2007-01-01

    Members of the family Adenoviridae have been isolated from a large variety of hosts, including representatives from every major vertebrate class from fish to mammals. The high prevalence, together with the fairly conserved organization of the central part of their genomes, make the adenoviruses one of (if not the) best models for studying viral evolution on a larger time scale. Phylogenetic calculation can infer the evolutionary distance among adenovirus strains on serotype, species, and genus levels, thus helping the establishment of a correct taxonomy on the one hand, and speeding up the process of typing new isolates on the other. Initially, four major lineages corresponding to four genera were recognized. Later, the demarcation criteria of lower taxon levels, such as species or types, could also be defined with phylogenetic calculations. A limited number of possible host switches have been hypothesized and convincingly supported. Application of the web-based BLAST and MultAlin programs and the freely available PHYLIP package, along with the TreeView program, enables everyone to make correct calculations. In addition to step-by-step instruction on how to perform phylogenetic analysis, critical points where typical mistakes or misinterpretation of the results might occur will be identified and hints for their avoidance will be provided.

  6. Relaxed Phylogenetics and Dating with Confidence

    PubMed Central

    Ho, Simon Y. W; Phillips, Matthew J

    2006-01-01

    In phylogenetics, the unrooted model of phylogeny and the strict molecular clock model are two extremes of a continuum. Despite their dominance in phylogenetic inference, it is evident that both are biologically unrealistic and that the real evolutionary process lies between these two extremes. Fortunately, intermediate models employing relaxed molecular clocks have been described. These models open the gate to a new field of “relaxed phylogenetics.” Here we introduce a new approach to performing relaxed phylogenetic analysis. We describe how it can be used to estimate phylogenies and divergence times in the face of uncertainty in evolutionary rates and calibration times. Our approach also provides a means for measuring the clocklikeness of datasets and comparing this measure between different genes and phylogenies. We find no significant rate autocorrelation among branches in three large datasets, suggesting that autocorrelated models are not necessarily suitable for these data. In addition, we place these datasets on the continuum of clocklikeness between a strict molecular clock and the alternative unrooted extreme. Finally, we present analyses of 102 bacterial, 106 yeast, 61 plant, 99 metazoan, and 500 primate alignments. From these we conclude that our method is phylogenetically more accurate and precise than the traditional unrooted model while adding the ability to infer a timescale to evolution. PMID:16683862

  7. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  8. Genomic repeat abundances contain phylogenetic signal.

    PubMed

    Dodsworth, Steven; Chase, Mark W; Kelly, Laura J; Leitch, Ilia J; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.

  9. Phylogenetic analysis with the iPlant discovery environment.

    PubMed

    Matasci, Naim; McKay, Sheldon

    2013-06-01

    The iPlant Collaborative's Discovery Environment is a unified Web portal to many bioinformatics applications and analytical workflows, including various methods of phylogenetic analysis. This unit describes example protocols for phylogenetic analyses, starting at sequence retrieval from the GenBank sequence database, through to multiple sequence alignment inference and visualization of phylogenetic trees. Methods for extracting smaller sub-trees from very large phylogenies, and the comparative method of continuous ancestral character state reconstruction based on observed morphology of extant species related to their phylogenetic relationships, are also presented.

  10. Inferring horizontal gene transfer.

    PubMed

    Ravenhall, Matt; Škunca, Nives; Lassalle, Florent; Dessimoz, Christophe

    2015-05-01

    Horizontal or Lateral Gene Transfer (HGT or LGT) is the transmission of portions of genomic DNA between organisms through a process decoupled from vertical inheritance. In the presence of HGT events, different fragments of the genome are the result of different evolutionary histories. This can therefore complicate the investigations of evolutionary relatedness of lineages and species. Also, as HGT can bring into genomes radically different genotypes from distant lineages, or even new genes bearing new functions, it is a major source of phenotypic innovation and a mechanism of niche adaptation. For example, of particular relevance to human health is the lateral transfer of antibiotic resistance and pathogenicity determinants, leading to the emergence of pathogenic lineages. Computational identification of HGT events relies upon the investigation of sequence composition or evolutionary history of genes. Sequence composition-based ("parametric") methods search for deviations from the genomic average, whereas evolutionary history-based ("phylogenetic") approaches identify genes whose evolutionary history significantly differs from that of the host species. The evaluation and benchmarking of HGT inference methods typically rely upon simulated genomes, for which the true history is known. On real data, different methods tend to infer different HGT events, and as a result it can be difficult to ascertain all but simple and clear-cut HGT events. PMID:26020646

  11. Inferring Horizontal Gene Transfer

    PubMed Central

    Lassalle, Florent; Dessimoz, Christophe

    2015-01-01

    Horizontal or Lateral Gene Transfer (HGT or LGT) is the transmission of portions of genomic DNA between organisms through a process decoupled from vertical inheritance. In the presence of HGT events, different fragments of the genome are the result of different evolutionary histories. This can therefore complicate the investigations of evolutionary relatedness of lineages and species. Also, as HGT can bring into genomes radically different genotypes from distant lineages, or even new genes bearing new functions, it is a major source of phenotypic innovation and a mechanism of niche adaptation. For example, of particular relevance to human health is the lateral transfer of antibiotic resistance and pathogenicity determinants, leading to the emergence of pathogenic lineages [1]. Computational identification of HGT events relies upon the investigation of sequence composition or evolutionary history of genes. Sequence composition-based ("parametric") methods search for deviations from the genomic average, whereas evolutionary history-based ("phylogenetic") approaches identify genes whose evolutionary history significantly differs from that of the host species. The evaluation and benchmarking of HGT inference methods typically rely upon simulated genomes, for which the true history is known. On real data, different methods tend to infer different HGT events, and as a result it can be difficult to ascertain all but simple and clear-cut HGT events. PMID:26020646

  12. Phylogenetic mixture models for proteins.

    PubMed

    Le, Si Quang; Lartillot, Nicolas; Gascuel, Olivier

    2008-12-27

    Standard protein substitution models use a single amino acid replacement rate matrix that summarizes the biological, chemical and physical properties of amino acids. However, site evolution is highly heterogeneous and depends on many factors: genetic code; solvent exposure; secondary and tertiary structure; protein function; etc. These impact the substitution pattern and, in most cases, a single replacement matrix is not enough to represent all the complexity of the evolutionary processes. This paper explores in maximum-likelihood framework phylogenetic mixture models that combine several amino acid replacement matrices to better fit protein evolution.We learn these mixture models from a large alignment database extracted from HSSP, and test the performance using independent alignments from TREEBASE.We compare unsupervised learning approaches, where the site categories are unknown, to supervised ones, where in estimations we use the known category of each site, based on its exposure or its secondary structure. All our models are combined with gamma-distributed rates across sites. Results show that highly significant likelihood gains are obtained when using mixture models compared with the best available single replacement matrices. Mixtures of matrices also improve over mixtures of profiles in the manner of the CAT model. The unsupervised approach tends to be better than the supervised one, but it appears difficult to implement and highly sensitive to the starting values of the parameters, meaning that the supervised approach is still of interest for initialization and model comparison. Using an unsupervised model involving three matrices, the average AIC gain per site with TREEBASE test alignments is 0.31, 0.49 and 0.61 compared with LG (named after Le & Gascuel 2008 Mol. Biol. Evol. 25, 1307-1320), WAG and JTT, respectively. This three-matrix model is significantly better than LG for 34 alignments (among 57), and significantly worse for 1 alignment only. Moreover

  13. Many-core algorithms for statistical phylogenetics

    PubMed Central

    Suchard, Marc A.; Rambaut, Andrew

    2009-01-01

    Motivation: Statistical phylogenetics is computationally intensive, resulting in considerable attention meted on techniques for parallelization. Codon-based models allow for independent rates of synonymous and replacement substitutions and have the potential to more adequately model the process of protein-coding sequence evolution with a resulting increase in phylogenetic accuracy. Unfortunately, due to the high number of codon states, computational burden has largely thwarted phylogenetic reconstruction under codon models, particularly at the genomic-scale. Here, we describe novel algorithms and methods for evaluating phylogenies under arbitrary molecular evolutionary models on graphics processing units (GPUs), making use of the large number of processing cores to efficiently parallelize calculations even for large state-size models. Results: We implement the approach in an existing Bayesian framework and apply the algorithms to estimating the phylogeny of 62 complete mitochondrial genomes of carnivores under a 60-state codon model. We see a near 90-fold speed increase over an optimized CPU-based computation and a >140-fold increase over the currently available implementation, making this the first practical use of codon models for phylogenetic inference over whole mitochondrial or microorganism genomes. Availability and implementation: Source code provided in BEAGLE: Broad-platform Evolutionary Analysis General Likelihood Evaluator, a cross-platform/processor library for phylogenetic likelihood computation (http://beagle-lib.googlecode.com/). We employ a BEAGLE-implementation using the Bayesian phylogenetics framework BEAST (http://beast.bio.ed.ac.uk/). Contact: msuchard@ucla.edu; a.rambaut@ed.ac.uk PMID:19369496

  14. Inferring Stabilizing Mutations from Protein Phylogenies: Application to Influenza Hemagglutinin

    PubMed Central

    Bloom, Jesse D.; Glassman, Matthew J.

    2009-01-01

    One selection pressure shaping sequence evolution is the requirement that a protein fold with sufficient stability to perform its biological functions. We present a conceptual framework that explains how this requirement causes the probability that a particular amino acid mutation is fixed during evolution to depend on its effect on protein stability. We mathematically formalize this framework to develop a Bayesian approach for inferring the stability effects of individual mutations from homologous protein sequences of known phylogeny. This approach is able to predict published experimentally measured mutational stability effects (ΔΔG values) with an accuracy that exceeds both a state-of-the-art physicochemical modeling program and the sequence-based consensus approach. As a further test, we use our phylogenetic inference approach to predict stabilizing mutations to influenza hemagglutinin. We introduce these mutations into a temperature-sensitive influenza virus with a defect in its hemagglutinin gene and experimentally demonstrate that some of the mutations allow the virus to grow at higher temperatures. Our work therefore describes a powerful new approach for predicting stabilizing mutations that can be successfully applied even to large, complex proteins such as hemagglutinin. This approach also makes a mathematical link between phylogenetics and experimentally measurable protein properties, potentially paving the way for more accurate analyses of molecular evolution. PMID:19381264

  15. Maximum Parsimony on Phylogenetic networks

    PubMed Central

    2012-01-01

    Background Phylogenetic networks are generalizations of phylogenetic trees, that are used to model evolutionary events in various contexts. Several different methods and criteria have been introduced for reconstructing phylogenetic trees. Maximum Parsimony is a character-based approach that infers a phylogenetic tree by minimizing the total number of evolutionary steps required to explain a given set of data assigned on the leaves. Exact solutions for optimizing parsimony scores on phylogenetic trees have been introduced in the past. Results In this paper, we define the parsimony score on networks as the sum of the substitution costs along all the edges of the network; and show that certain well-known algorithms that calculate the optimum parsimony score on trees, such as Sankoff and Fitch algorithms extend naturally for networks, barring conflicting assignments at the reticulate vertices. We provide heuristics for finding the optimum parsimony scores on networks. Our algorithms can be applied for any cost matrix that may contain unequal substitution costs of transforming between different characters along different edges of the network. We analyzed this for experimental data on 10 leaves or fewer with at most 2 reticulations and found that for almost all networks, the bounds returned by the heuristics matched with the exhaustively determined optimum parsimony scores. Conclusion The parsimony score we define here does not directly reflect the cost of the best tree in the network that displays the evolution of the character. However, when searching for the most parsimonious network that describes a collection of characters, it becomes necessary to add additional cost considerations to prefer simpler structures, such as trees over networks. The parsimony score on a network that we describe here takes into account the substitution costs along the additional edges incident on each reticulate vertex, in addition to the substitution costs along the other edges which are

  16. Phylogenetic approaches for studying diversification.

    PubMed

    Morlon, Hélène

    2014-04-01

    Estimating rates of speciation and extinction, and understanding how and why they vary over evolutionary time, geographical space and species groups, is a key to understanding how ecological and evolutionary processes generate biological diversity. Such inferences will increasingly benefit from phylogenetic approaches given the ever-accelerating rates of genetic sequencing. In the last few years, models designed to understand diversification from phylogenetic data have advanced significantly. Here, I review these approaches and what they have revealed about diversification in the natural world. I focus on key distinctions between different models, and I clarify the conclusions that can be drawn from each model. I identify promising areas for future research. A major challenge ahead is to develop models that more explicitly take into account ecology, in particular the interaction of species with each other and with their environment. This will not only improve our understanding of diversification; it will also present a new perspective to the use of phylogenies in community ecology, the science of interaction networks and conservation biology, and might shift the current focus in ecology on equilibrium biodiversity theories to non-equilibrium theories recognising the crucial role of history.

  17. Inferring Bounded Evolution in Phenotypic Characters from Phylogenetic Comparative Data.

    PubMed

    Boucher, Florian C; Démery, Vincent

    2016-07-01

    Our understanding of phenotypic evolution over macroevolutionary timescales largely relies on the use of stochastic models for the evolution of continuous traits over phylogenies. The two most widely used models, Brownian motion and the Ornstein-Uhlenbeck (OU) process, differ in that the latter includes constraints on the variance that a trait can attain in a clade. The OU model explicitly models adaptive evolution toward a trait optimum and has thus been widely used to demonstrate the existence of stabilizing selection on a trait. Here we introduce a new model for the evolution of continuous characters on phylogenies: Brownian motion between two reflective bounds, or Bounded Brownian Motion (BBM). This process also models evolutionary constraints, but of a very different kind. We provide analytical expressions for the likelihood of BBM and present a method to calculate the likelihood numerically, as well as the associated R code. Numerical simulations show that BBM achieves good performance: parameter estimation is generally accurate but more importantly BBM can be very easily discriminated from both BM and OU. We then analyze climatic niche evolution in diprotodonts and find that BBM best fits this empirical data set, suggesting that the climatic niches of diprotodonts are bounded by the climate available in Australia and the neighboring islands but probably evolved with little additional constraints. We conclude that BBM is a valuable addition to the macroevolutionary toolbox, which should enable researchers to elucidate whether the phenotypic traits they study are evolving under hard constraints between bounds. PMID:26865274

  18. Inverse Ising inference with correlated samples

    NASA Astrophysics Data System (ADS)

    Obermayer, Benedikt; Levine, Erel

    2014-12-01

    Correlations between two variables of a high-dimensional system can be indicative of an underlying interaction, but can also result from indirect effects. Inverse Ising inference is a method to distinguish one from the other. Essentially, the parameters of the least constrained statistical model are learned from the observed correlations such that direct interactions can be separated from indirect correlations. Among many other applications, this approach has been helpful for protein structure prediction, because residues which interact in the 3D structure often show correlated substitutions in a multiple sequence alignment. In this context, samples used for inference are not independent but share an evolutionary history on a phylogenetic tree. Here, we discuss the effects of correlations between samples on global inference. Such correlations could arise due to phylogeny but also via other slow dynamical processes. We present a simple analytical model to address the resulting inference biases, and develop an exact method accounting for background correlations in alignment data by combining phylogenetic modeling with an adaptive cluster expansion algorithm. We find that popular reweighting schemes are only marginally effective at removing phylogenetic bias, suggest a rescaling strategy that yields better results, and provide evidence that our conclusions carry over to the frequently used mean-field approach to the inverse Ising problem.

  19. Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies.

    PubMed

    Leaché, Adam D; Banbury, Barbara L; Felsenstein, Joseph; de Oca, Adrián Nieto-Montes; Stamatakis, Alexandros

    2015-11-01

    Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the

  20. Phylogenetically Distant Viruses Use the Same BH3-Only Protein Puma to Trigger Bax/Bak-Dependent Apoptosis of Infected Mouse and Human Cells.

    PubMed

    Papaianni, Emanuela; El Maadidi, Souhayla; Schejtman, Andrea; Neumann, Simon; Maurer, Ulrich; Marino-Merlo, Francesca; Mastino, Antonio; Borner, Christoph

    2015-01-01

    Viruses can trigger apoptosis of infected host cells if not counteracted by cellular or viral anti-apoptotic proteins. These protective proteins either inhibit the activation of caspases or they act as Bcl-2 homologs to prevent Bax/Bak-mediated outer mitochondrial membrane permeabilization (MOMP). The exact mechanism by which viruses trigger MOMP has however remained enigmatic. Here we use two distinct types of viruses, a double stranded DNA virus, herpes simplex virus-1 (HSV-1) and a positive sense, single stranded RNA virus, Semliki Forest virus (SFV) to show that the BH3-only protein Puma is the major mediator of virus-induced Bax/Bak activation and MOMP induction. Indeed, when Puma was genetically deleted or downregulated by shRNA, mouse embryonic fibroblasts and IL-3-dependent monocytes as well as human colon carcinoma cells were as resistant to virus-induced apoptosis as their Bax/Bak double deficient counterparts (Bax/Bak-/-). Puma protein expression started to augment after 2 h postinfection with both viruses. Puma mRNA levels increased as well, but this occurred after apoptosis initiation (MOMP) because it was blocked in cells lacking Bax/Bak or overexpressing Bcl-xL. Moreover, none of the classical Puma transcription factors such as p53, p73 or p65 NFκB were involved in HSV-1-induced apoptosis. Our data suggest that viruses use a Puma protein-dependent mechanism to trigger MOMP and apoptosis in host cells.

  1. Phylogenetically Distant Viruses Use the Same BH3-Only Protein Puma to Trigger Bax/Bak-Dependent Apoptosis of Infected Mouse and Human Cells

    PubMed Central

    Papaianni, Emanuela; El Maadidi, Souhayla; Schejtman, Andrea; Neumann, Simon; Maurer, Ulrich; Marino-Merlo, Francesca; Mastino, Antonio; Borner, Christoph

    2015-01-01

    Viruses can trigger apoptosis of infected host cells if not counteracted by cellular or viral anti-apoptotic proteins. These protective proteins either inhibit the activation of caspases or they act as Bcl-2 homologs to prevent Bax/Bak-mediated outer mitochondrial membrane permeabilization (MOMP). The exact mechanism by which viruses trigger MOMP has however remained enigmatic. Here we use two distinct types of viruses, a double stranded DNA virus, herpes simplex virus-1 (HSV-1) and a positive sense, single stranded RNA virus, Semliki Forest virus (SFV) to show that the BH3-only protein Puma is the major mediator of virus-induced Bax/Bak activation and MOMP induction. Indeed, when Puma was genetically deleted or downregulated by shRNA, mouse embryonic fibroblasts and IL-3-dependent monocytes as well as human colon carcinoma cells were as resistant to virus-induced apoptosis as their Bax/Bak double deficient counterparts (Bax/Bak-/-). Puma protein expression started to augment after 2 h postinfection with both viruses. Puma mRNA levels increased as well, but this occurred after apoptosis initiation (MOMP) because it was blocked in cells lacking Bax/Bak or overexpressing Bcl-xL. Moreover, none of the classical Puma transcription factors such as p53, p73 or p65 NFκB were involved in HSV-1-induced apoptosis. Our data suggest that viruses use a Puma protein-dependent mechanism to trigger MOMP and apoptosis in host cells. PMID:26030884

  2. LSHPlace: fast phylogenetic placement using locality-sensitive hashing.

    PubMed

    Brown, Daniel G; Truszkowski, Jakub

    2013-01-01

    We consider the problem of phylogenetic placement, in which large numbers of sequences (often next-generation sequencing reads) are placed onto an existing phylogenetic tree. We adapt our recent work on phylogenetic tree inference, which uses ancestral sequence reconstruction and locality-sensitive hashing, to this domain. With these ideas, new sequences can be placed onto trees with high fidelity in strikingly fast runtimes. Our results are two orders of magnitude faster than existing programs for this domain, and show a modest accuracy tradeoff. Our results offer the possibility of analyzing many more reads in a next-generation sequencing project than is currently possible.

  3. Disentangling the phylogenetic and ecological components of spider phenotypic variation.

    PubMed

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.

  4. Why the phylogenetic species concept?-Elementary.

    PubMed

    Wheeler, Q D

    1999-06-01

    Although species play a number of unique and necessary roles in biology, none are more important than as the elements of phylogeny, nomenclature, and biodiversity study. Species are not divisible into any smaller units among which shared derived characters can be recognized with fidelity. Biodiversity inventory, assessment, and conservation are dependent upon a uniformly applicable species concept. Species are the fundamental units in formal Linnaean classification and zoological nomenclature. The Biological Species Concept, long given nominal support by most zoologists, forced an essentialy taxonomic problem (what are species?) into a population genetics framework (why are there species?). Early efforts at a phylogenetic species concept focused on correcting problems in the Biological Species Concept associated with ancestral populations, then applying phylogenetic logic to species themselves. Subsequently, Eldredge and Cracraft, and Nelson and Platnick, each proposed essentially identical and truly phylogenetic species concepts that permitted the rigorous recognition of species prior to and for the purposes of phylogenetic analysis, yet maintained the integrity of the Phylogenetic Species Concept outside of cladistic analysis. Such phylogenetic elements have many benefits, including giving to biology a unit species concept applicable across all kinds of living things including sexual and asexual forms. This is possible because the Phylogenetic Species Concept is based on patterns of character distributions and is therefore consistent with the full range of possible evolutionary processes that contribute to species formation, including both biotic and abiotic (even random) factors. PMID:19270883

  5. Phylogenetic effective sample size.

    PubMed

    Bartoszek, Krzysztof

    2016-10-21

    In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present concept of effective sample size (the mean effective sample size). Through a simulation study I find that the AICc is robust if one corrects for the number of species or effective number of species. Lastly I discuss how the concept of the phylogenetic effective sample size can be useful for biodiversity quantification, identification of interesting clades and deciding on the importance of phylogenetic correlations. PMID:27343033

  6. Phylogenetic lineages in Entomophthoromycota

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Entomophthoromycota Humber is one of five major phylogenetic lineages among the former phylum Zygomycota. These early terrestrial fungi share evolutionarily ancestral characters such as coenocytic mycelium and gametangiogamy as a sexual process resulting in zygospore formation. Previous molecular st...

  7. Maximum likelihood inference of reticulate evolutionary histories.

    PubMed

    Yu, Yun; Dong, Jianrong; Liu, Kevin J; Nakhleh, Luay

    2014-11-18

    Hybridization plays an important role in the evolution of certain groups of organisms, adaptation to their environments, and diversification of their genomes. The evolutionary histories of such groups are reticulate, and methods for reconstructing them are still in their infancy and have limited applicability. We present a maximum likelihood method for inferring reticulate evolutionary histories while accounting simultaneously for incomplete lineage sorting. Additionally, we propose methods for assessing confidence in the amount of reticulation and the topology of the inferred evolutionary history. Our method obtains accurate estimates of reticulate evolutionary histories on simulated datasets. Furthermore, our method provides support for a hypothesis of a reticulate evolutionary history inferred from a set of house mouse (Mus musculus) genomes. As evidence of hybridization in eukaryotic groups accumulates, it is essential to have methods that infer reticulate evolutionary histories. The work we present here allows for such inference and provides a significant step toward putting phylogenetic networks on par with phylogenetic trees as a model of capturing evolutionary relationships. PMID:25368173

  8. Mitochondrial phylogenetics and evolution of mysticete whales.

    PubMed

    Sasaki, Takeshi; Nikaido, Masato; Hamilton, Healy; Goto, Mutsuo; Kato, Hidehiro; Kanda, Naohisa; Pastene, Luis; Cao, Ying; Fordyce, R; Hasegawa, Masami; Okada, Norihiro

    2005-02-01

    The phylogenetic relationships among baleen whales (Order: Cetacea) remain uncertain despite extensive research in cetacean molecular phylogenetics and a potential morphological sample size of over 2 million animals harvested. Questions remain regarding the number of species and the monophyly of genera, as well as higher order relationships. Here, we approach mysticete phylogeny with complete mitochondrial genome sequence analysis. We determined complete mtDNA sequences of 10 extant Mysticeti species, inferred their phylogenetic relationships, and estimated node divergence times. The mtDNA sequence analysis concurs with previous molecular studies in the ordering of the principal branches, with Balaenidae (right whales) as sister to all other mysticetes base, followed by Neobalaenidae (pygmy right whale), Eschrichtiidae (gray whale), and finally Balaenopteridae (rorquals + humpback whale). The mtDNA analysis further suggests that four lineages exist within the clade of Eschrichtiidae + Balaenopteridae, including a sister relationship between the humpback and fin whales, and a monophyletic group formed by the blue, sei, and Bryde's whales, each of which represents a newly recognized phylogenetic relationship in Mysticeti. We also estimated the divergence times of all extant mysticete species, accounting for evolutionary rate heterogeneity among lineages. When the mtDNA divergence estimates are compared with the mysticete fossil record, several lineages have molecular divergence estimates strikingly older than indicated by paleontological data. We suggest this discrepancy reflects both a large amount of ancestral polymorphism and long generation times of ancestral baleen whale populations.

  9. A taxonomic and phylogenetic re-appraisal of the genus Curvularia

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...

  10. Understanding phylogenetic incongruence: lessons from phyllostomid bats

    PubMed Central

    Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B

    2012-01-01

    All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive

  11. Moment inference from tomograms

    USGS Publications Warehouse

    Day-Lewis, F. D.; Chen, Y.; Singha, K.

    2007-01-01

    Time-lapse geophysical tomography can provide valuable qualitative insights into hydrologic transport phenomena associated with aquifer dynamics, tracer experiments, and engineered remediation. Increasingly, tomograms are used to infer the spatial and/or temporal moments of solute plumes; these moments provide quantitative information about transport processes (e.g., advection, dispersion, and rate-limited mass transfer) and controlling parameters (e.g., permeability, dispersivity, and rate coefficients). The reliability of moments calculated from tomograms is, however, poorly understood because classic approaches to image appraisal (e.g., the model resolution matrix) are not directly applicable to moment inference. Here, we present a semi-analytical approach to construct a moment resolution matrix based on (1) the classic model resolution matrix and (2) image reconstruction from orthogonal moments. Numerical results for radar and electrical-resistivity imaging of solute plumes demonstrate that moment values calculated from tomograms depend strongly on plume location within the tomogram, survey geometry, regularization criteria, and measurement error. Copyright 2007 by the American Geophysical Union.

  12. Evolutionary inference via the Poisson Indel Process.

    PubMed

    Bouchard-Côté, Alexandre; Jordan, Michael I

    2013-01-22

    We address the problem of the joint statistical inference of phylogenetic trees and multiple sequence alignments from unaligned molecular sequences. This problem is generally formulated in terms of string-valued evolutionary processes along the branches of a phylogenetic tree. The classic evolutionary process, the TKF91 model [Thorne JL, Kishino H, Felsenstein J (1991) J Mol Evol 33(2):114-124] is a continuous-time Markov chain model composed of insertion, deletion, and substitution events. Unfortunately, this model gives rise to an intractable computational problem: The computation of the marginal likelihood under the TKF91 model is exponential in the number of taxa. In this work, we present a stochastic process, the Poisson Indel Process (PIP), in which the complexity of this computation is reduced to linear. The Poisson Indel Process is closely related to the TKF91 model, differing only in its treatment of insertions, but it has a global characterization as a Poisson process on the phylogeny. Standard results for Poisson processes allow key computations to be decoupled, which yields the favorable computational profile of inference under the PIP model. We present illustrative experiments in which Bayesian inference under the PIP model is compared with separate inference of phylogenies and alignments.

  13. A Universal Phylogenetic Tree.

    ERIC Educational Resources Information Center

    Offner, Susan

    2001-01-01

    Presents a universal phylogenetic tree suitable for use in high school and college-level biology classrooms. Illustrates the antiquity of life and that all life is related, even if it dates back 3.5 billion years. Reflects important evolutionary relationships and provides an exciting way to learn about the history of life. (SAH)

  14. Cnidarian phylogenetic relationships as revealed by mitogenomics

    PubMed Central

    2013-01-01

    the shared morphological characters in these groups are plesiomorphies, originated in the branch leading to Medusozoa. The expansion of mitogenomic data along with improvements in phylogenetic inference methods and use of additional nuclear markers will further enhance our understanding of the phylogenetic relationships and character evolution within Cnidaria. PMID:23302374

  15. Phylogenetic signal in molar dental shape of extant and fossil catarrhine primates.

    PubMed

    Gamarra, Beatriz; Nova Delgado, Mónica; Romero, Alejandro; Galbany, Jordi; Pérez-Pérez, Alejandro

    2016-05-01

    Morphology has been widely used for inferring the phylogenies of numerous taxonomic groups. Recent molecular studies performed on extant non-human primates, however, have cast doubt on the reliability of cranial and postcranial characters for characterizing evolutionary affinities. Because molecular evidence is often not available for fossil specimens, detecting phylogenetic signals in anatomical features is of great relevance. Here we have analyzed molar (M1 and M2) crown shape by means of geometric morphometrics in a large sample of both extant and fossil Miocene catarrhine primates to detect the phylogenetic signal in molar morphometry. Results support that molar shape carries a strong phylogenetic signal, mostly at the superfamily level but also to some extent at the family level. Dietary factors, however, appear to have less influence, especially for M2. The Miocene Pliopithecoidea, Cercopithecoidea, and Hominoidea superfamilies clearly grouped according to the expected taxonomic affinities with the extant groups, although some discrepancies were found depending on the tooth considered. Our findings suggest that although molar crown shape can be used as a reliable proxy for establishing taxonomic affinities of catarrhine fossil primates with extant groups, a significant amount of interspecific variation exists, indicative of derived adaptations at the genus or species level.

  16. Student interpretations of phylogenetic trees in an introductory biology course.

    PubMed

    Dees, Jonathan; Momsen, Jennifer L; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa relatedness on phylogenetic trees, to measure the prevalence of correct taxa-relatedness interpretations, and to determine how student reasoning and correctness change in response to instruction and over time. Counting synapomorphies and nodes between taxa were the most common forms of incorrect reasoning, which presents a pedagogical dilemma concerning labeled synapomorphies on phylogenetic trees. Students also independently generated an alternative form of correct reasoning using monophyletic groups, the use of which decreased in popularity over time. Approximately half of all students were able to correctly interpret taxa relatedness on phylogenetic trees, and many memorized correct reasoning without understanding its application. Broad initial instruction that allowed students to generate inferences on their own contributed very little to phylogenetic tree understanding, while targeted instruction on evolutionary relationships improved understanding to some extent. Phylogenetic trees, which can directly affect student understanding of evolution, appear to offer introductory biology instructors a formidable pedagogical challenge.

  17. Investigating the performance of AIC in selecting phylogenetic models.

    PubMed

    Jhwueng, Dwueng-Chwuan; Huzurbazar, Snehalata; O'Meara, Brian C; Liu, Liang

    2014-08-01

    The popular likelihood-based model selection criterion, Akaike's Information Criterion (AIC), is a breakthrough mathematical result derived from information theory. AIC is an approximation to Kullback-Leibler (KL) divergence with the derivation relying on the assumption that the likelihood function has finite second derivatives. However, for phylogenetic estimation, given that tree space is discrete with respect to tree topology, the assumption of a continuous likelihood function with finite second derivatives is violated. In this paper, we investigate the relationship between the expected log likelihood of a candidate model, and the expected KL divergence in the context of phylogenetic tree estimation. We find that given the tree topology, AIC is an unbiased estimator of the expected KL divergence. However, when the tree topology is unknown, AIC tends to underestimate the expected KL divergence for phylogenetic models. Simulation results suggest that the degree of underestimation varies across phylogenetic models so that even for large sample sizes, the bias of AIC can result in selecting a wrong model. As the choice of phylogenetic models is essential for statistical phylogenetic inference, it is important to improve the accuracy of model selection criteria in the context of phylogenetics. PMID:24867284

  18. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    PubMed Central

    Dees, Jonathan; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa relatedness on phylogenetic trees, to measure the prevalence of correct taxa-relatedness interpretations, and to determine how student reasoning and correctness change in response to instruction and over time. Counting synapomorphies and nodes between taxa were the most common forms of incorrect reasoning, which presents a pedagogical dilemma concerning labeled synapomorphies on phylogenetic trees. Students also independently generated an alternative form of correct reasoning using monophyletic groups, the use of which decreased in popularity over time. Approximately half of all students were able to correctly interpret taxa relatedness on phylogenetic trees, and many memorized correct reasoning without understanding its application. Broad initial instruction that allowed students to generate inferences on their own contributed very little to phylogenetic tree understanding, while targeted instruction on evolutionary relationships improved understanding to some extent. Phylogenetic trees, which can directly affect student understanding of evolution, appear to offer introductory biology instructors a formidable pedagogical challenge. PMID:25452489

  19. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation

    PubMed Central

    Roger, Andrew J; Hug, Laura A

    2006-01-01

    Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods

  20. PAML 4: phylogenetic analysis by maximum likelihood.

    PubMed

    Yang, Ziheng

    2007-08-01

    PAML, currently in version 4, is a package of programs for phylogenetic analyses of DNA and protein sequences using maximum likelihood (ML). The programs may be used to compare and test phylogenetic trees, but their main strengths lie in the rich repertoire of evolutionary models implemented, which can be used to estimate parameters in models of sequence evolution and to test interesting biological hypotheses. Uses of the programs include estimation of synonymous and nonsynonymous rates (d(N) and d(S)) between two protein-coding DNA sequences, inference of positive Darwinian selection through phylogenetic comparison of protein-coding genes, reconstruction of ancestral genes and proteins for molecular restoration studies of extinct life forms, combined analysis of heterogeneous data sets from multiple gene loci, and estimation of species divergence times incorporating uncertainties in fossil calibrations. This note discusses some of the major applications of the package, which includes example data sets to demonstrate their use. The package is written in ANSI C, and runs under Windows, Mac OSX, and UNIX systems. It is available at -- (http://abacus.gene.ucl.ac.uk/software/paml.html).

  1. A phylogenetic approach to disentangling the role of competition and habitat filtering in community assembly of Neotropical forest birds.

    PubMed

    Gómez, Juan Pablo; Bravo, Gustavo A; Brumfield, Robb T; Tello, José G; Cadena, Carlos Daniel

    2010-11-01

    1. Methods that assess patterns of phylogenetic relatedness, as well as character distribution and evolution, allow one to infer the ecological processes involved in community assembly. Assuming niche conservatism, assemblages should shift from phylogenetic clustering to evenness with decreasing geographic scale because the relative importance of mechanisms that shape assemblages is hypothesized to be scale-dependent. Whereas habitat filtering is more likely to act at regional scales because of increased habitat heterogeneity that allows sorting of ecologically similar species in contrasting environments, competition is more likely to act at local scales because low habitat heterogeneity provides few opportunities for niche partitioning. 2. We used species lists to assess assemblage composition, data on ecologically-relevant traits, and a molecular phylogeny, to examine the phylogenetic structure of antbird (Thamnophilidae) assemblages at three different geographical scales: regional (ecoregions), intermediate (100-ha plots) and local (mixed-flocks). In addition, we used patterns of phylogenetic beta diversity and beta diversity to separate the factors that structure antbird assemblages at regional scales. 3. Contrary to previous findings, we found a shift from phylogenetic evenness to clustering with decreasing geographical scale. We argue that this does not reject the hypothesis that habitat filtering is the predominant force in regional community assembly, because analyses of trait evolution and structure indicated a lack of niche conservatism in antbirds. 4. In some cases, phylogenetic evenness at regional scales can be an effect of historical biogeographic processes instead of niche-based processes. However, regional patterns of beta diversity and phylogenetic beta diversity suggested that phylogenetic structure in our study cannot be explained by the history of speciation and dispersal of antbirds, further supporting the habitat-filtering hypothesis. 5. Our

  2. Quantifying Age-dependent Extinction from Species Phylogenies.

    PubMed

    Alexander, Helen K; Lambert, Amaury; Stadler, Tanja

    2016-01-01

    Several ecological factors that could play into species extinction are expected to correlate with species age, i.e., time elapsed since the species arose by speciation. To date, however, statistical tools to incorporate species age into likelihood-based phylogenetic inference have been lacking. We present here a computational framework to quantify age-dependent extinction through maximum likelihood parameter estimation based on phylogenetic trees, assuming species lifetimes are gamma distributed. Testing on simulated trees shows that neglecting age dependence can lead to biased estimates of key macroevolutionary parameters. We then apply this method to two real data sets, namely a complete phylogeny of birds (class Aves) and a clade of self-compatible and -incompatible nightshades (Solanaceae), gaining initial insights into the extent to which age-dependent extinction may help explain macroevolutionary patterns. Our methods have been added to the R package TreePar. PMID:26405218

  3. Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent

    PubMed Central

    Wen, Dingqiao; Yu, Yun; Nakhleh, Luay

    2016-01-01

    The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation. PMID:27144273

  4. Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent.

    PubMed

    Wen, Dingqiao; Yu, Yun; Nakhleh, Luay

    2016-05-01

    The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation. PMID:27144273

  5. Disentangling the Phylogenetic and Ecological Components of Spider Phenotypic Variation

    PubMed Central

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264

  6. Disentangling the phylogenetic and ecological components of spider phenotypic variation.

    PubMed

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264

  7. Bayesian phylogenetic estimation of fossil ages

    PubMed Central

    Drummond, Alexei J.; Stadler, Tanja

    2016-01-01

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences

  8. Bayesian phylogenetic estimation of fossil ages.

    PubMed

    Drummond, Alexei J; Stadler, Tanja

    2016-07-19

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using

  9. Bayesian phylogenetic estimation of fossil ages.

    PubMed

    Drummond, Alexei J; Stadler, Tanja

    2016-07-19

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using

  10. Phylogenetic placement of Trichonympha.

    PubMed

    Dacks, J B; Redfield, R J

    1998-01-01

    Flagellated protists of the Class Hypermastigida have previously been classified on morphology alone, since no molecular sequences have been available. We have isolated DNA from 350 cells of the hypermastigote Trichonympha, manually collected from the hindgut of Zootermopsis angusticollis, and used this DNA as template for polymerase chain reaction amplification of the small-subunit ribosomal RNA gene. The DNA sequence of the amplified fragment is closely related to that of a previously-unidentified gut symbiont from the termite Reticulitermes flavipes, and phylogenetic analysis places both sequences as a sister group to the known trichomonads, in agreement with the morphological classification.

  11. Phylogenetic Comparative Assembly

    NASA Astrophysics Data System (ADS)

    Husemann, Peter; Stoye, Jens

    Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence. Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a contig adjacency graph. From this a layout graph can be computed which indicates putative adjacencies of the contigs in order to aid biologists in finishing the complete genomic sequence.

  12. Chloroplast Phylogenomic Inference of Green Algae Relationships

    PubMed Central

    Sun, Linhua; Fang, Ling; Zhang, Zhenhua; Chang, Xin; Penny, David; Zhong, Bojian

    2016-01-01

    The green algal phylum Chlorophyta has six diverse classes, but the phylogenetic relationship of the classes within Chlorophyta remains uncertain. In order to better understand the ancient Chlorophyta evolution, we have applied a site pattern sorting method to study compositional heterogeneity and the model fit in the green algal chloroplast genomic data. We show that the fastest-evolving sites are significantly correlated with among-site compositional heterogeneity, and these sites have a much poorer fit to the evolutionary model. Our phylogenomic analyses suggest that the class Chlorophyceae is a monophyletic group, and the classes Ulvophyceae, Trebouxiophyceae and Prasinophyceae are non-monophyletic groups. Our proposed phylogenetic tree of Chlorophyta will offer new insights to investigate ancient green algae evolution, and our analytical framework will provide a useful approach for evaluating and mitigating the potential errors of phylogenomic inferences. PMID:26846729

  13. Phylogenetic conservatism of functional traits in microorganisms.

    PubMed

    Martiny, Adam C; Treseder, Kathleen; Pusch, Gordon

    2013-04-01

    A central question in biology is how biodiversity influences ecosystem functioning. Underlying this is the relationship between organismal phylogeny and the presence of specific functional traits. The relationship is complicated by gene loss and convergent evolution, resulting in the polyphyletic distribution of many traits. In microorganisms, lateral gene transfer can further distort the linkage between phylogeny and the presence of specific functional traits. To identify the phylogenetic conservation of specific traits in microorganisms, we developed a new phylogenetic metric-consenTRAIT-to estimate the clade depth where organisms share a trait. We then analyzed the distribution of 89 functional traits across a broad range of Bacteria and Archaea using genotypic and phenotypic data. A total of 93% of the traits were significantly non-randomly distributed, which suggested that vertical inheritance was generally important for the phylogenetic dispersion of functional traits in microorganisms. Further, traits in microbes were associated with a continuum of trait depths (τD), ranging from a few deep to many shallow clades (average τD: 0.101-0.0011 rRNA sequence dissimilarity). Next, we demonstrated that the dispersion and the depth of clades that contain a trait is correlated with the trait's complexity. Specifically, complex traits encoded by many genes like photosynthesis and methanogenesis were found in a few deep clusters, whereas the ability to use simple carbon substrates was highly phylogenetically dispersed. On the basis of these results, we propose a framework for predicting the phylogenetic conservatism of functional traits depending on the complexity of the trait. This framework enables predicting how variation in microbial composition may affect microbially-mediated ecosystem processes as well as linking phylogenetic and trait-based patterns of biogeography.

  14. Entanglement, Invariants, and Phylogenetics

    NASA Astrophysics Data System (ADS)

    Sumner, J. G.

    2007-10-01

    This thesis develops and expands upon known techniques of mathematical physics relevant to the analysis of the popular Markov model of phylogenetic trees required in biology to reconstruct the evolutionary relationships of taxonomic units from biomolecular sequence data. The techniques of mathematical physics are plethora and have been developed for some time. The Markov model of phylogenetics and its analysis is a relatively new technique where most progress to date has been achieved by using discrete mathematics. This thesis takes a group theoretical approach to the problem by beginning with a remarkable mathematical parallel to the process of scattering in particle physics. This is shown to equate to branching events in the evolutionary history of molecular units. The major technical result of this thesis is the derivation of existence proofs and computational techniques for calculating polynomial group invariant functions on a multi-linear space where the group action is that relevant to a Markovian time evolution. The practical results of this thesis are an extended analysis of the use of invariant functions in distance based methods and the presentation of a new reconstruction technique for quartet trees which is consistent with the most general Markov model of sequence evolution.

  15. Phylogenetic measures of biodiversity and neo- and paleo-endemism in Australian Acacia.

    PubMed

    Mishler, Brent D; Knerr, Nunzio; González-Orozco, Carlos E; Thornhill, Andrew H; Laffan, Shawn W; Miller, Joseph T

    2014-01-01

    Understanding spatial patterns of biodiversity is critical for conservation planning, particularly given rapid habitat loss and human-induced climatic change. Diversity and endemism are typically assessed by comparing species ranges across regions. However, investigation of patterns of species diversity alone misses out on the full richness of patterns that can be inferred using a phylogenetic approach. Here, using Australian Acacia as an example, we show that the application of phylogenetic methods, particularly two new measures, relative phylogenetic diversity and relative phylogenetic endemism, greatly enhances our knowledge of biodiversity across both space and time. We found that areas of high species richness and species endemism are not necessarily areas of high phylogenetic diversity or phylogenetic endemism. We propose a new method called categorical analysis of neo- and paleo-endemism (CANAPE) that allows, for the first time, a clear, quantitative distinction between centres of neo- and paleo-endemism, useful to the conservation decision-making process.

  16. Reconstructible Phylogenetic Networks: Do Not Distinguish the Indistinguishable

    PubMed Central

    Pardi, Fabio; Scornavacca, Celine

    2015-01-01

    Phylogenetic networks represent the evolution of organisms that have undergone reticulate events, such as recombination, hybrid speciation or lateral gene transfer. An important way to interpret a phylogenetic network is in terms of the trees it displays, which represent all the possible histories of the characters carried by the organisms in the network. Interestingly, however, different networks may display exactly the same set of trees, an observation that poses a problem for network reconstruction: from the perspective of many inference methods such networks are indistinguishable. This is true for all methods that evaluate a phylogenetic network solely on the basis of how well the displayed trees fit the available data, including all methods based on input data consisting of clades, triples, quartets, or trees with any number of taxa, and also sequence-based approaches such as popular formalisations of maximum parsimony and maximum likelihood for networks. This identifiability problem is partially solved by accounting for branch lengths, although this merely reduces the frequency of the problem. Here we propose that network inference methods should only attempt to reconstruct what they can uniquely identify. To this end, we introduce a novel definition of what constitutes a uniquely reconstructible network. For any given set of indistinguishable networks, we define a canonical network that, under mild assumptions, is unique and thus representative of the entire set. Given data that underwent reticulate evolution, only the canonical form of the underlying phylogenetic network can be uniquely reconstructed. While on the methodological side this will imply a drastic reduction of the solution space in network inference, for the study of reticulate evolution this is a fundamental limitation that will require an important change of perspective when interpreting phylogenetic networks. PMID:25849429

  17. Reconstructible phylogenetic networks: do not distinguish the indistinguishable.

    PubMed

    Pardi, Fabio; Scornavacca, Celine

    2015-04-01

    Phylogenetic networks represent the evolution of organisms that have undergone reticulate events, such as recombination, hybrid speciation or lateral gene transfer. An important way to interpret a phylogenetic network is in terms of the trees it displays, which represent all the possible histories of the characters carried by the organisms in the network. Interestingly, however, different networks may display exactly the same set of trees, an observation that poses a problem for network reconstruction: from the perspective of many inference methods such networks are "indistinguishable". This is true for all methods that evaluate a phylogenetic network solely on the basis of how well the displayed trees fit the available data, including all methods based on input data consisting of clades, triples, quartets, or trees with any number of taxa, and also sequence-based approaches such as popular formalisations of maximum parsimony and maximum likelihood for networks. This identifiability problem is partially solved by accounting for branch lengths, although this merely reduces the frequency of the problem. Here we propose that network inference methods should only attempt to reconstruct what they can uniquely identify. To this end, we introduce a novel definition of what constitutes a uniquely reconstructible network. For any given set of indistinguishable networks, we define a canonical network that, under mild assumptions, is unique and thus representative of the entire set. Given data that underwent reticulate evolution, only the canonical form of the underlying phylogenetic network can be uniquely reconstructed. While on the methodological side this will imply a drastic reduction of the solution space in network inference, for the study of reticulate evolution this is a fundamental limitation that will require an important change of perspective when interpreting phylogenetic networks.

  18. Reconstructible phylogenetic networks: do not distinguish the indistinguishable.

    PubMed

    Pardi, Fabio; Scornavacca, Celine

    2015-04-01

    Phylogenetic networks represent the evolution of organisms that have undergone reticulate events, such as recombination, hybrid speciation or lateral gene transfer. An important way to interpret a phylogenetic network is in terms of the trees it displays, which represent all the possible histories of the characters carried by the organisms in the network. Interestingly, however, different networks may display exactly the same set of trees, an observation that poses a problem for network reconstruction: from the perspective of many inference methods such networks are "indistinguishable". This is true for all methods that evaluate a phylogenetic network solely on the basis of how well the displayed trees fit the available data, including all methods based on input data consisting of clades, triples, quartets, or trees with any number of taxa, and also sequence-based approaches such as popular formalisations of maximum parsimony and maximum likelihood for networks. This identifiability problem is partially solved by accounting for branch lengths, although this merely reduces the frequency of the problem. Here we propose that network inference methods should only attempt to reconstruct what they can uniquely identify. To this end, we introduce a novel definition of what constitutes a uniquely reconstructible network. For any given set of indistinguishable networks, we define a canonical network that, under mild assumptions, is unique and thus representative of the entire set. Given data that underwent reticulate evolution, only the canonical form of the underlying phylogenetic network can be uniquely reconstructed. While on the methodological side this will imply a drastic reduction of the solution space in network inference, for the study of reticulate evolution this is a fundamental limitation that will require an important change of perspective when interpreting phylogenetic networks. PMID:25849429

  19. A molecular phylogenetic hypothesis for the manakins (Aves: Pipridae).

    PubMed

    McKay, Bailey D; Barker, F Keith; Mays, Herman L; Doucet, Stéphanie M; Hill, Geoffrey E

    2010-05-01

    Phylogenetic relationships among the 14 manakin genera were inferred from DNA sequence data obtained from both mitochondrial and nuclear DNA loci. Phylogenetic analysis resulted in a well-supported hypothesis that corroborates a sister relationship between tyrant-manakins and the "core" manakins (Antilophia, Chiroxiphia, Corapipo, Dixiphia, Heterocercus, Ilicura, Lepidothrix, Manacus, Masius, Machaeropterus, Pipra, and Xenopipo). Our data strongly support these core manakin genera as a monophyletic group. Consistent with previous work, we find two major clades within the core manakins, although the placement of the genus Xenopipo with regards to these two clades is ambiguous. Generic relationships within these clades are generally well resolved. Although we find some concordance between our study and a previous manakin phylogeny based on syringeal characters, we note several fundamental differences between the phylogenies. Thus, we offer a new phylogenetic hypothesis for Pipridae. PMID:20188842

  20. One Tree to Link Them All: A Phylogenetic Dataset for the European Tetrapoda

    PubMed Central

    Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried

    2014-01-01

    Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty. PMID:25685620

  1. Reconstruction of phylogenetic trees of prokaryotes using maximal common intervals.

    PubMed

    Heydari, Mahdi; Marashi, Sayed-Amir; Tusserkani, Ruzbeh; Sadeghi, Mehdi

    2014-10-01

    One of the fundamental problems in bioinformatics is phylogenetic tree reconstruction, which can be used for classifying living organisms into different taxonomic clades. The classical approach to this problem is based on a marker such as 16S ribosomal RNA. Since evolutionary events like genomic rearrangements are not included in reconstructions of phylogenetic trees based on single genes, much effort has been made to find other characteristics for phylogenetic reconstruction in recent years. With the increasing availability of completely sequenced genomes, gene order can be considered as a new solution for this problem. In the present work, we applied maximal common intervals (MCIs) in two or more genomes to infer their distance and to reconstruct their evolutionary relationship. Additionally, measures based on uncommon segments (UCS's), i.e., those genomic segments which are not detected as part of any of the MCIs, are also used for phylogenetic tree reconstruction. We applied these two types of measures for reconstructing the phylogenetic tree of 63 prokaryotes with known COG (clusters of orthologous groups) families. Similarity between the MCI-based (resp. UCS-based) reconstructed phylogenetic trees and the phylogenetic tree obtained from NCBI taxonomy browser is as high as 93.1% (resp. 94.9%). We show that in the case of this diverse dataset of prokaryotes, tree reconstruction based on MCI and UCS outperforms most of the currently available methods based on gene orders, including breakpoint distance and DCJ. We additionally tested our new measures on a dataset of 13 closely-related bacteria from the genus Prochlorococcus. In this case, distances like rearrangement distance, breakpoint distance and DCJ proved to be useful, while our new measures are still appropriate for phylogenetic reconstruction.

  2. Exploration of phylogenetic data using a global sequence analysis method

    PubMed Central

    Chapus, Charles; Dufraigne, Christine; Edwards, Scott; Giron, Alain; Fertil, Bernard; Deschavanne, Patrick

    2005-01-01

    Background Molecular phylogenetic methods are based on alignments of nucleic or peptidic sequences. The tremendous increase in molecular data permits phylogenetic analyses of very long sequences and of many species, but also requires methods to help manage large datasets. Results Here we explore the phylogenetic signal present in molecular data by genomic signatures, defined as the set of frequencies of short oligonucleotides present in DNA sequences. Although violating many of the standard assumptions of traditional phylogenetic analyses – in particular explicit statements of homology inherent in character matrices – the use of the signature does permit the analysis of very long sequences, even those that are unalignable, and is therefore most useful in cases where alignment is questionable. We compare the results obtained by traditional phylogenetic methods to those inferred by the signature method for two genes: RAG1, which is easily alignable, and 18S RNA, where alignments are often ambiguous for some regions. We also apply this method to a multigene data set of 33 genes for 9 bacteria and one archea species as well as to the whole genome of a set of 16 γ-proteobacteria. In addition to delivering phylogenetic results comparable to traditional methods, the comparison of signatures for the sequences involved in the bacterial example identified putative candidates for horizontal gene transfers. Conclusion The signature method is therefore a fast tool for exploring phylogenetic data, providing not only a pretreatment for discovering new sequence relationships, but also for identifying cases of sequence evolution that could confound traditional phylogenetic analysis. PMID:16280081

  3. PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees.

    PubMed

    Whelan, Simon; de Bakker, Paul I W; Quevillon, Emmanuel; Rodriguez, Nicolas; Goldman, Nick

    2006-01-01

    PANDIT is a database of homologous sequence alignments accompanied by estimates of their corresponding phylogenetic trees. It provides a valuable resource to those studying phylogenetic methodology and the evolution of coding-DNA and protein sequences. Currently in version 17.0, PANDIT comprises 7738 families of homologous protein domains; for each family, DNA and corresponding amino acid sequence multiple alignments are available together with high quality phylogenetic tree estimates. Recent improvements include expanded methods for phylogenetic tree inference, assessment of alignment quality and a redesigned web interface, available at the URL http://www.ebi.ac.uk/goldman-srv/pandit.

  4. Phylogenetic analysis of ancient DNA using BEAST.

    PubMed

    Ho, Simon Y W

    2012-01-01

    Under exceptional circumstances, it is possible to obtain DNA sequences from samples that are up to hundreds of thousands of years old. These data provide an opportunity to look directly at past genetic diversity, to trace the evolutionary process through time, and to infer demographic and phylogeographic trends. Ancient DNA (aDNA) data sets have some degree of intrinsic temporal structure because the sequences have been obtained from samples of different ages. When analyzing these data sets, it is usually necessary to take the sampling times into account. A number of phylogenetic methods have been designed with this purpose in mind. Here I describe the steps involved in Bayesian phylogenetic analysis of aDNA data. I outline a procedure that can be used to co-estimate the genealogical relationships, mutation rate, evolutionary timescale, and demographic history of the study species in a single analytical framework. A number of modifications to the methodology can be made in order to deal with complicating factors such as postmortem damage, sequences from undated samples, and data sets with low information content.

  5. Phylogenetic analysis of ancient DNA using BEAST.

    PubMed

    Ho, Simon Y W

    2012-01-01

    Under exceptional circumstances, it is possible to obtain DNA sequences from samples that are up to hundreds of thousands of years old. These data provide an opportunity to look directly at past genetic diversity, to trace the evolutionary process through time, and to infer demographic and phylogeographic trends. Ancient DNA (aDNA) data sets have some degree of intrinsic temporal structure because the sequences have been obtained from samples of different ages. When analyzing these data sets, it is usually necessary to take the sampling times into account. A number of phylogenetic methods have been designed with this purpose in mind. Here I describe the steps involved in Bayesian phylogenetic analysis of aDNA data. I outline a procedure that can be used to co-estimate the genealogical relationships, mutation rate, evolutionary timescale, and demographic history of the study species in a single analytical framework. A number of modifications to the methodology can be made in order to deal with complicating factors such as postmortem damage, sequences from undated samples, and data sets with low information content. PMID:22237538

  6. A phylogenetic blueprint for a modern whale.

    PubMed

    Gatesy, John; Geisler, Jonathan H; Chang, Joseph; Buell, Carl; Berta, Annalisa; Meredith, Robert W; Springer, Mark S; McGowen, Michael R

    2013-02-01

    The emergence of Cetacea in the Paleogene represents one of the most profound macroevolutionary transitions within Mammalia. The move from a terrestrial habitat to a committed aquatic lifestyle engendered wholesale changes in anatomy, physiology, and behavior. The results of this remarkable transformation are extant whales that include the largest, biggest brained, fastest swimming, loudest, deepest diving mammals, some of which can detect prey with a sophisticated echolocation system (Odontoceti - toothed whales), and others that batch feed using racks of baleen (Mysticeti - baleen whales). A broad-scale reconstruction of the evolutionary remodeling that culminated in extant cetaceans has not yet been based on integration of genomic and paleontological information. Here, we first place Cetacea relative to extant mammalian diversity, and assess the distribution of support among molecular datasets for relationships within Artiodactyla (even-toed ungulates, including Cetacea). We then merge trees derived from three large concatenations of molecular and fossil data to yield a composite hypothesis that encompasses many critical events in the evolutionary history of Cetacea. By combining diverse evidence, we infer a phylogenetic blueprint that outlines the stepwise evolutionary development of modern whales. This hypothesis represents a starting point for more detailed, comprehensive phylogenetic reconstructions in the future, and also highlights the synergistic interaction between modern (genomic) and traditional (morphological+paleontological) approaches that ultimately must be exploited to provide a rich understanding of evolutionary history across the entire tree of Life.

  7. Many hexapod groups originated earlier and withstood extinction events better than previously realized: inferences from supertrees.

    PubMed

    Davis, Robert B; Baldauf, Sandra L; Mayhew, Peter J

    2010-05-22

    Comprising over half of all described species, the hexapods are central to understanding the evolution of global biodiversity. Direct fossil evidence suggests that new hexapod orders continued to originate from the Jurassic onwards, and diversity is presently higher than ever. Previous studies also suggest that several shifts in net diversification rate have occurred at higher taxonomic levels. However, their inferred timing is phylogeny dependent. We re-examine these issues using the supertree approach to provide, to our knowledge, the first composite estimates of hexapod order-level phylogeny. The Purvis matrix representation with parsimony method provides the most optimal supertree, but alternative methods are considered. Inferring ghost ranges shows richness of terminal lineages in the order-level phylogeny to peak just before the end-Permian extinction, rather than the present day, indicating that at least 11 more lineages survived this extinction than implied by fossils alone. The major upshift in diversification is associated with the origin of wings/wing folding and for the first time, to our knowledge, significant downshifts are shown associated with the origin of species-poor taxa (e.g. Neuropterida, Zoraptera). Polyneopteran phylogeny, especially the position of Zoraptera, remains important resolve because this influences findings regarding shifts in diversification. Our study shows how combining fossil with phylogenetic information can improve macroevolutionary inferences. PMID:20129983

  8. Inferring epidemiological dynamics with Bayesian coalescent inference: the merits of deterministic and stochastic models.

    PubMed

    Popinga, Alex; Vaughan, Tim; Stadler, Tanja; Drummond, Alexei J

    2015-02-01

    Estimation of epidemiological and population parameters from molecular sequence data has become central to the understanding of infectious disease dynamics. Various models have been proposed to infer details of the dynamics that describe epidemic progression. These include inference approaches derived from Kingman's coalescent theory. Here, we use recently described coalescent theory for epidemic dynamics to develop stochastic and deterministic coalescent susceptible-infected-removed (SIR) tree priors. We implement these in a Bayesian phylogenetic inference framework to permit joint estimation of SIR epidemic parameters and the sample genealogy. We assess the performance of the two coalescent models and also juxtapose results obtained with a recently published birth-death-sampling model for epidemic inference. Comparisons are made by analyzing sets of genealogies simulated under precisely known epidemiological parameters. Additionally, we analyze influenza A (H1N1) sequence data sampled in the Canterbury region of New Zealand and HIV-1 sequence data obtained from known United Kingdom infection clusters. We show that both coalescent SIR models are effective at estimating epidemiological parameters from data with large fundamental reproductive number [Formula: see text] and large population size [Formula: see text]. Furthermore, we find that the stochastic variant generally outperforms its deterministic counterpart in terms of error, bias, and highest posterior density coverage, particularly for smaller [Formula: see text] and [Formula: see text]. However, each of these inference models is shown to have undesirable properties in certain circumstances, especially for epidemic outbreaks with [Formula: see text] close to one or with small effective susceptible populations. PMID:25527289

  9. Quartets and unrooted phylogenetic networks.

    PubMed

    Gambette, Philippe; Berry, Vincent; Paul, Christophe

    2012-08-01

    Phylogenetic networks were introduced to describe evolution in the presence of exchanges of genetic material between coexisting species or individuals. Split networks in particular were introduced as a special kind of abstract network to visualize conflicts between phylogenetic trees which may correspond to such exchanges. More recently, methods were designed to reconstruct explicit phylogenetic networks (whose vertices can be interpreted as biological events) from triplet data. In this article, we link abstract and explicit networks through their combinatorial properties, by introducing the unrooted analog of level-k networks. In particular, we give an equivalence theorem between circular split systems and unrooted level-1 networks. We also show how to adapt to quartets some existing results on triplets, in order to reconstruct unrooted level-k phylogenetic networks. These results give an interesting perspective on the combinatorics of phylogenetic networks and also raise algorithmic and combinatorial questions.

  10. Quartets and unrooted phylogenetic networks.

    PubMed

    Gambette, Philippe; Berry, Vincent; Paul, Christophe

    2012-08-01

    Phylogenetic networks were introduced to describe evolution in the presence of exchanges of genetic material between coexisting species or individuals. Split networks in particular were introduced as a special kind of abstract network to visualize conflicts between phylogenetic trees which may correspond to such exchanges. More recently, methods were designed to reconstruct explicit phylogenetic networks (whose vertices can be interpreted as biological events) from triplet data. In this article, we link abstract and explicit networks through their combinatorial properties, by introducing the unrooted analog of level-k networks. In particular, we give an equivalence theorem between circular split systems and unrooted level-1 networks. We also show how to adapt to quartets some existing results on triplets, in order to reconstruct unrooted level-k phylogenetic networks. These results give an interesting perspective on the combinatorics of phylogenetic networks and also raise algorithmic and combinatorial questions. PMID:22809417

  11. Distribution, phylogenetic diversity and physiological characteristics of epsilon-Proteobacteria in a deep-sea hydrothermal field.

    PubMed

    Nakagawa, Satoshi; Takai, Ken; Inagaki, Fumio; Hirayama, Hisako; Nunoura, Takuro; Horikoshi, Koki; Sako, Yoshihiko

    2005-10-01

    Epsilon-Proteobacteria is increasingly recognized as an ecologically significant group of bacteria, particularly in deep-sea hydrothermal environments. In this study, we studied the spatial distribution, diversity and physiological characteristics of the epsilon-Proteobacteria in various microbial habitats in the vicinity of a deep-sea hydrothermal vent occurring in the Iheya North field in the Mid-Okinawa Trough, by using culture-dependent and -independent approaches. The habitats studied were inside and outside hydrothermal plume, and annelid polychaete tubes. In addition, we deployed colonization devices near the vent emission. The polychaete tubes harboured physiologically and phylogenetically diverse microbial community. The in situ samplers were predominantly colonized by epsilon-Proteobacteria. Energy metabolism of epsilon-Proteobacteria isolates was highly versatile. Tree topology generated from the metabolic traits was significantly different (P = 0.000) from that of 16S rRNA tree, indicating current 16S rRNA gene-based analyses do not provide sufficient information to infer the physiological characteristics of epsilon-Proteobacteria. Nevertheless, culturability of epsilon-Proteobacteria in various microbial habitats differed among the phylogenetic subgroups. Members of Sulfurimonas were characterized by the robust culturability, and the other phylogenetic subgroups appeared to lose culturability in seawater, probably because of the sensitivity to oxygen. These results provide new insight into the ecophysiological characteristics of the deep-sea hydrothermal vent epsilon-Proteobacteria, which has never been assessed by comparative analysis of the 16S rRNA genes.

  12. Neogastropod phylogenetic relationships based on entire mitochondrial genomes

    PubMed Central

    Cunha, Regina L; Grande, Cristina; Zardoya, Rafael

    2009-01-01

    Background The Neogastropoda is a highly diversified group of predatory marine snails (Gastropoda: Caenogastropoda). Traditionally, its monophyly has been widely accepted based on several morphological synapomorphies mostly related with the digestive system. However, recent molecular phylogenetic studies challenged the monophyly of Neogastropoda due to the inclusion of representatives of other caenogastropod lineages (e.g. Littorinimorpha) within the group. Neogastropoda has been classified into up to six superfamilies including Buccinoidea, Muricoidea, Olivoidea, Pseudolivoidea, Conoidea, and Cancellarioidea. Phylogenetic relationships among neogastropod superfamilies remain unresolved. Results The complete mitochondrial (mt) genomes of seven Neogastropoda (Bolinus brandaris, Cancellaria cancellata, Conus borgesi, Cymbium olla, Fusiturris similis, Nassarius reticulatus, and Terebra dimidiata) and of the tonnoidean Cymatium parthenopeum (Littorinimorpha), a putative sister group to Neogastropoda, were sequenced. In addition, the partial sequence of the mitochondrial genome of the calyptraeoidean Calyptraea chinensis (Littorinimorpha) was also determined. All sequenced neogastropod mt genomes shared a highly conserved gene order with only two instances of tRNA gene translocation. Phylogenetic relationships of Neogastropoda were inferred based on the 13 mt protein coding genes (both at the amino acid and nucleotide level) of all available caenogastropod mitochondrial genomes. Maximum likelihood (ML) and Bayesian inference (BI) phylogenetic analyses failed to recover the monophyly of Neogastropoda due to the inclusion of the tonnoidean Cymatium parthenopeum within the group. At the superfamily level, all phylogenetic analyses questioned the taxonomic validity of Muricoidea, whereas the monophyly of Conoidea was supported by most phylogenetic analyses, albeit weakly. All analyzed families were recovered as monophyletic except Turridae due to the inclusion of Terebridae

  13. Testing and quantifying phylogenetic signals and homoplasy in morphometric data.

    PubMed

    Klingenberg, Christian Peter; Gidaszewski, Nelly A

    2010-05-01

    The relationship between morphometrics and phylogenetic analysis has long been controversial. Here we propose an approach that is based on mapping morphometric traits onto phylogenies derived from other data and thus avoids the pitfalls encountered by previous studies. This method treats shape as a single, multidimensional character. We propose a test for the presence of a phylogenetic signal in morphometric data, which simulates the null hypothesis of the complete absence of phylogenetic structure by permutation of the shape data among the terminal taxa. We also propose 2 measures of the fit of morphometric data to the phylogeny that are direct extensions of the consistency index and retention index used in traditional cladistics. We apply these methods to a small study of the evolution of wing shape in the Drosophila melanogaster subgroup, for which a very strongly supported phylogeny is available. This case study reveals a significant phylogenetic signal and a relatively low degree of homoplasy. Despite the low homoplasy, the shortest tree computed from landmark data on wing shape is inconsistent with the well-supported phylogenetic tree from molecular data, underscoring that morphometric data may not provide reliable information for inferring phylogeny. PMID:20525633

  14. Towards an integrated phylogenetic classification of the Tremellomycetes

    PubMed Central

    Liu, X.-Z.; Wang, Q.-M.; Göker, M.; Groenewald, M.; Kachalkin, A.V.; Lumbsch, H.T.; Millanes, A.M.; Wedin, M.; Yurkov, A.M.; Boekhout, T.; Bai, F.-Y.

    2016-01-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  15. Global patterns and drivers of phylogenetic structure in island floras.

    PubMed

    Weigelt, Patrick; Kissling, W Daniel; Kisel, Yael; Fritz, Susanne A; Karger, Dirk Nikolaus; Kessler, Michael; Lehtonen, Samuli; Svenning, Jens-Christian; Kreft, Holger

    2015-01-01

    Islands are ideal for investigating processes that shape species assemblages because they are isolated and have discrete boundaries. Quantifying phylogenetic assemblage structure allows inferences about these processes, in particular dispersal, environmental filtering and in-situ speciation. Here, we link phylogenetic assemblage structure to island characteristics across 393 islands worldwide and 37,041 vascular plant species (representing angiosperms overall, palms and ferns). Physical and bioclimatic factors, especially those impeding colonization and promoting speciation, explained more variation in phylogenetic structure of angiosperms overall (49%) and palms (52%) than of ferns (18%). The relationships showed different or contrasting trends among these major plant groups, consistent with their dispersal- and speciation-related traits and climatic adaptations. Phylogenetic diversity was negatively related to isolation for palms, but unexpectedly it was positively related to isolation for angiosperms overall. This indicates strong dispersal filtering for the predominantly large-seeded, animal-dispersed palm family whereas colonization from biogeographically distinct source pools on remote islands likely drives the phylogenetic structure of angiosperm floras. We show that signatures of dispersal limitation, environmental filtering and in-situ speciation differ markedly among taxonomic groups on islands, which sheds light on the origin of insular plant diversity. PMID:26198002

  16. Global patterns and drivers of phylogenetic structure in island floras

    PubMed Central

    Weigelt, Patrick; Daniel Kissling, W.; Kisel, Yael; Fritz, Susanne A.; Karger, Dirk Nikolaus; Kessler, Michael; Lehtonen, Samuli; Svenning, Jens-Christian; Kreft, Holger

    2015-01-01

    Islands are ideal for investigating processes that shape species assemblages because they are isolated and have discrete boundaries. Quantifying phylogenetic assemblage structure allows inferences about these processes, in particular dispersal, environmental filtering and in-situ speciation. Here, we link phylogenetic assemblage structure to island characteristics across 393 islands worldwide and 37,041 vascular plant species (representing angiosperms overall, palms and ferns). Physical and bioclimatic factors, especially those impeding colonization and promoting speciation, explained more variation in phylogenetic structure of angiosperms overall (49%) and palms (52%) than of ferns (18%). The relationships showed different or contrasting trends among these major plant groups, consistent with their dispersal- and speciation-related traits and climatic adaptations. Phylogenetic diversity was negatively related to isolation for palms, but unexpectedly it was positively related to isolation for angiosperms overall. This indicates strong dispersal filtering for the predominantly large-seeded, animal-dispersed palm family whereas colonization from biogeographically distinct source pools on remote islands likely drives the phylogenetic structure of angiosperm floras. We show that signatures of dispersal limitation, environmental filtering and in-situ speciation differ markedly among taxonomic groups on islands, which sheds light on the origin of insular plant diversity. PMID:26198002

  17. Towards an integrated phylogenetic classification of the Tremellomycetes.

    PubMed

    Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

    2015-06-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained.

  18. Minimizing phylogenetic number to find good evolutionary trees

    SciTech Connect

    Goldberg, L.A.; Goldberg, P.W.; Phillips, C.A.; Sweedyk, E.; Warnow, T.

    1995-05-01

    Inferring phylogenetic trees is a fundamental problem in computational-biology. We present a new objective criterion, the phylogenetic number, for evaluating evolutionary trees for species defined by biomolecular sequences or other qualitative characters. The phylogenetic number of a tree T is the maximum number of times that any given character state arises in T. By contrast, the classical parsimony criterion measures the total number of times that different character states arise in T. We consider the following related problems: finding the tree with minimum phylogenetic number, and computing the phylogenetic number of a given topology in which only the leaves are labeled by species. When the number of states is bounded (as is the case for biomolecular sequence characters), we can solve the second problem in polynomial time. We can also compute a fixed-topology 2-phylogeny (when one exists) for an arbitrary number of states. This algorithm can be used to further distinguish trees that are equal under parsimony. We also consider a number of other related problems.

  19. Towards an integrated phylogenetic classification of the Tremellomycetes.

    PubMed

    Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

    2015-06-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  20. PTreeRec: Phylogenetic Tree Reconstruction based on genome BLAST distance.

    PubMed

    Deng, Riqiang; Huang, Mingsong; Wang, Jinwen; Huang, Yuansen; Yang, Jie; Feng, Jinghua; Wang, Xunzhang

    2006-08-01

    Phylogenetic Tree Reconstruction (PTreeRec) is a web-based tool for automatic phylogeny inferences from whole-genome sequences, which accepts files of DNA sequences in the FASTA format and allows users to save the output tree file, and displays the inferred tree through an applet in a web browser. PTreeRec involves three basic steps. First, regions of maximal segment pairs (MSPs) based on an all-against-all pairwise comparison of genomes are located. Second, a distance matrix is calculated from MSP scores or coverage. Finally, a phylogenetic tree is reconstructed by the neighbor-joining method.

  1. Phylogenetics by likelihood: evolutionary modeling as a tool for understanding the genome.

    PubMed

    Kosiol, Carolin; Bofkin, Lee; Whelan, Simon

    2006-02-01

    Molecular evolutionary studies provide a means of investigating how cells function and how organisms adapt to their environment. The products of evolutionary studies provide medically important insights to the source of major diseases, such as HIV, and hold the key to understand the developing immunity of pathogenic bacteria to antibiotics. They have also helped mankind understand its place in nature, casting light on the selective forces and environmental conditions that resulted in modern humans. The use of likelihood as a framework for statistical modeling in phylogenetics has played a fundamental role in studying molecular evolution, enabling rigorous and robust conclusions to be drawn from sequence data. The first half of this article is a general introduction to the likelihood method for inferring phylogenies, the properties of the models used, and how it can be used for statistical testing. The latter half of the article focuses on the emerging new generation of phylogenetic models that describe heterogeneity in the evolutionary process along sequences, including the recoding of protein coding sequence data to amino acids and codons, and various approaches for describing dependencies between sites in a sequence. We conclude with a detailed case study examining how modern modeling approaches have been successfully employed to identify adaptive evolution in proteins.

  2. ImOSM: intermittent evolution and robustness of phylogenetic methods.

    PubMed

    Thi Nguyen, Minh Anh; Gesell, Tanja; von Haeseler, Arndt

    2012-02-01

    Among the criteria to evaluate the performance of a phylogenetic method, robustness to model violation is of particular practical importance as complete a priori knowledge of evolutionary processes is typically unavailable. For studies of robustness in phylogenetic inference, a utility to add well-defined model violations to the simulated data would be helpful. We therefore introduce ImOSM, a tool to imbed intermittent evolution as model violation into an alignment. Intermittent evolution refers to extra substitutions occurring randomly on branches of a tree, thus changing alignment site patterns. This means that the extra substitutions are placed on the tree after the typical process of sequence evolution is completed. We then study the robustness of widely used phylogenetic methods: maximum likelihood (ML), maximum parsimony (MP), and a distance-based method (BIONJ) to various scenarios of model violation. Violation of rates across sites (RaS) heterogeneity and simultaneous violation of RaS and the transition/transversion ratio on two nonadjacent external branches hinder all the methods recovery of the true topology for a four-taxon tree. For an eight-taxon balanced tree, the violations cause each of the three methods to infer a different topology. Both ML and MP fail, whereas BIONJ, which calculates the distances based on the ML estimated parameters, reconstructs the true tree. Finally, we report that a test of model homogeneity and goodness of fit tests have enough power to detect such model violations. The outcome of the tests can help to actually gain confidence in the inferred trees. Therefore, we recommend using these tests in practical phylogenetic analyses.

  3. Multigene Phylogenetics Reveals Temporal Diversification of Major African Malaria Vectors

    PubMed Central

    Kamali, Maryam; Marek, Paul E.; Peery, Ashley; Antonio-Nkondjio, Christophe; Ndo, Cyrille; Tu, Zhijian; Simard, Frederic; Sharakhov, Igor V.

    2014-01-01

    The major vectors of malaria in sub-Saharan Africa belong to subgenus Cellia. Yet, phylogenetic relationships and temporal diversification among African mosquito species have not been unambiguously determined. Knowledge about vector evolutionary history is crucial for correct interpretation of genetic changes identified through comparative genomics analyses. In this study, we estimated a molecular phylogeny using 49 gene sequences for the African malaria vectors An. gambiae, An. funestus, An. nili, the Asian malaria mosquito An. stephensi, and the outgroup species Culex quinquefasciatus and Aedes aegypti. To infer the phylogeny, we identified orthologous sequences uniformly distributed approximately every 5 Mb in the five chromosomal arms. The sequences were aligned and the phylogenetic trees were inferred using maximum likelihood and neighbor-joining methods. Bayesian molecular dating using a relaxed log normal model was used to infer divergence times. Trees from individual genes agreed with each other, placing An. nili as a basal clade that diversified from the studied malaria mosquito species 47.6 million years ago (mya). Other African malaria vectors originated more recently, and independently acquired traits related to vectorial capacity. The lineage leading to An. gambiae diverged 30.4 mya, while the African vector An. funestus and the Asian vector An. stephensi were the most closely related sister taxa that split 20.8 mya. These results were supported by consistently high bootstrap values in concatenated phylogenetic trees generated individually for each chromosomal arm. Genome-wide multigene phylogenetic analysis is a useful approach for discerning historic relationships among malaria vectors, providing a framework for the correct interpretation of genomic changes across species, and comprehending the evolutionary origins of this ubiquitous and deadly insect-borne disease. PMID:24705448

  4. [Phylogeny of protostome moulting animals (Ecdysozoa) inferred from 18 and 28S rRNA gene sequences].

    PubMed

    Petrov, N B; Vladychenskaia, N S

    2005-01-01

    Reliability of reconstruction of phylogenetic relationships within a group of protostome moulting animals was evaluated by means of comparison of 18 and 28S rRNA gene sequences sets both taken separately and combined. Reliability of reconstructions was evaluated by values of the bootstrap support of major phylogenetic tree nodes and by degree of congruence of phylogenetic trees inferred by various methods. By both criteria, phylogenetic trees reconstructed from the combined 18 and 28S rRNA gene sequences were better than those inferred from 18 and 28S sequences taken separately. Results obtained are consistent with phylogenetic hypothesis separating protostome animals into two major clades, moulting Ecdysozoa (Priapulida + Kinorhyncha, Nematoda + Nematomorpha, Onychophora + Tardigrada, Myriapoda + Chelicerata, Crustacea + Hexapoda) and unmoulting Lophotrochozoa (Plathelminthes, Nemertini, Annelida, Mollusca, Echiura, Sipuncula). Clade Cephalorhyncha does not include nematomorphs (Nematomorpha). Conclusion was taken that it is necessary to use combined 18 and 28S data in phylogenetic studies. PMID:16083008

  5. Species names in phylogenetic nomenclature.

    PubMed

    Cantino, P D; Bryant, H N; de Queiroz, K; Donoghue, M J; Eriksson, T; Hillis, D M; Lee, M S

    1999-12-01

    Linnaean binomial nomenclature is logically incompatible with the phylogenetic nomenclature of de Queiroz and Gauthier (1992, Annu. Rev. Ecol. Syst. 23:449-480): The former is based on the concept of genus, thus making this rank mandatory, while the latter is based on phylogenetic definitions and requires the abandonment of mandatory ranks. Thus, if species are to receive names under phylogenetic nomenclature, a different method must be devised to name them. Here, 13 methods for naming species in the context of phylogenetic nomenclature are contrasted with each other and with Linnaean binomials. A fundamental dichotomy among the proposed methods distinguishes those that retain the entire binomial of a preexisting species name from those that retain only the specific epithet. Other relevant issues include the stability, uniqueness, and ease of pronunciation of species names; their capacity to convey phylogenetic information; and the distinguishability of species names that are governed by a code of phylogenetic nomenclature both from clade names and from species names governed by the current codes. No method is ideal. Each has advantages and drawbacks, and preference for one option over another will be influenced by one's evaluation of the relative importance of the pros and cons for each. Moreover, sometimes the same feature is viewed as an advantage by some and a drawback by others. Nevertheless, all of the proposed methods for naming species in the context of phylogenetic nomenclature provide names that are more stable than Linnaean binomials. PMID:12066299

  6. Phylogeny and species delimitations in European Dicranum (Dicranaceae, Bryophyta) inferred from nuclear and plastid DNA.

    PubMed

    Lang, Annick S; Bocksberger, Gaëlle; Stech, Michael

    2015-11-01

    DNA sequences have been widely used for taxonomy, inferring phylogenetic relationships and identifying species boundaries. Several specific methods to define species delimitations based on molecular phylogenies have appeared recently, with the generalized mixed Yule coalescent (GMYC) method being most popular. However, only few studies on land plants have been published so far and GMYC analyses of bryophytes are missing. Dicranum is a large genus of mosses whose (morpho-)species are partly ill-defined and frequently confused. To infer molecular species delimitations, we reconstructed phylogenetic trees based on five chloroplast markers and nuclear ribosomal ITS sequences from 27 out of 30 species occurring in Europe. We applied the species delimitation methods GMYC and Poisson tree processes (PTP) in order to compare their discriminatory power with species boundaries inferred from the molecular phylogenetic reconstructions and with the morphological species concept. Phylogenetic circumscriptions were congruent with the morphological concept for 19 species, while eight species were molecularly not well delimited, mostly forming closely related species pairs. The automated species delimitation methods achieved similar results but tended to overestimate the number of potential species and exposed several incongruences between the morphological concept and inference from molecular phylogenetic reconstructions. It is concluded that GMYC and PTP methods potentially provide a useful and objective way of delimiting bryophyte species, but studies on further bryophyte data sets are necessary to infer whether incongruences might ensue from evolutionary processes and to test the suitability of these approaches. PMID:26149758

  7. Phylogeny and species delimitations in European Dicranum (Dicranaceae, Bryophyta) inferred from nuclear and plastid DNA.

    PubMed

    Lang, Annick S; Bocksberger, Gaëlle; Stech, Michael

    2015-11-01

    DNA sequences have been widely used for taxonomy, inferring phylogenetic relationships and identifying species boundaries. Several specific methods to define species delimitations based on molecular phylogenies have appeared recently, with the generalized mixed Yule coalescent (GMYC) method being most popular. However, only few studies on land plants have been published so far and GMYC analyses of bryophytes are missing. Dicranum is a large genus of mosses whose (morpho-)species are partly ill-defined and frequently confused. To infer molecular species delimitations, we reconstructed phylogenetic trees based on five chloroplast markers and nuclear ribosomal ITS sequences from 27 out of 30 species occurring in Europe. We applied the species delimitation methods GMYC and Poisson tree processes (PTP) in order to compare their discriminatory power with species boundaries inferred from the molecular phylogenetic reconstructions and with the morphological species concept. Phylogenetic circumscriptions were congruent with the morphological concept for 19 species, while eight species were molecularly not well delimited, mostly forming closely related species pairs. The automated species delimitation methods achieved similar results but tended to overestimate the number of potential species and exposed several incongruences between the morphological concept and inference from molecular phylogenetic reconstructions. It is concluded that GMYC and PTP methods potentially provide a useful and objective way of delimiting bryophyte species, but studies on further bryophyte data sets are necessary to infer whether incongruences might ensue from evolutionary processes and to test the suitability of these approaches.

  8. Phylogenetic lineages in Pseudocercospora

    PubMed Central

    Crous, P.W.; Braun, U.; Hunter, G.C.; Wingfield, M.J.; Verkley, G.J.M.; Shin, H.-D.; Nakashima, C.; Groenewald, J.Z.

    2013-01-01

    Pseudocercospora is a large cosmopolitan genus of plant pathogenic fungi that are commonly associated with leaf and fruit spots as well as blights on a wide range of plant hosts. They occur in arid as well as wet environments and in a wide range of climates including cool temperate, sub-tropical and tropical regions. Pseudocercospora is now treated as a genus in its own right, although formerly recognised as either an anamorphic state of Mycosphaerella or having mycosphaerella-like teleomorphs. The aim of this study was to sequence the partial 28S nuclear ribosomal RNA gene of a selected set of isolates to resolve phylogenetic generic limits within the Pseudocercospora complex. From these data, 14 clades are recognised, six of which cluster in Mycosphaerellaceae. Pseudocercospora s. str. represents a distinct clade, sister to Passalora eucalypti, and a clade representing the genera Scolecostigmina, Trochophora and Pallidocercospora gen. nov., taxa formerly accommodated in the Mycosphaerella heimii complex and characterised by smooth, pale brown conidia, as well as the formation of red crystals in agar media. Other clades in Mycosphaerellaceae include Sonderhenia, Microcyclosporella, and Paracercospora. Pseudocercosporella resides in a large clade along with Phloeospora, Miuraea, Cercospora and Septoria. Additional clades represent Dissoconiaceae, Teratosphaeriaceae, Cladosporiaceae, and the genera Xenostigmina, Strelitziana, Cyphellophora and Thedgonia. The genus Phaeomycocentrospora is introduced to accommodate Mycocentrospora cantuariensis, primarily distinguished from Pseudocercospora based on its hyaline hyphae, broad conidiogenous loci and hila. Host specificity was considered for 146 species of Pseudocercospora occurring on 115 host genera from 33 countries. Partial nucleotide sequence data for three gene loci, ITS, EF-1α, and ACT suggest that the majority of these species are host specific. Species identified on the basis of host, symptomatology and general

  9. Phylogenetics and the Human Microbiome

    PubMed Central

    Matsen, Frederick A.

    2015-01-01

    The human microbiome is the ensemble of genes in the microbes that live inside and on the surface of humans. Because microbial sequencing information is now much easier to come by than phenotypic information, there has been an explosion of sequencing and genetic analysis of microbiome samples. Much of the analytical work for these sequences involves phylogenetics, at least indirectly, but methodology has developed in a somewhat different direction than for other applications of phylogenetics. In this article, I review the field and its methods from the perspective of a phylogeneticist, as well as describing current challenges for phylogenetics coming from this type of work. PMID:25102857

  10. [Foundations of the new phylogenetics].

    PubMed

    Pavlinov, I Ia

    2004-01-01

    Evolutionary idea is the core of the modern biology. Due to this, phylogenetics dealing with historical reconstructions in biology takes a priority position among biological disciplines. The second half of the 20th century witnessed growth of a great interest to phylogenetic reconstructions at macrotaxonomic level which replaced microevolutionary studies dominating during the 30s-60s. This meant shift from population thinking to phylogenetic one but it was not revival of the classical phylogenetics; rather, a new approach emerged that was baptized The New Phylogenetics. It arose as a result of merging of three disciplines which were developing independently during 60s-70s, namely cladistics, numerical phyletics, and molecular phylogenetics (now basically genophyletics). Thus, the new phylogenetics could be defined as a branch of evolutionary biology aimed at elaboration of "parsimonious" cladistic hypotheses by means of numerical methods on the basis of mostly molecular data. Classical phylogenetics, as a historical predecessor of the new one, emerged on the basis of the naturphilosophical worldview which included a superorganismal idea of biota. Accordingly to that view, historical development (the phylogeny) was thought an analogy of individual one (the ontogeny) so its most basical features were progressive parallel developments of "parts" (taxa), supplemented with Darwinian concept of monophyly. Two predominating traditions were diverged within classical phylogenetics according to a particular interpretation of relation between these concepts. One of them (Cope, Severtzow) belittled monophyly and paid most attention to progressive parallel developments of morphological traits. Such an attitude turned this kind of phylogenetics to be rather the semogenetics dealing primarily with evolution of structures and not of taxa. Another tradition (Haeckel) considered both monophyletic and parallel origins of taxa jointly: in the middle of 20th century it was split into

  11. Complete mitochondrial genome of Cervus elaphus songaricus (Cetartiodactyla: Cervinae) and a phylogenetic analysis with related species.

    PubMed

    Li, Yiqing; Ba, Hengxing; Yang, Fuhe

    2016-01-01

    Complete mitochondrial genome of Tianshan wapiti, Cervus elaphus songaricus, is 16,419 bp in length and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and 1 control region. The phylogenetic trees were reconstructed with the concatenated nucleotide sequences of the 13 protein-coding genes using maximum parsimony (MP) and Bayesian inference (BI) methods. MP and BI phylogenetic trees here showed an identical tree topology. The monopoly of red deer, wapiti and sika deer was well supported, and wapiti was found to share a closer relationship with sika deer. Tianshan wapiti shared a closer relationship with xanthopygus than yarkandensis. Rusa unicolor and Rucervus eldi were given a basal phylogenetic position. Our phylogenetic analysis provided a robust phylogenetic resolution spanning the entire evolutionary relationship of the subfamily Cervinae. PMID:24725059

  12. Phylogenetic clustering among aggressive competitors: evidence from odonate assemblages along a riverine gradient.

    PubMed

    Saito, Victor S; Valente-Neto, Francisco; Rodrigues, Marciel Elio; de Oliveira Roque, Fabio; Siqueira, Tadeu

    2016-09-01

    Studies on phylogenetic community ecology usually infer habitat filtering when communities are phylogenetically clustered or competitive exclusion when communities are overdispersed. This logic is based on strong competition and niche similarity among closely related species-a less common phenomenon than previously expected. Dragonflies and damselflies are good models for testing predictions based on this logic because they behave aggressively towards related species due to mistaken identification of conspecifics. This behavior may drive communities toward phylogenetic overdispersion if closely related species frequently exclude each other. However, phylogenetically clustered communities could also be observed if habitat filtering and/or competitive asymmetry among distantly related species are major drivers of community assembling. We investigated the phylogenetic structure of odonate assemblages in central Brazil in a watershed characterized by variations in stream width, vegetation cover, aquatic vegetation, and luminosity. We observed general clustering in communities according to two indices of phylogenetic structure. Phylogenetic beta diversity coupled with Mantel tests and RLQ analysis evidenced a correlation between the riverine gradient and phylogenetic structure. Larger rivers with aquatic vegetation were characterized by anisopterans, while most zygopterans stayed in small and shaded streams. These results indicate niche conservatism in Odonata habitat occupancy, and that the environment is a major influence on the phylogenetic structure of these communities. We suggest that this is due to clade-specific ecophysiological requirements, and because closely related species may also have competitive advantages and dominate certain preferred habitats. PMID:27160426

  13. Phylogenetic clustering among aggressive competitors: evidence from odonate assemblages along a riverine gradient.

    PubMed

    Saito, Victor S; Valente-Neto, Francisco; Rodrigues, Marciel Elio; de Oliveira Roque, Fabio; Siqueira, Tadeu

    2016-09-01

    Studies on phylogenetic community ecology usually infer habitat filtering when communities are phylogenetically clustered or competitive exclusion when communities are overdispersed. This logic is based on strong competition and niche similarity among closely related species-a less common phenomenon than previously expected. Dragonflies and damselflies are good models for testing predictions based on this logic because they behave aggressively towards related species due to mistaken identification of conspecifics. This behavior may drive communities toward phylogenetic overdispersion if closely related species frequently exclude each other. However, phylogenetically clustered communities could also be observed if habitat filtering and/or competitive asymmetry among distantly related species are major drivers of community assembling. We investigated the phylogenetic structure of odonate assemblages in central Brazil in a watershed characterized by variations in stream width, vegetation cover, aquatic vegetation, and luminosity. We observed general clustering in communities according to two indices of phylogenetic structure. Phylogenetic beta diversity coupled with Mantel tests and RLQ analysis evidenced a correlation between the riverine gradient and phylogenetic structure. Larger rivers with aquatic vegetation were characterized by anisopterans, while most zygopterans stayed in small and shaded streams. These results indicate niche conservatism in Odonata habitat occupancy, and that the environment is a major influence on the phylogenetic structure of these communities. We suggest that this is due to clade-specific ecophysiological requirements, and because closely related species may also have competitive advantages and dominate certain preferred habitats.

  14. Symbiotic spirochetes in the termite hindgut: phylogenetic identification of ectosymbiotic spirochetes of oxymonad protists.

    PubMed

    Iida; Ohkuma; Ohtoko; Kudo

    2000-10-01

    Some species of protists inhabiting the hindgut of lower-termites have a large number of ectosymbiotic spirochetes on the cell surface. The phylogenetic positions of the ectosymbiotic spirochetes of three oxymonad protists, Dinenympha porteri in the gut of Reticulitermes speratus, and Pyrsonympha sp. and Dinenympha sp. in Hodotermopsis sjoestedti, were investigated without cultivation of these organisms. Protist fractions carefully collected with a micromanipulator were used as templates for the amplification of small subunit ribosomal RNA genes (SSU rDNA). The phylogenetic tree inferred from the nucleotide sequences of the SSU rDNA showed that they were affiliated with the Treponema cluster of spirochetes and they were divided into two clusters. One was grouped together with the spirochetal sequences reported previously from the gut of termites and the other was related to the Treponema bryantii subgroup of treponemes (denoted as termite Treponema clusters I and II, respectively). Whole-cell in situ hybridization using a fluorescent-labeled oligonucleotide probe specific for the group of sequences in cluster II identified most of the ectosymbiotic spirochetes of the oxymonad protists in the gut of R. speratus and H. sjoestedti. However, not all of the ectosymbiotic spirochetes could be detected by means of this cluster II group-specific probe and the population of ectosymbiotic spirochetes of cluster II was different among the oxymonad species. In the case of D. porteri, an oligonucleotide probe specific for one member of cluster II recognized a portion of the ectosymbiotic spirochetes of cluster II, and their population was also different depending on the cell-type of D. porteri in terms of the attachment of ectosymbiotic spirochetes. The results indicate that the spirochetes of cluster II and probably those of a part of cluster I can be assigned to ectosymbiotic species of oxymonad protists and that the population of ectosymbiotic spirochetes associated with a

  15. Accounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial.

    PubMed

    Le, Si Quang; Gascuel, Olivier

    2010-05-01

    Amino acid substitution models are essential to most methods to infer phylogenies from protein data. These models represent the ways in which proteins evolve and substitutions accumulate along the course of time. It is widely accepted that the substitution processes vary depending on the structural configuration of the protein residues. However, this information is very rarely used in phylogenetic studies, though the 3-dimensional structure of dozens of thousands of proteins has been elucidated. Here, we reinvestigate the question in order to fill this gap. We use an improved estimation methodology and a very large database comprising 1471 nonredundant globular protein alignments with structural annotations to estimate new amino acid substitution models accounting for the secondary structure and solvent accessibility of the residues. These models incorporate a confidence coefficient that is estimated from the data and reflects the reliability and usefulness of structural annotations in the analyzed sequences. Our results with 300 independent test alignments show an impressive likelihood gain compared with standard models such as JTT or WAG. Moreover, the use of these models induces significant topological changes in the inferred trees, which should be of primary interest to phylogeneticists. Our data, models, and software are available for download from http://atgc.lirmm.fr/phyml-structure/.

  16. Phylogenetics: bats united, microbats divided.

    PubMed

    Springer, Mark S

    2013-11-18

    Phylogenetic analyses on four new bat genomes provide convincing support for the placement of bats relative to other placental mammals, suggest that microbats are an unnatural group, and have important implications for understanding the evolution of echolocation.

  17. The Bayes Inference Engine

    SciTech Connect

    Hanson, K.M.; Cunningham, G.S.

    1996-04-01

    The authors are developing a computer application, called the Bayes Inference Engine, to provide the means to make inferences about models of physical reality within a Bayesian framework. The construction of complex nonlinear models is achieved by a fully object-oriented design. The models are represented by a data-flow diagram that may be manipulated by the analyst through a graphical programming environment. Maximum a posteriori solutions are achieved using a general, gradient-based optimization algorithm. The application incorporates a new technique of estimating and visualizing the uncertainties in specific aspects of the model.

  18. Combinatorics of distance-based tree inference.

    PubMed

    Pardi, Fabio; Gascuel, Olivier

    2012-10-01

    Several popular methods for phylogenetic inference (or hierarchical clustering) are based on a matrix of pairwise distances between taxa (or any kind of objects): The objective is to construct a tree with branch lengths so that the distances between the leaves in that tree are as close as possible to the input distances. If we hold the structure (topology) of the tree fixed, in some relevant cases (e.g., ordinary least squares) the optimal values for the branch lengths can be expressed using simple combinatorial formulae. Here we define a general form for these formulae and show that they all have two desirable properties: First, the common tree reconstruction approaches (least squares, minimum evolution), when used in combination with these formulae, are guaranteed to infer the correct tree when given enough data (consistency); second, the branch lengths of all the simple (nearest neighbor interchange) rearrangements of a tree can be calculated, optimally, in quadratic time in the size of the tree, thus allowing the efficient application of hill climbing heuristics. The study presented here is a continuation of that by Mihaescu and Pachter on branch length estimation [Mihaescu R, Pachter L (2008) Proc Natl Acad Sci USA 105:13206-13211]. The focus here is on the inference of the tree itself and on providing a basis for novel algorithms to reconstruct trees from distances.

  19. Combinatorics of distance-based tree inference

    PubMed Central

    Pardi, Fabio; Gascuel, Olivier

    2012-01-01

    Several popular methods for phylogenetic inference (or hierarchical clustering) are based on a matrix of pairwise distances between taxa (or any kind of objects): The objective is to construct a tree with branch lengths so that the distances between the leaves in that tree are as close as possible to the input distances. If we hold the structure (topology) of the tree fixed, in some relevant cases (e.g., ordinary least squares) the optimal values for the branch lengths can be expressed using simple combinatorial formulae. Here we define a general form for these formulae and show that they all have two desirable properties: First, the common tree reconstruction approaches (least squares, minimum evolution), when used in combination with these formulae, are guaranteed to infer the correct tree when given enough data (consistency); second, the branch lengths of all the simple (nearest neighbor interchange) rearrangements of a tree can be calculated, optimally, in quadratic time in the size of the tree, thus allowing the efficient application of hill climbing heuristics. The study presented here is a continuation of that by Mihaescu and Pachter on branch length estimation [Mihaescu R, Pachter L (2008) Proc Natl Acad Sci USA 105:13206–13211]. The focus here is on the inference of the tree itself and on providing a basis for novel algorithms to reconstruct trees from distances. PMID:23012403

  20. Combinatorics of distance-based tree inference.

    PubMed

    Pardi, Fabio; Gascuel, Olivier

    2012-10-01

    Several popular methods for phylogenetic inference (or hierarchical clustering) are based on a matrix of pairwise distances between taxa (or any kind of objects): The objective is to construct a tree with branch lengths so that the distances between the leaves in that tree are as close as possible to the input distances. If we hold the structure (topology) of the tree fixed, in some relevant cases (e.g., ordinary least squares) the optimal values for the branch lengths can be expressed using simple combinatorial formulae. Here we define a general form for these formulae and show that they all have two desirable properties: First, the common tree reconstruction approaches (least squares, minimum evolution), when used in combination with these formulae, are guaranteed to infer the correct tree when given enough data (consistency); second, the branch lengths of all the simple (nearest neighbor interchange) rearrangements of a tree can be calculated, optimally, in quadratic time in the size of the tree, thus allowing the efficient application of hill climbing heuristics. The study presented here is a continuation of that by Mihaescu and Pachter on branch length estimation [Mihaescu R, Pachter L (2008) Proc Natl Acad Sci USA 105:13206-13211]. The focus here is on the inference of the tree itself and on providing a basis for novel algorithms to reconstruct trees from distances. PMID:23012403

  1. Inferring ancestral sequences in taxon-rich phylogenies.

    PubMed

    Gascuel, Olivier; Steel, Mike

    2010-10-01

    Statistical consistency in phylogenetics has traditionally referred to the accuracy of estimating phylogenetic parameters for a fixed number of species as we increase the number of characters. However, it is also useful to consider a dual type of statistical consistency where we increase the number of species, rather than characters. This raises some basic questions: what can we learn about the evolutionary process as we increase the number of species? In particular, does having more species allow us to infer the ancestral state of characters accurately? This question is particularly important when sequence evolution varies in a complex way from character to character, as methods applicable for i.i.d. models may no longer be valid. In this paper, we assemble a collection of results to analyse various approaches for inferring ancestral information with increasing accuracy as the number of taxa increases.

  2. Relating phylogenetic trees to transmission trees of infectious disease outbreaks.

    PubMed

    Ypma, Rolf J F; van Ballegooijen, W Marijn; Wallinga, Jacco

    2013-11-01

    Transmission events are the fundamental building blocks of the dynamics of any infectious disease. Much about the epidemiology of a disease can be learned when these individual transmission events are known or can be estimated. Such estimations are difficult and generally feasible only when detailed epidemiological data are available. The genealogy estimated from genetic sequences of sampled pathogens is another rich source of information on transmission history. Optimal inference of transmission events calls for the combination of genetic data and epidemiological data into one joint analysis. A key difficulty is that the transmission tree, which describes the transmission events between infected hosts, differs from the phylogenetic tree, which describes the ancestral relationships between pathogens sampled from these hosts. The trees differ both in timing of the internal nodes and in topology. These differences become more pronounced when a higher fraction of infected hosts is sampled. We show how the phylogenetic tree of sampled pathogens is related to the transmission tree of an outbreak of an infectious disease, by the within-host dynamics of pathogens. We provide a statistical framework to infer key epidemiological and mutational parameters by simultaneously estimating the phylogenetic tree and the transmission tree. We test the approach using simulations and illustrate its use on an outbreak of foot-and-mouth disease. The approach unifies existing methods in the emerging field of phylodynamics with transmission tree reconstruction methods that are used in infectious disease epidemiology.

  3. Phylogenetic lineages in Pseudocercospora

    PubMed Central

    Crous, P.W.; Braun, U.; Hunter, G.C.; Wingfield, M.J.; Verkley, G.J.M.; Shin, H.-D.; Nakashima, C.; Groenewald, J.Z.

    2013-01-01

    Pseudocercospora is a large cosmopolitan genus of plant pathogenic fungi that are commonly associated with leaf and fruit spots as well as blights on a wide range of plant hosts. They occur in arid as well as wet environments and in a wide range of climates including cool temperate, sub-tropical and tropical regions. Pseudocercospora is now treated as a genus in its own right, although formerly recognised as either an anamorphic state of Mycosphaerella or having mycosphaerella-like teleomorphs. The aim of this study was to sequence the partial 28S nuclear ribosomal RNA gene of a selected set of isolates to resolve phylogenetic generic limits within the Pseudocercospora complex. From these data, 14 clades are recognised, six of which cluster in Mycosphaerellaceae. Pseudocercospora s. str. represents a distinct clade, sister to Passalora eucalypti, and a clade representing the genera Scolecostigmina, Trochophora and Pallidocercospora gen. nov., taxa formerly accommodated in the Mycosphaerella heimii complex and characterised by smooth, pale brown conidia, as well as the formation of red crystals in agar media. Other clades in Mycosphaerellaceae include Sonderhenia, Microcyclosporella, and Paracercospora. Pseudocercosporella resides in a large clade along with Phloeospora, Miuraea, Cercospora and Septoria. Additional clades represent Dissoconiaceae, Teratosphaeriaceae, Cladosporiaceae, and the genera Xenostigmina, Strelitziana, Cyphellophora and Thedgonia. The genus Phaeomycocentrospora is introduced to accommodate Mycocentrospora cantuariensis, primarily distinguished from Pseudocercospora based on its hyaline hyphae, broad conidiogenous loci and hila. Host specificity was considered for 146 species of Pseudocercospora occurring on 115 host genera from 33 countries. Partial nucleotide sequence data for three gene loci, ITS, EF-1α, and ACT suggest that the majority of these species are host specific. Species identified on the basis of host, symptomatology and general

  4. Quantitative developmental data in a phylogenetic framework.

    PubMed

    Giannini, Norberto Pedro

    2014-12-01

    Following the embryonic period of organogenesis, most development is allometric growth, which is thought to produce most of the evolutionary morphological divergence between related species. Bivariate or multivariate coefficients of allometry are used to describe quantitative developmental data and are comparable across taxa; as such, these coefficients are amenable to direct treatment in a phylogenetic framework. Mapping of actual allometric coefficients onto phylogenetic trees is supported on the basis of the evolving nature of growth programs and the type of character (continuous) that they represent. This procedure depicts evolutionary allometry accurately and allows for the generation of reliable reconstructions of ancestral allometry, as shown here with a previously published case study on rodent cranial ontogeny. Results reconstructed the signature allometric patterns of rodents to the root of the phylogeny, which could be traced back into a (minimum) Paleocene age. Both character and statistical dependence need to be addressed, so this approach can be integrated with phylogenetic comparative methods that deal with those issues. It is shown that, in this particular sample of rodents, common ancestry explains little allometric variation given the level of divergence present within, and convergence between, major rodent lineages. Furthermore, all that variation is independent of body mass. Thus, from an evolutionary perspective, allometry appears to have a strong functional and likely adaptive basis. PMID:25130201

  5. Phylogenetic estimation of timescales using ancient DNA: the effects of temporal sampling scheme and uncertainty in sample ages.

    PubMed

    Molak, Martyna; Lorenzen, Eline D; Shapiro, Beth; Ho, Simon Y W

    2013-02-01

    In recent years, ancient DNA has increasingly been used for estimating molecular timescales, particularly in studies of substitution rates and demographic histories. Molecular clocks can be calibrated using temporal information from ancient DNA sequences. This information comes from the ages of the ancient samples, which can be estimated by radiocarbon dating the source material or by dating the layers in which the material was deposited. Both methods involve sources of uncertainty. The performance of bayesian phylogenetic inference depends on the information content of the data set, which includes variation in the DNA sequences and the structure of the sample ages. Various sources of estimation error can reduce our ability to estimate rates and timescales accurately and precisely. We investigated the impact of sample-dating uncertainties on the estimation of evolutionary timescale parameters using the software BEAST. Our analyses involved 11 published data sets and focused on estimates of substitution rate and root age. We show that, provided that samples have been accurately dated and have a broad temporal span, it might be unnecessary to account for sample-dating uncertainty in Bayesian phylogenetic analyses of ancient DNA. We also investigated the sample size and temporal span of the ancient DNA sequences needed to estimate phylogenetic timescales reliably. Our results show that the range of sample ages plays a crucial role in determining the quality of the results but that accurate and precise phylogenetic estimates of timescales can be made even with only a few ancient sequences. These findings have important practical consequences for studies of molecular rates, timescales, and population dynamics.

  6. Phylogenetic relationships among subsurface microorganisms

    SciTech Connect

    Nierzwicki-Bauer, S.A.

    1991-01-01

    This report summarizes the progress made from 6/90--3/91 toward completion of our project, Phylogenetic Relationships among subsurface microorganisms. 16S rRNA was sequenced, and based on the sequence the SMCC isolates were assigned to preliminary groups. Microorganisms were obtained at various depths at the Savannah River Site, including the Surface (0 m), Congaree (91 m), and Middendorf (244 m, 259 m, 265 m). Sequence data from four isolates from the Congaree formation indicate these microorganisms can be divided into Pseudomonas spp. or Acinetobacter spp. Three 16S rRNA probes were synthesized based on sequence data. The synthesized probes were tested through in situ hybridization. Optimal conditions for in situ hybridization were determined. Because stability of RNA-DNA hybrids is dependent on hybridization stringency, related organisms can be differentiated using a single probe under different strigencies. The results of these hybridizations agree with results obtained by Balkwill and Reeves using restriction fragment length polymorphism analysis. The RNA content of a cell reflects its metabolic state. Cells which are starved for four days are not detectable with the homologous 16S rRNA probe. However, within 15 minutes of refeeding, detectable rRNA appeared. This suggests that organisms which are undetectable in environmental samples due to starvation may be detectable after addition of nutrients. Stepwise addition of specific nutrients could indicate which nutrients are rate limiting for growth. Preliminary experiments with soil samples from the Hanford Site indicate indigenous microorganisms can be detected by oligionucleotide probes. Further, using multiple probes based on universal sequences increases the number of organisms detected. Double label experiments, using a rhodamine-labelled oligionucleotide probe with free coumarin succinimidyl ester will allow simultaneous detection of total bacteria and specific 16S rRNA containing bacteria. 4 tabs. (MHB)

  7. [Inferences and verbal comprehension in children with developmental language disorders].

    PubMed

    Monfort, Isabelle; Monfort, Marc

    2013-02-22

    We review the concept of inference in language comprehension -both oral and written- recalling the different proposals of classification. We analyze the type of difficulties that children might encounter in their application of the inferences, depending on the type of language or development pathology. Finally, we describe the proposals for intervention that have been made to enhance the ability to apply inferences in language comprehension. PMID:23446716

  8. A critical appraisal of the use of microRNA data in phylogenetics

    PubMed Central

    Thomson, Robert C.; Plachetzki, David C.; Mahler, D. Luke; Moore, Brian R.

    2014-01-01

    Recent progress in resolving the tree of life continues to expose relationships that resist resolution, which drives the search for novel sources of information to solve these difficult phylogenetic problems. A recent example, the presence and absence of microRNA families, has been vigorously promoted as an ideal source of phylogenetic data and has been applied to several perennial phylogenetic problems. The utility of such data for phylogenetic inference hinges critically both on developing stochastic models that provide a reasonable description of the process that give rise to these data, and also on the careful validation of those models in real inference scenarios. Remarkably, however, the statistical behavior and phylogenetic utility of microRNA data have not yet been rigorously characterized. Here we explore the behavior and performance of microRNA presence/absence data under a variety of evolutionary models and reexamine datasets from several previous studies. We find that highly heterogeneous rates of microRNA gain and loss, pervasive secondary loss, and sampling error collectively render microRNA-based inference of phylogeny difficult. Moreover, our reanalyses fundamentally alter the conclusions for four of the five studies that we reexamined. Our results indicate that the capacity of miRNA data to resolve the tree of life has been overstated, and we urge caution in their application and interpretation. PMID:25071211

  9. Molecular phylogenetic study on the origin and evolution of Mustelidae.

    PubMed

    Yonezawa, Takahiro; Nikaido, Masato; Kohno, Naoki; Fukumoto, Yukio; Okada, Norihiro; Hasegawa, Masami

    2007-07-01

    The family Mustelidae, which consists of Mustelinae, Lutrinae, Melinae, and Taxidiinae, is the largest family among Carnivora and is a highly diverse group. Recent molecular phylogenetic studies have clarified the phylogenetic relations among Mustelidae, but there remain several unresolved problems, particularly concerning the deep branchings. Whereas many studies support the monophyly of Mustelidae+Procyonidae among Musteloidea, the relations between Mustelidae+Procyonidae, Ailuridae, and Miphitidae are still unclear. To address these problems, we inferred a tree on the basis of the sequences of mitochondrial genomes and of multiple nuclear genes using the maximum likelihood method. Our results strongly support the hypothesis that the Taxidiinae branched at first, followed by the branching of the Melinae. After that, Mustelinae diversified, and Lutrinae evolved within Mustelinae. With respect to the deep branchings in Musteloidea, the Ailuridae/Mephitidae monophyly tree and the Mephitidae-basal tree are indistinguishable in log-likelihood score, and this problem remains unresolved.

  10. An explosive innovation: Phylogenetic relationships of Solanum section Gonatotrichum (Solanaceae)

    PubMed Central

    Stern, Stephen; Bohs, Lynn

    2012-01-01

    Abstract Solanum is one of the largest plant genera and exhibits a wide range of morphological diversity. Solanum section Gonatotrichum, the focus of this study, is unique within the genus because of its fruits that swell with turgor pressure and explosively dehisce to disperse the seeds. We infer phylogenetic relationships within section Gonatotrichum using DNA sequence data from two nuclear regions (ITS and the granule-bound starch synthase gene [GBSSI or waxy]) and the chloroplast region trnT-F. The resulting phylogenetic trees support the monophyly of the section with the inclusion of Solanum lignescens, a species not previously thought to belong to the group due to the presence of stellate hairs. This inclusion of this species in section Gonatotrichum suggests that the simple, often geniculate hairs of species in the group may represent reduced stellate hairs. The presence of heterantherous flowers appears to be derived in the section, but this character is largely lost in Solanum parcistrigosum. PMID:22287931

  11. Culture observation and molecular phylogenetic analysis on the blooming green alga Chaetomorpha valida (Cladophorales, Chlorophyta) from China

    NASA Astrophysics Data System (ADS)

    Deng, Yunyan; Tang, Xiaorong; Zhan, Zifeng; Teng, Linhong; Ding, Lanping; Huang, Bingxin

    2013-05-01

    The marine green alga Chaetomorpha valida fouls aquaculture ponds along the coastal cities of Dalian and Rongcheng, China. Unialgal cultures were observed under a microscope to determine the developmental morphological characters of C. valida. Results reveal that gametophytic filaments often produce lateral branches under laboratory culture conditions, suggesting an atypical heteromorphic life cycle of C. valida between unbranched sporophytes and branched gametophytes, which differs from typical isomorphic alternation of Chaetomorpha species. The shape of the basal attachment cell, an important taxonomic character within the genus, was found variable depending on environmental conditions. The 18S rDNA and 28S rDNA regions were used to explore the phylogenetic affinity of the taxa. Inferred trees from 18S rDNA sequences revealed a close relationship between C. valida and Chaetomorpha moniligera. These results would enrich information in general biology and morphological plasticity of C. valida and provided a basis for future identification of green tide forming algae.

  12. Trandimensional Inference in the Geosciences

    NASA Astrophysics Data System (ADS)

    Bodin, Thomas

    2016-04-01

    An inverse problem is the task often occurring in many branches of Earth sciences, where the values of some model parameters describing the Earth must be obtained given noisy observations made at the surface. In all applications of inversion, assumptions are made about the nature of the model parametrisation and data noise characteristics, and results can significantly depend on those assumptions. These quantities are often manually `tuned' by means of subjective trial-and-error procedures, and this prevents to accurately quantify uncertainties in the solution. A Bayesian approach allows these assumptions to be relaxed by incorporating relevant parameters as unknowns in the inference problem. Rather than being forced to make decisions on parametrisation, the level of data noise and the weights between data types in advance, as is often the case in an optimization framework, the choice can be informed by the data themselves. Probabilistic sampling techniques such as transdimensional Markov chain Monte Carlo, allow sampling over complex posterior probability density functions, thus providing information on constraint, trade-offs and uncertainty in the unknowns. This presentation will present a review of transdimensional inference, and its application to different problems, ranging from Geochemistry to Solid Earth Geophysics.

  13. Quantum Inference on Bayesian Networks

    NASA Astrophysics Data System (ADS)

    Yoder, Theodore; Low, Guang Hao; Chuang, Isaac

    2014-03-01

    Because quantum physics is naturally probabilistic, it seems reasonable to expect physical systems to describe probabilities and their evolution in a natural fashion. Here, we use quantum computation to speedup sampling from a graphical probability model, the Bayesian network. A specialization of this sampling problem is approximate Bayesian inference, where the distribution on query variables is sampled given the values e of evidence variables. Inference is a key part of modern machine learning and artificial intelligence tasks, but is known to be NP-hard. Classically, a single unbiased sample is obtained from a Bayesian network on n variables with at most m parents per node in time (nmP(e) - 1 / 2) , depending critically on P(e) , the probability the evidence might occur in the first place. However, by implementing a quantum version of rejection sampling, we obtain a square-root speedup, taking (n2m P(e) -1/2) time per sample. The speedup is the result of amplitude amplification, which is proving to be broadly applicable in sampling and machine learning tasks. In particular, we provide an explicit and efficient circuit construction that implements the algorithm without the need for oracle access.

  14. Inference is bliss: using evolutionary relationship to guide categorical inferences.

    PubMed

    Novick, Laura R; Catley, Kefyn M; Funk, Daniel J

    2011-01-01

    Three experiments, adopting an evolutionary biology perspective, investigated subjects' inferences about living things. Subjects were told that different enzymes help regulate cell function in two taxa and asked which enzyme a third taxon most likely uses. Experiment 1 and its follow-up, with college students, used triads involving amphibians, reptiles, and mammals (reptiles and mammals are most closely related evolutionarily) and plants, fungi, and animals (fungi are more closely related to animals than to plants). Experiment 2, with 10th graders, also included triads involving mammals, birds, and snakes/crocodilians (birds and snakes/crocodilians are most closely related). Some subjects received cladograms (hierarchical diagrams) depicting the evolutionary relationships among the taxa. The effect of providing cladograms depended on students' background in biology. The results illuminate students' misconceptions concerning common taxa and constraints on their willingness to override faulty knowledge when given appropriate evolutionary evidence. Implications for introducing tree thinking into biology curricula are discussed. PMID:21463358

  15. Evaluating phylogenetic informativeness and data-type usage for new protein-coding genes across Vertebrata.

    PubMed

    Fong, Jonathan J; Fujita, Matthew K

    2011-11-01

    As a resource for vertebrate phylogenetics, we developed 75 new protein-coding genes using a combination of expressed sequence tags (ESTs) available in Genbank, and targeted amplification of complementary DNA (cDNA). In addition, we performed three additional analyses in order to assess the utility of our approach. First, we profiled the phylogenetic informativeness of these new markers using the online program PhyDesign. Next, we compared the utility of four different data-types used in phylogenetics: nucleotides (NUCL), amino acids (AA), 1st and 2nd codon positions only (N12), and modified sequences to account for codon degeneracy (DEGEN1; Regier et al., 2010). Lastly, we use these new markers to construct a vertebrate phylogeny and address the uncertain relationship between higher-level mammal groups: monotremes, marsupials, and placentals. Our results show that phylogenetic informativeness of the 75 new markers varies, both in the amount of phylogenetic signal and optimal timescale. When comparing the four data-types, we find that the NUCL data-type, due to the high level of phylogenetic signal, performs the best across all divergence times. The remaining three data-types (AA, N12, DEGEN1) are less subject to homoplasy, but have greatly reduced levels of phylogenetic signal relative to NUCL. Our phylogenetic inference supports the Theria hypothesis of mammalian relationships, with marsupials and placentals being sister groups.

  16. The complete mitochondrial genome of Flustra foliacea (Ectoprocta, Cheilostomata) - compositional bias affects phylogenetic analyses of lophotrochozoan relationships

    PubMed Central

    2011-01-01

    Background The phylogenetic relationships of the lophophorate lineages, ectoprocts, brachiopods and phoronids, within Lophotrochozoa are still controversial. We sequenced an additional mitochondrial genome of the most species-rich lophophorate lineage, the ectoprocts. Although it is known that there are large differences in the nucleotide composition of mitochondrial sequences of different lineages as well as in the amino acid composition of the encoded proteins, this bias is often not considered in phylogenetic analyses. We applied several approaches for reducing compositional bias and saturation in the phylogenetic analyses of the mitochondrial sequences. Results The complete mitochondrial genome (16,089 bp) of Flustra foliacea (Ectoprocta, Gymnolaemata, Cheilostomata) was sequenced. All protein-encoding, rRNA and tRNA genes are transcribed from the same strand. Flustra shares long intergenic sequences with the cheilostomate ectoproct Bugula, which might be a synapomorphy of these taxa. Further synapomorphies might be the loss of the DHU arm of the tRNA L(UUR), the loss of the DHU arm of the tRNA S(UCN) and the unique anticodon sequence GAG of the tRNA L(CUN). The gene order of the mitochondrial genome of Flustra differs strongly from that of the other known ectoprocts. Phylogenetic analyses of mitochondrial nucleotide and amino acid data sets show that the lophophorate lineages are more closely related to trochozoan phyla than to deuterostomes or ecdysozoans confirming the Lophotrochozoa hypothesis. Furthermore, they support the monophyly of Cheilostomata and Ectoprocta. However, the relationships of the lophophorate lineages within Lophotrochozoa differ strongly depending on the data set and the used method. Different approaches for reducing heterogeneity in nucleotide and amino acid data sets and saturation did not result in a more robust resolution of lophotrochozoan relationships. Conclusion The contradictory and usually weakly supported phylogenetic

  17. [Phylogenetic analysis of Pleurotus species].

    PubMed

    Shnyreva, A A; Shnyreva, A V

    2015-02-01

    We performed phylogenetic analysis for ten Pleurotus species, based on internal transcribed spacer (ITS) sequences of rDNA. A phylogenetic tree was constructed on the basis of 31 oyster fungi strains of different origin and 10 reference sequences from GenBank. Our analysis demonstrates that the tested Pleurotus species are of monophyletic origin. We evaluated the evolutionary distances between these species. Classic genetic analysis of sexual compatibility based on monocaryon (mon)-mon crosses showed no reproductive barriers within the P. cornucopiae-P. euosmus species complex. Thus, despite the divergence (subclustering) between commercial strains and natural isolates of P. ostreatus revealed by phylogenetic analysis, there is no reproductive isolation between these groups. A common allele of the matB locus was identified for the commercial strains Sommer and L/4, supporting the common origin of these strains. PMID:25966583

  18. Geometric morphometric character suites as phylogenetic data: extracting phylogenetic signal from gastropod shells.

    PubMed

    Smith, Ursula E; Hendricks, Jonathan R

    2013-05-01

    . In 33-45% of the test cases (depending upon the approach used for measuring success), it was possible to place the pseudofossil taxon into the correct regions of the phylogeny using only the morphometric characters. This suggests that the incorporation of extinct Conus taxa into phylogenetic hypotheses will be possible, permitting a wide range of macroevolutionary questions to be addressed within this genus. This methodology also has potential to contribute to phylogenetic reconstructions for other major components of the fossil record that lack numerous discrete characters. PMID:23325808

  19. Interpreting the universal phylogenetic tree

    NASA Technical Reports Server (NTRS)

    Woese, C. R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.

  20. Data supporting phylogenetic reconstructions of the Neotropical clade Gymnotiformes

    PubMed Central

    Tagliacollo, Victor A.; Bernt, Maxwell J.; Craig, Jack M.; Oliveira, Claudio; Albert, James S.

    2016-01-01

    Data is presented in support of model-based total evidence (MBTE) phylogenetic reconstructions of the Neotropical clade of Gymnotiformes “Model-based total evidence phylogeny of Neotropical electric knifefishes (Teleostei, Gymnotiformes)” (Tagliacollo et al., 2016) [1]). The MBTE phylogenies were inferred using a comprehensive dataset comprised of six genes (5277 bp) and 223 morphological characters for an ingroup taxon sample of 120 of 218 valid species and 33 of the 34 extant genera. The data in this article include primer sequences for gene amplification and sequencing, voucher information and GenBank accession numbers, descriptions of morphological characters, morphological synapomorphies for the recognized clades of Gymnotiformes, a supermatrix comprised of concatenated molecular and morphological data, and computer scripts to replicate MBTE inferences. We also included here Maximum-likelihood and Bayesian topologies, which support two main gymnotiform clades: Gymnotidae and Sternopygoidei, the latter comprised of Rhamphichthyoidea (Rhamphichthyidae+Hypopomidae) and Sinusoidea (Sternopygidae+Apteronotidae). PMID:26955648

  1. Data on phylogenetic analyses of gazelles (genus Gazella) based on mitochondrial and nuclear intron markers

    PubMed Central

    Lerp, Hannes; Klaus, Sebastian; Allgöwer, Stefanie; Wronski, Torsten; Pfenninger, Markus; Plath, Martin

    2016-01-01

    The data provided is related to the article “Phylogenetic analyses of gazelles reveal repeated transitions of key ecological traits and provide novel insights into the origin of the genus Gazella” [1]. The data is based on 48 tissue samples of all nine extant species of the genus Gazella, namely Gazella gazella, Gazella arabica, Gazella bennettii, Gazella cuvieri, Gazella dorcas, Gazella leptoceros, Gazella marica, Gazella spekei, and Gazella subgutturosa and four related taxa (Saiga tatarica, Antidorcas marsupialis, Antilope cervicapra and Eudorcas rufifrons). It comprises alignments of sequences of a cytochrome b data set and of six nuclear intron markers. For the latter new primers were designed based on cattle and sheep genomes. Based on these alignments phylogenetic trees were inferred using Bayesian Inference and Maximum Likelihood methods. Furthermore, ancestral character states (inferred with BayesTraits 1.0) and ancestral ranges based on a Dispersal-Extinction-Cladogenesis model were estimated and results׳ files were stored within this article. PMID:27054158

  2. Data on phylogenetic analyses of gazelles (genus Gazella) based on mitochondrial and nuclear intron markers.

    PubMed

    Lerp, Hannes; Klaus, Sebastian; Allgöwer, Stefanie; Wronski, Torsten; Pfenninger, Markus; Plath, Martin

    2016-06-01

    The data provided is related to the article "Phylogenetic analyses of gazelles reveal repeated transitions of key ecological traits and provide novel insights into the origin of the genus Gazella" [1]. The data is based on 48 tissue samples of all nine extant species of the genus Gazella, namely Gazella gazella, Gazella arabica, Gazella bennettii, Gazella cuvieri, Gazella dorcas, Gazella leptoceros, Gazella marica, Gazella spekei, and Gazella subgutturosa and four related taxa (Saiga tatarica, Antidorcas marsupialis, Antilope cervicapra and Eudorcas rufifrons). It comprises alignments of sequences of a cytochrome b data set and of six nuclear intron markers. For the latter new primers were designed based on cattle and sheep genomes. Based on these alignments phylogenetic trees were inferred using Bayesian Inference and Maximum Likelihood methods. Furthermore, ancestral character states (inferred with BayesTraits 1.0) and ancestral ranges based on a Dispersal-Extinction-Cladogenesis model were estimated and results׳ files were stored within this article. PMID:27054158

  3. Darwinian selection for sites of Asn-linked glycosylation in phylogenetically disparate eukaryotes and viruses.

    PubMed

    Cui, Jike; Smith, Temple; Robbins, Phillips W; Samuelson, John

    2009-08-11

    Numerous protists and rare fungi have truncated Asn-linked glycan precursors and lack N-glycan-dependent quality control (QC) systems for glycoprotein folding in the endoplasmic reticulum. Here, we show that the abundance of sequons (NXT or NXS), which are sites for N-glycosylation of secreted and membrane proteins, varies by more than a factor of 4 among phylogenetically diverse eukaryotes, based on a few variables. There is positive correlation between the density of sequons and the AT content of coding regions, although no causality can be inferred. In contrast, there appears to be Darwinian selection for sequons containing Thr, but not Ser, in eukaryotes that have N-glycan-dependent QC systems. Selection for sequons with Thr, which nearly doubles the sequon density in human secreted and membrane proteins, occurs by an increased conditional probability that Asn and Thr are present in sequons rather than elsewhere. Increasing sequon densities of the hemagglutinin (HA) of influenza viruses A/H3N2 and A/H1N1 during the past few decades of human infection also result from an increased conditional probability that Asn, Thr, and Ser are present in sequons rather than elsewhere. In contrast, there is no selection on sequons by this mechanism in HA of A/H5N1 or 2009 A/H1N1 (Swine flu). Very strong selection for sequons with both Thr and Ser in glycoprotein of M(r) 120,000 (gp120) of HIV and related retroviruses results from this same mechanism, as well as amino acid composition bias and increases in AT content. We conclude that there is Darwinian selection for sequons in phylogenetically disparate eukaryotes and viruses.

  4. Towards Context Sensitive Information Inference.

    ERIC Educational Resources Information Center

    Song, D.; Bruza, P. D.

    2003-01-01

    Discusses information inference from a psychologistic stance and proposes an information inference mechanism that makes inferences via computations of information flow through an approximation of a conceptual space. Highlights include cognitive economics of information processing; context sensitivity; and query models for information retrieval.…

  5. Multimodel inference and adaptive management

    USGS Publications Warehouse

    Rehme, S.E.; Powell, L.A.; Allen, C.R.

    2011-01-01

    Ecology is an inherently complex science coping with correlated variables, nonlinear interactions and multiple scales of pattern and process, making it difficult for experiments to result in clear, strong inference. Natural resource managers, policy makers, and stakeholders rely on science to provide timely and accurate management recommendations. However, the time necessary to untangle the complexities of interactions within ecosystems is often far greater than the time available to make management decisions. One method of coping with this problem is multimodel inference. Multimodel inference assesses uncertainty by calculating likelihoods among multiple competing hypotheses, but multimodel inference results are often equivocal. Despite this, there may be pressure for ecologists to provide management recommendations regardless of the strength of their study’s inference. We reviewed papers in the Journal of Wildlife Management (JWM) and the journal Conservation Biology (CB) to quantify the prevalence of multimodel inference approaches, the resulting inference (weak versus strong), and how authors dealt with the uncertainty. Thirty-eight percent and 14%, respectively, of articles in the JWM and CB used multimodel inference approaches. Strong inference was rarely observed, with only 7% of JWM and 20% of CB articles resulting in strong inference. We found the majority of weak inference papers in both journals (59%) gave specific management recommendations. Model selection uncertainty was ignored in most recommendations for management. We suggest that adaptive management is an ideal method to resolve uncertainty when research results in weak inference.

  6. Declarative memory, awareness, and transitive inference.

    PubMed

    Smith, Christine; Squire, Larry R

    2005-11-01

    A characteristic usually attributed to declarative memory is that what is learned is accessible to awareness. Recently, the relationship between awareness and declarative (hippocampus-dependent) memory has been questioned on the basis of findings from transitive inference tasks. In transitive inference, participants are first trained on overlapping pairs of items (e.g., A+B-, B+C-, C+D-, and D+E-, where + and - indicate correct and incorrect choices). Later, participants who choose B over D when presented with the novel pair BD are said to demonstrate transitive inference. The ability to exhibit transitive inference is thought to depend on the fact that participants have represented the stimulus elements hierarchically (i.e., A>B>C>D>E). We found that performance on five-item and six-item transitive inference tasks was closely related to awareness of the hierarchical relationship among the elements of the training pairs. Participants who were aware of the hierarchy performed near 100% correct on all tests of transitivity, but participants who were unaware of the hierarchy performed poorly (e.g., on transitive pair BD in the five-item problem; on transitive pairs BD, BE, and CE in the six-item problem). When the five-item task was administered to memory-impaired patients with damage thought to be limited to the hippocampal region, the patients were impaired at learning the training pairs. All patients were unaware of the hierarchy and, like unaware controls, performed poorly on the BD pair. The findings indicate that awareness is critical for robust performance on tests of transitive inference and support the view that awareness of what is learned is a fundamental characteristic of declarative memory.

  7. Inferring learners' knowledge from their actions.

    PubMed

    Rafferty, Anna N; LaMar, Michelle M; Griffiths, Thomas L

    2015-04-01

    Watching another person take actions to complete a goal and making inferences about that person's knowledge is a relatively natural task for people. This ability can be especially important in educational settings, where the inferences can be used for assessment, diagnosing misconceptions, and providing informative feedback. In this paper, we develop a general framework for automatically making such inferences based on observed actions; this framework is particularly relevant for inferring student knowledge in educational games and other interactive virtual environments. Our approach relies on modeling action planning: We formalize the problem as a Markov decision process in which one must choose what actions to take to complete a goal, where choices will be dependent on one's beliefs about how actions affect the environment. We use a variation of inverse reinforcement learning to infer these beliefs. Through two lab experiments, we show that this model can recover people's beliefs in a simple environment, with accuracy comparable to that of human observers. We then demonstrate that the model can be used to provide real-time feedback and to model data from an existing educational game.

  8. Inferring learners' knowledge from their actions.

    PubMed

    Rafferty, Anna N; LaMar, Michelle M; Griffiths, Thomas L

    2015-04-01

    Watching another person take actions to complete a goal and making inferences about that person's knowledge is a relatively natural task for people. This ability can be especially important in educational settings, where the inferences can be used for assessment, diagnosing misconceptions, and providing informative feedback. In this paper, we develop a general framework for automatically making such inferences based on observed actions; this framework is particularly relevant for inferring student knowledge in educational games and other interactive virtual environments. Our approach relies on modeling action planning: We formalize the problem as a Markov decision process in which one must choose what actions to take to complete a goal, where choices will be dependent on one's beliefs about how actions affect the environment. We use a variation of inverse reinforcement learning to infer these beliefs. Through two lab experiments, we show that this model can recover people's beliefs in a simple environment, with accuracy comparable to that of human observers. We then demonstrate that the model can be used to provide real-time feedback and to model data from an existing educational game. PMID:25155381

  9. Identifiability of large phylogenetic mixture models.

    PubMed

    Rhodes, John A; Sullivant, Seth

    2012-01-01

    Phylogenetic mixture models are statistical models of character evolution allowing for heterogeneity. Each of the classes in some unknown partition of the characters may evolve by different processes, or even along different trees. Such models are of increasing interest for data analysis, as they can capture the variety of evolutionary processes that may be occurring across long sequences of DNA or proteins. The fundamental question of whether parameters of such a model are identifiable is difficult to address, due to the complexity of the parameterization. Identifiability is, however, essential to their use for statistical inference.We analyze mixture models on large trees, with many mixture components, showing that both numerical and tree parameters are indeed identifiable in these models when all trees are the same. This provides a theoretical justification for some current empirical studies, and indicates that extensions to even more mixture components should be theoretically well behaved. We also extend our results to certain mixtures on different trees, using the same algebraic techniques.

  10. Visual Inference Programming

    NASA Technical Reports Server (NTRS)

    Wheeler, Kevin; Timucin, Dogan; Rabbette, Maura; Curry, Charles; Allan, Mark; Lvov, Nikolay; Clanton, Sam; Pilewskie, Peter

    2002-01-01

    The goal of visual inference programming is to develop a software framework data analysis and to provide machine learning algorithms for inter-active data exploration and visualization. The topics include: 1) Intelligent Data Understanding (IDU) framework; 2) Challenge problems; 3) What's new here; 4) Framework features; 5) Wiring diagram; 6) Generated script; 7) Results of script; 8) Initial algorithms; 9) Independent Component Analysis for instrument diagnosis; 10) Output sensory mapping virtual joystick; 11) Output sensory mapping typing; 12) Closed-loop feedback mu-rhythm control; 13) Closed-loop training; 14) Data sources; and 15) Algorithms. This paper is in viewgraph form.

  11. Graphical inference for Infovis.

    PubMed

    Wickham, Hadley; Cook, Dianne; Hofmann, Heike; Buja, Andreas

    2010-01-01

    How do we know if what we see is really there? When visualizing data, how do we avoid falling into the trap of apophenia where we see patterns in random noise? Traditionally, infovis has been concerned with discovering new relationships, and statistics with preventing spurious relationships from being reported. We pull these opposing poles closer with two new techniques for rigorous statistical inference of visual discoveries. The "Rorschach" helps the analyst calibrate their understanding of uncertainty and "line-up" provides a protocol for assessing the significance of visual discoveries, protecting against the discovery of spurious structure.

  12. Inferring tumor progression from genomic heterogeneity

    PubMed Central

    Navin, Nicholas; Krasnitz, Alexander; Rodgers, Linda; Cook, Kerry; Meth, Jennifer; Kendall, Jude; Riggs, Michael; Eberling, Yvonne; Troge, Jennifer; Grubor, Vladimir; Levy, Dan; Lundin, Pär; Månér, Susanne; Zetterberg, Anders; Hicks, James; Wigler, Michael

    2010-01-01

    Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form of the phylogenetic relationships. We have developed a method we call Sector-Ploidy-Profiling (SPP) to study the clonal composition of breast tumors. SPP involves macro-dissecting tumors, flow-sorting genomic subpopulations by DNA content, and profiling genomes using comparative genomic hybridization (CGH). Breast carcinomas display two classes of genomic structural variation: (1) monogenomic and (2) polygenomic. Monogenomic tumors appear to contain a single major clonal subpopulation with a highly stable chromosome structure. Polygenomic tumors contain multiple clonal tumor subpopulations, which may occupy the same sectors, or separate anatomic locations. In polygenomic tumors, we show that heterogeneity can be ascribed to a few clonal subpopulations, rather than a series of gradual intermediates. By comparing multiple subpopulations from different anatomic locations, we have inferred pathways of cancer progression and the organization of tumor growth. PMID:19903760

  13. Inferring tumor progression from genomic heterogeneity.

    PubMed

    Navin, Nicholas; Krasnitz, Alexander; Rodgers, Linda; Cook, Kerry; Meth, Jennifer; Kendall, Jude; Riggs, Michael; Eberling, Yvonne; Troge, Jennifer; Grubor, Vladimir; Levy, Dan; Lundin, Pär; Månér, Susanne; Zetterberg, Anders; Hicks, James; Wigler, Michael

    2010-01-01

    Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form of the phylogenetic relationships. We have developed a method we call Sector-Ploidy-Profiling (SPP) to study the clonal composition of breast tumors. SPP involves macro-dissecting tumors, flow-sorting genomic subpopulations by DNA content, and profiling genomes using comparative genomic hybridization (CGH). Breast carcinomas display two classes of genomic structural variation: (1) monogenomic and (2) polygenomic. Monogenomic tumors appear to contain a single major clonal subpopulation with a highly stable chromosome structure. Polygenomic tumors contain multiple clonal tumor subpopulations, which may occupy the same sectors, or separate anatomic locations. In polygenomic tumors, we show that heterogeneity can be ascribed to a few clonal subpopulations, rather than a series of gradual intermediates. By comparing multiple subpopulations from different anatomic locations, we have inferred pathways of cancer progression and the organization of tumor growth.

  14. Circular inferences in schizophrenia.

    PubMed

    Jardri, Renaud; Denève, Sophie

    2013-11-01

    A considerable number of recent experimental and computational studies suggest that subtle impairments of excitatory to inhibitory balance or regulation are involved in many neurological and psychiatric conditions. The current paper aims to relate, specifically and quantitatively, excitatory to inhibitory imbalance with psychotic symptoms in schizophrenia. Considering that the brain constructs hierarchical causal models of the external world, we show that the failure to maintain the excitatory to inhibitory balance results in hallucinations as well as in the formation and subsequent consolidation of delusional beliefs. Indeed, the consequence of excitatory to inhibitory imbalance in a hierarchical neural network is equated to a pathological form of causal inference called 'circular belief propagation'. In circular belief propagation, bottom-up sensory information and top-down predictions are reverberated, i.e. prior beliefs are misinterpreted as sensory observations and vice versa. As a result, these predictions are counted multiple times. Circular inference explains the emergence of erroneous percepts, the patient's overconfidence when facing probabilistic choices, the learning of 'unshakable' causal relationships between unrelated events and a paradoxical immunity to perceptual illusions, which are all known to be associated with schizophrenia. PMID:24065721

  15. Circular inferences in schizophrenia.

    PubMed

    Jardri, Renaud; Denève, Sophie

    2013-11-01

    A considerable number of recent experimental and computational studies suggest that subtle impairments of excitatory to inhibitory balance or regulation are involved in many neurological and psychiatric conditions. The current paper aims to relate, specifically and quantitatively, excitatory to inhibitory imbalance with psychotic symptoms in schizophrenia. Considering that the brain constructs hierarchical causal models of the external world, we show that the failure to maintain the excitatory to inhibitory balance results in hallucinations as well as in the formation and subsequent consolidation of delusional beliefs. Indeed, the consequence of excitatory to inhibitory imbalance in a hierarchical neural network is equated to a pathological form of causal inference called 'circular belief propagation'. In circular belief propagation, bottom-up sensory information and top-down predictions are reverberated, i.e. prior beliefs are misinterpreted as sensory observations and vice versa. As a result, these predictions are counted multiple times. Circular inference explains the emergence of erroneous percepts, the patient's overconfidence when facing probabilistic choices, the learning of 'unshakable' causal relationships between unrelated events and a paradoxical immunity to perceptual illusions, which are all known to be associated with schizophrenia.

  16. Disentangling niche and neutral influences on community assembly: assessing the performance of community phylogenetic structure tests.

    PubMed

    Kembel, Steven W

    2009-09-01

    Patterns of phylogenetic relatedness within communities have been widely used to infer the importance of different ecological and evolutionary processes during community assembly, but little is known about the relative ability of community phylogenetics methods and null models to detect the signature of processes such as dispersal, competition and filtering under different models of trait evolution. Using a metacommunity simulation incorporating quantitative models of trait evolution and community assembly, I assessed the performance of different tests that have been used to measure community phylogenetic structure. All tests were sensitive to the relative phylogenetic signal in species metacommunity abundances and traits; methods that were most sensitive to the effects of niche-based processes on community structure were also more likely to find non-random patterns of community phylogenetic structure under dispersal assembly. When used with a null model that maintained species occurrence frequency in random communities, several metrics could detect niche-based assembly when there was strong phylogenetic signal in species traits, when multiple traits were involved in community assembly, and in the presence of environmental heterogeneity. Interpretations of the causes of community phylogenetic structure should be modified to account for the influence of dispersal.

  17. Phylogenetic relationships of American willows (Salix L., Salicaceae).

    PubMed

    Lauron-Moreau, Aurélien; Pitre, Frédéric E; Argus, George W; Labrecque, Michel; Brouillet, Luc

    2015-01-01

    Salix L. is the largest genus in the family Salicaceae (450 species). Several classifications have been published, but taxonomic subdivision has been under continuous revision. Our goal is to establish the phylogenetic structure of the genus using molecular data on all American willows, using three DNA markers. This complete phylogeny of American willows allows us to propose a biogeographic framework for the evolution of the genus. Material was obtained for the 122 native and introduced willow species of America. Sequences were obtained from the ITS (ribosomal nuclear DNA) and two plastid regions, matK and rbcL. Phylogenetic analyses (parsimony, maximum likelihood, Bayesian inference) were performed on the data. Geographic distribution was mapped onto the tree. The species tree provides strong support for a division of the genus into two subgenera, Salix and Vetrix. Subgenus Salix comprises temperate species from the Americas and Asia, and their disjunction may result from Tertiary events. Subgenus Vetrix is composed of boreo-arctic species of the Northern Hemisphere and their radiation may coincide with the Quaternary glaciations. Sixteen species have ambiguous positions; genetic diversity is lower in subg. Vetrix. A molecular phylogeny of all species of American willows has been inferred. It needs to be tested and further resolved using other molecular data. Nonetheless, the genus clearly has two clades that have distinct biogeographic patterns.

  18. Phylogenetic relationships of American willows (Salix L., Salicaceae).

    PubMed

    Lauron-Moreau, Aurélien; Pitre, Frédéric E; Argus, George W; Labrecque, Michel; Brouillet, Luc

    2015-01-01

    Salix L. is the largest genus in the family Salicaceae (450 species). Several classifications have been published, but taxonomic subdivision has been under continuous revision. Our goal is to establish the phylogenetic structure of the genus using molecular data on all American willows, using three DNA markers. This complete phylogeny of American willows allows us to propose a biogeographic framework for the evolution of the genus. Material was obtained for the 122 native and introduced willow species of America. Sequences were obtained from the ITS (ribosomal nuclear DNA) and two plastid regions, matK and rbcL. Phylogenetic analyses (parsimony, maximum likelihood, Bayesian inference) were performed on the data. Geographic distribution was mapped onto the tree. The species tree provides strong support for a division of the genus into two subgenera, Salix and Vetrix. Subgenus Salix comprises temperate species from the Americas and Asia, and their disjunction may result from Tertiary events. Subgenus Vetrix is composed of boreo-arctic species of the Northern Hemisphere and their radiation may coincide with the Quaternary glaciations. Sixteen species have ambiguous positions; genetic diversity is lower in subg. Vetrix. A molecular phylogeny of all species of American willows has been inferred. It needs to be tested and further resolved using other molecular data. Nonetheless, the genus clearly has two clades that have distinct biogeographic patterns. PMID:25880993

  19. Phylogenetic Relationships of American Willows (Salix L., Salicaceae)

    PubMed Central

    Lauron-Moreau, Aurélien; Pitre, Frédéric E.; Argus, George W.; Labrecque, Michel; Brouillet, Luc

    2015-01-01

    Salix L. is the largest genus in the family Salicaceae (450 species). Several classifications have been published, but taxonomic subdivision has been under continuous revision. Our goal is to establish the phylogenetic structure of the genus using molecular data on all American willows, using three DNA markers. This complete phylogeny of American willows allows us to propose a biogeographic framework for the evolution of the genus. Material was obtained for the 122 native and introduced willow species of America. Sequences were obtained from the ITS (ribosomal nuclear DNA) and two plastid regions, matK and rbcL. Phylogenetic analyses (parsimony, maximum likelihood, Bayesian inference) were performed on the data. Geographic distribution was mapped onto the tree. The species tree provides strong support for a division of the genus into two subgenera, Salix and Vetrix. Subgenus Salix comprises temperate species from the Americas and Asia, and their disjunction may result from Tertiary events. Subgenus Vetrix is composed of boreo-arctic species of the Northern Hemisphere and their radiation may coincide with the Quaternary glaciations. Sixteen species have ambiguous positions; genetic diversity is lower in subg. Vetrix. A molecular phylogeny of all species of American willows has been inferred. It needs to be tested and further resolved using other molecular data. Nonetheless, the genus clearly has two clades that have distinct biogeographic patterns. PMID:25880993

  20. Phylogeny, historical biogeography, and patterns of diversification for Pinus (Pinaceae): phylogenetic tests of fossil-based hypotheses.

    PubMed

    Eckert, Andrew J; Hall, Benjamin D

    2006-07-01

    Pines comprise one of the largest coniferous genera, are distributed throughout the Northern Hemisphere, and have an abundant fossil record. Distributions of fossils have been used to derive a three-step hypothesis of early pine evolution, which postulates a Mesozoic origin for the genus, east-west expansions across Laurasia, and retraction into Eocene refugia. Here, we present phylogenetic tests of this hypothesis using chloroplast sequence data from four loci for 83 pine species. We used the fossil-based hypothesis to derive null expectations concerning monophyly of taxonomic groups, dates of cladogenesis, and patterns of diversification. Phylogenetic analyses using several algorithms subsequently provided rigorous tests of these expectations. Our inferred phylogenies illustrated broad congruence with taxonomic groups, but highlighted consistent problems within subgenus Strobus. Estimated minimum dates of divergence derived from relaxed clock methods were largely consistent with the fossil record and yielded a date for the ingroup node of Pinus of 128+/-4 mya, depending upon the calibration used for subgenus Pinus. Ancestral area reconstructions showed Pinus to have most likely originated in Eurasia. Major clades differed in biogeographic patterns, but were consistent with the fossil-based hypothesis. We found weak support, however, for a change in diversification rate in the Eocene as interpretations of fossil distributions would have predicted.

  1. Transdimensional inference in the geosciences.

    PubMed

    Sambridge, M; Bodin, T; Gallagher, K; Tkalcic, H

    2013-02-13

    Seismologists construct images of the Earth's interior structure using observations, derived from seismograms, collected at the surface. A common approach to such inverse problems is to build a single 'best' Earth model, in some sense. This is despite the fact that the observations by themselves often do not require, or even allow, a single best-fit Earth model to exist. Interpretation of optimal models can be fraught with difficulties, particularly when formal uncertainty estimates become heavily dependent on the regularization imposed. Similar issues occur across the physical sciences with model construction in ill-posed problems. An alternative approach is to embrace the non-uniqueness directly and employ an inference process based on parameter space sampling. Instead of seeking a best model within an optimization framework, one seeks an ensemble of solutions and derives properties of that ensemble for inspection. While this idea has itself been employed for more than 30 years, it is now receiving increasing attention in the geosciences. Recently, it has been shown that transdimensional and hierarchical sampling methods have some considerable benefits for problems involving multiple parameter types, uncertain data errors and/or uncertain model parametrizations, as are common in seismology. Rather than being forced to make decisions on parametrization, the level of data noise and the weights between data types in advance, as is often the case in an optimization framework, the choice can be informed by the data themselves. Despite the relatively high computational burden involved, the number of areas where sampling methods are now feasible is growing rapidly. The intention of this article is to introduce concepts of transdimensional inference to a general readership and illustrate with particular seismological examples. A growing body of references provide necessary detail. PMID:23277604

  2. The phylogenetic structure of plant-pollinator networks increases with habitat size and isolation.

    PubMed

    Aizen, Marcelo A; Gleiser, Gabriela; Sabatino, Malena; Gilarranz, Luis J; Bascompte, Jordi; Verdú, Miguel

    2016-01-01

    Similarity among species in traits related to ecological interactions is frequently associated with common ancestry. Thus, closely related species usually interact with ecologically similar partners, which can be reinforced by diverse co-evolutionary processes. The effect of habitat fragmentation on the phylogenetic signal in interspecific interactions and correspondence between plant and animal phylogenies is, however, unknown. Here, we address to what extent phylogenetic signal and co-phylogenetic congruence of plant-animal interactions depend on habitat size and isolation by analysing the phylogenetic structure of 12 pollination webs from isolated Pampean hills. Phylogenetic signal in interspecific interactions differed among webs, being stronger for flower-visiting insects than plants. Phylogenetic signal and overall co-phylogenetic congruence increased independently with hill size and isolation. We propose that habitat fragmentation would erode the phylogenetic structure of interaction webs. A decrease in phylogenetic signal and co-phylogenetic correspondence in plant-pollinator interactions could be associated with less reliable mutualism and erratic co-evolutionary change.

  3. Social engagement and attachment: a phylogenetic perspective.

    PubMed

    Porges, Stephen W

    2003-12-01

    This article focuses on the importance of social engagement and the behavioral and neurophysiological mechanisms that allow individuals to reduce psychological and physical distance. A model of social engagement derived from the Polyvagal Theory is presented. The model emphasizes phylogeny as an organizing principle and includes the following points: (1) there are well-defined neural circuits to support social engagement behaviors and the defensive strategies of fight, flight, and freeze; (2) these neural circuits form a phylogenetically organized hierarchy; (3) without being dependent on conscious awareness, the nervous system evaluates risk in the environment and regulates the expression of adaptive behavior to match the neuroception of a safe, dangerous, or life-threatening environment; (4) social engagement behaviors and the benefits of the physiological states associated with social support require a neuroception of safety; (5) social behaviors associated with nursing, reproduction, and the formation of strong pair bonds require immobilization without fear; and (6) immobilization without fear is mediated by a co-opting of the neural circuit regulating defensive freezing behaviors through the involvement of oxytocin, a neuropeptide in mammals involved in the formation of social bonds. The model provides a phylogenetic interpretation of the neural mechanisms mediating the behavioral and physiological features associated with stress and several psychiatric disorders.

  4. Terraces in phylogenetic tree space.

    PubMed

    Sanderson, Michael J; McMahon, Michelle M; Steel, Mike

    2011-07-22

    A key step in assembling the tree of life is the construction of species-rich phylogenies from multilocus--but often incomplete--sequence data sets. We describe previously unknown structure in the landscape of solutions to the tree reconstruction problem, comprising sometimes vast "terraces" of trees with identical quality, arranged on islands of phylogenetically similar trees. Phylogenetic ambiguity within a terrace can be characterized efficiently and then ameliorated by new algorithms for obtaining a terrace's maximum-agreement subtree or by identifying the smallest set of new targets for additional sequencing. Algorithms to find optimal trees or estimate Bayesian posterior tree distributions may need to navigate strategically in the neighborhood of large terraces in tree space.

  5. Genotyping-by-sequencing as a tool to infer phylogeny and ancestral hybridization: a case study in Carex (Cyperaceae).

    PubMed

    Escudero, Marcial; Eaton, Deren A R; Hahn, Marlene; Hipp, Andrew L

    2014-10-01

    Determining phylogenetic relationships among very closely related species has remained a challenge for evolutionary biologists due to interlocus phylogenetic discordance and the difficulty of obtaining variable markers. Here, we used a Genotyping-by-Sequencing (GBS) approach to sample a reduced representation genomic data set and infer the phylogeny of seven closely related species in the genus Carex (Cyperaceae). Past attempts to reconstruct phylogenetic relationships among these species produced conflicting and poorly-supported results. We inferred a robust phylogeny based on >3000 GBS loci and >1300 SNPs (with a minimum sequence depth within individuals of 10) using maximum likelihood and Bayesian inference. We also tested for historical introgression using the D-statistic test. We compared these analyses with partitioned RAD analysis, which is designed to identify suboptimal trees reflecting secondary phylogenetic signal that may be obscured by the dominant signal in the data. Phylogenetic analyses yielded fully resolved trees with high support. We found two main clades, one grouping Carex scoparia populations and C. waponahkikensis, and a second clade grouping C. longii, C. vexans, C. suberecta and C. albolutescens. We detected marginally significant signals of introgression between C. scoparia and C. suberecta or C. albolutescens, and we rejected a hybrid origin hypothesis for C. waponahkikensis. Our results demonstrate the power of NGS data sets for resolving some of the most difficult phylogenetic challenges where traditional phylogenetic markers have failed. PMID:25010772

  6. Genotyping-by-sequencing as a tool to infer phylogeny and ancestral hybridization: a case study in Carex (Cyperaceae).

    PubMed

    Escudero, Marcial; Eaton, Deren A R; Hahn, Marlene; Hipp, Andrew L

    2014-10-01

    Determining phylogenetic relationships among very closely related species has remained a challenge for evolutionary biologists due to interlocus phylogenetic discordance and the difficulty of obtaining variable markers. Here, we used a Genotyping-by-Sequencing (GBS) approach to sample a reduced representation genomic data set and infer the phylogeny of seven closely related species in the genus Carex (Cyperaceae). Past attempts to reconstruct phylogenetic relationships among these species produced conflicting and poorly-supported results. We inferred a robust phylogeny based on >3000 GBS loci and >1300 SNPs (with a minimum sequence depth within individuals of 10) using maximum likelihood and Bayesian inference. We also tested for historical introgression using the D-statistic test. We compared these analyses with partitioned RAD analysis, which is designed to identify suboptimal trees reflecting secondary phylogenetic signal that may be obscured by the dominant signal in the data. Phylogenetic analyses yielded fully resolved trees with high support. We found two main clades, one grouping Carex scoparia populations and C. waponahkikensis, and a second clade grouping C. longii, C. vexans, C. suberecta and C. albolutescens. We detected marginally significant signals of introgression between C. scoparia and C. suberecta or C. albolutescens, and we rejected a hybrid origin hypothesis for C. waponahkikensis. Our results demonstrate the power of NGS data sets for resolving some of the most difficult phylogenetic challenges where traditional phylogenetic markers have failed.

  7. morePhyML: improving the phylogenetic tree space exploration with PhyML 3.

    PubMed

    Criscuolo, Alexis

    2011-12-01

    PhyML is a widely used Maximum Likelihood (ML) phylogenetic tree inference software based on a standard hill-climbing method. Starting from an initial tree, the version 3 of PhyML explores the tree space by using "Nearest Neighbor Interchange" (NNI) or "Subtree Pruning and Regrafting" (SPR) tree swapping techniques in order to find the ML phylogenetic tree. NNI-based local searches are fast but can often get trapped in local optima, whereas it is expected that the larger (but slower to cover) SPR-based neighborhoods will lead to trees with higher likelihood. Here, I verify that PhyML infers more likely trees with SPRs than with NNIs in almost all cases. However, I also show that the SPR-based local search of PhyML often does not succeed at locating the ML tree. To improve the tree space exploration, I deliver a script, named morePhyML, which allows escaping from local optima by performing character reweighting. This ML tree search strategy, named ratchet, often leads to higher likelihood estimates. Based on the analysis of a large number of amino acid and nucleotide data, I show that morePhyML allows inferring more accurate phylogenetic trees than several other recently developed ML tree inference softwares in many cases.

  8. Phylogeographic ancestral inference using the coalescent model on haplotype trees.

    PubMed

    Manolopoulou, Ioanna; Emerson, Brent C

    2012-06-01

    Phylogeographic ancestral inference is issue frequently arising in population ecology that aims to understand the geographical roots and structure of species. Here, we specifically address relatively small scale mtDNA datasets (typically less than 500 sequences with fewer than 1000 nucleotides), focusing on ancestral location inference. Our approach uses a coalescent modelling framework projected onto haplotype trees in order to reduce computational complexity, at the same time adhering to complex evolutionary processes. Statistical innovations of the last few years have allowed for computationally feasible yet accurate inferences in phylogenetic frameworks. We implement our methods on a set of synthetic datasets and show how, despite high uncertainty in terms of identifying the root haplotype, estimation of the ancestral location naturally encompasses lower uncertainty, allowing us to pinpoint the Maximum A Posteriori estimates for ancestral locations. We exemplify our methods on a set of synthetic datasets and then combine our inference methods with the phylogeographic clustering approach presented in Manolopoulou et al. (2011) on a real dataset from weevils in the Iberian peninsula in order to infer ancestral locations as well as population substructure.

  9. Phylogenetic methods in drug discovery.

    PubMed

    Ashton, John C

    2013-12-01

    In recent decades, growth of computing power has facilitated powerful techniques for reconstructing evolutionary relationships from online genetic and proteomic databases. These methods are useful tools for pharmacologists for analyzing relationships between receptors and associated enzymes. Phylogenetic analysis can help generate hypotheses and leads for experimentation. Reconstruction of molecular phylogenies for the nonspecialist is described in this article using the example of the orphaned g protein coupled receptor GPR18.

  10. Detection of Horizontal Gene Transfers from Phylogenetic Comparisons

    PubMed Central

    Pylro, Victor Satler; Vespoli, Luciano de Souza; Duarte, Gabriela Frois; Yotoko, Karla Suemy Clemente

    2012-01-01

    Bacterial phylogenies have become one of the most important challenges for microbial ecology. This field started in the mid-1970s with the aim of using the sequence of the small subunit ribosomal RNA (16S) tool to infer bacterial phylogenies. Phylogenetic hypotheses based on other sequences usually give conflicting topologies that reveal different evolutionary histories, which in some cases may be the result of horizontal gene transfer events. Currently, one of the major goals of molecular biology is to understand the role that horizontal gene transfer plays in species adaptation and evolution. In this work, we compared the phylogenetic tree based on 16S with the tree based on dszC, a gene involved in the cleavage of carbon-sulfur bonds. Bacteria of several genera perform this survival task when living in environments lacking free mineral sulfur. The biochemical pathway of the desulphurization process was extensively studied due to its economic importance, since this step is expensive and indispensable in fuel production. Our results clearly show that horizontal gene transfer events could be detected using common phylogenetic methods with gene sequences obtained from public sequence databases. PMID:22675653

  11. Phylogenetic analysis of Maverick/Polinton giant transposons across organisms.

    PubMed

    Haapa-Paananen, Saija; Wahlberg, Niklas; Savilahti, Harri

    2014-09-01

    Polintons are a recently discovered group of large transposable elements (<40Kb in size) encoding up to 10 different proteins. The increasing number of genome sequencing projects has led to the discovery of these elements in genomes of protists, fungi, and animals, but not in plants. The RepBase database of eukaryotic repetitive elements currently contains consensus sequences and information of 70 Polinton elements from 28 organisms. Previous phylogenetic analyses have shown the relationship of Polintons to linear plasmids, bacteriophages, and retroviruses. However, a comprehensive phylogenetic analysis of all known Polintons has been lacking. We retrieved the Polinton consensus sequences from the most recent version of RepBase, and compiled amino acid sequences for the two most common Polinton-specific genes, the DNA polymerase-B and retroviral-like integrase. Open reading frame predictions and homology comparisons revealed partial or full sequences for 54 polymerases and 55 Polinton integrases. Multiple sequence alignments portrayed conservation in several functional motifs of these proteins. Phylogenetic analyses based on Bayesian inference using single- and combined-gene datasets revealed seven distinct lineages of Polintons that broadly follow the tree of life. Two of the seven lineages are found within the same species, indicating that ancient divergences have been retained to this day.

  12. Phylogenetic analysis of otospiralin protein

    PubMed Central

    Torktaz, Ibrahim; Behjati, Mohaddeseh; Rostami, Amin

    2016-01-01

    Background: Fibrocyte-specific protein, otospiralin, is a small protein, widely expressed in the central nervous system as neuronal cell bodies and glia. The increased expression of otospiralin in reactive astrocytes implicates its role in signaling pathways and reparative mechanisms subsequent to injury. Indeed, otospiralin is considered to be essential for the survival of fibrocytes of the mesenchymal nonsensory regions of the cochlea. It seems that other functions of this protein are not yet completely understood. Materials and Methods: Amino acid sequences of otospiralin from 12 vertebrates were derived from National Center for Biotechnology Information database. Phylogenetic analysis and phylogeny estimation were performed using MEGA 5.0.5 program, and neighbor-joining tree was constructed by this software. Results: In this computational study, the phylogenetic tree of otospiralin has been investigated. Therefore, dendrograms of otospiralin were depicted. Alignment performed in MUSCLE method by UPGMB algorithm. Also, entropy plot determined for a better illustration of amino acid variations in this protein. Conclusion: In the present study, we used otospiralin sequence of 12 different species and by constructing phylogenetic tree, we suggested out group for some related species. PMID:27099854

  13. Phylogenetic analysis of trophic associations.

    PubMed

    Ives, A R; Godfray, H C J

    2006-07-01

    Ecologists frequently collect data on the patterns of association between adjacent trophic levels in the form of binary or quantitative food webs. Here, we develop statistical methods to estimate the roles of consumer and resource phylogenies in explaining patterns of consumer-resource association. We use these methods to ask whether closely related consumer species are more likely to attack the same resource species and whether closely related resource species are more likely to be attacked by the same consumer species. We then show how to use estimates of phylogenetic signals to predict novel consumer-resource associations solely from the phylogenetic position of species for which no other (or only partial) data are available. Finally, we show how to combine phylogenetic information with information about species' ecological characteristics and life-history traits to estimate the effects of species traits on consumer-resource associations while accounting for phylogenies. We illustrate these techniques using a food web comprising species of parasitoids, leaf-mining moths, and their host plants.

  14. Bayesian inference in geomagnetism

    NASA Technical Reports Server (NTRS)

    Backus, George E.

    1988-01-01

    The inverse problem in empirical geomagnetic modeling is investigated, with critical examination of recently published studies. Particular attention is given to the use of Bayesian inference (BI) to select the damping parameter lambda in the uniqueness portion of the inverse problem. The mathematical bases of BI and stochastic inversion are explored, with consideration of bound-softening problems and resolution in linear Gaussian BI. The problem of estimating the radial magnetic field B(r) at the earth core-mantle boundary from surface and satellite measurements is then analyzed in detail, with specific attention to the selection of lambda in the studies of Gubbins (1983) and Gubbins and Bloxham (1985). It is argued that the selection method is inappropriate and leads to lambda values much larger than those that would result if a reasonable bound on the heat flow at the CMB were assumed.

  15. BIE: Bayesian Inference Engine

    NASA Astrophysics Data System (ADS)

    Weinberg, Martin D.

    2013-12-01

    The Bayesian Inference Engine (BIE) is an object-oriented library of tools written in C++ designed explicitly to enable Bayesian update and model comparison for astronomical problems. To facilitate "what if" exploration, BIE provides a command line interface (written with Bison and Flex) to run input scripts. The output of the code is a simulation of the Bayesian posterior distribution from which summary statistics e.g. by taking moments, or determine confidence intervals and so forth, can be determined. All of these quantities are fundamentally integrals and the Markov Chain approach produces variates heta distributed according to P( heta|D) so moments are trivially obtained by summing of the ensemble of variates.

  16. Nonparametric causal inference for bivariate time series.

    PubMed

    McCracken, James M; Weigel, Robert S

    2016-02-01

    We introduce new quantities for exploratory causal inference between bivariate time series. The quantities, called penchants and leanings, are computationally straightforward to apply, follow directly from assumptions of probabilistic causality, do not depend on any assumed models for the time series generating process, and do not rely on any embedding procedures; these features may provide a clearer interpretation of the results than those from existing time series causality tools. The penchant and leaning are computed based on a structured method for computing probabilities.

  17. Nonparametric causal inference for bivariate time series

    NASA Astrophysics Data System (ADS)

    McCracken, James M.; Weigel, Robert S.

    2016-02-01

    We introduce new quantities for exploratory causal inference between bivariate time series. The quantities, called penchants and leanings, are computationally straightforward to apply, follow directly from assumptions of probabilistic causality, do not depend on any assumed models for the time series generating process, and do not rely on any embedding procedures; these features may provide a clearer interpretation of the results than those from existing time series causality tools. The penchant and leaning are computed based on a structured method for computing probabilities.

  18. Semiparametric Bayesian Inference for Multilevel Repeated Measurement Data

    PubMed Central

    Müller, Peter; Quintana, Fernando A.; Rosner, Gary L.

    2011-01-01

    Summary We discuss inference for data with repeated measurements at multiple levels. The motivating example is data with blood counts from cancer patients undergoing multiple cycles of chemotherapy, with days nested within cycles. Some inference questions relate to repeated measurements over days within cycle, while other questions are concerned with the dependence across cycles. When the desired inference relates to both levels of repetition, it becomes important to reflect the data structure in the model. We develop a semiparametric Bayesian modeling approach, restricting attention to two levels of repeated measurements. For the top-level longitudinal sampling model we use random effects to introduce the desired dependence across repeated measurements. We use a nonparametric prior for the random effects distribution. Inference about dependence across second-level repetition is implemented by the clustering implied in the nonparametric random effects model. Practical use of the model requires that the posterior distribution on the latent random effects be reasonably precise. PMID:17447954

  19. The evolution of respiratory O2/NO reductases: an out-of-the-phylogenetic-box perspective

    PubMed Central

    Ducluzeau, Anne-Lise; Schoepp-Cothenet, Barbara; van Lis, Robert; Baymann, Frauke; Russell, Michael J.; Nitschke, Wolfgang

    2014-01-01

    Complex life on our planet crucially depends on strong redox disequilibria afforded by the almost ubiquitous presence of highly oxidizing molecular oxygen. However, the history of O2-levels in the atmosphere is complex and prior to the Great Oxidation Event some 2.3 billion years ago, the amount of O2 in the biosphere is considered to have been extremely low as compared with present-day values. Therefore the evolutionary histories of life and of O2-levels are likely intricately intertwined. The obvious biological proxy for inferring the impact of changing O2-levels on life is the evolutionary history of the enzyme allowing organisms to tap into the redox power of molecular oxygen, i.e. the bioenergetic O2 reductases, alias the cytochrome and quinol oxidases. Consequently, molecular phylogenies reconstructed for this enzyme superfamily have been exploited over the last two decades in attempts to elucidate the interlocking between O2 levels in the environment and the evolution of respiratory bioenergetic processes. Although based on strictly identical datasets, these phylogenetic approaches have led to diametrically opposite scenarios with respect to the history of both the enzyme superfamily and molecular oxygen on the Earth. In an effort to overcome the deadlock of molecular phylogeny, we here review presently available structural, functional, palaeogeochemical and thermodynamic information pertinent to the evolution of the superfamily (which notably also encompasses the subfamily of nitric oxide reductases). The scenario which, in our eyes, most closely fits the ensemble of these non-phylogenetic data, sees the low O2-affinity SoxM- (or A-) type enzymes as the most recent evolutionary innovation and the high-affinity O2 reductases (SoxB or B and cbb3 or C) as arising independently from NO-reducing precursor enzymes. PMID:24968694

  20. The evolution of respiratory O2/NO reductases: an out-of-the-phylogenetic-box perspective.

    PubMed

    Ducluzeau, Anne-Lise; Schoepp-Cothenet, Barbara; van Lis, Robert; Baymann, Frauke; Russell, Michael J; Nitschke, Wolfgang

    2014-09-01

    Complex life on our planet crucially depends on strong redox disequilibria afforded by the almost ubiquitous presence of highly oxidizing molecular oxygen. However, the history of O2-levels in the atmosphere is complex and prior to the Great Oxidation Event some 2.3 billion years ago, the amount of O2 in the biosphere is considered to have been extremely low as compared with present-day values. Therefore the evolutionary histories of life and of O2-levels are likely intricately intertwined. The obvious biological proxy for inferring the impact of changing O2-levels on life is the evolutionary history of the enzyme allowing organisms to tap into the redox power of molecular oxygen, i.e. the bioenergetic O2 reductases, alias the cytochrome and quinol oxidases. Consequently, molecular phylogenies reconstructed for this enzyme superfamily have been exploited over the last two decades in attempts to elucidate the interlocking between O2 levels in the environment and the evolution of respiratory bioenergetic processes. Although based on strictly identical datasets, these phylogenetic approaches have led to diametrically opposite scenarios with respect to the history of both the enzyme superfamily and molecular oxygen on the Earth. In an effort to overcome the deadlock of molecular phylogeny, we here review presently available structural, functional, palaeogeochemical and thermodynamic information pertinent to the evolution of the superfamily (which notably also encompasses the subfamily of nitric oxide reductases). The scenario which, in our eyes, most closely fits the ensemble of these non-phylogenetic data, sees the low O2-affinity SoxM- (or A-) type enzymes as the most recent evolutionary innovation and the high-affinity O2 reductases (SoxB or B and cbb3 or C) as arising independently from NO-reducing precursor enzymes.

  1. The Effects of an Inference-Making Strategy Taught with and without Goal Setting

    ERIC Educational Resources Information Center

    Reed, Deborah K.; Lynn, Devon

    2016-01-01

    This quasi-experimental study examined the effects of a strategy for making text-dependent inferences--with and without embedded self-regulation skills--on the reading comprehension of 24 middle-grade students with disabilities. Classes were randomly assigned to receive the inference intervention only (IO), inference + individual goal setting…

  2. Bayes factors and multimodel inference

    USGS Publications Warehouse

    Link, W.A.; Barker, R.J.; Thomson, David L.; Cooch, Evan G.; Conroy, Michael J.

    2009-01-01

    Multimodel inference has two main themes: model selection, and model averaging. Model averaging is a means of making inference conditional on a model set, rather than on a selected model, allowing formal recognition of the uncertainty associated with model choice. The Bayesian paradigm provides a natural framework for model averaging, and provides a context for evaluation of the commonly used AIC weights. We review Bayesian multimodel inference, noting the importance of Bayes factors. Noting the sensitivity of Bayes factors to the choice of priors on parameters, we define and propose nonpreferential priors as offering a reasonable standard for objective multimodel inference.

  3. POWER: PhylOgenetic WEb Repeater--an integrated and user-optimized framework for biomolecular phylogenetic analysis.

    PubMed

    Lin, Chung-Yen; Lin, Fan-Kai; Lin, Chieh Hua; Lai, Li-Wei; Hsu, Hsiu-Jun; Chen, Shu-Hwa; Hsiung, Chao A

    2005-07-01

    POWER, the PhylOgenetic WEb Repeater, is a web-based service designed to perform user-friendly pipeline phylogenetic analysis. POWER uses an open-source LAMP structure and infers genetic distances and phylogenetic relationships using well-established algorithms (ClustalW and PHYLIP). POWER incorporates a novel tree builder based on the GD library to generate a high-quality tree topology according to the calculated result. POWER accepts either raw sequences in FASTA format or user-uploaded alignment output files. Through a user-friendly web interface, users can sketch a tree effortlessly in multiple steps. After a tree has been generated, users can freely set and modify parameters, select tree building algorithms, refine sequence alignments or edit the tree topology. All the information related to input sequences and the processing history is logged and downloadable for the user's reference. Furthermore, iterative tree construction can be performed by adding sequences to, or removing them from, a previously submitted job. POWER is accessible at http://power.nhri.org.tw.

  4. Double jeopardy in inferring cognitive processes.

    PubMed

    Fific, Mario

    2014-01-01

    Inferences we make about underlying cognitive processes can be jeopardized in two ways due to problematic forms of aggregation. First, averaging across individuals is typically considered a very useful tool for removing random variability. The threat is that averaging across subjects leads to averaging across different cognitive strategies, thus harming our inferences. The second threat comes from the construction of inadequate research designs possessing a low diagnostic accuracy of cognitive processes. For that reason we introduced the systems factorial technology (SFT), which has primarily been designed to make inferences about underlying processing order (serial, parallel, coactive), stopping rule (terminating, exhaustive), and process dependency. SFT proposes that the minimal research design complexity to learn about n number of cognitive processes should be equal to 2 (n) . In addition, SFT proposes that (a) each cognitive process should be controlled by a separate experimental factor, and (b) The saliency levels of all factors should be combined in a full factorial design. In the current study, the author cross combined the levels of jeopardies in a 2 × 2 analysis, leading to four different analysis conditions. The results indicate a decline in the diagnostic accuracy of inferences made about cognitive processes due to the presence of each jeopardy in isolation and when combined. The results warrant the development of more individual subject analyses and the utilization of full-factorial (SFT) experimental designs.

  5. Double jeopardy in inferring cognitive processes

    PubMed Central

    Fific, Mario

    2014-01-01

    Inferences we make about underlying cognitive processes can be jeopardized in two ways due to problematic forms of aggregation. First, averaging across individuals is typically considered a very useful tool for removing random variability. The threat is that averaging across subjects leads to averaging across different cognitive strategies, thus harming our inferences. The second threat comes from the construction of inadequate research designs possessing a low diagnostic accuracy of cognitive processes. For that reason we introduced the systems factorial technology (SFT), which has primarily been designed to make inferences about underlying processing order (serial, parallel, coactive), stopping rule (terminating, exhaustive), and process dependency. SFT proposes that the minimal research design complexity to learn about n number of cognitive processes should be equal to 2n. In addition, SFT proposes that (a) each cognitive process should be controlled by a separate experimental factor, and (b) The saliency levels of all factors should be combined in a full factorial design. In the current study, the author cross combined the levels of jeopardies in a 2 × 2 analysis, leading to four different analysis conditions. The results indicate a decline in the diagnostic accuracy of inferences made about cognitive processes due to the presence of each jeopardy in isolation and when combined. The results warrant the development of more individual subject analyses and the utilization of full-factorial (SFT) experimental designs. PMID:25374545

  6. Continental scale patterns and predictors of fern richness and phylogenetic diversity.

    PubMed

    Nagalingum, Nathalie S; Knerr, Nunzio; Laffan, Shawn W; González-Orozco, Carlos E; Thornhill, Andrew H; Miller, Joseph T; Mishler, Brent D

    2015-01-01

    Because ferns have a wide range of habitat preferences and are widely distributed, they are an ideal group for understanding how diversity is distributed. Here we examine fern diversity on a broad-scale using standard and corrected richness measures as well as phylogenetic indices; in addition we determine the environmental predictors of each diversity metric. Using the combined records of Australian herbaria, a dataset of over 60,000 records was obtained for 89 genera to infer richness. A molecular phylogeny of all the genera was constructed and combined with the herbarium records to obtain phylogenetic diversity patterns. A hotspot of both taxic and phylogenetic diversity occurs in the Wet Tropics of northeastern Australia. Although considerable diversity is distributed along the eastern coast, some important regions of diversity are identified only after sample-standardization of richness and through the phylogenetic metric. Of all of the metrics, annual precipitation was identified as the most explanatory variable, in part, in agreement with global and regional fern studies. However, precipitation was combined with a different variable for each different metric. For corrected richness, precipitation was combined with temperature seasonality, while correlation of phylogenetic diversity to precipitation plus radiation indicated support for the species-energy hypothesis. Significantly high and significantly low phylogenetic diversity were found in geographically separate areas. These separate areas correlated with different climatic conditions such as seasonality in precipitation. The phylogenetic metrics identified additional areas of significant diversity, some of which have not been revealed using traditional taxonomic analyses, suggesting that different ecological and evolutionary processes have operated over the continent. Our study demonstrates that it is possible and vital to incorporate evolutionary metrics when inferring biodiversity hotspots from large

  7. Continental scale patterns and predictors of fern richness and phylogenetic diversity

    PubMed Central

    Nagalingum, Nathalie S.; Knerr, Nunzio; Laffan, Shawn W.; González-Orozco, Carlos E.; Thornhill, Andrew H.; Miller, Joseph T.; Mishler, Brent D.

    2015-01-01

    Because ferns have a wide range of habitat preferences and are widely distributed, they are an ideal group for understanding how diversity is distributed. Here we examine fern diversity on a broad-scale using standard and corrected richness measures as well as phylogenetic indices; in addition we determine the environmental predictors of each diversity metric. Using the combined records of Australian herbaria, a dataset of over 60,000 records was obtained for 89 genera to infer richness. A molecular phylogeny of all the genera was constructed and combined with the herbarium records to obtain phylogenetic diversity patterns. A hotspot of both taxic and phylogenetic diversity occurs in the Wet Tropics of northeastern Australia. Although considerable diversity is distributed along the eastern coast, some important regions of diversity are identified only after sample-standardization of richness and through the phylogenetic metric. Of all of the metrics, annual precipitation was identified as the most explanatory variable, in part, in agreement with global and regional fern studies. However, precipitation was combined with a different variable for each different metric. For corrected richness, precipitation was combined with temperature seasonality, while correlation of phylogenetic diversity to precipitation plus radiation indicated support for the species-energy hypothesis. Significantly high and significantly low phylogenetic diversity were found in geographically separate areas. These separate areas correlated with different climatic conditions such as seasonality in precipitation. The phylogenetic metrics identified additional areas of significant diversity, some of which have not been revealed using traditional taxonomic analyses, suggesting that different ecological and evolutionary processes have operated over the continent. Our study demonstrates that it is possible and vital to incorporate evolutionary metrics when inferring biodiversity hotspots from large

  8. Transforming phylogenetic networks: Moving beyond tree space.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks.

  9. Seasonal and genotypic changes in escherichia coli phylogenetic groups in the Yeongsan River basin of South Korea.

    PubMed

    Jang, Jeonghwan; Di, Doris Y W; Lee, Anna; Unno, Tatsuya; Sadowsky, Michael J; Hur, Hor-Gil

    2014-01-01

    With 3,480 E. coli strains isolated from the Yeongsan River basin, South Korea, correlations between phylogenetic groups and horizontal fluorophore enhanced rep-PCR (HFERP) genotypes were examined, and environmental factors affecting E. coli phylogenetic groups in the river water were determined. Interestingly, multidimentional scaling (MDS) analyses based on HFERP DNA fingerprint data indicated that E. coli in phylogenetic groups A and B1 were uniquely clustered. Results of self-organized maps (SOMs) analyses also indicated that E. coli phylogenetic groups were seasonally affected by water temperature, with greater occurrences of phylogenetic groups A and B1 in low and high temperature seasons, respectively. The presence of E. coli in phylogenetic groups A and B1 were inversely related. Furthermore, redundancy analysis (RDA) revealed that phylogenetic group B1 correlated positively with temperature, strain diversity, and biochemical oxygen demand (BOD) but negatively with phylogenetic group A. Results of this study indicated that while E. coli strains could be clustered based on their genotypes and environment conditions, their phylogenetic groups did not change in relation to the same conditions. The distributional differences of phylogenetic groups among E. coli populations in different environments may be caused by different genomic adaptability and plasticity of E. coli strains belonging to each phylogenetic group. Although several previous studies have reported different E. coli ecological structures depending on their origins, this study is a first description of the specific environmental factors affecting E. coli phylogenetic groups in river water. PMID:24999864

  10. Seasonal and Genotypic Changes in Escherichia coli Phylogenetic Groups in the Yeongsan River Basin of South Korea

    PubMed Central

    Jang, Jeonghwan; Di, Doris Y. W.; Lee, Anna; Unno, Tatsuya; Sadowsky, Michael J.; Hur, Hor-Gil

    2014-01-01

    With 3,480 E. coli strains isolated from the Yeongsan River basin, South Korea, correlations between phylogenetic groups and horizontal fluorophore enhanced rep-PCR (HFERP) genotypes were examined, and environmental factors affecting E. coli phylogenetic groups in the river water were determined. Interestingly, multidimentional scaling (MDS) analyses based on HFERP DNA fingerprint data indicated that E. coli in phylogenetic groups A and B1 were uniquely clustered. Results of self-organized maps (SOMs) analyses also indicated that E. coli phylogenetic groups were seasonally affected by water temperature, with greater occurrences of phylogenetic groups A and B1 in low and high temperature seasons, respectively. The presence of E. coli in phylogenetic groups A and B1 were inversely related. Furthermore, redundancy analysis (RDA) revealed that phylogenetic group B1 correlated positively with temperature, strain diversity, and biochemical oxygen demand (BOD) but negatively with phylogenetic group A. Results of this study indicated that while E. coli strains could be clustered based on their genotypes and environment conditions, their phylogenetic groups did not change in relation to the same conditions. The distributional differences of phylogenetic groups among E. coli populations in different environments may be caused by different genomic adaptability and plasticity of E. coli strains belonging to each phylogenetic group. Although several previous studies have reported different E. coli ecological structures depending on their origins, this study is a first description of the specific environmental factors affecting E. coli phylogenetic groups in river water. PMID:24999864

  11. Learning to Observe "and" Infer

    ERIC Educational Resources Information Center

    Hanuscin, Deborah L.; Park Rogers, Meredith A.

    2008-01-01

    Researchers describe the need for students to have multiple opportunities and social interaction to learn about the differences between observation and inference and their role in developing scientific explanations (Harlen 2001; Simpson 2000). Helping children develop their skills of observation and inference in science while emphasizing the…

  12. Feature Inference Learning and Eyetracking

    ERIC Educational Resources Information Center

    Rehder, Bob; Colner, Robert M.; Hoffman, Aaron B.

    2009-01-01

    Besides traditional supervised classification learning, people can learn categories by inferring the missing features of category members. It has been proposed that feature inference learning promotes learning a category's internal structure (e.g., its typical features and interfeature correlations) whereas classification promotes the learning of…

  13. Improving Inferences from Multiple Methods.

    ERIC Educational Resources Information Center

    Shotland, R. Lance; Mark, Melvin M.

    1987-01-01

    Multiple evaluation methods (MEMs) can cause an inferential challenge, although there are strategies to strengthen inferences. Practical and theoretical issues involved in the use by social scientists of MEMs, three potential problems in drawing inferences from MEMs, and short- and long-term strategies for alleviating these problems are outlined.…

  14. Causal Inference in Retrospective Studies.

    ERIC Educational Resources Information Center

    Holland, Paul W.; Rubin, Donald B.

    1988-01-01

    The problem of drawing causal inferences from retrospective case-controlled studies is considered. A model for causal inference in prospective studies is applied to retrospective studies. Limitations of case-controlled studies are formulated concerning relevant parameters that can be estimated in such studies. A coffee-drinking/myocardial…

  15. Causal Inference and Developmental Psychology

    ERIC Educational Resources Information Center

    Foster, E. Michael

    2010-01-01

    Causal inference is of central importance to developmental psychology. Many key questions in the field revolve around improving the lives of children and their families. These include identifying risk factors that if manipulated in some way would foster child development. Such a task inherently involves causal inference: One wants to know whether…

  16. Phylogenetic Analysis of Poliovirus Sequences.

    PubMed

    Jorba, Jaume

    2016-01-01

    Comparative genomic sequencing is a major surveillance tool in the Polio Laboratory Network. Due to the rapid evolution of polioviruses (~1 % per year), pathways of virus transmission can be reconstructed from the pathways of genomic evolution. Here, we describe three main phylogenetic methods; estimation of genetic distances, reconstruction of a maximum-likelihood (ML) tree, and estimation of substitution rates using Bayesian Markov chain Monte Carlo (MCMC). The data set used consists of complete capsid sequences from a survey of poliovirus sequences available in GenBank. PMID:26983737

  17. Phylogenetic placement of the Spirosomaceae

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Maloy, S.; Mandelco, L.; Raj, H. D.

    1990-01-01

    Comparative analysis of 16S rRNA sequences shows that the family Spirosomaceae belongs within the eubacterial phylum defined by the flavobacteria and bacteriodes. Its constituent genera, Spirosoma, Flectobacillus, and Runella form a monophyletic grouping therein. The phylogenetic assignment is based not only upon evolutionary distance analysis, but also upon sequence signatures and higher order structural synapomorphies in 16S rRNA. Another genus peripherally associated with the Spirosomaceae, Ancylobacter ("Microcyclus"), does not cluster with the flavobacteria and their relatives, but rather belongs to the alpha subdivision of the purple bacteria.

  18. Mistaking geography for biology: inferring processes from species distributions.

    PubMed

    Warren, Dan L; Cardillo, Marcel; Rosauer, Dan F; Bolnick, Daniel I

    2014-10-01

    Over the past few decades, there has been a rapid proliferation of statistical methods that infer evolutionary and ecological processes from data on species distributions. These methods have led to considerable new insights, but they often fail to account for the effects of historical biogeography on present-day species distributions. Because the geography of speciation can lead to patterns of spatial and temporal autocorrelation in the distributions of species within a clade, this can result in misleading inferences about the importance of deterministic processes in generating spatial patterns of biodiversity. In this opinion article, we discuss ways in which patterns of species distributions driven by historical biogeography are often interpreted as evidence of particular evolutionary or ecological processes. We focus on three areas that are especially prone to such misinterpretations: community phylogenetics, environmental niche modelling, and analyses of beta diversity (compositional turnover of biodiversity).

  19. Phylogenetic Reconstruction as a Broadly Applicable Teaching Tool in the Biology Classroom: The Value of Data in Estimating Likely Answers

    ERIC Educational Resources Information Center

    Julius, Matthew L.; Schoenfuss, Heiko L.

    2006-01-01

    This laboratory exercise introduces students to a fundamental tool in evolutionary biology--phylogenetic inference. Students are required to create a data set via observation and through mining preexisting data sets. These student data sets are then used to develop and compare competing hypotheses of vertebrate phylogeny. The exercise uses readily…

  20. Mammalian phylogenetic diversity-area relationships at a continental scale

    PubMed Central

    Mazel, Florent; Renaud, Julien; Guilhaumon, François; Mouillot, David; Gravel, Dominique; Thuiller, Wilfried

    2015-01-01

    In analogy to the species-area relationship (SAR), one of the few laws in Ecology, the phylogenetic diversity-area relationship (PDAR) describes the tendency of phylogenetic diversity (PD) to increase with area. Although investigating PDAR has the potential to unravel the underlying processes shaping assemblages across spatial scales and to predict PD loss through habitat reduction, it has been little investigated so far. Focusing on PD has noticeable advantages compared to species richness (SR) since PD also gives insights on processes such as speciation/extinction, assembly rules and ecosystem functioning. Here we investigate the universality and pervasiveness of the PDAR at continental scale using terrestrial mammals as study case. We define the relative robustness of PD (compared to SR) to habitat loss as the area between the standardized PDAR and standardized SAR (i.e. standardized by the diversity of the largest spatial window) divided by the area under the standardized SAR only. This metric quantifies the relative increase of PD robustness compared to SR robustness. We show that PD robustness is higher than SR robustness but that it varies among continents. We further use a null model approach to disentangle the relative effect of phylogenetic tree shape and non random spatial distribution of evolutionary history on the PDAR. We find that for most spatial scales and for all continents except Eurasia, PDARs are not different from expected by a model using only the observed SAR and the shape of the phylogenetic tree at continental scale. Interestingly, we detect a strong phylogenetic structure of the Eurasian PDAR that can be predicted by a model that specifically account for a finer biogeographical delineation of this continent. In conclusion, the relative robustness of PD to habitat loss compared to species richness is determined by the phylogenetic tree shape but also depends on the spatial structure of PD. PMID:26649401

  1. Estimating Bayesian Phylogenetic Information Content

    PubMed Central

    Lewis, Paul O.; Chen, Ming-Hui; Kuo, Lynn; Lewis, Louise A.; Fučíková, Karolina; Neupane, Suman; Wang, Yu-Bo; Shi, Daoyuan

    2016-01-01

    Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.] PMID:27155008

  2. Phylogenetic Origins of Brain Organisers

    PubMed Central

    Robertshaw, Ellen; Kiecker, Clemens

    2012-01-01

    The regionalisation of the nervous system begins early in embryogenesis, concomitant with the establishment of the anteroposterior (AP) and dorsoventral (DV) body axes. The molecular mechanisms that drive axis induction appear to be conserved throughout the animal kingdom and may be phylogenetically older than the emergence of bilateral symmetry. As a result of this process, groups of patterning genes that are equally well conserved are expressed at specific AP and DV coordinates of the embryo. In the emerging nervous system of vertebrate embryos, this initial pattern is refined by local signalling centres, secondary organisers, that regulate patterning, proliferation, and axonal pathfinding in adjacent neuroepithelium. The main secondary organisers for the AP neuraxis are the midbrain-hindbrain boundary, zona limitans intrathalamica, and anterior neural ridge and for the DV neuraxis the notochord, floor plate, and roof plate. A search for homologous secondary organisers in nonvertebrate lineages has led to controversy over their phylogenetic origins. Based on a recent study in hemichordates, it has been suggested that the AP secondary organisers evolved at the base of the deuterostome superphylum, earlier than previously thought. According to this view, the lack of signalling centres in some deuterostome lineages is likely to reflect a secondary loss due to adaptive processes. We propose that the relative evolutionary flexibility of secondary organisers has contributed to a broader morphological complexity of nervous systems in different clades. PMID:24278699

  3. Phylogenetic Conservatism in Plant Phenology

    NASA Technical Reports Server (NTRS)

    Davies, T. Jonathan; Wolkovich, Elizabeth M.; Kraft, Nathan J. B.; Salamin, Nicolas; Allen, Jenica M.; Ault, Toby R.; Betancourt, Julio L.; Bolmgren, Kjell; Cleland, Elsa E.; Cook, Benjamin I.; Crimmins, Theresa M.; Mazer, Susan J.; McCabe, Gregory J.; Pau, Stephanie; Regetz, Jim; Schwartz, Mark D.; Travers, Steven E.

    2013-01-01

    Phenological events defined points in the life cycle of a plant or animal have been regarded as highly plastic traits, reflecting flexible responses to various environmental cues. The ability of a species to track, via shifts in phenological events, the abiotic environment through time might dictate its vulnerability to future climate change. Understanding the predictors and drivers of phenological change is therefore critical. Here, we evaluated evidence for phylogenetic conservatism the tendency for closely related species to share similar ecological and biological attributes in phenological traits across flowering plants. We aggregated published and unpublished data on timing of first flower and first leaf, encompassing 4000 species at 23 sites across the Northern Hemisphere. We reconstructed the phylogeny for the set of included species, first, using the software program Phylomatic, and second, from DNA data. We then quantified phylogenetic conservatism in plant phenology within and across sites. We show that more closely related species tend to flower and leaf at similar times. By contrasting mean flowering times within and across sites, however, we illustrate that it is not the time of year that is conserved, but rather the phenological responses to a common set of abiotic cues. Our findings suggest that species cannot be treated as statistically independent when modelling phenological responses.Closely related species tend to resemble each other in the timing of their life-history events, a likely product of evolutionarily conserved responses to environmental cues. The search for the underlying drivers of phenology must therefore account for species' shared evolutionary histories.

  4. Nuclear and mitochondrial genes for inferring Trichuris phylogeny.

    PubMed

    Callejón, Rocío; Cutillas, Cristina; Nadler, Steven A

    2015-12-01

    Nucleotide sequences of the triose phosphate isomerase (TPI) gene (624 bp) and mitochondrial cytochrome b (cob) gene (520 bp) were obtained by PCR and evaluated for utility in inferring the phylogenetic relationships among Trichuris species. Published sequences of one other nuclear gene (18S or SSU rRNA, 1816-1846 bp) and one additional mitochondrial (mtDNA) gene (cytochrome oxidase 1, cox1, 342 bp) were also analyzed. Maximum likelihood and Bayesian inference methods were used to infer phylogenies for each gene separately but also for the combined mitochondrial data (two genes), the combined nuclear data (two genes), and the total evidence (four gene) dataset. Few Trichuris clades were uniformly resolved across separate analyses of individual genes. For the mtDNA, the cob gene trees had greater phylogenetic resolution and tended to have higher support values than the cox1 analyses. For nuclear genes, the SSU gene trees had slightly greater resolution and support values than the TPI analyses, but TPI was the only gene with reliable support for the deepest nodes in the tree. Combined analyses of genes yielded strongly supported clades in most cases, with the exception of the relationship among Trichuris clades 1, 2, and 3, which showed conflicting results between nuclear and mitochondrial genes. Both the TPI and cob genes proved valuable for inferring Trichuris relationships, with greatest resolution and support values achieved through combined analysis of multiple genes. Based on the phylogeny of the combined analysis of nuclear and mitochondrial genes, parsimony mapping of definitive host utilization depicts artiodactyls as the ancestral hosts for these Trichuris, with host-shifts into primates, rodents, and Carnivora.

  5. The more, the better: the use of multiple landmark configurations to solve the phylogenetic relationships in musteloids.

    PubMed

    Catalano, Santiago A; Ercoli, Marcos D; Prevosti, Francisco J

    2015-03-01

    Although the use of landmark data to study shape changes along a phylogenetic tree has become a common practice in evolutionary studies, the role of this sort of data for the inference of phylogenetic relationships remains under debate. Theoretical issues aside, the very existence of historical information in landmark data has been challenged, since phylogenetic analyses have often shown little congruence with alternative sources of evidence. However, most analyses conducted in the past were based upon a single landmark configuration, leaving it unsettled whether the incorporation of multiple configurations may improve the rather poor performance of this data source in most previous phylogenetic analyses. In the present study, we present a phylogenetic analysis of landmark data that combines information derived from several skeletal structures to derive a phylogenetic tree for musteloids. The analysis includes nine configurations representing different skeletal structures for 24 species. The resulting tree presents several notable concordances with phylogenetic hypotheses derived from molecular data. In particular, Mephitidae, Procyonidae, and Lutrinae plus the genera Martes, Mustela, Galictis, and Procyon were retrieved as monophyletic. In addition, other groupings were in agreement with molecular phylogenies or presented only minor discordances. Complementary analyses have also indicated that the results improve substantially when an increasing number of landmark configurations are included in the analysis. The results presented here thus highlight the importance of combining information from multiple structures to derive phylogenetic hypotheses from landmark data. PMID:25516268

  6. Phylogenetic analysis of the spirochetes.

    PubMed Central

    Paster, B J; Dewhirst, F E; Weisburg, W G; Tordoff, L A; Fraser, G J; Hespell, R B; Stanton, T B; Zablen, L; Mandelco, L; Woese, C R

    1991-01-01

    The 16S rRNA sequences were determined for species of Spirochaeta, Treponema, Borrelia, Leptospira, Leptonema, and Serpula, using a modified Sanger method of direct RNA sequencing. Analysis of aligned 16S rRNA sequences indicated that the spirochetes form a coherent taxon composed of six major clusters or groups. The first group, termed the treponemes, was divided into two subgroups. The first treponeme subgroup consisted of Treponema pallidum, Treponema phagedenis, Treponema denticola, a thermophilic spirochete strain, and two species of Spirochaeta, Spirochaeta zuelzerae and Spirochaeta stenostrepta, with an average interspecies similarity of 89.9%. The second treponeme subgroup contained Treponema bryantii, Treponema pectinovorum, Treponema saccharophilum, Treponema succinifaciens, and rumen strain CA, with an average interspecies similarity of 86.2%. The average interspecies similarity between the two treponeme subgroups was 84.2%. The division of the treponemes into two subgroups was verified by single-base signature analysis. The second spirochete group contained Spirochaeta aurantia, Spirochaeta halophila, Spirochaeta bajacaliforniensis, Spirochaeta litoralis, and Spirochaeta isovalerica, with an average similarity of 87.4%. The Spirochaeta group was related to the treponeme group, with an average similarity of 81.9%. The third spirochete group contained borrelias, including Borrelia burgdorferi, Borrelia anserina, Borrelia hermsii, and a rabbit tick strain. The borrelias formed a tight phylogenetic cluster, with average similarity of 97%. THe borrelia group shared a common branch with the Spirochaeta group and was closer to this group than to the treponemes. A single spirochete strain isolated fromt the shew constituted the fourth group. The fifth group was composed of strains of Serpula (Treponema) hyodysenteriae and Serpula (Treponema) innocens. The two species of this group were closely related, with a similarity of greater than 99%. Leptonema illini

  7. Phylogenetic Analysis of Pelecaniformes (Aves) Based on Osteological Data: Implications for Waterbird Phylogeny and Fossil Calibration Studies

    PubMed Central

    Smith, Nathan D.

    2010-01-01

    molecular phylogenies than previously assumed, though notable conflicts remain. The phylogenetic position of the Plotopteridae implies that wing-propelled diving evolved independently in plotopterids and penguins, representing a remarkable case of convergent evolution. Despite robust support for the placement of fossil taxa representing key calibration points, the successive outgroup relationships of several “stem fossil + crown family” clades are variable and poorly supported across recent studies of avian phylogeny. Thus, the impact these fossils have on inferred patterns of temporal diversification depends heavily on the resolution of deep nodes in avian phylogeny. PMID:20976229

  8. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    PubMed

    Kelly, Steven; Maini, Philip K

    2013-01-01

    The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  9. Phylogenetic organization of bacterial activity.

    PubMed

    Morrissey, Ember M; Mau, Rebecca L; Schwartz, Egbert; Caporaso, J Gregory; Dijkstra, Paul; van Gestel, Natasja; Koch, Benjamin J; Liu, Cindy M; Hayer, Michaela; McHugh, Theresa A; Marks, Jane C; Price, Lance B; Hungate, Bruce A

    2016-09-01

    Phylogeny is an ecologically meaningful way to classify plants and animals, as closely related taxa frequently have similar ecological characteristics, functional traits and effects on ecosystem processes. For bacteria, however, phylogeny has been argued to be an unreliable indicator of an organism's ecology owing to evolutionary processes more common to microbes such as gene loss and lateral gene transfer, as well as convergent evolution. Here we use advanced stable isotope probing with (13)C and (18)O to show that evolutionary history has ecological significance for in situ bacterial activity. Phylogenetic organization in the activity of bacteria sets the stage for characterizing the functional attributes of bacterial taxonomic groups. Connecting identity with function in this way will allow scientists to begin building a mechanistic understanding of how bacterial community composition regulates critical ecosystem functions.

  10. Phylogenetic organization of bacterial activity

    PubMed Central

    Morrissey, Ember M; Mau, Rebecca L; Schwartz, Egbert; Caporaso, J Gregory; Dijkstra, Paul; van Gestel, Natasja; Koch, Benjamin J; Liu, Cindy M; Hayer, Michaela; McHugh, Theresa A; Marks, Jane C; Price, Lance B; Hungate, Bruce A

    2016-01-01

    Phylogeny is an ecologically meaningful way to classify plants and animals, as closely related taxa frequently have similar ecological characteristics, functional traits and effects on ecosystem processes. For bacteria, however, phylogeny has been argued to be an unreliable indicator of an organism's ecology owing to evolutionary processes more common to microbes such as gene loss and lateral gene transfer, as well as convergent evolution. Here we use advanced stable isotope probing with 13C and 18O to show that evolutionary history has ecological significance for in situ bacterial activity. Phylogenetic organization in the activity of bacteria sets the stage for characterizing the functional attributes of bacterial taxonomic groups. Connecting identity with function in this way will allow scientists to begin building a mechanistic understanding of how bacterial community composition regulates critical ecosystem functions. PMID:26943624

  11. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  12. Visual exploration of parameter influence on phylogenetic trees.

    PubMed

    Hess, Martin; Bremm, Sebastian; Weissgraeber, Stephanie; Hamacher, Kay; Goesele, Michael; Wiemeyer, Josef; von Landesberger, Tatiana

    2014-01-01

    Evolutionary relationships between organisms are frequently derived as phylogenetic trees inferred from multiple sequence alignments (MSAs). The MSA parameter space is exponentially large, so tens of thousands of potential trees can emerge for each dataset. A proposed visual-analytics approach can reveal the parameters' impact on the trees. Given input trees created with different parameter settings, it hierarchically clusters the trees according to their structural similarity. The most important clusters of similar trees are shown together with their parameters. This view offers interactive parameter exploration and automatic identification of relevant parameters. Biologists applied this approach to real data of 16S ribosomal RNA and protein sequences of ion channels. It revealed which parameters affected the tree structures. This led to a more reliable selection of the best trees.

  13. High-level phylogeny of the Coleoptera inferred with mitochondrial genome sequences.

    PubMed

    Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Guo, Zhong-Long; Liu, Yong-Jian; Shen, Yu-Ying; Shao, Renfu

    2016-11-01

    The Coleoptera (beetles) exhibits tremendous morphological, ecological, and behavioral diversity. To better understand the phylogenetics and evolution of beetles, we sequenced three complete mitogenomes from two families (Cleridae and Meloidae), which share conserved mitogenomic features with other completely sequenced beetles. We assessed the influence of six datasets and three inference methods on topology and nodal support within the Coleoptera. We found that both Bayesian inference and maximum likelihood with homogeneous-site models were greatly affected by nucleotide compositional heterogeneity, while the heterogeneous-site mixture model in PhyloBayes could provide better phylogenetic signals for the Coleoptera. The amino acid dataset generated more reliable tree topology at the higher taxonomic levels (i.e. suborders and series), where the inclusion of rRNA genes and the third positions of protein-coding genes improved phylogenetic inference at the superfamily level, especially under a heterogeneous-site model. We recovered the suborder relationships as (Archostemata+Adephaga)+(Myxophaga+Polyphaga). The series relationships within Polyphaga were recovered as (Scirtiformia+(Elateriformia+((Bostrichiformia+Scarabaeiformia+Staphyliniformia)+Cucujiformia))). All superfamilies within Cucujiformia were recovered as monophyletic. We obtained a cucujiform phylogeny of (Cleroidea+(Coccinelloidea+((Lymexyloidea+Tenebrionoidea)+(Cucujoidea+(Chrysomeloidea+Curculionoidea))))). This study showed that although tree topologies were sensitive to data types and inference methods, mitogenomic data could provide useful information for resolving the Coleoptera phylogeny at various taxonomic levels by using suitable datasets and heterogeneous-site models.

  14. High-level phylogeny of the Coleoptera inferred with mitochondrial genome sequences.

    PubMed

    Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Guo, Zhong-Long; Liu, Yong-Jian; Shen, Yu-Ying; Shao, Renfu

    2016-11-01

    The Coleoptera (beetles) exhibits tremendous morphological, ecological, and behavioral diversity. To better understand the phylogenetics and evolution of beetles, we sequenced three complete mitogenomes from two families (Cleridae and Meloidae), which share conserved mitogenomic features with other completely sequenced beetles. We assessed the influence of six datasets and three inference methods on topology and nodal support within the Coleoptera. We found that both Bayesian inference and maximum likelihood with homogeneous-site models were greatly affected by nucleotide compositional heterogeneity, while the heterogeneous-site mixture model in PhyloBayes could provide better phylogenetic signals for the Coleoptera. The amino acid dataset generated more reliable tree topology at the higher taxonomic levels (i.e. suborders and series), where the inclusion of rRNA genes and the third positions of protein-coding genes improved phylogenetic inference at the superfamily level, especially under a heterogeneous-site model. We recovered the suborder relationships as (Archostemata+Adephaga)+(Myxophaga+Polyphaga). The series relationships within Polyphaga were recovered as (Scirtiformia+(Elateriformia+((Bostrichiformia+Scarabaeiformia+Staphyliniformia)+Cucujiformia))). All superfamilies within Cucujiformia were recovered as monophyletic. We obtained a cucujiform phylogeny of (Cleroidea+(Coccinelloidea+((Lymexyloidea+Tenebrionoidea)+(Cucujoidea+(Chrysomeloidea+Curculionoidea))))). This study showed that although tree topologies were sensitive to data types and inference methods, mitogenomic data could provide useful information for resolving the Coleoptera phylogeny at various taxonomic levels by using suitable datasets and heterogeneous-site models. PMID:27497607

  15. Bayesian Inference: with ecological applications

    USGS Publications Warehouse

    Link, William A.; Barker, Richard J.

    2010-01-01

    This text provides a mathematically rigorous yet accessible and engaging introduction to Bayesian inference with relevant examples that will be of interest to biologists working in the fields of ecology, wildlife management and environmental studies as well as students in advanced undergraduate statistics.. This text opens the door to Bayesian inference, taking advantage of modern computational efficiencies and easily accessible software to evaluate complex hierarchical models.

  16. Phylogeny and Divergence Times of Lemurs Inferred with Recent and Ancient Fossils in the Tree.

    PubMed

    Herrera, James P; Dávalos, Liliana M

    2016-09-01

    Paleontological and neontological systematics seek to answer evolutionary questions with different data sets. Phylogenies inferred for combined extant and extinct taxa provide novel insights into the evolutionary history of life. Primates have an extensive, diverse fossil record and molecular data for living and extinct taxa are rapidly becoming available. We used two models to infer the phylogeny and divergence times for living and fossil primates, the tip-dating (TD) and fossilized birth-death process (FBD). We collected new morphological data, especially on the living and extinct endemic lemurs of Madagascar. We combined the morphological data with published DNA sequences to infer near-complete (88% of lemurs) time-calibrated phylogenies. The results suggest that primates originated around the Cretaceous-Tertiary boundary, slightly earlier than indicated by the fossil record and later than previously inferred from molecular data alone. We infer novel relationships among extinct lemurs, and strong support for relationships that were previously unresolved. Dates inferred with TD were significantly older than those inferred with FBD, most likely related to an assumption of a uniform branching process in the TD compared with a birth-death process assumed in the FBD. This is the first study to combine morphological and DNA sequence data from extinct and extant primates to infer evolutionary relationships and divergence times, and our results shed new light on the tempo of lemur evolution and the efficacy of combined phylogenetic analyses. PMID:27113475

  17. Phylogenetic relationships within the Alcidae (Charadriiformes: Aves) inferred from total molecular evidence.

    PubMed

    Friesen, V L; Baker, A J; Piatt, J F

    1996-02-01

    The Alcidae is a unique assemblage of Northern Hemisphere seabirds that forage by "flying" underwater. Despite obvious affinities among the species, their evolutionary relationships are unclear. We analyzed nucleotide sequences of 1,045 base pairs of the mitochondrial cytochrome b gene and allelic profiles for 37 allozyme loci in all 22 extant species. Trees were constructed on independent and combined data sets using maximum parsimony and distance methods that correct for superimposed changes. Alternative methods of analysis produced only minor differences in relationships that were supported strongly by bootstrapping or standard error tests. Combining sequence and allozyme data into a single analysis provided the greatest number of relationships receiving strong support. Addition of published morphological and ecological data did not improve support for any additional relationship. All analyses grouped species into six distinct lineages: (1) the dovekie (Alle alle) and auks, (2) guillemots, (3) brachyramphine murrelets, (4) synthliboramphine murrelets, (5) true auklets, and (6) the rhinoceros auklet (Cerorhinca monocerata) and puffins. The two murres (genus Uria) were sister taxa, and the black guillemot (Cepphus grylle) was basal to the other guillemots. The Asian subspecies of the marbled murrelet (Brachyramphus marmoratus perdix) was the most divergent brachyramphine murrelet, and two distinct lineages occurred within the synthliboramphine murrelets. Cassin's auklet (Ptychoramphus aleuticus) and the rhinoceros auklet were basal to the other auklets and puffins, respectively, and the Atlantic (Fratercula arctica) and horned (Fratercula corniculata) puffins were sister taxa. Several relationships among tribes, among the dovekie and auks, and among the auklets could not be resolved but resembled "star" phylogenies indicative of adaptive radiations at different depths within the trees. PMID:8587501

  18. Data set of phylogenetic analysis inferred based on the complete genomes of the family Nodaviridae.

    PubMed

    Low, Chen-Fei; Bunawan, Hamidun

    2016-09-01

    In this article, nine complete genomes of viruses from the genus Alphanodavirus and Betanodavirus (Family Nodaviridae) were comparatively analyzed and the data of their evolutionary origins and relatedness are reported. The nucleotide sequence alignment of the complete genomes from all species and their deduced evolutionary relationships are presented. High sequence similarity within the genus Betanodavirus compared to the genus Alphanodavirus was revealed in multiple sequence alignment of the Nodaviridae genomes. The amino acid sequence similarity for both RNA1 and RNA2 ORF is more conserved in Betanodavirus, compared to Alphanodavirus. The conserved and variable regions within the virus genome that were defined based on the multiple sequence alignments are presented in this dataset. PMID:27617282

  19. Phylogenetics and diversification of Cotyledon (Crassulaceae) inferred from nuclear and chloroplast DNA sequence data.

    PubMed

    Mort, Mark E; Levsen, Nicholas; Randle, Christopher P; Van Jaarsveld, Ernst; Palmer, Annie

    2005-07-01

    Crassulaceae includes approximately 35 genera and 1500 species of leaf and stem succulent flowering plants. The family is nearly cosmopolitan in distribution, but is particularly diverse in southern Africa, where five genera comprising approximately 325 species are found. One of these genera, Cotyledon, includes 10 species that are largely confined to South Africa, where they are commonly found on rocky hillsides, coastal flats, and cliff faces. Species of Cotyledon are characterized by five-parted, pendulous, sympetalous flowers, but the genus is highly diverse in growth form, flower color and size, and leaf morphology. One particularly variable species, C. orbiculata, has been divided into five varieties based on leaf morphology and biogeography; however, the monophyly of this species as well as the relationships among the varieties have not previously been investigated. Parsimony analyses of a combined data set of DNA sequences from chloroplast and nuclear genome provided the first estimate of phylogeny for Cotyledon, and resulted in two minimum-length trees and a fully resolved phylogeny for the genus. Results indicate that C. orbiculata is not monophyletic and suggest the need for additional studies and a revised classification within the genus. PMID:21646139

  20. Phylogenetic relationships among onychophora from Australasia inferred from the mitochondrial cytochrome oxidase subunit I gene.

    PubMed

    Gleeson, D M; Rowell, D M; Tait, N N; Briscoe, D A; Higgins, A V

    1998-10-01

    Nucleotide sequence variation in a region of the mitochondrial cytochrome oxidase subunit I (COI) gene (456 bp) was examined for 26 onychophorans representing 15 genera of the family Peripatopsidae from Australasia. Sequence analysis revealed high intergeneric COI sequence divergence (up to 20.6% corrected) but low amino acid substitution rates, with high levels of transitional saturation evident. Among unambiguously alignable sequences, parsimony and distance analyses revealed a broadly congruent tree topology, robust to various algorithms and statistical analysis. There are two major groupings. One, largely unresolved, consists entirely of Australian mainland taxa. The other, for which there is convincing support, includes all of the New Zealand and Tasmanian taxa together with one mainland Australian species. In respect of the two major groupings, this topology is consistent with previous morphologically based phylogenies and provides further evidence for an ancient radiation within the mainland Australian Onychophora. The biogeographic implications of the close affinities revealed between the Tasmanian and New Zealand taxa are discussed.

  1. The phylogenetic relationships of the extant pelicans inferred from DNA sequence data.

    PubMed

    Kennedy, Martyn; Taylor, Scott A; Nádvorník, Petr; Spencer, Hamish G

    2013-01-01

    The pelicans are a charismatic group of large water birds, whose evolutionary relationships have been long debated. Here we use DNA sequence data from both mitochondrial and nuclear genes to derive a robust phylogeny of all the extant species. Our data rejects the widespread notion that pelicans can be divided into white- and brown-plumaged groups. Instead, we find that, in contrast to all previous evolutionary hypotheses, the species fall into three well-supported clades: an Old World clade of the Dalmatian, Spot-billed, Pink-backed and Australian Pelicans, a New World clade of the American White, Brown and Peruvian Pelicans, and monospecific clade consisting solely of the Great White Pelican, weakly grouped with the Old World clade. We discuss possible evolutionary scenarios giving rise to this diversity. PMID:23059726

  2. Late Proterozoic rise in atmospheric oxygen concentration inferred from phylogenetic and sulphur-isotope studies

    NASA Technical Reports Server (NTRS)

    Canfield, D. E.; Teske, A.

    1996-01-01

    The evolution of non-photosynthetic sulphide-oxidizing bacteria was contemporaneous with a large shift in the isotopic composition of biogenic sedimentary sulphides between 0.64 and 1.05 billion years ago. Both events were probably driven by a rise in atmospheric oxygen concentrations to greater than 5-18% of present levels--a change that may also have triggered the evolution of animals.

  3. A phylogenetic view on species radiation in Apodemus inferred from variation of nuclear and mitochondrial genes.

    PubMed

    Serizawa, K; Suzuki, H; Tsuchiya, K

    2000-02-01

    Species of field mice (genus Apodemus) are the most common rodents inhabiting woodlands and forests of the Palaearctic region. We examined the cytochrome b (cyt b) gene in mitochondrial DNA (1140 bp) and the interphotoreceptor retinoid binding protein (IRBP) gene in nuclear DNA (1152 bp) in nine species of Apodemus. Based on the genetic variation, the nine species were grouped into four lineages: (1) Agrarius group (A. agrarius, A. peninsulae, A. semotus, and A. speciosus), (2) Argenteus group (A. argenteus), (3) Gurkha group (A. gurkha), and (4) Sylvaticus group (A. alpicola, A. flavicollis, and A. sylvaticus). It was shown that these four lineages diverged within a short period of evolutionary time, suggestive of a radiation event. Soon after the radiation, the Agrarius group was likely to have differentiated again into the species lineages simultaneously. In contrast, the European clade, the Sylvaticus group, radiated rather recently. The relative ratio of the extent of sequence divergence among the four main lineages to that among the members of the subfamily Murinae (including Mus and Rattus) was calculated to be 72.4% in the cyt b gene with transversional substitutions, and 58.5% in the IRBP gene with all substitutions, using the Kimura two-parameter method. The value for the three European lineages was 27.6% in the cyt b gene and 12.3% in the IRBP gene. These results may have a correlation with the notion that deciduous broadleaf forests remained in Central East Asia through the late Tertiary to the present, while those in Europe to a large extent had disappeared by the Pliocene.

  4. Active inference, communication and hermeneutics.

    PubMed

    Friston, Karl J; Frith, Christopher D

    2015-07-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others--during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions--both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then--in principle--they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa.

  5. Active inference, communication and hermeneutics☆

    PubMed Central

    Friston, Karl J.; Frith, Christopher D.

    2015-01-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others – during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions – both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then – in principle – they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa. PMID:25957007

  6. Active inference, communication and hermeneutics.

    PubMed

    Friston, Karl J; Frith, Christopher D

    2015-07-01

    Hermeneutics refers to interpretation and translation of text (typically ancient scriptures) but also applies to verbal and non-verbal communication. In a psychological setting it nicely frames the problem of inferring the intended content of a communication. In this paper, we offer a solution to the problem of neural hermeneutics based upon active inference. In active inference, action fulfils predictions about how we will behave (e.g., predicting we will speak). Crucially, these predictions can be used to predict both self and others--during speaking and listening respectively. Active inference mandates the suppression of prediction errors by updating an internal model that generates predictions--both at fast timescales (through perceptual inference) and slower timescales (through perceptual learning). If two agents adopt the same model, then--in principle--they can predict each other and minimise their mutual prediction errors. Heuristically, this ensures they are singing from the same hymn sheet. This paper builds upon recent work on active inference and communication to illustrate perceptual learning using simulated birdsongs. Our focus here is the neural hermeneutics implicit in learning, where communication facilitates long-term changes in generative models that are trying to predict each other. In other words, communication induces perceptual learning and enables others to (literally) change our minds and vice versa. PMID:25957007

  7. Cyclopeptide toxins of lethal amanitas: Compositions, distribution and phylogenetic implication.

    PubMed

    Tang, Shanshan; Zhou, Qian; He, Zhengmi; Luo, Tao; Zhang, Ping; Cai, Qing; Yang, Zhuliang; Chen, Jia; Chen, Zuohong

    2016-09-15

    Lethal amanitas (Amanita sect. Phalloideae) are responsible for 90% of all fatal mushroom poisonings. Since 2000, more than ten new lethal Amanita species have been discovered and some of them had caused severe mushroom poisonings in China. However, the contents and distribution of cyclopeptides in these lethal mushrooms remain poorly known. In this study, the diversity of major cyclopeptide toxins in seven Amanita species from Eastern Asia and three species from Europe and North America were systematically analyzed, and a new approach to inferring phylogenetic relationships using cyclopeptide profile was evaluated for the first time. The results showed that there were diversities of the cyclopeptides among lethal Amanita species, and cyclopeptides from Amanita rimosa and Amanita fuligineoides were reported for the first time. The amounts of amatoxins in East Asian Amanita species were significantly higher than those in European and North American species. The analysis of distribution of amatoxins and phallotoxins in various Amanita species demonstrated that the content of phallotoxins was higher than that of amatoxins in Amanita phalloides and Amanita virosa. In contrast, the content of phallotoxins was significantly lower than that of amatoxins in all East Asian lethal Amanita species tested. However, the distribution of amatoxins and phallotoxins in different tissues showed the same tendency. Eight cyclopeptides and three unknown compounds were identified using cyclopeptide standards and high-resolution MS. Based on the cyclopeptide profiles, phylogenetic relationships of lethal amanitas were inferred through a dendrogram generated by UPGMA method. The results showed high similarity to the phylogeny established previously based on the multi-locus DNA sequences. PMID:27476461

  8. The phylogenetic distribution of ultraviolet sensitivity in birds

    PubMed Central

    2013-01-01

    Background Colour vision in birds can be categorized into two classes, the ultraviolet (UVS) and violet sensitive (VS). Their phylogenetic distributions have traditionally been regarded as highly conserved. However, the complicated nature of acquiring spectral sensitivities from cone photoreceptors meant that until recently, only a few species had actually been studied. Whether birds are UVS or VS can nowadays be inferred from a wide range of species via genomic sequencing of the UV/violet SWS1 cone opsin gene. Results We present genomic sequencing results of the SWS1 gene from 21 avian orders. Amino acid residues signifying UV sensitivity are found in the two most important spectral tuning sites 86 and 90 of Pteroclidiformes and Coraciiformes, in addition to the major clades, Palaeognathae, Charadriiformes, Trogoniformes, Psittaciformes and Passeriformes, where they where previously known to occur. We confirm that the presumed UVS-conferring amino acid combination F86, C90 and M93 is common to Palaeognathae and unique to this clade, despite available spectrometric evidence showing the ostrich retina to be VS. Conclusions By mapping our results together with data from previous studies on a molecular phylogeny we show that avian colour vision shifted between VS and UVS at least 14 times. Single nucleotide substitutions can explain all these shifts. The common ancestor of birds most likely had a VS phenotype. However, the ancestral state of the avian SWS1 opsin’s spectral tuning sites cannot be resolved, since the Palaeognathae are F86, C90 while the Neognathae are ancestrally S86, S90. The phylogenetic distribution of UVS and VS colour vision in birds is so complex that inferences of spectral sensitivities from closely related taxa should be used with caution. PMID:23394614

  9. Causal inference in biology networks with integrated belief propagation.

    PubMed

    Chang, Rui; Karr, Jonathan R; Schadt, Eric E

    2015-01-01

    Inferring causal relationships among molecular and higher order phenotypes is a critical step in elucidating the complexity of living systems. Here we propose a novel method for inferring causality that is no longer constrained by the conditional dependency arguments that limit the ability of statistical causal inference methods to resolve causal relationships within sets of graphical models that are Markov equivalent. Our method utilizes Bayesian belief propagation to infer the responses of perturbation events on molecular traits given a hypothesized graph structure. A distance measure between the inferred response distribution and the observed data is defined to assess the 'fitness' of the hypothesized causal relationships. To test our algorithm, we infer causal relationships within equivalence classes of gene networks in which the form of the functional interactions that are possible are assumed to be nonlinear, given synthetic microarray and RNA sequencing data. We also apply our method to infer causality in real metabolic network with v-structure and feedback loop. We show that our method can recapitulate the causal structure and recover the feedback loop only from steady-state data which conventional method cannot. PMID:25592596

  10. Integrating fossils with molecular phylogenies improves inference of trait evolution.

    PubMed

    Slater, Graham J; Harmon, Luke J; Alfaro, Michael E

    2012-12-01

    Comparative biologists often attempt to draw inferences about tempo and mode in evolution by comparing the fit of evolutionary models to phylogenetic comparative data consisting of a molecular phylogeny with branch lengths and trait measurements from extant taxa. These kinds of approaches ignore historical evidence for evolutionary pattern and process contained in the fossil record. In this article, we show through simulation that incorporation of fossil information dramatically improves our ability to distinguish among models of quantitative trait evolution using comparative data. We further suggest a novel Bayesian approach that allows fossil information to be integrated even when explicit phylogenetic hypotheses are lacking for extinct representatives of extant clades. By applying this approach to a comparative dataset comprising body sizes for caniform carnivorans, we show that incorporation of fossil information not only improves ancestral state estimates relative to those derived from extant taxa alone, but also results in preference of a model of evolution with trend toward large body size over alternative models such as Brownian motion or Ornstein-Uhlenbeck processes. Our approach highlights the importance of considering fossil information when making macroevolutionary inference, and provides a way to integrate the kind of sparse fossil information that is available to most evolutionary biologists.

  11. Palaeotemperature trend for Precambrian life inferred from resurrected proteins.

    PubMed

    Gaucher, Eric A; Govindarajan, Sridhar; Ganesh, Omjoy K

    2008-02-01

    Biosignatures and structures in the geological record indicate that microbial life has inhabited Earth for the past 3.5 billion years or so. Research in the physical sciences has been able to generate statements about the ancient environment that hosted this life. These include the chemical compositions and temperatures of the early ocean and atmosphere. Only recently have the natural sciences been able to provide experimental results describing the environments of ancient life. Our previous work with resurrected proteins indicated that ancient life lived in a hot environment. Here we expand the timescale of resurrected proteins to provide a palaeotemperature trend of the environments that hosted life from 3.5 to 0.5 billion years ago. The thermostability of more than 25 phylogenetically dispersed ancestral elongation factors suggest that the environment supporting ancient life cooled progressively by 30 degrees C during that period. Here we show that our results are robust to potential statistical bias associated with the posterior distribution of inferred character states, phylogenetic ambiguity, and uncertainties in the amino-acid equilibrium frequencies used by evolutionary models. Our results are further supported by a nearly identical cooling trend for the ancient ocean as inferred from the deposition of oxygen isotopes. The convergence of results from natural and physical sciences suggest that ancient life has continually adapted to changes in environmental temperatures throughout its evolutionary history.

  12. Palaeotemperature trend for Precambrian life inferred from resurrected proteins.

    PubMed

    Gaucher, Eric A; Govindarajan, Sridhar; Ganesh, Omjoy K

    2008-02-01

    Biosignatures and structures in the geological record indicate that microbial life has inhabited Earth for the past 3.5 billion years or so. Research in the physical sciences has been able to generate statements about the ancient environment that hosted this life. These include the chemical compositions and temperatures of the early ocean and atmosphere. Only recently have the natural sciences been able to provide experimental results describing the environments of ancient life. Our previous work with resurrected proteins indicated that ancient life lived in a hot environment. Here we expand the timescale of resurrected proteins to provide a palaeotemperature trend of the environments that hosted life from 3.5 to 0.5 billion years ago. The thermostability of more than 25 phylogenetically dispersed ancestral elongation factors suggest that the environment supporting ancient life cooled progressively by 30 degrees C during that period. Here we show that our results are robust to potential statistical bias associated with the posterior distribution of inferred character states, phylogenetic ambiguity, and uncertainties in the amino-acid equilibrium frequencies used by evolutionary models. Our results are further supported by a nearly identical cooling trend for the ancient ocean as inferred from the deposition of oxygen isotopes. The convergence of results from natural and physical sciences suggest that ancient life has continually adapted to changes in environmental temperatures throughout its evolutionary history. PMID:18256669

  13. Mixed integer linear programming for maximum-parsimony phylogeny inference.

    PubMed

    Sridhar, Srinath; Lam, Fumei; Blelloch, Guy E; Ravi, R; Schwartz, Russell

    2008-01-01

    Reconstruction of phylogenetic trees is a fundamental problem in computational biology. While excellent heuristic methods are available for many variants of this problem, new advances in phylogeny inference will be required if we are to be able to continue to make effective use of the rapidly growing stores of variation data now being gathered. In this paper, we present two integer linear programming (ILP) formulations to find the most parsimonious phylogenetic tree from a set of binary variation data. One method uses a flow-based formulation that can produce exponential numbers of variables and constraints in the worst case. The method has, however, proven extremely efficient in practice on datasets that are well beyond the reach of the available provably efficient methods, solving several large mtDNA and Y-chromosome instances within a few seconds and giving provably optimal results in times competitive with fast heuristics than cannot guarantee optimality. An alternative formulation establishes that the problem can be solved with a polynomial-sized ILP. We further present a web server developed based on the exponential-sized ILP that performs fast maximum parsimony inferences and serves as a front end to a database of precomputed phylogenies spanning the human genome.

  14. Optimal inference with suboptimal models: Addiction and active Bayesian inference

    PubMed Central

    Schwartenbeck, Philipp; FitzGerald, Thomas H.B.; Mathys, Christoph; Dolan, Ray; Wurst, Friedrich; Kronbichler, Martin; Friston, Karl

    2015-01-01

    When casting behaviour as active (Bayesian) inference, optimal inference is defined with respect to an agent’s beliefs – based on its generative model of the world. This contrasts with normative accounts of choice behaviour, in which optimal actions are considered in relation to the true structure of the environment – as opposed to the agent’s beliefs about worldly states (or the task). This distinction shifts an understanding of suboptimal or pathological behaviour away from aberrant inference as such, to understanding the prior beliefs of a subject that cause them to behave less ‘optimally’ than our prior beliefs suggest they should behave. Put simply, suboptimal or pathological behaviour does not speak against understanding behaviour in terms of (Bayes optimal) inference, but rather calls for a more refined understanding of the subject’s generative model upon which their (optimal) Bayesian inference is based. Here, we discuss this fundamental distinction and its implications for understanding optimality, bounded rationality and pathological (choice) behaviour. We illustrate our argument using addictive choice behaviour in a recently described ‘limited offer’ task. Our simulations of pathological choices and addictive behaviour also generate some clear hypotheses, which we hope to pursue in ongoing empirical work. PMID:25561321

  15. A molecular phylogenetic analysis of strombid gastropod morphological diversity.

    PubMed

    Latiolais, Jared M; Taylor, Michael S; Roy, Kaustuv; Hellberg, Michael E

    2006-11-01

    The shells of strombid gastropods show a wide variety of forms, ranging from small and fusiform to large and elaborately ornamented with a strongly flared outer lip. Here, we present the first species-level molecular phylogeny for strombids and use the resulting phylogenetic framework to explore relationships between species richness and morphological diversity. We use portions of one nuclear (325 bp of histone H3) and one mitochondrial (640 bp of cytochrome oxidase I, COI) gene to infer relationships within the two most species-rich genera in the Strombidae: Strombus and Lambis. We include 32 species of Strombus, representing 10 of 11 extant subgenera, and 3 of the 9 species of Lambis, representing 2 of 3 extant subgenera. Maximum likelihood and Bayesian analyses of COI and of H3 and COI combined suggest Lambis is nested within a paraphyletic Strombus. Eastern Pacific and western Atlantic species of Strombus form a relatively recent monophyletic radiation within an older, paraphyletic Indo-West Pacific grade. Morphological diversity of subclades scales positively with species richness but does not show evidence of strong phylogenetic constraints. PMID:16839783

  16. Neural substrates of cognitive biases during probabilistic inference.

    PubMed

    Soltani, Alireza; Khorsand, Peyman; Guo, Clara; Farashahi, Shiva; Liu, Janet

    2016-01-01

    Decision making often requires simultaneously learning about and combining evidence from various sources of information. However, when making inferences from these sources, humans show systematic biases that are often attributed to heuristics or limitations in cognitive processes. Here we use a combination of experimental and modelling approaches to reveal neural substrates of probabilistic inference and corresponding biases. We find systematic deviations from normative accounts of inference when alternative options are not equally rewarding; subjects' choice behaviour is biased towards the more rewarding option, whereas their inferences about individual cues show the opposite bias. Moreover, inference bias about combinations of cues depends on the number of cues. Using a biophysically plausible model, we link these biases to synaptic plasticity mechanisms modulated by reward expectation and attention. We demonstrate that inference relies on direct estimation of posteriors, not on combination of likelihoods and prior. Our work reveals novel mechanisms underlying cognitive biases and contributions of interactions between reward-dependent learning, decision making and attention to high-level reasoning. PMID:27116102

  17. Neural substrates of cognitive biases during probabilistic inference

    PubMed Central

    Soltani, Alireza; Khorsand, Peyman; Guo, Clara; Farashahi, Shiva; Liu, Janet

    2016-01-01

    Decision making often requires simultaneously learning about and combining evidence from various sources of information. However, when making inferences from these sources, humans show systematic biases that are often attributed to heuristics or limitations in cognitive processes. Here we use a combination of experimental and modelling approaches to reveal neural substrates of probabilistic inference and corresponding biases. We find systematic deviations from normative accounts of inference when alternative options are not equally rewarding; subjects' choice behaviour is biased towards the more rewarding option, whereas their inferences about individual cues show the opposite bias. Moreover, inference bias about combinations of cues depends on the number of cues. Using a biophysically plausible model, we link these biases to synaptic plasticity mechanisms modulated by reward expectation and attention. We demonstrate that inference relies on direct estimation of posteriors, not on combination of likelihoods and prior. Our work reveals novel mechanisms underlying cognitive biases and contributions of interactions between reward-dependent learning, decision making and attention to high-level reasoning. PMID:27116102

  18. Predictive Inference Using Latent Variables with Covariates*

    PubMed Central

    Schofield, Lynne Steuerle; Junker, Brian; Taylor, Lowell J.; Black, Dan A.

    2014-01-01

    Plausible Values (PVs) are a standard multiple imputation tool for analysis of large education survey data that measures latent proficiency variables. When latent proficiency is the dependent variable, we reconsider the standard institutionally-generated PV methodology and find it applies with greater generality than shown previously. When latent proficiency is an independent variable, we show that the standard institutional PV methodology produces biased inference because the institutional conditioning model places restrictions on the form of the secondary analysts’ model. We offer an alternative approach that avoids these biases based on the mixed effects structural equations (MESE) model of Schofield (2008). PMID:25231627

  19. Phylogenetic Comparisons of Bacterial Communities from Serpentine and Nonserpentine Soils▿

    PubMed Central

    Oline, David K.

    2006-01-01

    I present the results of a culture-independent survey of soil bacterial communities from serpentine soils and adjacent nonserpentine comparator soils using a variety of newly developed phylogenetically based statistical tools. The study design included site-based replication of the serpentine-to-nonserpentine community comparison over a regional scale (∼100 km) in Northern California and Southern Oregon by producing 16S rRNA clone libraries from pairs of samples taken on either side of the serepentine-nonserpentine edaphic boundary at three geographical sites. At the division level, the serpentine and nonserpentine communities were similar to each other and to previous data from forest soils. Comparisons of both richness and Shannon diversity produced no significant differences between any of the libraries, but the vast majority of phylogenetically based tests were significant, even with only 50 sequences per library. These results suggest that most samples were distinct, consisting of a collection of lineages generally not found in other samples. The pattern of results showed that serpentine communities tended to be more similar to each other than they were to nonserpentine communities, and these differences were at a lower taxonomic scale. Comparisons of two nonserpentine communities generally showed differences, and some results suggest that the geographical site may control community composition as well. These results show the power of phylogenetic tests to discern differences between 16S rRNA libraries compared to tests that discard DNA data to bin sequences into operational taxonomic units, and they stress the importance of replication at larger scales for inferences regarding microbial biogeography. PMID:16950906

  20. Phylogenetic informativeness reconciles ray-finned fish molecular divergence times

    PubMed Central

    2014-01-01

    Background Discordance among individual molecular age estimates, or between molecular age estimates and the fossil record, is observed in many clades across the Tree of Life. This discordance is attributed to a variety of variables including calibration age uncertainty, calibration placement, nucleotide substitution rate heterogeneity, or the specified molecular clock model. However, the impact of changes in phylogenetic informativeness of individual genes over time on phylogenetic inferences is rarely analyzed. Using nuclear and mitochondrial sequence data for ray-finned fishes (Actinopterygii) as an example, we extend the utility of phylogenetic informativeness profiles to predict the time intervals when nucleotide substitution saturation results in discordance among molecular ages estimated. Results We demonstrate that even with identical calibration regimes and molecular clock methods, mitochondrial based molecular age estimates are systematically older than those estimated from nuclear sequences. This discordance is most severe for highly nested nodes corresponding to more recent (i.e., Jurassic-Recent) divergences. By removing data deemed saturated, we reconcile the competing age estimates and highlight that the older mtDNA based ages were driven by nucleotide saturation. Conclusions Homoplasious site patterns in a DNA sequence alignment can systematically bias molecular divergence time estimates. Our study demonstrates that PI profiles can provide a non-arbitrary criterion for data exclusion to mitigate the influence of homoplasy on time calibrated branch length estimates. Analyses of actinopterygian molecular clocks demonstrate that scrutiny of the time scale on which sequence data is informative is a fundamental, but generally overlooked, step in molecular divergence time estimation. PMID:25103329

  1. Space versus phylogeny: disentangling phylogenetic and spatial signals in comparative data.

    PubMed

    Freckleton, Robert P; Jetz, Walter

    2009-01-01

    Variation in traits across species or populations is the outcome of both environmental and historical factors. Trait variation is therefore a function of both the phylogenetic and spatial context of species. Here we introduce a method that, within a single framework, estimates the relative roles of spatial and phylogenetic variations in comparative data. The approach requires traits measured across phylogenetic units, e.g. species, the spatial occurrences of those units and a phylogeny connecting them. The method modifies the expected variance of phylogenetically independent contrasts to include both spatial and phylogenetic effects. We illustrate this approach by analysing cross-species variation in body mass, geographical range size and species-typical environmental temperature in three orders of mammals (carnivores, artiodactyls and primates). These species attributes contain highly disparate levels of phylogenetic and spatial signals, with the strongest phylogenetic autocorrelation in body size and spatial dependence in environmental temperatures and geographical range size showing mixed effects. The proposed method successfully captures these differences and in its simplest form estimates a single parameter that quantifies the relative effects of space and phylogeny. We discuss how the method may be extended to explore a range of models of evolution and spatial dependence.

  2. The ethnobotany of psychoactive plant use: a phylogenetic perspective

    PubMed Central

    2016-01-01

    Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives). The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg), Papaveraceae (opium poppy), Cactaceae (peyote), Convolvulaceae (morning glory), Solanaceae (tobacco), Lamiaceae (mints), Apocynaceae (dogbane) have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence). However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants suggests

  3. Bayesian inference of sampled ancestor trees for epidemiology and fossil calibration.

    PubMed

    Gavryushkina, Alexandra; Welch, David; Stadler, Tanja; Drummond, Alexei J

    2014-12-01

    Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibility. We have developed and implemented a Bayesian Markov Chain Monte Carlo (MCMC) algorithm to infer what we call sampled ancestor trees, that is, trees in which sampled individuals can be direct ancestors of other sampled individuals. We use a family of birth-death models where individuals may remain in the tree process after sampling, in particular we extend the birth-death skyline model [Stadler et al., 2013] to sampled ancestor trees. This method allows the detection of sampled ancestors as well as estimation of the probability that an individual will be removed from the process when it is sampled. We show that even if sampled ancestors are not of specific interest in an analysis, failing to account for them leads to significant bias in parameter estimates. We also show that sampled ancestor birth-death models where every sample comes from a different time point are non-identifiable and thus require one parameter to be known in order to infer other parameters. We apply our phylogenetic inference accounting for sampled ancestors to epidemiological data, where the possibility of sampled ancestors enables us to identify individuals that infected other individuals after being sampled and to infer fundamental epidemiological parameters. We also apply the method to infer divergence times and diversification rates when fossils are included along with extant species samples, so that fossilisation events are modelled as a part of the tree branching process. Such modelling has many advantages as argued in the literature. The sampler is available as an open-source BEAST2 package (https://github.com/CompEvol/sampled-ancestors). PMID:25474353

  4. Bayesian Inference of Sampled Ancestor Trees for Epidemiology and Fossil Calibration

    PubMed Central

    Gavryushkina, Alexandra; Welch, David; Stadler, Tanja; Drummond, Alexei J.

    2014-01-01

    Phylogenetic analyses which include fossils or molecular sequences that are sampled through time require models that allow one sample to be a direct ancestor of another sample. As previously available phylogenetic inference tools assume that all samples are tips, they do not allow for this possibility. We have developed and implemented a Bayesian Markov Chain Monte Carlo (MCMC) algorithm to infer what we call sampled ancestor trees, that is, trees in which sampled individuals can be direct ancestors of other sampled individuals. We use a family of birth-death models where individuals may remain in the tree process after sampling, in particular we extend the birth-death skyline model [Stadler et al., 2013] to sampled ancestor trees. This method allows the detection of sampled ancestors as well as estimation of the probability that an individual will be removed from the process when it is sampled. We show that even if sampled ancestors are not of specific interest in an analysis, failing to account for them leads to significant bias in parameter estimates. We also show that sampled ancestor birth-death models where every sample comes from a different time point are non-identifiable and thus require one parameter to be known in order to infer other parameters. We apply our phylogenetic inference accounting for sampled ancestors to epidemiological data, where the possibility of sampled ancestors enables us to identify individuals that infected other individuals after being sampled and to infer fundamental epidemiological parameters. We also apply the method to infer divergence times and diversification rates when fossils are included along with extant species samples, so that fossilisation events are modelled as a part of the tree branching process. Such modelling has many advantages as argued in the literature. The sampler is available as an open-source BEAST2 package (https://github.com/CompEvol/sampled-ancestors). PMID:25474353

  5. Short-wavelength sensitive opsin (SWS1) as a new marker for vertebrate phylogenetics

    PubMed Central

    van Hazel, Ilke; Santini, Francesco; Müller, Johannes; Chang, Belinda SW

    2006-01-01

    Background Vertebrate SWS1 visual pigments mediate visual transduction in response to light at short wavelengths. Due to their importance in vision, SWS1 genes have been isolated from a surprisingly wide range of vertebrates, including lampreys, teleosts, amphibians, reptiles, birds, and mammals. The SWS1 genes exhibit many of the characteristics of genes typically targeted for phylogenetic analyses. This study investigates both the utility of SWS1 as a marker for inferring vertebrate phylogenetic relationships, and the characteristics of the gene that contribute to its phylogenetic utility. Results Phylogenetic analyses of vertebrate SWS1 genes produced topologies that were remarkably congruent with generally accepted hypotheses of vertebrate evolution at both higher and lower taxonomic levels. The few exceptions were generally associated with areas of poor taxonomic sampling, or relationships that have been difficult to resolve using other molecular markers. The SWS1 data set was characterized by a substantial amount of among-site rate variation, and a relatively unskewed substitution rate matrix, even when the data were partitioned into different codon sites and individual taxonomic groups. Although there were nucleotide biases in some groups at third positions, these biases were not convergent across different taxonomic groups. Conclusion Our results suggest that SWS1 may be a good marker for vertebrate phylogenetics due to the variable yet consistent patterns of sequence evolution exhibited across fairly wide taxonomic groups. This may result from constraints imposed by the functional role of SWS1 pigments in visual transduction. PMID:17107620

  6. [Conditionally neutral phylogenetic markers of major taxa: a new aspect of the evolution of macromolecules].

    PubMed

    Petrov, N B; Aleshin, V V

    2002-08-01

    The current phase of molecular phylogenetics can be named the 18S rRNA gene era, which is now approaching the end. To date, almost all phyla of metazoans and many taxa of protists are represented in databases of 18S rRNA gene sequences. The elements of the phylogenetic tree of Metazoa inferred from 18S rRNA genes are characterized by unequal validity: some of them seem to be well grounded; others are not adequately supported, and probably will be revised later. The validity of phylogenetic reconstruction is influenced by two main factors: (1) erroneous grouping of long branches that occur because of abnormally high evolution rate; (2) deficit of phylogenetically informative characters. A method for overcoming these difficulties is suggested in addition to known tools: using phylogenetic markers that are stable within individual taxa and evolve by punctuated equilibrium. These markers are least influenced by the convergence caused by a high evolution rate of the entire gene. The nature of these markers of ancient taxa, paradoxical from the perspective of neutral evolution, is discussed, as well as their importance for establishing monophyly of both new large-scale taxonomic groups of invertebrates (Bilateria + Rhombozoa + Orthonectida + Myxozoa + Cnidaria + Placozoa and Echinodermata + Hemichordata) and some major taxa of Nematoda. PMID:12244690

  7. A phylogenetic perspective on the individual species-area relationship in temperate and tropical tree communities.

    PubMed

    Yang, Jie; Swenson, Nathan G; Cao, Min; Chuyong, George B; Ewango, Corneille E N; Howe, Robert; Kenfack, David; Thomas, Duncan; Wolf, Amy; Lin, Luxiang

    2013-01-01

    Ecologists have historically used species-area relationships (SARs) as a tool to understand the spatial distribution of species. Recent work has extended SARs to focus on individual-level distributions to generate individual species area relationships (ISARs). The ISAR approach quantifies whether individuals of a species tend have more or less species richness surrounding them than expected by chance. By identifying richness 'accumulators' and 'repellers', respectively, the ISAR approach has been used to infer the relative importance of abiotic and biotic interactions and neutrality. A clear limitation of the SAR and ISAR approaches is that all species are treated as evolutionarily independent and that a large amount of work has now shown that local tree neighborhoods exhibit non-random phylogenetic structure given the species richness. Here, we use nine tropical and temperate forest dynamics plots to ask: (i) do ISARs change predictably across latitude?; (ii) is the phylogenetic diversity in the neighborhood of species accumulators and repellers higher or lower than that expected given the observed species richness?; and (iii) do species accumulators, repellers distributed non-randomly on the community phylogenetic tree? The results indicate no clear trend in ISARs from the temperate zone to the tropics and that the phylogenetic diversity surrounding the individuals of species is generally only non-random on very local scales. Interestingly the distribution of species accumulators and repellers was non-random on the community phylogenies suggesting the presence of phylogenetic signal in the ISAR across latitude.

  8. Incorporating clade identity in analyses of phylogenetic community structure: an example with hummingbirds.

    PubMed

    Parra, Juan L; McGuire, Jimmy A; Graham, Catherine H

    2010-11-01

    An important challenge in community ecology is to determine how processes occurring at multiple spatial, temporal, and phylogenetic scales influence the structure of local communities. While indexes of phylogenetic structure, which measure how related species are in a community, provide insight into the processes that shape species coexistence, they fail to pinpoint the phylogenetic scales at which those processes occur. Here, we explore a framework to identify the species and clades responsible for the inferred patterns of phylogenetic structure within a given community. Further, we evaluate how communities that share the nonrandom representation of species from a given clade in the phylogeny are distributed across geography and environmental gradients. Using Ecuadorian hummingbird communities, we found that multiple patterns of phylogenetic structure often occur within a local assemblage. We also identified four geographic regions where species from certain clades exhibit nonrandom representation: the eastern Amazonian lowlands, the western dry lowlands, the Andes at middle elevations, and the Andes at high elevations. The environmental gradients along which changes in the local coexistence of species occurred were mainly elevation, annual precipitation, and seasonality in both temperature and precipitation. Finally, we show how these patterns can be used to generate hypotheses about the processes that allow species coexistence.

  9. [Conditionally neutral phylogenetic markers of major taxa: a new aspect of the evolution of macromolecules].

    PubMed

    Petrov, N B; Aleshin, V V

    2002-08-01

    The current phase of molecular phylogenetics can be named the 18S rRNA gene era, which is now approaching the end. To date, almost all phyla of metazoans and many taxa of protists are represented in databases of 18S rRNA gene sequences. The elements of the phylogenetic tree of Metazoa inferred from 18S rRNA genes are characterized by unequal validity: some of them seem to be well grounded; others are not adequately supported, and probably will be revised later. The validity of phylogenetic reconstruction is influenced by two main factors: (1) erroneous grouping of long branches that occur because of abnormally high evolution rate; (2) deficit of phylogenetically informative characters. A method for overcoming these difficulties is suggested in addition to known tools: using phylogenetic markers that are stable within individual taxa and evolve by punctuated equilibrium. These markers are least influenced by the convergence caused by a high evolution rate of the entire gene. The nature of these markers of ancient taxa, paradoxical from the perspective of neutral evolution, is discussed, as well as their importance for establishing monophyly of both new large-scale taxonomic groups of invertebrates (Bilateria + Rhombozoa + Orthonectida + Myxozoa + Cnidaria + Placozoa and Echinodermata + Hemichordata) and some major taxa of Nematoda.

  10. Decisive Data Sets in Phylogenomics: Lessons from Studies on the Phylogenetic Relationships of Primarily Wingless Insects

    PubMed Central

    Meusemann, Karen; Meyer, Benjamin; Borner, Janus; Petersen, Malte; Aberer, Andre J.; Stamatakis, Alexandros; Walzl, Manfred G.; Minh, Bui Quang; von Haeseler, Arndt; Ebersberger, Ingo; Pass, Günther; Misof, Bernhard

    2014-01-01

    Phylogenetic relationships of the primarily wingless insects are still considered unresolved. Even the most comprehensive phylogenomic studies that addressed this question did not yield congruent results. To get a grip on these problems, we here analyzed the sources of incongruence in these phylogenomic studies by using an extended transcriptome data set. Our analyses showed that unevenly distributed missing data can be severely misleading by inflating node support despite the absence of phylogenetic signal. In consequence, only decisive data sets should be used which exclusively comprise data blocks containing all taxa whose relationships are addressed. Additionally, we used Four-cluster Likelihood Mapping (FcLM) to measure the degree of congruence among genes of a data set, as a measure of support alternative to bootstrap. FcLM showed incongruent signal among genes, which in our case is correlated neither with functional class assignment of these genes nor with model misspecification due to unpartitioned analyses. The herein analyzed data set is the currently largest data set covering primarily wingless insects, but failed to elucidate their interordinal phylogenetic relationships. Although this is unsatisfying from a phylogenetic perspective, we try to show that the analyses of structure and signal within phylogenomic data can protect us from biased phylogenetic inferences due to analytical artifacts. PMID:24140757

  11. Tracing the roots of syntax with Bayesian phylogenetics

    PubMed Central

    Maurits, Luke; Griffiths, Thomas L.

    2014-01-01

    The ordering of subject, verb, and object is one of the fundamental components of the syntax of natural languages. The distribution of basic word orders across the world’s languages is highly nonuniform, with the majority of languages being either subject-object-verb (SOV) or subject-verb-object (SVO). Explaining this fact using psychological accounts of language acquisition or processing requires understanding how the present distribution has resulted from ancestral distributions and the rates of change between orders. We show that Bayesian phylogenetics can provide quantitative answers to three important questions: how word orders are likely to change over time, which word orders were dominant historically, and whether strong inferences about the origins of syntax can be drawn from modern languages. We find that SOV to SVO change is more common than the reverse and VSO to SVO change is more common than VSO to SOV, and that if the seven language families we consider share a common ancestor then that common ancestor likely had SOV word order, but also that there are limits on how confidently we can make inferences about ancestral word order based on modern-day observations. These results shed new light on old questions from historical linguistics and provide clear targets for psychological explanations of word-order distributions. PMID:25192934

  12. Tracing the roots of syntax with Bayesian phylogenetics.

    PubMed

    Maurits, Luke; Griffiths, Thomas L

    2014-09-16

    The ordering of subject, verb, and object is one of the fundamental components of the syntax of natural languages. The distribution of basic word orders across the world's languages is highly nonuniform, with the majority of languages being either subject-object-verb (SOV) or subject-verb-object (SVO). Explaining this fact using psychological accounts of language acquisition or processing requires understanding how the present distribution has resulted from ancestral distributions and the rates of change between orders. We show that Bayesian phylogenetics can provide quantitative answers to three important questions: how word orders are likely to change over time, which word orders were dominant historically, and whether strong inferences about the origins of syntax can be drawn from modern languages. We find that SOV to SVO change is more common than the reverse and VSO to SVO change is more common than VSO to SOV, and that if the seven language families we consider share a common ancestor then that common ancestor likely had SOV word order, but also that there are limits on how confidently we can make inferences about ancestral word order based on modern-day observations. These results shed new light on old questions from historical linguistics and provide clear targets for psychological explanations of word-order distributions.

  13. Phylogenetic mapping of bacterial morphology

    NASA Technical Reports Server (NTRS)

    Siefert, J. L.; Fox, G. E.

    1998-01-01

    The availability of a meaningful molecular phylogeny for bacteria provides a context for examining the historical significance of various developments in bacterial evolution. Herein, the classical morphological descriptions of selected members of the domain Bacteria are mapped upon the genealogical ancestry deduced from comparison of small-subunit rRNA sequences. For the species examined in this study, a distinct pattern emerges which indicates that the coccus shape has arisen and accumulated independently multiple times in separate lineages and typically survived as a persistent end-state morphology. At least two other morphologies persist but have evolved only once. This study demonstrates that although bacterial morphology is not useful in defining bacterial phylogeny, it is remarkably consistent with that phylogeny once it is known. An examination of the experimental evidence available for morphogenesis as well as microbial fossil evidence corroborates these findings. It is proposed that the accumulation of persistent morphologies is a result of the biophysical properties of peptidoglycan and their genetic control, and that an evolved body-plan strategy based on peptidoglycan may have been a fate-sealing step in the evolution of Bacteria. More generally, this study illustrates that significant evolutionary insights can be obtained by examining biological and biochemical data in the context of a reliable phylogenetic structure.

  14. Phylogenetic insights into the history and diversification of fishes on reefs

    NASA Astrophysics Data System (ADS)

    Price, S. A.; Claverie, T.; Near, T. J.; Wainwright, P. C.

    2015-12-01

    Studies of the phylogenetic history of fishes on reefs and the impact of reefs on fish diversification have, to date, been limited to relatively small clades. We take advantage of a recent multi-locus, time-calibrated phylogeny of acanthomorph fishes and a broad-scale morphological dataset of body shape in reef acanthomorphs to explore the history and diversif