Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.
2014-01-01
Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554
Chao, Anne; Chiu, Chun-Huo; Colwell, Robert K; Magnago, Luiz Fernando S; Chazdon, Robin L; Gotelli, Nicholas J
2017-11-01
Estimating the species, phylogenetic, and functional diversity of a community is challenging because rare species are often undetected, even with intensive sampling. The Good-Turing frequency formula, originally developed for cryptography, estimates in an ecological context the true frequencies of rare species in a single assemblage based on an incomplete sample of individuals. Until now, this formula has never been used to estimate undetected species, phylogenetic, and functional diversity. Here, we first generalize the Good-Turing formula to incomplete sampling of two assemblages. The original formula and its two-assemblage generalization provide a novel and unified approach to notation, terminology, and estimation of undetected biological diversity. For species richness, the Good-Turing framework offers an intuitive way to derive the non-parametric estimators of the undetected species richness in a single assemblage, and of the undetected species shared between two assemblages. For phylogenetic diversity, the unified approach leads to an estimator of the undetected Faith's phylogenetic diversity (PD, the total length of undetected branches of a phylogenetic tree connecting all species), as well as a new estimator of undetected PD shared between two phylogenetic trees. For functional diversity based on species traits, the unified approach yields a new estimator of undetected Walker et al.'s functional attribute diversity (FAD, the total species-pairwise functional distance) in a single assemblage, as well as a new estimator of undetected FAD shared between two assemblages. Although some of the resulting estimators have been previously published (but derived with traditional mathematical inequalities), all taxonomic, phylogenetic, and functional diversity estimators are now derived under the same framework. All the derived estimators are theoretically lower bounds of the corresponding undetected diversities; our approach reveals the sufficient conditions under which the estimators are nearly unbiased, thus offering new insights. Simulation results are reported to numerically verify the performance of the derived estimators. We illustrate all estimators and assess their sampling uncertainty with an empirical dataset for Brazilian rain forest trees. These estimators should be widely applicable to many current problems in ecology, such as the effects of climate change on spatial and temporal beta diversity and the contribution of trait diversity to ecosystem multi-functionality. © 2017 by the Ecological Society of America.
Krajewski, C; Fain, M G; Buckley, L; King, D G
1999-11-01
ki ctes over whether molecular sequence data should be partitioned for phylogenetic analysis often confound two types of heterogeneity among partitions. We distinguish historical heterogeneity (i.e., different partitions have different evolutionary relationships) from dynamic heterogeneity (i.e., different partitions show different patterns of sequence evolution) and explore the impact of the latter on phylogenetic accuracy and precision with a two-gene, mitochondrial data set for cranes. The well-established phylogeny of cranes allows us to contrast tree-based estimates of relevant parameter values with estimates based on pairwise comparisons and to ascertain the effects of incorporating different amounts of process information into phylogenetic estimates. We show that codon positions in the cytochrome b and NADH dehydrogenase subunit 6 genes are dynamically heterogenous under both Poisson and invariable-sites + gamma-rates versions of the F84 model and that heterogeneity includes variation in base composition and transition bias as well as substitution rate. Estimates of transition-bias and relative-rate parameters from pairwise sequence comparisons were comparable to those obtained as tree-based maximum likelihood estimates. Neither rate-category nor mixed-model partitioning strategies resulted in a loss of phylogenetic precision relative to unpartitioned analyses. We suggest that weighted-average distances provide a computationally feasible alternative to direct maximum likelihood estimates of phylogeny for mixed-model analyses of large, dynamically heterogenous data sets. Copyright 1999 Academic Press.
Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.
Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M
2012-02-01
The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).
Tang, Cuong Q; Humphreys, Aelys M; Fontaneto, Diego; Barraclough, Timothy G; Paradis, Emmanuel
2014-01-01
Coalescent-based species delimitation methods combine population genetic and phylogenetic theory to provide an objective means for delineating evolutionarily significant units of diversity. The generalised mixed Yule coalescent (GMYC) and the Poisson tree process (PTP) are methods that use ultrametric (GMYC or PTP) or non-ultrametric (PTP) gene trees as input, intended for use mostly with single-locus data such as DNA barcodes. Here, we assess how robust the GMYC and PTP are to different phylogenetic reconstruction and branch smoothing methods. We reconstruct over 400 ultrametric trees using up to 30 different combinations of phylogenetic and smoothing methods and perform over 2000 separate species delimitation analyses across 16 empirical data sets. We then assess how variable diversity estimates are, in terms of richness and identity, with respect to species delimitation, phylogenetic and smoothing methods. The PTP method generally generates diversity estimates that are more robust to different phylogenetic methods. The GMYC is more sensitive, but provides consistent estimates for BEAST trees. The lower consistency of GMYC estimates is likely a result of differences among gene trees introduced by the smoothing step. Unresolved nodes (real anomalies or methodological artefacts) affect both GMYC and PTP estimates, but have a greater effect on GMYC estimates. Branch smoothing is a difficult step and perhaps an underappreciated source of bias that may be widespread among studies of diversity and diversification. Nevertheless, careful choice of phylogenetic method does produce equivalent PTP and GMYC diversity estimates. We recommend simultaneous use of the PTP model with any model-based gene tree (e.g. RAxML) and GMYC approaches with BEAST trees for obtaining species hypotheses. PMID:25821577
A synthetic phylogeny of freshwater crayfish: insights for conservation.
Owen, Christopher L; Bracken-Grissom, Heather; Stern, David; Crandall, Keith A
2015-02-19
Phylogenetic systematics is heading for a renaissance where we shift from considering our phylogenetic estimates as a static image in a published paper and taxonomies as a hardcopy checklist to treating both the phylogenetic estimate and dynamic taxonomies as metadata for further analyses. The Open Tree of Life project (opentreeoflife.org) is developing synthesis tools for harnessing the power of phylogenetic inference and robust taxonomy to develop a synthetic tree of life. We capitalize on this approach to estimate a synthesis tree for the freshwater crayfish. The crayfish make an exceptional group to demonstrate the utility of the synthesis approach, as there recently have been a number of phylogenetic studies on the crayfishes along with a robust underlying taxonomic framework. Importantly, the crayfish have also been extensively assessed by an IUCN Red List team and therefore have accurate and up-to-date area and conservation status data available for analysis within a phylogenetic context. Here, we develop a synthesis phylogeny for the world's freshwater crayfish and examine the phylogenetic distribution of threat. We also estimate a molecular phylogeny based on all available GenBank crayfish sequences and use this tree to estimate divergence times and test for divergence rate variation. Finally, we conduct EDGE and HEDGE analyses and identify a number of species of freshwater crayfish of highest priority in conservation efforts. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
A synthetic phylogeny of freshwater crayfish: insights for conservation
Owen, Christopher L.; Bracken-Grissom, Heather; Stern, David; Crandall, Keith A.
2015-01-01
Phylogenetic systematics is heading for a renaissance where we shift from considering our phylogenetic estimates as a static image in a published paper and taxonomies as a hardcopy checklist to treating both the phylogenetic estimate and dynamic taxonomies as metadata for further analyses. The Open Tree of Life project (opentreeoflife.org) is developing synthesis tools for harnessing the power of phylogenetic inference and robust taxonomy to develop a synthetic tree of life. We capitalize on this approach to estimate a synthesis tree for the freshwater crayfish. The crayfish make an exceptional group to demonstrate the utility of the synthesis approach, as there recently have been a number of phylogenetic studies on the crayfishes along with a robust underlying taxonomic framework. Importantly, the crayfish have also been extensively assessed by an IUCN Red List team and therefore have accurate and up-to-date area and conservation status data available for analysis within a phylogenetic context. Here, we develop a synthesis phylogeny for the world's freshwater crayfish and examine the phylogenetic distribution of threat. We also estimate a molecular phylogeny based on all available GenBank crayfish sequences and use this tree to estimate divergence times and test for divergence rate variation. Finally, we conduct EDGE and HEDGE analyses and identify a number of species of freshwater crayfish of highest priority in conservation efforts. PMID:25561670
PoMo: An Allele Frequency-Based Approach for Species Tree Estimation
De Maio, Nicola; Schrempf, Dominik; Kosiol, Carolin
2015-01-01
Incomplete lineage sorting can cause incongruencies of the overall species-level phylogenetic tree with the phylogenetic trees for individual genes or genomic segments. If these incongruencies are not accounted for, it is possible to incur several biases in species tree estimation. Here, we present a simple maximum likelihood approach that accounts for ancestral variation and incomplete lineage sorting. We use a POlymorphisms-aware phylogenetic MOdel (PoMo) that we have recently shown to efficiently estimate mutation rates and fixation biases from within and between-species variation data. We extend this model to perform efficient estimation of species trees. We test the performance of PoMo in several different scenarios of incomplete lineage sorting using simulations and compare it with existing methods both in accuracy and computational speed. In contrast to other approaches, our model does not use coalescent theory but is allele frequency based. We show that PoMo is well suited for genome-wide species tree estimation and that on such data it is more accurate than previous approaches. PMID:26209413
2010-01-01
Background Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. Results This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Conclusions Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service. PMID:21034504
Phylogenetic overdispersion of plant species in southern Brazilian savannas.
Silva, I A; Batalha, M A
2009-08-01
Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutionary processes. Based on these assumptions, we may infer mechanisms of contemporary coexistence from the phylogenetic relationships of the species in a community. We studied the phylogenetic structure of plant communities in four cerrado sites, in southeastern Brazil. We calculated two raw phylogenetic distances among the species sampled. We estimated the phylogenetic structure by comparing the observed phylogenetic distances to the distribution of phylogenetic distances in null communities. We obtained null communities by randomizing the phylogenetic relationships of the regional pool of species. We found a phylogenetic overdispersion of the cerrado species. Phylogenetic overdispersion has several explanations, depending on the phylogenetic history of traits and contemporary ecological interactions. However, based on coexistence models between grasses and trees, density-dependent ecological forces, and the evolutionary history of the cerrado flora, we argue that the phylogenetic overdispersion of cerrado species is predominantly due to competitive interactions, herbivores and pathogen attacks, and ecological speciation. Future studies will need to include information on the phylogenetic history of plant traits.
Dor, Roi; Carling, Matthew D; Lovette, Irby J; Sheldon, Frederick H; Winkler, David W
2012-10-01
The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group. Copyright © 2012 Elsevier Inc. All rights reserved.
Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.
2011-01-01
The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275
Garamszegi, László Zsolt
2011-02-01
Plasmodium parasites, the causative agents of malaria, are generally considered as harmful parasites, but many of them cause mild symptoms. Little is known about the evolutionary history and phylogenetic constraints that generate this interspecific variation in virulence due to uncertainties about the phylogenetic associations of parasites. Here, to account for such phylogenetic uncertainty, phylogenetic methods based on Bayesian statistics were followed in combination with sequence data from five genes to estimate the ancestral state of virulence in primate Plasmodium parasites. When recent parasites were categorised according to the damage caused to the host, Bayesian estimates of ancestral states indicated that the acquisition of a harmful host exploitation strategy is more likely to be a recent evolutionary event than a result of an ancient change in a character state altering virulence. On the contrary, there was more evidence for moderate host exploitation having a deep origin along the phylogenetic tree. Moreover, the evolution of host severity is determined by the phylogenetic relationships of parasites, as severity gains did not appear randomly on the evolutionary tree. Such phylogenetic constraints can be mediated by the acquisition of virulence genes. As the impact of a parasite on a host is the result of both the parasite's investment in reproduction and host sensitivity, virulence was also estimated by calculating peak parasitemia after eliminating host effects. A directional random-walk evolutionary model showed that the ancestral primate malarias reproduced at very low parasitemia in their hosts. Consequently, the extreme variation in the outcome of malaria infection in different host species can be better understood in light of the phylogeny of parasites. Copyright © 2010 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Dornburg, Alex; Friedman, Matt; Near, Thomas J
2015-08-01
Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Gravuer, Kelly; Eskelinen, Anu
2017-01-01
Microbial traits related to ecological responses and functions could provide a common currency facilitating synthesis and prediction; however, such traits are difficult to measure directly for all taxa in environmental samples. Past efforts to estimate trait values based on phylogenetic relationships have not always distinguished between traits with high and low phylogenetic conservatism, limiting reliability, especially in poorly known environments, such as soil. Using updated reference trees and phylogenetic relationships, we estimated two phylogenetically conserved traits hypothesized to be ecologically important from DNA sequences of the 16S rRNA gene from soil bacterial and archaeal communities. We sampled these communities from an environmental change experiment in California grassland applying factorial addition of late-season precipitation and soil nutrients to multiple soil types for 3 years prior to sampling. Estimated traits were rRNA gene copy number, which contributes to how rapidly a microbe can respond to an increase in resources and may be related to its maximum growth rate, and genome size, which suggests the breadth of environmental and substrate conditions in which a microbe can thrive. Nutrient addition increased community-weighted mean estimated rRNA gene copy number and marginally increased estimated genome size, whereas precipitation addition decreased these community means for both estimated traits. The effects of both treatments on both traits were associated with soil properties, such as ammonium, available phosphorus, and pH. Estimated trait responses within several phyla were opposite to the community mean response, indicating that microbial responses, although largely consistent among soil types, were not uniform across the tree of life. Our results show that phylogenetic estimation of microbial traits can provide insight into how microbial ecological strategies interact with environmental changes. The method could easily be applied to any of the thousands of existing 16S rRNA sequence data sets and offers potential to improve our understanding of how microbial communities mediate ecosystem function responses to global changes.
The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method
2015-01-01
Phylogenetic comparative methods (PCMs) use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE), which purportedly employs a novel model of evolution based on Felsenstein’s Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or “directional” contrasts, and draw parallels between IE and previous PCMs such as “minimum evolution”. Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses. PMID:26683838
A phylogenetic analysis of the megadiverse Chalcidoidea (Hymenoptera)
USDA-ARS?s Scientific Manuscript database
Chalcidoidea (Hymenoptera) are extremely diverse with an estimated 500,000 species. We present the first phylogenetic analysis of the superfamily based on a cladistic analysis of both morphological and molecular data. A total of 233 morphological characters were scored for 300 taxa and 265 genera, a...
Phylesystem: a git-based data store for community-curated phylogenetic estimates.
McTavish, Emily Jane; Hinchliff, Cody E; Allman, James F; Brown, Joseph W; Cranston, Karen A; Holder, Mark T; Rees, Jonathan A; Smith, Stephen A
2015-09-01
Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct. Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git's version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the 'phylesystem-api', which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements. Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed at http://tree.opentreeoflife.org/curator. Code for that tool is available from https://github.com/OpenTreeOfLife/opentree. mtholder@gmail.com. © The Author 2015. Published by Oxford University Press.
Blom, Mozes P K; Bragg, Jason G; Potter, Sally; Moritz, Craig
2017-05-01
Accurate gene tree inference is an important aspect of species tree estimation in a summary-coalescent framework. Yet, in empirical studies, inferred gene trees differ in accuracy due to stochastic variation in phylogenetic signal between targeted loci. Empiricists should, therefore, examine the consistency of species tree inference, while accounting for the observed heterogeneity in gene tree resolution of phylogenomic data sets. Here, we assess the impact of gene tree estimation error on summary-coalescent species tree inference by screening ${\\sim}2000$ exonic loci based on gene tree resolution prior to phylogenetic inference. We focus on a phylogenetically challenging radiation of Australian lizards (genus Cryptoblepharus, Scincidae) and explore effects on topology and support. We identify a well-supported topology based on all loci and find that a relatively small number of high-resolution gene trees can be sufficient to converge on the same topology. Adding gene trees with decreasing resolution produced a generally consistent topology, and increased confidence for specific bipartitions that were poorly supported when using a small number of informative loci. This corroborates coalescent-based simulation studies that have highlighted the need for a large number of loci to confidently resolve challenging relationships and refutes the notion that low-resolution gene trees introduce phylogenetic noise. Further, our study also highlights the value of quantifying changes in nodal support across locus subsets of increasing size (but decreasing gene tree resolution). Such detailed analyses can reveal anomalous fluctuations in support at some nodes, suggesting the possibility of model violation. By characterizing the heterogeneity in phylogenetic signal among loci, we can account for uncertainty in gene tree estimation and assess its effect on the consistency of the species tree estimate. We suggest that the evaluation of gene tree resolution should be incorporated in the analysis of empirical phylogenomic data sets. This will ultimately increase our confidence in species tree estimation using summary-coalescent methods and enable us to exploit genomic data for phylogenetic inference. [Coalescence; concatenation; Cryptoblepharus; exon capture; gene tree; phylogenomics; species tree.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.
Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723
Xi, Zhenxiang; Liu, Liang; Davis, Charles C
2015-11-01
The development and application of coalescent methods are undergoing rapid changes. One little explored area that bears on the application of gene-tree-based coalescent methods to species tree estimation is gene informativeness. Here, we investigate the accuracy of these coalescent methods when genes have minimal phylogenetic information, including the implementation of the multilocus bootstrap approach. Using simulated DNA sequences, we demonstrate that genes with minimal phylogenetic information can produce unreliable gene trees (i.e., high error in gene tree estimation), which may in turn reduce the accuracy of species tree estimation using gene-tree-based coalescent methods. We demonstrate that this problem can be alleviated by sampling more genes, as is commonly done in large-scale phylogenomic analyses. This applies even when these genes are minimally informative. If gene tree estimation is biased, however, gene-tree-based coalescent analyses will produce inconsistent results, which cannot be remedied by increasing the number of genes. In this case, it is not the gene-tree-based coalescent methods that are flawed, but rather the input data (i.e., estimated gene trees). Along these lines, the commonly used program PhyML has a tendency to infer one particular bifurcating topology even though it is best represented as a polytomy. We additionally corroborate these findings by analyzing the 183-locus mammal data set assembled by McCormack et al. (2012) using ultra-conserved elements (UCEs) and flanking DNA. Lastly, we demonstrate that when employing the multilocus bootstrap approach on this 183-locus data set, there is no strong conflict between species trees estimated from concatenation and gene-tree-based coalescent analyses, as has been previously suggested by Gatesy and Springer (2014). Copyright © 2015 Elsevier Inc. All rights reserved.
Adams, Dean C
2014-09-01
Phylogenetic signal is the tendency for closely related species to display similar trait values due to their common ancestry. Several methods have been developed for quantifying phylogenetic signal in univariate traits and for sets of traits treated simultaneously, and the statistical properties of these approaches have been extensively studied. However, methods for assessing phylogenetic signal in high-dimensional multivariate traits like shape are less well developed, and their statistical performance is not well characterized. In this article, I describe a generalization of the K statistic of Blomberg et al. that is useful for quantifying and evaluating phylogenetic signal in highly dimensional multivariate data. The method (K(mult)) is found from the equivalency between statistical methods based on covariance matrices and those based on distance matrices. Using computer simulations based on Brownian motion, I demonstrate that the expected value of K(mult) remains at 1.0 as trait variation among species is increased or decreased, and as the number of trait dimensions is increased. By contrast, estimates of phylogenetic signal found with a squared-change parsimony procedure for multivariate data change with increasing trait variation among species and with increasing numbers of trait dimensions, confounding biological interpretations. I also evaluate the statistical performance of hypothesis testing procedures based on K(mult) and find that the method displays appropriate Type I error and high statistical power for detecting phylogenetic signal in high-dimensional data. Statistical properties of K(mult) were consistent for simulations using bifurcating and random phylogenies, for simulations using different numbers of species, for simulations that varied the number of trait dimensions, and for different underlying models of trait covariance structure. Overall these findings demonstrate that K(mult) provides a useful means of evaluating phylogenetic signal in high-dimensional multivariate traits. Finally, I illustrate the utility of the new approach by evaluating the strength of phylogenetic signal for head shape in a lineage of Plethodon salamanders. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Genomic Repeat Abundances Contain Phylogenetic Signal
Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.
2015-01-01
A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464
Using Genotype Abundance to Improve Phylogenetic Inference
Mesin, Luka; Victora, Gabriel D; Minin, Vladimir N; Matsen, Frederick A
2018-01-01
Abstract Modern biological techniques enable very dense genetic sampling of unfolding evolutionary histories, and thus frequently sample some genotypes multiple times. This motivates strategies to incorporate genotype abundance information in phylogenetic inference. In this article, we synthesize a stochastic process model with standard sequence-based phylogenetic optimality, and show that tree estimation is substantially improved by doing so. Our method is validated with extensive simulations and an experimental single-cell lineage tracing study of germinal center B cell receptor affinity maturation. PMID:29474671
Liang, Li-Jung; Weiss, Robert E; Redelings, Benjamin; Suchard, Marc A
2009-10-01
Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.
Faith, Daniel P
2008-12-01
New species conservation strategies, including the EDGE of Existence (EDGE) program, have expanded threatened species assessments by integrating information about species' phylogenetic distinctiveness. Distinctiveness has been measured through simple scores that assign shared credit among species for evolutionary heritage represented by the deeper phylogenetic branches. A species with a high score combined with a high extinction probability receives high priority for conservation efforts. Simple hypothetical scenarios for phylogenetic trees and extinction probabilities demonstrate how such scoring approaches can provide inefficient priorities for conservation. An existing probabilistic framework derived from the phylogenetic diversity measure (PD) properly captures the idea of shared responsibility for the persistence of evolutionary history. It avoids static scores, takes into account the status of close relatives through their extinction probabilities, and allows for the necessary updating of priorities in light of changes in species threat status. A hypothetical phylogenetic tree illustrates how changes in extinction probabilities of one or more species translate into changes in expected PD. The probabilistic PD framework provided a range of strategies that moved beyond expected PD to better consider worst-case PD losses. In another example, risk aversion gave higher priority to a conservation program that provided a smaller, but less risky, gain in expected PD. The EDGE program could continue to promote a list of top species conservation priorities through application of probabilistic PD and simple estimates of current extinction probability. The list might be a dynamic one, with all the priority scores updated as extinction probabilities change. Results of recent studies suggest that estimation of extinction probabilities derived from the red list criteria linked to changes in species range sizes may provide estimated probabilities for many different species. Probabilistic PD provides a framework for single-species assessment that is well-integrated with a broader measurement of impacts on PD owing to climate change and other factors.
Tetrapods on the EDGE: Overcoming data limitations to identify phylogenetic conservation priorities
Gray, Claudia L.; Wearn, Oliver R.; Owen, Nisha R.
2018-01-01
The scale of the ongoing biodiversity crisis requires both effective conservation prioritisation and urgent action. As extinction is non-random across the tree of life, it is important to prioritise threatened species which represent large amounts of evolutionary history. The EDGE metric prioritises species based on their Evolutionary Distinctiveness (ED), which measures the relative contribution of a species to the total evolutionary history of their taxonomic group, and Global Endangerment (GE), or extinction risk. EDGE prioritisations rely on adequate phylogenetic and extinction risk data to generate meaningful priorities for conservation. However, comprehensive phylogenetic trees of large taxonomic groups are extremely rare and, even when available, become quickly out-of-date due to the rapid rate of species descriptions and taxonomic revisions. Thus, it is important that conservationists can use the available data to incorporate evolutionary history into conservation prioritisation. We compared published and new methods to estimate missing ED scores for species absent from a phylogenetic tree whilst simultaneously correcting the ED scores of their close taxonomic relatives. We found that following artificial removal of species from a phylogenetic tree, the new method provided the closest estimates of their “true” ED score, differing from the true ED score by an average of less than 1%, compared to the 31% and 38% difference of the previous methods. The previous methods also substantially under- and over-estimated scores as more species were artificially removed from a phylogenetic tree. We therefore used the new method to estimate ED scores for all tetrapods. From these scores we updated EDGE prioritisation rankings for all tetrapod species with IUCN Red List assessments, including the first EDGE prioritisation for reptiles. Further, we identified criteria to identify robust priority species in an effort to further inform conservation action whilst limiting uncertainty and anticipating future phylogenetic advances. PMID:29641585
Phylogenetic analysis of the envelope protein (domain lll) of dengue 4 viruses
Mota, Javier; Ramos-Castañeda, José; Rico-Hesse, Rebeca; Ramos, Celso
2011-01-01
Objective To evaluate the genetic variability of domain III of envelope (E) protein and to estimate phylogenetic relationships of dengue 4 (Den-4) viruses isolated in Mexico and from other endemic areas of the world. Material and Methods A phylogenetic study of domain III of envelope (E) protein of Den-4 viruses was conducted in 1998 using virus strains from Mexico and other parts of the world, isolated in different years. Specific primers were used to amplify by RT-PCR the domain III and to obtain nucleotide sequence. Based on nucleotide and deduced aminoacid sequence, genetic variability was estimated and a phylogenetic tree was generated. To make an easy genetic analysis of domain III region, a Restriction Fragment Length Polymorphism (RFLP) assay was performed, using six restriction enzymes. Results Study results demonstrate that nucleotide and aminoacid sequence analysis of domain III are similar to those reported from the complete E protein gene. Based on the RFLP analysis of domain III using the restriction enzymes Nla III, Dde I and Cfo I, Den-4 viruses included in this study were clustered into genotypes 1 and 2 previously reported. Conclusions Study results suggest that domain III may be used as a genetic marker for phylogenetic and molecular epidemiology studies of dengue viruses. The English version of this paper is available too at: http://www.insp.mx/salud/index.html PMID:12132320
Inferring Phylogenetic Networks Using PhyloNet.
Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay
2018-07-01
PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees.
Martínez-Aquino, Andrés
2016-08-01
Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host-parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a "compass" when "walking" through jungles of tangled phylogenetic trees.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees
2016-01-01
Abstract Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host–parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a “compass” when “walking” through jungles of tangled phylogenetic trees. PMID:29491928
Zhou, Xuming; Xu, Shixia; Xu, Junxiao; Chen, Bingyao; Zhou, Kaiya; Yang, Guang
2012-01-01
Abstract Although great progress has been made in resolving the relationships of placental mammals, the position of several clades in Laurasiatheria remain controversial. In this study, we performed a phylogenetic analysis of 97 orthologs (46,152 bp) for 15 taxa, representing all laurasiatherian orders. Additionally, phylogenetic trees of laurasiatherian mammals with draft genome sequences were reconstructed based on 1608 exons (2,175,102 bp). Our reconstructions resolve the interordinal relationships within Laurasiatheria and corroborate the clades Scrotifera, Fereuungulata, and Cetartiodactyla. Furthermore, we tested alternative topologies within Laurasiatheria, and among alternatives for the phylogenetic position of Perissodactyla, a sister-group relationship with Cetartiodactyla receives the highest support. Thus, Pegasoferae (Perissodactyla + Carnivora + Pholidota + Chiroptera) does not appear to be a natural group. Divergence time estimates from these genes were compared with published estimates for splits within Laurasiatheria. Our estimates were similar to those of several studies and suggest that the divergences among these orders occurred within just a few million years. PMID:21900649
Admir J. Giachini; Kentaro Hosaka; Eduardo Nouhra; Joseph Spatafora; James M. Trappe
2010-01-01
Phylogenetic relationships among Geastrales, Gomphales, Hysterangiales, and Phallales were estimated via combined sequences: nuclear large subunit ribosomal DNA (nuc-25S-rDNA), mitochondrial small subunit ribosomal DNA (mit-12S-rDNA), and mitochondrial atp6 DNA (mit-atp6-DNA). Eighty-one taxa comprising 19 genera and 58 species...
Breinholt, Jesse W.; Porter, Megan L.; Crandall, Keith A.
2012-01-01
Background The genus Cambarus is one of three most species rich crayfish genera in the Northern Hemisphere. The genus has its center of diversity in the Southern Appalachians of the United States and has been divided into 12 subgenera. Using Cambarus we test the correspondence of subgeneric designations based on morphology used in traditional crayfish taxonomy to the underlying evolutionary history for these crayfish. We further test for significant correlation and explanatory power of geographic distance, taxonomic model, and a habitat model to estimated phylogenetic distance with multiple variable regression. Methodology/Principal Findings We use three mitochondrial and one nuclear gene regions to estimate the phylogenetic relationships for species within the genus Cambarus and test evolutionary hypotheses of relationships and associated morphological and biogeographical hypotheses. Our resulting phylogeny indicates that the genus Cambarus is polyphyletic, however we fail to reject the monophyly of Cambarus with a topology test. The majority of the Cambarus subgenera are rejected as monophyletic, suggesting the morphological characters used to define those taxa are subject to convergent evolution. While we found incongruence between taxonomy and estimated phylogenetic relationships, a multiple model regression analysis indicates that taxonomy had more explanatory power of genetic relationships than either habitat or geographic distance. Conclusions We find convergent evolution has impacted the morphological features used to delimit Cambarus subgenera. Studies of the crayfish genus Orconectes have shown gonopod morphology used to delimit subgenera is also affected by convergent evolution. This suggests that morphological diagnoses based on traditional crayfish taxonomy might be confounded by convergent evolution across the cambarids and has little utility in diagnosing relationships or defining natural groups. We further suggest that convergent morphological evolution appears to be a common occurrence in invertebrates suggesting the need for careful phylogenetically based interpretations of morphological evolution in invertebrate systematics. PMID:23049950
Poon, Art F. Y.; Joy, Jeffrey B.; Woods, Conan K.; Shurgold, Susan; Colley, Guillaume; Brumme, Chanson J.; Hogg, Robert S.; Montaner, Julio S. G.; Harrigan, P. Richard
2015-01-01
Background. The diversification of human immunodeficiency virus (HIV) is shaped by its transmission history. We therefore used a population based province wide HIV drug resistance database in British Columbia (BC), Canada, to evaluate the impact of clinical, demographic, and behavioral factors on rates of HIV transmission. Methods. We reconstructed molecular phylogenies from 27 296 anonymized bulk HIV pol sequences representing 7747 individuals in BC—about half the estimated HIV prevalence in BC. Infections were grouped into clusters based on phylogenetic distances, as a proxy for variation in transmission rates. Rates of cluster expansion were reconstructed from estimated dates of HIV seroconversion. Results. Our criteria grouped 4431 individuals into 744 clusters largely separated with respect to risk factors, including large established clusters predominated by injection drug users and more-recently emerging clusters comprising men who have sex with men. The mean log10 viral load of an individual's phylogenetic neighborhood (composed of 5 other individuals with shortest phylogenetic distances) increased their odds of appearing in a cluster by >2-fold per log10 viruses per milliliter. Conclusions. Hotspots of ongoing HIV transmission can be characterized in near real time by the secondary analysis of HIV resistance genotypes, providing an important potential resource for targeting public health initiatives for HIV prevention. PMID:25312037
Effects of 16S rDNA sampling on estimates of the number of endosymbiont lineages in sucking lice
Burleigh, J. Gordon; Light, Jessica E.; Reed, David L.
2016-01-01
Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain. PMID:27547523
Estimation of rates-across-sites distributions in phylogenetic substitution models.
Susko, Edward; Field, Chris; Blouin, Christian; Roger, Andrew J
2003-10-01
Previous work has shown that it is often essential to account for the variation in rates at different sites in phylogenetic models in order to avoid phylogenetic artifacts such as long branch attraction. In most current models, the gamma distribution is used for the rates-across-sites distributions and is implemented as an equal-probability discrete gamma. In this article, we introduce discrete distribution estimates with large numbers of equally spaced rate categories allowing us to investigate the appropriateness of the gamma model. With large numbers of rate categories, these discrete estimates are flexible enough to approximate the shape of almost any distribution. Likelihood ratio statistical tests and a nonparametric bootstrap confidence-bound estimation procedure based on the discrete estimates are presented that can be used to test the fit of a parametric family. We applied the methodology to several different protein data sets, and found that although the gamma model often provides a good parametric model for this type of data, rate estimates from an equal-probability discrete gamma model with a small number of categories will tend to underestimate the largest rates. In cases when the gamma model assumption is in doubt, rate estimates coming from the discrete rate distribution estimate with a large number of rate categories provide a robust alternative to gamma estimates. An alternative implementation of the gamma distribution is proposed that, for equal numbers of rate categories, is computationally more efficient during optimization than the standard gamma implementation and can provide more accurate estimates of site rates.
Poe, Steven; Nieto-Montes de Oca, Adrián; Torres-Carvajal, Omar; De Queiroz, Kevin; Velasco, Julián A; Truett, Brad; Gray, Levi N; Ryan, Mason J; Köhler, Gunther; Ayala-Varela, Fernando; Latella, Ian
2017-09-01
Anolis lizards (anoles) are textbook study organisms in evolution and ecology. Although several topics in evolutionary biology have been elucidated by the study of anoles, progress in some areas has been hampered by limited phylogenetic information on this group. Here, we present a phylogenetic analysis of all 379 extant species of Anolis, with new phylogenetic data for 139 species including new DNA data for 101 species. We use the resulting estimates as a basis for defining anole clade names under the principles of phylogenetic nomenclature and to examine the biogeographic history of anoles. Our new taxonomic treatment achieves the supposed advantages of recent subdivisions of anoles that employed ranked Linnaean-based nomenclature while avoiding the pitfalls of those approaches regarding artificial constraints imposed by ranks. Our biogeographic analyses demonstrate complexity in the dispersal history of anoles, including multiple crossings of the Isthmus of Panama, two invasions of the Caribbean, single invasions to Jamaica and Cuba, and a single evolutionary dispersal from the Caribbean to the mainland that resulted in substantial anole diversity. Our comprehensive phylogenetic estimate of anoles should prove useful for rigorous testing of many comparative evolutionary hypotheses. [Anoles; biogeography; lizards; Neotropics; phylogeny; taxonomy]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Fossils matter: improved estimates of divergence times in Pinus reveal older diversification.
Saladin, Bianca; Leslie, Andrew B; Wüest, Rafael O; Litsios, Glenn; Conti, Elena; Salamin, Nicolas; Zimmermann, Niklaus E
2017-04-04
The taxonomy of pines (genus Pinus) is widely accepted and a robust gene tree based on entire plastome sequences exists. However, there is a large discrepancy in estimated divergence times of major pine clades among existing studies, mainly due to differences in fossil placement and dating methods used. We currently lack a dated molecular phylogeny that makes use of the rich pine fossil record, and this study is the first to estimate the divergence dates of pines based on a large number of fossils (21) evenly distributed across all major clades, in combination with applying both node and tip dating methods. We present a range of molecular phylogenetic trees of Pinus generated within a Bayesian framework. We find the origin of crown Pinus is likely up to 30 Myr older (Early Cretaceous) than inferred in most previous studies (Late Cretaceous) and propose generally older divergence times for major clades within Pinus than previously thought. Our age estimates vary significantly between the different dating approaches, but the results generally agree on older divergence times. We present a revised list of 21 fossils that are suitable to use in dating or comparative analyses of pines. Reliable estimates of divergence times in pines are essential if we are to link diversification processes and functional adaptation of this genus to geological events or to changing climates. In addition to older divergence times in Pinus, our results also indicate that node age estimates in pines depend on dating approaches and the specific fossil sets used, reflecting inherent differences in various dating approaches. The sets of dated phylogenetic trees of pines presented here provide a way to account for uncertainties in age estimations when applying comparative phylogenetic methods.
da Cruz, Marcos de O R; Weksler, Marcelo
2018-02-01
The use of genetic data and tree-based algorithms to delimit evolutionary lineages is becoming an important practice in taxonomic identification, especially in morphologically cryptic groups. The effects of different phylogenetic and/or coalescent models in the analyses of species delimitation, however, are not clear. In this paper, we assess the impact of different evolutionary priors in phylogenetic estimation, species delimitation, and molecular dating of the genus Oligoryzomys (Mammalia: Rodentia), a group with complex taxonomy and morphological cryptic species. Phylogenetic and coalescent analyses included 20 of the 24 recognized species of the genus, comprising of 416 Cytochrome b sequences, 26 Cytochrome c oxidase I sequences, and 27 Beta-Fibrinogen Intron 7 sequences. For species delimitation, we employed the General Mixed Yule Coalescent (GMYC) and Bayesian Poisson tree processes (bPTP) analyses, and contrasted 4 genealogical and phylogenetic models: Pure-birth (Yule), Constant Population Size Coalescent, Multiple Species Coalescent, and a mixed Yule-Coalescent model. GMYC analyses of trees from different genealogical models resulted in similar species delimitation and phylogenetic relationships, with incongruence restricted to areas of poor nodal support. bPTP results, however, significantly differed from GMYC for 5 taxa. Oligoryzomys early diversification was estimated to have occurred in the Early Pleistocene, between 0.7 and 2.6 MYA. The mixed Yule-Coalescent model, however, recovered younger dating estimates for Oligoryzomys diversification, and for the threshold for the speciation-coalescent horizon in GMYC. Eight of the 20 included Oligoryzomys species were identified as having two or more independent evolutionary units, indicating that current taxonomy of Oligoryzomys is still unsettled. Copyright © 2017 Elsevier Inc. All rights reserved.
Lischer, Heidi E L; Excoffier, Laurent; Heckel, Gerald
2014-04-01
Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to resolve phylogenetic trees, but the diploid nature of sequence data remains analytically challenging. Here, we performed a phylogenomic reconstruction of the evolutionary history of the common vole (Microtus arvalis) with a focus on the influence of heterozygosity on the estimation of intraspecific divergence times. We used genome-wide sequence information from 15 voles distributed across the European range. We provide a novel approach to integrate heterozygous information in existing phylogenetic programs by repeated random haplotype sampling from sequences with multiple unphased heterozygous sites. We evaluated the impact of the use of full, partial, or no heterozygous information for tree reconstructions on divergence time estimates. All results consistently showed four deep and strongly supported evolutionary lineages in the vole data. These lineages undergoing divergence processes split only at the end or after the last glacial maximum based on calibration with radiocarbon-dated paleontological material. However, the incorporation of information from heterozygous sites had a significant impact on absolute and relative branch length estimations. Ignoring heterozygous information led to an overestimation of divergence times between the evolutionary lineages of M. arvalis. We conclude that the exclusion of heterozygous sites from evolutionary analyses may cause biased and misleading divergence time estimates in closely related taxa.
Poon, Art F Y; Joy, Jeffrey B; Woods, Conan K; Shurgold, Susan; Colley, Guillaume; Brumme, Chanson J; Hogg, Robert S; Montaner, Julio S G; Harrigan, P Richard
2015-03-15
The diversification of human immunodeficiency virus (HIV) is shaped by its transmission history. We therefore used a population based province wide HIV drug resistance database in British Columbia (BC), Canada, to evaluate the impact of clinical, demographic, and behavioral factors on rates of HIV transmission. We reconstructed molecular phylogenies from 27,296 anonymized bulk HIV pol sequences representing 7747 individuals in BC-about half the estimated HIV prevalence in BC. Infections were grouped into clusters based on phylogenetic distances, as a proxy for variation in transmission rates. Rates of cluster expansion were reconstructed from estimated dates of HIV seroconversion. Our criteria grouped 4431 individuals into 744 clusters largely separated with respect to risk factors, including large established clusters predominated by injection drug users and more-recently emerging clusters comprising men who have sex with men. The mean log10 viral load of an individual's phylogenetic neighborhood (composed of 5 other individuals with shortest phylogenetic distances) increased their odds of appearing in a cluster by >2-fold per log10 viruses per milliliter. Hotspots of ongoing HIV transmission can be characterized in near real time by the secondary analysis of HIV resistance genotypes, providing an important potential resource for targeting public health initiatives for HIV prevention. © The Author 2014. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Mendes, Joana; Harris, D James; Carranza, Salvador; Salvi, Daniele
2016-07-01
Estimating the phylogeny of lacertid lizards, and particularly the tribe Lacertini has been challenging, possibly due to the fast radiation of this group resulting in a hard polytomy. However this is still an open question, as concatenated data primarily from mitochondrial markers have been used so far whereas in a recent phylogeny based on a compilation of these data within a squamate supermatrix the basal polytomy seems to be resolved. In this study, we estimate phylogenetic relationships between all Lacertini genera using for the first time DNA sequences from five fast evolving nuclear genes (acm4, mc1r, pdc, βfib and reln) and two mitochondrial genes (nd4 and 12S). We generated a total of 529 sequences from 88 species and used Maximum Likelihood and Bayesian Inference methods based on concatenated multilocus dataset as well as a coalescent-based species tree approach with the aim of (i) shedding light on the basal relationships of Lacertini (ii) assessing the monophyly of genera which were previously questioned, and (iii) discussing differences between estimates from this and previous studies based on different markers, and phylogenetic methods. Results uncovered (i) a new phylogenetic clade formed by the monotypic genera Archaeolacerta, Zootoca, Teira and Scelarcis; and (ii) support for the monophyly of the Algyroides clade, with two sister species pairs represented by western (A. marchi and A. fitzingeri) and eastern (A. nigropunctatus and A. moreoticus) lineages. In both cases the members of these groups show peculiar morphology and very different geographical distributions, suggesting that they are relictual groups that were once diverse and widespread. They probably originated about 11-13 million years ago during early events of speciation in the tribe, and the split between their members is estimated to be only slightly older. This scenario may explain why mitochondrial markers (possibly saturated at higher divergence levels) or slower nuclear markers used in previous studies (likely lacking enough phylogenetic signal) failed to recover these relationships. Finally, the phylogenetic position of most remaining genera was unresolved, corroborating the hypothesis of a hard polytomy in the Lacertini phylogeny due to a fast radiation. This is in agreement with all previous studies but in sharp contrast with a recent squamate megaphylogeny. We show that the supermatrix approach may provide high support for incorrect nodes that are not supported either by original sequence data or by new data from this study. This finding suggests caution when using megaphylogenies to integrate inter-generic relationships in comparative ecological and evolutionary studies. Copyright © 2016 Elsevier Inc. All rights reserved.
A phylogeny of robber flies (Diptera: Asilidae) at the subfamilial level: molecular evidence.
Bybee, Seth M; Taylor, Sean D; Riley Nelson, C; Whiting, Michael F
2004-03-01
We present the first formal analysis of phylogenetic relationships among the Asilidae, based on four genes: 16S rDNA, 18S rDNA, 28S rDNA, and cytochrome oxidase II. Twenty-six ingroup taxa representing 11 of the 12 described subfamilies were selected to produce a phylogenetic estimate of asilid subfamilial relationships via optimization alignment, parsimony, and maximum likelihood techniques. Phylogenetic analyses support the monophyly of Asilidae with Leptogastrinae as the most basal robber fly lineage. Apocleinae+(Asilinae+Ommatiinae) is supported as monophyletic. The laphriinae-group (Laphriinae+Laphystiinae) and the dasypogoninae-group (Dasypogoninae+Stenopogoninae+Stichopogoninae+ Trigonomiminae) are paraphyletic. These results suggest that current subfamilial classification only partially reflects robber fly phylogeny, indicating the need for further phylogenetic investigation of this group.
Bayesian phylogenetic estimation of fossil ages.
Drummond, Alexei J; Stadler, Tanja
2016-07-19
Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using rocks and clocks'. © 2016 The Authors.
Bayesian phylogenetic estimation of fossil ages
Drummond, Alexei J.; Stadler, Tanja
2016-01-01
Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences using rocks and clocks’. PMID:27325827
Quantifying Transmission Heterogeneity Using Both Pathogen Phylogenies and Incidence Time Series
Li, Lucy M.; Grassly, Nicholas C.; Fraser, Christophe
2017-01-01
Abstract Heterogeneity in individual-level transmissibility can be quantified by the dispersion parameter k of the offspring distribution. Quantifying heterogeneity is important as it affects other parameter estimates, it modulates the degree of unpredictability of an epidemic, and it needs to be accounted for in models of infection control. Aggregated data such as incidence time series are often not sufficiently informative to estimate k. Incorporating phylogenetic analysis can help to estimate k concurrently with other epidemiological parameters. We have developed an inference framework that uses particle Markov Chain Monte Carlo to estimate k and other epidemiological parameters using both incidence time series and the pathogen phylogeny. Using the framework to fit a modified compartmental transmission model that includes the parameter k to simulated data, we found that more accurate and less biased estimates of the reproductive number were obtained by combining epidemiological and phylogenetic analyses. However, k was most accurately estimated using pathogen phylogeny alone. Accurately estimating k was necessary for unbiased estimates of the reproductive number, but it did not affect the accuracy of reporting probability and epidemic start date estimates. We further demonstrated that inference was possible in the presence of phylogenetic uncertainty by sampling from the posterior distribution of phylogenies. Finally, we used the inference framework to estimate transmission parameters from epidemiological and genetic data collected during a poliovirus outbreak. Despite the large degree of phylogenetic uncertainty, we demonstrated that incorporating phylogenetic data in parameter inference improved the accuracy and precision of estimates. PMID:28981709
Castel, Guillaume; Tordo, Noël; Plyusnin, Alexander
2017-04-02
Because of the great variability of their reservoir hosts, hantaviruses are excellent models to evaluate the dynamics of virus-host co-evolution. Intriguing questions remain about the timescale of the diversification events that influenced this evolution. In this paper we attempted to estimate the first ever timing of hantavirus diversification based on thirty five available complete genomes representing five major groups of hantaviruses and the assumption of co-speciation of hantaviruses with their respective mammal hosts. Phylogenetic analyses were used to estimate the main diversification points during hantavirus evolution in mammals while host diversification was mostly estimated from independent calibrators taken from fossil records. Our results support an earlier developed hypothesis of co-speciation of known hantaviruses with their respective mammal hosts and hence a common ancestor for all hantaviruses carried by placental mammals. Copyright © 2017 Elsevier B.V. All rights reserved.
Functional & phylogenetic diversity of copepod communities
NASA Astrophysics Data System (ADS)
Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.
2016-02-01
The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.
Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
Baele, Guy; Lemey, Philippe; Suchard, Marc A.
2016-01-01
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of “working distributions” to facilitate—or shorten—the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a “working” distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different “working” distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses. PMID:26526428
Unrealistic phylogenetic trees may improve phylogenetic footprinting.
Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo
2017-06-01
The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A
2018-05-01
Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.
Pandey, Ravi S; Saxena, Garima; Bhattacharya, Debashish; Qiu, Huan; Azad, Rajeev K
2017-02-01
Identification of horizontal gene transfers (HGTs) has primarily relied on phylogenetic tree based methods, which require a rich sampling of sequenced genomes to ensure a reliable inference. Because the success of phylogenetic approaches depends on the breadth and depth of the database, researchers usually apply stringent filters to detect only the most likely gene transfers in the genomes of interest. One such study focused on a highly conservative estimate of trans-domain gene transfers in the extremophile eukaryote, Galdieria sulphuraria (Galdieri) Merola (Rhodophyta), by applying multiple filters in their phylogenetic pipeline. This led to the identification of 75 inter-domain acquisitions from Bacteria or Archaea. Because of the evolutionary, ecological, and potential biotechnological significance of foreign genes in algae, alternative approaches and pipelines complementing phylogenetics are needed for a more comprehensive assessment of HGT. We present here a novel pipeline that uncovered 17 novel foreign genes of prokaryotic origin in G. sulphuraria, results that are supported by multiple lines of evidence including composition-based, comparative data, and phylogenetics. These genes encode a variety of potentially adaptive functions, from metabolite transport to DNA repair. © 2016 Phycological Society of America.
Faith, Daniel P.
2015-01-01
The phylogenetic diversity measure, (‘PD’), measures the relative feature diversity of different subsets of taxa from a phylogeny. At the level of feature diversity, PD supports the broad goal of biodiversity conservation to maintain living variation and option values. PD calculations at the level of lineages and features include those integrating probabilities of extinction, providing estimates of expected PD. This approach has known advantages over the evolutionarily distinct and globally endangered (EDGE) methods. Expected PD methods also have limitations. An alternative notion of expected diversity, expected functional trait diversity, relies on an alternative non-phylogenetic model and allows inferences of diversity at the level of functional traits. Expected PD also faces challenges in helping to address phylogenetic tipping points and worst-case PD losses. Expected PD may not choose conservation options that best avoid worst-case losses of long branches from the tree of life. We can expand the range of useful calculations based on expected PD, including methods for identifying phylogenetic key biodiversity areas. PMID:25561672
NASA Astrophysics Data System (ADS)
Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann
2017-04-01
Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first chamber length and P/D ratio) and some growth-invariant characters of the chamber sequence (e.g., backbend angle, initial chamber base length and chamber length increase) are evident.
Kutschera, Verena E.; Bidon, Tobias; Hailer, Frank; Rodi, Julia L.; Fain, Steven R.; Janke, Axel
2014-01-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. PMID:24903145
2017-01-01
North America’s Great Basin has long been of interest to biologists due to its high level of organismal endemicity throughout its endorheic watersheds. One example of such a group is the subfamily Empetricthyinae. In this paper, we analyzed the relationships of the Empetrichtyinae and assessed the validity of the subspecies designations given by Williams and Wilde within the group using concatenated phylogenetic tree estimation and species tree estimation. Samples from 19 populations were included covering the entire distribution of the three extant species of Empetricthyinae–Crenichthys nevadae, Crenichthys baileyi and Empetricthys latos. Three nuclear introns (S8 intron 4, S7 intron 1, and P0 intron 1) and one mitochondrial gene (Cytb) were sequenced for phylogenetic analysis. Using these sequences, we generated two separate hypotheses of the evolutionary relationships of Empetrichtyinae- one based on the mitochondrial data and one based on the nuclear data using Bayesian phylogenetics. Haplotype networks were also generated to look at the relationships of the populations within Empetrichthyinae. After comparing the two phylogenetic hypotheses, species trees were generated using *BEAST with the nuclear data to further test the validity of the subspecies within Empetrichthyinae. The mitochondrial analyses supported four lineages within C. baileyi and 2 within C. nevadae. The concatenated nuclear tree was more conserved, supporting one clade and an unresolved polytomy in both species. The species tree analysis supported the presence of two species within both C. baileyi and C. nevadae. Based on the results of these analyses, the subspecies designations of Williams and Wilde are not valid, rather a conservative approach suggests there are two species within C. nevadae and two species within C. baileyi. No structure was found for E. latos or the populations of Empetricthyinae. This study represents one of many demonstrating the invalidity of subspecies and their detriment to species identification, conservation, and understanding. PMID:29077708
Fast algorithms for computing phylogenetic divergence time.
Crosby, Ralph W; Williams, Tiffani L
2017-12-06
The inference of species divergence time is a key step in most phylogenetic studies. Methods have been available for the last ten years to perform the inference, but the performance of the methods does not yet scale well to studies with hundreds of taxa and thousands of DNA base pairs. For example a study of 349 primate taxa was estimated to require over 9 months of processing time. In this work, we present a new algorithm, AncestralAge, that significantly improves the performance of the divergence time process. As part of AncestralAge, we demonstrate a new method for the computation of phylogenetic likelihood and our experiments show a 90% improvement in likelihood computation time on the aforementioned dataset of 349 primates taxa with over 60,000 DNA base pairs. Additionally, we show that our new method for the computation of the Bayesian prior on node ages reduces the running time for this computation on the 349 taxa dataset by 99%. Through the use of these new algorithms we open up the ability to perform divergence time inference on large phylogenetic studies.
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
Si, Xingfeng; Cadotte, Marc W; Zhao, Yuhao; Zhou, Haonan; Zeng, Di; Li, Jiaqi; Jin, Tinghao; Ren, Peng; Wang, Yanping; Ding, Ping; Tingley, Morgan W
2018-06-26
Incorporating imperfect detection when estimating species richness has become commonplace in the past decade. However, the question of how imperfect detection of species affects estimates of functional and phylogenetic community structure remains untested. We used long-term counts of breeding bird species that were detected at least once on islands in a land-bridge island system, and employed multi-species occupancy models to assess the effects of imperfect detection of species on estimates of bird diversity and community structure by incorporating species traits and phylogenies. Our results showed that taxonomic, functional, and phylogenetic diversity were all underestimated significantly as a result of species' imperfect detection, with taxonomic diversity showing the greatest bias. The functional and phylogenetic structure calculated from observed communities were both more clustered than those from the detection-corrected communities due to missed distinct species. The discrepancy between observed and estimated diversity differed according to the measure of biodiversity employed. Our study demonstrates the importance of accounting for species' imperfect detection in biodiversity studies, especially for functional and phylogenetic community ecology, and when attempting to infer community assembly processes. With datasets that allow for detection-corrected community structure, we can better estimate diversity and infer the underlying mechanisms that structure community assembly, and thus make reliable management decisions for the conservation of biodiversity. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Inferring species trees from incongruent multi-copy gene trees using the Robinson-Foulds distance
2013-01-01
Background Constructing species trees from multi-copy gene trees remains a challenging problem in phylogenetics. One difficulty is that the underlying genes can be incongruent due to evolutionary processes such as gene duplication and loss, deep coalescence, or lateral gene transfer. Gene tree estimation errors may further exacerbate the difficulties of species tree estimation. Results We present a new approach for inferring species trees from incongruent multi-copy gene trees that is based on a generalization of the Robinson-Foulds (RF) distance measure to multi-labeled trees (mul-trees). We prove that it is NP-hard to compute the RF distance between two mul-trees; however, it is easy to calculate this distance between a mul-tree and a singly-labeled species tree. Motivated by this, we formulate the RF problem for mul-trees (MulRF) as follows: Given a collection of multi-copy gene trees, find a singly-labeled species tree that minimizes the total RF distance from the input mul-trees. We develop and implement a fast SPR-based heuristic algorithm for the NP-hard MulRF problem. We compare the performance of the MulRF method (available at http://genome.cs.iastate.edu/CBL/MulRF/) with several gene tree parsimony approaches using gene tree simulations that incorporate gene tree error, gene duplications and losses, and/or lateral transfer. The MulRF method produces more accurate species trees than gene tree parsimony approaches. We also demonstrate that the MulRF method infers in minutes a credible plant species tree from a collection of nearly 2,000 gene trees. Conclusions Our new phylogenetic inference method, based on a generalized RF distance, makes it possible to quickly estimate species trees from large genomic data sets. Since the MulRF method, unlike gene tree parsimony, is based on a generic tree distance measure, it is appealing for analyses of genomic data sets, in which many processes such as deep coalescence, recombination, gene duplication and losses as well as phylogenetic error may contribute to gene tree discord. In experiments, the MulRF method estimated species trees accurately and quickly, demonstrating MulRF as an efficient alternative approach for phylogenetic inference from large-scale genomic data sets. PMID:24180377
Knowles, Lacey L; Klimov, Pavel B
2011-11-01
With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.
A phylogeny and revised classification of Squamata, including 4161 species of lizards and snakes
2013-01-01
Background The extant squamates (>9400 known species of lizards and snakes) are one of the most diverse and conspicuous radiations of terrestrial vertebrates, but no studies have attempted to reconstruct a phylogeny for the group with large-scale taxon sampling. Such an estimate is invaluable for comparative evolutionary studies, and to address their classification. Here, we present the first large-scale phylogenetic estimate for Squamata. Results The estimated phylogeny contains 4161 species, representing all currently recognized families and subfamilies. The analysis is based on up to 12896 base pairs of sequence data per species (average = 2497 bp) from 12 genes, including seven nuclear loci (BDNF, c-mos, NT3, PDC, R35, RAG-1, and RAG-2), and five mitochondrial genes (12S, 16S, cytochrome b, ND2, and ND4). The tree provides important confirmation for recent estimates of higher-level squamate phylogeny based on molecular data (but with more limited taxon sampling), estimates that are very different from previous morphology-based hypotheses. The tree also includes many relationships that differ from previous molecular estimates and many that differ from traditional taxonomy. Conclusions We present a new large-scale phylogeny of squamate reptiles that should be a valuable resource for future comparative studies. We also present a revised classification of squamates at the family and subfamily level to bring the taxonomy more in line with the new phylogenetic hypothesis. This classification includes new, resurrected, and modified subfamilies within gymnophthalmid and scincid lizards, and boid, colubrid, and lamprophiid snakes. PMID:23627680
Roger, Andrew J; Hug, Laura A
2006-01-01
Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods used and their assumptions. Accurate dating of divergence times among the major eukaryote lineages will require a robust tree of eukaryotes, a much richer Proterozoic fossil record of microbial eukaryotes assignable to extant groups for calibration, more sophisticated relaxed molecular clock methods and many more genes sampled from the full diversity of microbial eukaryotes. PMID:16754613
Julien, Clavel; Leandro, Aristide; Hélène, Morlon
2018-06-19
Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.
Phylogenetic relationships among anuran trypanosomes as revealed by riboprinting.
Clark, C G; Martin, D S; Diamond, L S
1995-01-01
Twenty trypanosome isolates from Anura (frogs and toads) assigned to several species were characterized by riboprinting-restriction enzyme digestion of polymerase chain reaction amplified small subunit ribosomal RNA genes. Restriction site polymorphisms allowed distinction of all the recognized species and no intraspecific variation in riboprint patterns was detected. Phylogenetic reconstruction using parsimony and distance estimates based on restriction fragment comigration showed Trypanosoma chattoni to be only distantly related to the other species, while T. ranarum and T. fallisi appear to be sister taxa despite showing non-overlapping host specificities.
Cross-validation to select Bayesian hierarchical models in phylogenetics.
Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C
2016-05-26
Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.
Nauheimer, Lars; Schley, Rowan J; Clements, Mark A; Micheneau, Claire; Nargar, Katharina
2018-06-02
Australia harbours a rich and highly endemic orchid flora, with c. 90 % of species endemic to the country. Despite that, the biogeographic history of Australasian orchid lineages is only poorly understood. Here we examined evolutionary relationships and the spatio-temporal evolution of the sun orchids (Thelymitra, 119 species), which display disjunct distribution patterns frequently found in Australasian orchid lineages. Phylogenetic analyses were conducted based on one nuclear (ITS) and three plastid markers (matK, psbJ-petA, ycf1) using Maximum Likelihood and Bayesian inference. Divergence time estimations were carried out with a relaxed molecular clock in a Bayesian framework. Ancestral ranges were estimated using the dispersal-extinction-cladogenesis model and an area coding based on major disjunctions. The phylogenetic analyses clarified intergeneric relationships within Thelymitrinae, with Epiblema being sister to Thelymitra plus Calochilus, both of which were well-supported. Within Thelymitra, eight major and several minor clades were retrieved in the nuclear and plastid phylogenetic reconstructions. Five major clades corresponded to species complexes previously recognized based on morphological characters, whereas other previously recognized species groups were found to be paraphyletic. Conflicting signals between the nuclear and plastid phylogenetic reconstructions provided support for hybridization and plastid capture events both in the deeper evolutionary history of the genus and more recently. Divergence time estimation placed the origin of Thelymitra in the late Miocene (c. 10.8 Ma) and the origin of the majority of the main clades within Thelymitra during the late Pliocene and early Pleistocene, with the majority of extant species arising during the Pleistocene. Ancestral range reconstruction revealed that the early diversification of the genus in the late Miocene and Pliocene took place predominantly in southwest Australia, where most species with highly restricted distributional ranges occur. Several long-distance dispersal events eastwards across the Nullarbor Plain were inferred, recurrently resulting in lineage divergence within the genus. The predominant eastwards direction of long-distance dispersal events in Thelymitra highlights the importance of the West Wind Drift for the present-day distribution of the genus, giving rise to the Thelymitra floras of Tasmania, New Zealand and New Caledonia, which were inferred to be of comparatively recent origin. Copyright © 2018. Published by Elsevier Inc.
Anchoring quartet-based phylogenetic distances and applications to species tree reconstruction.
Sayyari, Erfan; Mirarab, Siavash
2016-11-11
Inferring species trees from gene trees using the coalescent-based summary methods has been the subject of much attention, yet new scalable and accurate methods are needed. We introduce DISTIQUE, a new statistically consistent summary method for inferring species trees from gene trees under the coalescent model. We generalize our results to arbitrary phylogenetic inference problems; we show that two arbitrarily chosen leaves, called anchors, can be used to estimate relative distances between all other pairs of leaves by inferring relevant quartet trees. This results in a family of distance-based tree inference methods, with running times ranging between quadratic to quartic in the number of leaves. We show in simulated studies that DISTIQUE has comparable accuracy to leading coalescent-based summary methods and reduced running times.
Roux, C Z
2009-05-01
Short phylogenetic distances between taxa occur, for example, in studies on ribosomal RNA-genes with slow substitution rates. For consistently short distances, it is proved that in the completely singular limit of the covariance matrix ordinary least squares (OLS) estimates are minimum variance or best linear unbiased (BLU) estimates of phylogenetic tree branch lengths. Although OLS estimates are in this situation equal to generalized least squares (GLS) estimates, the GLS chi-square likelihood ratio test will be inapplicable as it is associated with zero degrees of freedom. Consequently, an OLS normal distribution test or an analogous bootstrap approach will provide optimal branch length tests of significance for consistently short phylogenetic distances. As the asymptotic covariances between branch lengths will be equal to zero, it follows that the product rule can be used in tree evaluation to calculate an approximate simultaneous confidence probability that all interior branches are positive.
Attigala, Lakshmi; Wysocki, William P; Duvall, Melvin R; Clark, Lynn G
2016-08-01
We explored phylogenetic relationships among the twelve lineages of the temperate woody bamboo clade (tribe Arundinarieae) based on plastid genome (plastome) sequence data. A representative sample of 28 taxa was used and maximum parsimony, maximum likelihood and Bayesian inference analyses were conducted to estimate the Arundinarieae phylogeny. All the previously recognized clades of Arundinarieae were supported, with Ampelocalamus calcareus (Clade XI) as sister to the rest of the temperate woody bamboos. Well supported sister relationships between Bergbambos tessellata (Clade I) and Thamnocalamus spathiflorus (Clade VII) and between Kuruna (Clade XII) and Chimonocalmus (Clade III) were revealed by the current study. The plastome topology was tested by taxon removal experiments and alternative hypothesis testing and the results supported the current plastome phylogeny as robust. Neighbor-net analyses showed few phylogenetic signal conflicts, but suggested some potentially complex relationships among these taxa. Analyses of morphological character evolution of rhizomes and reproductive structures revealed that pachymorph rhizomes were most likely the ancestral state in Arundinarieae. In contrast leptomorph rhizomes either evolved once with reversions to the pachymorph condition or multiple times in Arundinarieae. Further, pseudospikelets evolved independently at least twice in the Arundinarieae, but the ancestral state is ambiguous. Copyright © 2016 Elsevier Inc. All rights reserved.
Genetic Diversity and Population Structure of Cowpea (Vigna unguiculata L. Walp).
Xiong, Haizheng; Shi, Ainong; Mou, Beiquan; Qin, Jun; Motes, Dennis; Lu, Weiguo; Ma, Jianbing; Weng, Yuejin; Yang, Wei; Wu, Dianxing
2016-01-01
The genetic diversity of cowpea was analyzed, and the population structure was estimated in a diverse set of 768 cultivated cowpea genotypes from the USDA GRIN cowpea collection, originally collected from 56 countries. Genotyping by sequencing was used to discover single nucleotide polymorphism (SNP) in cowpea and the identified SNP alleles were used to estimate the level of genetic diversity, population structure, and phylogenetic relationships. The aim of this study was to detect the gene pool structure of cowpea and to determine its relationship between different regions and countries. Based on the model-based ancestry analysis, the phylogenetic tree, and the principal component analysis, three well-differentiated genetic populations were postulated from 768 worldwide cowpea genotypes. According to the phylogenetic analyses between each individual, region, and country, we may trace the accession from off-original, back to the two candidate original areas (West and East of Africa) to predict the migration and domestication history during the cowpea dispersal and development. To our knowledge, this is the first report of the analysis of the genetic variation and relationship between globally cultivated cowpea genotypes. The results will help curators, researchers, and breeders to understand, utilize, conserve, and manage the collection for more efficient contribution to international cowpea research.
Genetic Diversity and Population Structure of Cowpea (Vigna unguiculata L. Walp)
Xiong, Haizheng; Shi, Ainong; Mou, Beiquan; Qin, Jun; Motes, Dennis; Lu, Weiguo; Ma, Jianbing; Weng, Yuejin; Yang, Wei; Wu, Dianxing
2016-01-01
The genetic diversity of cowpea was analyzed, and the population structure was estimated in a diverse set of 768 cultivated cowpea genotypes from the USDA GRIN cowpea collection, originally collected from 56 countries. Genotyping by sequencing was used to discover single nucleotide polymorphism (SNP) in cowpea and the identified SNP alleles were used to estimate the level of genetic diversity, population structure, and phylogenetic relationships. The aim of this study was to detect the gene pool structure of cowpea and to determine its relationship between different regions and countries. Based on the model-based ancestry analysis, the phylogenetic tree, and the principal component analysis, three well-differentiated genetic populations were postulated from 768 worldwide cowpea genotypes. According to the phylogenetic analyses between each individual, region, and country, we may trace the accession from off-original, back to the two candidate original areas (West and East of Africa) to predict the migration and domestication history during the cowpea dispersal and development. To our knowledge, this is the first report of the analysis of the genetic variation and relationship between globally cultivated cowpea genotypes. The results will help curators, researchers, and breeders to understand, utilize, conserve, and manage the collection for more efficient contribution to international cowpea research. PMID:27509049
Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel
2014-08-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Faith, Daniel P
2015-02-19
The phylogenetic diversity measure, ('PD'), measures the relative feature diversity of different subsets of taxa from a phylogeny. At the level of feature diversity, PD supports the broad goal of biodiversity conservation to maintain living variation and option values. PD calculations at the level of lineages and features include those integrating probabilities of extinction, providing estimates of expected PD. This approach has known advantages over the evolutionarily distinct and globally endangered (EDGE) methods. Expected PD methods also have limitations. An alternative notion of expected diversity, expected functional trait diversity, relies on an alternative non-phylogenetic model and allows inferences of diversity at the level of functional traits. Expected PD also faces challenges in helping to address phylogenetic tipping points and worst-case PD losses. Expected PD may not choose conservation options that best avoid worst-case losses of long branches from the tree of life. We can expand the range of useful calculations based on expected PD, including methods for identifying phylogenetic key biodiversity areas. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Pagès, Marie; Chevret, Pascale; Gros-Balthazard, Muriel; Hughes, Sandrine; Alcover, Josep Antoni; Hutterer, Rainer; Rando, Juan Carlos; Michaux, Jacques; Hänni, Catherine
2012-01-01
The lava mouse, Malpaisomys insularis, was endemic to the Eastern Canary islands and became extinct at the beginning of the 14(th) century when the Europeans reached the archipelago. Studies to determine Malpaisomys' phylogenetic affinities, based on morphological characters, remained inconclusive because morphological changes experienced by this insular rodent make phylogenetic investigations a real challenge. Over 20 years since its first description, Malpaisomys' phylogenetic position remains enigmatic. In this study, we resolved this issue using molecular characters. Mitochondrial and nuclear markers were successfully amplified from subfossils of three lava mouse samples. Molecular phylogenetic reconstructions revealed, without any ambiguity, unsuspected relationships between Malpaisomys and extant mice (genus Mus, Murinae). Moreover, through molecular dating we estimated the origin of the Malpaisomys/mouse clade at 6.9 Ma, corresponding to the maximal age at which the archipelago was colonised by the Malpaisomys ancestor via natural rafting. This study reconsiders the derived morphological characters of Malpaisomys in light of this unexpected molecular finding. To reconcile molecular and morphological data, we propose to consider Malpaisomys insularis as an insular lineage of mouse.
Tavera, Jose; Acero P, Arturo; Wainwright, Peter C
2018-04-01
We present a phylogenetic analysis with divergence time estimates, and an ecomorphological assessment of the role of the benthic-to-pelagic axis of diversification in the history of haemulid fishes. Phylogenetic analyses were performed on 97 grunt species based on sequence data collected from seven loci. Divergence time estimation indicates that Haemulidae originated during the mid Eocene (54.7-42.3 Ma) but that the major lineages were formed during the mid-Oligocene 30-25 Ma. We propose a new classification that reflects the phylogenetic history of grunts. Overall the pattern of morphological and functional diversification in grunts appears to be strongly linked with feeding ecology. Feeding traits and the first principal component of body shape strongly separate species that feed in benthic and pelagic habitats. The benthic-to-pelagic axis has been the major axis of ecomorphological diversification in this important group of tropical shoreline fishes, with about 13 transitions between feeding habitats that have had major consequences for head and body morphology. Copyright © 2017 Elsevier Inc. All rights reserved.
Lentendu, Guillaume; Mahé, Frédéric; Bass, David; Rueckert, Sonja; Stoeck, Thorsten; Dunthorn, Micah
2018-05-30
Tropical animals and plants are known to have high alpha diversity within forests, but low beta diversity between forests. By contrast, it is unknown whether microbes inhabiting the same ecosystems exhibit similar biogeographic patterns. To evaluate the biogeographies of tropical protists, we used metabarcoding data of species sampled in the soils of three lowland Neotropical rainforests. Taxa-area and distance-decay relationships for three of the dominant protist taxa and their subtaxa were estimated at both the OTU and phylogenetic levels, with presence-absence and abundance-based measures. These estimates were compared to null models. High local alpha and low regional beta diversity patterns were consistently found for both the parasitic Apicomplexa and the largely free-living Cercozoa and Ciliophora. Similar to animals and plants, the protists showed spatial structures between forests at the OTU and phylogenetic levels, and only at the phylogenetic level within forests. These results suggest that the biogeographies of macro- and micro-organismal eukaryotes in lowland Neotropical rainforests are partially structured by the same general processes. However, and unlike the animals and plants, the protist OTUs did not exhibit spatial structures within forests, which hinders our ability to estimate the local and regional diversity of protists in tropical forests. © 2018 John Wiley & Sons Ltd.
Li, Xinnian; Duke, Norman C; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua
2016-01-01
Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time.
Li, Xinnian; Duke, Norman C.; Yang, Yuchen; Huang, Lishi; Zhu, Yuxiang; Zhang, Zhang; Zhou, Renchao; Zhong, Cairong; Huang, Yelin; Shi, Suhua
2016-01-01
Avicennia L. (Avicenniaceae), one of the most diverse mangrove genera, is distributed widely in tropical and subtropical intertidal zones worldwide. Five species of Avicennia in the Indo-West Pacific region have been previously described. However, their phylogenetic relationships were determined based on morphological and allozyme data. To enhance our understanding of evolutionary patterns in the clade, we carried out a molecular phylogenetic study using wide sampling and multiple loci. Our results support two monophyletic clades across all species worldwide in Avicennia: an Atlantic-East Pacific (AEP) lineage and an Indo-West Pacific (IWP) lineage. This split is in line with biogeographic distribution of the clade. Focusing on the IWP branch, we reconstructed a detailed phylogenetic tree based on sequences from 25 nuclear genes. The results identified three distinct subclades, (1) A. rumphiana and A. alba, (2) A. officinalis and A. integra, and (3) the A. marina complex, with high bootstrap support. The results strongly corresponded to two morphological traits in floral structure: stigma position in relation to the anthers and style length. Using Bayesian dating methods we estimated diversification of the IWP lineage was dated to late Miocene (c. 6.0 million years ago) and may have been driven largely by the fluctuating sea levels since that time. PMID:27716800
Bayesian models for comparative analysis integrating phylogenetic uncertainty.
de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P
2012-06-28
Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language.
Bayesian models for comparative analysis integrating phylogenetic uncertainty
2012-01-01
Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language. PMID:22741602
Bahmani, Zahed; Rastegar-Pouyani, Eskandar; Rastegar-Pouyani, Nasrullah
2017-09-08
The taxonomic status of species included in the genus Heremites in Iran and Iraq is uncertain. Three of these species have been assigned to the genus based on morphology: Heremites auratus transcaucasica, H. vittatus, and H. septemtaeniatus. We examined the phylogenetic relationships and taxonomic status of the Iranian and Iraqi species of Heremites by performing phylogenetic analyses using mitochondrial DNA sequences (cytochrome b and 16S rRNA). Phylogenetic relationships and estimated genetic distances indicated that the Heremites populations of the area (Iran and Iraq) form five distinct clades. Three of these clades are found only in Iran, specifically in: (1) Fars and Hormozgan provinces; (2) Northeastern Khuzestan; and (3) Khorasan and Isfahan provinces. The fourth clade (H. septemtaeniatus) is found in west and Mahshahr in Iran as well as in eastern and northern parts of Iraq. The fifth clade, Heremites vittatus, is found in Iran and Iraq. We also confirm the absence of H. auratus in Iran and Iraq. It also indicated that H. vittatus is sister taxon to the other groups that our analyses estimate the divergence of this clade in the Middle Miocene (15.9 Mya). The clade containing the Fars-Hormozgan and Khuzestan populations diverged at the end of the Miocene (8.5 Mya). The Isfahan and Khorasan populations separated at the Pliocene (4.2 Mya) from the western Iranian group, the group in Mahshahr, Iran and the groups in northern and eastern Iraq.
Santos-Neto, Guilherme da Cruz; Beasley, Colin Robert; Schneider, Horacio; Pimpão, Daniel Mansur; Hoeh, Walter Randolph; Simone, Luiz Ricardo Lopes de; Tagliaro, Claudia Helena
2016-07-01
The current phylogenetic framework for the South American Hyriidae is solely based on morphological data. However, freshwater bivalve morphology is highly variable due to both genetic and environmental factors. The present study used both mitochondrial (COI and 16S) and nuclear (18S-ITS1) sequences in molecular phylogenetic analyses of nine Neotropical species of Hyriidae, collected from 15 South American rivers, and sequences of hyriids from Australia and New Zealand obtained from GenBank. The present molecular findings support traditional taxonomic proposals, based on morphology, for the South American subfamily Hyriinae, currently divided in three tribes: Hyriini, Castaliini and Rhipidodontini. Phylogenetic trees based on COI nucleotide sequences revealed at least four geographical groups of Castalia ambigua: northeast Amazon (Piriá, Tocantins and Caeté rivers), central Amazon, including C. quadrata (Amazon and Aripuanã rivers), north (Trombetas river), and C. ambigua from Peru. Genetic distances suggest that some specimens may be cryptic species. Among the Hyriini, a total evidence data set generated phylogenetic trees indicating that Paxyodon syrmatophorus and Prisodon obliquus are more closely related, followed by Triplodon corrugatus. The molecular clock, based on COI, agreed with the fossil record of Neotropical hyriids. The ancestor of both Australasian and Neotropical Hyriidae is estimated to have lived around 225million years ago. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhang, Peng
2012-01-01
Background Universal nuclear protein-coding locus (NPCL) markers that are applicable across diverse taxa and show good phylogenetic discrimination have broad applications in molecular phylogenetic studies. For example, RAG1, a representative NPCL marker, has been successfully used to make phylogenetic inferences within all major osteichthyan groups. However, such markers with broad working range and high phylogenetic performance are still scarce. It is necessary to develop more universal NPCL markers comparable to RAG1 for osteichthyan phylogenetics. Methodology/Principal Findings We developed three long universal NPCL markers (>1.6 kb each) based on single-copy nuclear genes (KIAA1239, SACS and TTN) that possess large exons and exhibit the appropriate evolutionary rates. We then compared their phylogenetic utilities with that of the reference marker RAG1 in 47 jawed vertebrate species. In comparison with RAG1, each of the three long universal markers yielded similar topologies and branch supports, all in congruence with the currently accepted osteichthyan phylogeny. To compare their phylogenetic performance visually, we also estimated the phylogenetic informativeness (PI) profile for each of the four long universal NPCL markers. The PI curves indicated that SACS performed best over the whole timescale, while RAG1, KIAA1239 and TTN exhibited similar phylogenetic performances. In addition, we compared the success of nested PCR and standard PCR when amplifying NPCL marker fragments. The amplification success rate and efficiency of the nested PCR were overwhelmingly higher than those of standard PCR. Conclusions/Significance Our work clearly demonstrates the superiority of nested PCR over the conventional PCR in phylogenetic studies and develops three long universal NPCL markers (KIAA1239, SACS and TTN) with the nested PCR strategy. The three markers exhibit high phylogenetic utilities in osteichthyan phylogenetics and can be widely used as pilot genes for phylogenetic questions of osteichthyans at different taxonomic levels. PMID:22720083
Cisneros, Laura M; Fagan, Matthew E; Willig, Michael R
2016-01-01
Assembly of species into communities following human disturbance (e.g., deforestation, fragmentation) may be governed by spatial (e.g., dispersal) or environmental (e.g., niche partitioning) mechanisms. Variation partitioning has been used to broadly disentangle spatial and environmental mechanisms, and approaches utilizing functional and phylogenetic characteristics of communities have been implemented to determine the relative importance of particular environmental (or niche-based) mechanisms. Nonetheless, few studies have integrated these quantitative approaches to comprehensively assess the relative importance of particular structuring processes. We employed a novel variation partitioning approach to evaluate the relative importance of particular spatial and environmental drivers of taxonomic, functional, and phylogenetic aspects of bat communities in a human-modified landscape in Costa Rica. Specifically, we estimated the amount of variation in species composition (taxonomic structure) and in two aspects of functional and phylogenetic structure (i.e., composition and dispersion) along a forest loss and fragmentation gradient that are uniquely explained by landscape characteristics (i.e., environment) or space to assess the importance of competing mechanisms. The unique effects of space on taxonomic, functional and phylogenetic structure were consistently small. In contrast, landscape characteristics (i.e., environment) played an appreciable role in structuring bat communities. Spatially-structured landscape characteristics explained 84% of the variation in functional or phylogenetic dispersion, and the unique effects of landscape characteristics significantly explained 14% of the variation in species composition. Furthermore, variation in bat community structure was primarily due to differences in dispersion of species within functional or phylogenetic space along the gradient, rather than due to differences in functional or phylogenetic composition. Variation among bat communities was related to environmental mechanisms, especially niche-based (i.e., environmental) processes, rather than spatial mechanisms. High variation in functional or phylogenetic dispersion, as opposed to functional or phylogenetic composition, suggests that loss or gain of niche space is driving the progressive loss or gain of species with particular traits from communities along the human-modified gradient. Thus, environmental characteristics associated with landscape structure influence functional or phylogenetic aspects of bat communities by effectively altering the ways in which species partition niche space.
Fagan, Matthew E.; Willig, Michael R.
2016-01-01
Background Assembly of species into communities following human disturbance (e.g., deforestation, fragmentation) may be governed by spatial (e.g., dispersal) or environmental (e.g., niche partitioning) mechanisms. Variation partitioning has been used to broadly disentangle spatial and environmental mechanisms, and approaches utilizing functional and phylogenetic characteristics of communities have been implemented to determine the relative importance of particular environmental (or niche-based) mechanisms. Nonetheless, few studies have integrated these quantitative approaches to comprehensively assess the relative importance of particular structuring processes. Methods We employed a novel variation partitioning approach to evaluate the relative importance of particular spatial and environmental drivers of taxonomic, functional, and phylogenetic aspects of bat communities in a human-modified landscape in Costa Rica. Specifically, we estimated the amount of variation in species composition (taxonomic structure) and in two aspects of functional and phylogenetic structure (i.e., composition and dispersion) along a forest loss and fragmentation gradient that are uniquely explained by landscape characteristics (i.e., environment) or space to assess the importance of competing mechanisms. Results The unique effects of space on taxonomic, functional and phylogenetic structure were consistently small. In contrast, landscape characteristics (i.e., environment) played an appreciable role in structuring bat communities. Spatially-structured landscape characteristics explained 84% of the variation in functional or phylogenetic dispersion, and the unique effects of landscape characteristics significantly explained 14% of the variation in species composition. Furthermore, variation in bat community structure was primarily due to differences in dispersion of species within functional or phylogenetic space along the gradient, rather than due to differences in functional or phylogenetic composition. Discussion Variation among bat communities was related to environmental mechanisms, especially niche-based (i.e., environmental) processes, rather than spatial mechanisms. High variation in functional or phylogenetic dispersion, as opposed to functional or phylogenetic composition, suggests that loss or gain of niche space is driving the progressive loss or gain of species with particular traits from communities along the human-modified gradient. Thus, environmental characteristics associated with landscape structure influence functional or phylogenetic aspects of bat communities by effectively altering the ways in which species partition niche space. PMID:27761338
Perry, Jonathan M G; Cooke, Siobhán B; Runestad Connour, Jacqueline A; Burgess, M Loring; Ruff, Christopher B
2018-02-01
Body mass is an important component of any paleobiological reconstruction. Reliable skeletal dimensions for making estimates are desirable but extant primate reference samples with known body masses are rare. We estimated body mass in a sample of extinct platyrrhines and Fayum anthropoids based on four measurements of the articular surfaces of the humerus and femur. Estimates were based on a large extant reference sample of wild-collected individuals with associated body masses, including previously published and new data from extant platyrrhines, cercopithecoids, and hominoids. In general, scaling of joint dimensions is positively allometric relative to expectations of geometric isometry, but negatively allometric relative to expectations of maintaining equivalent joint surface areas. Body mass prediction equations based on articular breadths are reasonably precise, with %SEEs of 17-25%. The breadth of the distal femoral articulation yields the most reliable estimates of body mass because it scales similarly in all major anthropoid taxa. Other joints scale differently in different taxa; therefore, locomotor style and phylogenetic affinity must be considered when calculating body mass estimates from the proximal femur, proximal humerus, and distal humerus. The body mass prediction equations were applied to 36 Old World and New World fossil anthropoid specimens representing 11 taxa, plus two Haitian specimens of uncertain taxonomic affinity. Among the extinct platyrrhines studied, only Cebupithecia is similar to large, extant platyrrhines in having large humeral (especially distal) joints. Our body mass estimates differ from each other and from published estimates based on teeth in ways that reflect known differences in relative sizes of the joints and teeth. We prefer body mass estimators that are biomechanically linked to weight-bearing, and especially those that are relatively insensitive to differences in locomotor style and phylogenetic history. Whenever possible, extant reference samples should be chosen to match target fossils in joint proportionality. Copyright © 2017 Elsevier Ltd. All rights reserved.
Independent contrasts and PGLS regression estimators are equivalent.
Blomberg, Simon P; Lefevre, James G; Wells, Jessie A; Waterhouse, Mary
2012-05-01
We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has several implications: 1. Understanding the structure of the linear model for GLS regression provides insight into when and why phylogeny is important in comparative studies. 2. The limitations of the PIC regression analysis are the same as the limitations of the GLS model. In particular, phylogenetic covariance applies only to the response variable in the regression and the explanatory variable should be regarded as fixed. Calculation of PICs for explanatory variables should be treated as a mathematical idiosyncrasy of the PIC regression algorithm. 3. Since the GLS estimator is the best linear unbiased estimator (BLUE), the slope parameter estimated using PICs is also BLUE. 4. If the slope is estimated using different branch lengths for the explanatory and response variables in the PIC algorithm, the estimator is no longer the BLUE, so this is not recommended. Finally, we discuss whether or not and how to accommodate phylogenetic covariance in regression analyses, particularly in relation to the problem of phylogenetic uncertainty. This discussion is from both frequentist and Bayesian perspectives.
Adaptive MCMC in Bayesian phylogenetics: an application to analyzing partitioned data in BEAST.
Baele, Guy; Lemey, Philippe; Rambaut, Andrew; Suchard, Marc A
2017-06-15
Advances in sequencing technology continue to deliver increasingly large molecular sequence datasets that are often heavily partitioned in order to accurately model the underlying evolutionary processes. In phylogenetic analyses, partitioning strategies involve estimating conditionally independent models of molecular evolution for different genes and different positions within those genes, requiring a large number of evolutionary parameters that have to be estimated, leading to an increased computational burden for such analyses. The past two decades have also seen the rise of multi-core processors, both in the central processing unit (CPU) and Graphics processing unit processor markets, enabling massively parallel computations that are not yet fully exploited by many software packages for multipartite analyses. We here propose a Markov chain Monte Carlo (MCMC) approach using an adaptive multivariate transition kernel to estimate in parallel a large number of parameters, split across partitioned data, by exploiting multi-core processing. Across several real-world examples, we demonstrate that our approach enables the estimation of these multipartite parameters more efficiently than standard approaches that typically use a mixture of univariate transition kernels. In one case, when estimating the relative rate parameter of the non-coding partition in a heterochronous dataset, MCMC integration efficiency improves by > 14-fold. Our implementation is part of the BEAST code base, a widely used open source software package to perform Bayesian phylogenetic inference. guy.baele@kuleuven.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Phylogenetic relationships of Hemiptera inferred from mitochondrial and nuclear genes.
Song, Nan; Li, Hu; Cai, Wanzhi; Yan, Fengming; Wang, Jianyun; Song, Fan
2016-11-01
Here, we reconstructed the Hemiptera phylogeny based on the expanded mitochondrial protein-coding genes and the nuclear 18S rRNA gene, separately. The differential rates of change across lineages may associate with long-branch attraction (LBA) effect and result in conflicting estimates of phylogeny from different types of data. To reduce the potential effects of systematic biases on inferences of topology, various data coding schemes, site removal method, and different algorithms were utilized in phylogenetic reconstruction. We show that the outgroups Phthiraptera, Thysanoptera, and the ingroup Sternorrhyncha share similar base composition, and exhibit "long branches" relative to other hemipterans. Thus, the long-branch attraction between these groups is suspected to cause the failure of recovering Hemiptera under the homogeneous model. In contrast, a monophyletic Hemiptera is supported when heterogeneous model is utilized in the analysis. Although higher level phylogenetic relationships within Hemiptera remain to be answered, consensus between analyses is beginning to converge on a stable phylogeny.
Lorén, J. Gaspar; Farfán, Maribel; Fusté, M. Carmen
2014-01-01
Several approaches have been developed to estimate both the relative and absolute rates of speciation and extinction within clades based on molecular phylogenetic reconstructions of evolutionary relationships, according to an underlying model of diversification. However, the macroevolutionary models established for eukaryotes have scarcely been used with prokaryotes. We have investigated the rate and pattern of cladogenesis in the genus Aeromonas (γ-Proteobacteria, Proteobacteria, Bacteria) using the sequences of five housekeeping genes and an uncorrelated relaxed-clock approach. To our knowledge, until now this analysis has never been applied to all the species described in a bacterial genus and thus opens up the possibility of establishing models of speciation from sequence data commonly used in phylogenetic studies of prokaryotes. Our results suggest that the genus Aeromonas began to diverge between 248 and 266 million years ago, exhibiting a constant divergence rate through the Phanerozoic, which could be described as a pure birth process. PMID:24586399
Winterton, Shaun L; Wiegmann, Brian M; Schlinger, Evert I
2007-06-01
The first formal analysis of phylogenetic relationships among small-headed flies (Acroceridae) is presented based on DNA sequence data from two ribosomal (16S and 28S) and two protein-encoding genes: carbomoylphosphate synthase (CPS) domain of CAD (i.e., rudimentary locus) and cytochrome oxidase I (COI). DNA sequences from 40 species in 22 genera of Acroceridae (representing all three subfamilies) were compared with outgroup exemplars from Nemestrinidae, Stratiomyidae, Tabanidae, and Xylophagidae. Parsimony and Bayesian simultaneous analyses of the full data set recover a well-resolved and strongly supported hypothesis of phylogenetic relationships for major lineages within the family. Molecular evidence supports the monophyly of traditionally recognised subfamilies Philopotinae and Panopinae, but Acrocerinae are polyphyletic. Panopinae, sometimes considered "primitive" based on morphology and host-use, are always placed in a more derived position in the current study. Furthermore, these data support emerging morphological evidence that the type genus Acrocera Meigen, and its sister genus Sphaerops, are atypical acrocerids, comprising a sister lineage to all other Acroceridae. Based on the phylogeny generated in the simultaneous analysis, historical divergence times were estimated using Bayesian methodology constrained with fossil data. These estimates indicate Acroceridae likely evolved during the late Triassic but did not diversify greatly until the Cretaceous.
An ordination of life histories using morphological proxies: capital vs. income breeding in insects.
Davis, Robert B; Javoiš, Juhan; Kaasik, Ants; Õunap, Erki; Tammaru, Toomas
2016-08-01
Predictive classifications of life histories are essential for evolutionary ecology. While attempts to apply a single approach to all organisms may be overambitious, recent advances suggest that more narrow ordination schemes can be useful. However, these schemes mostly lack easily observable proxies of the position of a species on respective axes. It has been proposed that, in insects, the degree of capital (vs. income) breeding, reflecting the importance of adult feeding for reproduction, correlates with various ecological traits at the level of among-species comparison. We sought to prove these ideas via rigorous phylogenetic comparative analyses. We used experimentally derived life-history data for 57 species of European Geometridae (Lepidoptera), and an original phylogenetic reconstruction. The degree of capital breeding was estimated based on morphological proxies, including relative abdomen size of females. Applying Brownian-motion-based comparative analyses (with an original update to include error estimates), we demonstrated the associations between the degree of capital breeding and larval diet breadth, sexual size dimorphism, and reproductive season. Ornstein-Uhlenbeck model based phylogenetic analysis suggested a causal relationship between the degree of capital breeding and diet breadth. Our study indicates that the gradation from capital to income breeding is an informative axis to ordinate life-history strategies in flying insects which are affected by the fecundity vs. mobility trade off, with the availability of easy to record proxies contributing to its predictive power in practical contexts. © 2016 by the Ecological Society of America.
A Framework Phylogeny of the American Oak Clade Based on Sequenced RAD Data
Hipp, Andrew L.; Eaton, Deren A. R.; Cavender-Bares, Jeannine; Fitzek, Elisabeth; Nipper, Rick; Manos, Paul S.
2014-01-01
Previous phylogenetic studies in oaks (Quercus, Fagaceae) have failed to resolve the backbone topology of the genus with strong support. Here, we utilize next-generation sequencing of restriction-site associated DNA (RAD-Seq) to resolve a framework phylogeny of a predominantly American clade of oaks whose crown age is estimated at 23–33 million years old. Using a recently developed analytical pipeline for RAD-Seq phylogenetics, we created a concatenated matrix of 1.40 E06 aligned nucleotides, constituting 27,727 sequence clusters. RAD-Seq data were readily combined across runs, with no difference in phylogenetic placement between technical replicates, which overlapped by only 43–64% in locus coverage. 17% (4,715) of the loci we analyzed could be mapped with high confidence to one or more expressed sequence tags in NCBI Genbank. A concatenated matrix of the loci that BLAST to at least one EST sequence provides approximately half as many variable or parsimony-informative characters as equal-sized datasets from the non-EST loci. The EST-associated matrix is more complete (fewer missing loci) and has slightly lower homoplasy than non-EST subsampled matrices of the same size, but there is no difference in phylogenetic support or relative attribution of base substitutions to internal versus terminal branches of the phylogeny. We introduce a partitioned RAD visualization method (implemented in the R package RADami; http://cran.r-project.org/web/packages/RADami) to investigate the possibility that suboptimal topologies supported by large numbers of loci—due, for example, to reticulate evolution or lineage sorting—are masked by the globally optimal tree. We find no evidence for strongly-supported alternative topologies in our study, suggesting that the phylogeny we recover is a robust estimate of large-scale phylogenetic patterns in the American oak clade. Our study is one of the first to demonstrate the utility of RAD-Seq data for inferring phylogeny in a 23–33 million year-old clade. PMID:24705617
Fast and accurate estimation of the covariance between pairwise maximum likelihood distances.
Gil, Manuel
2014-01-01
Pairwise evolutionary distances are a model-based summary statistic for a set of molecular sequences. They represent the leaf-to-leaf path lengths of the underlying phylogenetic tree. Estimates of pairwise distances with overlapping paths covary because of shared mutation events. It is desirable to take these covariance structure into account to increase precision in any process that compares or combines distances. This paper introduces a fast estimator for the covariance of two pairwise maximum likelihood distances, estimated under general Markov models. The estimator is based on a conjecture (going back to Nei & Jin, 1989) which links the covariance to path lengths. It is proven here under a simple symmetric substitution model. A simulation shows that the estimator outperforms previously published ones in terms of the mean squared error.
Fast and accurate estimation of the covariance between pairwise maximum likelihood distances
2014-01-01
Pairwise evolutionary distances are a model-based summary statistic for a set of molecular sequences. They represent the leaf-to-leaf path lengths of the underlying phylogenetic tree. Estimates of pairwise distances with overlapping paths covary because of shared mutation events. It is desirable to take these covariance structure into account to increase precision in any process that compares or combines distances. This paper introduces a fast estimator for the covariance of two pairwise maximum likelihood distances, estimated under general Markov models. The estimator is based on a conjecture (going back to Nei & Jin, 1989) which links the covariance to path lengths. It is proven here under a simple symmetric substitution model. A simulation shows that the estimator outperforms previously published ones in terms of the mean squared error. PMID:25279263
Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong
2013-07-04
Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
Posada, David; Buckley, Thomas R
2004-10-01
Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental concepts and techniques of model selection in the context of phylogenetics. We start by reviewing different aspects of the selection of substitution models in phylogenetics from a theoretical, philosophical and practical point of view, and summarize this comparison in table format. We argue that the most commonly implemented model selection approach, the hierarchical likelihood ratio test, is not the optimal strategy for model selection in phylogenetics, and that approaches like the Akaike Information Criterion (AIC) and Bayesian methods offer important advantages. In particular, the latter two methods are able to simultaneously compare multiple nested or nonnested models, assess model selection uncertainty, and allow for the estimation of phylogenies and model parameters using all available models (model-averaged inference or multimodel inference). We also describe how the relative importance of the different parameters included in substitution models can be depicted. To illustrate some of these points, we have applied AIC-based model averaging to 37 mitochondrial DNA sequences from the subgenus Ohomopterus(genus Carabus) ground beetles described by Sota and Vogler (2001).
Harrison, Luke B; Larsson, Hans C E
2015-03-01
Likelihood-based methods are commonplace in phylogenetic systematics. Although much effort has been directed toward likelihood-based models for molecular data, comparatively less work has addressed models for discrete morphological character (DMC) data. Among-character rate variation (ACRV) may confound phylogenetic analysis, but there have been few analyses of the magnitude and distribution of rate heterogeneity among DMCs. Using 76 data sets covering a range of plants, invertebrate, and vertebrate animals, we used a modified version of MrBayes to test equal, gamma-distributed and lognormally distributed models of ACRV, integrating across phylogenetic uncertainty using Bayesian model selection. We found that in approximately 80% of data sets, unequal-rates models outperformed equal-rates models, especially among larger data sets. Moreover, although most data sets were equivocal, more data sets favored the lognormal rate distribution relative to the gamma rate distribution, lending some support for more complex character correlations than in molecular data. Parsimony estimation of the underlying rate distributions in several data sets suggests that the lognormal distribution is preferred when there are many slowly evolving characters and fewer quickly evolving characters. The commonly adopted four rate category discrete approximation used for molecular data was found to be sufficient to approximate a gamma rate distribution with discrete characters. However, among the two data sets tested that favored a lognormal rate distribution, the continuous distribution was better approximated with at least eight discrete rate categories. Although the effect of rate model on the estimation of topology was difficult to assess across all data sets, it appeared relatively minor between the unequal-rates models for the one data set examined carefully. As in molecular analyses, we argue that researchers should test and adopt the most appropriate model of rate variation for the data set in question. As discrete characters are increasingly used in more sophisticated likelihood-based phylogenetic analyses, it is important that these studies be built on the most appropriate and carefully selected underlying models of evolution. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
MASTtreedist: visualization of tree space based on maximum agreement subtree.
Huang, Hong; Li, Yongji
2013-01-01
Phylogenetic tree construction process might produce many candidate trees as the "best estimates." As the number of constructed phylogenetic trees grows, the need to efficiently compare their topological or physical structures arises. One of the tree comparison's software tools, the Mesquite's Tree Set Viz module, allows the rapid and efficient visualization of the tree comparison distances using multidimensional scaling (MDS). Tree-distance measures, such as Robinson-Foulds (RF), for the topological distance among different trees have been implemented in Tree Set Viz. New and sophisticated measures such as Maximum Agreement Subtree (MAST) can be continuously built upon Tree Set Viz. MAST can detect the common substructures among trees and provide more precise information on the similarity of the trees, but it is NP-hard and difficult to implement. In this article, we present a practical tree-distance metric: MASTtreedist, a MAST-based comparison metric in Mesquite's Tree Set Viz module. In this metric, the efficient optimizations for the maximum weight clique problem are applied. The results suggest that the proposed method can efficiently compute the MAST distances among trees, and such tree topological differences can be translated as a scatter of points in two-dimensional (2D) space. We also provide statistical evaluation of provided measures with respect to RF-using experimental data sets. This new comparison module provides a new tree-tree pairwise comparison metric based on the differences of the number of MAST leaves among constructed phylogenetic trees. Such a new phylogenetic tree comparison metric improves the visualization of taxa differences by discriminating small divergences of subtree structures for phylogenetic tree reconstruction.
FPGA Acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods.
Zierke, Stephanie; Bakos, Jason D
2010-04-12
Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA's on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10x speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs).
Gros-Balthazard, Muriel; Hughes, Sandrine; Alcover, Josep Antoni; Hutterer, Rainer; Rando, Juan Carlos; Michaux, Jacques; Hänni, Catherine
2012-01-01
Background The lava mouse, Malpaisomys insularis, was endemic to the Eastern Canary islands and became extinct at the beginning of the 14th century when the Europeans reached the archipelago. Studies to determine Malpaisomys' phylogenetic affinities, based on morphological characters, remained inconclusive because morphological changes experienced by this insular rodent make phylogenetic investigations a real challenge. Over 20 years since its first description, Malpaisomys' phylogenetic position remains enigmatic. Methodology/Principal Findings In this study, we resolved this issue using molecular characters. Mitochondrial and nuclear markers were successfully amplified from subfossils of three lava mouse samples. Molecular phylogenetic reconstructions revealed, without any ambiguity, unsuspected relationships between Malpaisomys and extant mice (genus Mus, Murinae). Moreover, through molecular dating we estimated the origin of the Malpaisomys/mouse clade at 6.9 Ma, corresponding to the maximal age at which the archipelago was colonised by the Malpaisomys ancestor via natural rafting. Conclusion/Significance This study reconsiders the derived morphological characters of Malpaisomys in light of this unexpected molecular finding. To reconcile molecular and morphological data, we propose to consider Malpaisomys insularis as an insular lineage of mouse. PMID:22363563
Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.
Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto
2014-01-01
The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.
Species Divergence and Phylogenetic Variation of Ecophysiological Traits in Lianas and Trees
Rios, Rodrigo S.; Salgado-Luarte, Cristian; Gianoli, Ernesto
2014-01-01
The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [Amax], dark respiration rate [Rd], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that Rd evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for Rd, while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. Rd followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for Amax. Rd may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades. PMID:24914958
Palaeohistological Evidence for Ancestral High Metabolic Rate in Archosaurs.
Legendre, Lucas J; Guénard, Guillaume; Botha-Brink, Jennifer; Cubo, Jorge
2016-11-01
Metabolic heat production in archosaurs has played an important role in their evolutionary radiation during the Mesozoic, and their ancestral metabolic condition has long been a matter of debate in systematics and palaeontology. The study of fossil bone histology provides crucial information on bone growth rate, which has been used to indirectly investigate the evolution of thermometabolism in archosaurs. However, no quantitative estimation of metabolic rate has ever been performed on fossils using bone histological features. Moreover, to date, no inference model has included phylogenetic information in the form of predictive variables. Here we performed statistical predictive modeling using the new method of phylogenetic eigenvector maps on a set of bone histological features for a sample of extant and extinct vertebrates, to estimate metabolic rates of fossil archosauromorphs. This modeling procedure serves as a case study for eigenvector-based predictive modeling in a phylogenetic context, as well as an investigation of the poorly known evolutionary patterns of metabolic rate in archosaurs. Our results show that Mesozoic theropod dinosaurs exhibit metabolic rates very close to those found in modern birds, that archosaurs share a higher ancestral metabolic rate than that of extant ectotherms, and that this derived high metabolic rate was acquired at a much more inclusive level of the phylogenetic tree, among non-archosaurian archosauromorphs. These results also highlight the difficulties of assigning a given heat production strategy (i.e., endothermy, ectothermy) to an estimated metabolic rate value, and confirm findings of previous studies that the definition of the endotherm/ectotherm dichotomy may be ambiguous. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Understanding phylogenetic incongruence: lessons from phyllostomid bats
Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B
2012-01-01
All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive morphological convergence among nectar-feeding lineages, and incongruent gene trees. Applying methods to account for nucleotide sequence saturation reduces, but does not completely eliminate, phylogenetic conflict. We ruled out paralogy, lateral gene transfer, and poor taxon sampling and outgroup choices among the processes leading to incongruent gene trees in phyllostomid bats. Uncovering and countering the possible effects of introgression and lineage sorting of ancestral polymorphism on gene trees will require great leaps in genomic and allelic sequencing in this species-rich mammalian family. We also found evidence for adaptive molecular evolution leading to convergence in mitochondrial proteins among nectar-feeding lineages. In conclusion, the biological processes that generate phylogenetic conflict are ubiquitous, and overcoming incongruence requires better models and more data than have been collected even in well-studied organisms such as phyllostomid bats. PMID:22891620
The age and phylogeny of wood boring weevils and the origin of subsociality.
Jordal, Bjarte H; Sequeira, Andrea S; Cognato, Anthony I
2011-06-01
A large proportion of the hyperdiverse weevils are wood boring and many of these taxa have subsocial family structures. The origin and relationship between certain wood boring weevil taxa has been problematic to solve and hypotheses on their phylogenies change substantially between different studies. We aimed at testing the phylogenetic position and monophyly of the most prominent wood boring taxa Scolytinae, Platypodinae and Cossoninae, including a range of weevil outgroups with either the herbivorous or wood boring habit. Many putatively intergrading taxa were included in a broad phylogenetic analysis for the first time in this study, such as Schedlarius, Mecopelmus, Coptonotus, Dactylipalpus, Coptocorynus and allied Araucariini taxa, Dobionus, Psepholax, Amorphocerus-Porthetes, and some peculiar wood boring Conoderini with bark beetle behaviour. Data analyses were based on 128 morphological characters, rDNA nucleotides from the D2-D3 segment of 28S, and nucleotides and amino acids from the protein encoding gene fragments of CAD, ArgK, EF-1α and COI. Although the results varied for some of the groups between various data sets and analyses, one may conclude the following from this study: Scolytinae and Platypodinae are likely sister lineages most closely related to Coptonotus; Cossoninae is monophyletic (including Araucariini) and more distantly related to Scolytinae; Amorphocerini is not part of Cossoninae and Psepholax may belong to Cryptorhynchini. Likelihood estimation of ancestral state reconstruction of subsociality indicated five or six origins as a conservative estimate. Overall the phylogenetic results were quite dependent on morphological data and we conclude that more genetic loci must be sampled to improve phylogenetic resolution. However, some results such as the derived position of Scolytinae were consistent between morphological and molecular data. A revised time estimation of the origin of Curculionidae and various subfamily groups were made using the recently updated fossil age of Scolytinae (100 Ma), which had a significant influence on node age estimates. Copyright © 2011 Elsevier Inc. All rights reserved.
Duchêne, David; Duchêne, Sebastian; Ho, Simon Y W
2015-07-01
Phylogenetic estimation of evolutionary timescales has become routine in biology, forming the basis of a wide range of evolutionary and ecological studies. However, there are various sources of bias that can affect these estimates. We investigated whether tree imbalance, a property that is commonly observed in phylogenetic trees, can lead to reduced accuracy or precision of phylogenetic timescale estimates. We analysed simulated data sets with calibrations at internal nodes and at the tips, taking into consideration different calibration schemes and levels of tree imbalance. We also investigated the effect of tree imbalance on two empirical data sets: mitogenomes from primates and serial samples of the African swine fever virus. In analyses calibrated using dated, heterochronous tips, we found that tree imbalance had a detrimental impact on precision and produced a bias in which the overall timescale was underestimated. A pronounced effect was observed in analyses with shallow calibrations. The greatest decreases in accuracy usually occurred in the age estimates for medium and deep nodes of the tree. In contrast, analyses calibrated at internal nodes did not display a reduction in estimation accuracy or precision due to tree imbalance. Our results suggest that molecular-clock analyses can be improved by increasing taxon sampling, with the specific aims of including deeper calibrations, breaking up long branches and reducing tree imbalance. © 2014 John Wiley & Sons Ltd.
Coalescent methods for estimating phylogenetic trees.
Liu, Liang; Yu, Lili; Kubatko, Laura; Pearl, Dennis K; Edwards, Scott V
2009-10-01
We review recent models to estimate phylogenetic trees under the multispecies coalescent. Although the distinction between gene trees and species trees has come to the fore of phylogenetics, only recently have methods been developed that explicitly estimate species trees. Of the several factors that can cause gene tree heterogeneity and discordance with the species tree, deep coalescence due to random genetic drift in branches of the species tree has been modeled most thoroughly. Bayesian approaches to estimating species trees utilizes two likelihood functions, one of which has been widely used in traditional phylogenetics and involves the model of nucleotide substitution, and the second of which is less familiar to phylogeneticists and involves the probability distribution of gene trees given a species tree. Other recent parametric and nonparametric methods for estimating species trees involve parsimony criteria, summary statistics, supertree and consensus methods. Species tree approaches are an appropriate goal for systematics, appear to work well in some cases where concatenation can be misleading, and suggest that sampling many independent loci will be paramount. Such methods can also be challenging to implement because of the complexity of the models and computational time. In addition, further elaboration of the simplest of coalescent models will be required to incorporate commonly known issues such as deviation from the molecular clock, gene flow and other genetic forces.
Spinks, Phillip Q; Thomson, Robert C; Zhang, YaPing; Che, Jing; Wu, Yonghua; Shaffer, H Bradley
2012-06-01
Turtles are currently the most endangered major clade of vertebrates on earth, and Asian box turtles (Cuora) are in catastrophic decline. Effective management of this diverse turtle clade has been hampered by human-mediated, and perhaps natural hybridization, resulting in discordance between mitochondrial and nuclear markers and confusion regarding species boundaries and phylogenetic relationships among hypothesized species of Cuora. Here, we present analyses of mitochondrial and nuclear DNA data for all 12 currently hypothesized species to resolve both species boundaries and phylogenetic relationships. Our 15-gene, 40-individual nuclear data set was frequently in conflict with our mitochondrial data set; based on its general concordance with published morphological analyses and the strength of 15 independent estimates of evolutionary history, we interpret the nuclear data as representing the most reliable estimate of species boundaries and phylogeny of Cuora. Our results strongly reiterate the necessity of using multiple nuclear markers for phylogeny and species delimitation in these animals, including any form of DNA "barcoding", and point to Cuora as an important case study where reliance on mitochondrial DNA can lead to incorrect species identification. Copyright © 2012 Elsevier Inc. All rights reserved.
Molecular systematics and global phylogeography of angel sharks (genus Squatina).
Stelbrink, Björn; von Rintelen, Thomas; Cliff, Geremy; Kriwet, Jürgen
2010-02-01
Angel sharks of the genus Squatina represent a group comprising 22 extant benthic species inhabiting continental shelves and upper slopes. In the present study, a comprehensive phylogenetic reconstruction of 17 Squatina species based on two mitochondrial markers (COI and 16S rRNA) is provided. The phylogenetic reconstructions are used to test biogeographic patterns. In addition, a molecular clock analysis is conducted to estimate divergence times of the emerged clades. All analyses show Squatina to be monophyletic. Four geographic clades are recognized, of which the Europe-North Africa-Asia clade is probably a result of the Tethys Sea closure. A second sister group relationship emerged in the analyses, including S. californica (eastern North Pacific) and S. dumeril (western North Atlantic), probably related to the rise of the Panamanian isthmus. The molecular clock analysis show that both lineage divergences coincide with the estimated time of these two geological events. Copyright (c) 2009. Published by Elsevier Inc.
Zeng, Xu; Yuan, Zhengrong; Tong, Xin; Li, Qiushi; Gao, Weiwei; Qin, Minjian; Liu, Zhihua
2012-05-01
Oryzoideae (Poaceae) plants have economic and ecological value. However, the phylogenetic position of some plants is not clear, such as Hygroryza aristata (Retz.) Nees. and Porteresia coarctata (Roxb.) Tateoka (syn. Oryza coarctata). Comprehensive molecular phylogenetic studies have been carried out on many genera in the Poaceae. The different DNA sequences, including nuclear and chloroplast sequences, had been extensively employed to determine relationships at both higher and lower taxonomic levels in the Poaceae. Chloroplast DNA ndhF gene and atpB-rbcL spacer were used to construct phylogenetic trees and estimate the divergence time of Oryzoideae, Bambusoideae, Panicoideae, Pooideae and so on. Complete sequences of atpB-rbcL and ndhF were generated for 17 species representing six species of the Oryzoideae and related subfamilies. Nicotiana tabacum L. was the outgroup species. The two DNA datasets were analyzed, using Maximum Parsimony and Bayesian analysis methods. The molecular phylogeny revealed that H. aristata (Retz.) Nees was the sister to Chikusichloa aquatica Koidz. Moreover, P. coarctata (Roxb.) Tateoka was in the genus Oryza. Furthermore, the result of evolution analysis, which based on the ndhF marker, indicated that the time of origin of Oryzoideae might be 31 million years ago.
Duchêne, Sebastián; Geoghegan, Jemma L; Holmes, Edward C; Ho, Simon Y W
2016-11-15
In rapidly evolving pathogens, including viruses and some bacteria, genetic change can accumulate over short time-frames. Accordingly, their sampling times can be used to calibrate molecular clocks, allowing estimation of evolutionary rates. Methods for estimating rates from time-structured data vary in how they treat phylogenetic uncertainty and rate variation among lineages. We compiled 81 virus data sets and estimated nucleotide substitution rates using root-to-tip regression, least-squares dating and Bayesian inference. Although estimates from these three methods were often congruent, this largely relied on the choice of clock model. In particular, relaxed-clock models tended to produce higher rate estimates than methods that assume constant rates. Discrepancies in rate estimates were also associated with high among-lineage rate variation, and phylogenetic and temporal clustering. These results provide insights into the factors that affect the reliability of rate estimates from time-structured sequence data, emphasizing the importance of clock-model testing. sduchene@unimelb.edu.au or garzonsebastian@hotmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Chase, Mark W.; Kim, Joo-Hwan
2013-01-01
Phylogenetic analysis aims to produce a bifurcating tree, which disregards conflicting signals and displays only those that are present in a large proportion of the data. However, any character (or tree) conflict in a dataset allows the exploration of support for various evolutionary hypotheses. Although data-display network approaches exist, biologists cannot easily and routinely use them to compute rooted phylogenetic networks on real datasets containing hundreds of taxa. Here, we constructed an original neighbour-net for a large dataset of Asparagales to highlight the aspects of the resulting network that will be important for interpreting phylogeny. The analyses were largely conducted with new data collected for the same loci as in previous studies, but from different species accessions and greater sampling in many cases than in published analyses. The network tree summarised the majority data pattern in the characters of plastid sequences before tree building, which largely confirmed the currently recognised phylogenetic relationships. Most conflicting signals are at the base of each group along the Asparagales backbone, which helps us to establish the expectancy and advance our understanding of some difficult taxa relationships and their phylogeny. The network method should play a greater role in phylogenetic analyses than it has in the past. To advance the understanding of evolutionary history of the largest order of monocots Asparagales, absolute diversification times were estimated for family-level clades using relaxed molecular clock analyses. PMID:23544071
Tucker, Derek B; Hedges, Stephen Blair; Colli, Guarino R; Pyron, Robert Alexander; Sites, Jack W
2017-09-01
The phylogenetic relationships and biogeographic history of Caribbean island ameivas ( Pholidoscelis ) are not well-known because of incomplete sampling, conflicting datasets, and poor support for many clades. Here, we use phylogenomic and mitochondrial DNA datasets to reconstruct a well-supported phylogeny and assess historical colonization patterns in the group. We obtained sequence data from 316 nuclear loci and one mitochondrial marker for 16 of 19 extant species of the Caribbean endemic genus Pholidoscelis . Phylogenetic analyses were carried out using both concatenation and species tree approaches. To estimate divergence times, we used fossil teiids to calibrate a timetree which was used to elucidate the historical biogeography of these lizards. All phylogenetic analyses recovered four well-supported species groups (clades) recognized previously and supported novel relationships of those groups, including a ( P. auberi + P. lineolatus ) clade (western + central Caribbean), and a ( P. exsul + P. plei ) clade (eastern Caribbean). Divergence between Pholidoscelis and its sister clade was estimated to have occurred ~25 Ma, with subsequent diversification on Caribbean islands occurring over the last 11 Myr. Of the six models compared in the biogeographic analyses, the scenario which considered the distance among islands and allowed dispersal in all directions best fit the data. These reconstructions suggest that the ancestor of this group colonized either Hispaniola or Puerto Rico from Middle America. We provide a well-supported phylogeny of Pholidoscelis with novel relationships not reported in previous studies that were based on significantly smaller datasets. We propose that Pholidoscelis colonized the eastern Greater Antilles from Middle America based on our biogeographic analysis, phylogeny, and divergence time estimates. The closing of the Central American Seaway and subsequent formation of the modern Atlantic meridional overturning circulation may have promoted dispersal in this group.
Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.
2017-01-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen. PMID:29291245
Burbrink, Frank T; Lorch, Jeffrey M; Lips, Karen R
2017-12-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.
Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.
2017-01-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.
Nonbinary Tree-Based Phylogenetic Networks.
Jetten, Laura; van Iersel, Leo
2018-01-01
Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.
Liu, Jun; Li, Qi; Kong, Lingfeng; Yu, Hong; Zheng, Xiaodong
2011-09-01
Oysters (family Ostreidae), with high levels of phenotypic plasticity and wide geographic distribution, are a challenging group for taxonomists and phylogenetics. As a useful tool for molecular species identification, DNA barcoding might offer significant potential for oyster identification and taxonomy. This study used two mitochondrial fragments, cytochrome c oxidase I (COI) and the large ribosomal subunit (16S rDNA), to assess whether oyster species could be identified by phylogeny and distance-based DNA barcoding techniques. Relationships among species were estimated by the phylogenetic analyses of both genes, and then pairwise inter- and intraspecific genetic divergences were assessed. Species forming well-differentiated clades in the molecular phylogenies were identical for both genes even when the closely related species were included. Intraspecific variability of 16S rDNA overlapped with interspecific divergence. However, average intra- and interspecific genetic divergences for COI were 0-1.4% (maximum 2.2%) and 2.6-32.2% (minimum 2.2%), respectively, indicating the existence of a barcoding gap. These results confirm the efficacy of species identification in oysters via DNA barcodes and phylogenetic analysis. © 2011 Blackwell Publishing Ltd.
Jensen, Anders; Scholz, Christian F P; Kilian, Mogens
2016-11-01
The Mitis group of the genus Streptococcus currently comprises 20 species with validly published names, including the pathogen S. pneumoniae. They have been the subject of much taxonomic confusion, due to phenotypic overlap and genetic heterogeneity, which has hampered a full appreciation of their clinical significance. The purpose of this study was to critically re-examine the taxonomy of the Mitis group using 195 publicly available genomes, including designated type strains for phylogenetic analyses based on core genomes, multilocus sequences and 16S rRNA gene sequences, combined with estimates of average nucleotide identity (ANI) and in silico and in vitro analyses of specific phenotypic characteristics. Our core genomic phylogenetic analyses revealed distinct clades that, to some extent, and from the clustering of type strains represent known species. However, many of the genomes have been incorrectly identified adding to the current confusion. Furthermore, our data show that 16S rRNA gene sequences and ANI are unsuitable for identifying and circumscribing new species of the Mitis group of the genus Streptococci. Based on the clustering patterns resulting from core genome phylogenetic analysis, we conclude that S. oligofermentans is a later synonym of S. cristatus. The recently described strains of the species Streptococcus dentisani includes one previously referred to as 'S. mitis biovar 2'. Together with S. oralis, S. dentisani and S. tigurinus form subclusters within a coherent phylogenetic clade. We propose that the species S. oralis consists of three subspecies: S. oralis subsp. oralis subsp. nov., S. oralis subsp. tigurinus comb. nov., and S. oralis subsp. dentisani comb. nov.
Molecular phylogeny and evolutionary timescale for the family of mammalian herpesviruses.
McGeoch, D J; Cook, S; Dolan, A; Jamieson, F E; Telford, E A
1995-03-31
A detailed phylogenetic analysis for mammalian members of the family Herpesviridae, based on molecular sequences is reported. Sets of encoded amino acid sequences were collected for eight well conserved genes that are common to mammalian herpesviruses. Phylogenetic trees were inferred from alignments of these sequence sets using both maximum parsimony and distance methods, and evaluated by bootstrap analysis. In all cases the three recognised subfamilies (Alpha-, Beta- and Gammaherpesvirinae), and major sublineages in each subfamily, were clearly distinguished, but within sublineages some finer details of branching were incompletely resolved. Multiple-gene sets were assembled to give a broadly based tree. The root position of the tree was estimated by assuming a constant molecular clock and also by analysis of one herpesviral gene set (that encoding uracil-DNA glycosylase) using cellular homologues as outgroups. Both procedures placed the root between the Alphaherpesvirinae and the other two subfamilies. Substitution rates were calculated for the combined gene sets based on a previous estimate for alphaherpesviral UL27 genes, where the time base had been obtained according to the hypothesis of cospeciation of virus and host lineages. Assuming a constant molecular clock, it was then estimated that the three subfamilies arose approximately 180 to 220 million years ago, that major sublineages within subfamilies were probably generated before the mammalian radiation of 80 to 60 million years ago, and that speciations within sublineages took place in the last 80 million years, probably with a major component of cospeciation with host lineages.
Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses
Lanfear, Robert; Hua, Xia; Warren, Dan L.
2016-01-01
Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses. PMID:27435794
Liu, Kevin; Warnow, Tandy J; Holder, Mark T; Nelesen, Serita M; Yu, Jiaye; Stamatakis, Alexandros P; Linder, C Randal
2012-01-01
Highly accurate estimation of phylogenetic trees for large data sets is difficult, in part because multiple sequence alignments must be accurate for phylogeny estimation methods to be accurate. Coestimation of alignments and trees has been attempted but currently only SATé estimates reasonably accurate trees and alignments for large data sets in practical time frames (Liu K., Raghavan S., Nelesen S., Linder C.R., Warnow T. 2009b. Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees. Science. 324:1561-1564). Here, we present a modification to the original SATé algorithm that improves upon SATé (which we now call SATé-I) in terms of speed and of phylogenetic and alignment accuracy. SATé-II uses a different divide-and-conquer strategy than SATé-I and so produces smaller more closely related subsets than SATé-I; as a result, SATé-II produces more accurate alignments and trees, can analyze larger data sets, and runs more efficiently than SATé-I. Generally, SATé is a metamethod that takes an existing multiple sequence alignment method as an input parameter and boosts the quality of that alignment method. SATé-II-boosted alignment methods are significantly more accurate than their unboosted versions, and trees based upon these improved alignments are more accurate than trees based upon the original alignments. Because SATé-I used maximum likelihood (ML) methods that treat gaps as missing data to estimate trees and because we found a correlation between the quality of tree/alignment pairs and ML scores, we explored the degree to which SATé's performance depends on using ML with gaps treated as missing data to determine the best tree/alignment pair. We present two lines of evidence that using ML with gaps treated as missing data to optimize the alignment and tree produces very poor results. First, we show that the optimization problem where a set of unaligned DNA sequences is given and the output is the tree and alignment of those sequences that maximize likelihood under the Jukes-Cantor model is uninformative in the worst possible sense. For all inputs, all trees optimize the likelihood score. Second, we show that a greedy heuristic that uses GTR+Gamma ML to optimize the alignment and the tree can produce very poor alignments and trees. Therefore, the excellent performance of SATé-II and SATé-I is not because ML is used as an optimization criterion for choosing the best tree/alignment pair but rather due to the particular divide-and-conquer realignment techniques employed.
The evolutionary rate dynamically tracks changes in HIV-1 epidemics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maljkovic-berry, Irina; Athreya, Gayathri; Daniels, Marcus
Large-sequence datasets provide an opportunity to investigate the dynamics of pathogen epidemics. Thus, a fast method to estimate the evolutionary rate from large and numerous phylogenetic trees becomes necessary. Based on minimizing tip height variances, we optimize the root in a given phylogenetic tree to estimate the most homogenous evolutionary rate between samples from at least two different time points. Simulations showed that the method had no bias in the estimation of evolutionary rates and that it was robust to tree rooting and topological errors. We show that the evolutionary rates of HIV-1 subtype B and C epidemics have changedmore » over time, with the rate of evolution inversely correlated to the rate of virus spread. For subtype B, the evolutionary rate slowed down and tracked the start of the HAART era in 1996. Subtype C in Ethiopia showed an increase in the evolutionary rate when the prevalence increase markedly slowed down in 1995. Thus, we show that the evolutionary rate of HIV-1 on the population level dynamically tracks epidemic events.« less
Naumann, Julia; Salomo, Karsten; Der, Joshua P.; Wafula, Eric K.; Bolin, Jay F.; Maass, Erika; Frenzke, Lena; Samain, Marie-Stéphanie; Neinhuis, Christoph
2013-01-01
Extreme haustorial parasites have long captured the interest of naturalists and scientists with their greatly reduced and highly specialized morphology. Along with the reduction or loss of photosynthesis, the plastid genome often decays as photosynthetic genes are released from selective constraint. This makes it challenging to use traditional plastid genes for parasitic plant phylogenetics, and has driven the search for alternative phylogenetic and molecular evolutionary markers. Thus, evolutionary studies, such as molecular clock-based age estimates, are not yet available for all parasitic lineages. In the present study, we extracted 14 nuclear single copy genes (nSCG) from Illumina transcriptome data from one of the “strangest plants in the world”, Hydnora visseri (Hydnoraceae). A ∼15,000 character molecular dataset, based on all three genomic compartments, shows the utility of nSCG for reconstructing phylogenetic relationships in parasitic lineages. A relaxed molecular clock approach with the same multi-locus dataset, revealed an ancient age of ∼91 MYA for Hydnoraceae. We then estimated the stem ages of all independently originated parasitic angiosperm lineages using a published dataset, which also revealed a Cretaceous origin for Balanophoraceae, Cynomoriaceae and Apodanthaceae. With the exception of Santalales, older parasite lineages tend to be more specialized with respect to trophic level and have lower species diversity. We thus propose the “temporal specialization hypothesis” (TSH) implementing multiple independent specialization processes over time during parasitic angiosperm evolution. PMID:24265760
Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals
Fourment, Mathieu; Claywell, Brian C; Dinh, Vu; McCoy, Connor; Matsen IV, Frederick A; Darling, Aaron E
2018-01-01
Abstract Modern infectious disease outbreak surveillance produces continuous streams of sequence data which require phylogenetic analysis as data arrives. Current software packages for Bayesian phylogenetic inference are unable to quickly incorporate new sequences as they become available, making them less useful for dynamically unfolding evolutionary stories. This limitation can be addressed by applying a class of Bayesian statistical inference algorithms called sequential Monte Carlo (SMC) to conduct online inference, wherein new data can be continuously incorporated to update the estimate of the posterior probability distribution. In this article, we describe and evaluate several different online phylogenetic sequential Monte Carlo (OPSMC) algorithms. We show that proposing new phylogenies with a density similar to the Bayesian prior suffers from poor performance, and we develop “guided” proposals that better match the proposal density to the posterior. Furthermore, we show that the simplest guided proposals can exhibit pathological behavior in some situations, leading to poor results, and that the situation can be resolved by heating the proposal density. The results demonstrate that relative to the widely used MCMC-based algorithm implemented in MrBayes, the total time required to compute a series of phylogenetic posteriors as sequences arrive can be significantly reduced by the use of OPSMC, without incurring a significant loss in accuracy. PMID:29186587
Phylogeny and species traits predict bird detectability
Solymos, Peter; Matsuoka, Steven M.; Stralberg, Diana; Barker, Nicole K. S.; Bayne, Erin M.
2018-01-01
Avian acoustic communication has resulted from evolutionary pressures and ecological constraints. We therefore expect that auditory detectability in birds might be predictable by species traits and phylogenetic relatedness. We evaluated the relationship between phylogeny, species traits, and field‐based estimates of the two processes that determine species detectability (singing rate and detection distance) for 141 bird species breeding in boreal North America. We used phylogenetic mixed models and cross‐validation to compare the relative merits of using trait data only, phylogeny only, or the combination of both to predict detectability. We found a strong phylogenetic signal in both singing rates and detection distances; however the strength of phylogenetic effects was less than expected under Brownian motion evolution. The evolution of behavioural traits that determine singing rates was found to be more labile, leaving more room for species to evolve independently, whereas detection distance was mostly determined by anatomy (i.e. body size) and thus the laws of physics. Our findings can help in disentangling how complex ecological and evolutionary mechanisms have shaped different aspects of detectability in boreal birds. Such information can greatly inform single‐ and multi‐species models but more work is required to better understand how to best correct possible biases in phylogenetic diversity and other community metrics.
Phylogenetic lineages in the Botryosphaeriales: a systematic and evolutionary framework
Slippers, B.; Boissin, E.; Phillips, A.J.L.; Groenewald, J.Z.; Lombard, L.; Wingfield, M.J.; Postma, A.; Burgess, T.; Crous, P.W.
2013-01-01
The order Botryosphaeriales represents several ecologically diverse fungal families that are commonly isolated as endophytes or pathogens from various woody hosts. The taxonomy of members of this order has been strongly influenced by sequence-based phylogenetics, and the abandonment of dual nomenclature. In this study, the phylogenetic relationships of the genera known from culture are evaluated based on DNA sequence data for six loci (SSU, LSU, ITS, EF1, BT, mtSSU). The results make it possible to recognise a total of six families. Other than the Botryosphaeriaceae (17 genera), Phyllostictaceae (Phyllosticta) and Planistromellaceae (Kellermania), newly introduced families include Aplosporellaceae (Aplosporella and Bagnisiella), Melanopsaceae (Melanops), and Saccharataceae (Saccharata). Furthermore, the evolution of morphological characters in the Botryosphaeriaceae were investigated via analysis of phylogeny-trait association. None of the traits presented a significant phylogenetic signal, suggesting that conidial and ascospore pigmentation, septation and appendages evolved more than once in the family. Molecular clock dating on radiations within the Botryosphaeriales based on estimated mutation rates of the rDNA SSU locus, suggests that the order originated in the Cretaceous period around 103 (45-188) mya, with most of the diversification in the Tertiary period. This coincides with important periods of radiation and spread of the main group of plants that these fungi infect, namely woody Angiosperms. The resulting host-associations and distribution could have influenced the diversification of these fungi. Taxonomic novelties: New families - Aplosporellaceae Slippers, Boissin & Crous, Melanopsaceae Phillips, Slippers, Boissin & Crous, Saccharataceae Slippers, Boissin & Crous. PMID:24302789
Escobedo, Víctor M.; Rios, Rodrigo S.; Salgado-Luarte, Cristian; Stotz, Gisela C.
2017-01-01
Abstract Background and Aims Disturbance often drives plant invasion and may modify community assembly. However, little is known about how these modifications of community patterns occur in terms of taxonomic, functional and phylogenetic structure. This study evaluated in an arid shrubland the influence of disturbance by an endemic rodent on community functional divergence and phylogenetic structure as well as on plant invasion. It was expected that disturbance would operate as a habitat filter favouring exotic species with short life cycles. Methods Sixteen plots were sampled along a disturbance gradient caused by the endemic fossorial rodent Spalacopus cyanus, measuring community parameters and estimating functional divergence for life history traits (functional dispersion index) and the relative contribution to functional divergence of exotic and native species. The phylogenetic signal (Pagel’s lambda) and phylogenetic community structure (mean phylogenetic distance and mean nearest taxon phylogenetic distance) were also estimated. The use of a continuous approach to the disturbance gradient allowed the identification of non-linear relationships between disturbance and community parameters. Key Results The relationship between disturbance and both species richness and abundance was positive for exotic species and negative for native species. Disturbance modified community composition, and exotic species were associated with more disturbed sites. Disturbance increased trait convergence, which resulted in phylogenetic clustering because traits showed a significant phylogenetic signal. The relative contribution of exotic species to functional divergence increased, while that of natives decreased, with disturbance. Exotic and native species were not phylogenetically distinct. Conclusions Disturbance by rodents in this arid shrubland constitutes a habitat filter over phylogeny-dependent life history traits, leading to phylogenetic clustering, and drives invasion by favouring species with short life cycles. Results can be explained by high phenotypic and phylogenetic resemblance between exotic and native species. The use of continuous gradients when studying the effects of disturbance on community assembly is advocated. PMID:28087661
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction.
Mai, Uyen; Sayyari, Erfan; Mirarab, Siavash
2017-01-01
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods.
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
Sayyari, Erfan; Mirarab, Siavash
2017-01-01
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. PMID:28800608
Roos, Jonas; Aggarwal, Ramesh K; Janke, Axel
2007-11-01
The mitochondrial genomes of the dwarf crocodile, Osteolaemus tetraspis, and two species of dwarf caimans, the smooth-fronted caiman, Paleosuchus trigonatus, and Cuvier's dwarf caiman, Paleosuchus palpebrosus, were sequenced and included in a mitogenomic phylogenetic study. The phylogenetic analyses, which included a total of ten crocodylian species, yielded strong support to a basal split between Crocodylidae and Alligatoridae. Osteolaemus fell within the Crocodylidae as the sister group to Crocodylus. Gavialis and Tomistoma, which joined on a common branch, constituted a sister group to Crocodylus/Osteolaemus. This suggests that extant crocodylians are organized in two families: Alligatoridae and Crocodylidae. Within the Alligatoridae there was a basal split between Alligator and a branch that contained Paleosuchus and Caiman. The analyses also provided molecular estimates of various divergences applying recently established crocodylian and outgroup fossil calibration points. Molecular estimates based on amino acid data placed the divergence between Crocodylidae and Alligatoridae at 97-103 million years ago and that between Alligator and Caiman/Paleosuchus at 65-72 million years ago. Other crocodilian divergences were placed after the Cretaceous-Tertiary boundary. Thus, according to the molecular estimates, three extant crocodylian lineages have their roots in the Cretaceous. Considering the crocodylian diversification in the Cretaceous the molecular datings suggest that the extinction of the dinosaurs was also to some extent paralleled in the crocodylian evolution. However, for whatever reason, some crocodylian lineages survived into the Tertiary.
Phylogenetic patterns of climatic, habitat and trophic niches in a European avian assemblage
Pearman, Peter B; Lavergne, Sébastien; Roquet, Cristina; Wüest, Rafael; Zimmermann, Niklaus E; Thuiller, Wilfried
2014-01-01
Aim The origins of ecological diversity in continental species assemblages have long intrigued biogeographers. We apply phylogenetic comparative analyses to disentangle the evolutionary patterns of ecological niches in an assemblage of European birds. We compare phylogenetic patterns in trophic, habitat and climatic niche components. Location Europe. Methods From polygon range maps and handbook data we inferred the realized climatic, habitat and trophic niches of 405 species of breeding birds in Europe. We fitted Pagel's lambda and kappa statistics, and conducted analyses of disparity through time to compare temporal patterns of ecological diversification on all niche axes together. All observed patterns were compared with expectations based on neutral (Brownian) models of niche divergence. Results In this assemblage, patterns of phylogenetic signal (lambda) suggest that related species resemble each other less in regard to their climatic and habitat niches than they do in their trophic niche. Kappa estimates show that ecological divergence does not gradually increase with divergence time, and that this punctualism is stronger in climatic niches than in habitat and trophic niches. Observed niche disparity markedly exceeds levels expected from a Brownian model of ecological diversification, thus providing no evidence for past phylogenetic niche conservatism in these multivariate niches. Levels of multivariate disparity are greatest for the climatic niche, followed by disparity of the habitat and the trophic niches. Main conclusions Phylogenetic patterns in the three niche components differ within this avian assemblage. Variation in evolutionary rates (degree of gradualism, constancy through the tree) and/or non-random macroecological sampling probably lead here to differences in the phylogenetic structure of niche components. Testing hypotheses on the origin of these patterns requires more complete phylogenetic trees of the birds, and extended ecological data on different niche components for all bird species. PMID:24790525
Stratification of co-evolving genomic groups using ranked phylogenetic profiles
Freilich, Shiri; Goldovsky, Leon; Gottlieb, Assaf; Blanc, Eric; Tsoka, Sophia; Ouzounis, Christos A
2009-01-01
Background Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database. Results The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples. Conclusion Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples. PMID:19860884
Daniel L. Lindner; Mark T. Banik
2011-01-01
Regions of rDNA are commonly used to infer phylogenetic relationships among fungal species and as DNA barcodes for identification. These regions occur in large tandem arrays, and concerted evolution is believed to reduce intragenomic variation among copies within these arrays, although some variation still might exist. Phylogenetic studies typically use consensus...
Diversification of Rosaceae since the Late Cretaceous based on plastid phylogenomics.
Zhang, Shu-Dong; Jin, Jian-Jun; Chen, Si-Yun; Chase, Mark W; Soltis, Douglas E; Li, Hong-Tao; Yang, Jun-Bo; Li, De-Zhu; Yi, Ting-Shuang
2017-05-01
Phylogenetic relationships in Rosaceae have long been problematic because of frequent hybridisation, apomixis and presumed rapid radiation, and their historical diversification has not been clarified. With 87 genera representing all subfamilies and tribes of Rosaceae and six of the other eight families of Rosales (outgroups), we analysed 130 newly sequenced plastomes together with 12 from GenBank in an attempt to reconstruct deep relationships and reveal temporal diversification of this family. Our results highlight the importance of improving sequence alignment and the use of appropriate substitution models in plastid phylogenomics. Three subfamilies and 16 tribes (as previously delimited) were strongly supported as monophyletic, and their relationships were fully resolved and strongly supported at most nodes. Rosaceae were estimated to have originated during the Late Cretaceous with evidence for rapid diversification events during several geological periods. The major lineages rapidly diversified in warm and wet habits during the Late Cretaceous, and the rapid diversification of genera from the early Oligocene onwards occurred in colder and drier environments. Plastid phylogenomics offers new and important insights into deep phylogenetic relationships and the diversification history of Rosaceae. The robust phylogenetic backbone and time estimates we provide establish a framework for future comparative studies on rosaceous evolution. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
The performance of the Congruence Among Distance Matrices (CADM) test in phylogenetic analysis
2011-01-01
Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa), the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously. PMID:21388552
Phylogenetic incongruence in the Drosophila melanogaster species group
Wong, Alex; Jensen, Jeffrey D.; Pool, John E.; Aquadro, Charles F.
2007-01-01
Drosophila melanogaster and its close relatives are used extensively in comparative biology. Despite the importance of phylogenetic information for such studies, relationships between some melanogaster species group members are unclear due to conflicting phylogenetic signals at different loci. In this study, we use twelve nuclear loci (eleven coding and one non-coding) to assess the degree of phylogenetic incongruence in this model system. We focus on two nodes: (1) The node joining the D. erecta-D. orena, D. melanogaster-D. simulans, and D. yakuba-D. teissieri lineages, and (2) The node joining the lineages leading to the melanogaster, takahashii, and eugracilis subgroups. We find limited evidence for incongruence at the first node; our data, as well as those of several previous studies, strongly support monophyly of a clade consisting of D. erecta-D. orena and D. yakuba-D. teissieri. By contrast, using likelihood based tests of congruence, we find robust evidence for topological incongruence at the second node. Different loci support different relationships among the melanogaster, takahashii and eugracilis subgroups, and the observed incongruence is not easily attributable to homoplasy, non-equilibrium base composition, or positive selection on a subset of loci. We argue that lineage sorting in the common ancestor of these three subgroups is the most plausible explanation for our observations. Such lineage sorting may lead to biased estimation of tree topology and evolutionary rates, and may confound inferences of positive selection. PMID:17071113
Phylogenetic Analysis and Epidemic History of Hepatitis C Virus Genotype 2 in Tunisia, North Africa
Rajhi, Mouna; Ghedira, Kais; Chouikha, Anissa; Djebbi, Ahlem; Cheikh, Imed; Ben Yahia, Ahlem; Sadraoui, Amel; Hammami, Walid; Azouz, Msaddek; Ben Mami, Nabil; Triki, Henda
2016-01-01
HCV genotype 2 (HCV-2) has a worldwide distribution with prevalence rates that vary from country to country. High genetic diversity and long-term endemicity were suggested in West African countries. A global dispersal of HCV-2 would have occurred during the 20th century, especially in European countries. In Tunisia, genotype 2 was the second prevalent genotype after genotype 1 and most isolates belong to subtypes 2c and 2k. In this study, phylogenetic analyses based on the NS5B genomic sequences of 113 Tunisian HCV isolates from subtypes 2c and 2k were carried out. A Bayesian coalescent-based framework was used to estimate the origin and the spread of these subtypes circulating in Tunisia. Phylogenetic analyses of HCV-2c sequences suggest the absence of country-specific or time-specific variants. In contrast, the phylogenetic grouping of HCV-2k sequences shows the existence of two major genetic clusters that may represent two distinct circulating variants. Coalescent analysis indicated a most recent common ancestor (tMRCA) of Tunisian HCV-2c around 1886 (1869–1902) before the introduction of HCV-2k in 1901 (1867–1931). Our findings suggest that the introduction of HCV-2c in Tunisia is possibly a result of population movements between Tunisia and European population following the French colonization. PMID:27100294
Phylogenetic Analysis and Epidemic History of Hepatitis C Virus Genotype 2 in Tunisia, North Africa.
Rajhi, Mouna; Ghedira, Kais; Chouikha, Anissa; Djebbi, Ahlem; Cheikh, Imed; Ben Yahia, Ahlem; Sadraoui, Amel; Hammami, Walid; Azouz, Msaddek; Ben Mami, Nabil; Triki, Henda
2016-01-01
HCV genotype 2 (HCV-2) has a worldwide distribution with prevalence rates that vary from country to country. High genetic diversity and long-term endemicity were suggested in West African countries. A global dispersal of HCV-2 would have occurred during the 20th century, especially in European countries. In Tunisia, genotype 2 was the second prevalent genotype after genotype 1 and most isolates belong to subtypes 2c and 2k. In this study, phylogenetic analyses based on the NS5B genomic sequences of 113 Tunisian HCV isolates from subtypes 2c and 2k were carried out. A Bayesian coalescent-based framework was used to estimate the origin and the spread of these subtypes circulating in Tunisia. Phylogenetic analyses of HCV-2c sequences suggest the absence of country-specific or time-specific variants. In contrast, the phylogenetic grouping of HCV-2k sequences shows the existence of two major genetic clusters that may represent two distinct circulating variants. Coalescent analysis indicated a most recent common ancestor (tMRCA) of Tunisian HCV-2c around 1886 (1869-1902) before the introduction of HCV-2k in 1901 (1867-1931). Our findings suggest that the introduction of HCV-2c in Tunisia is possibly a result of population movements between Tunisia and European population following the French colonization.
Nasr Esfahani, Bahram; Moghim, Sharareh; Ghasemian Safaei, Hajieh; Moghoofei, Mohsen; Sedighi, Mansour; Hadifar, Shima
2016-01-01
Background Taxonomic and phylogenetic studies of Mycobacterium species have been based around the 16sRNA gene for many years. However, due to the high strain similarity between species in the Mycobacterium genus (94.3% - 100%), defining a valid phylogenetic tree is difficult; consequently, its use in estimating the boundaries between species is limited. The sequence of the rpoB gene makes it an appropriate gene for phylogenetic analysis, especially in bacteria with limited variation. Objectives In the present study, a 360bp sequence of rpoB was used for precise classification of Mycobacterium strains isolated in Isfahan, Iran. Materials and Methods From February to October 2013, 57 clinical and environmental isolates were collected, subcultured, and identified by phenotypic methods. After DNA extraction, a 360bp fragment was PCR-amplified and sequenced. The phylogenetic tree was constructed based on consensus sequence data, using MEGA5 software. Results Slow and fast-growing groups of the Mycobacterium strains were clearly differentiated based on the constructed tree of 56 common Mycobacterium isolates. Each species with a unique title in the tree was identified; in total, 13 nods with a bootstrap value of over 50% were supported. Among the slow-growing group was Mycobacterium kansasii, with M. tuberculosis in a cluster with a bootstrap value of 98% and M. gordonae in another cluster with a bootstrap value of 90%. In the fast-growing group, one cluster with a bootstrap value of 89% was defined, including all fast-growing members present in this study. Conclusions The results suggest that only the application of the rpoB gene sequence is sufficient for taxonomic categorization and definition of a new Mycobacterium species, due to its high resolution power and proper variation in its sequence (85% - 100%); the resulting tree has high validity. PMID:27284397
Variance to mean ratio, R(t), for poisson processes on phylogenetic trees.
Goldman, N
1994-09-01
The ratio of expected variance to mean, R(t), of numbers of DNA base substitutions for contemporary sequences related by a "star" phylogeny is widely seen as a measure of the adherence of the sequences' evolution to a Poisson process with a molecular clock, as predicted by the "neutral theory" of molecular evolution under certain conditions. A number of estimators of R(t) have been proposed, all predicted to have mean 1 and distributions based on the chi 2. Various genes have previously been analyzed and found to have values of R(t) far in excess of 1, calling into question important aspects of the neutral theory. In this paper, I use Monte Carlo simulation to show that the previously suggested means and distributions of estimators of R(t) are highly inaccurate. The analysis is applied to star phylogenies and to general phylogenetic trees, and well-known gene sequences are reanalyzed. For star phylogenies the results show that Kimura's estimators ("The Neutral Theory of Molecular Evolution," Cambridge Univ. Press, Cambridge, 1983) are unsatisfactory for statistical testing of R(t), but confirm the accuracy of Bulmer's correction factor (Genetics 123: 615-619, 1989). For all three nonstar phylogenies studied, attained values of all three estimators of R(t), although larger than 1, are within their true confidence limits under simple Poisson process models. This shows that lineage effects can be responsible for high estimates of R(t), restoring some limited confidence in the molecular clock and showing that the distinction between lineage and molecular clock effects is vital.(ABSTRACT TRUNCATED AT 250 WORDS)
Lambert, Amaury; Alexander, Helen K; Stadler, Tanja
2014-07-07
The reconstruction of phylogenetic trees based on viral genetic sequence data sequentially sampled from an epidemic provides estimates of the past transmission dynamics, by fitting epidemiological models to these trees. To our knowledge, none of the epidemiological models currently used in phylogenetics can account for recovery rates and sampling rates dependent on the time elapsed since transmission, i.e. age of infection. Here we introduce an epidemiological model where infectives leave the epidemic, by either recovery or sampling, after some random time which may follow an arbitrary distribution. We derive an expression for the likelihood of the phylogenetic tree of sampled infectives under our general epidemiological model. The analytic concept developed in this paper will facilitate inference of past epidemiological dynamics and provide an analytical framework for performing very efficient simulations of phylogenetic trees under our model. The main idea of our analytic study is that the non-Markovian epidemiological model giving rise to phylogenetic trees growing vertically as time goes by can be represented by a Markovian "coalescent point process" growing horizontally by the sequential addition of pairs of coalescence and sampling times. As examples, we discuss two special cases of our general model, described in terms of influenza and HIV epidemics. Though phrased in epidemiological terms, our framework can also be used for instance to fit macroevolutionary models to phylogenies of extant and extinct species, accounting for general species lifetime distributions. Copyright © 2014 Elsevier Ltd. All rights reserved.
Tosh, J.; Dessein, S.; Buerki, S.; Groeninckx, I.; Mouly, A.; Bremer, B.; Smets, E. F.; De Block, P.
2013-01-01
Background and Aims Previous work on the pantropical genus Ixora has revealed an Afro-Madagascan clade, but as yet no study has focused in detail on the evolutionary history and morphological trends in this group. Here the evolutionary history of Afro-Madagascan Ixora spp. (a clade of approx. 80 taxa) is investigated and the phylogenetic trees compared with several key morphological traits in taxa occurring in Madagascar. Methods Phylogenetic relationships of Afro-Madagascan Ixora are assessed using sequence data from four plastid regions (petD, rps16, rpoB-trnC and trnL-trnF) and nuclear ribosomal external transcribed spacer (ETS) and internal transcribed spacer (ITS) regions. The phylogenetic distribution of key morphological characters is assessed. Bayesian inference (implemented in BEAST) is used to estimate the temporal origin of Ixora based on fossil evidence. Key Results Two separate lineages of Madagascan taxa are recovered, one of which is nested in a group of East African taxa. Divergence in Ixora is estimated to have commenced during the mid Miocene, with extensive cladogenesis occurring in the Afro-Madagascan clade during the Pliocene onwards. Conclusions Both lineages of Madagascan Ixora exhibit morphological innovations that are rare throughout the rest of the genus, including a trend towards pauciflorous inflorescences and a trend towards extreme corolla tube length, suggesting that the same ecological and selective pressures are acting upon taxa from both Madagascan lineages. Novel ecological opportunities resulting from climate-induced habitat fragmentation and corolla tube length diversification are likely to have facilitated species radiation on Madagascar. PMID:24142919
Hill, Kathy B R; Marshall, David C; Moulds, Maxwell S; Simon, Chris
2015-07-10
North America has a diverse cicada fauna with multiple genera from all three Cicadidae subfamilies, yet molecular phylogenetic analyses have been completed only for the well-studied periodical cicadas (Magicicada Davis). The genus Tibicen Latreille, a large group of charismatic species, is in need of such work because morphological patterns suggest multiple groups with complicated relationships to other genera in the tribe Cryptotympanini. In this paper we present a molecular phylogenetic analysis, based on mitochondrial and nuclear DNA, of 35 of the 38 extant USA species and subspecies of the genus Tibicen together with their North American tribal allies (Cornuplura Davis, Cacama Davis), selected Tibicen species from Eurasia, and representatives of other Eurasian and Pacific cryptotympanine genera. This tree shows that Tibicen contains several well-supported clades, one predominating in eastern and central North America and related to Cryptotympana Stål and Raiateana Boulard, another in western North America related to Cacama and Cornuplura, and at least two clades in Eurasia. We also present a morphological cladistic analysis of Tibicen and its close allies based on 27 characters. Character states identified in the cladistic analysis define three new genera, two for North American taxa (Hadoa gen. n. and Neotibicen gen. n.) including several Mexican species, and one for Asian species (Subsolanus gen. n.). Using relaxed molecular clocks and literature-derived mtDNA rate estimates, we estimate the timeframe of diversification of Tibicen clades and find that intergeneric divergence has occurred since the late Eocene, with most extant species within the former Tibicen originating after the mid-Miocene. We review patterns of ecology, behavior, and geography among Tibicen clades in light of the phylogenetic results and note that the study of these insects is still in its early stages. Some Mexican species formerly placed in Tibicen are here transferred to Diceroprocta, following refinement of the definition of that genus.
Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison
Ratmann, Oliver; Hodcroft, Emma B.; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A.S. Md Mukarram; Joy, Jeffrey B.; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E.; Liang, Richard; Plazzotta, Giacomo; Poon, Art F.Y.; Rasmussen, David A.; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J.; Fraser, Christophe
2017-01-01
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods’ development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. PMID:28053012
Verdant: automated annotation, alignment and phylogenetic analysis of whole chloroplast genomes.
McKain, Michael R; Hartsock, Ryan H; Wohl, Molly M; Kellogg, Elizabeth A
2017-01-01
Chloroplast genomes are now produced in the hundreds for angiosperm phylogenetics projects, but current methods for annotation, alignment and tree estimation still require some manual intervention reducing throughput and increasing analysis time for large chloroplast systematics projects. Verdant is a web-based software suite and database built to take advantage a novel annotation program, annoBTD. Using annoBTD, Verdant provides accurate annotation of chloroplast genomes without manual intervention. Subsequent alignment and tree estimation can incorporate newly annotated and publically available plastomes and can accommodate a large number of taxa. Verdant sharply reduces the time required for analysis of assembled chloroplast genomes and removes the need for pipelines and software on personal hardware. Verdant is available at: http://verdant.iplantcollaborative.org/plastidDB/ It is implemented in PHP, Perl, MySQL, Javascript, HTML and CSS with all major browsers supported. mrmckain@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Combinatorics of least-squares trees.
Mihaescu, Radu; Pachter, Lior
2008-09-09
A recurring theme in the least-squares approach to phylogenetics has been the discovery of elegant combinatorial formulas for the least-squares estimates of edge lengths. These formulas have proved useful for the development of efficient algorithms, and have also been important for understanding connections among popular phylogeny algorithms. For example, the selection criterion of the neighbor-joining algorithm is now understood in terms of the combinatorial formulas of Pauplin for estimating tree length. We highlight a phylogenetically desirable property that weighted least-squares methods should satisfy, and provide a complete characterization of methods that satisfy the property. The necessary and sufficient condition is a multiplicative four-point condition that the variance matrix needs to satisfy. The proof is based on the observation that the Lagrange multipliers in the proof of the Gauss-Markov theorem are tree-additive. Our results generalize and complete previous work on ordinary least squares, balanced minimum evolution, and the taxon-weighted variance model. They also provide a time-optimal algorithm for computation.
NASA Astrophysics Data System (ADS)
Cucchi, T.; Mohaseb, A.; Peigné, S.; Debue, K.; Orlando, L.; Mashkour, M.
2017-04-01
The Plio-Pleistocene evolution of Equus and the subsequent domestication of horses and donkeys remains poorly understood, due to the lack of phenotypic markers capable of tracing this evolutionary process in the palaeontological/archaeological record. Using images from 345 specimens, encompassing 15 extant taxa of equids, we quantified the occlusal enamel folding pattern in four mandibular cheek teeth with a single geometric morphometric protocol. We initially investigated the protocol accuracy by assigning each tooth to its correct anatomical position and taxonomic group. We then contrasted the phylogenetic signal present in each tooth shape with an exome-wide phylogeny from 10 extant equine species. We estimated the strength of the phylogenetic signal using a Brownian motion model of evolution with multivariate K statistic, and mapped the dental shape along the molecular phylogeny using an approach based on squared-change parsimony. We found clear evidence for the relevance of dental phenotypes to accurately discriminate all modern members of the genus Equus and capture their phylogenetic relationships. These results are valuable for both palaeontologists and zooarchaeologists exploring the spatial and temporal dynamics of the evolutionary history of the horse family, up to the latest domestication trajectories of horses and donkeys.
Mohaseb, A.; Peigné, S.; Debue, K.; Orlando, L.; Mashkour, M.
2017-01-01
The Plio–Pleistocene evolution of Equus and the subsequent domestication of horses and donkeys remains poorly understood, due to the lack of phenotypic markers capable of tracing this evolutionary process in the palaeontological/archaeological record. Using images from 345 specimens, encompassing 15 extant taxa of equids, we quantified the occlusal enamel folding pattern in four mandibular cheek teeth with a single geometric morphometric protocol. We initially investigated the protocol accuracy by assigning each tooth to its correct anatomical position and taxonomic group. We then contrasted the phylogenetic signal present in each tooth shape with an exome-wide phylogeny from 10 extant equine species. We estimated the strength of the phylogenetic signal using a Brownian motion model of evolution with multivariate K statistic, and mapped the dental shape along the molecular phylogeny using an approach based on squared-change parsimony. We found clear evidence for the relevance of dental phenotypes to accurately discriminate all modern members of the genus Equus and capture their phylogenetic relationships. These results are valuable for both palaeontologists and zooarchaeologists exploring the spatial and temporal dynamics of the evolutionary history of the horse family, up to the latest domestication trajectories of horses and donkeys. PMID:28484618
On the relationship between phylogenetic diversity and trait diversity.
Tucker, Caroline M; Davies, T Jonathan; Cadotte, Marc W; Pearse, William D
2018-05-21
Niche differences are key to understanding the distribution and structure of biodiversity. To examine niche differences, we must first characterize how species occupy niche space, and two approaches are commonly used in the ecological literature. The first uses species traits to estimate multivariate trait space (so-called functional trait diversity, FD); the second quantifies the amount of time or evolutionary history captured by a group of species (phylogenetic diversity, PD). It is often-but controversially-assumed that these putative measures of niche space are at a minimum correlated and perhaps redundant, since more evolutionary time allows for greater accumulation of trait changes. This theoretical expectation remains surprisingly poorly evaluated, particularly in the context of multivariate measures of trait diversity. We evaluated the relationship between phylogenetic diversity and trait diversity using analytical and simulation-based methods across common models of trait evolution. We show that PD correlates with FD increasingly strongly as more traits are included in the FD measure. Our results indicate that phylogenetic diversity can be a useful surrogate for high-dimensional trait diversity, but we also show that the correlation weakens when the underlying process of trait evolution includes variation in rate and optima. © 2018 by the Ecological Society of America.
Pan-genome and phylogeny of Bacillus cereus sensu lato.
Bazinet, Adam L
2017-08-02
Bacillus cereus sensu lato (s. l.) is an ecologically diverse bacterial group of medical and agricultural significance. In this study, I use publicly available genomes and novel bioinformatic workflows to characterize the B. cereus s. l. pan-genome and perform the largest phylogenetic and population genetic analyses of this group to date in terms of the number of genes and taxa included. With these fundamental data in hand, I identify genes associated with particular phenotypic traits (i.e., "pan-GWAS" analysis), and quantify the degree to which taxa sharing common attributes are phylogenetically clustered. A rapid k-mer based approach (Mash) was used to create reduced representations of selected Bacillus genomes, and a fast distance-based phylogenetic analysis of this data (FastME) was performed to determine which species should be included in B. cereus s. l. The complete genomes of eight B. cereus s. l. species were annotated de novo with Prokka, and these annotations were used by Roary to produce the B. cereus s. l. pan-genome. Scoary was used to associate gene presence and absence patterns with various phenotypes. The orthologous protein sequence clusters produced by Roary were filtered and used to build HaMStR databases of gene models that were used in turn to construct phylogenetic data matrices. Phylogenetic analyses used RAxML, DendroPy, ClonalFrameML, PAUP*, and SplitsTree. Bayesian model-based population genetic analysis assigned taxa to clusters using hierBAPS. The genealogical sorting index was used to quantify the phylogenetic clustering of taxa sharing common attributes. The B. cereus s. l. pan-genome currently consists of ≈60,000 genes, ≈600 of which are "core" (common to at least 99% of taxa sampled). Pan-GWAS analysis revealed genes associated with phenotypes such as isolation source, oxygen requirement, and ability to cause diseases such as anthrax or food poisoning. Extensive phylogenetic analyses using an unprecedented amount of data produced phylogenies that were largely concordant with each other and with previous studies. Phylogenetic support as measured by bootstrap probabilities increased markedly when all suitable pan-genome data was included in phylogenetic analyses, as opposed to when only core genes were used. Bayesian population genetic analysis recommended subdividing the three major clades of B. cereus s. l. into nine clusters. Taxa sharing common traits and species designations exhibited varying degrees of phylogenetic clustering. All phylogenetic analyses recapitulated two previously used classification systems, and taxa were consistently assigned to the same major clade and group. By including accessory genes from the pan-genome in the phylogenetic analyses, I produced an exceptionally well-supported phylogeny of 114 complete B. cereus s. l. genomes. The best-performing methods were used to produce a phylogeny of all 498 publicly available B. cereus s. l. genomes, which was in turn used to compare three different classification systems and to test the monophyly status of various B. cereus s. l. species. The majority of the methodology used in this study is generic and could be leveraged to produce pan-genome estimates and similarly robust phylogenetic hypotheses for other bacterial groups.
Relating phylogenetic trees to transmission trees of infectious disease outbreaks.
Ypma, Rolf J F; van Ballegooijen, W Marijn; Wallinga, Jacco
2013-11-01
Transmission events are the fundamental building blocks of the dynamics of any infectious disease. Much about the epidemiology of a disease can be learned when these individual transmission events are known or can be estimated. Such estimations are difficult and generally feasible only when detailed epidemiological data are available. The genealogy estimated from genetic sequences of sampled pathogens is another rich source of information on transmission history. Optimal inference of transmission events calls for the combination of genetic data and epidemiological data into one joint analysis. A key difficulty is that the transmission tree, which describes the transmission events between infected hosts, differs from the phylogenetic tree, which describes the ancestral relationships between pathogens sampled from these hosts. The trees differ both in timing of the internal nodes and in topology. These differences become more pronounced when a higher fraction of infected hosts is sampled. We show how the phylogenetic tree of sampled pathogens is related to the transmission tree of an outbreak of an infectious disease, by the within-host dynamics of pathogens. We provide a statistical framework to infer key epidemiological and mutational parameters by simultaneously estimating the phylogenetic tree and the transmission tree. We test the approach using simulations and illustrate its use on an outbreak of foot-and-mouth disease. The approach unifies existing methods in the emerging field of phylodynamics with transmission tree reconstruction methods that are used in infectious disease epidemiology.
Peña, Carlos; Espeland, Marianne
2015-01-01
The species rich butterfly family Nymphalidae has been used to study evolutionary interactions between plants and insects. Theories of insect-hostplant dynamics predict accelerated diversification due to key innovations. In evolutionary biology, analysis of maximum credibility trees in the software MEDUSA (modelling evolutionary diversity using stepwise AIC) is a popular method for estimation of shifts in diversification rates. We investigated whether phylogenetic uncertainty can produce different results by extending the method across a random sample of trees from the posterior distribution of a Bayesian run. Using the MultiMEDUSA approach, we found that phylogenetic uncertainty greatly affects diversification rate estimates. Different trees produced diversification rates ranging from high values to almost zero for the same clade, and both significant rate increase and decrease in some clades. Only four out of 18 significant shifts found on the maximum clade credibility tree were consistent across most of the sampled trees. Among these, we found accelerated diversification for Ithomiini butterflies. We used the binary speciation and extinction model (BiSSE) and found that a hostplant shift to Solanaceae is correlated with increased net diversification rates in Ithomiini, congruent with the diffuse cospeciation hypothesis. Our results show that taking phylogenetic uncertainty into account when estimating net diversification rate shifts is of great importance, as very different results can be obtained when using the maximum clade credibility tree and other trees from the posterior distribution. PMID:25830910
Peña, Carlos; Espeland, Marianne
2015-01-01
The species rich butterfly family Nymphalidae has been used to study evolutionary interactions between plants and insects. Theories of insect-hostplant dynamics predict accelerated diversification due to key innovations. In evolutionary biology, analysis of maximum credibility trees in the software MEDUSA (modelling evolutionary diversity using stepwise AIC) is a popular method for estimation of shifts in diversification rates. We investigated whether phylogenetic uncertainty can produce different results by extending the method across a random sample of trees from the posterior distribution of a Bayesian run. Using the MultiMEDUSA approach, we found that phylogenetic uncertainty greatly affects diversification rate estimates. Different trees produced diversification rates ranging from high values to almost zero for the same clade, and both significant rate increase and decrease in some clades. Only four out of 18 significant shifts found on the maximum clade credibility tree were consistent across most of the sampled trees. Among these, we found accelerated diversification for Ithomiini butterflies. We used the binary speciation and extinction model (BiSSE) and found that a hostplant shift to Solanaceae is correlated with increased net diversification rates in Ithomiini, congruent with the diffuse cospeciation hypothesis. Our results show that taking phylogenetic uncertainty into account when estimating net diversification rate shifts is of great importance, as very different results can be obtained when using the maximum clade credibility tree and other trees from the posterior distribution.
Wilcox, Thomas P; Zwickl, Derrick J; Heath, Tracy A; Hillis, David M
2002-11-01
Four New World genera of dwarf boas (Exiliboa, Trachyboa, Tropidophis, and Ungaliophis) have been placed by many systematists in a single group (traditionally called Tropidophiidae). However, the monophyly of this group has been questioned in several studies. Moreover, the overall relationships among basal snake lineages, including the placement of the dwarf boas, are poorly understood. We obtained mtDNA sequence data for 12S, 16S, and intervening tRNA-val genes from 23 species of snakes representing most major snake lineages, including all four genera of New World dwarf boas. We then examined the phylogenetic position of these species by estimating the phylogeny of the basal snakes. Our phylogenetic analysis suggests that New World dwarf boas are not monophyletic. Instead, we find Exiliboa and Ungaliophis to be most closely related to sand boas (Erycinae), boas (Boinae), and advanced snakes (Caenophidea), whereas Tropidophis and Trachyboa form an independent clade that separated relatively early in snake radiation. Our estimate of snake phylogeny differs significantly in other ways from some previous estimates of snake phylogeny. For instance, pythons do not cluster with boas and sand boas, but instead show a strong relationship with Loxocemus and Xenopeltis. Additionally, uropeltids cluster strongly with Cylindrophis, and together are embedded in what has previously been considered the macrostomatan radiation. These relationships are supported by both bootstrapping (parametric and nonparametric approaches) and Bayesian analysis, although Bayesian support values are consistently higher than those obtained from nonparametric bootstrapping. Simulations show that Bayesian support values represent much better estimates of phylogenetic accuracy than do nonparametric bootstrap support values, at least under the conditions of our study. Copyright 2002 Elsevier Science (USA)
Escobedo, Víctor M; Rios, Rodrigo S; Salgado-Luarte, Cristian; Stotz, Gisela C; Gianoli, Ernesto
2017-03-01
Disturbance often drives plant invasion and may modify community assembly. However, little is known about how these modifications of community patterns occur in terms of taxonomic, functional and phylogenetic structure. This study evaluated in an arid shrubland the influence of disturbance by an endemic rodent on community functional divergence and phylogenetic structure as well as on plant invasion. It was expected that disturbance would operate as a habitat filter favouring exotic species with short life cycles. Sixteen plots were sampled along a disturbance gradient caused by the endemic fossorial rodent Spalacopus cyanus , measuring community parameters and estimating functional divergence for life history traits (functional dispersion index) and the relative contribution to functional divergence of exotic and native species. The phylogenetic signal (Pagel's lambda) and phylogenetic community structure (mean phylogenetic distance and mean nearest taxon phylogenetic distance) were also estimated. The use of a continuous approach to the disturbance gradient allowed the identification of non-linear relationships between disturbance and community parameters. The relationship between disturbance and both species richness and abundance was positive for exotic species and negative for native species. Disturbance modified community composition, and exotic species were associated with more disturbed sites. Disturbance increased trait convergence, which resulted in phylogenetic clustering because traits showed a significant phylogenetic signal. The relative contribution of exotic species to functional divergence increased, while that of natives decreased, with disturbance. Exotic and native species were not phylogenetically distinct. Disturbance by rodents in this arid shrubland constitutes a habitat filter over phylogeny-dependent life history traits, leading to phylogenetic clustering, and drives invasion by favouring species with short life cycles. Results can be explained by high phenotypic and phylogenetic resemblance between exotic and native species. The use of continuous gradients when studying the effects of disturbance on community assembly is advocated. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Mansion, Guilhem; Parolly, Gerald; Crowl, Andrew A.; Mavrodiev, Evgeny; Cellinese, Nico; Oganesian, Marine; Fraunhofer, Katharina; Kamari, Georgia; Phitos, Dimitrios; Haberle, Rosemarie; Akaydin, Galip; Ikinci, Nursel; Raus, Thomas; Borsch, Thomas
2012-01-01
Background Speciose clades usually harbor species with a broad spectrum of adaptive strategies and complex distribution patterns, and thus constitute ideal systems to disentangle biotic and abiotic causes underlying species diversification. The delimitation of such study systems to test evolutionary hypotheses is difficult because they often rely on artificial genus concepts as starting points. One of the most prominent examples is the bellflower genus Campanula with some 420 species, but up to 600 species when including all lineages to which Campanula is paraphyletic. We generated a large alignment of petD group II intron sequences to include more than 70% of described species as a reference. By comparison with partial data sets we could then assess the impact of selective taxon sampling strategies on phylogenetic reconstruction and subsequent evolutionary conclusions. Methodology/Principal Findings Phylogenetic analyses based on maximum parsimony (PAUP, PRAP), Bayesian inference (MrBayes), and maximum likelihood (RAxML) were first carried out on the large reference data set (D680). Parameters including tree topology, branch support, and age estimates, were then compared to those obtained from smaller data sets resulting from “classification-guided” (D088) and “phylogeny-guided sampling” (D101). Analyses of D088 failed to fully recover the phylogenetic diversity in Campanula, whereas D101 inferred significantly different branch support and age estimates. Conclusions/Significance A short genomic region with high phylogenetic utility allowed us to easily generate a comprehensive phylogenetic framework for the speciose Campanula clade. Our approach recovered 17 well-supported and circumscribed sub-lineages. Knowing these will be instrumental for developing more specific evolutionary hypotheses and guide future research, we highlight the predictive value of a mass taxon-sampling strategy as a first essential step towards illuminating the detailed evolutionary history of diverse clades. PMID:23209646
Phylogenetic trees in bioinformatics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burr, Tom L
2008-01-01
Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding themore » best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.« less
What is the danger of the anomaly zone for empirical phylogenetics?
Huang, Huateng; Knowles, L Lacey
2009-10-01
The increasing number of observations of gene trees with discordant topologies in phylogenetic studies has raised awareness about the problems of incongruence between species trees and gene trees. Moreover, theoretical treatments focusing on the impact of coalescent variance on phylogenetic study have also identified situations where the most probable gene trees are ones that do not match the underlying species tree (i.e., anomalous gene trees [AGTs]). However, although the theoretical proof of the existence of AGTs is alarming, the actual risk that AGTs pose to empirical phylogenetic study is far from clear. Establishing the conditions (i.e., the branch lengths in a species tree) for which AGTs are possible does not address the critical issue of how prevalent they might be. Furthermore, theoretical characterization of the species trees for which AGTs may pose a problem (i.e., the anomaly zone or the species histories for which AGTs are theoretically possible) is based on consideration of just one source of variance that contributes to species tree and gene tree discord-gene lineage coalescence. Yet, empirical data contain another important stochastic component-mutational variance. Estimated gene trees will differ from the underlying gene trees (i.e., the actual genealogy) because of the random process of mutation. Here, we take a simulation approach to investigate the prevalence of AGTs, among estimated gene trees, thereby characterizing the boundaries of the anomaly zone taking into account both coalescent and mutational variances. We also determine the frequency of realized AGTs, which is critical to putting the theoretical work on AGTs into a realistic biological context. Two salient results emerge from this investigation. First, our results show that mutational variance can indeed expand the parameter space (i.e., the relative branch lengths in a species tree) where AGTs might be observed in empirical data. By exploring the underlying cause for the expanded anomaly zone, we identify aspects of empirical data relevant to avoiding the problems that AGTs pose for species tree inference from multilocus data. Second, for the empirical species histories where AGTs are possible, unresolved trees-not AGTs-predominate the pool of estimated gene trees. This result suggests that the risk of AGTs, while they exist in theory, may rarely be realized in practice. By considering the biological realities of both mutational and coalescent variances, the study has refined, and redefined, what the actual challenges are for empirical phylogenetic study of recently diverged taxa that have speciated rapidly-AGTs themselves are unlikely to pose a significant danger to empirical phylogenetic study.
Likelihood of Tree Topologies with Fossils and Diversification Rate Estimation.
Didier, Gilles; Fau, Marine; Laurin, Michel
2017-11-01
Since the diversification process cannot be directly observed at the human scale, it has to be studied from the information available, namely the extant taxa and the fossil record. In this sense, phylogenetic trees including both extant taxa and fossils are the most complete representations of the diversification process that one can get. Such phylogenetic trees can be reconstructed from molecular and morphological data, to some extent. Among the temporal information of such phylogenetic trees, fossil ages are by far the most precisely known (divergence times are inferences calibrated mostly with fossils). We propose here a method to compute the likelihood of a phylogenetic tree with fossils in which the only considered time information is the fossil ages, and apply it to the estimation of the diversification rates from such data. Since it is required in our computation, we provide a method for determining the probability of a tree topology under the standard diversification model. Testing our approach on simulated data shows that the maximum likelihood rate estimates from the phylogenetic tree topology and the fossil dates are almost as accurate as those obtained by taking into account all the data, including the divergence times. Moreover, they are substantially more accurate than the estimates obtained only from the exact divergence times (without taking into account the fossil record). We also provide an empirical example composed of 50 Permo-Carboniferous eupelycosaur (early synapsid) taxa ranging in age from about 315 Ma (Late Carboniferous) to 270 Ma (shortly after the end of the Early Permian). Our analyses suggest a speciation (cladogenesis, or birth) rate of about 0.1 per lineage and per myr, a marginally lower extinction rate, and a considerable hidden paleobiodiversity of early synapsids. [Extinction rate; fossil ages; maximum likelihood estimation; speciation rate.]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Kämpfer, Peter; Falsen, Enevold; Busse, Hans-Jürgen
2008-01-01
Pseudomonas mephitica CCUG 2513(T) has been reinvestigated to clarify its taxonomic position. 16S rRNA gene sequence comparisons demonstrated that this strain clusters phylogenetically closely with Janthinobacterium lividum (99.8% sequence similarity to the type strain). Investigation of fatty acid patterns, polar lipid profiles, polyamine patterns and quinone systems supported this delineation. Substrate utilization profiles and biochemical characteristics displayed no differences from the type strain of J. lividum, CCUG 2344(T). Therefore, the reclassification of Pseudomonas mephitica as a later heterotypic synonym of Janthinobacterium lividum is proposed, based upon the estimated phylogenetic position derived from 16S rRNA gene sequence data and chemotaxonomic and biochemical data.
A review of criticisms of phylogenetic nomenclature: is taxonomic freedom the fundamental issue?
Bryant, Harold N; Cantino, Philip D
2002-02-01
The proposal to implement a phylogenetic nomenclatural system governed by the PhyloCode), in which taxon names are defined by explicit reference to common descent, has met with strong criticism from some proponents of phylogenetic taxonomy (taxonomy based on the principle of common descent in which only clades and species are recognized). We examine these criticisms and find that some of the perceived problems with phylogenetic nomenclature are based on misconceptions, some are equally true of the current rank-based nomenclatural system, and some will be eliminated by implementation of the PhyloCode. Most of the criticisms are related to an overriding concern that, because the meanings of names are associated with phylogenetic pattern which is subject to change, the adoption of phylogenetic nomenclature will lead to increased instability in the content of taxa. This concern is associated with the fact that, despite the widespread adoption of the view that taxa are historical entities that are conceptualized based on ancestry, many taxonomists also conceptualize taxa based on their content. As a result, critics of phylogenetic nomenclature have argued that taxonomists should be free to emend the content of taxa without constraints imposed by nomenclatural decisions. However, in phylogenetic nomenclature the contents of taxa are determined, not by the taxonomist, but by the combination of the phylogenetic definition of the name and a phylogenetic hypothesis. Because the contents of taxa, once their names are defined, can no longer be freely modified by taxonomists, phylogenetic nomenclature is perceived as limiting taxonomic freedom. We argue that the form of taxonomic freedom inherent to phylogenetic nomenclature is appropriate to phylogenetic taxonomy in which taxa are considered historical entities that are discovered through phylogenetic analysis and are not human constructs.
Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).
Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A
2011-10-01
Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.
Lanier, Hayley C; Knowles, L Lacey
2015-02-01
Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including difficulties with MCMC convergence and estimation of requisite population genetic parameters for coalescent-based approaches. Despite these difficulties, the findings generally support the utility of species-tree analyses for the estimation of species relationships throughout the ToL. We discuss strategies for successful application of species-tree approaches to deep phylogenies. Copyright © 2014 Elsevier Inc. All rights reserved.
Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison.
Ratmann, Oliver; Hodcroft, Emma B; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A S Md Mukarram; Joy, Jeffrey B; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E; Liang, Richard; Plazzotta, Giacomo; Poon, Art F Y; Rasmussen, David A; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J; Fraser, Christophe
2017-01-01
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods' development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gaubert, Philippe; Veron, Géraldine
2003-01-01
Although molecular studies have helped to clarify the phylogeny of the problematic family Viverridae, a recent phylogenetic investigation based on cytochrome b (cyt b) has excluded the Asiatic linsangs (genus Prionodon) from the family. To assess the phylogenetic position of the Asiatic linsangs within the Feliformia, we analysed an exhaustive taxonomic sample set with cyt b and newly produced transthyretin intron I sequences (TR-I-I). TR-I-I alone and cyt b +TR-I-I combined (maximum-likelihood analysis) highly support the position of Asiatic linsangs as sister-group of the Felidae. The estimation of minimum divergence dates from molecular data suggests a splitting event ca. 33.3 million years (Myr) ago, which lends support to historical assertions that the Asiatic linsangs are "living fossils" that share a plesiomorphic morphotype with the Oligocene feliform Paleoprionodon. The African linsang is estimated to appear more than 20 Myr later and represents the sister-group of the genus Genetta. Our phylogenetic results illustrate numerous morphological convergences of "diagnostic" characters among Feliformia that might be problematic for the identification of fossil taxa. The morphotype reappearance from the Asiatic to the African linsangs suggests that the genome of the Feliformia conserved its potential ability of expression for a peculiar adaptive phenotype throughout evolution, in this case arboreality and hypercarnivory in tropical forest. PMID:14667345
Jonniaux, Pierre; Kumazawa, Yoshinori
2008-01-15
Mitochondrial DNA sequences of approximately 2.3 kbp including the complete NADH dehydrogenase subunit 2 gene and its flanking genes, as well as parts of 12S and 16S rRNA genes were determined from major species of the eyelid gecko family Eublepharidae sensu [Kluge, A.G. 1987. Cladistic relationships in the Gekkonoidea (Squamata, Sauria). Misc. Publ. Mus. Zool. Univ. Michigan 173, 1-54.]. In contrast to previous morphological studies, phylogenetic analyses based on these sequences supported that Eublepharidae and Gekkonidae form a sister group with Pygopodidae, raising the possibility of homoplasious character change in some key features of geckos, such as reduction of movable eyelids and innovation of climbing toe pads. The phylogenetic analyses also provided a well-resolved tree for relationships between the eublepharid species. The Bayesian estimation of divergence times without assuming the molecular clock suggested the Jurassic divergence of Eublepharidae from Gekkonidae and radiations of most eublepharid genera around the Cretaceous. These dating results appeared to be robust against some conditional changes for time estimation, such as gene regions used, taxon representation, and data partitioning. Taken together with geological evidence, these results support the vicariant divergence of Eublepharidae and Gekkonidae by the breakup of Pangea into Laurasia and Gondwanaland, and recent dispersal of two African eublepharid genera from Eurasia to Africa after these landmasses were connected in the Early Miocene.
Wan, Yizhen; Schwaninger, Heidi R; Baldo, Angela M; Labate, Joanne A; Zhong, Gan-Yuan; Simon, Charles J
2013-07-05
Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 - 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 - 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use.
Steinke, Dirk; Salzburger, Walter; Meyer, Axel
2006-06-01
The power of comparative phylogenomic analyses also depends on the amount of data that are included in such studies. We used expressed sequence tags (ESTs) from fish model species as a proof of principle approach in order to test the reliability of using ESTs for phylogenetic inference. As expected, the robustness increases with the amount of sequences. Although some progress has been made in the elucidation of the phylogeny of teleosts, relationships among the main lineages of the derived fish (Euteleostei) remain poorly defined and are still debated. We performed a phylogenomic analysis of a set of 42 of orthologous genes from 10 available fish model systems from seven different orders (Salmoniformes, Siluriformes, Cypriniformes, Tetraodontiformes, Cyprinodontiformes, Beloniformes, and Perciformes) of euteleostean fish to estimate divergence times and evolutionary relationships among those lineages. All 10 fish species serve as models for developmental, aquaculture, genomic, and comparative genetic studies. The phylogenetic signal and the strength of the contribution of each of the 42 orthologous genes were estimated with randomly chosen data subsets. Our study revealed a molecular phylogeny of higher-level relationships of derived teleosts, which indicates that the use of multiple genes produces robust phylogenies, a finding that is expected to apply to other phylogenetic issues among distantly related taxa. Our phylogenomic analyses confirm that the euteleostean superorders Ostariophysi and Acanthopterygii are monophyletic and the Protacanthopterygii and Ostariophysi are sister clades. In addition, and contrary to the traditional phylogenetic hypothesis, our analyses determine that killifish (Cyprinodontiformes), medaka (Beloniformes), and cichlids (Perciformes) appear to be more closely related to each other than either of them is to pufferfish (Tetraodontiformes). All 10 lineages split before or during the fragmentation of the supercontinent Pangea in the Jurassic.
USDA-ARS?s Scientific Manuscript database
The phylogenetic diversity of true morels (Morchella) in China was estimated by initially analyzing nuclear ribosomal internal transcribed spacer (ITS) rDNA sequences from 361 specimens collected in 21 provinces during the 2003-2011 growing seasons, together with six collections obtained on loan fro...
Marcussen, Thomas; Heier, Lise; Brysting, Anne K.; Oxelman, Bengt; Jakobsen, Kjetill S.
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the “correct” network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies (“ghost subgenome lineages”) significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage—between 67% and 88%—of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. PMID:25281848
How Accurate and Robust Are the Phylogenetic Estimates of Austronesian Language Relationships?
Greenhill, Simon J.; Drummond, Alexei J.; Gray, Russell D.
2010-01-01
We recently used computational phylogenetic methods on lexical data to test between two scenarios for the peopling of the Pacific. Our analyses of lexical data supported a pulse-pause scenario of Pacific settlement in which the Austronesian speakers originated in Taiwan around 5,200 years ago and rapidly spread through the Pacific in a series of expansion pulses and settlement pauses. We claimed that there was high congruence between traditional language subgroups and those observed in the language phylogenies, and that the estimated age of the Austronesian expansion at 5,200 years ago was consistent with the archaeological evidence. However, the congruence between the language phylogenies and the evidence from historical linguistics was not quantitatively assessed using tree comparison metrics. The robustness of the divergence time estimates to different calibration points was also not investigated exhaustively. Here we address these limitations by using a systematic tree comparison metric to calculate the similarity between the Bayesian phylogenetic trees and the subgroups proposed by historical linguistics, and by re-estimating the age of the Austronesian expansion using only the most robust calibrations. The results show that the Austronesian language phylogenies are highly congruent with the traditional subgroupings, and the date estimates are robust even when calculated using a restricted set of historical calibrations. PMID:20224774
Phylogenetic analysis of the GST family in Anopheles (Nyssorhynchus) darlingi.
Azevedo-Júnior, Gilson Martins de; Guimarães-Marques, Giselle Moura; Cegatti Bridi, Leticia; Christine Ohse, Ketlen; Vicentini, Renato; Tadei, Wanderli; Rafael, Míriam Silva
2014-08-01
Anopheles darlingi Root, 1926 and Anopheles gambiae (Diptera: Culicidae) are the most important human malaria vectors in South America and Africa, respectively. The two species are estimated to have diverged 100 million years ago. Studies on the phylogenetics and evolution of gene sequences, such as glutathione S-transferase (GST) in disease-transmitting mosquitoes are scarce. The sigma class GST (KC890767) from the transcriptome of An. darlingi captured in the Brazilian Amazon was studied by in silico hybridization, and mapped to chromosome 3 of An. gambiae. The sigma class GST of An. darlingi was used for phylogenetic analyses to understand the GST base composition of the most recent common ancestor between An. darlingi, Anopheles gambiae, Aedes aegypti and Culex quinquefasciatus. The GST (KC890767) of An. darlingi was studied to generate the main divergence branches using a Neighbor-Joining and bootstrapping approaches to confirm confidence levels on the tree nodes that separate the An. darlingi and other mosquito species. The results showed divergence between An. gambiae, Ae. Aegypti, Cx. quinquefasciatus, and Phlebotomus papatasi as outgroup, and the homology relationship between sigma class GST of An. darlingi and GSTS1_1 gene of An. gambiae was valuable for phylogenetic and evolutionary studies. Copyright © 2014 Elsevier B.V. All rights reserved.
Johnson, Leigh A; Chan, Lauren M; Weese, Terri L; Busby, Lisa D; McMurry, Samuel
2008-09-01
Members of the phlox family (Polemoniaceae) serve as useful models for studying various evolutionary and biological processes. Despite its biological importance, no family-wide phylogenetic estimate based on multiple DNA regions with complete generic sampling is available. Here, we analyze one nuclear and five chloroplast DNA sequence regions (nuclear ITS, chloroplast matK, trnL intron plus trnL-trnF intergeneric spacer, and the trnS-trnG, trnD-trnT, and psbM-trnD intergenic spacers) using parsimony and Bayesian methods, as well as assessments of congruence and long branch attraction, to explore phylogenetic relationships among 84 ingroup species representing all currently recognized Polemoniaceae genera. Relationships inferred from the ITS and concatenated chloroplast regions are similar overall. A combined analysis provides strong support for the monophyly of Polemoniaceae and subfamilies Acanthogilioideae, Cobaeoideae, and Polemonioideae. Relationships among subfamilies, and thus for the precise root of Polemoniaceae, remain poorly supported. Within the largest subfamily, Polemonioideae, four clades corresponding to tribes Polemonieae, Phlocideae, Gilieae, and Loeselieae receive strong support. The monogeneric Polemonieae appears sister to Phlocideae. Relationships within Polemonieae, Phlocideae, and Gilieae are mostly consistent between analyses and data permutations. Many relationships within Loeselieae remain uncertain. Overall, inferred phylogenetic relationships support a higher-level classification for Polemoniaceae proposed in 2000.
DeChaine, Eric G.; Anderson, Stacy A.; McNew, Jennifer M.; Wendling, Barry M.
2013-01-01
Arctic-alpine plants in the genus Saxifraga L. (Saxifragaceae Juss.) provide an excellent system for investigating the process of diversification in northern regions. Yet, sect. Trachyphyllum (Gaud.) Koch, which is comprised of about 8 to 26 species, has still not been explored by molecular systematists even though taxonomists concur that the section needs to be thoroughly re-examined. Our goals were to use chloroplast trnL-F and nuclear ITS DNA sequence data to circumscribe the section phylogenetically, test models of geographically-based population divergence, and assess the utility of morphological characters in estimating evolutionary relationships. To do so, we sequenced both genetic markers for 19 taxa within the section. The phylogenetic inferences of sect. Trachyphyllum using maximum likelihood and Bayesian analyses showed that the section is polyphyletic, with S. aspera L. and S bryoides L. falling outside the main clade. In addition, the analyses supported several taxonomic re-classifications to prior names. We used two approaches to test biogeographic hypotheses: i) a coalescent approach in Mesquite to test the fit of our reconstructed gene trees to geographically-based models of population divergence and ii) a maximum likelihood inference in Lagrange. These tests uncovered strong support for an origin of the clade in the Southern Rocky Mountains of North America followed by dispersal and divergence episodes across refugia. Finally we adopted a stochastic character mapping approach in SIMMAP to investigate the utility of morphological characters in estimating evolutionary relationships among taxa. We found that few morphological characters were phylogenetically informative and many were misleading. Our molecular analyses provide a foundation for the diversity and evolutionary relationships within sect. Trachyphyllum and hypotheses for better understanding the patterns and processes of divergence in this section, other saxifrages, and plants inhabiting the North Pacific Rim. PMID:23922810
Iles, William J.D.; Barrett, Craig F.; Smith, Selena Y.; Specht, Chelsea D.
2016-01-01
The Zingiberales are an iconic order of monocotyledonous plants comprising eight families with distinctive and diverse floral morphologies and representing an important ecological element of tropical and subtropical forests. While the eight families are demonstrated to be monophyletic, phylogenetic relationships among these families remain unresolved. Neither combined morphological and molecular studies nor recent attempts to resolve family relationships using sequence data from whole plastomes has resulted in a well-supported, family-level phylogenetic hypothesis of relationships. Here we approach this challenge by leveraging the complete genome of one member of the order, Musa acuminata, together with transcriptome information from each of the other seven families to design a set of nuclear loci that can be enriched from highly divergent taxa with a single array-based capture of indexed genomic DNA. A total of 494 exons from 418 nuclear genes were captured for 53 ingroup taxa. The entire plastid genome was also captured for the same 53 taxa. Of the total genes captured, 308 nuclear and 68 plastid genes were used for phylogenetic estimation. The concatenated plastid and nuclear dataset supports the position of Musaceae as sister to the remaining seven families. Moreover, the combined dataset recovers known intra- and inter-family phylogenetic relationships with generally high bootstrap support. This is a flexible and cost effective method that gives the broader plant biology community a tool for generating phylogenomic scale sequence data in non-model systems at varying evolutionary depths. PMID:26819846
Sass, Chodon; Iles, William J D; Barrett, Craig F; Smith, Selena Y; Specht, Chelsea D
2016-01-01
The Zingiberales are an iconic order of monocotyledonous plants comprising eight families with distinctive and diverse floral morphologies and representing an important ecological element of tropical and subtropical forests. While the eight families are demonstrated to be monophyletic, phylogenetic relationships among these families remain unresolved. Neither combined morphological and molecular studies nor recent attempts to resolve family relationships using sequence data from whole plastomes has resulted in a well-supported, family-level phylogenetic hypothesis of relationships. Here we approach this challenge by leveraging the complete genome of one member of the order, Musa acuminata, together with transcriptome information from each of the other seven families to design a set of nuclear loci that can be enriched from highly divergent taxa with a single array-based capture of indexed genomic DNA. A total of 494 exons from 418 nuclear genes were captured for 53 ingroup taxa. The entire plastid genome was also captured for the same 53 taxa. Of the total genes captured, 308 nuclear and 68 plastid genes were used for phylogenetic estimation. The concatenated plastid and nuclear dataset supports the position of Musaceae as sister to the remaining seven families. Moreover, the combined dataset recovers known intra- and inter-family phylogenetic relationships with generally high bootstrap support. This is a flexible and cost effective method that gives the broader plant biology community a tool for generating phylogenomic scale sequence data in non-model systems at varying evolutionary depths.
A taxonomic wish-list for community ecology.
Gotelli, Nicholas J
2004-01-01
Community ecology seeks to explain the number and relative abundance of coexisting species. Four research frontiers in community ecology are closely tied to research in systematics and taxonomy: the statistics of species richness estimators, global patterns of biodiversity, the influence of global climate change on community structure, and phylogenetic influences on community structure. The most pressing needs for taxonomic information in community ecology research are usable taxonomic keys, current nomenclature, species occurrence records and resolved phylogenies. These products can best be obtained from Internet-based phylogenetic and taxonomic resources, but the lack of trained professional systematists and taxonomists threatens this effort. Community ecologists will benefit most directly from research in systematics and taxonomy by making better use of resources in museums and herbaria, and by actively seeking training, information and collaborations with taxonomic specialists. PMID:15253346
Qian, Hong; Chen, Shengbin; Zhang, Jin-Long
2017-07-17
Niche-based and neutrality-based theories are two major classes of theories explaining the assembly mechanisms of local communities. Both theories have been frequently used to explain species diversity and composition in local communities but their relative importance remains unclear. Here, we analyzed 57 assemblages of angiosperm trees in 0.1-ha forest plots across China to examine the effects of environmental heterogeneity (relevant to niche-based processes) and spatial contingency (relevant to neutrality-based processes) on phylogenetic structure of angiosperm tree assemblages distributed across a wide range of environment and space. Phylogenetic structure was quantified with six phylogenetic metrics (i.e., phylogenetic diversity, mean pairwise distance, mean nearest taxon distance, and the standardized effect sizes of these three metrics), which emphasize on different depths of evolutionary histories and account for different degrees of species richness effects. Our results showed that the variation in phylogenetic metrics explained independently by environmental variables was on average much greater than that explained independently by spatial structure, and the vast majority of the variation in phylogenetic metrics was explained by spatially structured environmental variables. We conclude that niche-based processes have played a more important role than neutrality-based processes in driving phylogenetic structure of angiosperm tree species in forest communities in China.
Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.
Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier
2014-01-01
Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.
On Tree-Based Phylogenetic Networks.
Zhang, Louxin
2016-07-01
A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.
Del Latte, Laura; Bortolin, Francesca; Rota-Stabelli, Omar; Fusco, Giuseppe; Bonato, Lucio
2015-01-01
Abstract Stenotaenia is one of the largest and most widespread genera of geophilid centipedes in the Western Palearctic, with a very uniform morphology and about fifteen species provisionally recognized. For a better understanding of Stenotaenia species-level taxonomy, we have explored the possibility of using molecular data. As a preliminary assay, we sampled twelve populations, mainly from the Italian region, and analyzed partial sequences of the two genes COI and 28S. We employed a DNA-barcoding approach, complemented by a phylogenetic analysis coupled with divergence time estimation. Assuming a barcoding gap of 10–16% K2P pairwise distances, we found evidence for the presence of at least six Stenotaenia species in the Italian region, which started diverging about 50 million years ago, only partially matching with previously recognized species. We found that small-sized oligopodous species belong to a single clade that originated about 33 million years ago, and obtained some preliminary evidence of the related genus Tuoba being nested within Stenotaenia. PMID:26257533
Kimura, Yuri; Hawkins, Melissa T R; McDonough, Molly M; Jacobs, Louis L; Flynn, Lawrence J
2015-09-28
Time calibration derived from the fossil record is essential for molecular phylogenetic and evolutionary studies. Fossil mice and rats, discovered in the Siwalik Group of Pakistan, have served as one of the best-known fossil calibration points in molecular phylogenic studies. Although these fossils have been widely used as the 12 Ma date for the Mus/Rattus split or a more basal split, conclusive paleontological evidence for the nodal assignments has been absent. This study analyzes newly recognized characters that demonstrate lineage separation in the fossil record of Siwalik murines and examines the most reasonable nodal placement of the diverging lineages in a molecular phylogenetic tree by ancestral state reconstruction. Our specimen-based approach strongly indicates that Siwalik murines of the Karnimata clade are fossil members of the Arvicanthini-Otomyini-Millardini clade, which excludes Rattus and its relatives. Combining the new interpretation with the widely accepted hypothesis that the Progonomys clade includes Mus, the lineage separation event in the Siwalik fossil record represents the Mus/Arvicanthis split. Our test analysis on Bayesian age estimates shows that this new calibration point provides more accurate estimates of murine divergence than previous applications. Thus, we define this fossil calibration point and refine two other fossil-based points for molecular dating.
Kimura, Yuri; Hawkins, Melissa T. R.; McDonough, Molly M.; Jacobs, Louis L.; Flynn, Lawrence J.
2015-01-01
Time calibration derived from the fossil record is essential for molecular phylogenetic and evolutionary studies. Fossil mice and rats, discovered in the Siwalik Group of Pakistan, have served as one of the best-known fossil calibration points in molecular phylogenic studies. Although these fossils have been widely used as the 12 Ma date for the Mus/Rattus split or a more basal split, conclusive paleontological evidence for the nodal assignments has been absent. This study analyzes newly recognized characters that demonstrate lineage separation in the fossil record of Siwalik murines and examines the most reasonable nodal placement of the diverging lineages in a molecular phylogenetic tree by ancestral state reconstruction. Our specimen-based approach strongly indicates that Siwalik murines of the Karnimata clade are fossil members of the Arvicanthini-Otomyini-Millardini clade, which excludes Rattus and its relatives. Combining the new interpretation with the widely accepted hypothesis that the Progonomys clade includes Mus, the lineage separation event in the Siwalik fossil record represents the Mus/Arvicanthis split. Our test analysis on Bayesian age estimates shows that this new calibration point provides more accurate estimates of murine divergence than previous applications. Thus, we define this fossil calibration point and refine two other fossil-based points for molecular dating. PMID:26411391
Bracken-Grissom, Heather D; Ahyong, Shane T; Wilkinson, Richard D; Feldmann, Rodney M; Schweitzer, Carrie E; Breinholt, Jesse W; Bendall, Matthew; Palero, Ferran; Chan, Tin-Yam; Felder, Darryl L; Robles, Rafael; Chu, Ka-Hou; Tsang, Ling-Ming; Kim, Dohyup; Martin, Joel W; Crandall, Keith A
2014-07-01
Lobsters are a ubiquitous and economically important group of decapod crustaceans that include the infraorders Polychelida, Glypheidea, Astacidea and Achelata. They include familiar forms such as the spiny, slipper, clawed lobsters and crayfish and unfamiliar forms such as the deep-sea and "living fossil" species. The high degree of morphological diversity among these infraorders has led to a dynamic classification and conflicting hypotheses of evolutionary relationships. In this study, we estimated phylogenetic relationships among the major groups of all lobster families and 94% of the genera using six genes (mitochondrial and nuclear) and 195 morphological characters across 173 species of lobsters for the most comprehensive sampling to date. Lobsters were recovered as a non-monophyletic assemblage in the combined (molecular + morphology) analysis. All families were monophyletic, with the exception of Cambaridae, and 7 of 79 genera were recovered as poly- or paraphyletic. A rich fossil history coupled with dense taxon coverage allowed us to estimate and compare divergence times and origins of major lineages using two drastically different approaches. Age priors were constructed and/or included based on fossil age information or fossil discovery, age, and extant species count data. Results from the two approaches were largely congruent across deep to shallow taxonomic divergences across major lineages. The origin of the first lobster-like decapod (Polychelida) was estimated in the Devonian (∼409-372 Ma) with all infraorders present in the Carboniferous (∼353-318 Ma). Fossil calibration subsampling studies examined the influence of sampling density (number of fossils) and placement (deep, middle, and shallow) on divergence time estimates. Results from our study suggest including at least 1 fossil per 10 operational taxonomic units (OTUs) in divergence dating analyses. [Dating; decapods; divergence; lobsters; molecular; morphology; phylogenetics.]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved.For Permissions, please email: journals.permissions@oup.com.
Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K
2010-11-09
Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.
Kumar, Girish; Kocour, Martin; Kunal, Swaraj Priyaranjan
2016-05-01
In order to assess the DNA sequence variation and phylogenetic relationship among five tuna species (Auxis thazard, Euthynnus affinis, Katsuwonus pelamis, Thunnus tonggol, and T. albacares) out of all four tuna genera, partial sequences of the mitochondrial DNA (mtDNA) D-loop region were analyzed. The estimate of intra-specific sequence variation in studied species was low, ranging from 0.027 to 0.080 [Kimura's two parameter distance (K2P)], whereas values of inter-specific variation ranged from 0.049 to 0.491. The longtail tuna (T. tonggol) and yellowfin tuna (T. albacares) were found to share a close relationship (K2P = 0.049) while skipjack tuna (K. pelamis) was most divergent studied species. Phylogenetic analysis using Maximum-Likelihood (ML) and Neighbor-Joining (NJ) methods supported the monophyletic origin of Thunnus species. Similarly, phylogeny of Auxis and Euthynnus species substantiate the monophyly. However, results showed a distinct origin of K. pelamis from genus Thunnus as well as Auxis and Euthynnus. Thus, the mtDNA D-loop region sequence data supports the polyphyletic origin of tuna species.
Schweizer, Manuel; Ayé, Raffael; Kashkarov, Roman; Roth, Tobias
2014-01-01
Although phylogenetic diversity has been suggested to be relevant from a conservation point of view, its role is still limited in applied nature conservation. Recently, the practice of investing conservation resources based on threatened species was identified as a reason for the slow integration of phylogenetic diversity in nature conservation planning. One of the main arguments is based on the observation that threatened species are not evenly distributed over the phylogenetic tree. However this argument seems to dismiss the fact that conservation action is a spatially explicit process, and even if threatened species are not evenly distributed over the phylogenetic tree, the occurrence of threatened species could still indicate areas with above average phylogenetic diversity and consequently could protect phylogenetic diversity. Here we aim to study the selection of important bird areas in Central Asia, which were nominated largely based on the presence of threatened bird species. We show that although threatened species occurring in Central Asia do not capture phylogenetically more distinct species than expected by chance, the current spatially explicit conservation approach of selecting important bird areas covers above average taxonomic and phylogenetic diversity of breeding and wintering birds. We conclude that the spatially explicit processes of conservation actions need to be considered in the current discussion of whether new prioritization methods are needed to complement conservation action based on threatened species. PMID:25337861
Phylogenetic Analyses: A Toolbox Expanding towards Bayesian Methods
Aris-Brosou, Stéphane; Xia, Xuhua
2008-01-01
The reconstruction of phylogenies is becoming an increasingly simple activity. This is mainly due to two reasons: the democratization of computing power and the increased availability of sophisticated yet user-friendly software. This review describes some of the latest additions to the phylogenetic toolbox, along with some of their theoretical and practical limitations. It is shown that Bayesian methods are under heavy development, as they offer the possibility to solve a number of long-standing issues and to integrate several steps of the phylogenetic analyses into a single framework. Specific topics include not only phylogenetic reconstruction, but also the comparison of phylogenies, the detection of adaptive evolution, and the estimation of divergence times between species. PMID:18483574
Le Vu, Stéphane; Ratmann, Oliver; Delpech, Valerie; Brown, Alison E; Gill, O Noel; Tostevin, Anna; Fraser, Christophe; Volz, Erik M
2018-06-01
Phylogenetic clustering of HIV sequences from a random sample of patients can reveal epidemiological transmission patterns, but interpretation is hampered by limited theoretical support and statistical properties of clustering analysis remain poorly understood. Alternatively, source attribution methods allow fitting of HIV transmission models and thereby quantify aspects of disease transmission. A simulation study was conducted to assess error rates of clustering methods for detecting transmission risk factors. We modeled HIV epidemics among men having sex with men and generated phylogenies comparable to those that can be obtained from HIV surveillance data in the UK. Clustering and source attribution approaches were applied to evaluate their ability to identify patient attributes as transmission risk factors. We find that commonly used methods show a misleading association between cluster size or odds of clustering and covariates that are correlated with time since infection, regardless of their influence on transmission. Clustering methods usually have higher error rates and lower sensitivity than source attribution method for identifying transmission risk factors. But neither methods provide robust estimates of transmission risk ratios. Source attribution method can alleviate drawbacks from phylogenetic clustering but formal population genetic modeling may be required to estimate quantitative transmission risk factors. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Devitt, Thomas J
2006-12-01
The Western Lyresnake (Trimorphodon biscutatus) is a widespread, polytypic taxon inhabiting arid regions from the warm deserts of the southwestern United States southward along the Pacific versant of Mexico to the tropical deciduous forests of Mesoamerica. This broadly distributed species provides a unique opportunity to evaluate a priori biogeographical hypotheses spanning two major distinct biogeographical realms (the Nearctic and Neotropical) that are usually treated separately in phylogeographical analyses. I investigated the phylogeography of T. biscutatus using maximum likelihood and Bayesian phylogenetic analysis of mitochondrial DNA (mtDNA) from across this species' range. Phylogenetic analyses recovered five well-supported clades whose boundaries are concordant with existing geographical barriers, a pattern consistent with a model of vicariant allopatric divergence. Assuming a vicariance model, divergence times between mitochondrial lineages were estimated using Bayesian relaxed molecular clock methods calibrated using geological information from putative vicariant events. Divergence time point estimates were bounded by broad confidence intervals, and thus these highly conservative estimates should be considered tentative hypotheses at best. Comparison of mtDNA lineages and taxa traditionally recognized as subspecies based on morphology suggest this taxon is comprised of multiple independent lineages at various stages of divergence, ranging from putative secondary contact and hybridization to sympatry of 'subspecies'.
Resolving Recent Plant Radiations: Power and Robustness of Genotyping-by-Sequencing.
Fernández-Mazuecos, Mario; Mellers, Greg; Vigalondo, Beatriz; Sáez, Llorenç; Vargas, Pablo; Glover, Beverley J
2018-03-01
Disentangling species boundaries and phylogenetic relationships within recent evolutionary radiations is a challenge due to the poor morphological differentiation and low genetic divergence between species, frequently accompanied by phenotypic convergence, interspecific gene flow and incomplete lineage sorting. Here we employed a genotyping-by-sequencing (GBS) approach, in combination with morphometric analyses, to investigate a small western Mediterranean clade in the flowering plant genus Linaria that radiated in the Quaternary. After confirming the morphological and genetic distinctness of eight species, we evaluated the relative performances of concatenation and coalescent methods to resolve phylogenetic relationships. Specifically, we focused on assessing the robustness of both approaches to variations in the parameter used to estimate sequence homology (clustering threshold). Concatenation analyses suffered from strong systematic bias, as revealed by the high statistical support for multiple alternative topologies depending on clustering threshold values. By contrast, topologies produced by two coalescent-based methods (NJ$_{\\mathrm{st}}$, SVDquartets) were robust to variations in the clustering threshold. Reticulate evolution may partly explain incongruences between NJ$_{\\mathrm{st}}$, SVDquartets and concatenated trees. Integration of morphometric and coalescent-based phylogenetic results revealed (i) extensive morphological divergence associated with recent splits between geographically close or sympatric sister species and (ii) morphological convergence in geographically disjunct species. These patterns are particularly true for floral traits related to pollinator specialization, including nectar spur length, tube width and corolla color, suggesting pollinator-driven diversification. Given its relatively simple and inexpensive implementation, GBS is a promising technique for the phylogenetic and systematic study of recent radiations, but care must be taken to evaluate the robustness of results to variation of data assembly parameters.
Ron, Santiago R; Mueses-Cisneros, Jonh Jairo; Gutiérrez-Cárdenas, Paul David Alfonso; Rojas-Rivera, Alejandra; Lynch, Ryan L; Rocha, Carlos F Duarte; Galarza, Gabriela
2015-04-16
Bufonidae is one of the most diverse amphibian families. Its large-scale phylogenetic relationships are relatively well understood with the exception of few Neotropical genera that may have diverged early in the evolution of the family. One of those genera is Andinophryne, a poorly known group of three toad species distributed in the western slopes of the Andes of northern Ecuador and southern Colombia. Their phylogenetic position is unknown due to lack of genetic data. We estimated a new phylogeny (over 200 species) of the family Bufonidae based on DNA sequences of mitochondrial and nuclear genes to assess the phylogenetic position of Andinophryne based on recently collected specimens of A. colomai and A. olallai from Ecuador and Colombia. We also examined external and internal morphology of Andinophryne to explore its congruence with the new phylogeny. The mtDNA and nuclear phylogenies show that Andinophryne is embedded within Rhaebo, a genus that belongs to a large clade characterized by the presence parotoid glands. Morphological characters confirmed the affinity of Andinophryne to Rhaebo and a close relationship between Andinophryne colomai and Andinophryne olallai. Rhaebo was paraphyletic relative to Andinophryne and to solve this problem we synonymize Andinophryne under Rhaebo. We discuss putative morphological synapomorphies for Rhaebo including Andinophryne. We provide species accounts for R. atelopoides new comb., R. colomai new comb. and R. olallai new comb. including assessments of their conservation status. We suggest that the three species are Critically Endangered. Their altitudinal distribution and association with streams are characteristic of endangered Andean amphibians.
Hernández-León, Sergio; Gernandt, David S.; Pérez de la Rosa, Jorge A.; Jardón-Barbolla, Lev
2013-01-01
Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities. PMID:23936218
A global perspective on Campanulaceae: Biogeographic, genomic, and floral evolution.
Crowl, Andrew A; Miles, Nicholas W; Visger, Clayton J; Hansen, Kimberly; Ayers, Tina; Haberle, Rosemarie; Cellinese, Nico
2016-02-01
The Campanulaceae are a diverse clade of flowering plants encompassing more than 2300 species in myriad habitats from tropical rainforests to arctic tundra. A robust, multigene phylogeny, including all major lineages, is presented to provide a broad, evolutionary perspective of this cosmopolitan clade. We used a phylogenetic framework, in combination with divergence dating, ancestral range estimation, chromosome modeling, and morphological character reconstruction analyses to infer phylogenetic placement and timing of major biogeographic, genomic, and morphological changes in the history of the group and provide insights into the diversification of this clade across six continents. Ancestral range estimation supports an out-of-Africa diversification following the Cretaceous-Tertiary extinction event. Chromosomal modeling, with corroboration from the distribution of synonymous substitutions among gene duplicates, provides evidence for as many as 20 genome-wide duplication events before large radiations. Morphological reconstructions support the hypothesis that switches in floral symmetry and anther dehiscence were important in the evolution of secondary pollen presentation mechanisms. This study provides a broad, phylogenetic perspective on the evolution of the Campanulaceae clade. The remarkable habitat diversity and cosmopolitan distribution of this lineage appears to be the result of a complex history of genome duplications and numerous long-distance dispersal events. We failed to find evidence for an ancestral polyploidy event for this clade, and our analyses indicate an ancestral base number of nine for the group. This study will serve as a framework for future studies in diverse areas of research in Campanulaceae. © 2016 Botanical Society of America.
Time and Origin of Cichlid Colonization of the Lower Congo Rapids
Schwarzer, Julia; Misof, Bernhard; Ifuta, Seraphin N.; Schliewen, Ulrich K.
2011-01-01
Most freshwater diversity is arguably located in networks of rivers and streams, but, in contrast to lacustrine systems riverine radiations, are largely understudied. The extensive rapids of the lower Congo River is one of the few river stretches inhabited by a locally endemic cichlid species flock as well as several species pairs, for which we provide evidence that they have radiated in situ. We use more that 2,000 AFLP markers as well as multilocus sequence datasets to reconstruct their origin, phylogenetic history, as well as the timing of colonization and speciation of two Lower Congo cichlid genera, Steatocranus and Nanochromis. Based on a representative taxon sampling and well resolved phylogenetic hypotheses we demonstrate that a high level of riverine diversity originated in the lower Congo within about 5 mya, which is concordant with age estimates for the hydrological origin of the modern lower Congo River. A spatial genetic structure is present in all widely distributed lineages corresponding to a trisection of the lower Congo River into major biogeographic areas, each with locally endemic species assemblages. With the present study, we provide a phylogenetic framework for a complex system that may serve as a link between African riverine cichlid diversity and the megadiverse cichlid radiations of the East African lakes. Beyond this we give for the first time a biologically estimated age for the origin of the lower Congo River rapids, one of the most extreme freshwater habitats on earth. PMID:21799840
Hernández-León, Sergio; Gernandt, David S; Pérez de la Rosa, Jorge A; Jardón-Barbolla, Lev
2013-01-01
Recent diversification followed by secondary contact and hybridization may explain complex patterns of intra- and interspecific morphological and genetic variation in the North American hard pines (Pinus section Trifoliae), a group of approximately 49 tree species distributed in North and Central America and the Caribbean islands. We concatenated five plastid DNA markers for an average of 3.9 individuals per putative species and assessed the suitability of the five regions as DNA bar codes for species identification, species delimitation, and phylogenetic reconstruction. The ycf1 gene accounted for the greatest proportion of the alignment (46.9%), the greatest proportion of variable sites (74.9%), and the most unique sequences (75 haplotypes). Phylogenetic analysis recovered clades corresponding to subsections Australes, Contortae, and Ponderosae. Sequences for 23 of the 49 species were monophyletic and sequences for another 9 species were paraphyletic. Morphologically similar species within subsections usually grouped together, but there were exceptions consistent with incomplete lineage sorting or introgression. Bayesian relaxed molecular clock analyses indicated that all three subsections diversified relatively recently during the Miocene. The general mixed Yule-coalescent method gave a mixed model estimate of only 22 or 23 evolutionary entities for the plastid sequences, which corresponds to less than half the 49 species recognized based on morphological species assignments. Including more unique haplotypes per species may result in higher estimates, but low mutation rates, recent diversification, and large effective population sizes may limit the effectiveness of this method to detect evolutionary entities.
The problem and promise of scale dependency in community phylogenetics.
Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K
2006-10-01
The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.
Estimating Bayesian Phylogenetic Information Content
Lewis, Paul O.; Chen, Ming-Hui; Kuo, Lynn; Lewis, Louise A.; Fučíková, Karolina; Neupane, Suman; Wang, Yu-Bo; Shi, Daoyuan
2016-01-01
Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.] PMID:27155008
On the quirks of maximum parsimony and likelihood on phylogenetic networks.
Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles
2017-03-21
Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogenetic networks, which can display such events, are becoming of more and more interest in phylogenetic research. It is therefore necessary to extend concepts like maximum parsimony from phylogenetic trees to networks. Several suggestions for possible extensions can be found in recent literature, for instance the softwired and the hardwired parsimony concepts. In this paper, we analyze the so-called big parsimony problem under these two concepts, i.e. we investigate maximum parsimonious networks and analyze their properties. In particular, we show that finding a softwired maximum parsimony network is possible in polynomial time. We also show that the set of maximum parsimony networks for the hardwired definition always contains at least one phylogenetic tree. Lastly, we investigate some parallels of parsimony to different likelihood concepts on phylogenetic networks. Copyright © 2017 Elsevier Ltd. All rights reserved.
Li, Min; Tian, Ying; Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic.
Fossil butterflies, calibration points and the molecular clock (Lepidoptera: Papilionoidea).
Jong, Rienk DE
2017-05-25
Fossil butterflies are extremely rare. Yet, they are the only direct evidence of the first appearance of particular characters and as such, they are crucial for calibrating a molecular clock, from which divergence ages are estimated. In turn, these estimates, in combination with paleogeographic information, are most important in paleobiogeographic considerations. The key issue here is the correct allocation of fossils on the phylogenetic tree from which the molecular clock is calibrated.The allocation of a fossil on a tree should be based on an apomorphic character found in a tree based on extant species, similar to the allocation of a new extant species. In practice, the latter is not done, at least not explicitly, on the basis of apomorphy, but rather on overall similarity or on a phylogenetic analysis, which is not possible for most butterfly fossils since they usually are very fragmentary. Characters most often preserved are in the venation of the wings. Therefore, special attention is given to possible apomorphies in venational characters in extant butterflies. For estimation of divergence times, not only the correct allocation of the fossil on the tree is important, but also the tree itself influences the outcome as well as the correct determination of the age of the fossil. These three aspects are discussed. All known butterfly fossils, consisting of 49 taxa, are critically reviewed and their relationship to extant taxa is discussed as an aid for correctly calibrating a molecular clock for papilionoid Lepidoptera. In this context some aspects of age estimation and biogeographic conclusions are briefly mentioned in review. Specific information has been summarized in four appendices.
Zhao, Ying; Bu, Wenjun
2012-01-01
Heteroptera, or true bugs, are the largest, morphologically diverse and economically important group of insects with incomplete metamorphosis. However, the phylogenetic relationships within Heteroptera are still in dispute and most of the previous studies were based on morphological characters or with single gene (partial or whole 18S rDNA). Besides, so far, divergence time estimates for Heteroptera totally rely on the fossil record, while no studies have been performed on molecular divergence rates. Here, for the first time, we used maximum parsimony (MP), maximum likelihood (ML) and Bayesian inference (BI) with multiple genes (18S rDNA, 28S rDNA, 16S rDNA and COI) to estimate phylogenetic relationships among the infraorders, and meanwhile, the Penalized Likelihood (r8s) and Bayesian (BEAST) molecular dating methods were employed to estimate divergence time of higher taxa of this suborder. Major results of the present study included: Nepomorpha was placed as the most basal clade in all six trees (MP trees, ML trees and Bayesian trees of nuclear gene data and four-gene combined data, respectively) with full support values. The sister-group relationship of Cimicomorpha and Pentatomomorpha was also strongly supported. Nepomorpha originated in early Triassic and the other six infraorders originated in a very short period of time in middle Triassic. Cimicomorpha and Pentatomomorpha underwent a radiation at family level in Cretaceous, paralleling the proliferation of the flowering plants. Our results indicated that the higher-group radiations within hemimetabolous Heteroptera were simultaneously with those of holometabolous Coleoptera and Diptera which took place in the Triassic. While the aquatic habitat was colonized by Nepomorpha already in the Triassic, the Gerromorpha independently adapted to the semi-aquatic habitat in the Early Jurassic. PMID:22384163
Betancur-R, Ricardo; Ortí, Guillermo; Pyron, Robert Alexander
2015-05-01
The marine-freshwater boundary is a major biodiversity gradient and few groups have colonised both systems successfully. Fishes have transitioned between habitats repeatedly, diversifying in rivers, lakes and oceans over evolutionary time. However, their history of habitat colonisation and diversification is unclear based on available fossil and phylogenetic data. We estimate ancestral habitats and diversification and transition rates using a large-scale phylogeny of extant fish taxa and one containing a massive number of extinct species. Extant-only phylogenetic analyses indicate freshwater ancestry, but inclusion of fossils reveal strong evidence of marine ancestry in lineages now restricted to freshwaters. Diversification and colonisation dynamics vary asymmetrically between habitats, as marine lineages colonise and flourish in rivers more frequently than the reverse. Our study highlights the importance of including fossils in comparative analyses, showing that freshwaters have played a role as refuges for ancient fish lineages, a signal erased by extinction in extant-only phylogenies. © 2015 John Wiley & Sons Ltd/CNRS.
SpreaD3: Interactive Visualization of Spatiotemporal History and Trait Evolutionary Processes.
Bielejec, Filip; Baele, Guy; Vrancken, Bram; Suchard, Marc A; Rambaut, Andrew; Lemey, Philippe
2016-08-01
Model-based phylogenetic reconstructions increasingly consider spatial or phenotypic traits in conjunction with sequence data to study evolutionary processes. Alongside parameter estimation, visualization of ancestral reconstructions represents an integral part of these analyses. Here, we present a complete overhaul of the spatial phylogenetic reconstruction of evolutionary dynamics software, now called SpreaD3 to emphasize the use of data-driven documents, as an analysis and visualization package that primarily complements Bayesian inference in BEAST (http://beast.bio.ed.ac.uk, last accessed 9 May 2016). The integration of JavaScript D3 libraries (www.d3.org, last accessed 9 May 2016) offers novel interactive web-based visualization capacities that are not restricted to spatial traits and extend to any discrete or continuously valued trait for any organism of interest. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Turk, Teja; Bachmann, Nadine; Kadelka, Claus; Böni, Jürg; Yerly, Sabine; Aubert, Vincent; Klimkait, Thomas; Battegay, Manuel; Bernasconi, Enos; Calmy, Alexandra; Cavassini, Matthias; Furrer, Hansjakob; Hoffmann, Matthias; Aubert, V; Battegay, M; Bernasconi, E; Böni, J; Braun, DL; Bucher, HC; Calmy, A; Cavassini, M; Ciuffi, A; Dollenmaier, G; Egger, M; Elzi, L; Fehr, J; Fellay, J; Furrer, H; Fux, CA; Günthard, HF; Haerry, D; Hasse, B; Hirsch, HH; Hoffmann, M; Hösli, I; Kahlert, C; Kaiser, L; Keiser, O; Klimkait, T; Kouyos, RD; Kovari, H; Ledergerber, B; Martinetti, G; Martinez de Tejada, B; Marzolini, C; Metzner, KJ; Müller, N; Nicca, D; Pantaleo, G; Paioni, P; Rauch, A; Rudin, C; Scherrer, AU; Schmid, P; Speck, R; Stöckle, M; Tarr, P; Trkola, A; Vernazza, P; Wandeler, G; Weber, R; Yerly, S
2017-01-01
Assessing the danger of transition of HIV transmission from a concentrated to a generalized epidemic is of major importance for public health. In this study, we develop a phylogeny-based statistical approach to address this question. As a case study, we use this to investigate the trends and determinants of HIV transmission among Swiss heterosexuals. We extract the corresponding transmission clusters from a phylogenetic tree. To capture the incomplete sampling, the delayed introduction of imported infections to Switzerland, and potential factors associated with basic reproductive number R0, we extend the branching process model to infer transmission parameters. Overall, the R0 is estimated to be 0.44 (95%-confidence interval 0.42—0.46) and it is decreasing by 11% per 10 years (4%—17%). Our findings indicate rather diminishing HIV transmission among Swiss heterosexuals far below the epidemic threshold. Generally, our approach allows to assess the danger of self-sustained epidemics from any viral sequence data. PMID:28895527
Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis
2016-09-02
Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Phylogenomic analysis of Apoidea sheds new light on the sister group of bees.
Sann, Manuela; Niehuis, Oliver; Peters, Ralph S; Mayer, Christoph; Kozlov, Alexey; Podsiadlowski, Lars; Bank, Sarah; Meusemann, Karen; Misof, Bernhard; Bleidorn, Christoph; Ohl, Michael
2018-05-18
Apoid wasps and bees (Apoidea) are an ecologically and morphologically diverse group of Hymenoptera, with some species of bees having evolved eusocial societies. Major problems for our understanding of the evolutionary history of Apoidea have been the difficulty to trace the phylogenetic origin and to reliably estimate the geological age of bees. To address these issues, we compiled a comprehensive phylogenomic dataset by simultaneously analyzing target DNA enrichment and transcriptomic sequence data, comprising 195 single-copy protein-coding genes and covering all major lineages of apoid wasps and bee families. Our compiled data matrix comprised 284,607 nucleotide sites that we phylogenetically analyzed by applying a combination of domain- and codon-based partitioning schemes. The inferred results confirm the polyphyletic status of the former family "Crabronidae", which comprises nine major monophyletic lineages. We found the former subfamily Pemphredoninae to be polyphyletic, comprising three distantly related clades. One of them, Ammoplanina, constituted the sister group of bees in all our analyses. We estimate the origin of bees to be in the Early Cretaceous (ca. 128 million years ago), a time period during which angiosperms rapidly radiated. Finally, our phylogenetic analyses revealed that within the Apoidea, (eu)social societies evolved exclusively in a single clade that comprises pemphredonine and philanthine wasps as well as bees. By combining transcriptomic sequences with those obtained via target DNA enrichment, we were able to include an unprecedented large number of apoid wasps in a phylogenetic study for tracing the phylogenetic origin of bees. Our results confirm the polyphyletic nature of the former wasp family Crabonidae, which we here suggest splitting into eight families. Of these, the family Ammoplanidae possibly represents the extant sister lineage of bees. Species of Ammoplanidae are known to hunt thrips, of which some aggregate on flowers and feed on pollen. The specific biology of Ammoplanidae as predators indicates how the transition from a predatory to pollen-collecting life style could have taken place in the evolution of bees. This insight plus the finding that (eu)social societies evolved exclusively in a single subordinated lineage of apoid wasps provides new perspectives for future comparative studies.
Open Reading Frame Phylogenetic Analysis on the Cloud
2013-01-01
Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843
Herbei, Radu; Kubatko, Laura
2013-03-26
Markov chains are widely used for modeling in many areas of molecular biology and genetics. As the complexity of such models advances, it becomes increasingly important to assess the rate at which a Markov chain converges to its stationary distribution in order to carry out accurate inference. A common measure of convergence to the stationary distribution is the total variation distance, but this measure can be difficult to compute when the state space of the chain is large. We propose a Monte Carlo method to estimate the total variation distance that can be applied in this situation, and we demonstrate how the method can be efficiently implemented by taking advantage of GPU computing techniques. We apply the method to two Markov chains on the space of phylogenetic trees, and discuss the implications of our findings for the development of algorithms for phylogenetic inference.
Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D
2009-05-13
The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1 and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3-35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7-13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5-26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84-113 million years for the divergence of all butterfly families. These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation.
Reconstruction of phylogenetic trees of prokaryotes using maximal common intervals.
Heydari, Mahdi; Marashi, Sayed-Amir; Tusserkani, Ruzbeh; Sadeghi, Mehdi
2014-10-01
One of the fundamental problems in bioinformatics is phylogenetic tree reconstruction, which can be used for classifying living organisms into different taxonomic clades. The classical approach to this problem is based on a marker such as 16S ribosomal RNA. Since evolutionary events like genomic rearrangements are not included in reconstructions of phylogenetic trees based on single genes, much effort has been made to find other characteristics for phylogenetic reconstruction in recent years. With the increasing availability of completely sequenced genomes, gene order can be considered as a new solution for this problem. In the present work, we applied maximal common intervals (MCIs) in two or more genomes to infer their distance and to reconstruct their evolutionary relationship. Additionally, measures based on uncommon segments (UCS's), i.e., those genomic segments which are not detected as part of any of the MCIs, are also used for phylogenetic tree reconstruction. We applied these two types of measures for reconstructing the phylogenetic tree of 63 prokaryotes with known COG (clusters of orthologous groups) families. Similarity between the MCI-based (resp. UCS-based) reconstructed phylogenetic trees and the phylogenetic tree obtained from NCBI taxonomy browser is as high as 93.1% (resp. 94.9%). We show that in the case of this diverse dataset of prokaryotes, tree reconstruction based on MCI and UCS outperforms most of the currently available methods based on gene orders, including breakpoint distance and DCJ. We additionally tested our new measures on a dataset of 13 closely-related bacteria from the genus Prochlorococcus. In this case, distances like rearrangement distance, breakpoint distance and DCJ proved to be useful, while our new measures are still appropriate for phylogenetic reconstruction. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Rocha, Amanda V.; Rivera, Luis O.; Martinez, Jaime; Prestes, Nêmora P.; Caparroz, Renato
2014-01-01
Coalescent theory provides powerful models for population genetic inference and is now increasingly important in estimates of divergence times and speciation research. We use molecular data and methods based on coalescent theory to investigate whether genetic evidence supports the hypothesis of A. pretrei and A. tucumana as separate species and whether genetic data allow us to assess which allopatric model seems to better explain the diversification process in these taxa. We sampled 13 A. tucumana from two provinces in northern Argentina and 28 A. pretrei from nine localities of Rio Grande do Sul, Brazil. A 491 bp segment of the mitochondrial gene cytochrome c oxidase I was evaluated using the haplotype network and phylogenetic methods. The divergence time and other demographic quantities were estimated using the isolation and migration model based on coalescent theory. The network and phylogenetic reconstructions showed similar results, supporting reciprocal monophyly for these two taxa. The divergence time of lineage separation was estimated to be approximately 1.3 million years ago, which corresponds to the lower Pleistocene. Our results enforce the current taxonomic status for these two Amazon species. They also support that A. pretrei and A. tucumana diverged with little or no gene flow approximately 1.3 million years ago, most likely after the establishment of a small population in the Southern Yungas forest by dispersion of a few founders from the A. pretrei ancestral population. This process may have been favored by habitat corridors formed in hot and humid periods of the Quaternary. Considering that these two species are considered threatened, the results were evaluated for their implications for the conservation of these two species. PMID:25251765
Fast and accurate phylogeny reconstruction using filtered spaced-word matches
Sohrabi-Jahromi, Salma; Morgenstern, Burkhard
2017-01-01
Abstract Motivation: Word-based or ‘alignment-free’ algorithms are increasingly used for phylogeny reconstruction and genome comparison, since they are much faster than traditional approaches that are based on full sequence alignments. Existing alignment-free programs, however, are less accurate than alignment-based methods. Results: We propose Filtered Spaced Word Matches (FSWM), a fast alignment-free approach to estimate phylogenetic distances between large genomic sequences. For a pre-defined binary pattern of match and don’t-care positions, FSWM rapidly identifies spaced word-matches between input sequences, i.e. gap-free local alignments with matching nucleotides at the match positions and with mismatches allowed at the don’t-care positions. We then estimate the number of nucleotide substitutions per site by considering the nucleotides aligned at the don’t-care positions of the identified spaced-word matches. To reduce the noise from spurious random matches, we use a filtering procedure where we discard all spaced-word matches for which the overall similarity between the aligned segments is below a threshold. We show that our approach can accurately estimate substitution frequencies even for distantly related sequences that cannot be analyzed with existing alignment-free methods; phylogenetic trees constructed with FSWM distances are of high quality. A program run on a pair of eukaryotic genomes of a few hundred Mb each takes a few minutes. Availability and Implementation: The program source code for FSWM including a documentation, as well as the software that we used to generate artificial genome sequences are freely available at http://fswm.gobics.de/ Contact: chris.leimeister@stud.uni-goettingen.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28073754
Fast and accurate phylogeny reconstruction using filtered spaced-word matches.
Leimeister, Chris-André; Sohrabi-Jahromi, Salma; Morgenstern, Burkhard
2017-04-01
Word-based or 'alignment-free' algorithms are increasingly used for phylogeny reconstruction and genome comparison, since they are much faster than traditional approaches that are based on full sequence alignments. Existing alignment-free programs, however, are less accurate than alignment-based methods. We propose Filtered Spaced Word Matches (FSWM) , a fast alignment-free approach to estimate phylogenetic distances between large genomic sequences. For a pre-defined binary pattern of match and don't-care positions, FSWM rapidly identifies spaced word-matches between input sequences, i.e. gap-free local alignments with matching nucleotides at the match positions and with mismatches allowed at the don't-care positions. We then estimate the number of nucleotide substitutions per site by considering the nucleotides aligned at the don't-care positions of the identified spaced-word matches. To reduce the noise from spurious random matches, we use a filtering procedure where we discard all spaced-word matches for which the overall similarity between the aligned segments is below a threshold. We show that our approach can accurately estimate substitution frequencies even for distantly related sequences that cannot be analyzed with existing alignment-free methods; phylogenetic trees constructed with FSWM distances are of high quality. A program run on a pair of eukaryotic genomes of a few hundred Mb each takes a few minutes. The program source code for FSWM including a documentation, as well as the software that we used to generate artificial genome sequences are freely available at http://fswm.gobics.de/. chris.leimeister@stud.uni-goettingen.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Montgelard, Claudine; Forty, Ellen; Arnal, Véronique; Matthee, Conrad A
2008-11-26
The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using approximately 7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha (Castoridae + Geomyoidea). The second suprafamilial clustering identified a novel association between the Sciuromorpha (Gliridae + (Sciuridae + Aplodontidae)) and the Hystricomorpha (Ctenodactylidae + Hystricognathi) which together represents the earliest dichotomy among Rodentia. Molecular time estimates using a relaxed Bayesian molecular clock dates the appearance of the five suborders nearly contemporaniously at the KT boundary and this is congruent with suggestions of an early explosion of rodent diversity. Based on these newly proposed phylogenetic relationships, the evolution of the zygomasseteric pattern that has been used for a long time in rodent systematics is evaluated.
2008-01-01
Background The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using ~7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Results Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). Conclusion The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha (Castoridae + Geomyoidea). The second suprafamilial clustering identified a novel association between the Sciuromorpha (Gliridae + (Sciuridae + Aplodontidae)) and the Hystricomorpha (Ctenodactylidae + Hystricognathi) which together represents the earliest dichotomy among Rodentia. Molecular time estimates using a relaxed Bayesian molecular clock dates the appearance of the five suborders nearly contemporaniously at the KT boundary and this is congruent with suggestions of an early explosion of rodent diversity. Based on these newly proposed phylogenetic relationships, the evolution of the zygomasseteric pattern that has been used for a long time in rodent systematics is evaluated. PMID:19036132
Prychitko, T M; Moore, W S
1997-10-01
Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press
Streptococcus himalayensis sp. nov., isolated from the respiratory tract of Marmota himalayana.
Niu, Lina; Lu, Shan; Lai, Xin-He; Hu, Shoukui; Chen, Cuixia; Zhang, Gui; Yang, Jing; Jin, Dong; Wang, Yi; Lan, Ruiting; Lu, Gang; Xie, Yingping; Ye, Changyun; Xu, Jianguo
2017-02-01
Five strains of Gram-positive-staining, catalase-negative, coccus-shaped, chain-forming organisms isolated separately from the respiratory tracts of five Marmota himalayana animals in the Qinghai-Tibet Plateau of China were subjected to phenotypic and molecular taxonomic analyses. Comparative analysis of the 16S rRNA gene indicated that these singular organisms represent a new member of the genus Streptococcus, being phylogenetically closest to Streptococcus marmotae DSM 101995T (98.4 % similarity). The groEL, sodA and rpoB sequence analysis showed interspecies similarity values between HTS2T and Streptococcus. marmotae DSM 101995T, its closest phylogenetic relative based on 16S rRNA gene sequences, of 98.2, 78.8 and 93.7 %, respectively. A whole-genome phylogenetic tree built from 82 core genes of genomes from 16 species of the genus Streptococcus validated that HTS2T forms a distinct subline and exhibits specific phylogenetic affinity with S. marmotae. In silico DNA-DNA hybridization of HTS2T showed an estimated DNA reassociation value of 40.5 % with Streptococcus. marmotae DSM 101995T. On the basis of their phenotypic characteristics and phylogenetic findings, it is proposed that the five isolates be classified as representatives of a novel species of the genus Streptococcus, Streptococcus himalayensis sp. nov. The type strain is HTS2T (=DSM 101997T=CGMCC 1.15533T). The genome of Streptococcus himalayensis sp. nov. strain HTS2T contains 2195 genes with a size of 2 275 471 bp and a mean DNA G+C content of 41.3 mol%.
2013-01-01
Background Mitochondrial genomic (mitogenomic) reorganizations are rarely found in closely-related animals, yet drastic reorganizations have been found in the Ranoides frogs. The phylogenetic relationships of the three major ranoid taxa (Natatanura, Microhylidae, and Afrobatrachia) have been problematic, and mitogenomic information for afrobatrachians has not been available. Several molecular models for mitochondrial (mt) gene rearrangements have been proposed, but observational evidence has been insufficient to evaluate them. Furthermore, evolutionary trends in rearranged mt genes have not been well understood. To gain molecular and phylogenetic insights into these issues, we analyzed the mt genomes of four afrobatrachian species (Breviceps adspersus, Hemisus marmoratus, Hyperolius marmoratus, and Trichobatrachus robustus) and performed molecular phylogenetic analyses. Furthermore we searched for two evolutionary patterns expected in the rearranged mt genes of ranoids. Results Extensively reorganized mt genomes having many duplicated and rearranged genes were found in three of the four afrobatrachians analyzed. In fact, Breviceps has the largest known mt genome among vertebrates. Although the kinds of duplicated and rearranged genes differed among these species, a remarkable gene rearrangement pattern of non-tandemly copied genes situated within tandemly-copied regions was commonly found. Furthermore, the existence of concerted evolution was observed between non-neighboring copies of triplicated 12S and 16S ribosomal RNA regions. Conclusions Phylogenetic analyses based on mitogenomic data support a close relationship between Afrobatrachia and Microhylidae, with their estimated divergence 100 million years ago consistent with present-day endemism of afrobatrachians on the African continent. The afrobatrachian mt data supported the first tandem and second non-tandem duplication model for mt gene rearrangements and the recombination-based model for concerted evolution of duplicated mt regions. We also showed that specific nucleotide substitution and compositional patterns expected in duplicated and rearranged mt genes did not occur, suggesting no disadvantage in employing these genes for phylogenetic inference. PMID:24053406
Phylogenetic signal in the acoustic parameters of the advertisement calls of four clades of anurans.
Gingras, Bruno; Mohandesan, Elmira; Boko, Drasko; Fitch, W Tecumseh
2013-07-01
Anuran vocalizations, especially their advertisement calls, are largely species-specific and can be used to identify taxonomic affiliations. Because anurans are not vocal learners, their vocalizations are generally assumed to have a strong genetic component. This suggests that the degree of similarity between advertisement calls may be related to large-scale phylogenetic relationships. To test this hypothesis, advertisement calls from 90 species belonging to four large clades (Bufo, Hylinae, Leptodactylus, and Rana) were analyzed. Phylogenetic distances were estimated based on the DNA sequences of the 12S mitochondrial ribosomal RNA gene, and, for a subset of 49 species, on the rhodopsin gene. Mean values for five acoustic parameters (coefficient of variation of root-mean-square amplitude, dominant frequency, spectral flux, spectral irregularity, and spectral flatness) were computed for each species. We then tested for phylogenetic signal on the body-size-corrected residuals of these five parameters, using three statistical tests (Moran's I, Mantel, and Blomberg's K) and three models of genetic distance (pairwise distances, Abouheif's proximities, and the variance-covariance matrix derived from the phylogenetic tree). A significant phylogenetic signal was detected for most acoustic parameters on the 12S dataset, across statistical tests and genetic distance models, both for the entire sample of 90 species and within clades in several cases. A further analysis on a subset of 49 species using genetic distances derived from rhodopsin and from 12S broadly confirmed the results obtained on the larger sample, indicating that the phylogenetic signals observed in these acoustic parameters can be detected using a variety of genetic distance models derived either from a variable mitochondrial sequence or from a conserved nuclear gene. We found a robust relationship, in a large number of species, between anuran phylogenetic relatedness and acoustic similarity in the advertisement calls in a taxon with no evidence for vocal learning, even after correcting for the effect of body size. This finding, covering a broad sample of species whose vocalizations are fairly diverse, indicates that the intense selection on certain call characteristics observed in many anurans does not eliminate all acoustic indicators of relatedness. Our approach could potentially be applied to other vocal taxa.
Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen
2016-01-01
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335
2017-01-01
The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference. PMID:29136663
Multi-locus phylogenetic analysis reveals the pattern and tempo of bony fish evolution
Broughton, Richard E.; Betancur-R., Ricardo; Li, Chenhong; Arratia, Gloria; Ortí, Guillermo
2013-01-01
Over half of all vertebrates are “fishes”, which exhibit enormous diversity in morphology, physiology, behavior, reproductive biology, and ecology. Investigation of fundamental areas of vertebrate biology depend critically on a robust phylogeny of fishes, yet evolutionary relationships among the major actinopterygian and sarcopterygian lineages have not been conclusively resolved. Although a consensus phylogeny of teleosts has been emerging recently, it has been based on analyses of various subsets of actinopterygian taxa, but not on a full sample of all bony fishes. Here we conducted a comprehensive phylogenetic study on a broad taxonomic sample of 61 actinopterygian and sarcopterygian lineages (with a chondrichthyan outgroup) using a molecular data set of 21 independent loci. These data yielded a resolved phylogenetic hypothesis for extant Osteichthyes, including 1) reciprocally monophyletic Sarcopterygii and Actinopterygii, as currently understood, with polypteriforms as the first diverging lineage within Actinopterygii; 2) a monophyletic group containing gars and bowfin (= Holostei) as sister group to teleosts; and 3) the earliest diverging lineage among teleosts being Elopomorpha, rather than Osteoglossomorpha. Relaxed-clock dating analysis employing a set of 24 newly applied fossil calibrations reveals divergence times that are more consistent with paleontological estimates than previous studies. Establishing a new phylogenetic pattern with accurate divergence dates for bony fishes illustrates several areas where the fossil record is incomplete and provides critical new insights on diversification of this important vertebrate group. PMID:23788273
ESTimating plant phylogeny: lessons from partitioning
de la Torre, Jose EB; Egan, Mary G; Katari, Manpreet S; Brenner, Eric D; Stevenson, Dennis W; Coruzzi, Gloria M; DeSalle, Rob
2006-01-01
Background While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. Results A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. Conclusion Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products. PMID:16776834
Building a Phylogenetic Tree of the Human and Ape Superfamily Using DNA-DNA Hybridization Data
ERIC Educational Resources Information Center
Maier, Caroline Alexander
2004-01-01
The study describes the process of DNA-DNA hybridization and the history of its use by Sibley and Alquist in simple, straightforward, and interesting language that students easily understand to create their own phylogenetic tree of the hominoid superfamily. They calibrate the DNA clock and use it to estimate the divergence dates of the various…
ERIC Educational Resources Information Center
Julius, Matthew L.; Schoenfuss, Heiko L.
2006-01-01
This laboratory exercise introduces students to a fundamental tool in evolutionary biology--phylogenetic inference. Students are required to create a data set via observation and through mining preexisting data sets. These student data sets are then used to develop and compare competing hypotheses of vertebrate phylogeny. The exercise uses readily…
A Practical Guide to Estimating the Heritability of Pathogen Traits.
Mitov, Venelin; Stadler, Tanja
2018-01-09
Pathogen traits, such as the virulence of an infection, can vary significantly between patients. A major challenge is to measure the extent to which genetic differences between infecting strains explain the observed variation of the trait. This is quantified by the trait's broad-sense heritability, H2. A recent discrepancy between estimates of the heritability of HIV-virulence has opened a debate on the estimators' accuracy. Here, we show that the discrepancy originates from model limitations and important lifecycle differences between sexually reproducing organisms and transmittable pathogens. In particular, current quantitative genetics methods, such as donor-recipient regression (DR) of surveyed serodiscordant couples and the phylogenetic mixed model (PMM), are prone to underestimate H2, because they neglect or do not fit to the loss of resemblance between transmission partners caused by within-host evolution. In a phylogenetic analysis of 8,483 HIV patients from the UK, we show that the phenotypic correlation between transmission partners decays with the amount of within-host evolution of the virus. We reproduce this pattern in toy-model simulations and show that a phylogenetic Ornstein-Uhlenbeck model (POUMM) outperforms the PMM in capturing this correlation pattern and in quantifying H2. In particular, we show that POUMM outperforms PMM even in simulations without selection - as it captures the mentioned correlation pattern - which has not been appreciated until now. By cross-validating the POUMM estimates with ANOVA on closest phylogenetic pairs (ANOVA-CPP), we obtain H2≈0.2, meaning about 20% of the variation in HIV-virulence is explained by the virus genome both for European and African data. © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Marcussen, Thomas; Heier, Lise; Brysting, Anne K; Oxelman, Bengt; Jakobsen, Kjetill S
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the "correct" network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies ("ghost subgenome lineages") significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage-between 67% and 88%-of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Mitochondrial genomes of two Australian fishflies with an evolutionary timescale of Chauliodinae.
Yang, Fan; Jiang, Yunlan; Yang, Ding; Liu, Xingyue
2017-06-30
Fishflies (Corydalidae: Chauliodinae) with a total of ca. 130 extant species are one of the major groups of the holometabolous insect order Megaloptera. As a group which originated during the Mesozoic, the phylogeny and historical biogeography of fishflies are of high interest. The previous hypothesis on the evolutionary history of fishflies was based primarily on morphological data. To further test the existing phylogenetic relationships and to understand the divergence pattern of fishflies, we conducted a molecule-based study. We determined the complete mitochondrial (mt) genomes of two Australian fishfly species, Archichauliodes deceptor Kimmins, 1954 and Protochauliodes biconicus Kimmins, 1954, both members of a major subgroup of Chauliodinae with high phylogenetic significance. A phylogenomic analysis was carried out based on 13 mt protein coding genes (PCGs) and two rRNAs genes from the megalopteran species with determined mt genomes. Both maximum likelihood and Bayesian inference analyses recovered the Dysmicohermes clade as the sister group of the Archichauliodes clade + the Protochauliodes clade, which is consistent with the previous morphology-based hypothesis. The divergence time estimation suggested that the divergence among the three major subgroups of fishflies occurred during the Late Jurassic and Early Cretaceous when the supercontinent Pangaea was undergoing sequential breakup.
Kittel, Rebecca N; Austin, Andrew D; Klopfstein, Seraina
2016-08-01
Parasitoid wasps of the subfamily Cheloninae are both species rich and poorly known. Although the taxonomy of Cheloninae appears to be relatively stable, there is no clear understanding of relationships among higher-level taxa. We here applied molecular phylogenetic analyses using three markers (COI, EF1α, 28S) and 37 morphological characters to elucidate the evolution and systematics of these wasps. Analyses were based on 83 specimens representing 13 genera. All genera except Ascogaster, Phanerotoma, and Pseudophanerotoma formed monophyletic groups; Furcidentia (stat. rev.) is raised to generic rank. Neither Chelonus (Chelonus) nor Chelonus (Microchelonus) were recovered as monophyletic, but together formed a monophyletic lineage. The tribes Chelonini and Odontosphaeropygini formed monophyletic groups, but the Phanerotomini sensu Zettel and Pseudophanerotomini were retrieved as either para- or polyphyletic. The genera comprising the former subfamily Adeliinae were confirmed as being nested within the Cheloninae. To estimate the age of the subfamily, we used 16 fossil taxa. Three approaches were compared: fixed-rate dating, node dating, and total-evidence dating, with age estimates differing greatly between the three methods. Shortcomings of each approach in relation to our dataset are discussed, and none of the age estimates is deemed sufficiently reliable. Given that most dating studies use a single method only, in most cases without presenting analyses on the sensitivity to priors, it is likely that numerous age estimates in the literature suffer from a similar lack of robustness. We argue for a more rigorous approach to dating analyses and for a faithful presentation of uncertainties in divergence time estimates. Given the results of the phylogenetic analysis the following taxonomic changes are proposed: Furcidentia Zettel (stat. rev.), previously treated as a subgenus of Pseudophanerotoma Zettel is raised to generic rank; Microchelonus Szépligeti (syn. nov.), variously treated by previous authors, is proposed as a junior synonym of Chelonus Jurine; the following subgenera of Microchelonus - Baculonus Braet & van Achterberg (syn. nov.), Carinichelonus Tobias (syn. nov.) and Scabrichelonus He, Chen & van Achterberg (syn. nov.), are proposed as junior synonyms of Chelonus; a number of new species names are proposed due to homonyms resulting from the above changes and these are listed in the paper. Copyright © 2016 Elsevier Inc. All rights reserved.
2013-01-01
Background Grapes are one of the most economically important fruit crops. There are about 60 species in the genus Vitis. The phylogenetic relationships among these species are of keen interest for the conservation and use of this germplasm. We selected 309 accessions from 48 Vitis species,varieties, and outgroups, examined ~11 kb (~3.4 Mb total) of aligned nuclear DNA sequences from 27 unlinked genes in a phylogenetic context, and estimated divergence times based on fossil calibrations. Results Vitis formed a strongly supported clade. There was substantial support for species and less for the higher-level groupings (series). As estimated from extant taxa, the crown age of Vitis was 28 Ma and the divergence of subgenera (Vitis and Muscadinia) occurred at ~18 Ma. Higher clades in subgenus Vitis diverged 16 – 5 Ma with overlapping confidence intervals, and ongoing divergence formed extant species at 12 – 1.3 Ma. Several species had species-specific SNPs. NeighborNet analysis showed extensive reticulation at the core of subgenus Vitis representing the deeper nodes, with extensive reticulation radiating outward. Fitch Parsimony identified North America as the origin of the most recent common ancestor of extant Vitis species. Conclusions Phylogenetic patterns suggested origination of the genus in North America, fragmentation of an ancestral range during the Miocene, formation of extant species in the late Miocene-Pleistocene, and differentiation of species in the context of Pliocene-Quaternary tectonic and climatic change. Nuclear SNPs effectively resolved relationships at and below the species level in grapes and rectified several misclassifications of accessions in the repositories. Our results challenge current higher-level classifications, reveal the abundance of genetic diversity in the genus that is potentially available for crop improvement, and provide a valuable resource for species delineation, germplasm conservation and use. PMID:23826735
Shiino, Teiichiro; Hattori, Junko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru
2014-01-01
Background One major circulating HIV-1 subtype in Southeast Asian countries is CRF01_AE, but little is known about its epidemiology in Japan. We conducted a molecular phylodynamic study of patients newly diagnosed with CRF01_AE from 2003 to 2010. Methods Plasma samples from patients registered in Japanese Drug Resistance HIV-1 Surveillance Network were analyzed for protease-reverse transcriptase sequences; all sequences undergo subtyping and phylogenetic analysis using distance-matrix-based, maximum likelihood and Bayesian coalescent Markov Chain Monte Carlo (MCMC) phylogenetic inferences. Transmission clusters were identified using interior branch test and depth-first searches for sub-tree partitions. Times of most recent common ancestor (tMRCAs) of significant clusters were estimated using Bayesian MCMC analysis. Results Among 3618 patient registered in our network, 243 were infected with CRF01_AE. The majority of individuals with CRF01_AE were Japanese, predominantly male, and reported heterosexual contact as their risk factor. We found 5 large clusters with ≥5 members and 25 small clusters consisting of pairs of individuals with highly related CRF01_AE strains. The earliest cluster showed a tMRCA of 1996, and consisted of individuals with their known risk as heterosexual contacts. The other four large clusters showed later tMRCAs between 2000 and 2002 with members including intravenous drug users (IVDU) and non-Japanese, but not men who have sex with men (MSM). In contrast, small clusters included a high frequency of individuals reporting MSM risk factors. Phylogenetic analysis also showed that some individuals infected with HIV strains spread in East and South-eastern Asian countries. Conclusions Introduction of CRF01_AE viruses into Japan is estimated to have occurred in the 1990s. CFR01_AE spread via heterosexual behavior, then among persons connected with non-Japanese, IVDU, and MSM. Phylogenetic analysis demonstrated that some viral variants are largely restricted to Japan, while others have a broad geographic distribution. PMID:25025900
How does cognition evolve? Phylogenetic comparative psychology
Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria
2014-01-01
Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850
Toyama, Hironori; Kajisa, Tsuyoshi; Tagane, Shuichiro; Mase, Keiko; Chhang, Phourin; Samreth, Vanna; Ma, Vuthy; Sokh, Heng; Ichihashi, Ryuji; Onoda, Yusuke; Mizoue, Nobuya; Yahara, Tetsukazu
2015-01-01
Ecological communities including tropical rainforest are rapidly changing under various disturbances caused by increasing human activities. Recently in Cambodia, illegal logging and clear-felling for agriculture have been increasing. Here, we study the effects of logging, mortality and recruitment of plot trees on phylogenetic community structure in 32 plots in Kampong Thom, Cambodia. Each plot was 0.25 ha; 28 plots were established in primary evergreen forests and four were established in secondary dry deciduous forests. Measurements were made in 1998, 2000, 2004 and 2010, and logging, recruitment and mortality of each tree were recorded. We estimated phylogeny using rbcL and matK gene sequences and quantified phylogenetic α and β diversity. Within communities, logging decreased phylogenetic diversity, and increased overall phylogenetic clustering and terminal phylogenetic evenness. Between communities, logging increased phylogenetic similarity between evergreen and deciduous plots. On the other hand, recruitment had opposite effects both within and between communities. The observed patterns can be explained by environmental homogenization under logging. Logging is biased to particular species and larger diameter at breast height, and forest patrol has been effective in decreasing logging. PMID:25561669
Toyama, Hironori; Kajisa, Tsuyoshi; Tagane, Shuichiro; Mase, Keiko; Chhang, Phourin; Samreth, Vanna; Ma, Vuthy; Sokh, Heng; Ichihashi, Ryuji; Onoda, Yusuke; Mizoue, Nobuya; Yahara, Tetsukazu
2015-02-19
Ecological communities including tropical rainforest are rapidly changing under various disturbances caused by increasing human activities. Recently in Cambodia, illegal logging and clear-felling for agriculture have been increasing. Here, we study the effects of logging, mortality and recruitment of plot trees on phylogenetic community structure in 32 plots in Kampong Thom, Cambodia. Each plot was 0.25 ha; 28 plots were established in primary evergreen forests and four were established in secondary dry deciduous forests. Measurements were made in 1998, 2000, 2004 and 2010, and logging, recruitment and mortality of each tree were recorded. We estimated phylogeny using rbcL and matK gene sequences and quantified phylogenetic α and β diversity. Within communities, logging decreased phylogenetic diversity, and increased overall phylogenetic clustering and terminal phylogenetic evenness. Between communities, logging increased phylogenetic similarity between evergreen and deciduous plots. On the other hand, recruitment had opposite effects both within and between communities. The observed patterns can be explained by environmental homogenization under logging. Logging is biased to particular species and larger diameter at breast height, and forest patrol has been effective in decreasing logging. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
How does cognition evolve? Phylogenetic comparative psychology.
MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria
2012-03-01
Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.
SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
2013-01-01
Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408
SUNPLIN: simulation with uncertainty for phylogenetic investigations.
Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F
2013-11-15
Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.
Phylogenetic tree construction based on 2D graphical representation
NASA Astrophysics Data System (ADS)
Liao, Bo; Shan, Xinzhou; Zhu, Wen; Li, Renfa
2006-04-01
A new approach based on the two-dimensional (2D) graphical representation of the whole genome sequence [Bo Liao, Chem. Phys. Lett., 401(2005) 196.] is proposed to analyze the phylogenetic relationships of genomes. The evolutionary distances are obtained through measuring the differences among the 2D curves. The fuzzy theory is used to construct phylogenetic tree. The phylogenetic relationships of H5N1 avian influenza virus illustrate the utility of our approach.
Humphreys-Pereira, Danny A; Elling, Axel A
2014-01-01
Root-knot nematodes (Meloidogyne spp.) are among the most important plant pathogens. In this study, the mitochondrial (mt) genomes of the root-knot nematodes, M. chitwoodi and M. incognita were sequenced. PCR analyses suggest that both mt genomes are circular, with an estimated size of 19.7 and 18.6-19.1kb, respectively. The mt genomes each contain a large non-coding region with tandem repeats and the control region. The mt gene arrangement of M. chitwoodi and M. incognita is unlike that of other nematodes. Sequence alignments of the two Meloidogyne mt genomes showed three translocations; two in transfer RNAs and one in cox2. Compared with other nematode mt genomes, the gene arrangement of M. chitwoodi and M. incognita was most similar to Pratylenchus vulnus. Phylogenetic analyses (Maximum Likelihood and Bayesian inference) were conducted using 78 complete mt genomes of diverse nematode species. Analyses based on nucleotides and amino acids of the 12 protein-coding mt genes showed strong support for the monophyly of class Chromadorea, but only amino acid-based analyses supported the monophyly of class Enoplea. The suborder Spirurina was not monophyletic in any of the phylogenetic analyses, contradicting the Clade III model, which groups Ascaridomorpha, Spiruromorpha and Oxyuridomorpha based on the small subunit ribosomal RNA gene. Importantly, comparisons of mt gene arrangement and tree-based methods placed Meloidogyne as sister taxa of Pratylenchus, a migratory plant endoparasitic nematode, and not with the sedentary endoparasitic Heterodera. Thus, comparative analyses of mt genomes suggest that sedentary endoparasitism in Meloidogyne and Heterodera is based on convergent evolution. Copyright © 2014 Elsevier B.V. All rights reserved.
Cornillon, P A; Pontier, D; Rochet, M J
2000-02-21
Comparative methods are used to investigate the attributes of present species or higher taxa. Difficulties arise from the phylogenetic heritage: taxa are not independent and neglecting phylogenetic inertia can lead to inaccurate results. Within-species variations in life-history traits are also not negligible, but most comparative methods are not designed to take them into account. Taxa are generally described by a single value for each trait. We have developed a new model which permits the incorporation of both the phylogenetic relationships among populations and within-species variations. This is an extension of classical autoregressive models. This family of models was used to study the effect of fishing on six demographic traits measured on 77 populations of teleost fishes. Copyright 2000 Academic Press.
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.
Modeling adaptive kernels from probabilistic phylogenetic trees.
Nicotra, Luca; Micheli, Alessio
2009-01-01
Modeling phylogenetic interactions is an open issue in many computational biology problems. In the context of gene function prediction we introduce a class of kernels for structured data leveraging on a hierarchical probabilistic modeling of phylogeny among species. We derive three kernels belonging to this setting: a sufficient statistics kernel, a Fisher kernel, and a probability product kernel. The new kernels are used in the context of support vector machine learning. The kernels adaptivity is obtained through the estimation of the parameters of a tree structured model of evolution using as observed data phylogenetic profiles encoding the presence or absence of specific genes in a set of fully sequenced genomes. We report results obtained in the prediction of the functional class of the proteins of the budding yeast Saccharomyces cerevisae which favorably compare to a standard vector based kernel and to a non-adaptive tree kernel function. A further comparative analysis is performed in order to assess the impact of the different components of the proposed approach. We show that the key features of the proposed kernels are the adaptivity to the input domain and the ability to deal with structured data interpreted through a graphical model representation.
Liu, Jun; Liu, Helu; Zhang, Haibin
2018-04-22
The marine mussels (Mytilidae) are distributed in the oceans worldwide and occupy various habitats with diverse life styles. However, their taxonomy and phylogeny remain unclear from genus to family level due to equivocal morphological and anatomical characters among some taxa. In this study, we inferred the deep phylogenetic relationships among 42 mytiloid species, 19 genera, and five subfamilies of the extant marine mussels by using two mitochondrial (COI and 16S rRNA) and three nuclear (18S and 28S rRNA, and histone H3) genes. Phylogeny was reconstructed with a combination of five genes using Bayesian inference and maximum likelihood method, and divergence time was estimated for the major nodes using a relaxed clock model with three fossil calibrations. Phylogenetic trees revealed two major clades (Clades 1 and 2). In Clade 1, the deep-sea mussels (subfamily Bathymodiolinae) were sister to subfamily Modiolinae (represented by Modiolus), and then was clustered with Leiosolenus (subfamily Lithophaginae). Clade 2 comprised Lithophaga (Lithophaginae) and subfamily Mytilinae. Additionally, a Modiolus species and Musculus senhousia (subfamily Crenellinae) were positioned within the subfamily Mytilinae. The phylogenetic results strongly indicated monophyly of Mytilidae and Bathymodiolinae, polyphyly of Modiolinae and Lithophaginae, and paraphyly of Mytilinae. Divergence time estimation showed an ancient and gradual divergence in most mussel groups, whereas the deep-sea mussels originated recently and diverged rapidly during the Paleogene. The present study provides new insight into the evolutionary history of the marine mussels, and supports taxonomic revision for this important bivalve group. Copyright © 2018 Elsevier Inc. All rights reserved.
MaxAlign: maximizing usable data in an alignment.
Gouveia-Oliveira, Rodrigo; Sackett, Peter W; Pedersen, Anders G
2007-08-28
The presence of gaps in an alignment of nucleotide or protein sequences is often an inconvenience for bioinformatical studies. In phylogenetic and other analyses, for instance, gapped columns are often discarded entirely from the alignment. MaxAlign is a program that optimizes the alignment prior to such analyses. Specifically, it maximizes the number of nucleotide (or amino acid) symbols that are present in gap-free columns - the alignment area - by selecting the optimal subset of sequences to exclude from the alignment. MaxAlign can be used prior to phylogenetic and bioinformatical analyses as well as in other situations where this form of alignment improvement is useful. In this work we test MaxAlign's performance in these tasks and compare the accuracy of phylogenetic estimates including and excluding gapped columns from the analysis, with and without processing with MaxAlign. In this paper we also introduce a new simple measure of tree similarity, Normalized Symmetric Similarity (NSS) that we consider useful for comparing tree topologies. We demonstrate how MaxAlign is helpful in detecting misaligned or defective sequences without requiring manual inspection. We also show that it is not advisable to exclude gapped columns from phylogenetic analyses unless MaxAlign is used first. Finally, we find that the sequences removed by MaxAlign from an alignment tend to be those that would otherwise be associated with low phylogenetic accuracy, and that the presence of gaps in any given sequence does not seem to disturb the phylogenetic estimates of other sequences. The MaxAlign web-server is freely available online at http://www.cbs.dtu.dk/services/MaxAlign where supplementary information can also be found. The program is also freely available as a Perl stand-alone package.
The dawn of open access to phylogenetic data.
Magee, Andrew F; May, Michael R; Moore, Brian R
2014-01-01
The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation--extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for [Formula: see text] of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.
Early evolution of the angiosperm clade Asteraceae in the Cretaceous of Antarctica.
Barreda, Viviana D; Palazzesi, Luis; Tellería, Maria C; Olivero, Eduardo B; Raine, J Ian; Forest, Félix
2015-09-01
The Asteraceae (sunflowers and daisies) are the most diverse family of flowering plants. Despite their prominent role in extant terrestrial ecosystems, the early evolutionary history of this family remains poorly understood. Here we report the discovery of a number of fossil pollen grains preserved in dinosaur-bearing deposits from the Late Cretaceous of Antarctica that drastically pushes back the timing of assumed origin of the family. Reliably dated to ∼76-66 Mya, these specimens are about 20 million years older than previously known records for the Asteraceae. Using a phylogenetic approach, we interpreted these fossil specimens as members of an extinct early diverging clade of the family, associated with subfamily Barnadesioideae. Based on a molecular phylogenetic tree calibrated using fossils, including the ones reported here, we estimated that the most recent common ancestor of the family lived at least 80 Mya in Gondwana, well before the thermal and biogeographical isolation of Antarctica. Most of the early diverging lineages of the family originated in a narrow time interval after the K/P boundary, 60-50 Mya, coinciding with a pronounced climatic warming during the Late Paleocene and Early Eocene, and the scene of a dramatic rise in flowering plant diversity. Our age estimates reduce earlier discrepancies between the age of the fossil record and previous molecular estimates for the origin of the family, bearing important implications in the evolution of flowering plants in general.
Bewick, Adam J; Chain, Frédéric J J; Heled, Joseph; Evans, Ben J
2012-12-01
The estimation of phylogenetic relationships is an essential component of understanding evolution. Accurate phylogenetic estimation is difficult, however, when internodes are short and old, when genealogical discordance is common due to large ancestral effective population sizes or ancestral population structure, and when homoplasy is prevalent. Inference of divergence times is also hampered by unknown and uneven rates of evolution, the incomplete fossil record, uncertainty in relationships between fossil and extant lineages, and uncertainty in the age of fossils. Ideally, these challenges can be overcome by developing large "phylogenomic" data sets and by analyzing them with methods that accommodate features of the evolutionary process, such as genealogical discordance, recurrent substitution, recombination, ancestral population structure, gene flow after speciation among sampled and unsampled taxa, and variation in evolutionary rates. In some phylogenetic problems, it is possible to use information that is independent of fossils, such as the geological record, to identify putative triggers for diversification whose associated estimated divergence times can then be compared a posteriori with estimated relationships and ages of fossils. The history of diversification of pipid frog genera Pipa, Hymenochirus, Silurana, and Xenopus, for instance, is characterized by many of these evolutionary and analytical challenges. These frogs diversified dozens of millions of years ago, they have a relatively rich fossil record, their distributions span continental plates with a well characterized geological record of ancient connectivity, and there is considerable disagreement across studies in estimated evolutionary relationships. We used high throughput sequencing and public databases to generate a large phylogenomic data set with which we estimated evolutionary relationships using multilocus coalescence methods. We collected sequence data from Pipa, Hymenochirus, Silurana, and Xenopus and the outgroup taxon Rhinophrynus dorsalis from coding sequence of 113 autosomal regions, averaging ∼300 bp in length (range: 102-1695 bp) and also a portion of the mitochondrial genome. Analysis of these data using multiple approaches recovers strong support for the ((Xenopus, Silurana)(Pipa, Hymenochirus)) topology, and geologically calibrated divergence time estimates that are consistent with estimated ages and phylogenetic affinities of many fossils. These results provide new insights into the biogeography and chronology of pipid diversification during the breakup of Gondwanaland and illustrate how phylogenomic data may be necessary to tackle tough problems in molecular systematics. [Coalescence; gene tree; high-throughout sequencing; lineage sorting; pipid; species tree; Xenopus.].
A methodological investigation of hominoid craniodental morphology and phylogenetics.
Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A
2011-01-01
The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.
Dessimoz, Christophe; Boeckmann, Brigitte; Roth, Alexander C J; Gonnet, Gaston H
2006-01-01
Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.
Expected time-invariant effects of biological traits on mammal species duration.
Smits, Peter D
2015-10-20
Determining which biological traits influence differences in extinction risk is vital for understanding the differential diversification of life and for making predictions about species' vulnerability to anthropogenic impacts. Here I present a hierarchical Bayesian survival model of North American Cenozoic mammal species durations in relation to species-level ecological factors, time of origination, and phylogenetic relationships. I find support for the survival of the unspecialized as a time-invariant generalization of trait-based extinction risk. Furthermore, I find that phylogenetic and temporal effects are both substantial factors associated with differences in species durations. Finally, I find that the estimated effects of these factors are partially incongruous with how these factors are correlated with extinction risk of the extant species. These findings parallel previous observations that background extinction is a poor predictor of mass extinction events and suggest that attention should be focused on mass extinctions to gain insight into modern species loss.
Macrini, Thomas E; Flynn, John J; Ni, Xijun; Croft, Darin A; Wyss, André R
2013-01-01
The phylogenetic relationships of notoungulates, an extinct group of predominantly South American herbivores, remain poorly resolved with respect to both other placental mammals and among one another. Most previous phylogenetic analyses of notoungulates have not included characters of the internal cranium, not least because few such features, including the bony labyrinth, have been described for members of the group. Here we describe the inner ears of the notoungulates Altitypotherium chucalensis (Mesotheriidae), Pachyrukhos moyani (Hegetotheriidae) and Cochilius sp. (Interatheriidae) based on reconstructions of bony labyrinths obtained from computed tomography imagery. Comparisons of the bony labyrinths of these taxa with the basally diverging notoungulate Notostylops murinus (Notostylopidae), an isolated petrosal from Itaboraí, Brazil, referred to Notoungulata, and six therian outgroups, yielded an inner ear character matrix of 25 potentially phylogenetically informative characters, 14 of them novel to this study. Two equivocally optimized character states potentially support a pairing of Mesotheriidae and Hegetotheriidae, whereas four others may be diagnostic of Notoungulata. Three additional characters are potentially informative for diagnosing more inclusive clades: one for crown Placentalia; another for a clade containing Kulbeckia, Zalambdalestes, and Placentalia; and a third for Eutheria (crown Placentalia plus stem taxa). Several other characters are apomorphic for at least one notoungulate in our study and are of potential interest for broader taxonomic sampling within Notoungulata to clarify currently enigmatic interrelationships. Measures of the semicircular canals were used to infer agility (e.g. capable of quick movements vs. lethargic movements) of these taxa. Agility scores calculated from these data generally corroborate interpretations based on postcranial remains of these or closely related species. We provide estimates of the low-frequency hearing limits in notoungulates based on the ratio of radii of the apical and basal turns of the cochlea. These limits range from 15 Hz in Notostylops to 149 Hz in Pachyrukhos, values comparable to the Asian elephant (Elephas maximus) and the California sea lion (Zalophus californianus) when hearing in air, respectively. PMID:24102069
Byrne, Maria; Rowe, Frank; Uthicke, Sven
2010-09-01
The Stichopodidae comprise a diverse assemblage of holothuroids most of which occur in the Indo-Pacific. Phylogenetic analyses of mitochondrial gene (COI, 16S rRNA) sequence for 111 individuals (7 genera, 17 species) clarified taxonomic uncertainties, species relationships, biogeography and evolution of the family. A monophyly of the genus Stichopus was supported with the exception of Stichopus ellipes. Molecular analyses confirmed genus level taxonomy based on morphology. Most specimens harvested as S. horrens fell in the S. monotuberculatus clade, a morphologically variable assemblage with others from the S. naso clade. Taxonomic clarification of species fished as S. horrens will assist conservation measures. Evolutionary rates based on comparison of sequence from trans-ithmian Isostichopus species estimated that Stichopus and Isostichopus diverged ca. 5.5-10.7Ma (Miocene). More recent splits were estimated to be younger than 1Ma. Copyright 2010 Elsevier Inc. All rights reserved.
Ndhlovu, Andrew; Durand, Pierre M.; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. Database URL: http://www.bioinf.wits.ac.za/software/fire/evodb PMID:26140928
Ndhlovu, Andrew; Durand, Pierre M; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. © The Author(s) 2015. Published by Oxford University Press.
Cachera, Marie; Le Loc'h, François
2017-08-01
The relationships between diversity and ecosystem functioning have become a major focus of science. A crucial issue is to estimate functional diversity, as it is intended to impact ecosystem dynamics and stability. However, depending on the ecosystem, it may be challenging or even impossible to directly measure ecological functions and thus functional diversity. Phylogenetic diversity was recently under consideration as a proxy for functional diversity. Phylogenetic diversity is indeed supposed to match functional diversity if functions are conservative traits along evolution. However, in case of adaptive radiation and/or evolutive convergence, a mismatch may appear between species phylogenetic and functional singularities. Using highly threatened taxa, sharks, this study aimed to explore the relationships between phylogenetic and functional diversities and singularities. Different statistical computations were used in order to test both methodological issue (phylogenetic reconstruction) and overall a theoretical questioning: the predictive power of phylogeny for function diversity. Despite these several methodological approaches, a mismatch between phylogeny and function was highlighted. This mismatch revealed that (i) functions are apparently nonconservative in shark species, and (ii) phylogenetic singularity is not a proxy for functional singularity. Functions appeared to be not conservative along the evolution of sharks, raising the conservational challenge to identify and protect both phylogenetic and functional singular species. Facing the current rate of species loss, it is indeed of major importance to target phylogenetically singular species to protect genetic diversity and also functionally singular species in order to maintain particular functions within ecosystem.
Kunstler, Georges; Lavergne, Sébastien; Courbaud, Benoît; Thuiller, Wilfried; Vieilledent, Ghislain; Zimmermann, Niklaus E; Kattge, Jens; Coomes, David A
2012-08-01
The relative importance of competition vs. environmental filtering in the assembly of communities is commonly inferred from their functional and phylogenetic structure, on the grounds that similar species compete most strongly for resources and are therefore less likely to coexist locally. This approach ignores the possibility that competitive effects can be determined by relative positions of species on a hierarchy of competitive ability. Using growth data, we estimated 275 interaction coefficients between tree species in the French mountains. We show that interaction strengths are mainly driven by trait hierarchy and not by functional or phylogenetic similarity. On the basis of this result, we thus propose that functional and phylogenetic convergence in local tree community might be due to competition-sorting species with different competitive abilities and not only environmental filtering as commonly assumed. We then show a functional and phylogenetic convergence of forest structure with increasing plot age, which supports this view. © 2012 Blackwell Publishing Ltd/CNRS.
Phylogenetic affinity of tree shrews to Glires is attributed to fast evolution rate.
Lin, Jiannan; Chen, Guangfeng; Gu, Liang; Shen, Yuefeng; Zheng, Meizhu; Zheng, Weisheng; Hu, Xinjie; Zhang, Xiaobai; Qiu, Yu; Liu, Xiaoqing; Jiang, Cizhong
2014-02-01
Previous phylogenetic analyses have led to incongruent evolutionary relationships between tree shrews and other suborders of Euarchontoglires. What caused the incongruence remains elusive. In this study, we identified 6845 orthologous genes between seventeen placental mammals. Tree shrews and Primates were monophyletic in the phylogenetic trees derived from the first or/and second codon positions whereas tree shrews and Glires formed a monophyly in the trees derived from the third or all codon positions. The same topology was obtained in the phylogeny inference using the slowly and fast evolving genes, respectively. This incongruence was likely attributed to the fast substitution rate in tree shrews and Glires. Notably, sequence GC content only was not informative to resolve the controversial phylogenetic relationships between tree shrews, Glires, and Primates. Finally, estimation in the confidence of the tree selection strongly supported the phylogenetic affiliation of tree shrews to Primates as a monophyly. Copyright © 2013 Elsevier Inc. All rights reserved.
Evolutionary process of deep-sea bathymodiolus mussels.
Miyazaki, Jun-Ichi; de Oliveira Martins, Leonardo; Fujita, Yuko; Matsumoto, Hiroto; Fujiwara, Yoshihiro
2010-04-27
Since the discovery of deep-sea chemosynthesis-based communities, much work has been done to clarify their organismal and environmental aspects. However, major topics remain to be resolved, including when and how organisms invade and adapt to deep-sea environments; whether strategies for invasion and adaptation are shared by different taxa or unique to each taxon; how organisms extend their distribution and diversity; and how they become isolated to speciate in continuous waters. Deep-sea mussels are one of the dominant organisms in chemosynthesis-based communities, thus investigations of their origin and evolution contribute to resolving questions about life in those communities. We investigated worldwide phylogenetic relationships of deep-sea Bathymodiolus mussels and their mytilid relatives by analyzing nucleotide sequences of the mitochondrial cytochrome c oxidase subunit I (COI) and NADH dehydrogenase subunit 4 (ND4) genes. Phylogenetic analysis of the concatenated sequence data showed that mussels of the subfamily Bathymodiolinae from vents and seeps were divided into four groups, and that mussels of the subfamily Modiolinae from sunken wood and whale carcasses assumed the outgroup position and shallow-water modioline mussels were positioned more distantly to the bathymodioline mussels. We provisionally hypothesized the evolutionary history of Bathymodilolus mussels by estimating evolutionary time under a relaxed molecular clock model. Diversification of bathymodioline mussels was initiated in the early Miocene, and subsequently diversification of the groups occurred in the early to middle Miocene. The phylogenetic relationships support the "Evolutionary stepping stone hypothesis," in which mytilid ancestors exploited sunken wood and whale carcasses in their progressive adaptation to deep-sea environments. This hypothesis is also supported by the evolutionary transition of symbiosis in that nutritional adaptation to the deep sea proceeded from extracellular to intracellular symbiotic states in whale carcasses. The estimated evolutionary time suggests that the mytilid ancestors were able to exploit whales during adaptation to the deep sea.
Wahlberg, Niklas; Weingartner, Elisabet; Warren, Andrew D; Nylin, Sören
2009-01-01
Background Major conflict between mitochondrial and nuclear genes in estimating species relationships is an increasingly common finding in animals. Usually this is attributed to incomplete lineage sorting, but recently the possibility has been raised that hybridization is important in generating such phylogenetic patterns. Just how widespread ancient and/or recent hybridization is in animals and how it affects estimates of species relationships is still not well-known. Results We investigate the species relationships and their evolutionary history over time in the genus Polygonia using DNA sequences from two mitochondrial gene regions (COI and ND1, total 1931 bp) and four nuclear gene regions (EF-1α, wingless, GAPDH and RpS5, total 2948 bp). We found clear, strongly supported conflict between mitochondrial and nuclear DNA sequences in estimating species relationships in the genus Polygonia. Nodes at which there was no conflict tended to have diverged at the same time when analyzed separately, while nodes at which conflict was present diverged at different times. We find that two species create most of the conflict, and attribute the conflict found in Polygonia satyrus to ancient hybridization and conflict found in Polygonia oreas to recent or ongoing hybridization. In both examples, the nuclear gene regions tended to give the phylogenetic relationships of the species supported by morphology and biology. Conclusion Studies inferring species-level relationships using molecular data should never be based on a single locus. Here we show that the phylogenetic hypothesis generated using mitochondrial DNA gives a very different interpretation of the evolutionary history of Polygonia species compared to that generated from nuclear DNA. We show that possible cases of hybridization in Polygonia are not limited to sister species, but may be inferred further back in time. Furthermore, we provide more evidence that Haldane's effect might not be as strong a process in preventing hybridization in butterflies as has been previously thought. PMID:19422691
A Format for Phylogenetic Placements
Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988
A format for phylogenetic placements.
Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.
Molecular characterization of Hepatozoon species in reptiles from the Seychelles.
Harris, D James; Maia, João P M C; Perera, Ana
2011-02-01
Hepatozoon parasites were examined for the first time in reptiles from the Seychelles Islands. Although both prevalence and intensity were low, Hepatozoon species were detected in individuals from 2 endemic species, the lizard Mabuya wrightii and the snake Lycognathophis seychellensis. This was confirmed using visual identification and through sequencing part of the 18s rRNA gene. Phylogenetic analysis indicates that the Hepatozoon on the Seychelles form a monophyletic lineage, although more data are clearly needed to stabilize estimates of relationships based on this marker.
Dating Tips for Divergence-Time Estimation.
O'Reilly, Joseph E; Dos Reis, Mario; Donoghue, Philip C J
2015-11-01
The molecular clock is the only viable means of establishing an accurate timescale for Life on Earth, but it remains reliant on a capricious fossil record for calibration. 'Tip-dating' promises a conceptual advance, integrating fossil species among their living relatives using molecular/morphological datasets and evolutionary models. Fossil species of known age establish calibration directly, and their phylogenetic uncertainty is accommodated through the co-estimation of time and topology. However, challenges remain, including a dearth of effective models of morphological evolution, rate correlation, the non-random nature of missing characters in fossil data, and, most importantly, accommodating uncertainty in fossil age. We show uncertainty in fossil-dating propagates to divergence-time estimates, yielding estimates that are older and less precise than those based on traditional node calibration. Ultimately, node and tip calibrations are not mutually incompatible and may be integrated to achieve more accurate and precise evolutionary timescales. Copyright © 2015 Elsevier Ltd. All rights reserved.
Zhou, Xiaofan; Shen, Xing-Xing; Hittinger, Chris Todd
2018-01-01
Abstract The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses. PMID:29177474
Multilocus inference of species trees and DNA barcoding.
Mallo, Diego; Posada, David
2016-09-05
The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
Blom, Mozes P K
2015-08-05
Recently developed molecular methods enable geneticists to target and sequence thousands of orthologous loci and infer evolutionary relationships across the tree of life. Large numbers of genetic markers benefit species tree inference but visual inspection of alignment quality, as traditionally conducted, is challenging with thousands of loci. Furthermore, due to the impracticality of repeated visual inspection with alternative filtering criteria, the potential consequences of using datasets with different degrees of missing data remain nominally explored in most empirical phylogenomic studies. In this short communication, I describe a flexible high-throughput pipeline designed to assess alignment quality and filter exonic sequence data for subsequent inference. The stringency criteria for alignment quality and missing data can be adapted based on the expected level of sequence divergence. Each alignment is automatically evaluated based on the stringency criteria specified, significantly reducing the number of alignments that require visual inspection. By developing a rapid method for alignment filtering and quality assessment, the consistency of phylogenetic estimation based on exonic sequence alignments can be further explored across distinct inference methods, while accounting for different degrees of missing data.
Sumner, Jeremy G; Taylor, Amelia; Holland, Barbara R; Jarvis, Peter D
2017-12-01
Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees. In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any invariant-based phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) explicit dependence on the assumption of a continuous-time process. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistically bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference. The binary case is of particular theoretical interest as-in this case only-the Markov invariants can be expressed as linear combinations of the phylogenetic invariants. A wider implication of this is that, for models with more than two states-for example DNA sequence alignments with four-state models-we find that methods which rely on phylogenetic invariants are incapable of satisfying all three of the stated statistical properties. This is because in these cases the relevant Markov invariants belong to a class of polynomials independent from the phylogenetic invariants.
USDA-ARS?s Scientific Manuscript database
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence d...
Pozzi, Luca; Hodgson, Jason A; Burrell, Andrew S; Sterner, Kirstin N; Raaum, Ryan L; Disotell, Todd R
2014-06-01
The origins and the divergence times of the most basal lineages within primates have been difficult to resolve mainly due to the incomplete sampling of early fossil taxa. The main source of contention is related to the discordance between molecular and fossil estimates: while there are no crown primate fossils older than 56Ma, most molecule-based estimates extend the origins of crown primates into the Cretaceous. Here we present a comprehensive mitogenomic study of primates. We assembled 87 mammalian mitochondrial genomes, including 62 primate species representing all the families of the order. We newly sequenced eleven mitochondrial genomes, including eight Old World monkeys and three strepsirrhines. Phylogenetic analyses support a strong topology, confirming the monophyly for all the major primate clades. In contrast to previous mitogenomic studies, the positions of tarsiers and colugos relative to strepsirrhines and anthropoids are well resolved. In order to improve our understanding of how fossil calibrations affect age estimates within primates, we explore the effect of seventeen fossil calibrations across primates and other mammalian groups and we select a subset of calibrations to date our mitogenomic tree. The divergence date estimates of the Strepsirrhine/Haplorhine split support an origin of crown primates in the Late Cretaceous, at around 74Ma. This result supports a short-fuse model of primate origins, whereby relatively little time passed between the origin of the order and the diversification of its major clades. It also suggests that the early primate fossil record is likely poorly sampled. Copyright © 2014 Elsevier Inc. All rights reserved.
One tree to link them all: a phylogenetic dataset for the European tetrapoda.
Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried
2014-08-08
Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.
Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions
Birtel, Julia; Walser, Jean-Claude; Pichon, Samuel; Bürgmann, Helmut; Matthews, Blake
2015-01-01
Methods to estimate microbial diversity have developed rapidly in an effort to understand the distribution and diversity of microorganisms in natural environments. For bacterial communities, the 16S rRNA gene is the phylogenetic marker gene of choice, but most studies select only a specific region of the 16S rRNA to estimate bacterial diversity. Whereas biases derived from from DNA extraction, primer choice and PCR amplification are well documented, we here address how the choice of variable region can influence a wide range of standard ecological metrics, such as species richness, phylogenetic diversity, β-diversity and rank-abundance distributions. We have used Illumina paired-end sequencing to estimate the bacterial diversity of 20 natural lakes across Switzerland derived from three trimmed variable 16S rRNA regions (V3, V4, V5). Species richness, phylogenetic diversity, community composition, β-diversity, and rank-abundance distributions differed significantly between 16S rRNA regions. Overall, patterns of diversity quantified by the V3 and V5 regions were more similar to one another than those assessed by the V4 region. Similar results were obtained when analyzing the datasets with different sequence similarity thresholds used during sequences clustering and when the same analysis was used on a reference dataset of sequences from the Greengenes database. In addition we also measured species richness from the same lake samples using ARISA Fingerprinting, but did not find a strong relationship between species richness estimated by Illumina and ARISA. We conclude that the selection of 16S rRNA region significantly influences the estimation of bacterial diversity and species distributions and that caution is warranted when comparing data from different variable regions as well as when using different sequencing techniques. PMID:25915756
Duchêne, Sebastián; Duchêne, David; Holmes, Edward C; Ho, Simon Y W
2015-07-01
Rates and timescales of viral evolution can be estimated using phylogenetic analyses of time-structured molecular sequences. This involves the use of molecular-clock methods, calibrated by the sampling times of the viral sequences. However, the spread of these sampling times is not always sufficient to allow the substitution rate to be estimated accurately. We conducted Bayesian phylogenetic analyses of simulated virus data to evaluate the performance of the date-randomization test, which is sometimes used to investigate whether time-structured data sets have temporal signal. An estimate of the substitution rate passes this test if its mean does not fall within the 95% credible intervals of rate estimates obtained using replicate data sets in which the sampling times have been randomized. We find that the test sometimes fails to detect rate estimates from data with no temporal signal. This error can be minimized by using a more conservative criterion, whereby the 95% credible interval of the estimate with correct sampling times should not overlap with those obtained with randomized sampling times. We also investigated the behavior of the test when the sampling times are not uniformly distributed throughout the tree, which sometimes occurs in empirical data sets. The test performs poorly in these circumstances, such that a modification to the randomization scheme is needed. Finally, we illustrate the behavior of the test in analyses of nucleotide sequences of cereal yellow dwarf virus. Our results validate the use of the date-randomization test and allow us to propose guidelines for interpretation of its results. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Pohl, Nélida; Sison-Mangus, Marilou P; Yee, Emily N; Liswi, Saif W; Briscoe, Adriana D
2009-01-01
Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84–113 million years for the divergence of all butterfly families. Conclusion These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation. PMID:19439087
Phylogenetic diversity measures based on Hill numbers.
Chao, Anne; Chiu, Chun-Huo; Jost, Lou
2010-11-27
We propose a parametric class of phylogenetic diversity (PD) measures that are sensitive to both species abundance and species taxonomic or phylogenetic distances. This work extends the conventional parametric species-neutral approach (based on 'effective number of species' or Hill numbers) to take into account species relatedness, and also generalizes the traditional phylogenetic approach (based on 'total phylogenetic length') to incorporate species abundances. The proposed measure quantifies 'the mean effective number of species' over any time interval of interest, or the 'effective number of maximally distinct lineages' over that time interval. The product of the measure and the interval length quantifies the 'branch diversity' of the phylogenetic tree during that interval. The new measures generalize and unify many existing measures and lead to a natural definition of taxonomic diversity as a special case. The replication principle (or doubling property), an important requirement for species-neutral diversity, is generalized to PD. The widely used Rao's quadratic entropy and the phylogenetic entropy do not satisfy this essential property, but a simple transformation converts each to our measures, which do satisfy the property. The proposed approach is applied to forest data for interpreting the effects of thinning.
Jeffery, Nicholas W; Gregory, T Ryan
2014-10-01
Crustaceans are enormously diverse both phylogenetically and ecologically, but they remain substantially underrepresented in the existing genome size database. An expansion of this dataset could be facilitated if it were possible to obtain genome size estimates from ethanol-preserved specimens. In this study, two tests were performed in order to assess the reliability of genome size data generated using preserved material. First, the results of estimates based on flash-frozen versus ethanol-preserved material were compared across 37 species of crustaceans that differ widely in genome size. Second, a comparison was made of specimens from a single species that had been stored in ethanol for 1-14 years. In both cases, the use of gill tissue in Feulgen image analysis densitometry proved to be a very viable approach. This finding is of direct relevance to both new studies of field-collected crustaceans as well as potential studies based on existing collections. © 2014 International Society for Advancement of Cytometry.
Buchwalter, David B; Cain, Daniel J; Martin, Caitrin A; Xie, Lingtian; Luoma, Samuel N; Garland, Theodore
2008-06-17
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature.
Buchwalter, David B.; Cain, Daniel J.; Martin, Caitrin A.; Xie, Lingtian; Luoma, Samuel N.; Garland, Theodore
2008-01-01
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. PMID:18559853
Kuramae, Eiko E; Robert, Vincent; Echavarri-Erasun, Carlos; Boekhout, Teun
2007-01-01
Background The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40–45 concatenated proteins seem needed to resolve fungal TOL. Results In total 4852 orthologous proteins (KOGs) were assigned among 33 fungal genomes from the Asco- and Basidiomycota and 70 of these represented single copy proteins. The individual protein distance matrices based on 531 concatenated proteins that has been used for phylogeny reconstruction before [14] were compared one with another in order to select those with the highest CCC, which then was used as a reference. This reference distance matrix was compared with those of the 70 single copy proteins selected and their CCC values were calculated. Sixty four KOGs showed a CCC above 0.50 and these were further considered for their phylogenetic potential. Proteins belonging to the cellular processes and signaling KOG category seem more informative than those belonging to the other three categories: information storage and processing; metabolism; and the poorly characterized category. After concatenation of 40 proteins the topology of the phylogenetic tree remained stable, but after concatenation of 60 or more proteins the bootstrap support values of some branches decreased, most likely due to the inclusion of proteins with lowers CCC values. The selection of protein sequences to be used in various TOL projects remains a critical and important process. The method described in this paper will contribute to a more objective selection of phylogenetically informative protein sequences. Conclusion This study provides candidate protein sequences to be considered as phylogenetic markers in different branches of fungal TOL. The selection procedure described here will be useful to select informative protein sequences to resolve branches of TOL that contain few or no species with completely sequenced genomes. The robust phylogenetic trees resulting from this method may contribute to our understanding of organismal diversification processes. The method proposed can be extended easily to other branches of TOL. PMID:17688684
Platt, Roy N; Faircloth, Brant C; Sullivan, Kevin A M; Kieran, Troy J; Glenn, Travis C; Vandewege, Michael W; Lee, Thomas E; Baker, Robert J; Stevens, Richard D; Ray, David A
2018-03-01
The rapid diversification of Myotis bats into more than 100 species is one of the most extensive mammalian radiations available for study. Efforts to understand relationships within Myotis have primarily utilized mitochondrial markers and trees inferred from nuclear markers lacked resolution. Our current understanding of relationships within Myotis is therefore biased towards a set of phylogenetic markers that may not reflect the history of the nuclear genome. To resolve this, we sequenced the full mitochondrial genomes of 37 representative Myotis, primarily from the New World, in conjunction with targeted sequencing of 3648 ultraconserved elements (UCEs). We inferred the phylogeny and explored the effects of concatenation and summary phylogenetic methods, as well as combinations of markers based on informativeness or levels of missing data, on our results. Of the 294 phylogenies generated from the nuclear UCE data, all are significantly different from phylogenies inferred using mitochondrial genomes. Even within the nuclear data, quartet frequencies indicate that around half of all UCE loci conflict with the estimated species tree. Several factors can drive such conflict, including incomplete lineage sorting, introgressive hybridization, or even phylogenetic error. Despite the degree of discordance between nuclear UCE loci and the mitochondrial genome and among UCE loci themselves, the most common nuclear topology is recovered in one quarter of all analyses with strong nodal support. Based on these results, we re-examine the evolutionary history of Myotis to better understand the phenomena driving their unique nuclear, mitochondrial, and biogeographic histories.
Ely, John J; Dye, Brent; Frels, William I; Fritz, Jo; Gagneux, Pascal; Khun, Henry H; Switzer, William M; Lee, D Rick
2005-10-01
Chimpanzees are presently classified into three subspecies: Pan troglodytes verus from west Africa, P.t. troglodytes from central Africa, and P.t. schweinfurthii from east Africa. A fourth subspecies (P.t. vellerosus), from Cameroon and northern Nigeria, has been proposed. These taxonomic designations are based on geographical origins and are reflected in sequence variation in the first hypervariable region (HVR-I) of the mtDNA D-loop. Although advances have been made in our understanding of chimpanzee phylogenetics, little has been known regarding the subspecies composition of captive chimpanzees. We sequenced part of the mtDNA HVR-I region in 218 African-born population founders and performed a phylogenetic analysis with previously characterized African sequences of known provenance to infer subspecies affiliations. Most founders were P.t. verus (95.0%), distantly followed by the troglodytes schweinfurthii clade (4.6%), and a single P.t. vellerosus (0.4%). Pedigree-based estimates of genomic representation in the descendant population revealed that troglodytes schweinfurthii founder representation was reduced in captivity, vellerosus representation increased due to prolific breeding by a single male, and reproductive variance resulted in uneven representation among male P.t.verus founders. No increase in mortality was evident from between-subspecies interbreeding, indicating a lack of outbreeding depression. Knowledge of subspecies and their genomic representation can form the basis for phylogenetically informed genetic management of extant chimpanzees to preserve rare genetic variation for research, conservation, or possible future breeding. Copyright 2005 Wiley-Liss, Inc.
DNA-Based Identification of Forensically Important Blow Flies (Diptera: Calliphoridae) From India.
Bharti, Meenakshi; Singh, Baneshwar
2017-09-01
Correct species identification is the first and the most important criteria in entomological evidence-based postmortem interval (PMI) estimation. Although morphological keys are available for species identification of adult blow flies, keys for immature stages are either lacking or are incomplete. In this study, cytochrome oxidase subunit 1 (COI) reference data were developed from nine species (belonging to three subfamilies, namely, Calliphorinae, Luciliinae, and Chrysomyinae) of blow flies from India. Seven of the nine species included in this study were found suitable for DNA-based identification using COI gene, because they showed nonoverlapping intra- (0.0-0.3%) and inter-(1.96-18.14%) specific diversity, and formed well-supported monophyletic clade in phylogenetic analysis. The remaining two species (i.e., Chrysomya megacephala (Fabricius) and Chrysomya chani Kurahashi) cannot be distinguished reliably using our database because they had a very low interspecific diversity (0.11%), and Ch. megacephala was paraphyletic with respect to Ch. chani in the phylogenetic analysis. We conclude that the COI gene is a useful marker for DNA-based identification of blow flies from India. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.
Takahashi, Tetsumi; Moreno, Edmundo
2015-12-01
The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
Evolution of exceptional species richness among lineages of fleshy-fruited Myrtaceae
Biffin, Ed; Lucas, Eve J.; Craven, Lyn A.; Ribeiro da Costa, Itayguara; Harrington, Mark G.; Crisp, Michael D.
2010-01-01
Background and Aims The angiosperm family Myrtaceae comprises 17 tribes with more than half of the estimated 5500 species being referred to the fleshy-fruited and predominantly rainforest associated Syzygieae and Myrteae. Previous studies suggest that fleshy fruits have evolved separately in these lineages, whereas generally shifts in fruit morphology have been variously implicated in diversification rate shifts among angiosperms. A phylogenetic hypothesis and estimate divergence times for Myrtaceae is developed as a basis to explore the evidence for, and drivers of, elevated diversification rates among the fleshy-fruited tribes of Myrtaceae. Methods Bayesian phylogenetic analyses of plastid and nuclear DNA sequences were used to estimate intertribal relationships and lineage divergence times in Myrtaceae. Focusing on the fleshy-fruited tribes, a variety of statistical approaches were used to assess diversification rates and diversification rate shifts across the family. Key Results Analyses of the sequence data provide a strongly supported phylogenetic hypothesis for Myrtaceae. Relative to previous studies, substantially younger ages for many of the clades are reported, and it is argued that the use of flexible calibrations to incorporate fossil data provides more realistic divergence estimates than the use of errorless point calibrations. It is found that Syzygieae and Myrteae have experienced elevated diversification rates relative to other lineages of Myrtaceae. Positive shifts in diversification rate have occurred separately in each lineage, associated with a shift from dry to fleshy fruit. Conclusions Fleshy fruits have evolved independently in Syzygieae and Myrteae, and this is accompanied by exceptional diversification rate shifts in both instances, suggesting that the evolution of fleshy fruits is a key innovation for rainforest Myrtaceae. Noting the scale dependency of this hypothesis, more complex explanations may be required to explain diversification rate shifts occurring within the fleshy-fruited tribes, and the suggested phylogenetic hypothesis provides an appropriate framework for this undertaking. PMID:20462850
Extending the BEAGLE library to a multi-FPGA platform.
Jin, Zheming; Bakos, Jason D
2013-01-19
Maximum Likelihood (ML)-based phylogenetic inference using Felsenstein's pruning algorithm is a standard method for estimating the evolutionary relationships amongst a set of species based on DNA sequence data, and is used in popular applications such as RAxML, PHYLIP, GARLI, BEAST, and MrBayes. The Phylogenetic Likelihood Function (PLF) and its associated scaling and normalization steps comprise the computational kernel for these tools. These computations are data intensive but contain fine grain parallelism that can be exploited by coprocessor architectures such as FPGAs and GPUs. A general purpose API called BEAGLE has recently been developed that includes optimized implementations of Felsenstein's pruning algorithm for various data parallel architectures. In this paper, we extend the BEAGLE API to a multiple Field Programmable Gate Array (FPGA)-based platform called the Convey HC-1. The core calculation of our implementation, which includes both the phylogenetic likelihood function (PLF) and the tree likelihood calculation, has an arithmetic intensity of 130 floating-point operations per 64 bytes of I/O, or 2.03 ops/byte. Its performance can thus be calculated as a function of the host platform's peak memory bandwidth and the implementation's memory efficiency, as 2.03 × peak bandwidth × memory efficiency. Our FPGA-based platform has a peak bandwidth of 76.8 GB/s and our implementation achieves a memory efficiency of approximately 50%, which gives an average throughput of 78 Gflops. This represents a ~40X speedup when compared with BEAGLE's CPU implementation on a dual Xeon 5520 and 3X speedup versus BEAGLE's GPU implementation on a Tesla T10 GPU for very large data sizes. The power consumption is 92 W, yielding a power efficiency of 1.7 Gflops per Watt. The use of data parallel architectures to achieve high performance for likelihood-based phylogenetic inference requires high memory bandwidth and a design methodology that emphasizes high memory efficiency. To achieve this objective, we integrated 32 pipelined processing elements (PEs) across four FPGAs. For the design of each PE, we developed a specialized synthesis tool to generate a floating-point pipeline with resource and throughput constraints to match the target platform. We have found that using low-latency floating-point operators can significantly reduce FPGA area and still meet timing requirement on the target platform. We found that this design methodology can achieve performance that exceeds that of a GPU-based coprocessor.
Jin, Haofei; Yonezawa, Takahiro; Zhong, Yang; Kishino, Hirohisa; Hasegawa, Masami
2017-03-17
The giant rhinoceros beetles (Dynastini, Scarabaeidae, Coleoptera) are distributed in tropical and temperate regions in Asia, America and Africa. Recent molecular phylogenetic studies have revealed that the giant rhinoceros beetles can be divided into three clades representing Asia, America and Africa. Although a correlation between their evolution and the continental drift during the Pangean breakup was suggested, there is no accurate divergence time estimation among the three clades based on molecular data. Moreover, there is a long chronological gap between the timing of the Pangean breakup (Cretaceous: 110-148 Ma) and the emergence of the oldest fossil record (Oligocene: 33 Ma). In this study, we estimated their divergence times based on molecular data, using several combinations of fossil calibration sets, and obtained robust estimates. The inter-continental divergence events among the clades were estimated to have occurred about 99 Ma (Asian clade and others) and 78 Ma (American clade and African clade), both of which are after the Pangean breakup. These estimates suggest their inter-continental divergences occurred by overseas sweepstakes dispersal, rather than by vicariances of the population caused by the Pangean breakup.
Hellmuth, Marc; Wieseke, Nicolas; Lechner, Marcus; Lenhof, Hans-Peter; Middendorf, Martin; Stadler, Peter F.
2015-01-01
Phylogenomics heavily relies on well-curated sequence data sets that comprise, for each gene, exclusively 1:1 orthologos. Paralogs are treated as a dangerous nuisance that has to be detected and removed. We show here that this severe restriction of the data sets is not necessary. Building upon recent advances in mathematical phylogenetics, we demonstrate that gene duplications convey meaningful phylogenetic information and allow the inference of plausible phylogenetic trees, provided orthologs and paralogs can be distinguished with a degree of certainty. Starting from tree-free estimates of orthology, cograph editing can sufficiently reduce the noise to find correct event-annotated gene trees. The information of gene trees can then directly be translated into constraints on the species trees. Although the resolution is very poor for individual gene families, we show that genome-wide data sets are sufficient to generate fully resolved phylogenetic trees, even in the presence of horizontal gene transfer. PMID:25646426
Silvestro, Daniele; Zizka, Alexander; Bacon, Christine D; Cascales-Miñana, Borja; Salamin, Nicolas; Antonelli, Alexandre
2016-04-05
Methods in historical biogeography have revolutionized our ability to infer the evolution of ancestral geographical ranges from phylogenies of extant taxa, the rates of dispersals, and biotic connectivity among areas. However, extant taxa are likely to provide limited and potentially biased information about past biogeographic processes, due to extinction, asymmetrical dispersals and variable connectivity among areas. Fossil data hold considerable information about past distribution of lineages, but suffer from largely incomplete sampling. Here we present a new dispersal-extinction-sampling (DES) model, which estimates biogeographic parameters using fossil occurrences instead of phylogenetic trees. The model estimates dispersal and extinction rates while explicitly accounting for the incompleteness of the fossil record. Rates can vary between areas and through time, thus providing the opportunity to assess complex scenarios of biogeographic evolution. We implement the DES model in a Bayesian framework and demonstrate through simulations that it can accurately infer all the relevant parameters. We demonstrate the use of our model by analysing the Cenozoic fossil record of land plants and inferring dispersal and extinction rates across Eurasia and North America. Our results show that biogeographic range evolution is not a time-homogeneous process, as assumed in most phylogenetic analyses, but varies through time and between areas. In our empirical assessment, this is shown by the striking predominance of plant dispersals from Eurasia into North America during the Eocene climatic cooling, followed by a shift in the opposite direction, and finally, a balance in biotic interchange since the middle Miocene. We conclude by discussing the potential of fossil-based analyses to test biogeographic hypotheses and improve phylogenetic methods in historical biogeography. © 2016 The Author(s).
Matsudaira, Kazunari; Hamada, Yuzuru; Bunlungsup, Srichan; Ishida, Takafumi; San, Aye Mi; Malaivijitnond, Suchinda
2018-05-11
Macaca fascicularis aurea (Burmese long-tailed macaque) is 1 of the 10 subspecies of Macaca fascicularis. Despite having few morphological differences from other subspecies, a recent phylogeographic study showed that M. f. aurea is clearly distinct genetically from Macaca fascicularis fascicularis (common long-tailed macaque) and suggests that M. f. aurea experienced a disparate evolutionary pathway versus other subspecies. To construct a detailed evolutionary history of M. f. aurea and its relationships with other macaque species, we performed phylogenetic analyses and divergence time estimation of whole mitochondrial genomes (2 M. f. aurea, 8 M. f. fascicularis, and 16 animals of 12 macaque species) and 2871 bp of the Y chromosome (1 M. f. aurea, 2 M. f. fascicularis, and 5 animals of 5 macaque species) and haplotype network analysis of 758 bp of the Y chromosome (1 M. f. aurea, 2 M. f. fascicularis, and 21 animals of 19 macaque species). Whereas the Y chromosome of M. f. aurea clustered with those of the fascicularis species group in the phylogenetic and haplotype network analyses, its mtDNA clustered within the clade of the sinica species group. Based on this phylogenetic incongruence and the estimated divergence times, we propose that proto-M. f. aurea underwent hybridization with a population of the sinica species group between 2.5 and 0.95 MYA after divergence from the common ancestor of M. fascicularis. Hybridization and introgression might have been central in the evolution of M. f. aurea, similar to what occurred in the evolution of other macaque species and subspecies.
Okamoto, Takuji; Maruyama, Akihiko; Imura, Satoshi; Takeyama, Haruko; Naganuma, Takeshi
2004-05-01
Halomonas variabilis and phylogenetically related organisms were isolated from various habitats such as Antarctic terrain and saline ponds, deep-sea sediment, deep-sea waters affected by hydrothermal plumes, and hydrothermal vent fluids. Ten strains were selected for physiological and phylogenetic characterization in detail. All of those strains were found to be piezotolerant and psychrotolerant, as well as euryhaline halophilic or halotolerant. Their stress tolerance may facilitate their wide occurrence, even in so-called extreme environments. The 16S rDNA-based phylogenetic relationship was complemented by analyses of the DNA gyrase subunit B gene (gyrB) and genes involved in the synthesis of the major compatible solute, ectoine: diaminobutyric acid aminotransferase gene (ectB) and ectoine synthase gene (ectC). The phylogenetic relationships of H. variabilis and related organisms were very similar in terms of 16S rDNA, gyrB, and ectB. The ectC-based tree was inconsistent with the other phylogenetic trees. For that reason, ectC was inferred to derive from horizontal transfer.
Molecular phylogeny and systematics of the Echinostomatoidea Looss, 1899 (Platyhelminthes: Digenea).
Tkach, Vasyl V; Kudlai, Olena; Kostadinova, Aneta
2016-03-01
The Echinostomatoidea is a large, cosmopolitan group of digeneans currently including nine families and 105 genera, the vast majority parasitic, as adults, in birds with relatively few taxa parasitising mammals, reptiles and, exceptionally, fish. Despite the complex structure, diverse content and substantial species richness of the group, almost no attempt has been made to elucidate its phylogenetic relationships at the suprageneric level based on molecules due to the lack of data. Herein, we evaluate the consistency of the present morphology-based classification system of the Echinostomatoidea with the phylogenetic relationships of its members based on partial sequences of the nuclear lsrRNA gene for a broad diversity of taxa (80 species, representing eight families and 40 genera), including representatives of five subfamilies of the Echinostomatidae, which currently exhibits the most complex taxonomic structure within the superfamily. This first comprehensive phylogeny for the Echinostomatoidea challenged the current systematic framework based on comparative morphology. A morphology-based evaluation of this new molecular framework resulted in a number of systematic and nomenclatural changes consistent with the phylogenetic estimates of the generic and suprageneric boundaries and a new phylogeny-based classification of the Echinostomatoidea. In the current systematic treatment: (i) the rank of two family level lineages, the former Himasthlinae and Echinochasminae, is elevated to full family status; (ii) Caballerotrema is distinguished at the family level; (iii) the content and diagnosis of the Echinostomatidae (sensu stricto) (s. str.) are revised to reflect its phylogeny, resulting in the abolition of the Nephrostominae and Chaunocephalinae as synonyms of the Echinostomatidae (s. str.); (iv) Artyfechinostomum, Cathaemasia, Rhopalias and Ribeiroia are re-allocated within the Echinostomatidae (s. str.), resulting in the abolition of the Cathaemasiidae, Rhopaliidae and Ribeiroiinae, which become synonyms of the Echinostomatidae (s. str.); and (v) refinements of the generic boundaries within the Echinostomatidae (s. str.), Psilostomidae and Fasciolidae are made. Copyright © 2015 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Time-calibrated molecular phylogeny of pteropods
Hörnlein, Christine; Janssen, Arie W.; Hughes, Martin; Bush, Stephanie L.; Marlétaz, Ferdinand; Gasca, Rebeca; Pierrot-Bults, Annelies C.; Michel, Ellinor; Todd, Jonathan A.; Young, Jeremy R.; Osborn, Karen J.; Menken, Steph B. J.
2017-01-01
Pteropods are a widespread group of holoplanktonic gastropod molluscs and are uniquely suitable for study of long-term evolutionary processes in the open ocean because they are the only living metazoan plankton with a good fossil record. Pteropods have been proposed as bioindicators to monitor the impacts of ocean acidification and in consequence have attracted considerable research interest, however, a robust evolutionary framework for the group is still lacking. Here we reconstruct their phylogenetic relationships and examine the evolutionary history of pteropods based on combined analyses of Cytochrome Oxidase I, 28S, and 18S ribosomal rRNA sequences and a molecular clock calibrated using fossils and the estimated timing of the formation of the Isthmus of Panama. Euthecosomes with uncoiled shells were monophyletic with Creseis as the earliest diverging lineage, estimated at 41–38 million years ago (mya). The coiled euthecosomes (Limacina, Heliconoides, Thielea) were not monophyletic contrary to the accepted morphology-based taxonomy; however, due to their high rate heterogeneity no firm conclusions can be drawn. We found strong support for monophyly of most euthecosome genera, but Clio appeared as a polyphyletic group, and Diacavolinia grouped within Cavolinia, making the latter genus paraphyletic. The highest evolutionary rates were observed in Heliconoides inflatus and Limacina bulimoides for both 28S and 18S partitions. Using a fossil-calibrated phylogeny that sets the first occurrence of coiled euthecosomes at 79–66 mya, we estimate that uncoiled euthecosomes evolved 51–42 mya and that most extant uncoiled genera originated 40–15 mya. These findings are congruent with a molecular clock analysis using the Isthmus of Panama formation as an independent calibration. Although not all phylogenetic relationships could be resolved based on three molecular markers, this study provides a useful resource to study pteropod diversity and provides general insight into the processes that generate and maintain their diversity in the open ocean. PMID:28604805
Time-calibrated molecular phylogeny of pteropods.
Burridge, Alice K; Hörnlein, Christine; Janssen, Arie W; Hughes, Martin; Bush, Stephanie L; Marlétaz, Ferdinand; Gasca, Rebeca; Pierrot-Bults, Annelies C; Michel, Ellinor; Todd, Jonathan A; Young, Jeremy R; Osborn, Karen J; Menken, Steph B J; Peijnenburg, Katja T C A
2017-01-01
Pteropods are a widespread group of holoplanktonic gastropod molluscs and are uniquely suitable for study of long-term evolutionary processes in the open ocean because they are the only living metazoan plankton with a good fossil record. Pteropods have been proposed as bioindicators to monitor the impacts of ocean acidification and in consequence have attracted considerable research interest, however, a robust evolutionary framework for the group is still lacking. Here we reconstruct their phylogenetic relationships and examine the evolutionary history of pteropods based on combined analyses of Cytochrome Oxidase I, 28S, and 18S ribosomal rRNA sequences and a molecular clock calibrated using fossils and the estimated timing of the formation of the Isthmus of Panama. Euthecosomes with uncoiled shells were monophyletic with Creseis as the earliest diverging lineage, estimated at 41-38 million years ago (mya). The coiled euthecosomes (Limacina, Heliconoides, Thielea) were not monophyletic contrary to the accepted morphology-based taxonomy; however, due to their high rate heterogeneity no firm conclusions can be drawn. We found strong support for monophyly of most euthecosome genera, but Clio appeared as a polyphyletic group, and Diacavolinia grouped within Cavolinia, making the latter genus paraphyletic. The highest evolutionary rates were observed in Heliconoides inflatus and Limacina bulimoides for both 28S and 18S partitions. Using a fossil-calibrated phylogeny that sets the first occurrence of coiled euthecosomes at 79-66 mya, we estimate that uncoiled euthecosomes evolved 51-42 mya and that most extant uncoiled genera originated 40-15 mya. These findings are congruent with a molecular clock analysis using the Isthmus of Panama formation as an independent calibration. Although not all phylogenetic relationships could be resolved based on three molecular markers, this study provides a useful resource to study pteropod diversity and provides general insight into the processes that generate and maintain their diversity in the open ocean.
Early evolution of the angiosperm clade Asteraceae in the Cretaceous of Antarctica
Barreda, Viviana D.; Palazzesi, Luis; Tellería, Maria C.; Olivero, Eduardo B.; Raine, J. Ian; Forest, Félix
2015-01-01
The Asteraceae (sunflowers and daisies) are the most diverse family of flowering plants. Despite their prominent role in extant terrestrial ecosystems, the early evolutionary history of this family remains poorly understood. Here we report the discovery of a number of fossil pollen grains preserved in dinosaur-bearing deposits from the Late Cretaceous of Antarctica that drastically pushes back the timing of assumed origin of the family. Reliably dated to ∼76–66 Mya, these specimens are about 20 million years older than previously known records for the Asteraceae. Using a phylogenetic approach, we interpreted these fossil specimens as members of an extinct early diverging clade of the family, associated with subfamily Barnadesioideae. Based on a molecular phylogenetic tree calibrated using fossils, including the ones reported here, we estimated that the most recent common ancestor of the family lived at least 80 Mya in Gondwana, well before the thermal and biogeographical isolation of Antarctica. Most of the early diverging lineages of the family originated in a narrow time interval after the K/P boundary, 60–50 Mya, coinciding with a pronounced climatic warming during the Late Paleocene and Early Eocene, and the scene of a dramatic rise in flowering plant diversity. Our age estimates reduce earlier discrepancies between the age of the fossil record and previous molecular estimates for the origin of the family, bearing important implications in the evolution of flowering plants in general. PMID:26261324
Phylogenetics links monster larva to deep-sea shrimp.
Bracken-Grissom, Heather D; Felder, Darryl L; Vollmer, Nicole L; Martin, Joel W; Crandall, Keith A
2012-10-01
Mid-water plankton collections commonly include bizarre and mysterious developmental stages that differ conspicuously from their adult counterparts in morphology and habitat. Unaware of the existence of planktonic larval stages, early zoologists often misidentified these unique morphologies as independent adult lineages. Many such mistakes have since been corrected by collecting larvae, raising them in the lab, and identifying the adult forms. However, challenges arise when the larva is remarkably rare in nature and relatively inaccessible due to its changing habitats over the course of ontogeny. The mid-water marine species Cerataspis monstrosa (Gray 1828) is an armored crustacean larva whose adult identity has remained a mystery for over 180 years. Our phylogenetic analyses, based in part on recent collections from the Gulf of Mexico, provide definitive evidence that the rare, yet broadly distributed larva, C. monstrosa, is an early developmental stage of the globally distributed deepwater aristeid shrimp, Plesiopenaeus armatus. Divergence estimates and phylogenetic relationships across five genes confirm the larva and adult are the same species. Our work demonstrates the diagnostic power of molecular systematics in instances where larval rearing seldom succeeds and morphology and habitat are not indicative of identity. Larval-adult linkages not only aid in our understanding of biodiversity, they provide insights into the life history, distribution, and ecology of an organism.
Phylogenetics links monster larva to deep-sea shrimp
Bracken-Grissom, Heather D; Felder, Darryl L; Vollmer, Nicole L; Martin, Joel W; Crandall, Keith A
2012-01-01
Mid-water plankton collections commonly include bizarre and mysterious developmental stages that differ conspicuously from their adult counterparts in morphology and habitat. Unaware of the existence of planktonic larval stages, early zoologists often misidentified these unique morphologies as independent adult lineages. Many such mistakes have since been corrected by collecting larvae, raising them in the lab, and identifying the adult forms. However, challenges arise when the larva is remarkably rare in nature and relatively inaccessible due to its changing habitats over the course of ontogeny. The mid-water marine species Cerataspis monstrosa (Gray 1828) is an armored crustacean larva whose adult identity has remained a mystery for over 180 years. Our phylogenetic analyses, based in part on recent collections from the Gulf of Mexico, provide definitive evidence that the rare, yet broadly distributed larva, C. monstrosa, is an early developmental stage of the globally distributed deepwater aristeid shrimp, Plesiopenaeus armatus. Divergence estimates and phylogenetic relationships across five genes confirm the larva and adult are the same species. Our work demonstrates the diagnostic power of molecular systematics in instances where larval rearing seldom succeeds and morphology and habitat are not indicative of identity. Larval–adult linkages not only aid in our understanding of biodiversity, they provide insights into the life history, distribution, and ecology of an organism. PMID:23145324
A Practical Guide to Estimating the Heritability of Pathogen Traits
Mitov, Venelin; Stadler, Tanja
2018-01-01
Abstract Pathogen traits, such as the virulence of an infection, can vary significantly between patients. A major challenge is to measure the extent to which genetic differences between infecting strains explain the observed variation of the trait. This is quantified by the trait’s broad-sense heritability, H2. A recent discrepancy between estimates of the heritability of HIV-virulence has opened a debate on the estimators’ accuracy. Here, we show that the discrepancy originates from model limitations and important lifecycle differences between sexually reproducing organisms and transmittable pathogens. In particular, current quantitative genetics methods, such as donor–recipient regression of surveyed serodiscordant couples and the phylogenetic mixed model (PMM), are prone to underestimate H2, because they neglect or do not fit to the loss of resemblance between transmission partners caused by within-host evolution. In a phylogenetic analysis of 8,483 HIV patients from the United Kingdom, we show that the phenotypic correlation between transmission partners decays with the amount of within-host evolution of the virus. We reproduce this pattern in toy-model simulations and show that a phylogenetic Ornstein–Uhlenbeck model (POUMM) outperforms the PMM in capturing this correlation pattern and in quantifying H2. In particular, we show that POUMM outperforms PMM even in simulations without selection—as it captures the mentioned correlation pattern—which has not been appreciated until now. By cross-validating the POUMM estimates with ANOVA on closest phylogenetic pairs, we obtain H2 ≈ 0.2, meaning ∼20% of the variation in HIV-virulence is explained by the virus genome both for European and African data. PMID:29329426
2012-01-01
Background Although it has proven to be an important foundation for investigations of carnivoran ecology, biology and evolution, the complete species-level supertree for Carnivora of Bininda-Emonds et al. is showing its age. Additional, largely molecular sequence data are now available for many species and the advancement of computer technology means that many of the limitations of the original analysis can now be avoided. We therefore sought to provide an updated estimate of the phylogenetic relationships within all extant Carnivora, again using supertree analysis to be able to analyze as much of the global phylogenetic database for the group as possible. Results In total, 188 source trees were combined, representing 114 trees from the literature together with 74 newly constructed gene trees derived from nearly 45,000 bp of sequence data from GenBank. The greater availability of sequence data means that the new supertree is almost completely resolved and also better reflects current phylogenetic opinion (for example, supporting a monophyletic Mephitidae, Eupleridae and Prionodontidae; placing Nandinia binotata as sister to the remaining Feliformia). Following an initial rapid radiation, diversification rate analyses indicate a downturn in the net speciation rate within the past three million years as well as a possible increase some 18.0 million years ago; numerous diversification rate shifts within the order were also identified. Conclusions Together, the two carnivore supertrees remain the only complete phylogenetic estimates for all extant species and the new supertree, like the old one, will form a key tool in helping us to further understand the biology of this charismatic group of carnivores. PMID:22369503
A Bayesian framework to estimate diversification rates and their variation through time and space
2011-01-01
Background Patterns of species diversity are the result of speciation and extinction processes, and molecular phylogenetic data can provide valuable information to derive their variability through time and across clades. Bayesian Markov chain Monte Carlo methods offer a promising framework to incorporate phylogenetic uncertainty when estimating rates of diversification. Results We introduce a new approach to estimate diversification rates in a Bayesian framework over a distribution of trees under various constant and variable rate birth-death and pure-birth models, and test it on simulated phylogenies. Furthermore, speciation and extinction rates and their posterior credibility intervals can be estimated while accounting for non-random taxon sampling. The framework is particularly suitable for hypothesis testing using Bayes factors, as we demonstrate analyzing dated phylogenies of Chondrostoma (Cyprinidae) and Lupinus (Fabaceae). In addition, we develop a model that extends the rate estimation to a meta-analysis framework in which different data sets are combined in a single analysis to detect general temporal and spatial trends in diversification. Conclusions Our approach provides a flexible framework for the estimation of diversification parameters and hypothesis testing while simultaneously accounting for uncertainties in the divergence times and incomplete taxon sampling. PMID:22013891
Al-Atiyat, R M; Aljumaah, R S
2014-08-27
This study aimed to estimate evolutionary distances and to reconstruct phylogeny trees between different Awassi sheep populations. Thirty-two sheep individuals from three different geographical areas of Jordan and the Kingdom of Saudi Arabia (KSA) were randomly sampled. DNA was extracted from the tissue samples and sequenced using the T7 promoter universal primer. Different phylogenetic trees were reconstructed from 0.64-kb DNA sequences using the MEGA software with the best general time reverse distance model. Three methods of distance estimation were then used. The maximum composite likelihood test was considered for reconstructing maximum likelihood, neighbor-joining and UPGMA trees. The maximum likelihood tree indicated three major clusters separated by cytosine (C) and thymine (T). The greatest distance was shown between the South sheep and North sheep. On the other hand, the KSA sheep as an outgroup showed shorter evolutionary distance to the North sheep population than to the others. The neighbor-joining and UPGMA trees showed quite reliable clusters of evolutionary differentiation of Jordan sheep populations from the Saudi population. The overall results support geographical information and ecological types of the sheep populations studied. Summing up, the resulting phylogeny trees may contribute to the limited information about the genetic relatedness and phylogeny of Awassi sheep in nearby Arab countries.
Does History Repeat Itself? Wavelets and the Phylodynamics of Influenza A
Tom, Jennifer A.; Sinsheimer, Janet S.; Suchard, Marc A.
2012-01-01
Unprecedented global surveillance of viruses will result in massive sequence data sets that require new statistical methods. These data sets press the limits of Bayesian phylogenetics as the high-dimensional parameters that comprise a phylogenetic tree increase the already sizable computational burden of these techniques. This burden often results in partitioning the data set, for example, by gene, and inferring the evolutionary dynamics of each partition independently, a compromise that results in stratified analyses that depend only on data within a given partition. However, parameter estimates inferred from these stratified models are likely strongly correlated, considering they rely on data from a single data set. To overcome this shortfall, we exploit the existing Monte Carlo realizations from stratified Bayesian analyses to efficiently estimate a nonparametric hierarchical wavelet-based model and learn about the time-varying parameters of effective population size that reflect levels of genetic diversity across all partitions simultaneously. Our methods are applied to complete genome influenza A sequences that span 13 years. We find that broad peaks and trends, as opposed to seasonal spikes, in the effective population size history distinguish individual segments from the complete genome. We also address hypotheses regarding intersegment dynamics within a formal statistical framework that accounts for correlation between segment-specific parameters. PMID:22160768
Effects of Phylogenetic Tree Style on Student Comprehension
NASA Astrophysics Data System (ADS)
Dees, Jonathan Andrew
Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.
Buchwalter, D.B.; Cain, D.J.; Martin, C.A.; Xie, Lingtian; Luoma, S.N.; Garland, T.
2008-01-01
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. ?? 2008 by The National Academy of Sciences of the USA.
Higher speciation and lower extinction rates influence mammal diversity gradients in Asia.
Tamma, Krishnapriya; Ramakrishnan, Uma
2015-02-04
Little is known about the patterns and correlates of mammal diversity gradients in Asia. In this study, we examine patterns of species distributions and phylogenetic diversity in Asia and investigate if the observed diversity patterns are associated with differences in diversification rates between the tropical and non-tropical regions. We used species distribution maps and phylogenetic trees to generate species and phylogenetic diversity measures for 1° × 1° cells across mainland Asia. We constructed lineage-through-time plots and estimated diversification shift-times to examine the temporal patterns of diversifications across orders. Finally, we tested if the observed gradients in Asia could be associated with geographical differences in diversification rates across the tropical and non-tropical biomes. We estimated speciation, extinction and dispersal rates across these two regions for mammals, both globally and for Asian mammals. Our results demonstrate strong latitudinal and longitudinal gradients of species and phylogenetic diversity with Southeast Asia and the Himalayas showing highest diversity. Importantly, our results demonstrate that differences in diversification (speciation, extinction and dispersal) rates between the tropical and the non-tropical biomes influence the observed diversity gradients globally and in Asia. For the first time, we demonstrate that Asian tropics act as both cradles and museums of mammalian diversity. Temporal and spatial variation in diversification rates across different lineages of mammals is an important correlate of species diversity gradients observed in Asia.
Differences in Performance Among Test Statistics for Assessing Phylogenomic Model Adequacy.
Duchêne, David A; Duchêne, Sebastian; Ho, Simon Y W
2018-05-18
Statistical phylogenetic analyses of genomic data depend on models of nucleotide or amino acid substitution. The adequacy of these substitution models can be assessed using a number of test statistics, allowing the model to be rejected when it is found to provide a poor description of the evolutionary process. A potentially valuable use of model-adequacy test statistics is to identify when data sets are likely to produce unreliable phylogenetic estimates, but their differences in performance are rarely explored. We performed a comprehensive simulation study to identify test statistics that are sensitive to some of the most commonly cited sources of phylogenetic estimation error. Our results show that, for many test statistics, traditional thresholds for assessing model adequacy can fail to reject the model when the phylogenetic inferences are inaccurate and imprecise. This is particularly problematic when analysing loci that have few variable informative sites. We propose new thresholds for assessing substitution model adequacy and demonstrate their effectiveness in analyses of three phylogenomic data sets. These thresholds lead to frequent rejection of the model for loci that yield topological inferences that are imprecise and are likely to be inaccurate. We also propose the use of a summary statistic that provides a practical assessment of overall model adequacy. Our approach offers a promising means of enhancing model choice in genome-scale data sets, potentially leading to improvements in the reliability of phylogenomic inference.
Stevenson, Pablo R.; Link, Andrés; González-Caro, Sebastian; Torres-Jiménez, María Fernanda
2015-01-01
Frugivory is a widespread mutualistic interaction in which frugivores obtain nutritional resources while favoring plant recruitment through their seed dispersal services. Nonetheless, how these complex interactions are organized in diverse communities, such as tropical forests, is not fully understood. In this study we evaluated the existence of plant-frugivore sub-assemblages and their phylogenetic organization in an undisturbed western Amazonian forest in Colombia. We also explored for potential keystone plants, based on network analyses and an estimate of the amount of fruit going from plants to frugivores. We carried out diurnal observations on 73 canopy plant species during a period of two years. During focal tree sampling, we recorded frugivore identity, the duration of each individual visit, and feeding rates. We did not find support for the existence of sub assemblages, such as specialized vs. generalized dispersal systems. Visitation rates on the vast majority of canopy species were associated with the relative abundance of frugivores, in which ateline monkeys (i.e. Lagothrix and Ateles) played the most important roles. All fruiting plants were visited by a variety of frugivores and the phylogenetic assemblage was random in more than 67% of the cases. In cases of aggregation, the plant species were consumed by only primates or only birds, and filters were associated with fruit protection and likely chemical content. Plants suggested as keystone species based on the amount of pulp going from plants to frugivores differ from those suggested based on network approaches. Our results suggest that in tropical forests most tree-frugivore interactions are generalized, and abundance should be taken into account when assessing the most important plants for frugivores. PMID:26492037
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.
Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H
2014-02-26
Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
Phylogeny of economically important insect pests that infesting several crops species in Malaysia
NASA Astrophysics Data System (ADS)
Ghazali, Siti Zafirah; Zain, Badrul Munir Md.; Yaakop, Salmah
2014-09-01
This paper reported molecular data on insect pests of commercial crops in Peninsular Malaysia. Fifteen insect pests (Metisa plana, Calliteara horsefeldii, Cotesia vestalis, Bactrocera papayae, Bactrocera carambolae, Bactrocera latifrons, Conopomorpha cramella, Sesamia inferens, Chilo polychrysa, Rhynchophorus vulneratus, and Rhynchophorus ferrugineus) of nine crops were sampled (oil palm, coconut, paddy, cocoa, starfruit, angled loofah, guava, chili and mustard) and also four species that belong to the fern's pest (Herpetogramma platycapna) and storage and rice pests (Tribolium castaneum, Oryzaephilus surinamensis and Cadra cautella). The presented phylogeny summarized the initial phylogenetic hypothesis, which concerning by implementation of the economically important insect pests. In this paper, phylogenetic relationships among 39 individuals of 15 species that belonging to three orders under 12 genera were inferred from DNA sequences of mitochondrial marker, cytochrome oxidase subunit I (COI) and nuclear marker, ribosomal DNA 28S D2 region. The phylogenies resulted from the phylogenetic analyses of both genes are relatively similar, but differ in the sequence of evolution. Interestingly, this most recent molecular data of COI sequences data by using Bayesian Inference analysis resulted a more-resolved phylogeny that corroborated with traditional hypotheses of holometabolan relationships based on traditional hypotheses of holometabolan relationships and most of recently molecular study compared to 28S sequences. This finding provides the information on relationships of pests species, which infested several crops in Malaysia and also estimation on Holometabola's order relationships. The identification of the larval stages of insect pests could be done accurately, without waiting the emergence of adults and supported by the phylogenetic tree.
Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study
Weißenborn, Sandra; Walther, Dirk
2017-01-01
Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the potential and challenges associated with phylogenetic profiling methods for the detection of functional relationships between genes as well as the need to enlarge the set of plant genes with proven secondary metabolism involvement as well as the limitations of distinct pathways as abstractions of relationships between genes. PMID:29163570
Kolanowska, Marta; Mystkowska, Katarzyna; Kras, Marta; Dudek, Magdalena; Konowalik, Kamil
2016-01-01
The location of possible glacial refugia of six Apostasioideae representatives is estimated based on ecological niche modeling analysis. The distribution of their suitable niches during the last glacial maximum (LGM) is compared with their current potential and documented geographical ranges. The climatic factors limiting the studied species occurrences are evaluated and the niche overlap between the studied orchids is assessed and discussed. The predicted niche occupancy profiles and reconstruction of ancestral climatic tolerances suggest high level of phylogenetic niche conservatism within Apostasioideae.
Increased phylogenetic resolution using target enrichment in Rubus
USDA-ARS?s Scientific Manuscript database
Phylogenetic analyses in Rubus L. have been challenging due to polyploidy, hybridization, and apomixis within the genus. Wide morphological diversity occurs within and between species, contributing to challenges at lower and higher systematic levels. Phylogenetic inferences to date have been based o...
Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies.
Sayyari, Erfan; Mirarab, Siavash
2018-02-28
Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest.
Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies
Sayyari, Erfan
2018-01-01
Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest. PMID:29495636
Lateral Gene Transfer Dynamics in the Ancient Bacterial Genus Streptomyces
McDonald, Bradon R.
2017-01-01
ABSTRACT Lateral gene transfer (LGT) profoundly shapes the evolution of bacterial lineages. LGT across disparate phylogenetic groups and genome content diversity between related organisms suggest a model of bacterial evolution that views LGT as rampant and promiscuous. It has even driven the argument that species concepts and tree-based phylogenetics cannot be applied to bacteria. Here, we show that acquisition and retention of genes through LGT are surprisingly rare in the ubiquitous and biomedically important bacterial genus Streptomyces. Using a molecular clock, we estimate that the Streptomyces bacteria are ~380 million years old, indicating that this bacterial genus is as ancient as land vertebrates. Calibrating LGT rate to this geologic time span, we find that on average only 10 genes per million years were acquired and subsequently maintained. Over that same time span, Streptomyces accumulated thousands of point mutations. By explicitly incorporating evolutionary timescale into our analyses, we provide a dramatically different view on the dynamics of LGT and its impact on bacterial evolution. PMID:28588130
PAL: an object-oriented programming library for molecular evolution and phylogenetics.
Drummond, A; Strimmer, K
2001-07-01
Phylogenetic Analysis Library (PAL) is a collection of Java classes for use in molecular evolution and phylogenetics. PAL provides a modular environment for the rapid construction of both special-purpose and general analysis programs. PAL version 1.1 consists of 145 public classes or interfaces in 13 packages, including classes for models of character evolution, maximum-likelihood estimation, and the coalescent, with a total of more than 27000 lines of code. The PAL project is set up as a collaborative project to facilitate contributions from other researchers. AVAILIABILTY: The program is free and is available at http://www.pal-project.org. It requires Java 1.1 or later. PAL is licensed under the GNU General Public License.
Li, H; Liu, J; Xiong, L; Zhang, H; Zhou, H; Yin, H; Jing, W; Li, J; Shi, Q; Wang, Y; Liu, J; Nie, L
2017-05-01
The softshell turtles (Trionychidae) are one of the most widely distributed reptile groups in the world, and fossils have been found on all continents except Antarctica. The phylogenetic relationships among members of this group have been previously studied; however, disagreements regarding its taxonomy, its phylogeography and divergence times are still poorly understood as well. Here, we present a comprehensive mitogenomic study of softshell turtles. We sequenced the complete mitochondrial genomes of 10 softshell turtles, in addition to the GenBank sequence of Dogania subplana, Lissemys punctata, Trionyx triunguis, which cover all extant genera within Trionychidae except for Cyclanorbis and Cycloderma. These data were combined with other mitogenomes of turtles for phylogenetic analyses. Divergence time calibration and ancestral reconstruction were calculated using BEAST and RASP software, respectively. Our phylogenetic analyses indicate that Trionychidae is the sister taxon of Carettochelyidae, and support the monophyly of Trionychinae and Cyclanorbinae, which is consistent with morphological data and molecular analysis. Our phylogenetic analyses have established a sister taxon relationship between the Asian Rafetus and the Asian Palea + Pelodiscus + Dogania + Nilssonia + Amyda, whereas a previous study grouped the Asian Rafetus with the American Apalone. The results of divergence time estimates and area ancestral reconstruction show that extant Trionychidae originated in Asia at around 108 million years ago (MA), and radiations mainly occurred during two warm periods, namely Late Cretaceous-Early Eocene and Oligocene. By combining the estimated divergence time and the reconstructed ancestral area of softshell turtles, we determined that the dispersal of softshell turtles out of Asia may have taken three routes. Furthermore, the times of dispersal seem to be in agreement with the time of the India-Asia collision and opening of the Bering Strait, which provide evidence for the accuracy of our estimation of divergence time. Overall, the mitogenomes of this group were used to explore the origin and dispersal route of Trionychidae and have provided new insights on the evolution of this group. © 2017 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2017 European Society For Evolutionary Biology.
Usui, Takuji; Butchart, Stuart H M; Phillimore, Albert B
2017-03-01
There are wide reports of advances in the timing of spring migration of birds over time and in relation to rising temperatures, though phenological responses vary substantially within and among species. An understanding of the ecological, life-history and geographic variables that predict this intra- and interspecific variation can guide our projections of how populations and species are likely to respond to future climate change. Here, we conduct phylogenetic meta-analyses addressing slope estimates of the timing of avian spring migration regressed on (i) year and (ii) temperature, representing a total of 413 species across five continents. We take into account slope estimation error and examine phylogenetic, ecological and geographic predictors of intra- and interspecific variation. We confirm earlier findings that on average birds have significantly advanced their spring migration time by 2·1 days per decade and 1·2 days °C -1 . We find that over time and in response to warmer spring conditions, short-distance migrants have advanced spring migratory phenology by more than long-distance migrants. We also find that larger bodied species show greater advance over time compared to smaller bodied species. Our results did not reveal any evidence that interspecific variation in migration response is predictable on the basis of species' habitat or diet. We detected a substantial phylogenetic signal in migration time in response to both year and temperature, suggesting that some of the shifts in migratory phenological response to climate are predictable on the basis of phylogeny. However, we estimate high levels of species and spatial variance relative to phylogenetic variance, which is consistent with plasticity in response to climate evolving fairly rapidly and being more influenced by adaptation to current local climate than by common descent. On average, avian spring migration times have advanced over time and as spring has become warmer. While we are able to identify predictors that explain some of the true among-species variation in response, substantial intra- and interspecific variation in migratory response remains to be explained. © 2016 The Authors. Journal of Animal Ecology published by John Wiley & Sons Ltd on behalf of British Ecological Society.
Shi, Cheng-Min; Yang, Ziheng
2018-01-01
Abstract The phylogenetic relationships among extant gibbon species remain unresolved despite numerous efforts using morphological, behavorial, and genetic data and the sequencing of whole genomes. A major challenge in reconstructing the gibbon phylogeny is the radiative speciation process, which resulted in extremely short internal branches in the species phylogeny and extensive incomplete lineage sorting with extensive gene-tree heterogeneity across the genome. Here, we analyze two genomic-scale data sets, with ∼10,000 putative noncoding and exonic loci, respectively, to estimate the species tree for the major groups of gibbons. We used the Bayesian full-likelihood method bpp under the multispecies coalescent model, which naturally accommodates incomplete lineage sorting and uncertainties in the gene trees. For comparison, we included three heuristic coalescent-based methods (mp-est, SVDQuartets, and astral) as well as concatenation. From both data sets, we infer the phylogeny for the four extant gibbon genera to be (Hylobates, (Nomascus, (Hoolock, Symphalangus))). We used simulation guided by the real data to evaluate the accuracy of the methods used. Astral, while not as efficient as bpp, performed well in estimation of the species tree even in presence of excessive incomplete lineage sorting. Concatenation, mp-est and SVDQuartets were unreliable when the species tree contains very short internal branches. Likelihood ratio test of gene flow suggests a small amount of migration from Hylobates moloch to H. pileatus, while cross-genera migration is absent or rare. Our results highlight the utility of coalescent-based methods in addressing challenging species tree problems characterized by short internal branches and rampant gene tree-species tree discordance. PMID:29087487
Evolutionary origin and early biogeography of otophysan fishes (Ostariophysi: Teleostei).
Chen, Wei-Jen; Lavoué, Sébastien; Mayden, Richard L
2013-08-01
The biogeography of the mega-diverse, freshwater, and globally distributed Otophysi has received considerable attention. This attraction largely stems from assumptions as to their ancient origin, the clade being almost exclusively freshwater, and their suitability as to explanations of trans-oceanic distributions. Despite multiple hypotheses explaining present-day distributions, problems remain, precluding more parsimonious explanations. Underlying previous hypotheses are alternative phylogenies for Otophysi, uncertainties as to temporal diversification and assumptions integral to various explanations. We reexamine the origin and early diversification of this clade based on a comprehensive time-calibrated, molecular-based phylogenetic analysis and event-based approaches for ancestral range inference of lineages. Our results do not corroborate current phylogenetic classifications of otophysans. We demonstrate Siluriformes are never sister to Gymnotiformes and Characiformes are most likely nonmonophyletic. Divergence time estimates specify a split between Cypriniformes and Characiphysi with the fragmentation of Pangea. The early diversification of characiphysans either predated, or was contemporary with, the separation of Africa and South America, and involved a combination of within- and between-continental divergence events for these lineages. The intercontinental diversification of siluroids and characoids postdated major intercontinental tectonic fragmentations (<90 Mya). Post-tectonic drift dispersal events are hypothesized to account for their current distribution patterns. © 2013 The Author(s). Evolution © 2013 The Society for the Study of Evolution.
Tree-Based Unrooted Phylogenetic Networks.
Francis, A; Huber, K T; Moulton, V
2018-02-01
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.
KAPRAUN, DONALD F.
2005-01-01
• Background and Aims Multicellular eukaryotic algae are phylogenetically disparate. Nuclear DNA content estimates have been published for fewer than 1 % of the described species of Chlorophyta, Phaeophyta and Rhodophyta. The present investigation aims to summarize the state of our knowledge and to add substantially to our database of C-values for theses algae. • Methods The DNA-localizing fluorochrome DAPI (4′, 6-diamidino-2-phenylindole) and RBC (chicken erythrocyte) standard were used to estimate 2C values with static microspectrophotometry. • Key Results 2C DNA contents for 85 species of Chlorophyta range from 0·2–6·1 pg, excluding the highly polyploidy Charales and Desmidiales with DNA contents of up to 39·2 and 20·7 pg, respectively. 2C DNA contents for 111 species of Rhodophyta range from 0·1–2·8 pg, and for 44 species of Phaeophyta range from 0·2–1·8 pg. • Conclusions New availability of consensus higher-level molecular phylogenies provides a framework for viewing C-value data in a phylogenetic context. Both DNA content ranges and mean values are greater in taxa considered to be basal. It is proposed that the basal, ancestral genome in each algal group was quite small. Both mechanistic and ecological processes are discussed that could have produced the observed C-value ranges. PMID:15596456
Estimating hybridization in the presence of coalescence using phylogenetic intraspecific sampling.
Gerard, David; Gibbs, H Lisle; Kubatko, Laura
2011-10-06
A well-known characteristic of multi-locus data is that each locus has its own phylogenetic history which may differ substantially from the overall phylogenetic history of the species. Although the possibility that this arises through incomplete lineage sorting is often incorporated in models for the species-level phylogeny, it is much less common for hybridization to also be formally included in such models. We have modified the evolutionary model of Meng and Kubatko (2009) to incorporate intraspecific sampling of multiple individuals for estimation of speciation times and times of hybridization events for testing for hybridization in the presence of incomplete lineage sorting. We have also utilized a more efficient algorithm for obtaining our estimates. Using simulations, we demonstrate that our approach performs well under conditions motivated by an empirical data set for Sistrurus rattlesnakes where putative hybridization has occurred. We further demonstrate that the method is able to accurately detect the signature of hybridization in the data, while this signal may be obscured when other species-tree inference methods that ignore hybridization are used. Our approach is shown to be powerful in detecting hybridization when it is present. When applied to the Sistrurus data, we find no evidence of hybridization; instead, it appears that putative hybrid snakes in Missouri are most likely pure S. catenatus tergeminus in origin, which has significant conservation implications.
Prates, Ivan; Melo-Sampaio, Paulo Roberto; Drummond, Leandro de Oliveira; Teixeira, Mauro; Rodrigues, Miguel Trefaut; Carnaval, Ana Carolina
2017-08-01
Data on species ranges and phylogenetic relationships are key in historical biogeographical inference. In South America, our understanding of the evolutionary processes that underlie biodiversity patterns varies greatly across regions. Little is known, for instance, about the drivers of high endemism in the southern montane region of the Atlantic Rainforest. In this region, former biogeographic connections with other South American ecosystems have been invoked to explain the phylogenetic affinities of a number of endemic taxa. This may also be the case of the montane anole lizards Anolis nasofrontalis and A. pseudotigrinus, known from few specimens collected more than 40years ago. We combine new genetic data with published sequences of species in the Dactyloa clade of Anolis to investigate the phylogenetic relationships of A. nasofrontalis and A. pseudotigrinus, as well as estimate divergence times from their closest relatives. Based on newly sampled and previously overlooked specimens, we provide a taxonomic re-description of those two taxa. Our phylogenetic analysis recovered six main clades within Dactyloa, five of which were previously referred to as species series (aequatorialis, heterodermus, latifrons, punctatus, roquet). A sixth clade clustered A. nasofrontalis and A. pseudotigrinus with A. dissimilis from western Amazonia, A. calimae from the Andes, A. neblininus from the Guiana Shield, and two undescribed Andean taxa. We therefore define a sixth species series within Dactyloa: the neblininus series. Close phylogenetic relationships between highly disjunct, narrowly-distributed anoles suggest that patches of suitable habitat connected the southern Atlantic Forest to western South America during the Miocene, in agreement with the age of former connections between the central Andes and the Brazilian Shield as a result of Andean orogeny. The data also support the view of recurrent evolution (or loss) of a twig anole-like phenotype in mainland anoles, in apparent association with the occurrence in montane settings. Our findings stress the value of complementary genetic sampling efforts across South American countries to advance studies of mainland anole taxonomy and evolution. Copyright © 2017 Elsevier Inc. All rights reserved.
Improved Maximum Parsimony Models for Phylogenetic Networks.
Van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2018-05-01
Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.
Liu, Chi; Yao, Minjie; Stegen, James C.; ...
2017-12-13
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. Here, we started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightlymore » increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Chi; Yao, Minjie; Stegen, James C.
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. Here, we started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightlymore » increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.« less
Liu, Chi; Yao, Minjie; Stegen, James C; Rui, Junpeng; Li, Jiabao; Li, Xiangzhen
2017-12-13
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. We started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightly increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.
Cultural Phylogenetics of the Tupi Language Family in Lowland South America
Walker, Robert S.; Wichmann, Søren; Mailund, Thomas; Atkisson, Curtis J.
2012-01-01
Background Recent advances in automated assessment of basic vocabulary lists allow the construction of linguistic phylogenies useful for tracing dynamics of human population expansions, reconstructing ancestral cultures, and modeling transition rates of cultural traits over time. Methods Here we investigate the Tupi expansion, a widely-dispersed language family in lowland South America, with a distance-based phylogeny based on 40-word vocabulary lists from 48 languages. We coded 11 cultural traits across the diverse Tupi family including traditional warfare patterns, post-marital residence, corporate structure, community size, paternity beliefs, sibling terminology, presence of canoes, tattooing, shamanism, men's houses, and lip plugs. Results/Discussion The linguistic phylogeny supports a Tupi homeland in west-central Brazil with subsequent major expansions across much of lowland South America. Consistently, ancestral reconstructions of cultural traits over the linguistic phylogeny suggest that social complexity has tended to decline through time, most notably in the independent emergence of several nomadic hunter-gatherer societies. Estimated rates of cultural change across the Tupi expansion are on the order of only a few changes per 10,000 years, in accord with previous cultural phylogenetic results in other language families around the world, and indicate a conservative nature to much of human culture. PMID:22506065
Winkler, Isaac S; Blaschke, Jeremy D; Davis, Daniel J; Stireman, John O; O'Hara, James E; Cerretti, Pierfilippo; Moulton, John K
2015-07-01
Molecular phylogenetic studies at all taxonomic levels often infer rapid radiation events based on short, poorly resolved internodes. While such rapid episodes of diversification are an important and widespread evolutionary phenomenon, much of this poor phylogenetic resolution may be attributed to the continuing widespread use of "traditional" markers (mitochondrial, ribosomal, and some nuclear protein-coding genes) that are often poorly suited to resolve difficult, higher-level phylogenetic problems. Here we reconstruct phylogenetic relationships among a representative set of taxa of the parasitoid fly family Tachinidae and related outgroups of the superfamily Oestroidea. The Tachinidae are one of the most species rich, yet evolutionarily recent families of Diptera, providing an ideal case study for examining the differential performance of loci in resolving phylogenetic relationships and the benefits of adding more loci to phylogenetic analyses. We assess the phylogenetic utility of nine genes including both traditional genes (e.g., CO1 mtDNA, 28S rDNA) and nuclear protein-coding genes newly developed for phylogenetic analysis. Our phylogenetic findings, based on a limited set of taxa, include: a close relationship between Tachinidae and the calliphorid subfamily Polleninae, monophyly of Tachinidae and the subfamilies Exoristinae and Dexiinae, subfamily groupings of Dexiinae+Phasiinae and Tachininae+Exoristinae, and robust phylogenetic placement of the somewhat enigmatic genera Strongygaster, Euthera, and Ceracia. In contrast to poor resolution and phylogenetic incongruence of "traditional genes," we find that a more selective set of highly informative genes is able to more precisely identify regions of the phylogeny that experienced rapid radiation of lineages, while more accurately depicting their phylogenetic context. Although much expanded taxon sampling is necessary to effectively assess the monophyly of and relationships among major tachinid lineages and their relatives, we show that a small number of well-chosen nuclear protein-coding genes can successfully resolve even difficult phylogenetic problems. Copyright © 2015 Elsevier Inc. All rights reserved.
Graf, Daniel L; Jones, Hugh; Geneva, Anthony J; Pfeiffer, John M; Klunzinger, Michael W
2015-04-01
The freshwater mussel family Hyriidae (Mollusca: Bivalvia: Unionida) has a disjunct trans-Pacific distribution in Australasia and South America. Previous phylogenetic analyses have estimated the evolutionary relationships of the family and the major infra-familial taxa (Velesunioninae and Hyriinae: Hyridellini in Australia; Hyriinae: Hyriini, Castaliini, and Rhipidodontini in South America), but taxon and character sampling have been too incomplete to support a predictive classification or allow testing of biogeographical hypotheses. We sampled 30 freshwater mussel individuals representing the aforementioned hyriid taxa, as well as outgroup species representing the five other freshwater mussel families and their marine sister group (order Trigoniida). Our ingroup included representatives of all Australian genera. Phylogenetic relationships were estimated from three gene fragments (nuclear 28S, COI and 16S mtDNA) using maximum parsimony, maximum likelihood, and Bayesian inference, and we applied a Bayesian relaxed clock model calibrated with fossil dates to estimate node ages. Our analyses found good support for monophyly of the Hyriidae and the subfamilies and tribes, as well as the paraphyly of the Australasian taxa (Velesunioninae, (Hyridellini, (Rhipidodontini, (Castaliini, Hyriini)))). The Hyriidae was recovered as sister to a clade comprised of all other Recent freshwater mussel families. Our molecular date estimation supported Cretaceous origins of the major hyriid clades, pre-dating the Tertiary isolation of South America from Antarctica/Australia. We hypothesize that early diversification of the Hyriidae was driven by terrestrial barriers on Gondwana rather than marine barriers following disintegration of the super-continent. Copyright © 2015 Elsevier Inc. All rights reserved.
Ahnia, Hadjira; Bourebaba, Yasmina; Durán, David; Boulila, Farida; Palacios, José M; Rey, Luis; Ruiz-Argüeso, Tomás; Boulila, Abdelghani; Imperial, Juan
2018-04-04
We have characterized genetic, phenotypic and symbiotic properties of bacterial strains previously isolated from nitrogen-fixing nodules of Retama sphaerocarpa from Northern Algeria. Phylogenetic analyses of 16S rRNA genes and three concatenated housekeeping genes, recA, atpD and glnII, placed them in a new divergent group that is proposed to form a new Bradyrhizobium species, Bradyrhizobium algeriense sp. nov. (type strain RST89 T , LMG 27618 and CECT 8363). Based on these phylogenetic markers and on genomic identity data derived from draft genomic sequences, Bradyrhizobium valentinum LmjM3 T , Bradyrhizobium lablabi CCBAU 23086 T , Bradyrhizobium retamae Ro19 T , and Bradyrhizobium jicamae PAC68 T are the closest relatives of B. algeriense RST89 T , with sequence identities of 92-94% and Average Nucleotide Identities (ANIm) under 90%, well below the 95-96% species circumscription threshold. Likewise, a comparison of whole-cell proteomic patterns, estimated by Matrix-Assisted Laser Desorption/Ionization-Time-of-Flight (MALDI-TOF) mass spectrometric analysis, yielded almost identical spectra between B. algeriense strains but significant differences with B. valentinum, Bradyrhizobium paxllaeri, Bradyrhizobium icense, B. lablabi, B. jicamae and B. retamae. A phylogenetic tree based on symbiotic gene nodC revealed that the B. algeriense sequences cluster with sequences from the Bradyrhizobium symbiovar retamae, previously defined with B. retamae strains isolated from Retama monosperma. B. algeriense strains were able to establish effective symbioses with Retama raetam, Lupinus micranthus, Lupinus albus and Genista numidica, but not with Lupinus angustifolius or Glycine max. Copyright © 2018 Elsevier GmbH. All rights reserved.
A taxonomic and phylogenetic re-appraisal of the genus Curvularia
USDA-ARS?s Scientific Manuscript database
Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...
Efficient FPT Algorithms for (Strict) Compatibility of Unrooted Phylogenetic Trees.
Baste, Julien; Paul, Christophe; Sau, Ignasi; Scornavacca, Celine
2017-04-01
In phylogenetics, a central problem is to infer the evolutionary relationships between a set of species X; these relationships are often depicted via a phylogenetic tree-a tree having its leaves labeled bijectively by elements of X and without degree-2 nodes-called the "species tree." One common approach for reconstructing a species tree consists in first constructing several phylogenetic trees from primary data (e.g., DNA sequences originating from some species in X), and then constructing a single phylogenetic tree maximizing the "concordance" with the input trees. The obtained tree is our estimation of the species tree and, when the input trees are defined on overlapping-but not identical-sets of labels, is called "supertree." In this paper, we focus on two problems that are central when combining phylogenetic trees into a supertree: the compatibility and the strict compatibility problems for unrooted phylogenetic trees. These problems are strongly related, respectively, to the notions of "containing as a minor" and "containing as a topological minor" in the graph community. Both problems are known to be fixed parameter tractable in the number of input trees k, by using their expressibility in monadic second-order logic and a reduction to graphs of bounded treewidth. Motivated by the fact that the dependency on k of these algorithms is prohibitively large, we give the first explicit dynamic programming algorithms for solving these problems, both running in time [Formula: see text], where n is the total size of the input.
van Riemsdijk, Isolde; Arntzen, Jan W; Bogaerts, Sergé; Franzen, Michael; Litvinchuk, Spartak N; Olgun, Kurtuluş; Wielstra, Ben
2017-09-01
The banded newt (genus Ommatotriton) is widely distributed in the Near East (Anatolia, Caucasus and the Levant) - an understudied region from the perspective of phylogeography. The genus is polytypic, but the number of species included and the phylogenetic relationships between them are not settled. We sequenced two mitochondrial and two nuclear DNA markers throughout the range of Ommatotriton. For mtDNA we constructed phylogenetic trees, estimated divergence times using fossil calibration, and investigated changes in effective population size with Bayesian skyline plots and mismatch analyses. For nuDNA we constructed phylogenetic trees and haplotype networks. Species trees were constructed for all markers and nuDNA only. Species distribution models were projected on current and Last Glacial Maximum climate layers. We confirm the presence of three Ommatotriton species: O. nesterovi, O. ophryticus and O. vittatus. These species are genetically distinct and their most recent common ancestor was dated at ∼25Ma (Oligocene). No evidence of recent gene flow between species was found. The species show deep intraspecific genetic divergence, represented by geographically structured clades, with crown nodes of species dated ∼8-13Ma (Miocene to Early Quaternary); evidence of long-term in situ evolution and survival in multiple glacial refugia. While a species tree based on nuDNA suggested a sister species relationship between O. vittatus and O. ophryticus, when mtDNA was included, phylogenetic relationships were unresolved, and we refrain from accepting a particular phylogenetic hypothesis at this stage. While species distribution models suggest reduced and fragmented ranges during the Last Glacial Maximum, we found no evidence for strong population bottlenecks. We discuss our results in the light of other phylogeographic studies from the Near East. Our study underlines the important role of the Near East in generating and sustaining biodiversity. Copyright © 2017 Elsevier Inc. All rights reserved.
Kriebel, Ricardo; Khabbazian, Mohammad; Sytsma, Kenneth J
2017-01-01
The study of pollen morphology has historically allowed evolutionary biologists to assess phylogenetic relationships among Angiosperms, as well as to better understand the fossil record. During this process, pollen has mainly been studied by discretizing some of its main characteristics such as size, shape, and exine ornamentation. One large plant clade in which pollen has been used this way for phylogenetic inference and character mapping is the order Myrtales, composed by the small families Alzateaceae, Crypteroniaceae, and Penaeaceae (collectively the "CAP clade"), as well as the large families Combretaceae, Lythraceae, Melastomataceae, Myrtaceae, Onagraceae and Vochysiaceae. In this study, we present a novel way to study pollen evolution by using quantitative size and shape variables. We use morphometric and morphospace methods to evaluate pollen change in the order Myrtales using a time-calibrated, supermatrix phylogeny. We then test for conservatism, divergence, and morphological convergence of pollen and for correlation between the latitudinal gradient and pollen size and shape. To obtain an estimate of shape, Myrtales pollen images were extracted from the literature, and their outlines analyzed using elliptic Fourier methods. Shape and size variables were then analyzed in a phylogenetic framework under an Ornstein-Uhlenbeck process to test for shifts in size and shape during the evolutionary history of Myrtales. Few shifts in Myrtales pollen morphology were found which indicates morphological conservatism. Heterocolpate, small pollen is ancestral with largest pollen in Onagraceae. Convergent shifts in shape but not size occurred in Myrtaceae and Onagraceae and are correlated to shifts in latitude and biogeography. A quantitative approach was applied for the first time to examine pollen evolution across a large time scale. Using phylogenetic based morphometrics and an OU process, hypotheses of pollen size and shape were tested across Myrtales. Convergent pollen shifts and position in the latitudinal gradient support the selective role of harmomegathy, the mechanism by which pollen grains accommodate their volume in response to water loss.
Khabbazian, Mohammad; Sytsma, Kenneth J.
2017-01-01
The study of pollen morphology has historically allowed evolutionary biologists to assess phylogenetic relationships among Angiosperms, as well as to better understand the fossil record. During this process, pollen has mainly been studied by discretizing some of its main characteristics such as size, shape, and exine ornamentation. One large plant clade in which pollen has been used this way for phylogenetic inference and character mapping is the order Myrtales, composed by the small families Alzateaceae, Crypteroniaceae, and Penaeaceae (collectively the “CAP clade”), as well as the large families Combretaceae, Lythraceae, Melastomataceae, Myrtaceae, Onagraceae and Vochysiaceae. In this study, we present a novel way to study pollen evolution by using quantitative size and shape variables. We use morphometric and morphospace methods to evaluate pollen change in the order Myrtales using a time-calibrated, supermatrix phylogeny. We then test for conservatism, divergence, and morphological convergence of pollen and for correlation between the latitudinal gradient and pollen size and shape. To obtain an estimate of shape, Myrtales pollen images were extracted from the literature, and their outlines analyzed using elliptic Fourier methods. Shape and size variables were then analyzed in a phylogenetic framework under an Ornstein-Uhlenbeck process to test for shifts in size and shape during the evolutionary history of Myrtales. Few shifts in Myrtales pollen morphology were found which indicates morphological conservatism. Heterocolpate, small pollen is ancestral with largest pollen in Onagraceae. Convergent shifts in shape but not size occurred in Myrtaceae and Onagraceae and are correlated to shifts in latitude and biogeography. A quantitative approach was applied for the first time to examine pollen evolution across a large time scale. Using phylogenetic based morphometrics and an OU process, hypotheses of pollen size and shape were tested across Myrtales. Convergent pollen shifts and position in the latitudinal gradient support the selective role of harmomegathy, the mechanism by which pollen grains accommodate their volume in response to water loss. PMID:29211730
Phylogenetic diversity and biodiversity indices on phylogenetic networks.
Wicke, Kristina; Fischer, Mareike
2018-04-01
In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.
Yuko Ota; Mee-Sook Kim; Hitoshi Neda; Ned B. Klopfenstein; Eri Hasegawa
2011-01-01
An undetermined Armillaria species was collected on Amami-Oshima, a subtropical island of Japan. The phylogenetic position of the Armillaria sp. was determined using sequences of the elongation factor-1a (EF-1a) gene and the internal transcribed spacer (ITS) region (ITS1-5.8S-ITS2) of ribosomal DNA (rDNA). The phylogenetic analyses based on EF-1a and ITS sequences...
Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita
2017-06-01
The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
A novel model for DNA sequence similarity analysis based on graph theory.
Qi, Xingqin; Wu, Qin; Zhang, Yusen; Fuller, Eddie; Zhang, Cun-Quan
2011-01-01
Determination of sequence similarity is one of the major steps in computational phylogenetic studies. As we know, during evolutionary history, not only DNA mutations for individual nucleotide but also subsequent rearrangements occurred. It has been one of major tasks of computational biologists to develop novel mathematical descriptors for similarity analysis such that various mutation phenomena information would be involved simultaneously. In this paper, different from traditional methods (eg, nucleotide frequency, geometric representations) as bases for construction of mathematical descriptors, we construct novel mathematical descriptors based on graph theory. In particular, for each DNA sequence, we will set up a weighted directed graph. The adjacency matrix of the directed graph will be used to induce a representative vector for DNA sequence. This new approach measures similarity based on both ordering and frequency of nucleotides so that much more information is involved. As an application, the method is tested on a set of 0.9-kb mtDNA sequences of twelve different primate species. All output phylogenetic trees with various distance estimations have the same topology, and are generally consistent with the reported results from early studies, which proves the new method's efficiency; we also test the new method on a simulated data set, which shows our new method performs better than traditional global alignment method when subsequent rearrangements happen frequently during evolutionary history.
On the Shapley Value of Unrooted Phylogenetic Trees.
Wicke, Kristina; Fischer, Mareike
2018-01-17
The Shapley value, a solution concept from cooperative game theory, has recently been considered for both unrooted and rooted phylogenetic trees. Here, we focus on the Shapley value of unrooted trees and first revisit the so-called split counts of a phylogenetic tree and the Shapley transformation matrix that allows for the calculation of the Shapley value from the edge lengths of a tree. We show that non-isomorphic trees may have permutation-equivalent Shapley transformation matrices and permutation-equivalent null spaces. This implies that estimating the split counts associated with a tree or the Shapley values of its leaves does not suffice to reconstruct the correct tree topology. We then turn to the use of the Shapley value as a prioritization criterion in biodiversity conservation and compare it to a greedy solution concept. Here, we show that for certain phylogenetic trees, the Shapley value may fail as a prioritization criterion, meaning that the diversity spanned by the top k species (ranked by their Shapley values) cannot approximate the total diversity of all n species.
Phylogenetic Diversity in the Macromolecular Composition of Microalgae
Finkel, Zoe V.; Follows, Mick J.; Liefer, Justin D.; Brown, Chris M.; Benner, Ina; Irwin, Andrew J.
2016-01-01
The elemental stoichiometry of microalgae reflects their underlying macromolecular composition and influences competitive interactions among species and their role in the food web and biogeochemistry. Here we provide a new estimate of the macromolecular composition of microalgae using a hierarchical Bayesian analysis of data compiled from the literature. The median macromolecular composition of nutrient-sufficient exponentially growing microalgae is 32.2% protein, 17.3% lipid, 15.0% carbohydrate, 17.3% ash, 5.7% RNA, 1.1% chlorophyll-a and 1.0% DNA as percent dry weight. Our analysis identifies significant phylogenetic differences in macromolecular composition undetected by previous studies due to small sample sizes and the large inherent variability in macromolecular pools. The phylogenetic differences in macromolecular composition lead to variations in carbon-to-nitrogen ratios that are consistent with independent observations. These phylogenetic differences in macromolecular and elemental composition reflect adaptations in cellular architecture and biochemistry; specifically in the cell wall, the light harvesting apparatus, and storage pools. PMID:27228080
Effectiveness of protected areas for vertebrates based on taxonomic and phylogenetic diversity.
Quan, Qing; Che, Xianli; Wu, Yongjie; Wu, Yuchun; Zhang, Qiang; Zhang, Min; Zou, Fasheng
2018-04-01
Establishing protected areas is the primary goal and tool for preventing irreversible biodiversity loss. However, the effectiveness of protected areas that target specific species has been questioned for some time because targeting key species for conservation may impair the integral regional pool of species diversity and phylogenetic and functional diversity are seldom considered. We assessed the efficacy of protected areas in China for the conservation of phylogenetic diversity based on the ranges and phylogenies of 2279 terrestrial vertebrates. Phylogenetic and taxonomic diversity were strongly and positively correlated, and only 12.1-43.8% of priority conservation areas are currently protected. However, the patterns and coverage of phylogenetic diversity were affected when weighted by species richness. These results indicated that in China, protected areas targeting high species richness protected phylogenetic diversity well overall but failed to do so in some regions with more unique or threatened communities (e.g., coastal areas of eastern China, where severely threatened avian communities were less protected). Our results suggest that the current distribution of protected areas could be improved, although most protected areas protect both taxonomic and phylogenetic diversity. © 2017 Society for Conservation Biology.
Vidal-Martínez, Victor M.
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1–2) and internal transcribed spacers (ITS1–5.8S–ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum (A. cf. americanum and A. burminis) and paraphyly of the Acanthostominae. These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments. PMID:29250471
Martínez-Aquino, Andrés; Vidal-Martínez, Victor M; Aguirre-Macedo, M Leopoldina
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1-2) and internal transcribed spacers (ITS1-5.8S-ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum ( A. cf. americanum and A. burminis ) and paraphyly of the Acanthostominae . These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments.
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution
Kendall, Michelle; Colijn, Caroline
2016-01-01
Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. Key words: phylogenetics, evolution, tree metrics, genetics, sequencing. PMID:27343287
USDA-ARS?s Scientific Manuscript database
An extensive phylogenetic analysis and genus-level taxonomic revision of Paranoplocephala Lühe, 1910 -like cestodes (Cyclophyllidea, Anoplocephalidae) are presented. The phylogenetic analysis is based on DNA sequences of two partial mitochondrial genes, i.e. cytochrome c oxidase subunit 1 (cox1) and...
Callejón, Rocío; Robles, María Del Rosario; Panei, Carlos Javier; Cutillas, Cristina
2016-08-01
A molecular phylogenetic hypothesis is presented for the genus Trichuris based on sequence data from mitochondrial cytochrome c oxidase 1 (cox1) and cytochrome b (cob). The taxa consisted of nine populations of whipworm from five species of Sigmodontinae rodents from Argentina. Bayesian Inference, Maximum Parsimony, and Maximum Likelihood methods were used to infer phylogenies for each gene separately but also for the combined mitochondrial data and the combined mitochondrial and nuclear dataset. Phylogenetic results based on cox1 and cob mitochondrial DNA (mtDNA) revealed three clades strongly resolved corresponding to three different species (Trichuris navonae, Trichuris bainae, and Trichuris pardinasi) showing phylogeographic variation, but relationships among Trichuris species were poorly resolved. Phylogenetic reconstruction based on concatenated sequences had greater phylogenetic resolution for delimiting species and populations intra-specific of Trichuris than those based on partitioned genes. Thus, populations of T. bainae and T. pardinasi could be affected by geographical factors and co-divergence parasite-host.
Hsieh, Chia-Hung; Ko, Chiun-Cheng; Chung, Cheng-Han; Wang, Hurng-Yi
2014-07-01
The sweet potato whitefly, Bemisia tabaci, is a highly differentiated species complex. Despite consisting of several morphologically indistinguishable entities and frequent invasions on all continents with important associated economic losses, the phylogenetic relationships, species status, and evolutionary history of this species complex is still debated. We sequenced and analyzed one mitochondrial and three single-copy nuclear genes from 9 of the 12 genetic groups of B. tabaci and 5 closely related species. Bayesian species delimitation was applied to investigate the speciation events of B. tabaci. The species statuses of the different genetic groups were strongly supported under different prior settings and phylogenetic scenarios. Divergence histories were estimated by a multispecies coalescence approach implemented in (*)BEAST. Based on mitochondrial locus, B. tabaci was originated 6.47 million years ago (MYA). Nevertheless, the time was 1.25MYA based on nuclear loci. According to the method of approximate Bayesian computation, this difference is probably due to different degrees of migration among loci; i.e., although the mitochondrial locus had differentiated, gene flow at nuclear loci was still possible, a scenario similar to parapatric mode of speciation. This is the first study in whiteflies using multilocus data and incorporating Bayesian coalescence approaches, both of which provide a more biologically realistic framework for delimiting species status and delineating the divergence history of B. tabaci. Our study illustrates that gene flow during species divergence should not be overlooked and has a great impact on divergence time estimation. Copyright © 2014 Elsevier Inc. All rights reserved.
Carotenuto, Francesco; Diniz-Filho, José Alexandre F.
2016-01-01
Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes—speciation, extinction and dispersal—in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. PMID:26977061
Estimating phylogenetic trees from genome-scale data.
Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V
2015-12-01
The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.
Villalobos, Fabricio; Carotenuto, Francesco; Raia, Pasquale; Diniz-Filho, José Alexandre F
2016-04-05
Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes--speciation, extinction and dispersal--in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. © 2016 The Author(s).
Lambert, Shea M; Reeder, Tod W; Wiens, John J
2015-01-01
Simulation studies suggest that coalescent-based species-tree methods are generally more accurate than concatenated analyses. However, these species-tree methods remain impractical for many large datasets. Thus, a critical but unresolved issue is when and why concatenated and coalescent species-tree estimates will differ. We predict such differences for branches in concatenated trees that are short, weakly supported, and have conflicting gene trees. We test these predictions in Scincidae, the largest lizard family, with data from 10 nuclear genes for 17 ingroup taxa and 44 genes for 12 taxa. We support our initial predictions, andsuggest that simply considering uncertainty in concatenated trees may sometimes encompass the differences between these methods. We also found that relaxed-clock concatenated trees can be surprisingly similar to the species-tree estimate. Remarkably, the coalescent species-tree estimates had slightly lower support values when based on many more genes (44 vs. 10) and a small (∼30%) reduction in taxon sampling. Thus, taxon sampling may be more important than gene sampling when applying species-tree methods to deep phylogenetic questions. Finally, our coalescent species-tree estimates tentatively support division of Scincidae into three monophyletic subfamilies, a result otherwise found only in concatenated analyses with extensive species sampling. Copyright © 2014 Elsevier Inc. All rights reserved.
Molecular survey of basidiomycetes and divergence time estimation: An Indian perspective
Bhatt, Meghna; Mistri, Pankti; Joshi, Ishita; Ram, Hemal; Raval, Rinni; Thoota, Sruthi; Patel, Ankur; Raval, Dhrupa; Bhargava, Poonam; Soni, Subhash; Bagatharia, Snehal
2018-01-01
This study outlines the biodiversity of mushrooms of India. It reveals the molecular biodiversity and divergence time estimation of basidiomycetes from Gujarat, India. A total of 267 mushrooms were collected from 10 locations across the state. 225 ITS sequences were generated belonging to 105 species, 59 genera and 29 families. Phylogenetic analysis of Agaricaceae reveals monophyletic clade of Podaxis differentiating it from Coprinus. Further, the ancient nature of Podaxis supports the hypothesis that gasteroid forms evolved from secotioid forms. Members of Polyporaceae appeared polyphyletic. Further, our results of a close phylogenetic relationship between Trametes and Lenziteslead us to propose that the genera Trametes may by enlarged to include Lenzites. The tricholomatoid clade shows a clear demarcation for Entolomataceae. However, Lyophyllaceae and Tricholomataceae could not be distinguished clearly. Distribution studies of the mushrooms showed omnipresence of Ganoderma and Schizophyllum. Further, divergence time estimation shows that Dacrymycetes evolved in the Neoproterozoic Era and Hymenochaetales diverged from Agaricomycetes during the Silurian period. PMID:29771956
Reversible polymorphism-aware phylogenetic models and their application to tree inference.
Schrempf, Dominik; Minh, Bui Quang; De Maio, Nicola; von Haeseler, Arndt; Kosiol, Carolin
2016-10-21
We present a reversible Polymorphism-Aware Phylogenetic Model (revPoMo) for species tree estimation from genome-wide data. revPoMo enables the reconstruction of large scale species trees for many within-species samples. It expands the alphabet of DNA substitution models to include polymorphic states, thereby, naturally accounting for incomplete lineage sorting. We implemented revPoMo in the maximum likelihood software IQ-TREE. A simulation study and an application to great apes data show that the runtimes of our approach and standard substitution models are comparable but that revPoMo has much better accuracy in estimating trees, divergence times and mutation rates. The advantage of revPoMo is that an increase of sample size per species improves estimations but does not increase runtime. Therefore, revPoMo is a valuable tool with several applications, from speciation dating to species tree reconstruction. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Barkman, Todd J.; Chenery, Gordon; McNeal, Joel R.; Lyons-Weiler, James; Ellisens, Wayne J.; Moore, Gerry; Wolfe, Andrea D.; dePamphilis, Claude W.
2000-01-01
Plant phylogenetic estimates are most likely to be reliable when congruent evidence is obtained independently from the mitochondrial, plastid, and nuclear genomes with all methods of analysis. Here, results are presented from separate and combined genomic analyses of new and previously published data, including six and nine genes (8,911 bp and 12,010 bp, respectively) for different subsets of taxa that suggest Amborella + Nymphaeales (water lilies) are the first-branching angiosperm lineage. Before and after tree-independent noise reduction, most individual genomic compartments and methods of analysis estimated the Amborella + Nymphaeales basal topology with high support. Previous phylogenetic estimates placing Amborella alone as the first extant angiosperm branch may have been misled because of a series of specific problems with paralogy, suboptimal outgroups, long-branch taxa, and method dependence. Ancestral character state reconstructions differ between the two topologies and affect inferences about the features of early angiosperms. PMID:11069280
Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree
NASA Astrophysics Data System (ADS)
Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.
2017-02-01
Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.
Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud
2017-10-01
The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.
Moro, Marcelo Freire; Silva, Igor Aurélio; de Araújo, Francisca Soares; Nic Lughadha, Eimear; Meagher, Thomas R.; Martins, Fernando Roberto
2015-01-01
Seasonally dry tropical plant formations (SDTF) are likely to exhibit phylogenetic clustering owing to niche conservatism driven by a strong environmental filter (water stress), but heterogeneous edaphic environments and life histories may result in heterogeneity in degree of phylogenetic clustering. We investigated phylogenetic patterns across ecological gradients related to water availability (edaphic environment and climate) in the Caatinga, a SDTF in Brazil. Caatinga is characterized by semiarid climate and three distinct edaphic environments – sedimentary, crystalline, and inselberg –representing a decreasing gradient in soil water availability. We used two measures of phylogenetic diversity: Net Relatedness Index based on the entire phylogeny among species present in a site, reflecting long-term diversification; and Nearest Taxon Index based on the tips of the phylogeny, reflecting more recent diversification. We also evaluated woody species in contrast to herbaceous species. The main climatic variable influencing phylogenetic pattern was precipitation in the driest quarter, particularly for herbaceous species, suggesting that environmental filtering related to minimal periods of precipitation is an important driver of Caatinga biodiversity, as one might expect for a SDTF. Woody species tended to show phylogenetic clustering whereas herbaceous species tended towards phylogenetic overdispersion. We also found phylogenetic clustering in two edaphic environments (sedimentary and crystalline) in contrast to phylogenetic overdispersion in the third (inselberg). We conclude that while niche conservatism is evident in phylogenetic clustering in the Caatinga, this is not a universal pattern likely due to heterogeneity in the degree of realized environmental filtering across edaphic environments. Thus, SDTF, in spite of a strong shared environmental filter, are potentially heterogeneous in phylogenetic structuring. Our results support the need for scientifically informed conservation strategies in the Caatinga and other SDTF regions that have not previously been prioritized for conservation in order to take into account this heterogeneity. PMID:25798584
Yang, Chien-Hui; Bracken-Grissom, Heather; Kim, Dohyup; Crandall, Keith A; Chan, Tin-Yam
2012-01-01
The slipper lobsters belong to the family Scyllaridae which contains a total of 20 genera and 89 species distributed across four subfamilies (Arctidinae, Ibacinae, Scyllarinae, and Theninae). We have collected nucleotide sequence data from regions of five different genes (16S, 18S, COI, 28S, H3) to estimate phylogenetic relationships among 54 species from the Scyllaridae with a focus on the species rich subfamily Scyllarinae. We have included in our analyses at least one representative from all 20 genera in the Scyllaridae and 35 of the 52 species within the Scyllarinae. Our resulting phylogenetic estimate shows the subfamilies are monophyletic, except for Ibacinae, which has paraphyletic relationships among genera. Many of the genera within the Scyllarinae form non-monophyletic groups, while the genera from all other subfamilies form well supported clades. We discuss the implications of this history on the evolution of morphological characters and ecological transitions (nearshore vs. offshore) within the slipper lobsters. Finally, we identify, through ancestral state character reconstructions, key morphological features diagnostic of the major clades of diversity within the Scyllaridae and relate this character evolution to current taxonomy and classification. Copyright © 2011 Elsevier Inc. All rights reserved.
AN ADAPTIVE RADIATION OF FROGS IN A SOUTHEAST ASIAN ISLAND ARCHIPELAGO
Blackburn, David C; Siler, Cameron D; Diesmos, Arvin C; McGuire, Jimmy A; Cannatella, David C; Brown, Rafe M
2013-01-01
Living amphibians exhibit a diversity of ecologies, life histories, and species-rich lineages that offers opportunities for studies of adaptive radiation. We characterize a diverse clade of frogs (Kaloula, Microhylidae) in the Philippine island archipelago as an example of an adaptive radiation into three primary habitat specialists or ecotypes. We use a novel phylogenetic estimate for this clade to evaluate the tempo of lineage accumulation and morphological diversification. Because species-level phylogenetic estimates for Philippine Kaloula are lacking, we employ dense population sampling to determine the appropriate evolutionary lineages for diversification analyses. We explicitly take phylogenetic uncertainty into account when calculating diversification and disparification statistics and fitting models of diversification. Following dispersal to the Philippines from Southeast Asia, Kaloula radiated rapidly into several well-supported clades. Morphological variation within Kaloula is partly explained by ecotype and accumulated at high levels during this radiation, including within ecotypes. We pinpoint an axis of morphospace related directly to climbing and digging behaviors and find patterns of phenotypic evolution suggestive of ecological opportunity with partitioning into distinct habitat specialists. We conclude by discussing the components of phenotypic diversity that are likely important in amphibian adaptive radiations. PMID:24033172
Wang, Ya; Gao, Bo Liang; Li, Xi Xi; Zhang, Zhi Bin; Yan, Ri Ming; Yang, Hui Lin; Zhu, Du
2015-11-01
The biodiversity of plant endophytic fungi is enormous, numerous competent endophytic fungi are capable of providing different forms of fitness benefits to host plants and also could produce a wide array of bioactive natural products, which make them a largely unexplored source of novel compounds with potential bioactivity. In this study, we provided a first insights into revealing the diversity of culturable endophytic fungi in Dongxiang wild rice (Oryza rufipogon Griff.) from China using rDNA-ITS phylogenetic analysis. Here, the potential of fungi in producing bioactive natural products was estimated based on the beta-ketosynthase detected in the polyketide synthase (PKS) gene cluster and on the bioassay of antagonistic activity against two rice phytopathogens Thanatephorus cucumeris and Xanthomonas oryzae. A total of 229 endophytic fungal strains were validated in 19 genera. Among the 24 representative strains, 13 strains displayedantagonistic activity against the phytopathogens. Furthermore, PKS genes were detected in 9 strains, indicating their potential for synthesising PKS compounds. Our study confirms the phylogenetic diversity of endophytic fungi in O. rufipogon G. and highlights that endophytic fungi are not only promising resources of biocontrol agents against phytopathogens of rice plants, but also of bioactive natural products and defensive secondary metabolites. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Banker, Sarah E; Wade, Elizabeth J; Simon, Chris
2017-11-01
Phylogenetic studies of multiple independently inherited nuclear genes considered in combination with patterns of inheritance of organelle DNA have provided considerable insight into the history of species evolution. In particular, investigations of cicadas in the New Zealand genus Kikihia have identified interesting cases where mitochondrial DNA (mtDNA) crosses species boundaries in some species pairs but not others. Previous phylogenetic studies focusing on mtDNA largely corroborated Kikihia species groups identified by song, morphology and ecology with the exception of a unique South Island mitochondrial haplotype clade-the Westlandica group. This newly identified group consists of diverse taxa previously classified as belonging to three different sub-generic clades. We sequenced five nuclear loci from multiple individuals from every species of Kikihia to assess the nuclear gene concordance for this newly-identified mtDNA lineage. Bayes Factor analysis of the constrained phylogeny suggests some support for the mtDNA-based hypotheses, despite the fact that neither concatenation nor multiple species tree methods resolve the Westlandica group as monophyletic. The nuclear analyses suggest a geographic distinction between clearly defined monophyletic North Island clades and unresolved South Island clades. We suggest that more extreme habitat modification on South Island during the Pliocene and Pleistocene resulted in secondary contact and hybridization between species pairs and a series of mitochondrial capture events followed by subsequent lineage evolution. Copyright © 2017 Elsevier Inc. All rights reserved.
Morcillo, Felipe; Ornelas-García, Claudia Patricia; Alcaraz, Lourdes; Matamoros, Wilfredo A; Doadrio, Ignacio
2016-01-01
Freshwater fishes of Profundulidae, which until now was composed of two subgenera, represent one of the few extant fish families endemic to Mesoamerica. In this study we investigated the phylogenetic relationships and evolutionary history of the eight recognized extant species (from 37 populations) of Profundulidae using three mitochondrial and one nuclear gene markers (∼2.9 Kbp). We applied a Bayesian species delimitation method as a first approach to resolving speciation patterns within Profundulidae considering two different scenarios, eight-species and twelve-species models, obtained in a previous phylogenetic analysis. Based on our results, each of the two subgenera was resolved as monophyletic, with a remarkable molecular divergence of 24.5% for mtDNA and 7.8% for nDNA uncorrected p distances, and thus we propose that they correspond to separate genera. Moreover, we propose a conservative taxonomic hypothesis with five species within Profundulus and three within Tlaloc, although both eight-species and twelve-species models were highly supported by the bayesian species delimitation analysis, providing additional evidence of higher taxonomic diversity than currently recognized in this family. According to our divergence time estimates, the family originated during the Upper Oligocene 26 Mya, and Profundulus and Tlaloc diverged in the Upper Oligocene or Lower Miocene about 20 Mya. Copyright © 2015 Elsevier Inc. All rights reserved.
Advancing data reuse in phyloinformatics using an ontology-driven Semantic Web approach.
Panahiazar, Maryam; Sheth, Amit P; Ranabahu, Ajith; Vos, Rutger A; Leebens-Mack, Jim
2013-01-01
Phylogenetic analyses can resolve historical relationships among genes, organisms or higher taxa. Understanding such relationships can elucidate a wide range of biological phenomena, including, for example, the importance of gene and genome duplications in the evolution of gene function, the role of adaptation as a driver of diversification, or the evolutionary consequences of biogeographic shifts. Phyloinformaticists are developing data standards, databases and communication protocols (e.g. Application Programming Interfaces, APIs) to extend the accessibility of gene trees, species trees, and the metadata necessary to interpret these trees, thus enabling researchers across the life sciences to reuse phylogenetic knowledge. Specifically, Semantic Web technologies are being developed to make phylogenetic knowledge interpretable by web agents, thereby enabling intelligently automated, high-throughput reuse of results generated by phylogenetic research. This manuscript describes an ontology-driven, semantic problem-solving environment for phylogenetic analyses and introduces artefacts that can promote phyloinformatic efforts to promote accessibility of trees and underlying metadata. PhylOnt is an extensible ontology with concepts describing tree types and tree building methodologies including estimation methods, models and programs. In addition we present the PhylAnt platform for annotating scientific articles and NeXML files with PhylOnt concepts. The novelty of this work is the annotation of NeXML files and phylogenetic related documents with PhylOnt Ontology. This approach advances data reuse in phyloinformatics.
Molecular Phylogenetics: Concepts for a Newcomer.
Ajawatanawong, Pravech
Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.
Extending the BEAGLE library to a multi-FPGA platform
2013-01-01
Background Maximum Likelihood (ML)-based phylogenetic inference using Felsenstein’s pruning algorithm is a standard method for estimating the evolutionary relationships amongst a set of species based on DNA sequence data, and is used in popular applications such as RAxML, PHYLIP, GARLI, BEAST, and MrBayes. The Phylogenetic Likelihood Function (PLF) and its associated scaling and normalization steps comprise the computational kernel for these tools. These computations are data intensive but contain fine grain parallelism that can be exploited by coprocessor architectures such as FPGAs and GPUs. A general purpose API called BEAGLE has recently been developed that includes optimized implementations of Felsenstein’s pruning algorithm for various data parallel architectures. In this paper, we extend the BEAGLE API to a multiple Field Programmable Gate Array (FPGA)-based platform called the Convey HC-1. Results The core calculation of our implementation, which includes both the phylogenetic likelihood function (PLF) and the tree likelihood calculation, has an arithmetic intensity of 130 floating-point operations per 64 bytes of I/O, or 2.03 ops/byte. Its performance can thus be calculated as a function of the host platform’s peak memory bandwidth and the implementation’s memory efficiency, as 2.03 × peak bandwidth × memory efficiency. Our FPGA-based platform has a peak bandwidth of 76.8 GB/s and our implementation achieves a memory efficiency of approximately 50%, which gives an average throughput of 78 Gflops. This represents a ~40X speedup when compared with BEAGLE’s CPU implementation on a dual Xeon 5520 and 3X speedup versus BEAGLE’s GPU implementation on a Tesla T10 GPU for very large data sizes. The power consumption is 92 W, yielding a power efficiency of 1.7 Gflops per Watt. Conclusions The use of data parallel architectures to achieve high performance for likelihood-based phylogenetic inference requires high memory bandwidth and a design methodology that emphasizes high memory efficiency. To achieve this objective, we integrated 32 pipelined processing elements (PEs) across four FPGAs. For the design of each PE, we developed a specialized synthesis tool to generate a floating-point pipeline with resource and throughput constraints to match the target platform. We have found that using low-latency floating-point operators can significantly reduce FPGA area and still meet timing requirement on the target platform. We found that this design methodology can achieve performance that exceeds that of a GPU-based coprocessor. PMID:23331707
YBYRÁ facilitates comparison of large phylogenetic trees.
Machado, Denis Jacob
2015-07-01
The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .
Toussaint, Emmanuel F A; Sagata, Katayo; Surbakti, Suriani; Hendrich, Lars; Balke, Michael
2013-01-01
The Australasian archipelago is biologically extremely diverse as a result of a highly puzzling geological and biological evolution. Unveiling the underlying mechanisms has never been more attainable as molecular phylogenetic and geological methods improve, and has become a research priority considering increasing human-mediated loss of biodiversity. However, studies of finer scaled evolutionary patterns remain rare particularly for megadiverse Melanesian biota. While oceanic islands have received some attention in the region, likewise insular mountain blocks that serve as species pumps remain understudied, even though Australasia, for example, features some of the most spectacular tropical alpine habitats in the World. Here, we sequenced almost 2 kb of mitochondrial DNA from the widespread diving beetle Rhantus suturalis from across Australasia and the Indomalayan Archipelago, including remote New Guinean highlands. Based on expert taxonomy with a multigene phylogenetic backbone study, and combining molecular phylogenetics, phylogeography, divergence time estimation, and historical demography, we recover comparably low geographic signal, but complex phylogenetic relationships and population structure within R. suturalis. Four narrowly endemic New Guinea highland species are subordinated and two populations (New Guinea, New Zealand) seem to constitute cases of ongoing speciation. We reveal repeated colonization of remote mountain chains where haplotypes out of a core clade of very widespread haplotypes syntopically might occur with well-isolated ones. These results are corroborated by a Pleistocene origin approximately 2.4 Ma ago, followed by a sudden demographic expansion 600,000 years ago that may have been initiated through climatic adaptations. This study is a snapshot of the early stages of lineage diversification by peripatric speciation in Australasia, and supports New Guinea sky islands as cradles of evolution, in line with geological evidence suggesting very recent origin of high altitudes in the region. PMID:23610642
Nemati, Sara; Fazaeli, Asghar; Hajjaran, Homa; Khamesipour, Ali; Anbaran, Mohsen Falahati; Bozorgomid, Arezoo; Zarei, Fatah
2017-08-01
Despite the broad distribution of leishmaniasis among Iranians and animals across the country, little is known about the genetic characteristics of the causative agents. Applying both HSP70 PCR-RFLP and sequence analyses, this study aimed to evaluate the genetic diversity and phylogenetic relationships among Leishmania spp. isolated from Iranian endemic foci and available reference strains. A total of 36 Leishmania isolates from almost all districts across the country were genetically analyzed for the HSP70 gene using both PCR-RFLP and sequence analysis. The original HSP70 gene sequences were aligned along with homologous Leishmania sequences retrieved from NCBI, and subjected to the phylogenetic analysis. Basic parameters of genetic diversity were also estimated. The HSP70 PCR-RFLP presented 3 different electrophoretic patterns, with no further intraspecific variation, corresponding to 3 Leishmania species available in the country, L. tropica, L. major, and L. infantum. Phylogenetic analyses presented 5 major clades, corresponding to 5 species complexes. Iranian lineages, including L. major, L. tropica, and L. infantum, were distributed among 3 complexes L. major, L. tropica, and L. donovani. However, within the L. major and L. donovani species complexes, the HSP70 phylogeny was not able to distinguish clearly between the L. major and L. turanica isolates, and between the L. infantum, L. donovani, and L. chagasi isolates, respectively. Our results indicated that both HSP70 PCR-RFLP and sequence analyses are medically applicable tools for identification of Leishmania species in Iranian patients. However, the reduced genetic diversity of the target gene makes it inevitable that its phylogeny only resolves the major groups, namely, the species complexes.
Campbell, Matthew A; Alfaro, Michael E; Belasco, Max; López, J Andrés
2017-01-01
Phylogenetic inference based on evidence from DNA sequences has led to significant strides in the development of a stable and robustly supported framework for the vertebrate tree of life. To date, the bulk of those advances have relied on sequence data from a small number of genome regions that have proven unable to produce satisfactory answers to consistently recalcitrant phylogenetic questions. Here, we re-examine phylogenetic relationships among early-branching euteleostean fish lineages classically grouped in the Protacanthopterygii using DNA sequence data surrounding ultraconserved elements. We report and examine a dataset of thirty-four OTUs with 17,957 aligned characters from fifty-three nuclear loci. Phylogenetic analysis is conducted in concatenated, joint gene trees and species tree estimation and summary coalescent frameworks. All analytical frameworks yield supporting evidence for existing hypotheses of relationship for the placement of Lepidogalaxias salamandroides , monophyly of the Stomiatii and the presence of an esociform + salmonid clade. Lepidogalaxias salamandroides and the Esociformes + Salmoniformes are successive sister lineages to all other euteleosts in the majority of analyses. The concatenated and joint gene trees and species tree analysis types produce high support values for this arrangement. However, inter-relationships of Argentiniformes, Stomiatii and Neoteleostei remain uncertain as they varied by analysis type while receiving strong and contradictory indices of support. Topological differences between analysis types are also apparent within the otomorph and the percomorph taxa in the data set. Our results identify concordant areas with strong support for relationships within and between early-branching euteleost lineages but they also reveal limitations in the ability of larger datasets to conclusively resolve other aspects of that phylogeny.
Alfaro, Michael E.; Belasco, Max; López, J. Andrés
2017-01-01
Phylogenetic inference based on evidence from DNA sequences has led to significant strides in the development of a stable and robustly supported framework for the vertebrate tree of life. To date, the bulk of those advances have relied on sequence data from a small number of genome regions that have proven unable to produce satisfactory answers to consistently recalcitrant phylogenetic questions. Here, we re-examine phylogenetic relationships among early-branching euteleostean fish lineages classically grouped in the Protacanthopterygii using DNA sequence data surrounding ultraconserved elements. We report and examine a dataset of thirty-four OTUs with 17,957 aligned characters from fifty-three nuclear loci. Phylogenetic analysis is conducted in concatenated, joint gene trees and species tree estimation and summary coalescent frameworks. All analytical frameworks yield supporting evidence for existing hypotheses of relationship for the placement of Lepidogalaxias salamandroides, monophyly of the Stomiatii and the presence of an esociform + salmonid clade. Lepidogalaxias salamandroides and the Esociformes + Salmoniformes are successive sister lineages to all other euteleosts in the majority of analyses. The concatenated and joint gene trees and species tree analysis types produce high support values for this arrangement. However, inter-relationships of Argentiniformes, Stomiatii and Neoteleostei remain uncertain as they varied by analysis type while receiving strong and contradictory indices of support. Topological differences between analysis types are also apparent within the otomorph and the percomorph taxa in the data set. Our results identify concordant areas with strong support for relationships within and between early-branching euteleost lineages but they also reveal limitations in the ability of larger datasets to conclusively resolve other aspects of that phylogeny. PMID:28929008
Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences
Yue, Yaojing; Guo, Xian; Guo, Tingting; Chu, Min; Wang, Fan; Han, Jilong; Feng, Ruilin; Sun, Xiaoping; Niu, Chune; Yang, Bohui; Guo, Jian; Yuan, Chao
2016-01-01
The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries) is not well understood, and little is known about this species’ genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D) were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau. PMID:27463976
Trait-based assembly and phylogenetic structure in northeast Pacific rockfish assemblages.
Ingram, Travis; Shurin, Jonathan B
2009-09-01
If natural communities are assembled according to deterministic rules, coexisting species will represent a nonrandom subset of the potential species pool. We tested for signatures of assembly rules in the distribution of species' traits in Pacific rockfish (Sebastes spp.) assemblages. We used morphology, dietary niche (estimated with stable nitrogen isotopes), and distribution data to identify traits that relate to local-scale resource use (the alpha-niche) and to environmental gradients (the beta-niche). We showed that gill raker morphology was related to trophic position (an alpha-niche axis), while relative eye size was associated with depth habitat (a beta-niche axis). We therefore hypothesized that, within assemblages of coexisting rockfish species, the gill raker trait would be overdispersed (evenly spaced) due to limiting similarity, while relative eye size would be clustered due to environmental filtering. We examined the evolutionary relatedness of coexisting species to ask whether phylogenetic community structure and trait distributions gave similar indications about the roles of assembly processes. We tested the trait distributions and phylogenetic structure of 30 published rockfish assemblages against a null model of random community assembly. As predicted, the gill raker trait tended to be more evenly spaced than expected by chance, as did overall body size, while relative eye size was more clustered than expected. Phylogenetic community structure appeared to reflect historical dispersal and speciation and did not provide consistent support for assembly rules. Our results indicate that rockfish community assembly is nonrandom with regard to species' traits and show how distinguishing traits related to the alpha- and beta-niches and incorporating functional morphology can provide for powerful tests of assembly rules.
Xu, Jinshi; Chen, Yu; Zhang, Lixia; Chai, Yongfu; Wang, Mao; Guo, Yaoxin; Li, Ting; Yue, Ming
2017-07-01
Community assembly processes is the primary focus of community ecology. Using phylogenetic-based and functional trait-based methods jointly to explore these processes along environmental gradients are useful ways to explain the change of assembly mechanisms under changing world. Our study combined these methods to test assembly processes in wide range gradients of elevation and other habitat environmental factors. We collected our data at 40 plots in Taibai Mountain, China, with more than 2,300 m altitude difference in study area and then measured traits and environmental factors. Variance partitioning was used to distinguish the main environment factors leading to phylogeny and traits change among 40 plots. Principal component analysis (PCA) was applied to colligate other environment factors. Community assembly patterns along environmental gradients based on phylogenetic and functional methods were studied for exploring assembly mechanisms. Phylogenetic signal was calculated for each community along environmental gradients in order to detect the variation of trait performance on phylogeny. Elevation showed a better explanatory power than other environment factors for phylogenetic and most traits' variance. Phylogenetic and several functional structure clustered at high elevation while some conserved traits overdispersed. Convergent tendency which might be caused by filtering or competition along elevation was detected based on functional traits. Leaf dry matter content (LDMC) and leaf nitrogen content along PCA 1 axis showed conflicting patterns comparing to patterns showed on elevation. LDMC exhibited the strongest phylogenetic signal. Only the phylogenetic signal of maximum plant height showed explicable change along environmental gradients. Synthesis . Elevation is the best environment factors for predicting phylogeny and traits change. Plant's phylogenetic and some functional structures show environmental filtering in alpine region while it shows different assembly processes in middle- and low-altitude region by different trait/phylogeny. The results highlight deterministic processes dominate community assembly in large-scale environmental gradients. Performance of phylogeny and traits along gradients may be independent with each other. The novel method for calculating functional structure which we used in this study and the focus of phylogenetic signal change along gradients may provide more useful ways to detect community assembly mechanisms.
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.
Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju
2015-01-01
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
Rapid diversification of Tragopogon and ecological associates in Eurasia.
Bell, C D; Mavrodiev, E V; Soltis, P S; Calaminus, A K; Albach, D C; Cellinese, N; Garcia-Jacas, N; Soltis, D E
2012-12-01
Tragopogon comprises approximately 150 described species distributed throughout Eurasia from Ireland and the UK to India and China with a few species in North Africa. Most of the species diversity is found in Eastern Europe to Western Asia. Previous phylogenetic analyses identified several major clades, generally corresponding to recognized taxonomic sections, although relationships both among these clades and among species within clades remain largely unresolved. These patterns are consistent with rapid diversification following the origin of Tragopogon, and this study addresses the timing and rate of diversification in Tragopogon. Using BEAST to simultaneously estimate a phylogeny and divergence times, we estimate the age of a major split and subsequent rapid divergence within Tragopogon to be ~2.6 Ma (and 1.7-5.4 Ma using various clock estimates). Based on the age estimates obtained with BEAST (HPD 1.7-5.4 Ma) for the origin of crown group Tragopogon and 200 estimated species (to accommodate a large number of cryptic species), the diversification rate of Tragopogon is approximately 0.84-2.71 species/Myr for the crown group, assuming low levels of extinction. This estimate is comparable in rate to a rapid Eurasian radiation in Dianthus (0.66-3.89 species/Myr), which occurs in the same or similar habitats. Using available data, we show that subclades of various plant taxa that occur in the same semi-arid habitats of Eurasia also represent rapid radiations occurring during roughly the same window of time (1.7-5.4 Ma), suggesting similar causal events. However, not all species-rich plant genera from the same habitats diverged at the same time, or at the same tempo. Radiations of several other clades in this same habitat (e.g. Campanula, Knautia, Scabiosa) occurred at earlier dates (45-4.28 Ma). Existing phylogenetic data and diversification estimates therefore indicate that, although some elements of these semi-arid communities radiated during the Plio-Pleistocene period, other clades sharing the same habitat appear to have diversified earlier. © 2012 The Authors. Journal of Evolutionary Biology © 2012 European Society For Evolutionary Biology.
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelilhood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the...
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelihood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the ...
A phylogenetically-based nomenclature for Cordycipitaceae (Hypocreales)
USDA-ARS?s Scientific Manuscript database
Changes in Article 59 of the International Code of Nomenclature for algae, fungi, and plants (ICN) disallow the use of dual nomenclatural systems for fungi. This change requires the reconciliation of competing names, ideally linked through culture based or molecular methods. The phylogenetic syste...
West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N
2014-07-01
The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of closely related organisms, and discuss how it could be extended to future studies of multilocus rDNA systems. [concerted evolution; genome hydridisation; phylogenetic analysis; ribosomal DNA; whole genome sequencing; yeast]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Pyron, R Alexander
2017-01-01
Here, I combine previously underutilized models and priors to perform more biologically realistic phylogenetic inference from morphological data, with an example from squamate reptiles. When coding morphological characters, it is often possible to denote ordered states with explicit reference to observed or hypothetical ancestral conditions. Using this logic, we can integrate across character-state labels and estimate meaningful rates of forward and backward transitions from plesiomorphy to apomorphy. I refer to this approach as MkA, for “asymmetric.” The MkA model incorporates the biological reality of limited reversal for many phylogenetically informative characters, and significantly increases likelihoods in the empirical data sets. Despite this, the phylogeny of Squamata remains contentious. Total-evidence analyses using combined morphological and molecular data and the MkA approach tend toward recent consensus estimates supporting a nested Iguania. However, support for this topology is not unambiguous across data sets or analyses, and no mechanism has been proposed to explain the widespread incongruence between partitions, or the hidden support for various topologies in those partitions. Furthermore, different morphological data sets produced by different authors contain both different characters and different states for the same or similar characters, resulting in drastically different placements for many important fossil lineages. Effort is needed to standardize ontology for morphology, resolve incongruence, and estimate a robust phylogeny. The MkA approach provides a preliminary avenue for investigating morphological evolution while accounting for temporal evidence and asymmetry in character-state changes.
Distance-Based Phylogenetic Methods Around a Polytomy.
Davidson, Ruth; Sullivant, Seth
2014-01-01
Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.
Thuillard, Marc; Fraix-Burnet, Didier
2015-01-01
This article presents an innovative approach to phylogenies based on the reduction of multistate characters to binary-state characters. We show that the reduction to binary characters' approach can be applied to both character- and distance-based phylogenies and provides a unifying framework to explain simply and intuitively the similarities and differences between distance- and character-based phylogenies. Building on these results, this article gives a possible explanation on why phylogenetic trees obtained from a distance matrix or a set of characters are often quite reasonable despite lateral transfers of genetic material between taxa. In the presence of lateral transfers, outer planar networks furnish a better description of evolution than phylogenetic trees. We present a polynomial-time reconstruction algorithm for perfect outer planar networks with a fixed number of states, characters, and lateral transfers.
An Improved Binary Differential Evolution Algorithm to Infer Tumor Phylogenetic Trees.
Liang, Ying; Liao, Bo; Zhu, Wen
2017-01-01
Tumourigenesis is a mutation accumulation process, which is likely to start with a mutated founder cell. The evolutionary nature of tumor development makes phylogenetic models suitable for inferring tumor evolution through genetic variation data. Copy number variation (CNV) is the major genetic marker of the genome with more genes, disease loci, and functional elements involved. Fluorescence in situ hybridization (FISH) accurately measures multiple gene copy number of hundreds of single cells. We propose an improved binary differential evolution algorithm, BDEP, to infer tumor phylogenetic tree based on FISH platform. The topology analysis of tumor progression tree shows that the pathway of tumor subcell expansion varies greatly during different stages of tumor formation. And the classification experiment shows that tree-based features are better than data-based features in distinguishing tumor. The constructed phylogenetic trees have great performance in characterizing tumor development process, which outperforms other similar algorithms.
Moreira, Xoaquín; Abdala-Roberts, Luis; Galmán, Andrea; Francisco, Marta; Fuente, María de la; Butrón, Ana; Rasmann, Sergio
2018-06-07
Biogeographical factors and phylogenetic history are key determinants of inter-specific variation in plant defences. However, few studies have conducted broad-scale geographical comparisons of plant defences while controlling for phylogenetic relationships, and, in doing so, none have separated constitutive from induced defences. This gap has limited our understanding of how historical or large-scale processes mediate biogeographical patterns in plant defences since these may be contingent upon shared evolutionary history and phylogenetic constraints. We conducted a phylogenetically-controlled experiment testing for differences in constitutive leaf chemical defences and their inducibility between Palearctic and Nearctic oak species (Quercus, total 18 species). We induced defences in one-year old plants by inflicting damage by gypsy moth larvae (Lymantria dispar), estimated the amount of leaf area consumed, and quantified various groups of phenolic compounds. There was no detectable phylogenetic signal for constitutive or induced levels of most defensive traits except for constitutive condensed tannins, as well as no phylogenetic signal in leaf herbivory. We did, however, find marked differences in defence levels between oak species from each region: Palearctic species had higher levels of constitutive condensed tannins, but less constitutive lignins and less constitutive and induced hydrolysable tannins compared with Nearctic species. Additionally, Palearctic species had lower levels of leaf damage compared with Nearctic species. These differences in leaf damage, lignins and hydrolysable (but not condensed) tannins were lost after accounting for phylogeny, suggesting that geographical structuring of phylogenetic relationships mediated biogeographical differences in defences and herbivore resistance. Together, these findings suggest that historical processes and large-scale drivers have shaped differences in allocation to constitutive defences (and in turn resistance) between Palearctic and Nearctic oaks. Moreover, although evidence of phylogenetic conservatism in the studied traits is rather weak, shared evolutionary history appears to mediate some of these biogeographical patterns in allocation to chemical defences. Copyright © 2018 Elsevier Ltd. All rights reserved.
Bertels, Frederic; Marzel, Alex; Leventhal, Gabriel; Mitov, Venelin; Fellay, Jacques; Günthard, Huldrych F; Böni, Jürg; Yerly, Sabine; Klimkait, Thomas; Aubert, Vincent; Battegay, Manuel; Rauch, Andri; Cavassini, Matthias; Calmy, Alexandra; Bernasconi, Enos; Schmid, Patrick; Scherrer, Alexandra U; Müller, Viktor; Bonhoeffer, Sebastian; Kouyos, Roger; Regoes, Roland R
2018-01-01
Pathogen strains may differ in virulence because they attain different loads in their hosts, or because they induce different disease-causing mechanisms independent of their load. In evolutionary ecology, the latter is referred to as "per-parasite pathogenicity". Using viral load and CD4+ T-cell measures from 2014 HIV-1 subtype B-infected individuals enrolled in the Swiss HIV Cohort Study, we investigated if virulence-measured as the rate of decline of CD4+ T cells-and per-parasite pathogenicity are heritable from donor to recipient. We estimated heritability by donor-recipient regressions applied to 196 previously identified transmission pairs, and by phylogenetic mixed models applied to a phylogenetic tree inferred from HIV pol sequences. Regressing the CD4+ T-cell declines and per-parasite pathogenicities of the transmission pairs did not yield heritability estimates significantly different from zero. With the phylogenetic mixed model, however, our best estimate for the heritability of the CD4+ T-cell decline is 17% (5-30%), and that of the per-parasite pathogenicity is 17% (4-29%). Further, we confirm that the set-point viral load is heritable, and estimate a heritability of 29% (12-46%). Interestingly, the pattern of evolution of all these traits differs significantly from neutrality, and is most consistent with stabilizing selection for the set-point viral load, and with directional selection for the CD4+ T-cell decline and the per-parasite pathogenicity. Our analysis shows that the viral genotype affects virulence mainly by modulating the per-parasite pathogenicity, while the indirect effect via the set-point viral load is minor. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bertels, Frederic; Marzel, Alex; Leventhal, Gabriel; Mitov, Venelin; Fellay, Jacques; Günthard, Huldrych F; Böni, Jürg; Yerly, Sabine; Klimkait, Thomas; Aubert, Vincent; Battegay, Manuel; Rauch, Andri; Cavassini, Matthias; Calmy, Alexandra; Bernasconi, Enos; Schmid, Patrick; Scherrer, Alexandra U; Müller, Viktor; Bonhoeffer, Sebastian; Kouyos, Roger; Regoes, Roland R
2018-01-01
Abstract Pathogen strains may differ in virulence because they attain different loads in their hosts, or because they induce different disease-causing mechanisms independent of their load. In evolutionary ecology, the latter is referred to as “per-parasite pathogenicity”. Using viral load and CD4+ T-cell measures from 2014 HIV-1 subtype B-infected individuals enrolled in the Swiss HIV Cohort Study, we investigated if virulence—measured as the rate of decline of CD4+ T cells—and per-parasite pathogenicity are heritable from donor to recipient. We estimated heritability by donor–recipient regressions applied to 196 previously identified transmission pairs, and by phylogenetic mixed models applied to a phylogenetic tree inferred from HIV pol sequences. Regressing the CD4+ T-cell declines and per-parasite pathogenicities of the transmission pairs did not yield heritability estimates significantly different from zero. With the phylogenetic mixed model, however, our best estimate for the heritability of the CD4+ T-cell decline is 17% (5–30%), and that of the per-parasite pathogenicity is 17% (4–29%). Further, we confirm that the set-point viral load is heritable, and estimate a heritability of 29% (12–46%). Interestingly, the pattern of evolution of all these traits differs significantly from neutrality, and is most consistent with stabilizing selection for the set-point viral load, and with directional selection for the CD4+ T-cell decline and the per-parasite pathogenicity. Our analysis shows that the viral genotype affects virulence mainly by modulating the per-parasite pathogenicity, while the indirect effect via the set-point viral load is minor. PMID:29029206
Genome-wide heterogeneity of nucleotide substitution model fit.
Arbiza, Leonardo; Patricio, Mateus; Dopazo, Hernán; Posada, David
2011-01-01
At a genomic scale, the patterns that have shaped molecular evolution are believed to be largely heterogeneous. Consequently, comparative analyses should use appropriate probabilistic substitution models that capture the main features under which different genomic regions have evolved. While efforts have concentrated in the development and understanding of model selection techniques, no descriptions of overall relative substitution model fit at the genome level have been reported. Here, we provide a characterization of best-fit substitution models across three genomic data sets including coding regions from mammals, vertebrates, and Drosophila (24,000 alignments). According to the Akaike Information Criterion (AIC), 82 of 88 models considered were selected as best-fit models at least in one occasion, although with very different frequencies. Most parameter estimates also varied broadly among genes. Patterns found for vertebrates and Drosophila were quite similar and often more complex than those found in mammals. Phylogenetic trees derived from models in the 95% confidence interval set showed much less variance and were significantly closer to the tree estimated under the best-fit model than trees derived from models outside this interval. Although alternative criteria selected simpler models than the AIC, they suggested similar patterns. All together our results show that at a genomic scale, different gene alignments for the same set of taxa are best explained by a large variety of different substitution models and that model choice has implications on different parameter estimates including the inferred phylogenetic trees. After taking into account the differences related to sample size, our results suggest a noticeable diversity in the underlying evolutionary process. All together, we conclude that the use of model selection techniques is important to obtain consistent phylogenetic estimates from real data at a genomic scale.
Mystkowska, Katarzyna; Kras, Marta; Dudek, Magdalena
2016-01-01
The location of possible glacial refugia of six Apostasioideae representatives is estimated based on ecological niche modeling analysis. The distribution of their suitable niches during the last glacial maximum (LGM) is compared with their current potential and documented geographical ranges. The climatic factors limiting the studied species occurrences are evaluated and the niche overlap between the studied orchids is assessed and discussed. The predicted niche occupancy profiles and reconstruction of ancestral climatic tolerances suggest high level of phylogenetic niche conservatism within Apostasioideae. PMID:27635348
Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter
2017-12-01
Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.
Bayesian Total-Evidence Dating Reveals the Recent Crown Radiation of Penguins
Heath, Tracy A.; Ksepka, Daniel T.; Stadler, Tanja; Welch, David; Drummond, Alexei J.
2017-01-01
The total-evidence approach to divergence time dating uses molecular and morphological data from extant and fossil species to infer phylogenetic relationships, species divergence times, and macroevolutionary parameters in a single coherent framework. Current model-based implementations of this approach lack an appropriate model for the tree describing the diversification and fossilization process and can produce estimates that lead to erroneous conclusions. We address this shortcoming by providing a total-evidence method implemented in a Bayesian framework. This approach uses a mechanistic tree prior to describe the underlying diversification process that generated the tree of extant and fossil taxa. Previous attempts to apply the total-evidence approach have used tree priors that do not account for the possibility that fossil samples may be direct ancestors of other samples, that is, ancestors of fossil or extant species or of clades. The fossilized birth–death (FBD) process explicitly models the diversification, fossilization, and sampling processes and naturally allows for sampled ancestors. This model was recently applied to estimate divergence times based on molecular data and fossil occurrence dates. We incorporate the FBD model and a model of morphological trait evolution into a Bayesian total-evidence approach to dating species phylogenies. We apply this method to extant and fossil penguins and show that the modern penguins radiated much more recently than has been previously estimated, with the basal divergence in the crown clade occurring at \\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{upgreek} \\usepackage{mathrsfs} \\setlength{\\oddsidemargin}{-69pt} \\begin{document} }{}${\\sim}12.7$\\end{document} Ma and most splits leading to extant species occurring in the last 2 myr. Our results demonstrate that including stem-fossil diversity can greatly improve the estimates of the divergence times of crown taxa. The method is available in BEAST2 (version 2.4) software www.beast2.org with packages SA (version at least 1.1.4) and morph-models (version at least 1.0.4) installed. [Birth–death process; calibration; divergence times; MCMC; phylogenetics.] PMID:28173531
Ritchie, Andrew M; Lo, Nathan; Ho, Simon Y W
2017-05-01
In Bayesian phylogenetic analyses of genetic data, prior probability distributions need to be specified for the model parameters, including the tree. When Bayesian methods are used for molecular dating, available tree priors include those designed for species-level data, such as the pure-birth and birth-death priors, and coalescent-based priors designed for population-level data. However, molecular dating methods are frequently applied to data sets that include multiple individuals across multiple species. Such data sets violate the assumptions of both the speciation and coalescent-based tree priors, making it unclear which should be chosen and whether this choice can affect the estimation of node times. To investigate this problem, we used a simulation approach to produce data sets with different proportions of within- and between-species sampling under the multispecies coalescent model. These data sets were then analyzed under pure-birth, birth-death, constant-size coalescent, and skyline coalescent tree priors. We also explored the ability of Bayesian model testing to select the best-performing priors. We confirmed the applicability of our results to empirical data sets from cetaceans, phocids, and coregonid whitefish. Estimates of node times were generally robust to the choice of tree prior, but some combinations of tree priors and sampling schemes led to large differences in the age estimates. In particular, the pure-birth tree prior frequently led to inaccurate estimates for data sets containing a mixture of inter- and intraspecific sampling, whereas the birth-death and skyline coalescent priors produced stable results across all scenarios. Model testing provided an adequate means of rejecting inappropriate tree priors. Our results suggest that tree priors do not strongly affect Bayesian molecular dating results in most cases, even when severely misspecified. However, the choice of tree prior can be significant for the accuracy of dating results in the case of data sets with mixed inter- and intraspecies sampling. [Bayesian phylogenetic methods; model testing; molecular dating; node time; tree prior.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.
Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong; Lee, Hyang Burm
2015-09-01
A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae.
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.
Kendall, Michelle; Colijn, Caroline
2016-10-01
Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chen, Zhuo; Xu, Shixia; Zhou, Kaiya; Yang, Guang
2011-10-27
A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.
2011-01-01
Background A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. Results An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Conclusions Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future. PMID:22029548
The relevance of phylogeny to studies of global change.
Edwards, Erika J; Still, Christopher J; Donoghue, Michael J
2007-05-01
Phylogenetic thinking has infiltrated many areas of biological research, but has had little impact on studies of global ecology or climate change. Here, we illustrate how phylogenetic information can be relevant to understanding vegetation-atmosphere dynamics at ecosystem or global scales by re-analyzing a data set of carbonic anhydrase (CA) activity in leaves that was used to estimate terrestrial gross primary productivity. The original calculations relied on what appeared to be low CA activity exclusively in C4 grasses, but our analyses indicate that such activity might instead characterize the PACCAD grass lineage, which includes many widespread C3 species. We outline how phylogenetics can guide better taxon sampling of key physiological traits, and discuss how the emerging field of phyloinformatics presents a promising new framework for scaling from organism physiology to global processes.
Kim, Jiyeon; Kern, Elizabeth; Kim, Taeho; Sim, Mikang; Kim, Jaebum; Kim, Yuseob; Park, Chungoo; Nadler, Steven A; Park, Joong-Ki
2017-02-01
Plectida is an important nematode order with species that occupy many different biological niches. The order includes free-living aquatic and soil-dwelling species, but its phylogenetic position has remained uncertain. We sequenced the complete mitochondrial genomes of two members of this order, Plectus acuminatus and Plectus aquatilis and compared them with those of other major nematode clades. The genome size and base composition of these species are similar to other nematodes; 14,831 and 14,372bp, respectively, with AT contents of 71.0% and 70.1%. Gene content was also similar to other nematodes, but gene order and coding direction of Plectus mtDNAs were dissimilar from other chromadorean species. P. acuminatus and P. aquatilis are the first chromadorean species found to contain a gene inversion. We reconstructed mitochondrial genome phylogenetic trees using nucleotide and amino acid datasets from 87 nematodes that represent major nematode clades, including the Plectus sequences. Trees from phylogenetic analyses using maximum likelihood and Bayesian methods depicted Plectida as the sister group to other sequenced chromadorean nematodes. This finding is consistent with several phylogenetic results based on SSU rDNA, but disagrees with a classification based on morphology. Mitogenomes representing other basal chromadorean groups (Araeolaimida, Monhysterida, Desmodorida, Chromadorida) are needed to confirm their phylogenetic relationships. Copyright © 2016 Elsevier Inc. All rights reserved.
Ruggiero, Adriana
2017-01-01
The latitudinal diversity gradient has been considered a consequence of a shift in the impact of abiotic and biotic factors that limit species distributions from the poles to the equator, thus influencing species richness variation. It has also been considered the outcome of evolutionary processes that vary over geographical space. We used six South American mammal groups to test the association of environmental and evolutionary factors and the ecological structuring of mammal assemblages with spatial variation in taxonomic richness (TR), at a spatial resolution of 110 km x 110 km, at tropical and extra-tropical latitudes. Based on attributes that represent what mammal species do in ecosystems, we estimated ecological diversity (ED) as a mean pairwise ecological distance between all co-occurring taxa. The mean pairwise phylogenetic distance between all co-occurring taxa (AvPD) was used as an estimation of phylogenetic diversity. Geographically Weighted Regression analyses performed separately for each mammal group identified tropical and extra-tropical high R2 areas where environmental and evolutionary factors strongly accounted for richness variation. Temperature was the most important predictor of TR in high R2 areas outside the tropics, as was AvPD within the tropics. The proportion of TR variation accounted for by environment (either independently or combined with AvPD) was higher in tropical areas of high richness and low ecological diversity than in tropical areas of high richness and high ecological diversity. In conclusion, we confirmed a shift in the impact of environmental factors, mainly temperature, that best account for mammal richness variation in extra-tropical regions, whereas phylogenetic diversity best accounts for richness variation within the tropics. Environment in combination with evolutionary history explained the coexistence of a high number of ecologically similar species within the tropics. Consideration of the influence of contemporary environmental variables and evolutionary history is crucial to understanding of the latitudinal diversity gradient. PMID:28873434
Moretzsohn, Márcio C.; Gouvea, Ediene G.; Inglis, Peter W.; Leal-Bertioli, Soraya C. M.; Valls, José F. M.; Bertioli, David J.
2013-01-01
Background and Aims The genus Arachis contains 80 described species. Section Arachis is of particular interest because it includes cultivated peanut, an allotetraploid, and closely related wild species, most of which are diploids. This study aimed to analyse the genetic relationships of multiple accessions of section Arachis species using two complementary methods. Microsatellites allowed the analysis of inter- and intraspecific variability. Intron sequences from single-copy genes allowed phylogenetic analysis including the separation of the allotetraploid genome components. Methods Intron sequences and microsatellite markers were used to reconstruct phylogenetic relationships in section Arachis through maximum parsimony and genetic distance analyses. Key Results Although high intraspecific variability was evident, there was good support for most species. However, some problems were revealed, notably a probable polyphyletic origin for A. kuhlmannii. The validity of the genome groups was well supported. The F, K and D genomes grouped close to the A genome group. The 2n = 18 species grouped closer to the B genome group. The phylogenetic tree based on the intron data strongly indicated that A. duranensis and A. ipaënsis are the ancestors of A. hypogaea and A. monticola. Intron nucleotide substitutions allowed the ages of divergences of the main genome groups to be estimated at a relatively recent 2·3–2·9 million years ago. This age and the number of species described indicate a much higher speciation rate for section Arachis than for legumes in general. Conclusions The analyses revealed relationships between the species and genome groups and showed a generally high level of intraspecific genetic diversity. The improved knowledge of species relationships should facilitate the utilization of wild species for peanut improvement. The estimates of speciation rates in section Arachis are high, but not unprecedented. We suggest these high rates may be linked to the peculiar reproductive biology of Arachis. PMID:23131301
Knowles, L Lacey; Huang, Huateng; Sukumaran, Jeet; Smith, Stephen A
2018-03-01
Discordant gene trees are commonly encountered when sequences from thousands of loci are applied to estimate phylogenetic relationships. Several processes contribute to this discord. Yet, we have no methods that jointly model different sources of conflict when estimating phylogenies. An alternative to analyzing entire genomes or all the sequenced loci is to identify a subset of loci for phylogenetic analysis. If we can identify data partitions that are most likely to reflect descent from a common ancestor (i.e., discordant loci that indeed reflect incomplete lineage sorting [ILS], as opposed to some other process, such as lateral gene transfer [LGT]), we can analyze this subset using powerful coalescent-based species-tree approaches. Test data sets were simulated where discord among loci could arise from ILS and LGT. Data sets where analyzed using the newly developed program CLASSIPHY (Huang et al., ) to assess whether our ability to distinguish the cause of discord among loci varied when ILS and LGT occurred in the recent versus deep past and whether the accuracy of these inferences were affected by the mutational process. We show that accuracy of probabilistic classification of individual loci by the cause of discord differed when ILS and LGT events occurred more recently compared with the distant past and that the signal-to-noise ratio arising from the mutational process contributes to difficulties in inferring LGT data partitions. We discuss our findings in terms of the promise and limitations of identifying subsets of loci for species-tree inference that will not violate the underlying coalescent model (i.e., data partitions in which ILS, and not LGT, contributes to discord). We also discuss the empirical implications of our work given the many recalcitrant nodes in the tree of life (e.g., origins of angiosperms, amniotes, or Neoaves), and recent arguments for concatenating loci. © 2018 Botanical Society of America.
Mahony, Stephen; Foley, Nicole M; Biju, S D; Teeling, Emma C
2017-03-01
Molecular dating studies typically need fossils to calibrate the analyses. Unfortunately, the fossil record is extremely poor or presently nonexistent for many species groups, rendering such dating analysis difficult. One such group is the Asian horned frogs (Megophryinae). Sampling all generic nomina, we combined a novel ∼5 kb dataset composed of four nuclear and three mitochondrial gene fragments to produce a robust phylogeny, with an extensive external morphological study to produce a working taxonomy for the group. Expanding the molecular dataset to include out-groups of fossil-represented ancestral anuran families, we compared the priorless RelTime dating method with the widely used prior-based Bayesian timetree method, MCMCtree, utilizing a novel combination of fossil priors for anuran phylogenetic dating. The phylogeny was then subjected to ancestral phylogeographic analyses, and dating estimates were compared with likely biogeographic vicariant events. Phylogenetic analyses demonstrated that previously proposed systematic hypotheses were incorrect due to the paraphyly of genera. Molecular phylogenetic, morphological, and timetree results support the recognition of Megophryinae as a single genus, Megophrys, with a subgenus level classification. Timetree results using RelTime better corresponded with the known fossil record for the out-group anuran tree. For the priorless in-group, it also outperformed MCMCtree when node date estimates were compared with likely influential historical biogeographic events, providing novel insights into the evolutionary history of this pan-Asian anuran group. Given a relatively small molecular dataset, and limited prior knowledge, this study demonstrates that the computationally rapid RelTime dating tool may outperform more popular and complex prior reliant timetree methodologies. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Enterovirus D68 in Viet Nam (2009-2015).
Ny, Nguyen Thi Han; Anh, Nguyen To; Hang, Vu Thi Ty; Nguyet, Lam Anh; Thanh, Tran Tan; Ha, Do Quang; Minh, Ngo Ngoc Quang; Ha, Do Lien Anh; McBride, Angela; Tuan, Ha Manh; Baker, Stephen; Tam, Pham Thi Thanh; Phuc, Tran My; Huong, Dang Thao; Loi, Tran Quoc; Vu, Nguyen Tran Anh; Hung, Nguyen Van; Minh, Tran Thi Thuy; Xang, Nguyen Van; Dong, Nguyen; Nghia, Ho Dang Trung; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H Rogier; Anscombe, Catherine; Le Van, Tan
2017-01-01
Since 1962, enterovirus D68 (EV-D68) has been implicated in multiple outbreaks and sporadic cases of respiratory infection worldwide, but especially in the USA and Europe with an increasing frequency between 2010 and 2014. We describe the detection, associated clinical features and molecular characterization of EV-D68 in central and southern Viet Nam between 2009 and 2015. Enterovirus/rhinovirus PCR positive respiratory or CSF samples taken from children and adults with respiratory/central nervous system infections in Viet Nam were tested by an EV-D68 specific PCR. The included samples were derived from 3 different observational studies conducted at referral hospitals across central and southern Viet Nam between 2009 and 2015. Whole-genome sequencing was carried out using a MiSeq based approach. Phylogenetic reconstruction and estimation of evolutionary rate and recombination were carried out in BEAST and Recombination Detection Program, respectively. EV-D68 was detected in 21/625 (3.4%) enterovirus/rhinovirus PCR positive respiratory samples but in none of the 15 CSF. All the EV-D68 patients were young children (age range: 11.8 - 24.5 months) and had moderate respiratory infections. Phylogenetic analysis suggested that the Vietnamese sequences clustered with those from Asian countries, of which 9 fell in the B1 clade, and the remaining sequence was identified within the A2 clade. One intra sub-clade recombination event was detected, representing the second reported recombination within EV-D68. The evolutionary rate of EV-D68 was estimated to be 5.12E -3 substitutions/site/year. Phylogenetic analysis indicated that the virus was imported into Viet Nam in 2008. We have demonstrated for the first time EV-D68 has been circulating at low levels in Viet Nam since 2008, associated with moderate acute respiratory infection in children. EV-D68 in Viet Nam is most closely related to Asian viruses, and clusters separately from recent US and European viruses that were suggested to be associated with acute flaccid paralysis.
Quintero, Ignacio; Wiens, John J
2013-08-01
A key question in predicting responses to anthropogenic climate change is: how quickly can species adapt to different climatic conditions? Here, we take a phylogenetic approach to this question. We use 17 time-calibrated phylogenies representing the major tetrapod clades (amphibians, birds, crocodilians, mammals, squamates, turtles) and climatic data from distributions of > 500 extant species. We estimate rates of change based on differences in climatic variables between sister species and estimated times of their splitting. We compare these rates to predicted rates of climate change from 2000 to 2100. Our results are striking: matching projected changes for 2100 would require rates of niche evolution that are > 10,000 times faster than rates typically observed among species, for most variables and clades. Despite many caveats, our results suggest that adaptation to projected changes in the next 100 years would require rates that are largely unprecedented based on observed rates among vertebrate species. © 2013 John Wiley & Sons Ltd/CNRS.
Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano
2007-01-01
Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612
Qu, Xiao-Jian; Jin, Jian-Jun; Chaw, Shu-Miaw; Li, De-Zhu; Yi, Ting-Shuang
2017-01-01
Long-branch attraction (LBA) is a major obstacle in phylogenetic reconstruction. The phylogenetic relationships among Juniperus (J), Cupressus (C) and the Hesperocyparis-Callitropsis-Xanthocyparis (HCX) subclades of Cupressoideae are controversial. Our initial analyses of plastid protein-coding gene matrix revealed both J and C with much longer stem branches than those of HCX, so their sister relationships may be attributed to LBA. We used multiple measures including data filtering and modifying, evolutionary model selection and coalescent phylogenetic reconstruction to alleviate the LBA artifact. Data filtering by strictly removing unreliable aligned regions and removing substitution saturation genes and rapidly evolving sites could significantly reduce branch lengths of subclades J and C and recovered a relationship of J (C, HCX). In addition, using coalescent phylogenetic reconstruction could elucidate the LBA artifact and recovered J (C, HCX). However, some valid methods for other taxa were inefficient in alleviating the LBA artifact in J-C-HCX. Different strategies should be carefully considered and justified to reduce LBA in phylogenetic reconstruction of different groups. Three subclades of J-C-HCX were estimated to have experienced ancient rapid divergence within a short period, which could be another major obstacle in resolving relationships. Furthermore, our plastid phylogenomic analyses fully resolved the intergeneric relationships of Cupressoideae. PMID:28120880
Qu, Xiao-Jian; Jin, Jian-Jun; Chaw, Shu-Miaw; Li, De-Zhu; Yi, Ting-Shuang
2017-01-25
Long-branch attraction (LBA) is a major obstacle in phylogenetic reconstruction. The phylogenetic relationships among Juniperus (J), Cupressus (C) and the Hesperocyparis-Callitropsis-Xanthocyparis (HCX) subclades of Cupressoideae are controversial. Our initial analyses of plastid protein-coding gene matrix revealed both J and C with much longer stem branches than those of HCX, so their sister relationships may be attributed to LBA. We used multiple measures including data filtering and modifying, evolutionary model selection and coalescent phylogenetic reconstruction to alleviate the LBA artifact. Data filtering by strictly removing unreliable aligned regions and removing substitution saturation genes and rapidly evolving sites could significantly reduce branch lengths of subclades J and C and recovered a relationship of J (C, HCX). In addition, using coalescent phylogenetic reconstruction could elucidate the LBA artifact and recovered J (C, HCX). However, some valid methods for other taxa were inefficient in alleviating the LBA artifact in J-C-HCX. Different strategies should be carefully considered and justified to reduce LBA in phylogenetic reconstruction of different groups. Three subclades of J-C-HCX were estimated to have experienced ancient rapid divergence within a short period, which could be another major obstacle in resolving relationships. Furthermore, our plastid phylogenomic analyses fully resolved the intergeneric relationships of Cupressoideae.
Inferring epidemiological parameters from phylogenetic information for the HIV-1 epidemic among MSM
NASA Astrophysics Data System (ADS)
Quax, Rick; van de Vijver, David A. M. C.; Frentz, Dineke; Sloot, Peter M. A.
2013-09-01
The HIV-1 epidemic in Europe is primarily sustained by a dynamic topology of sexual interactions among MSM who have individual immune systems and behavior. This epidemiological process shapes the phylogeny of the virus population. Both fields of epidemic modeling and phylogenetics have a long history, however it remains difficult to use phylogenetic data to infer epidemiological parameters such as the structure of the sexual network and the per-act infectiousness. This is because phylogenetic data is necessarily incomplete and ambiguous. Here we show that the cluster-size distribution indeed contains information about epidemiological parameters using detailed numberical experiments. We simulate the HIV epidemic among MSM many times using the Monte Carlo method with all parameter values and their ranges taken from literature. For each simulation and the corresponding set of parameter values we calculate the likelihood of reproducing an observed cluster-size distribution. The result is an estimated likelihood distribution of all parameters from the phylogenetic data, in particular the structure of the sexual network, the per-act infectiousness, and the risk behavior reduction upon diagnosis. These likelihood distributions encode the knowledge provided by the observed cluster-size distrbution, which we quantify using information theory. Our work suggests that the growing body of genetic data of patients can be exploited to understand the underlying epidemiological process.
Patterns of co-speciation and host switching in primate malaria parasites.
Garamszegi, László Zsolt
2009-05-22
The evolutionary history of many parasites is dependent on the evolution of their hosts, leading to an association between host and parasite phylogenies. However, frequent host switches across broad phylogenetic distances may weaken this close evolutionary link, especially when vectors are involved in parasites transmission, as is the case for malaria pathogens. Several studies suggested that the evolution of the primate-infective malaria lineages may be constrained by the phylogenetic relationships of their hosts, and that lateral switches between distantly related hosts may have been occurred. However, no systematic analysis has been quantified the degree of phylogenetic association between primates and their malaria parasites. Here phylogenetic approaches have been used to discriminate statistically between events due to co-divergence, duplication, extinction and host switches that can potentially cause historical association between Plasmodium parasites and their primate hosts. A Bayesian reconstruction of parasite phylogeny based on genetic information for six genes served as basis for the analyses, which could account for uncertainties about the evolutionary hypotheses of malaria parasites. Related lineages of primate-infective Plasmodium tend to infect hosts within the same taxonomic family. Different analyses testing for congruence between host and parasite phylogenies unanimously revealed a significant association between the corresponding evolutionary trees. The most important factor that resulted in this association was host switching, but depending on the parasite phylogeny considered, co-speciation and duplication may have also played some additional role. Sorting seemed to be a relatively infrequent event, and can occur only under extreme co-evolutionary scenarios. The concordance between host and parasite phylogenies is heterogeneous: while the evolution of some malaria pathogens is strongly dependent on the phylogenetic history of their primate hosts, the congruent evolution is less emphasized for other parasite lineages (e.g. for human malaria parasites). Estimation of ancestral states of host use along the phylogenetic tree of parasites revealed that lateral transfers across distantly related hosts were likely to occur in several cases. Parasites cannot infect all available hosts, and they should preferentially infect hosts that provide a similar environment for reproduction. Marginally significant evidence suggested that there might be a consistent variation within host ranges in terms of physiology. The evolution of primate malarias is constrained by the phylogenetic associations of their hosts. Some parasites can preserve a great flexibility to infect hosts across a large phylogenetic distance, thus host switching can be an important factor in mediating host ranges observed in nature. Due to this inherent flexibility and the potential exposure to various vectors, the emergence of new malaria disease in primates including humans cannot be predicted from the phylogeny of parasites.
The phylogenetic distribution of extrafloral nectaries in plants.
Weber, Marjorie G; Keeler, Kathleen H
2013-06-01
Understanding the evolutionary patterns of ecologically relevant traits is a central goal in plant biology. However, for most important traits, we lack the comprehensive understanding of their taxonomic distribution needed to evaluate their evolutionary mode and tempo across the tree of life. Here we evaluate the broad phylogenetic patterns of a common plant-defence trait found across vascular plants: extrafloral nectaries (EFNs), plant glands that secrete nectar and are located outside the flower. EFNs typically defend plants indirectly by attracting invertebrate predators who reduce herbivory. Records of EFNs published over the last 135 years were compiled. After accounting for changes in taxonomy, phylogenetic comparative methods were used to evaluate patterns of EFN evolution, using a phylogeny of over 55 000 species of vascular plants. Using comparisons of parametric and non-parametric models, the true number of species with EFNs likely to exist beyond the current list was estimated. To date, EFNs have been reported in 3941 species representing 745 genera in 108 families, about 1-2 % of vascular plant species and approx. 21 % of families. They are found in 33 of 65 angiosperm orders. Foliar nectaries are known in four of 36 fern families. Extrafloral nectaries are unknown in early angiosperms, magnoliids and gymnosperms. They occur throughout monocotyledons, yet most EFNs are found within eudicots, with the bulk of species with EFNs being rosids. Phylogenetic analyses strongly support the repeated gain and loss of EFNs across plant clades, especially in more derived dicot families, and suggest that EFNs are found in a minimum of 457 independent lineages. However, model selection methods estimate that the number of unreported cases of EFNs may be as high as the number of species already reported. EFNs are widespread and evolutionarily labile traits that have repeatedly evolved a remarkable number of times in vascular plants. Our current understanding of the phylogenetic patterns of EFNs makes them powerful candidates for future work exploring the drivers of their evolutionary origins, shifts, and losses.
De Palma, Adriana; Kuhlmann, Michael; Bugter, Rob; Ferrier, Simon; Hoskins, Andrew J; Potts, Simon G; Roberts, Stuart P M; Schweiger, Oliver; Purvis, Andy
2017-12-01
Agricultural intensification and urbanization are important drivers of biodiversity change in Europe. Different aspects of bee community diversity vary in their sensitivity to these pressures, as well as independently influencing ecosystem service provision (pollination). To obtain a more comprehensive understanding of human impacts on bee diversity across Europe, we assess multiple, complementary indices of diversity. One Thousand four hundred and forty six sites across Europe. We collated data on bee occurrence and abundance from the published literature and supplemented them with the PREDICTS database. Using Rao's Quadratic Entropy, we assessed how species, functional and phylogenetic diversity of 1,446 bee communities respond to land-use characteristics including land-use class, cropland intensity, human population density and distance to roads. We combined these models with statistically downscaled estimates of land use in 2005 to estimate and map-at a scale of approximately 1 km 2 -the losses in diversity relative to semi-natural/natural baseline (the predicted diversity of an uninhabited grid square, consisting only of semi-natural/natural vegetation). We show that-relative to the predicted local diversity in uninhabited semi-natural/natural habitat-half of all EU27 countries have lost over 10% of their average local species diversity and two-thirds of countries have lost over 5% of their average local functional and phylogenetic diversity. All diversity measures were generally lower in pasture and higher-intensity cropland than in semi-natural/natural vegetation, but facets of diversity showed less consistent responses to human population density. These differences have led to marked spatial mismatches in losses: losses in phylogenetic diversity were in some areas almost 20 percentage points (pp.) more severe than losses in species diversity, but in other areas losses were almost 40 pp. less severe. These results highlight the importance of exploring multiple measures of diversity when prioritizing and evaluating conservation actions, as species-diverse assemblages may be phylogenetically and functionally impoverished, potentially threatening pollination service provision.
Faith, Daniel P
2007-02-19
A recent paper in this journal (Faith and Baker, 2006) described bio-informatics challenges in the application of the PD (phylogenetic diversity) measure of Faith (1992a), and highlighted the use of the root of the phylogenetic tree, as implied by the original definition of PD. A response paper (Crozier et al. 2006) stated that 1) the (Faith, 1992a) PD definition did not include the use of the root of the tree, and 2) Moritz and Faith (1998) changed the PD definition to include the root. Both characterizations are here refuted. Examples from Faith (1992a,Faith 1992b) document the link from the definition to the use of the root of the overall tree, and a survey of papers over the past 15 years by Faith and colleagues demonstrate that the stated PD definition has remained the same as that in the original 1992 study. PD's estimation of biodiversity at the level of "feature diversity" is seen to have provided the original rationale for the measure's consideration of the root of the phylogenetic tree.
The Role of the Phylogenetic Diversity Measure, PD, in Bio-informatics: Getting the Definition Right
Faith, Daniel P.
2007-01-01
A recent paper in this journal (Faith and Baker, 2006) described bio-informatics challenges in the application of the PD (phylogenetic diversity) measure of Faith (1992a), and highlighted the use of the root of the phylogenetic tree, as implied by the original definition of PD. A response paper (Crozier et al. 2006) stated that 1) the (Faith, 1992a) PD definition did not include the use of the root of the tree, and 2) Moritz and Faith (1998) changed the PD definition to include the root. Both characterizations are here refuted. Examples from Faith (1992a,Faith 1992b) document the link from the definition to the use of the root of the overall tree, and a survey of papers over the past 15 years by Faith and colleagues demonstrate that the stated PD definition has remained the same as that in the original 1992 study. PD’s estimation of biodiversity at the level of “feature diversity” is seen to have provided the original rationale for the measure’s consideration of the root of the phylogenetic tree. PMID:19455221
Hagopian, Raffi; Davidson, John R; Datta, Ruchira S; Samad, Bushra; Jarvis, Glen R; Sjölander, Kimmen
2010-07-01
We present the jump-start simultaneous alignment and tree construction using hidden Markov models (SATCHMO-JS) web server for simultaneous estimation of protein multiple sequence alignments (MSAs) and phylogenetic trees. The server takes as input a set of sequences in FASTA format, and outputs a phylogenetic tree and MSA; these can be viewed online or downloaded from the website. SATCHMO-JS is an extension of the SATCHMO algorithm, and employs a divide-and-conquer strategy to jump-start SATCHMO at a higher point in the phylogenetic tree, reducing the computational complexity of the progressive all-versus-all HMM-HMM scoring and alignment. Results on a benchmark dataset of 983 structurally aligned pairs from the PREFAB benchmark dataset show that SATCHMO-JS provides a statistically significant improvement in alignment accuracy over MUSCLE, Multiple Alignment using Fast Fourier Transform (MAFFT), ClustalW and the original SATCHMO algorithm. The SATCHMO-JS webserver is available at http://phylogenomics.berkeley.edu/satchmo-js. The datasets used in these experiments are available for download at http://phylogenomics.berkeley.edu/satchmo-js/supplementary/.
Sato, Jun J; Ohdachi, Satoshi D; Echenique-Diaz, Lazaro M; Borroto-Páez, Rafael; Begué-Quiala, Gerardo; Delgado-Labañino, Jorge L; Gámez-Díez, Jorgelino; Alvarez-Lemus, José; Nguyen, Son Truong; Yamaguchi, Nobuyuki; Kita, Masaki
2016-08-08
The Cuban solenodon (Solenodon cubanus) is one of the most enigmatic mammals and is an extremely rare species with a distribution limited to a small part of the island of Cuba. Despite its rarity, in 2012 seven individuals of S. cubanus were captured and sampled successfully for DNA analysis, providing new insights into the evolutionary origin of this species and into the origins of the Caribbean fauna, which remain controversial. We conducted molecular phylogenetic analyses of five nuclear genes (Apob, Atp7a, Bdnf, Brca1 and Rag1; total, 4,602 bp) from 35 species of the mammalian order Eulipotyphla. Based on Bayesian relaxed molecular clock analyses, the family Solenodontidae diverged from other eulipotyphlan in the Paleocene, after the bolide impact on the Yucatan Peninsula, and S. cubanus diverged from the Hispaniolan solenodon (S. paradoxus) in the Early Pliocene. The strikingly recent divergence time estimates suggest that S. cubanus and its ancestral lineage originated via over-water dispersal rather than vicariance events, as had previously been hypothesised.
Armored kinorhynch-like scalidophoran animals from the early Cambrian.
Zhang, Huaqiao; Xiao, Shuhai; Liu, Yunhuan; Yuan, Xunlai; Wan, Bin; Muscente, A D; Shao, Tiequan; Gong, Hao; Cao, Guohua
2015-11-26
Morphology-based phylogenetic analyses support the monophyly of the Scalidophora (Kinorhyncha, Loricifera, Priapulida) and Nematoida (Nematoda, Nematomorpha), together constituting the monophyletic Cycloneuralia that is the sister group of the Panarthropoda. Kinorhynchs are unique among living cycloneuralians in having a segmented body with repeated cuticular plates, longitudinal muscles, dorsoventral muscles, and ganglia. Molecular clock estimates suggest that kinorhynchs may have diverged in the Ediacaran Period. Remarkably, no kinorhynch fossils have been discovered, in sharp contrast to priapulids and loriciferans that are represented by numerous Cambrian fossils. Here we describe several early Cambrian (~535 million years old) kinorhynch-like fossils, including the new species Eokinorhynchus rarus and two unnamed but related forms. E. rarus has characteristic scalidophoran features, including an introvert with pentaradially arranged hollow scalids. Its trunk bears at least 20 annuli each consisting of numerous small rectangular plates, and is armored with five pairs of large and bilaterally placed sclerites. Its trunk annuli are reminiscent of the epidermis segments of kinorhynchs. A phylogenetic analysis resolves E. rarus as a stem-group kinorhynch. Thus, the fossil record confirms that all three scalidophoran phyla diverged no later than the Cambrian Period.
Sato, Jun J.; Ohdachi, Satoshi D.; Echenique-Diaz, Lazaro M.; Borroto-Páez, Rafael; Begué-Quiala, Gerardo; Delgado-Labañino, Jorge L.; Gámez-Díez, Jorgelino; Alvarez-Lemus, José; Nguyen, Son Truong; Yamaguchi, Nobuyuki; Kita, Masaki
2016-01-01
The Cuban solenodon (Solenodon cubanus) is one of the most enigmatic mammals and is an extremely rare species with a distribution limited to a small part of the island of Cuba. Despite its rarity, in 2012 seven individuals of S. cubanus were captured and sampled successfully for DNA analysis, providing new insights into the evolutionary origin of this species and into the origins of the Caribbean fauna, which remain controversial. We conducted molecular phylogenetic analyses of five nuclear genes (Apob, Atp7a, Bdnf, Brca1 and Rag1; total, 4,602 bp) from 35 species of the mammalian order Eulipotyphla. Based on Bayesian relaxed molecular clock analyses, the family Solenodontidae diverged from other eulipotyphlan in the Paleocene, after the bolide impact on the Yucatan Peninsula, and S. cubanus diverged from the Hispaniolan solenodon (S. paradoxus) in the Early Pliocene. The strikingly recent divergence time estimates suggest that S. cubanus and its ancestral lineage originated via over-water dispersal rather than vicariance events, as had previously been hypothesised. PMID:27498968
Hrbek, Tomas; Stölting, Kai N; Bardakci, Fevzi; Küçük, Fahrettin; Wildekamp, Rudolf H; Meyer, Axel
2004-07-01
We investigated the phylogenetic relationships of Pseudophoxinus (Cyprinidae: Leuciscinae) species from central Anatolia, Turkey to test the hypothesis of geographic speciation driven by early Pliocene orogenic events. We analyzed 1141 aligned base pairs of the complete cytochrome b mitochondrial gene. Phylogenetic relationships reconstructed by maximum likelihood, Bayesian likelihood, and maximum parsimony methods are identical, and generally well supported. Species and clades are restricted to geologically well-defined units, and are deeply divergent from each other. The basal diversification of central Anatolian Pseudophoxinus is estimated to have occurred approximately 15 million years ago. Our results are in agreement with a previous study of the Anatolian fish genus Aphanius that also shows a diversification pattern driven by the Pliocene orogenic events. The distribution of clades of Aphanius and Pseudophoxinus overlap, and areas of distribution comprise the same geological units. The geological history of Anatolia is likely to have had a major impact on the diversification history of many taxa occupying central Anatolia; many of these taxa are likely to be still unrecognized as distinct. Copyright 2004 Elsevier Inc.
Pinzón C., Jorge H.; Beach-Letendre, Joshuah; Weil, Ernesto; Mydlarz, Laura D.
2014-01-01
Diseases affect coral species fitness and contribute significantly to the deterioration of coral reefs. The increase in frequency and severity of disease outbreaks has made evaluating and determining coral resistance a priority. Phylogenetic patterns in immunity and disease can provide important insight to how corals may respond to current and future environmental and/or biologically induced diseases. The purpose of this study was to determine if immunity, number of diseases and disease prevalence show a phylogenetic signal among Caribbean corals. We characterized the constitutive levels of six distinct innate immune traits in 14 Caribbean coral species and tested for the presence of a phylogenetic signal on each trait. Results indicate that constitutive levels of some individual immune related processes (i.e. melanin concentration, peroxidase and inhibition of bacterial growth), as well as their combination show a phylogenetic signal. Additionally, both the number of diseases affecting each species and disease prevalence (as measures of disease burden) show a significant phylogenetic signal. The phylogenetic signal of immune related processes, combined with estimates of species divergence times, indicates that among the studied species, those belonging to older lineages tend to resist/fight infections better than more recently diverged coral lineages. This result, combined with the increasing stressful conditions on corals in the Caribbean, suggest that future reefs in the region will likely be dominated by older lineages while modern species may face local population declines and/or geographic extinction. PMID:25133685
Miklós, István
2003-10-01
As more and more genomes have been sequenced, genomic data is rapidly accumulating. Genome-wide mutations are believed more neutral than local mutations such as substitutions, insertions and deletions, therefore phylogenetic investigations based on inversions, transpositions and inverted transpositions are less biased by the hypothesis on neutral evolution. Although efficient algorithms exist for obtaining the inversion distance of two signed permutations, there is no reliable algorithm when both inversions and transpositions are considered. Moreover, different type of mutations happen with different rates, and it is not clear how to weight them in a distance based approach. We introduce a Markov Chain Monte Carlo method to genome rearrangement based on a stochastic model of evolution, which can estimate the number of different evolutionary events needed to sort a signed permutation. The performance of the method was tested on simulated data, and the estimated numbers of different types of mutations were reliable. Human and Drosophila mitochondrial data were also analysed with the new method. The mixing time of the Markov Chain is short both in terms of CPU times and number of proposals. The source code in C is available on request from the author.
High-confidence prediction of global interactomes based on genome-wide coevolutionary networks
Juan, David; Pazos, Florencio; Valencia, Alfonso
2008-01-01
Interacting or functionally related protein families tend to have similar phylogenetic trees. Based on this observation, techniques have been developed to predict interaction partners. The observed degree of similarity between the phylogenetic trees of two proteins is the result of many different factors besides the actual interaction or functional relationship between them. Such factors influence the performance of interaction predictions. One aspect that can influence this similarity is related to the fact that a given protein interacts with many others, and hence it must adapt to all of them. Accordingly, the interaction or coadaptation signal within its tree is a composite of the influence of all of the interactors. Here, we introduce a new estimator of coevolution to overcome this and other problems. Instead of relying on the individual value of tree similarity between two proteins, we use the whole network of similarities between all of the pairs of proteins within a genome to reassess the similarity of that pair, thereby taking into account its coevolutionary context. We show that this approach offers a substantial improvement in interaction prediction performance, providing a degree of accuracy/coverage comparable with, or in some cases better than, that of experimental techniques. Moreover, important information on the structure, function, and evolution of macromolecular complexes can be inferred with this methodology. PMID:18199838
High-confidence prediction of global interactomes based on genome-wide coevolutionary networks.
Juan, David; Pazos, Florencio; Valencia, Alfonso
2008-01-22
Interacting or functionally related protein families tend to have similar phylogenetic trees. Based on this observation, techniques have been developed to predict interaction partners. The observed degree of similarity between the phylogenetic trees of two proteins is the result of many different factors besides the actual interaction or functional relationship between them. Such factors influence the performance of interaction predictions. One aspect that can influence this similarity is related to the fact that a given protein interacts with many others, and hence it must adapt to all of them. Accordingly, the interaction or coadaptation signal within its tree is a composite of the influence of all of the interactors. Here, we introduce a new estimator of coevolution to overcome this and other problems. Instead of relying on the individual value of tree similarity between two proteins, we use the whole network of similarities between all of the pairs of proteins within a genome to reassess the similarity of that pair, thereby taking into account its coevolutionary context. We show that this approach offers a substantial improvement in interaction prediction performance, providing a degree of accuracy/coverage comparable with, or in some cases better than, that of experimental techniques. Moreover, important information on the structure, function, and evolution of macromolecular complexes can be inferred with this methodology.
HIV Migration Between Blood and Cerebrospinal Fluid or Semen Over Time
Chaillon, Antoine; Gianella, Sara; Wertheim, Joel O.; Richman, Douglas D.; Mehta, Sanjay R.; Smith, David M.
2014-01-01
Previous studies reported associations between neuropathogenesis and human immunodeficiency virus (HIV) compartmentalization in cerebrospinal fluid (CSF) and between sexual transmission and human immunodeficiency virus type 1 (HIV) compartmentalization in semen. It remains unclear, however, how compartmentalization dynamics change over time. To address this, we used statistical methods and Bayesian phylogenetic approaches to reconstruct temporal dynamics of HIV migration between blood and CSF and between blood and the male genital tract. We investigated 11 HIV-infected individuals with paired semen and blood samples and 4 individuals with paired CSF and blood samples. Aligned partial HIV env sequences were analyzed by (1) phylogenetic reconstruction, using a Bayesian Markov-chain Monte Carlo approach; (2) evaluation of viral compartmentalization, using tree-based and distance-based methods; and (3) analysis of migration events, using a discrete Bayesian asymmetric phylogeographic approach of diffusion with Markov jump counts estimation. Finally, we evaluated potential correlates of viral gene flow across anatomical compartments. We observed bidirectional replenishment of viral compartments and asynchronous peaks of viral migration from and to blood over time, suggesting that disruption of viral compartment is transient and directionally selected. These findings imply that viral subpopulations in anatomical sites are an active part of the whole viral population and that compartmental reservoirs could have implications in future eradication studies. PMID:24302756
Suárez-Villota, Elkin Y; González-Wevar, Claudio A; Gallardo, Milton H; Vásquez, Rodrigo A; Poulin, Elie
2016-12-01
Endemic to South America, octodontid rodents are remarkable by being the only mammal taxa where allotetraploidy has been documented. The taxon's extensive morpho-physiological radiation associated to niche shifts has allowed testing phylogeographic hypotheses. Using maximum likelihood and Bayesian inference analyses, applied to all nominal species of octodontids, phylogenetic reconstructions based on sequences of 12S rRNA and growth hormone receptor gene are presented. Species boundaries were determined by coalescent analyses and divergence times among taxa were estimated based on mutation rates. Two main clades associated to the Andean orogenesis were recognized. The essentially western clade comprises genera Aconaemys, Octodon, Spalacopus, and Octodontomys whereas the eastern one included genera Octomys, Pipanacoctomys, Salinoctomys, and Tympanoctomys. Genetic relationships, coalescent analyses, and genetic distance supported the specific status given to Octodon pacificus and that given to Pipanacoctomys aureus as a species of Tympanoctomys. However, these analyses failed to recognize Salinoctomys loschalchalerosorum as a valid taxon considering its position within the diversity of Tympanoctomys barrerae. Although the origin of genome duplication remains contentious, the coincidence of the basal clade split with distinctive modes of karyotypic evolution across the Andes emphasizes the role of physiographic barriers and westerlies in shaping different edaphological conditions, selective grounds, and concomitantly distinct adaptations within the octodontids. Copyright © 2016 Elsevier Inc. All rights reserved.
Blastocystis phylogeny among various isolates from humans to insects.
Yoshikawa, Hisao; Koyama, Yukiko; Tsuchiya, Erika; Takami, Kazutoshi
2016-12-01
Blastocystis is a common unicellular eukaryotic parasite found not only in humans, but also in various kinds of animal species worldwide. Since Blastocystis isolates are morphologically indistinguishable, many molecular biological approaches have been applied to classify these isolates. The complete or partial sequences of the small subunit rRNA gene (SSU rDNA) are mainly used for comparisons and phylogenetic analyses among Blastocystis isolates. However, various lengths of the partial SSU rDNA sequence have been used for phylogenetic inference among genetically different isolates. Based on the complete SSU rDNA sequences, consensus terminology of nine subtypes (STs) of Blastocystis sp. that were supported by phylogenetically monophyletic nine clades was proposed in 2007. Thereafter, eight additional kinds of STs comprising non-human mammalian Blastocystis isolates have been reported based on the phylogeny of SSU rDNA sequences, while STs 11 and 12 were only proposed on the base of partial sequences. Although many sequence data from mammalian and avian Blastocystis are registered in GenBank, only limited data on SSU rDNA are available for poikilotherm-derived Blastocystis isolates. Therefore, the phylogenetic positions of the reptilian/amphibian Blastocystis clades are unstable. The phylogenetic inference of various STs comprising mammalian and/or avian Blastocystis isolates was verified herein based on comparisons between partial and complete SSU rDNA sequences, and the phylogenetic positions of reptilian and amphibian Blastocystis isolates were also investigated using 14 new Blastocystis isolates from reptiles with all known isolates from other reptilians, amphibians, and insects registered in GenBank. Copyright © 2016. Published by Elsevier Ireland Ltd.
Constructing phylogenetic trees using interacting pathways.
Wan, Peng; Che, Dongsheng
2013-01-01
Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.
Goggin, C L; Barker, S C
1993-07-01
Parasites of the genus Perkinsus destroy marine molluscs worldwide. Their phylogenetic position within the kingdom Protista is controversial. Nucleotide sequence data (1792 bp) from the small subunit rRNA gene of Perkinsus sp. from Anadara trapezia (Mollusca: Bivalvia) from Moreton Bay, Queensland, was used to examine the phylogenetic affinities of this enigmatic genus. These data were aligned with nucleotide sequences from 6 apicomplexans, 3 ciliates, 3 flagellates, a dinoflagellate, 3 fungi, maize and human. Phylogenetic trees were constructed after analysis with maximum parsimony and distance matrix methods. Our analyses indicate that Perkinsus is phylogenetically closer to dinoflagellates and to coccidean and piroplasm apicomplexans than to fungi or flagellates.
NASA Technical Reports Server (NTRS)
Wheeler, Ward C.
2003-01-01
The problem of determining the minimum cost hypothetical ancestral sequences for a given cladogram is known to be NP-complete (Wang and Jiang, 1994). Traditionally, point estimations of hypothetical ancestral sequences have been used to gain heuristic, upper bounds on cladogram cost. These include procedures with such diverse approaches as non-additive optimization of multiple sequence alignment, direct optimization (Wheeler, 1996), and fixed-state character optimization (Wheeler, 1999). A method is proposed here which, by extending fixed-state character optimization, replaces the estimation process with a search. This form of optimization examines a diversity of potential state solutions for cost-efficient hypothetical ancestral sequences and can result in greatly more parsimonious cladograms. Additionally, such an approach can be applied to other NP-complete phylogenetic optimization problems such as genomic break-point analysis. c2003 The Willi Hennig Society. Published by Elsevier Science (USA). All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy
Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the averagemore » nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.« less
Thornhill, Andrew H; Popple, Lindsay W; Carter, Richard J; Ho, Simon Y W; Crisp, Michael D
2012-04-01
The identification and application of reliable fossil calibrations represents a key component of many molecular studies of evolutionary timescales. In studies of plants, most paleontological calibrations are associated with macrofossils. However, the pollen record can also inform age calibrations if fossils matching extant pollen groups are found. Recent work has shown that pollen of the myrtle family, Myrtaceae, can be classified into a number of morphological groups that are synapomorphic with molecular groups. By assembling a data matrix of pollen morphological characters from extant and fossil Myrtaceae, we were able to measure the fit of 26 pollen fossils to a molecular phylogenetic tree using parsimony optimisation of characters. We identified eight Myrtaceidites fossils as appropriate for calibration based on the most parsimonious placements of these fossils on the tree. These fossils were used to inform age constraints in a Bayesian phylogenetic analysis of a sequence alignment comprising two sequences from the chloroplast genome (matK and ndhF) and one nuclear locus (ITS), sampled from 106 taxa representing 80 genera. Three additional analyses were calibrated by placing pollen fossils using geographic and morphological information (eight calibrations), macrofossils (five calibrations), and macrofossils and pollen fossils in combination (12 calibrations). The addition of new fossil pollen calibrations led to older crown ages than have previously been found for tribes such as Eucalypteae and Myrteae. Estimates of rate variation among lineages were affected by the choice of calibrations, suggesting that the use of multiple calibrations can improve estimates of rate heterogeneity among lineages. This study illustrates the potential of including pollen-based calibrations in molecular studies of divergence times. Copyright © 2011 Elsevier Inc. All rights reserved.
Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin
2016-01-01
The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. PMID:27172202
Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin
2016-07-07
The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. Copyright © 2016 Chen et al.
Chen, Jin-Min; Zhou, Wei-Wei; Poyarkov, Nikolay A; Stuart, Bryan L; Brown, Rafe M; Lathrop, Amy; Wang, Ying-Yong; Yuan, Zhi-Yong; Jiang, Ke; Hou, Mian; Chen, Hong-Man; Suwannapoom, Chatmongkon; Nguyen, Sang Ngoc; Duong, Tang Van; Papenfuss, Theodore J; Murphy, Robert W; Zhang, Ya-Ping; Che, Jing
2017-01-01
The horned toad assemblage, genus Megophrys sensu lato, currently includes three groups previously recognized as the genera Atympanophrys, Xenophrys and Megophrys sensu stricto. The taxonomic status and species composition of the three groups remain controversial due to conflicting phenotypic analyses and insufficient phylogenetic reconstruction; likewise, the position of the monotypic Borneophrys remains uncertain with respect to the horned toads. Further, the diversity of the horned toads remains poorly understood, especially for widespread species. Herein, we evaluate species-level diversity based on 45 of the 57 described species from throughout southern China, Southeast Asia and the Himalayas using Bayesian inference trees and the Generalized Mixed Yule Coalescent (GMYC) approach. We estimate the phylogeny using both mitochondrial and nuclear DNA data. Analyses reveal statistically significant mito-nuclear discordance. All analyses resolve paraphyly for horned toads involving multiple strongly supported clades. These clades correspond with geography. We resurrect the genera Atympanophrys and Xenophrys from the synonymy of Megophrys to eliminate paraphyly of Megophrys s.l. and to account for the morphological, molecular and biogeographic differences among these groups, but we also provide an alternative option. Our study suggests that Borneophrys is junior synonym of Megophrys sensu stricto. We provide an estimation of timeframe for the horned toads. The mitochondrial and nuclear trees indicate the presence of many putative undescribed species. Widespread species, such as Xenophrys major and X. minor, likely have dramatically underestimated diversity. The integration of morphological and molecular evidence can validate this discovery. Montane forest dynamics appear to play a significant role in driving diversification of horned toads. Copyright © 2016 Elsevier Inc. All rights reserved.
Evolutionary Diversification of New Caledonian Araucaria
Kranitz, Mai Lan; Biffin, Edward; Clark, Alexandra; Hollingsworth, Michelle L.; Ruhsam, Markus; Gardner, Martin F.; Thomas, Philip; Mill, Robert R.; Ennos, Richard A.; Gaudeul, Myriam; Lowe, Andrew J.; Hollingsworth, Peter M.
2014-01-01
New Caledonia is a global biodiversity hotspot. Hypotheses for its biotic richness suggest either that the island is a ‘museum’ for an old Gondwana biota or alternatively it has developed following relatively recent long distance dispersal and in situ radiation. The conifer genus Araucaria (Araucariaceae) comprises 19 species globally with 13 endemic to this island. With a typically Gondwanan distribution, Araucaria is particularly well suited to testing alternative biogeographic hypotheses concerning the origins of New Caledonian biota. We derived phylogenetic estimates using 11 plastid and rDNA ITS2 sequence data for a complete sampling of Araucaria (including multiple accessions of each of the 13 New Caledonian Araucaria species). In addition, we developed a dataset comprising 4 plastid regions for a wider taxon sample to facilitate fossil based molecular dating. Following statistical analyses to identify a credible and internally consistent set of fossil constraints, divergence times estimated using a Bayesian relaxed clock approach were contrasted with geological scenarios to explore the biogeographic history of Araucaria. The phylogenetic data resolve relationships within Araucariaceae and among the main lineages in Araucaria, but provide limited resolution within the monophyletic New Caledonian species group. Divergence time estimates suggest a Late Cretaceous-Cenozoic radiation of extant Araucaria and a Neogene radiation of the New Caledonian lineage. A molecular timescale for the evolution of Araucariaceae supports a relatively recent radiation, and suggests that earlier (pre-Cenozoic) fossil types assigned to Araucaria may have affinities elsewhere in Araucariaceae. While additional data will be required to adequately resolve relationships among the New Caledonian species, their recent origin is consistent with overwater dispersal following Eocene emersion of New Caledonia but is too old to support a single dispersal from Australia to Norfolk Island for the radiation of the Pacific Araucaria sect. Eutacta clade. PMID:25340350
Inward, Daegan J G; Vogler, Alfried P; Eggleton, Paul
2007-09-01
The first comprehensive combined molecular and morphological phylogenetic analysis of the major groups of termites is presented. This was based on the analysis of three genes (cytochrome oxidase II, 12S and 28S) and worker characters for approximately 250 species of termites. Parsimony analysis of the aligned dataset showed that the monophyly of Hodotermitidae, Kalotermitidae and Termitidae were well supported, while Termopsidae and Rhinotermitidae were both paraphyletic on the estimated cladogram. Within Termitidae, the most diverse and ecologically most important family, the monophyly of Macrotermitinae, Foraminitermitinae, Apicotermitinae, Syntermitinae and Nasutitermitinae were all broadly supported, but Termitinae was paraphyletic. The pantropical genera Termes, Amitermes and Nasutitermes were all paraphyletic on the estimated cladogram, with at least 17 genera nested within Nasutitermes, given the presently accepted generic limits. Key biological features were mapped onto the cladogram. It was not possible to reconstruct the evolution of true workers unambiguously, as it was as parsimonious to assume a basal evolution of true workers and subsequent evolution of pseudergates, as to assume a basal condition of pseudergates and subsequent evolution of true workers. However, true workers were only found in species with either separate- or intermediate-type nests, so that the mapping of nest habit and worker type onto the cladogram were perfectly correlated. Feeding group evolution, however, showed a much more complex pattern, particularly within the Termitidae, where it proved impossible to estimate unambiguously the ancestral state within the family (which is associated with the loss of worker gut flagellates). However, one biologically plausible optimization implies an initial evolution from wood-feeding to fungus-growing, proposed as the ancestral condition within the Termitidae, followed by the very early evolution of soil-feeding and subsequent re-evolution of wood-feeding in numerous lineages.
Inoue, Jun G; Kumazawa, Yoshinori; Miya, Masaki; Nishida, Mutsumi
2009-06-01
The continental distributions of freshwater fishes in the family Notopteridae (Osteoglossomorpha) across Africa, India, and Southeast Asia constitute a long standing and enigmatic problem of freshwater biogeography. The migrational pathway of the Asian notopterids has been discussed in light of two competing schemes: the first posits recent transcontinental dispersal while the second relies on distributions being shaped by ancient vicariance associated with plate-tectonic events. In this study, we determined complete mitochondrial DNA sequences from 10 osteoglossomorph fishes to estimate phylogenetic relationships using partitioned Bayesian and maximum likelihood methods and divergence dates of the family Notopteridae with a partitioned Bayesian approach. We used six species representing the major lineages of the Notopteridae and seven species from the remaining osteoglossomorph families. Fourteen more-derived teleosts, nine basal actinopterygians, two coelacanths, and one shark were used as outgroups. Phylogenetic analyses indicated that the African and Asian notopterids formed a sister group to each other and that these notopterids were a sister to a clade comprising two African families (Mormyridae and Gymnarchidae). Estimated divergence time between the African and Asian notopterids dated back to the early Cretaceous when India-Madagascar separated from the African part of Gondwanaland. Thus, estimated time of divergence based on the molecular evidence is at odds with the recent dispersal model. It can be reconciled with the geological and paleontological evidence to support the vicariance model in which the Asian notopterids diverged from the African notopterids in Gondwanaland and migrated into Eurasia on the Indian subcontinent from the Cretaceous to the Tertiary. However, we could not exclude an alternative explanation that the African and Asian notopterids diverged in Pangea before its complete separation into Laurasia and Gondwanaland, to which these two lineages were later confined, respectively.
Seven new dolphin mitochondrial genomes and a time-calibrated phylogeny of whales
Xiong, Ye; Brandley, Matthew C; Xu, Shixia; Zhou, Kaiya; Yang, Guang
2009-01-01
Background The phylogeny of Cetacea (whales) is not fully resolved with substantial support. The ambiguous and conflicting results of multiple phylogenetic studies may be the result of the use of too little data, phylogenetic methods that do not adequately capture the complex nature of DNA evolution, or both. In addition, there is also evidence that the generic taxonomy of Delphinidae (dolphins) underestimates its diversity. To remedy these problems, we sequenced the complete mitochondrial genomes of seven dolphins and analyzed these data with partitioned Bayesian analyses. Moreover, we incorporate a newly-developed "relaxed" molecular clock to model heterogenous rates of evolution among cetacean lineages. Results The "deep" phylogenetic relationships are well supported including the monophyly of Cetacea and Odontoceti. However, there is ambiguity in the phylogenetic affinities of two of the river dolphin clades Platanistidae (Indian River dolphins) and Lipotidae (Yangtze River dolphins). The phylogenetic analyses support a sister relationship between Delphinidae and Monodontidae + Phocoenidae. Additionally, there is statistically significant support for the paraphyly of Tursiops (bottlenose dolphins) and Stenella (spotted dolphins). Conclusion Our phylogenetic analysis of complete mitochondrial genomes using recently developed models of rate autocorrelation resolved the phylogenetic relationships of the major Cetacean lineages with a high degree of confidence. Our results indicate that a rapid radiation of lineages explains the lack of support the placement of Platanistidae and Lipotidae. Moreover, our estimation of molecular divergence dates indicates that these radiations occurred in the Middle to Late Oligocene and Middle Miocene, respectively. Furthermore, by collecting and analyzing seven new mitochondrial genomes, we provide strong evidence that the delphinid genera Tursiops and Stenella are not monophyletic, and the current taxonomy masks potentially interesting patterns of morphological, physiological, behavioral, and ecological evolution. PMID:19166626
Ayres, Daniel L; Darling, Aaron; Zwickl, Derrick J; Beerli, Peter; Holder, Mark T; Lewis, Paul O; Huelsenbeck, John P; Ronquist, Fredrik; Swofford, David L; Cummings, Michael P; Rambaut, Andrew; Suchard, Marc A
2012-01-01
Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software.
Ayres, Daniel L.; Darling, Aaron; Zwickl, Derrick J.; Beerli, Peter; Holder, Mark T.; Lewis, Paul O.; Huelsenbeck, John P.; Ronquist, Fredrik; Swofford, David L.; Cummings, Michael P.; Rambaut, Andrew; Suchard, Marc A.
2012-01-01
Abstract Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software. PMID:21963610
Crandall, K A; Harris, D J; Fetzner, J W
2000-01-01
Despite their widespread use as model organisms, the phylogenetic status of the around 520 species of freshwater crayfish is still in doubt. One hypothesis suggests two distinct origins of freshwater crayfish as indicated by their geographical distribution, with two centres of origin near the two present centres of diversity; one in south-eastern United States and the other in Victoria, Australia. An alternative theory proposes a single (monophyletic) origin of freshwater crayfish. Here we use over 3000 nucleotides from three different gene regions in estimating phylogenetic relationships among freshwater crayfish and related Crustacea. We show clear evidence for monophyly of freshwater crayfish and for the sister-group relationship between crayfish and clawed lobsters. Monophyly of the superfamilies Astacoidea and Parastacoidea is also supported. However, the monophyly of the family Cambaridae is questioned with the genus Cambaroides being associated with the Astacidae. PMID:11467432
Phylogenetic Status and Timescale for the Diversification of Steno and Sotalia Dolphins
Cunha, Haydée A.; Moraes, Lucas C.; Medeiros, Bruna V.; Lailson-Brito, José; da Silva, Vera M. F.; Solé-Cava, Antonio M.; Schrago, Carlos G.
2011-01-01
Molecular data have provided many insights into cetacean evolution but some unsettled issues still remain. We estimated the topology and timing of cetacean evolutionary relationships using Bayesian and maximum likelihood analyses of complete mitochondrial genomes. In order to clarify the phylogenetic placement of Sotalia and Steno within the Delphinidae, we sequenced three new delphinid mitogenomes. Our analyses support three delphinid clades: one joining Steno and Sotalia (supporting the revised subfamily Stenoninae); another placing Sousa within the Delphininae; and a third, the Globicephalinae, which includes Globicephala, Feresa, Pseudorca, Peponocephala and Grampus. We also conclude that Orcinus does not belong in the Globicephalinae, but Orcaella may be part of that subfamily. Divergence dates were estimated using the relaxed molecular clock calibrated with fossil data. We hypothesise that the timing of separation of the marine and Amazonian Sotalia species (2.3 Ma) coincided with the establishment of the modern Amazon River basin. PMID:22163290
Phylogenetic status and timescale for the diversification of Steno and Sotalia dolphins.
Cunha, Haydée A; Moraes, Lucas C; Medeiros, Bruna V; Lailson-Brito, José; da Silva, Vera M F; Solé-Cava, Antonio M; Schrago, Carlos G
2011-01-01
Molecular data have provided many insights into cetacean evolution but some unsettled issues still remain. We estimated the topology and timing of cetacean evolutionary relationships using bayesian and maximum likelihood analyses of complete mitochondrial genomes. In order to clarify the phylogenetic placement of Sotalia and Steno within the Delphinidae, we sequenced three new delphinid mitogenomes. Our analyses support three delphinid clades: one joining Steno and Sotalia (supporting the revised subfamily Stenoninae); another placing Sousa within the Delphininae; and a third, the Globicephalinae, which includes Globicephala, Feresa, Pseudorca, Peponocephala and Grampus. We also conclude that Orcinus does not belong in the Globicephalinae, but Orcaella may be part of that subfamily. Divergence dates were estimated using the relaxed molecular clock calibrated with fossil data. We hypothesise that the timing of separation of the marine and Amazonian Sotalia species (2.3 Ma) coincided with the establishment of the modern Amazon River basin.
The Biogeography of Deep Time Phylogenetic Reticulation.
Burbrink, Frank T; Gehara, Marcelo
2018-03-09
Most phylogenies are typically represented as purely bifurcating. However, as genomic data has become more common in phylogenetic studies, it is not unusual to find reticulation among terminal lineages or among internal nodes (deep time reticulation; DTR). In these situations, gene flow must have happened in the same or adjacent geographic areas for these DTRs to have occurred and therefore biogeographic reconstruction should provide similar area estimates for parental nodes, provided extinction or dispersal has not eroded these patterns. We examine the phylogeny of the widely distributed New World kingsnakes (Lampropeltis), determine if DTR is present in this group, and estimate the ancestral area for reticulation. Importantly, we develop a new method that uses coalescent simulations in a machine learning framework to show conclusively that this phylogeny is best represented as reticulating at deeper time. Using joint probabilities of ancestral area reconstructions on the bifurcating parental lineages from the reticulating node, we show that this reticulation likely occurred in northwestern Mexico/southwestern US and subsequently led to the diversification of the Mexican kingsnakes. This region has been previously identified as an area important for understanding speciation and secondary contact with gene flow in snakes and other squamates. This research shows that phylogenetic reticulation is common, even in well-studied groups, and that the geographic scope of ancient hybridization is recoverable.
Soejima, Akiko; Tanabe, Akifumi S; Takayama, Izumi; Kawahara, Takayuki; Watanabe, Kuniaki; Nakazawa, Miyuki; Mishima, Misako; Yahara, Tetsukazu
2017-11-01
The genus Stevia comprises approximately 200 species, which are distributed in North and South America, and are representative of the species diversity of the Asteraceae in the New World. We reconstructed the phylogenetic relationships using sequences of ITS and cpDNA and estimated the divergence times of the major clade of this genus. Our results suggested that Stevia originated in Mexico 7.0-7.3 million years ago (Mya). Two large clades, one with shrub species and another with herb species, were separated at about 6.6 Mya. The phylogenetic reconstruction suggested that an ancestor of Stevia was a small shrub in temperate pine-oak forests and the evolutionary change from a shrub state to a herb state occurred only once. A Brazilian clade was nested in a Mexican herb clade, and its origin was estimated to be 5.2 Mya, suggesting that the migration from North America to South America occurred after the formation of the Isthmus of Panama. The species diversity in Mexico appears to reflect the habitat diversity within the temperate pine-oak forest zone. The presence of many conspecific diploid-polyploid clades in the phylogenetic tree reflects the high frequency of polyploidization among the perennial Stevia species.
An adaptive radiation of frogs in a southeast Asian island archipelago.
Blackburn, David C; Siler, Cameron D; Diesmos, Arvin C; McGuire, Jimmy A; Cannatella, David C; Brown, Rafe M
2013-09-01
Living amphibians exhibit a diversity of ecologies, life histories, and species-rich lineages that offers opportunities for studies of adaptive radiation. We characterize a diverse clade of frogs (Kaloula, Microhylidae) in the Philippine island archipelago as an example of an adaptive radiation into three primary habitat specialists or ecotypes. We use a novel phylogenetic estimate for this clade to evaluate the tempo of lineage accumulation and morphological diversification. Because species-level phylogenetic estimates for Philippine Kaloula are lacking, we employ dense population sampling to determine the appropriate evolutionary lineages for diversification analyses. We explicitly take phylogenetic uncertainty into account when calculating diversification and disparification statistics and fitting models of diversification. Following dispersal to the Philippines from Southeast Asia, Kaloula radiated rapidly into several well-supported clades. Morphological variation within Kaloula is partly explained by ecotype and accumulated at high levels during this radiation, including within ecotypes. We pinpoint an axis of morphospace related directly to climbing and digging behaviors and find patterns of phenotypic evolution suggestive of ecological opportunity with partitioning into distinct habitat specialists. We conclude by discussing the components of phenotypic diversity that are likely important in amphibian adaptive radiations. © 2013 The Authors. Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.
The HIV-1 epidemic in Bolivia is dominated by subtype B and CRF12_BF "family" strains.
Guimarães, Monick L; Velarde-Dunois, Ketty G; Segurondo, David; Morgado, Mariza G
2012-01-16
Molecular epidemiological studies of HIV-1 in South America have revealed the occurrence of subtypes B, F1 and BF1 recombinants. Even so, little information concerning the HIV-1 molecular epidemiology in Bolivia is available. In this study we performed phylogenetic analyses from samples collected in Bolivia at two different points in time over a 10 year span. We analyzed these samples to estimate the trends in the HIV subtype and recombinant forms over time. Fifty one HIV-1 positive samples were collected in Bolivia over two distinct periods (1996 and 2005). These samples were genetically characterized based on partial pol protease/reverse transcriptase (pr/rt) and env regions. Alignment and neighbor-joining (NJ) phylogenetic analyses were established from partial env (n = 37) and all pol sequences using Mega 4. The remaining 14 env sequences from 1996 were previously characterized based on HMA-env (Heteroduplex mobility assay). The Simplot v.3.5.1 program was used to verify intragenic recombination, and SplitsTree 4.0 was employed to confirm the phylogenetic relationship of the BF1 recombinant samples. Phylogenetic analysis of both env and pol regions confirmed the predominance of "pure" subtype B (72.5%) samples circulating in Bolivia and revealed a high prevalence of BF1 genotypes (27.5%). Eleven out of 14 BF1 recombinants displayed a mosaic structure identical or similar to that described for the CRF12_BF variant, one sample was classified as CRF17_BF, and two others were F1pol/Benv. No "pure" HIV-1 subtype F1 or B" variant of subtype B was detected in the present study. Of note, samples characterized as CRF12_BF-related were depicted only in 2005. HIV-1 genetic diversity in Bolivia is mostly driven by subtype B followed by BF1 recombinant strains from the CRF12_BF "family". No significant temporal changes were detected between the mid-1990s and the mid-2000s for subtype B (76.2% vs 70.0%) or BF1 recombinant (23.8% vs 30.0%) samples from Bolivia.
The HIV-1 epidemic in Bolivia is dominated by subtype B and CRF12_BF "family" strains
2012-01-01
Background Molecular epidemiological studies of HIV-1 in South America have revealed the occurrence of subtypes B, F1 and BF1 recombinants. Even so, little information concerning the HIV-1 molecular epidemiology in Bolivia is available. In this study we performed phylogenetic analyses from samples collected in Bolivia at two different points in time over a 10 year span. We analyzed these samples to estimate the trends in the HIV subtype and recombinant forms over time. Materials and methods Fifty one HIV-1 positive samples were collected in Bolivia over two distinct periods (1996 and 2005). These samples were genetically characterized based on partial pol protease/reverse transcriptase (pr/rt) and env regions. Alignment and neighbor-joining (NJ) phylogenetic analyses were established from partial env (n = 37) and all pol sequences using Mega 4. The remaining 14 env sequences from 1996 were previously characterized based on HMA-env (Heteroduplex mobility assay). The Simplot v.3.5.1 program was used to verify intragenic recombination, and SplitsTree 4.0 was employed to confirm the phylogenetic relationship of the BF1 recombinant samples. Results Phylogenetic analysis of both env and pol regions confirmed the predominance of "pure" subtype B (72.5%) samples circulating in Bolivia and revealed a high prevalence of BF1 genotypes (27.5%). Eleven out of 14 BF1 recombinants displayed a mosaic structure identical or similar to that described for the CRF12_BF variant, one sample was classified as CRF17_BF, and two others were F1pol/Benv. No "pure" HIV-1 subtype F1 or B" variant of subtype B was detected in the present study. Of note, samples characterized as CRF12_BF-related were depicted only in 2005. Conclusion HIV-1 genetic diversity in Bolivia is mostly driven by subtype B followed by BF1 recombinant strains from the CRF12_BF "family". No significant temporal changes were detected between the mid-1990s and the mid-2000s for subtype B (76.2% vs 70.0%) or BF1 recombinant (23.8% vs 30.0%) samples from Bolivia. PMID:22248191
Ge, Zai-Wei; Yang, Zhu L.; Pfister, Donald H.; Carbone, Matteo; Bau, Tolgor; Smith, Matthew E.
2014-01-01
The family Cudoniaceae (Rhytismatales, Ascomycota) was erected to accommodate the “earth tongue fungi” in the genera Cudonia and Spathularia. There have been no recent taxonomic studies of these genera, and the evolutionary relationships within and among these fungi are largely unknown. Here we explore the molecular phylogenetic relationships within Cudonia and Spathularia using maximum likelihood and Bayesian inference analyses based on 111 collections from across the Northern Hemisphere. Phylogenies based on the combined data from ITS, nrLSU, rpb2 and tef-1α sequences support the monophyly of three main clades, the /flavida, /velutipes, and /cudonia clades. The genus Cudonia and the family Cudoniaceae are supported as monophyletic groups, while the genus Spathularia is not monophyletic. Although Cudoniaceae is monophyletic, our analyses agree with previous studies that this family is nested within the Rhytismataceae. Our phylogenetic analyses circumscribes 32 species-level clades, including the putative recognition of 23 undescribed phylogenetic species. Our molecular phylogeny also revealed an unexpectedly high species diversity of Cudonia and Spathularia in eastern Asia, with 16 (out of 21) species-level clades of Cudonia and 8 (out of 11) species-level clades of Spathularia. We estimate that the divergence time of the Cudoniaceae was in the Paleogene approximately 28 Million years ago (Mya) and that the ancestral area for this group of fungi was in Eastern Asia based on the current data. We hypothesize that the large-scale geological and climatic events in Oligocene (e.g. the global cooling and the uplift of the Tibetan plateau) may have triggered evolutionary radiations in this group of fungi in East Asia. This work provides a foundation for future studies on the phylogeny, diversity, and evolution of Cudonia and Spathularia and highlights the need for more molecular studies on collections from Europe and North America. PMID:25084276
Suchan, Tomasz; Espíndola, Anahí; Rutschmann, Sereina; Emerson, Brent C; Gori, Kevin; Dessimoz, Christophe; Arrigo, Nils; Ronikier, Michał; Alvarez, Nadir
2017-09-01
Determining phylogenetic relationships among recently diverged species has long been a challenge in evolutionary biology. Cytoplasmic DNA markers, which have been widely used, notably in the context of molecular barcoding, have not always proved successful in resolving such phylogenies. However, with the advent of next-generation-sequencing technologies and associated techniques of reduced genome representation, phylogenies of closely related species have been resolved at a much higher detail in the last couple of years. Here we examine the potential and limitations of one of such techniques-Restriction-site Associated DNA (RAD) sequencing, a method that produces thousands of (mostly) anonymous nuclear markers, in disentangling the phylogeny of the fly genus Chiastocheta (Diptera: Anthomyiidae). In Europe, this genus encompasses seven species of seed predators, which have been widely studied in the context of their ecological and evolutionary interactions with the plant Trollius europaeus (Ranunculaceae). So far, phylogenetic analyses using mitochondrial markers failed to resolve monophyly of most of the species from this recently diversified genus, suggesting that their taxonomy may need a revision. However, relying on a single, non-recombining marker and ignoring potential incongruences between mitochondrial and nuclear loci may provide an incomplete account of the lineage history. In this study, we applied both classical Sanger sequencing of three mtDNA regions and RAD-sequencing, for reconstructing the phylogeny of the genus. Contrasting with results based on mitochondrial markers, RAD-sequencing analyses retrieved the monophyly of all seven species, in agreement with the morphological species assignment. We found robust nuclear-based species assignment of individual samples, and low levels of estimated contemporary gene flow among them. However, despite recovering species' monophyly, interspecific relationships varied depending on the set of RAD loci considered, producing contradictory topologies. Moreover, coalescence-based phylogenetic analyses revealed low supports for most of the interspecific relationships. Our results indicate that despite the higher performance of RAD-sequencing in terms of species trees resolution compared to cytoplasmic markers, reconstructing inter-specific relationships among recently-diverged lineages may lie beyond the possibilities offered by large sets of RAD-sequencing markers in cases of strong gene tree incongruence. Copyright © 2017 Elsevier Inc. All rights reserved.
DHLAS: A web-based information system for statistical genetic analysis of HLA population data.
Thriskos, P; Zintzaras, E; Germenis, A
2007-03-01
DHLAS (database HLA system) is a user-friendly, web-based information system for the analysis of human leukocyte antigens (HLA) data from population studies. DHLAS has been developed using JAVA and the R system, it runs on a Java Virtual Machine and its user-interface is web-based powered by the servlet engine TOMCAT. It utilizes STRUTS, a Model-View-Controller framework and uses several GNU packages to perform several of its tasks. The database engine it relies upon for fast access is MySQL, but others can be used a well. The system estimates metrics, performs statistical testing and produces graphs required for HLA population studies: (i) Hardy-Weinberg equilibrium (calculated using both asymptotic and exact tests), (ii) genetics distances (Euclidian or Nei), (iii) phylogenetic trees using the unweighted pair group method with averages and neigbor-joining method, (iv) linkage disequilibrium (pairwise and overall, including variance estimations), (v) haplotype frequencies (estimate using the expectation-maximization algorithm) and (vi) discriminant analysis. The main merit of DHLAS is the incorporation of a database, thus, the data can be stored and manipulated along with integrated genetic data analysis procedures. In addition, it has an open architecture allowing the inclusion of other functions and procedures.
De Wever, Aaike; Leliaert, Frederik; Verleyen, Elie; Vanormelingen, Pieter; Van der Gucht, Katleen; Hodgson, Dominic A.; Sabbe, Koen; Vyverman, Wim
2009-01-01
Recent data revealed that metazoans such as mites and springtails have persisted in Antarctica throughout several glacial–interglacial cycles, which contradicts the existing paradigm that terrestrial life was wiped out by successive glacial events and that the current inhabitants are recent colonizers. We used molecular phylogenetic techniques to study Antarctic microchlorophyte strains isolated from lacustrine habitats from maritime and continental Antarctica. The 14 distinct chlorophycean and trebouxiophycean lineages observed point to a wide phylogenetic diversity of apparently endemic Antarctic lineages at different taxonomic levels. This supports the hypothesis that long-term survival took place in glacial refugia, resulting in a specific Antarctic flora. The majority of the lineages have estimated ages between 17 and 84 Ma and probably diverged from their closest relatives around the time of the opening of Drake Passage (30–45 Ma), while some lineages with longer branch lengths have estimated ages that precede the break-up of Gondwana. The variation in branch length and estimated age points to several independent but rare colonization events. PMID:19625320
ERIC Educational Resources Information Center
Maier, Caroline Alexandra
2001-01-01
Presents an activity in which students seek answers to questions about evolutionary relationships by using genetic databases and bioinformatics software. Students build genetic distance matrices and phylogenetic trees based on molecular sequence data using web-based resources. Provides a flowchart of steps involved in accessing, retrieving, and…
Phylogenetic classification of bony fishes.
Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo
2017-07-06
Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution for more taxa than previous versions, based on more densely sampled phylogenetic trees. The classification presented in this study represents, unlike any other, the most up-to-date hypothesis of the Tree of Life of fishes.
Nodal distances for rooted phylogenetic trees.
Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel
2010-08-01
Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).
J. Geml; I. Timling; C.H. Robinson; N. Lennon; H.C. Nusbaum; C. Brochmann; M.E. Noordeloos; D.L. Taylor
2011-01-01
Current evidence from temperate studies suggests that ectomycorrhizal (ECM) fungi require overland routes for migration because of their obligate symbiotic associations with woody plants. Despite their key roles in arctic ecosystems, the phylogenetic diversity and phylogeography of arctic ECM fungi remains little known. Here we assess the phylogenetic diversity of ECM...
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies.
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456
Sites, J.W.; Morando, M.; Highton, R.; Huber, F.; Jung, R.E.
2004-01-01
The Shenandoah salamander (Plethodon shenandoah), known from isolated talus slopes on three of the highest mountains in Shenandoah National Park, is listed as state-endangered in Virginia and federally endangered under the U.S. Endangered Species Act. A 1999 paper by G. R. Thurow described P. shenandoah-like salamanders from three localities further south in the Blue Ridge Physiographic Province, which, if confirmed, would represent a range extension for P. shenandoah of approximately 90 km from its nearest known locality. Samples collected from two of these three localities were included in a molecular phylogenetic study of the known populations of P. shenandoah, and all other recognized species in the Plethodon cinereus group, using a 792 bp region of the mitochondrial cytochrome-b gene. Phylogenetic estimates were based on Bayesian, maximum likelihood, and maximum parsimony methods and topologies examined for placement of the new P. shenandoah-like samples relative to all others. All topologies recovered all haplotypes of the P. shenandoah-like animals nested within P. cinereus, and a statistical comparison of the best likelihood tree topology with one with an enforced (Thurow + Shenandoah P. shenandoah) clade revealed that the unconstrained tree had a significantly lower -In L score (P < 0.05, using the Shimodaira-Hasegawa test) than the constraint tree. This result and other anecdotal information give us no solid reason to consider the Thurow report valid. The current recovery program for P. shenandoah should remain focused on populations in Shenandoah National Park.
High mitochondrial mutation rates estimated from deep-rooting Costa Rican pedigrees
Madrigal, Lorena; Melendez-Obando, Mauricio; Villegas-Palma, Ramon; Barrantes, Ramiro; Raventos, Henrieta; Pereira, Reynaldo; Luiselli, Donata; Pettener, Davide; Barbujani, Guido
2012-01-01
Estimates of mutation rates for the noncoding hypervariable Region I (HVR-I) of mitochondrial DNA (mtDNA) vary widely, depending on whether they are inferred from phylogenies (assuming that molecular evolution is clock-like) or directly from pedigrees. All pedigree-based studies so far were conducted on populations of European origin. In this paper we analyzed 19 deep-rooting pedigrees in a population of mixed origin in Costa Rica. We calculated two estimates of the HVR-I mutation rate, one considering all apparent mutations, and one disregarding changes at sites known to be mutational hot spots and eliminating genealogy branches which might be suspected to include errors, or unrecognized adoptions along the female lines. At the end of this procedure, we still observed a mutation rate equal to 1.24 × 10−6, per site per year, i.e., at least three-fold as high as estimates derived from phylogenies. Our results confirm that mutation rates observed in pedigrees are much higher than estimated assuming a neutral model of long-term HVRI evolution. We argue that, until the cause of these discrepancies will be fully understood, both lower estimates (i.e., those derived from phylogenetic comparisons) and higher, direct estimates such as those obtained in this study, should be considered when modeling evolutionary and demographic processes. PMID:22460349
The chordate proteome history database.
Levasseur, Anthony; Paganini, Julien; Dainat, Jacques; Thompson, Julie D; Poch, Olivier; Pontarotti, Pierre; Gouret, Philippe
2012-01-01
The chordate proteome history database (http://ioda.univ-provence.fr) comprises some 20,000 evolutionary analyses of proteins from chordate species. Our main objective was to characterize and study the evolutionary histories of the chordate proteome, and in particular to detect genomic events and automatic functional searches. Firstly, phylogenetic analyses based on high quality multiple sequence alignments and a robust phylogenetic pipeline were performed for the whole protein and for each individual domain. Novel approaches were developed to identify orthologs/paralogs, and predict gene duplication/gain/loss events and the occurrence of new protein architectures (domain gains, losses and shuffling). These important genetic events were localized on the phylogenetic trees and on the genomic sequence. Secondly, the phylogenetic trees were enhanced by the creation of phylogroups, whereby groups of orthologous sequences created using OrthoMCL were corrected based on the phylogenetic trees; gene family size and gene gain/loss in a given lineage could be deduced from the phylogroups. For each ortholog group obtained from the phylogenetic or the phylogroup analysis, functional information and expression data can be retrieved. Database searches can be performed easily using biological objects: protein identifier, keyword or domain, but can also be based on events, eg, domain exchange events can be retrieved. To our knowledge, this is the first database that links group clustering, phylogeny and automatic functional searches along with the detection of important events occurring during genome evolution, such as the appearance of a new domain architecture.
Jing, Hongmei; Lacap, Donnabella C; Lau, Chui Yim; Pointing, Stephen B
2006-04-01
The 16S rRNA gene-defined bacterial diversity of tropical intertidal geothermal vents subject to varying degrees of seawater inundation was investigated. Shannon-Weaver diversity estimates of clone library-derived sequences revealed that the hottest pools located above the mean high-water mark that did not experience seawater inundation were most diverse, followed by those that were permanently submerged below the mean low-water mark. Pools located in the intertidal were the least biodiverse, and this is attributed to the fluctuating conditions caused by periodic seawater inundation rather than physicochemical conditions per se. Phylogenetic analysis revealed that a ubiquitous Oscillatoria-like phylotype accounted for 83% of clones. Synechococcus-like phylotypes were also encountered at each location, whilst others belonging to the Chroococcales, Oscillatoriales, and other non-phototrophic bacteria occurred only at specific locations along the gradient. All cyanobacterial phylotypes displayed highest phylogenetic affinity to terrestrial thermophilic counterparts rather than marine taxa.
Daniels, Savel R; Phiri, Ethel E; Klaus, Sebastian; Albrecht, Christian; Cumberlidge, Neil
2015-07-01
Phylogenetic reconstruction, divergence time estimations and ancestral range estimation were undertaken for 66% of the Afrotropical freshwater crab fauna (Potamonautidae) based on four partial DNA loci (12S rRNA, 16S rRNA, cytochrome oxidase one [COI], and histone 3). The present study represents the most comprehensive taxonomic sampling of any freshwater crab family globally, and explores the impact of paleodrainage interconnectivity on cladogenesis among freshwater crabs. Phylogenetic analyses of the total evidence data using maximum-likelihood (ML), maximum parsimony (MP), and Bayesian inference (BI) produced a robust statistically well-supported tree topology that reaffirmed the monophyly of the Afrotropical freshwater crab fauna. The estimated divergence times suggest that the Afrotropical Potamonautidae diverged during the Eocene. Cladogenesis within and among several genera occurred predominantly during the Miocene, which was associated with major tectonic and climatic ameliorations throughout the region. Paleodrainage connectivity was observed with specimens from the Nilo-Sudan and East African coast proving to be sister to specimens from the Upper Guinea Forests in West Africa. In addition, we observed strong sister taxon affinity between specimens from East Africa and the Congo basin, including specimens from Lake Tanganyika, while the southern African fauna was retrieved as sister to the Angolan taxa. Within the East African clade we observed two independent transoceanic dispersal events, one to the Seychelles Archipelago and a second to Madagascar, while we observe a single transoceanic dispersal event from West Africa to São Tomé. The ancestral area estimation suggested a West African/East African ancestral range for the family with multiple dispersal events between southern Africa and East Africa, and between East Africa and Central Africa The taxonomic implications of our results are discussed in light of the widespread paraphyly evident among a number of genera. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Dembo, Mana; Radovčić, Davorka; Garvin, Heather M; Laird, Myra F; Schroeder, Lauren; Scott, Jill E; Brophy, Juliet; Ackermann, Rebecca R; Musiba, Chares M; de Ruiter, Darryl J; Mooers, Arne Ø; Collard, Mark
2016-08-01
Homo naledi is a recently discovered species of fossil hominin from South Africa. A considerable amount is already known about H. naledi but some important questions remain unanswered. Here we report a study that addressed two of them: "Where does H. naledi fit in the hominin evolutionary tree?" and "How old is it?" We used a large supermatrix of craniodental characters for both early and late hominin species and Bayesian phylogenetic techniques to carry out three analyses. First, we performed a dated Bayesian analysis to generate estimates of the evolutionary relationships of fossil hominins including H. naledi. Then we employed Bayes factor tests to compare the strength of support for hypotheses about the relationships of H. naledi suggested by the best-estimate trees. Lastly, we carried out a resampling analysis to assess the accuracy of the age estimate for H. naledi yielded by the dated Bayesian analysis. The analyses strongly supported the hypothesis that H. naledi forms a clade with the other Homo species and Australopithecus sediba. The analyses were more ambiguous regarding the position of H. naledi within the (Homo, Au. sediba) clade. A number of hypotheses were rejected, but several others were not. Based on the available craniodental data, Homo antecessor, Asian Homo erectus, Homo habilis, Homo floresiensis, Homo sapiens, and Au. sediba could all be the sister taxon of H. naledi. According to the dated Bayesian analysis, the most likely age for H. naledi is 912 ka. This age estimate was supported by the resampling analysis. Our findings have a number of implications. Most notably, they support the assignment of the new specimens to Homo, cast doubt on the claim that H. naledi is simply a variant of H. erectus, and suggest H. naledi is younger than has been previously proposed. Copyright © 2016 Elsevier Ltd. All rights reserved.
Pérez, María Encarnación; Pol, Diego
2012-01-01
Background Caviidae is a diverse group of caviomorph rodents that is broadly distributed in South America and is divided into three highly divergent extant lineages: Caviinae (cavies), Dolichotinae (maras), and Hydrochoerinae (capybaras). The fossil record of Caviidae is only abundant and diverse since the late Miocene. Caviids belongs to Cavioidea sensu stricto (Cavioidea s.s.) that also includes a diverse assemblage of extinct taxa recorded from the late Oligocene to the middle Miocene of South America (“eocardiids”). Results A phylogenetic analysis combining morphological and molecular data is presented here, evaluating the time of diversification of selected nodes based on the calibration of phylogenetic trees with fossil taxa and the use of relaxed molecular clocks. This analysis reveals three major phases of diversification in the evolutionary history of Cavioidea s.s. The first two phases involve two successive radiations of extinct lineages that occurred during the late Oligocene and the early Miocene. The third phase consists of the diversification of Caviidae. The initial split of caviids is dated as middle Miocene by the fossil record. This date falls within the 95% higher probability distribution estimated by the relaxed Bayesian molecular clock, although the mean age estimate ages are 3.5 to 7 Myr older. The initial split of caviids is followed by an obscure period of poor fossil record (refered here as the Mayoan gap) and then by the appearance of highly differentiated modern lineages of caviids, which evidentially occurred at the late Miocene as indicated by both the fossil record and molecular clock estimates. Conclusions The integrated approach used here allowed us identifying the agreements and discrepancies of the fossil record and molecular clock estimates on the timing of the major events in cavioid evolution, revealing evolutionary patterns that would not have been possible to gather using only molecular or paleontological data alone. PMID:23144757
Lloyd, G T; Bapst, D W; Friedman, M; Davis, K E
2016-11-01
Branch lengths-measured in character changes-are an essential requirement of clock-based divergence estimation, regardless of whether the fossil calibrations used represent nodes or tips. However, a separate set of divergence time approaches are typically used to date palaeontological trees, which may lack such branch lengths. Among these methods, sophisticated probabilistic approaches have recently emerged, in contrast with simpler algorithms relying on minimum node ages. Here, using a novel phylogenetic hypothesis for Mesozoic dinosaurs, we apply two such approaches to estimate divergence times for: (i) Dinosauria, (ii) Avialae (the earliest birds) and (iii) Neornithes (crown birds). We find: (i) the plausibility of a Permian origin for dinosaurs to be dependent on whether Nyasasaurus is the oldest dinosaur, (ii) a Middle to Late Jurassic origin of avian flight regardless of whether Archaeopteryx or Aurornis is considered the first bird and (iii) a Late Cretaceous origin for Neornithes that is broadly congruent with other node- and tip-dating estimates. Demonstrating the feasibility of probabilistic time-scaling further opens up divergence estimation to the rich histories of extinct biodiversity in the fossil record, even in the absence of detailed character data. © 2016 The Authors.
Montes-Pérez, Rubén C; García, Adán W Echeverría; Castro, Jorge Zavala; Gamboa, Militza G Alfaro
2006-09-01
The objective of this work was to estimate the nucleotidic variation between two groups of tepezcuintles (Agouti paca) from the states of Campeche and Quintana Roo, Mexico and within members of each group. Blood samples were collected from eleven A. paca kept in captivity. DNA from leukocytic cells was used for Ramdom Amplification of DNA Polimorphism (RAPD). The primers three 5'-d(GTAGACCCGT)- 3' and six 5'-d(CCCGTCAGCA)- 3' were selected from de Amersham kit (Ready.To.Go. RAPD Analysis Beads, Amersham Pharmacia Biotech), because they produced an adequate number of bands. The electrophoretic pattern of bands obtained was analyzed using software for phylogenetic analysis based on the UPGMA method, to estimate the units of nucleotidic variation. The phylogenetic tree obtained with primer three reveals a dicotomic grouping between the animals from both states in the Yucatan Peninsula showing a divergent value of 1.983 nucleotides per hundred. Animals from Quintana Roo show a grouping with primer six; an additional grouping was observed with animals from Campeche. Nucleotidic variation between both groups was 2.118 nucleotides per hundred. The nucleotidic variation for the two primers within the groups from both states, showed fluctuating values from 0.46 to 1.68 nucleotides per hundred, which indicates that nucleotidic variation between the two groups of animals is around two nucleotides per hundred and, within the groups, less than 1.7 nucleotides per hundred.
Benítez-Benítez, C; Escudero, M; Rodríguez-Sánchez, F; Martín-Bravo, S; Jiménez-Mejías, P
2018-04-01
Estimating species ability to adapt to environmental changes is crucial to understand their past and future response to climate change. The Mediterranean Basin has experienced remarkable climatic changes since the Miocene, which have greatly influenced the evolution of the Mediterranean flora. Here, we examine the evolutionary history and biogeographic patterns of two sedge sister species (Carex, Cyperaceae) restricted to the western Mediterranean Basin, but with Pliocene fossil record in central Europe. In particular, we estimated the evolution of climatic niches through time and its influence in lineage differentiation. We carried out a dated phylogenetic-phylogeographic study based on seven DNA regions (nDNA and ptDNA) and fingerprinting data (AFLPs), and modelled ecological niches and species distributions for the Pliocene, Pleistocene and present. Phylogenetic and divergence time analyses revealed that both species form a monophyletic lineage originated in the late Pliocene-early Pleistocene. We detected clear genetic differentiation between both species with distinct genetic clusters in disjunct areas, indicating the predominant role of geographic barriers limiting gene flow. We found a remarkable shift in the climatic requirements between Pliocene and extant populations, although the niche seems to have been relatively conserved since the Pleistocene split of both species. This study highlights how an integrative approach combining different data sources and analyses, including fossils, allows solid and robust inferences about the evolutionary history of a plant group since the Pliocene. © 2018 John Wiley & Sons Ltd.
Leaché, Adam D; Banbury, Barbara L; Linkem, Charles W; de Oca, Adrián Nieto-Montes
2016-03-22
Resolving the short phylogenetic branches that result from rapid evolutionary diversification often requires large numbers of loci. We collected targeted sequence capture data from 585 nuclear loci (541 ultraconserved elements and 44 protein-coding genes) to estimate the phylogenetic relationships among iguanian lizards in the North American genus Sceloporus. We tested for diversification rate shifts to determine if rapid radiation in the genus is correlated with chromosomal evolution. The phylogenomic trees that we obtained for Sceloporus using concatenation and coalescent-based species tree inference provide strong support for the monophyly and interrelationships among nearly all major groups. The diversification analysis supported one rate shift on the Sceloporus phylogeny approximately 20-25 million years ago that is associated with the doubling of the speciation rate from 0.06 species/million years (Ma) to 0.15 species/Ma. The posterior probability for this rate shift occurring on the branch leading to the Sceloporus species groups exhibiting increased chromosomal diversity is high (posterior probability = 0.997). Despite high levels of gene tree discordance, we were able to estimate a phylogenomic tree for Sceloporus that solves some of the taxonomic problems caused by previous analyses of fewer loci. The taxonomic changes that we propose using this new phylogenomic tree help clarify the number and composition of the major species groups in the genus. Our study provides new evidence for a putative link between chromosomal evolution and the rapid divergence and radiation of Sceloporus across North America.
NASA Astrophysics Data System (ADS)
Poff, N.; Vieira, N. K.; Simmons, M. P.; Olden, J. D.; Kondratieff, B. C.; Finn, D. S.
2005-05-01
The use of species traits as indicators of environmental disturbance is being considered for biomonitoring programs globally. As such, methods to select relevant and informative traits for inclusion in biometrics need to be developed. In this research, we identified 20 traits of aquatic insects within six trait groups: morphology, mobility, life-history strategy, thermal tolerance, feeding guild and ecology (e.g., habitat preference). We constructed phylogenetic trees for 1) all lotic insect species of North America and 2) all Ephemeroptera, Plecoptera and Trichoptera species based on morphology- and molecular-based analyses and classifications. We then measured variability (i.e., plasticity) of the 20 traits and six trait groups across the two phylogenetic trees. Traits with higher degrees of plasticity indicated traits that were less phylogenetically constrained, and were considered informative for biomonitoring purposes. Thermal tolerance, rheophily, body size at maturity and feeding guild showed the highest plasticity across both phylogenetic trees. Two mobility traits, occurrence in drift and adult dispersal distance, showed moderate plasticity. By contrast, adult exiting ability, degree of attachment, adult lifespan and body shape showed low variability and were thus less informative. Plastic species traits that are less phylogenetically constrained may be most useful in detecting community change along environmental gradients.
Phylogenetic congruence between subtropical trees and their associated fungi.
Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao
2016-12-01
Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.
Zheng, Xiaoyan; Cai, Danying; Potter, Daniel; Postman, Joseph; Liu, Jing; Teng, Yuanwen
2014-11-01
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence datasets. Phylogenetic trees based on both cpDNA and nuclear LFY2int2-N (LN) data resulted in poor resolution, especially, only five primary species were monophyletic in the LN tree. A phylogenetic network of LN suggested that reticulation caused by hybridization is one of the major evolutionary processes for Pyrus species. Polytomies of the gene trees and star-like structure of cpDNA networks suggested rapid radiation is another major evolutionary process, especially for the occidental species. Pyrus calleryana and P. regelii were the earliest diverged Pyrus species. Two North African species, P. cordata, P. spinosa and P. betulaefolia were descendent of primitive stock Pyrus species and still share some common molecular characters. Southwestern China, where a large number of P. pashia populations are found, is probably the most important diversification center of Pyrus. More accessions and nuclear genes are needed for further understanding the evolutionary histories of Pyrus. Copyright © 2014 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
The 10 species of Streptomyces implicated as the etiological agents in scab disease of potatoes or soft rot disease of sweet potatoes are distributed among 7 different phylogenetic clades in analyses based on 16S rRNA gene sequences, but high sequence similarity of this gene among Streptomyces speci...
Liu, Ai-Rong; Chen, Shuang-Chen; Wu, Shang-Ying; Xu, Tong; Guo, Liang-Dong; Jeewon, Rajesh; Wei, Ji-Guang
2010-11-01
Previous phylogenetic studies based on DNA sequence data have partially resolved taxonomic relationships among Pestalotiopsis species. There are still some morphological characters whose phylogenetic significance have not been assessed properly due to limited taxon sampling, in particular the degree of pigmentation of median cells. In this study, the stability of pigmentation of median cells of conidia in Pestalotiopsis species was evaluated in subculture, and a molecular phylogenetic analysis was conducted on 45 strains belonging to 26 species in order to reappraise the pigmentation of median cells for its significance in the taxonomy of Pestalotiopsis. Phylogenetic relationships were inferred from nucleotide sequences in ITS regions (ITS1, 5.8S and ITS2) and β-tubulin 2 gene (tub2). The results showed that pigmentation of median cells was stable and it could be a key character in the taxonomy of Pestalotiopsis species. Instead of "concolorous" and "versicolor" proposed by Steyeart (1949), "brown to olivaceous" and "umber to fuliginous" are described and proposed in this paper. Copyright © 2010. Published by Elsevier Inc.
Gottschling, Marc; Soehner, Sylvia; Zinssmeister, Carmen; John, Uwe; Plötner, Jörg; Schweikert, Michael; Aligizaki, Katerina; Elbrächter, Malte
2012-01-01
The phylogenetic relationships of the Dinophyceae (Alveolata) are not sufficiently resolved at present. The Thoracosphaeraceae (Peridiniales) are the only group of the Alveolata that include members with calcareous coccoid stages; this trait is considered apomorphic. Although the coccoid stage apparently is not calcareous, Bysmatrum has been assigned to the Thoracosphaeraceae based on thecal morphology. We tested the monophyly of the Thoracosphaeraceae using large sets of ribosomal RNA sequence data of the Alveolata including the Dinophyceae. Phylogenetic analyses were performed using Maximum Likelihood and Bayesian approaches. The Thoracosphaeraceae were monophyletic, but included also a number of non-calcareous dinophytes (such as Pentapharsodinium and Pfiesteria) and even parasites (such as Duboscquodinium and Tintinnophagus). Bysmatrum had an isolated and uncertain phylogenetic position outside the Thoracosphaeraceae. The phylogenetic relationships among calcareous dinophytes appear complex, and the assumption of the single origin of the potential to produce calcareous structures is challenged. The application of concatenated ribosomal RNA sequence data may prove promising for phylogenetic reconstructions of the Dinophyceae in future. Copyright © 2011 Elsevier GmbH. All rights reserved.
Piccin-Santos, Viviane; Brandão, Marcelo Mendes; Bittencourt-Oliveira, Maria Do Carmo
2014-08-01
Selection of genes that have not been horizontally transferred for prokaryote phylogenetic inferences is regarded as a challenging task. The markers internal transcribed spacer of ribosomal genes (16S-23S ITS) and phycocyanin intergenic spacer (PC-IGS), based on the operons of ribosomal and phycocyanin genes respectively, are among the most used markers in cyanobacteria. The region of the ribosomal genes has been considered stable, whereas the phycocyanin operon may have undergone horizontal transfer. To investigate the occurrence of horizontal transfer of PC-IGS, phylogenetic trees of Geitlerinema and Microcystis strains were generated using PC-IGS and 16S-23S ITS and compared. Phylogenetic trees based on the two markers were mostly congruent for Geitlerinema and Microcystis, indicating a common evolutionary history among ribosomal and phycocyanin genes with no evidence for horizontal transfer of PC-IGS. Thus, PC-IGS is a suitable marker, along with 16S-23S ITS for phylogenetic studies of cyanobacteria. © 2014 Phycological Society of America.
Nebenzahl-Guimaraes, Hanna; Verhagen, Lilly M; Borgdorff, Martien W; van Soolingen, Dick
2015-10-01
The aim of this study was to determine if mycobacterial lineages affect infection risk, clustering, and disease progression among Mycobacterium tuberculosis cases in The Netherlands. Multivariate negative binomial regression models adjusted for patient-related factors and stratified by patient ethnicity were used to determine the association between phylogenetic lineages and infectivity (mean number of positive contacts around each patient) and clustering (as defined by number of secondary cases within 2 years after diagnosis of an index case sharing the same fingerprint) indices. An estimate of progression to disease by each risk factor was calculated as a bootstrapped risk ratio of the clustering index by the infectivity index. Compared to the Euro-American reference, Mycobacterium africanum showed significantly lower infectivity and clustering indices in the foreign-born population, while Mycobacterium bovis showed significantly lower infectivity and clustering indices in the native population. Significantly lower infectivity was also observed for the East African Indian lineage in the foreign-born population. Smear positivity was a significant risk factor for increased infectivity and increased clustering. Estimates of progression to disease were significantly associated with age, sputum-smear status, and behavioral risk factors, such as alcohol and intravenous drug abuse, but not with phylogenetic lineages. In conclusion, we found evidence of a bacteriological factor influencing indicators of a strain's transmissibility, namely, a decreased ability to infect and a lower clustering index in ancient phylogenetic lineages compared to their modern counterparts. Confirmation of these findings via follow-up studies using tuberculin skin test conversion data should have important implications on M. tuberculosis control efforts. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Dool, Serena E; Puechmaille, Sebastien J; Foley, Nicole M; Allegrini, Benjamin; Bastian, Anna; Mutumi, Gregory L; Maluleke, Tinyiko G; Odendaal, Lizelle J; Teeling, Emma C; Jacobs, David S
2016-04-01
Despite many studies illustrating the perils of utilising mitochondrial DNA in phylogenetic studies, it remains one of the most widely used genetic markers for this purpose. Over the last decade, nuclear introns have been proposed as alternative markers for phylogenetic reconstruction. However, the resolution capabilities of mtDNA and nuclear introns have rarely been quantified and compared. In the current study we generated a novel ∼5kb dataset comprising six nuclear introns and a mtDNA fragment. We assessed the relative resolution capabilities of the six intronic fragments with respect to each other, when used in various combinations together, and when compared to the traditionally used mtDNA. We focused on a major clade in the horseshoe bat family (Afro-Palaearctic clade; Rhinolophidae) as our case study. This old, widely distributed and speciose group contains a high level of conserved morphology. This morphological stasis renders the reconstruction of the phylogeny of this group with traditional morphological characters complex. We sampled multiple individuals per species to represent their geographic distributions as best as possible (122 individuals, 24 species, 68 localities). We reconstructed the species phylogeny using several complementary methods (partitioned Maximum Likelihood and Bayesian and Bayesian multispecies-coalescent) and made inferences based on consensus across these methods. We computed pairwise comparisons based on Robinson-Foulds tree distance metric between all Bayesian topologies generated (27,000) for every gene(s) and visualised the tree space using multidimensional scaling (MDS) plots. Using our supported species phylogeny we estimated the ancestral state of key traits of interest within this group, e.g. echolocation peak frequency which has been implicated in speciation. Our results revealed many potential cryptic species within this group, even in taxa where this was not suspected a priori and also found evidence for mtDNA introgression. We demonstrated that by using just two introns one can recover a better supported species tree than when using the mtDNA alone, despite the shorter overall length of the combined introns. Additionally, when combining any single intron with mtDNA, we showed that the result is highly similar to the mtDNA gene tree and far from the true species tree and therefore this approach should be avoided. We caution against the indiscriminate use of mtDNA in phylogenetic studies and advocate for pilot studies to select nuclear introns. The selection of marker type and number is a crucial step that is best based on critical examination of preliminary or previously published data. Based on our findings and previous publications, we recommend the following markers to recover phylogenetic relationships between recently diverged taxa (<20 My) in bats and other mammals: ACOX2, COPS7A, BGN, ROGDI and STAT5A. Copyright © 2016 Elsevier Inc. All rights reserved.
Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A
2006-03-01
Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.
Alonso, Conchita; Pérez, Ricardo; Bazaga, Pilar; Herrera, Carlos M.
2015-01-01
DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value). Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis) and 39.2% (Narcissus). Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages. PMID:25688257
Rearrangement moves on rooted phylogenetic networks
Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2017-01-01
Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for practical phylogenetic network reconstruction. PMID:28763439
NASA Astrophysics Data System (ADS)
Porco, David; Deharveng, Louis
2009-08-01
The phylogeny of Collembola, originally discussed from a morphological point of view, has more recently benefited from novel insights brought by molecular analyses. Both morphological and molecular characters produced a well-resolved phylogenetic hypothesis including all orders, most families, and a large number of genera. However, several conflicting points exist between molecular and morphological data, and new characters are clearly needed to resolve these inconsistencies. In this study the usefulness of a new character type not previously used in the phylogenetic study of Collembola was tested: the epicuticular chemical compounds. Our phylogenetic analysis was based on 380 compounds from 26 Collembola species. The results show good resolution for terminal branches but not for internal nodes. This is probably due to the partial involvement of epicuticular lipids in ecological functions such as water conservation and sexual attraction. Thus, this character type is appropriate for reconstructing phylogenetic relationships among recently diversified groups.
Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro
2012-01-01
Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.
COI (cytochrome oxidase-I) sequence based studies of Carangid fishes from Kakinada coast, India.
Persis, M; Chandra Sekhar Reddy, A; Rao, L M; Khedkar, G D; Ravinder, K; Nasruddin, K
2009-09-01
Mitochondrial DNA, cytochrome oxidase-1 gene sequences were analyzed for species identification and phylogenetic relationship among the very high food value and commercially important Indian carangid fish species. Sequence analysis of COI gene very clearly indicated that all the 28 fish species fell into five distinct groups, which are genetically distant from each other and exhibited identical phylogenetic reservation. All the COI gene sequences from 28 fishes provide sufficient phylogenetic information and evolutionary relationship to distinguish the carangid species unambiguously. This study proves the utility of mtDNA COI gene sequence based approach in identifying fish species at a faster pace.
Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach
Reaz, Rezwana; Bayzid, Md. Shamsuzzoha; Rahman, M. Sohel
2014-01-01
Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A ‘quartet’ is an unrooted tree over taxa, hence the quartet-based supertree methods combine many -taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets. PMID:25117474
Dengue Virus Type 4 Phylogenetics in Brazil 2011: Looking beyond the Veil
de Souza, Renato Pereira; Rocco, Iray M.; Maeda, Adriana Y.; Spenassatto, Carine; Bisordi, Ivani; Suzuki, Akemi; Silveira, Vivian R.; Silva, Sarai J. S.; Azevedo, Roberta M.; Tolentino, Fernanda M.; Assis, Jaqueline C.; Bassi, Margarida G.; Dambrós, Bibiana P.; Tumioto, Gabriela L.; Gregianini, Tatiana S.; Souza, Luiza Terezinha M.; Timenetsky, Maria do Carmo S. T.; Santos, Cecília L. S.
2011-01-01
Dengue Fever and Dengue Hemorrhagic Fever are diseases affecting approximately 100 million people/year and are a major concern in developing countries. In the present study, the phylogenetic relationship of six strains of the first autochthonous cases of DENV-4 infection occurred in Sao Paulo State, Parana State and Rio Grande do Sul State, Brazil, 2011 were studied. Nucleotide sequences of the envelope gene were determined and compared with sequences representative of the genotypes I, II, III and Sylvatic for DEN4 retrieved from GenBank. We employed a Bayesian phylogenetic approach to reconstruct the phylogenetic relationships of Brazilian DENV-4 and we estimated evolutionary rates and dates of divergence for DENV-4 found in Brazil in 2011. All samples sequenced in this study were located in Genotype II. The studied strains are monophyletic and our data suggest that they have been evolving separately for at least 4 to 6 years. Our data suggest that the virus might have been present in the region for some time, without being noticed by Health Surveillance Services due to a low level of circulation and a higher prevalence of DENV-1 and DENV- 2. PMID:22216365
Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks.
Oh, S June; Joung, Je-Gun; Chang, Jeong-Ho; Zhang, Byoung-Tak
2006-06-06
To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway structures using meta-level information rather than sequence information. This method may yield further information about biological evolution, such as the history of horizontal transfer of each gene, by studying the detailed structure of the phylogenetic tree constructed by the kernel-based method.
Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui
2014-06-27
Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.
CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis
2011-01-01
Background The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Results Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. Conclusions CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore. PMID:21496247
CDAO-store: ontology-driven data integration for phylogenetic analysis.
Chisham, Brandon; Wright, Ben; Le, Trung; Son, Tran Cao; Pontelli, Enrico
2011-04-15
The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore.
Pangaea and the Out-of-Africa Model of Varicella-Zoster Virus Evolution and Phylogeography.
Grose, Charles
2012-09-01
The goal of this minireview is to provide an overview of varicella-zoster virus (VZV) phylogenetics and phylogeography when placed in the broad context of geologic time. Planet Earth was formed over 4 billion years ago, and the supercontinent Pangaea coalesced around 400 million years ago (mya). Based on detailed tree-building models, the base of the phylogenetic tree of the Herpesviridae family has been estimated at 400 mya. Subsequently, Pangaea split into Laurasia and Gondwanaland; in turn, Africa rifted from Gondwanaland. Based on available data, the hypothesis of this minireview is that the ancestral alphaherpesvirus VZV coevolved in simians, apes, and hominins in Africa. When anatomically modern humans first crossed over the Red Sea 60,000 years ago, VZV was carried along in their dorsal root ganglia. Currently, there are five VZV clades, distinguishable by single nucleotide polymorphisms. These clades likely represent continued VZV coevolution, as humans with latent VZV infection left Arabia and dispersed into Asia (clades 2 and 5) and Europe (clades 1, 3, and 4). The prototype VZV sequence contains nearly 125,000 bp, divided into 70 open reading frames. Generally, isolates within a clade display >99.9% identity to one another, while members of one clade compared to a second clade show 99.8% identity to one another. Recently, four different VZV genotypes that do not segregate into the previously defined five clades have been identified, a result indicating a wider than anticipated diversity among newly collected VZV strains around the world.
Evidence of transoceanic dispersion of the genus Vanilla based on plastid DNA phylogenetic analysis.
Bouetard, Anthony; Lefeuvre, Pierre; Gigant, Rodolphe; Bory, Séverine; Pignal, Marc; Besse, Pascale; Grisoni, Michel
2010-05-01
The phylogeny and the biogeographical history of the genus Vanilla was investigated using four chloroplastic genes (psbB, psbC; psaB and rbcL), on 47 accessions of Vanilla chosen from the ex situ CIRAD collection maintained in Reunion Island and additional sequences from GenBank. Bayesian methods provided a fairly well supported reconstruction of the phylogeny of the Vanilloideae sub-family and more particularly of the genus Vanilla. Three major phylogenetic groups in the genus Vanilla were differentiated, which is in disagreement with the actual classification in two sections (Foliosae and Aphyllae) based on morphological traits. Recent Bayesian relaxed molecular clock methods allowed to test the two main hypotheses of the phylogeography of the genus Vanilla. Early radiation of the Vanilla genus and diversification by vicariance consecutive to the break-up of Gondwana, 95 million years ago (Mya), was incompatible with the admitted age of origin of Angiosperm. Based on the Vanilloideae age recently estimated to 71 million years ago (Mya), we conclude that the genus Vanilla would have appeared approximately 34 Mya in South America, when continents were already separated. Nevertheless, whatever the two extreme scenarios tested, at least three long distance migration events are needed to explain the present distribution of Vanilla species in tropical areas. These transoceanic dispersions could have occurred via transoceanic passageway such as the Rio Grande Ridge and the involvement of floating vegetation mats and migratory birds. Copyright 2010 Elsevier Inc. All rights reserved.
Pangaea and the Out-of-Africa Model of Varicella-Zoster Virus Evolution and Phylogeography
2012-01-01
The goal of this minireview is to provide an overview of varicella-zoster virus (VZV) phylogenetics and phylogeography when placed in the broad context of geologic time. Planet Earth was formed over 4 billion years ago, and the supercontinent Pangaea coalesced around 400 million years ago (mya). Based on detailed tree-building models, the base of the phylogenetic tree of the Herpesviridae family has been estimated at 400 mya. Subsequently, Pangaea split into Laurasia and Gondwanaland; in turn, Africa rifted from Gondwanaland. Based on available data, the hypothesis of this minireview is that the ancestral alphaherpesvirus VZV coevolved in simians, apes, and hominins in Africa. When anatomically modern humans first crossed over the Red Sea 60,000 years ago, VZV was carried along in their dorsal root ganglia. Currently, there are five VZV clades, distinguishable by single nucleotide polymorphisms. These clades likely represent continued VZV coevolution, as humans with latent VZV infection left Arabia and dispersed into Asia (clades 2 and 5) and Europe (clades 1, 3, and 4). The prototype VZV sequence contains nearly 125,000 bp, divided into 70 open reading frames. Generally, isolates within a clade display >99.9% identity to one another, while members of one clade compared to a second clade show 99.8% identity to one another. Recently, four different VZV genotypes that do not segregate into the previously defined five clades have been identified, a result indicating a wider than anticipated diversity among newly collected VZV strains around the world. PMID:22761371
Toussaint, Emmanuel F A; Morinière, Jérôme; Müller, Chris J; Kunte, Krushnamegh; Turlin, Bernard; Hausmann, Axel; Balke, Michael
2015-10-01
The charismatic tropical Polyura Nawab butterflies are distributed across twelve biodiversity hotspots in the Indomalayan/Australasian archipelago. In this study, we tested an array of species delimitation methods and compared the results to existing morphology-based taxonomy. We sequenced two mitochondrial and two nuclear gene fragments to reconstruct phylogenetic relationships within Polyura using both Bayesian inference and maximum likelihood. Based on this phylogenetic framework, we used the recently introduced bGMYC, BPP and PTP methods to investigate species boundaries. Based on our results, we describe two new species Polyura paulettae Toussaint sp. n. and Polyura smilesi Toussaint sp. n., propose one synonym, and five populations are raised to species status. Most of the newly recognized species are single-island endemics likely resulting from the recent highly complex geological history of the Indomalayan-Australasian archipelago. Surprisingly, we also find two newly recognized species in the Indomalayan region where additional biotic or abiotic factors have fostered speciation. Species delimitation methods were largely congruent and succeeded to cross-validate most extant morphological species. PTP and BPP seem to yield more consistent and robust estimations of species boundaries with respect to morphological characters while bGMYC delivered contrasting results depending on the different gene trees considered. Our findings demonstrate the efficiency of comparative approaches using molecular species delimitation methods on empirical data. They also pave the way for the investigation of less well-known groups to unveil patterns of species richness and catalogue Earth's concealed, therefore unappreciated diversity. Published by Elsevier Inc.
Kent, Angela D.; Smith, Dan J.; Benson, Barbara J.; Triplett, Eric W.
2003-01-01
Culture-independent DNA fingerprints are commonly used to assess the diversity of a microbial community. However, relating species composition to community profiles produced by community fingerprint methods is not straightforward. Terminal restriction fragment length polymorphism (T-RFLP) is a community fingerprint method in which phylogenetic assignments may be inferred from the terminal restriction fragment (T-RF) sizes through the use of web-based resources that predict T-RF sizes for known bacteria. The process quickly becomes computationally intensive due to the need to analyze profiles produced by multiple restriction digests and the complexity of profiles generated by natural microbial communities. A web-based tool is described here that rapidly generates phylogenetic assignments from submitted community T-RFLP profiles based on a database of fragments produced by known 16S rRNA gene sequences. Users have the option of submitting a customized database generated from unpublished sequences or from a gene other than the 16S rRNA gene. This phylogenetic assignment tool allows users to employ T-RFLP to simultaneously analyze microbial community diversity and species composition. An analysis of the variability of bacterial species composition throughout the water column in a humic lake was carried out to demonstrate the functionality of the phylogenetic assignment tool. This method was validated by comparing the results generated by this program with results from a 16S rRNA gene clone library. PMID:14602639