2017-01-01
The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference. PMID:29136663
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution
Kendall, Michelle; Colijn, Caroline
2016-01-01
Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. Key words: phylogenetics, evolution, tree metrics, genetics, sequencing. PMID:27343287
Phylogenetic overdispersion of plant species in southern Brazilian savannas.
Silva, I A; Batalha, M A
2009-08-01
Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutionary processes. Based on these assumptions, we may infer mechanisms of contemporary coexistence from the phylogenetic relationships of the species in a community. We studied the phylogenetic structure of plant communities in four cerrado sites, in southeastern Brazil. We calculated two raw phylogenetic distances among the species sampled. We estimated the phylogenetic structure by comparing the observed phylogenetic distances to the distribution of phylogenetic distances in null communities. We obtained null communities by randomizing the phylogenetic relationships of the regional pool of species. We found a phylogenetic overdispersion of the cerrado species. Phylogenetic overdispersion has several explanations, depending on the phylogenetic history of traits and contemporary ecological interactions. However, based on coexistence models between grasses and trees, density-dependent ecological forces, and the evolutionary history of the cerrado flora, we argue that the phylogenetic overdispersion of cerrado species is predominantly due to competitive interactions, herbivores and pathogen attacks, and ecological speciation. Future studies will need to include information on the phylogenetic history of plant traits.
Genomic Repeat Abundances Contain Phylogenetic Signal
Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.
2015-01-01
A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464
Buchwalter, David B; Cain, Daniel J; Martin, Caitrin A; Xie, Lingtian; Luoma, Samuel N; Garland, Theodore
2008-06-17
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature.
Buchwalter, David B.; Cain, Daniel J.; Martin, Caitrin A.; Xie, Lingtian; Luoma, Samuel N.; Garland, Theodore
2008-01-01
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. PMID:18559853
Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.
Kendall, Michelle; Colijn, Caroline
2016-10-01
Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks.
Oh, S June; Joung, Je-Gun; Chang, Jeong-Ho; Zhang, Byoung-Tak
2006-06-06
To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway structures using meta-level information rather than sequence information. This method may yield further information about biological evolution, such as the history of horizontal transfer of each gene, by studying the detailed structure of the phylogenetic tree constructed by the kernel-based method.
Distance-Based Phylogenetic Methods Around a Polytomy.
Davidson, Ruth; Sullivant, Seth
2014-01-01
Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.
Buchwalter, D.B.; Cain, D.J.; Martin, C.A.; Xie, Lingtian; Luoma, S.N.; Garland, T.
2008-01-01
We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. ?? 2008 by The National Academy of Sciences of the USA.
Zhang, Peng
2012-01-01
Background Universal nuclear protein-coding locus (NPCL) markers that are applicable across diverse taxa and show good phylogenetic discrimination have broad applications in molecular phylogenetic studies. For example, RAG1, a representative NPCL marker, has been successfully used to make phylogenetic inferences within all major osteichthyan groups. However, such markers with broad working range and high phylogenetic performance are still scarce. It is necessary to develop more universal NPCL markers comparable to RAG1 for osteichthyan phylogenetics. Methodology/Principal Findings We developed three long universal NPCL markers (>1.6 kb each) based on single-copy nuclear genes (KIAA1239, SACS and TTN) that possess large exons and exhibit the appropriate evolutionary rates. We then compared their phylogenetic utilities with that of the reference marker RAG1 in 47 jawed vertebrate species. In comparison with RAG1, each of the three long universal markers yielded similar topologies and branch supports, all in congruence with the currently accepted osteichthyan phylogeny. To compare their phylogenetic performance visually, we also estimated the phylogenetic informativeness (PI) profile for each of the four long universal NPCL markers. The PI curves indicated that SACS performed best over the whole timescale, while RAG1, KIAA1239 and TTN exhibited similar phylogenetic performances. In addition, we compared the success of nested PCR and standard PCR when amplifying NPCL marker fragments. The amplification success rate and efficiency of the nested PCR were overwhelmingly higher than those of standard PCR. Conclusions/Significance Our work clearly demonstrates the superiority of nested PCR over the conventional PCR in phylogenetic studies and develops three long universal NPCL markers (KIAA1239, SACS and TTN) with the nested PCR strategy. The three markers exhibit high phylogenetic utilities in osteichthyan phylogenetics and can be widely used as pilot genes for phylogenetic questions of osteichthyans at different taxonomic levels. PMID:22720083
A methodological investigation of hominoid craniodental morphology and phylogenetics.
Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A
2011-01-01
The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.
Okamoto, Takuji; Maruyama, Akihiko; Imura, Satoshi; Takeyama, Haruko; Naganuma, Takeshi
2004-05-01
Halomonas variabilis and phylogenetically related organisms were isolated from various habitats such as Antarctic terrain and saline ponds, deep-sea sediment, deep-sea waters affected by hydrothermal plumes, and hydrothermal vent fluids. Ten strains were selected for physiological and phylogenetic characterization in detail. All of those strains were found to be piezotolerant and psychrotolerant, as well as euryhaline halophilic or halotolerant. Their stress tolerance may facilitate their wide occurrence, even in so-called extreme environments. The 16S rDNA-based phylogenetic relationship was complemented by analyses of the DNA gyrase subunit B gene (gyrB) and genes involved in the synthesis of the major compatible solute, ectoine: diaminobutyric acid aminotransferase gene (ectB) and ectoine synthase gene (ectC). The phylogenetic relationships of H. variabilis and related organisms were very similar in terms of 16S rDNA, gyrB, and ectB. The ectC-based tree was inconsistent with the other phylogenetic trees. For that reason, ectC was inferred to derive from horizontal transfer.
Klassen, Jonathan L.
2010-01-01
Background Carotenoids are multifunctional, taxonomically widespread and biotechnologically important pigments. Their biosynthesis serves as a model system for understanding the evolution of secondary metabolism. Microbial carotenoid diversity and evolution has hitherto been analyzed primarily from structural and biosynthetic perspectives, with the few phylogenetic analyses of microbial carotenoid biosynthetic proteins using either used limited datasets or lacking methodological rigor. Given the recent accumulation of microbial genome sequences, a reappraisal of microbial carotenoid biosynthetic diversity and evolution from the perspective of comparative genomics is warranted to validate and complement models of microbial carotenoid diversity and evolution based upon structural and biosynthetic data. Methodology/Principal Findings Comparative genomics were used to identify and analyze in silico microbial carotenoid biosynthetic pathways. Four major phylogenetic lineages of carotenoid biosynthesis are suggested composed of: (i) Proteobacteria; (ii) Firmicutes; (iii) Chlorobi, Cyanobacteria and photosynthetic eukaryotes; and (iv) Archaea, Bacteroidetes and two separate sub-lineages of Actinobacteria. Using this phylogenetic framework, specific evolutionary mechanisms are proposed for carotenoid desaturase CrtI-family enzymes and carotenoid cyclases. Several phylogenetic lineage-specific evolutionary mechanisms are also suggested, including: (i) horizontal gene transfer; (ii) gene acquisition followed by differential gene loss; (iii) co-evolution with other biochemical structures such as proteorhodopsins; and (iv) positive selection. Conclusions/Significance Comparative genomics analyses of microbial carotenoid biosynthetic proteins indicate a much greater taxonomic diversity then that identified based on structural and biosynthetic data, and divides microbial carotenoid biosynthesis into several, well-supported phylogenetic lineages not evident previously. This phylogenetic framework is applicable to understanding the evolution of specific carotenoid biosynthetic proteins or the unique characteristics of carotenoid biosynthetic evolution in a specific phylogenetic lineage. Together, these analyses suggest a “bramble” model for microbial carotenoid biosynthesis whereby later biosynthetic steps exhibit greater evolutionary plasticity and reticulation compared to those closer to the biosynthetic “root”. Structural diversification may be constrained (“trimmed”) where selection is strong, but less so where selection is weaker. These analyses also highlight likely productive avenues for future research and bioprospecting by identifying both gaps in current knowledge and taxa which may particularly facilitate carotenoid diversification. PMID:20582313
One tree to link them all: a phylogenetic dataset for the European tetrapoda.
Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried
2014-08-08
Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.
Phylogenetic patterns of climatic, habitat and trophic niches in a European avian assemblage
Pearman, Peter B; Lavergne, Sébastien; Roquet, Cristina; Wüest, Rafael; Zimmermann, Niklaus E; Thuiller, Wilfried
2014-01-01
Aim The origins of ecological diversity in continental species assemblages have long intrigued biogeographers. We apply phylogenetic comparative analyses to disentangle the evolutionary patterns of ecological niches in an assemblage of European birds. We compare phylogenetic patterns in trophic, habitat and climatic niche components. Location Europe. Methods From polygon range maps and handbook data we inferred the realized climatic, habitat and trophic niches of 405 species of breeding birds in Europe. We fitted Pagel's lambda and kappa statistics, and conducted analyses of disparity through time to compare temporal patterns of ecological diversification on all niche axes together. All observed patterns were compared with expectations based on neutral (Brownian) models of niche divergence. Results In this assemblage, patterns of phylogenetic signal (lambda) suggest that related species resemble each other less in regard to their climatic and habitat niches than they do in their trophic niche. Kappa estimates show that ecological divergence does not gradually increase with divergence time, and that this punctualism is stronger in climatic niches than in habitat and trophic niches. Observed niche disparity markedly exceeds levels expected from a Brownian model of ecological diversification, thus providing no evidence for past phylogenetic niche conservatism in these multivariate niches. Levels of multivariate disparity are greatest for the climatic niche, followed by disparity of the habitat and the trophic niches. Main conclusions Phylogenetic patterns in the three niche components differ within this avian assemblage. Variation in evolutionary rates (degree of gradualism, constancy through the tree) and/or non-random macroecological sampling probably lead here to differences in the phylogenetic structure of niche components. Testing hypotheses on the origin of these patterns requires more complete phylogenetic trees of the birds, and extended ecological data on different niche components for all bird species. PMID:24790525
Inferring Phylogenetic Networks Using PhyloNet.
Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay
2018-07-01
PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
YBYRÁ facilitates comparison of large phylogenetic trees.
Machado, Denis Jacob
2015-07-01
The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .
Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter
2017-12-01
Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.
The phylogenetic roots of human lethal violence.
Gómez, José María; Verdú, Miguel; González-Megías, Adela; Méndez, Marcos
2016-10-13
The psychological, sociological and evolutionary roots of conspecific violence in humans are still debated, despite attracting the attention of intellectuals for over two millennia. Here we propose a conceptual approach towards understanding these roots based on the assumption that aggression in mammals, including humans, has a significant phylogenetic component. By compiling sources of mortality from a comprehensive sample of mammals, we assessed the percentage of deaths due to conspecifics and, using phylogenetic comparative tools, predicted this value for humans. The proportion of human deaths phylogenetically predicted to be caused by interpersonal violence stood at 2%. This value was similar to the one phylogenetically inferred for the evolutionary ancestor of primates and apes, indicating that a certain level of lethal violence arises owing to our position within the phylogeny of mammals. It was also similar to the percentage seen in prehistoric bands and tribes, indicating that we were as lethally violent then as common mammalian evolutionary history would predict. However, the level of lethal violence has changed through human history and can be associated with changes in the socio-political organization of human populations. Our study provides a detailed phylogenetic and historical context against which to compare levels of lethal violence observed throughout our history.
Nodal distances for rooted phylogenetic trees.
Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel
2010-08-01
Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).
Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree
NASA Astrophysics Data System (ADS)
Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.
2017-02-01
Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.
Comparative transcriptomics of early dipteran development
2013-01-01
Background Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). Results We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. Conclusions We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies). PMID:23432914
Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723
Comparing Mycobacterium tuberculosis genomes using genome topology networks.
Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan
2015-02-14
Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes were found to be affected by SVs in M. tuberculosis genomes. We believe that the GTN method will be suitable for the exploration of genomic SVs in connection with biological features of bacterial strains, and that GTN-based phylogenetic analysis will provide additional insights into whole genome-based phylogenetic analysis.
Phylogenetic search through partial tree mixing
2012-01-01
Background Recent advances in sequencing technology have created large data sets upon which phylogenetic inference can be performed. Current research is limited by the prohibitive time necessary to perform tree search on a reasonable number of individuals. This research develops new phylogenetic algorithms that can operate on tens of thousands of species in a reasonable amount of time through several innovative search techniques. Results When compared to popular phylogenetic search algorithms, better trees are found much more quickly for large data sets. These algorithms are incorporated in the PSODA application available at http://dna.cs.byu.edu/psoda Conclusions The use of Partial Tree Mixing in a partition based tree space allows the algorithm to quickly converge on near optimal tree regions. These regions can then be searched in a methodical way to determine the overall optimal phylogenetic solution. PMID:23320449
Kim, Jiyeon; Kern, Elizabeth; Kim, Taeho; Sim, Mikang; Kim, Jaebum; Kim, Yuseob; Park, Chungoo; Nadler, Steven A; Park, Joong-Ki
2017-02-01
Plectida is an important nematode order with species that occupy many different biological niches. The order includes free-living aquatic and soil-dwelling species, but its phylogenetic position has remained uncertain. We sequenced the complete mitochondrial genomes of two members of this order, Plectus acuminatus and Plectus aquatilis and compared them with those of other major nematode clades. The genome size and base composition of these species are similar to other nematodes; 14,831 and 14,372bp, respectively, with AT contents of 71.0% and 70.1%. Gene content was also similar to other nematodes, but gene order and coding direction of Plectus mtDNAs were dissimilar from other chromadorean species. P. acuminatus and P. aquatilis are the first chromadorean species found to contain a gene inversion. We reconstructed mitochondrial genome phylogenetic trees using nucleotide and amino acid datasets from 87 nematodes that represent major nematode clades, including the Plectus sequences. Trees from phylogenetic analyses using maximum likelihood and Bayesian methods depicted Plectida as the sister group to other sequenced chromadorean nematodes. This finding is consistent with several phylogenetic results based on SSU rDNA, but disagrees with a classification based on morphology. Mitogenomes representing other basal chromadorean groups (Araeolaimida, Monhysterida, Desmodorida, Chromadorida) are needed to confirm their phylogenetic relationships. Copyright © 2016 Elsevier Inc. All rights reserved.
Piccin-Santos, Viviane; Brandão, Marcelo Mendes; Bittencourt-Oliveira, Maria Do Carmo
2014-08-01
Selection of genes that have not been horizontally transferred for prokaryote phylogenetic inferences is regarded as a challenging task. The markers internal transcribed spacer of ribosomal genes (16S-23S ITS) and phycocyanin intergenic spacer (PC-IGS), based on the operons of ribosomal and phycocyanin genes respectively, are among the most used markers in cyanobacteria. The region of the ribosomal genes has been considered stable, whereas the phycocyanin operon may have undergone horizontal transfer. To investigate the occurrence of horizontal transfer of PC-IGS, phylogenetic trees of Geitlerinema and Microcystis strains were generated using PC-IGS and 16S-23S ITS and compared. Phylogenetic trees based on the two markers were mostly congruent for Geitlerinema and Microcystis, indicating a common evolutionary history among ribosomal and phycocyanin genes with no evidence for horizontal transfer of PC-IGS. Thus, PC-IGS is a suitable marker, along with 16S-23S ITS for phylogenetic studies of cyanobacteria. © 2014 Phycological Society of America.
Multivariate Phylogenetic Comparative Methods: Evaluations, Comparisons, and Recommendations.
Adams, Dean C; Collyer, Michael L
2018-01-01
Recent years have seen increased interest in phylogenetic comparative analyses of multivariate data sets, but to date the varied proposed approaches have not been extensively examined. Here we review the mathematical properties required of any multivariate method, and specifically evaluate existing multivariate phylogenetic comparative methods in this context. Phylogenetic comparative methods based on the full multivariate likelihood are robust to levels of covariation among trait dimensions and are insensitive to the orientation of the data set, but display increasing model misspecification as the number of trait dimensions increases. This is because the expected evolutionary covariance matrix (V) used in the likelihood calculations becomes more ill-conditioned as trait dimensionality increases, and as evolutionary models become more complex. Thus, these approaches are only appropriate for data sets with few traits and many species. Methods that summarize patterns across trait dimensions treated separately (e.g., SURFACE) incorrectly assume independence among trait dimensions, resulting in nearly a 100% model misspecification rate. Methods using pairwise composite likelihood are highly sensitive to levels of trait covariation, the orientation of the data set, and the number of trait dimensions. The consequences of these debilitating deficiencies are that a user can arrive at differing statistical conclusions, and therefore biological inferences, simply from a dataspace rotation, like principal component analysis. By contrast, algebraic generalizations of the standard phylogenetic comparative toolkit that use the trace of covariance matrices are insensitive to levels of trait covariation, the number of trait dimensions, and the orientation of the data set. Further, when appropriate permutation tests are used, these approaches display acceptable Type I error and statistical power. We conclude that methods summarizing information across trait dimensions, as well as pairwise composite likelihood methods should be avoided, whereas algebraic generalizations of the phylogenetic comparative toolkit provide a useful means of assessing macroevolutionary patterns in multivariate data. Finally, we discuss areas in which multivariate phylogenetic comparative methods are still in need of future development; namely highly multivariate Ornstein-Uhlenbeck models and approaches for multivariate evolutionary model comparisons. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Kawaida, Hitomi; Ohba, Kohki; Koutake, Yuhki; Shimizu, Hiroshi; Tachida, Hidenori; Kobayakawa, Yoshitaka
2013-03-01
Although many physiological studies have been reported on the symbiosis between hydra and green algae, very little information from a molecular phylogenetic aspect of symbiosis is available. In order to understand the origin and evolution of symbiosis between the two organisms, we compared the phylogenetic relationships among symbiotic green algae with the phylogenetic relationships among host hydra strains. To do so, we reconstructed molecular phylogenetic trees of several strains of symbiotic chlorella harbored in the endodermal epithelial cells of viridissima group hydra strains and investigated their congruence with the molecular phylogenetic trees of the host hydra strains. To examine the species specificity between the host and the symbiont with respect to the genetic distance, we also tried to introduce chlorella strains into two aposymbiotic strains of viridissima group hydra in which symbiotic chlorella had been eliminated in advance. We discussed the origin and history of symbiosis between hydra and green algae based on the analysis. Copyright © 2012 Elsevier Inc. All rights reserved.
16S and 23S plastid rDNA phylogenies of Prototheca species and their auxanographic phenotypes.
Ewing, Aren; Brubaker, Shane; Somanchi, Aravind; Yu, Esther; Rudenko, George; Reyes, Nina; Espina, Karen; Grossman, Arthur; Franklin, Scott
2014-08-01
Because algae have become more accepted as sources of human nutrition, phylogenetic analysis can help resolve the taxonomy of taxa that have not been well studied. This can help establish algal evolutionary relationships. Here, we compare Auxenochlorella protothecoides and 23 strains of Prototheca based on their complete 16S and partial 23S plastid rDNA sequences along with nutrient utilization (auxanographic) profiles. These data demonstrate that some of the species groupings are not in agreement with the molecular phylogenetic analyses and that auxanographic profiles are poor predictors of phylogenetic relationships.
16S and 23S plastid rDNA phylogenies of Prototheca species and their auxanographic phenotypes1
Ewing, Aren; Brubaker, Shane; Somanchi, Aravind; Yu, Esther; Rudenko, George; Reyes, Nina; Espina, Karen; Grossman, Arthur; Franklin, Scott
2014-01-01
Because algae have become more accepted as sources of human nutrition, phylogenetic analysis can help resolve the taxonomy of taxa that have not been well studied. This can help establish algal evolutionary relationships. Here, we compare Auxenochlorella protothecoides and 23 strains of Prototheca based on their complete 16S and partial 23S plastid rDNA sequences along with nutrient utilization (auxanographic) profiles. These data demonstrate that some of the species groupings are not in agreement with the molecular phylogenetic analyses and that auxanographic profiles are poor predictors of phylogenetic relationships. PMID:25937672
Phylogenetics of modern birds in the era of genomics
Edwards, Scott V; Bryan Jennings, W; Shedlock, Andrew M
2005-01-01
In the 14 years since the first higher-level bird phylogenies based on DNA sequence data, avian phylogenetics has witnessed the advent and maturation of the genomics era, the completion of the chicken genome and a suite of technologies that promise to add considerably to the agenda of avian phylogenetics. In this review, we summarize current approaches and data characteristics of recent higher-level bird studies and suggest a number of as yet untested molecular and analytical approaches for the unfolding tree of life for birds. A variety of comparative genomics strategies, including adoption of objective quality scores for sequence data, analysis of contiguous DNA sequences provided by large-insert genomic libraries, and the systematic use of retroposon insertions and other rare genomic changes all promise an integrated phylogenetics that is solidly grounded in genome evolution. The avian genome is an excellent testing ground for such approaches because of the more balanced representation of single-copy and repetitive DNA regions than in mammals. Although comparative genomics has a number of obvious uses in avian phylogenetics, its application to large numbers of taxa poses a number of methodological and infrastructural challenges, and can be greatly facilitated by a ‘community genomics’ approach in which the modest sequencing throughputs of single PI laboratories are pooled to produce larger, complementary datasets. Although the polymerase chain reaction era of avian phylogenetics is far from complete, the comparative genomics era—with its ability to vastly increase the number and type of molecular characters and to provide a genomic context for these characters—will usher in a host of new perspectives and opportunities for integrating genome evolution and avian phylogenetics. PMID:16024355
Nonbinary Tree-Based Phylogenetic Networks.
Jetten, Laura; van Iersel, Leo
2018-01-01
Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.
Iiyama, Kazuhiro; Otao, Masahiro; Mori, Kazuki; Mon, Hiroaki; Lee, Jae Man; Kusakabe, Takahiro; Tashiro, Kousuke; Asano, Shin-Ichiro; Yasunaga-Aoki, Chisa
2014-01-01
To determine the phylogenetic relationship among Paenibacillus species, putative replication origin regions were compared. In the rsmG-gyrA region, gene arrangements in Paenibacillus species were identical to those of Bacillus species, with the exception of an open reading frame (orf14) positioned between gyrB and gyrA, which was observed only in Paenibacillus species. The orf14 product was homologous to the endospore-associated proteins YheC and YheD of Bacillus subtilis. Phylogenetic analysis based on the YheCD proteins suggested that Orf14 could be categorized into the YheC group. In the Paenibacillus genome, DnaA box clusters were found in rpmH-dnaA and dnaA-dnaN intergenic regions, known as box regions C and R, respectively; this localization was similar to that observed in B. halodurans. A phylogenetic tree based on the nucleotide sequences of the whole replication origin regions suggested that P. popilliae, P. thiaminolyticus, and P. dendritiformis are closely related species.
Wang, Shuai; Wei, Wei; Luo, Xuenong; Cai, Xuepeng
2014-01-01
Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV) have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3'-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.
Phylogenetic congruence between subtropical trees and their associated fungi.
Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao
2016-12-01
Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.
Endress, Peter K.; Davis, Charles C.; Matthews, Merran L.
2013-01-01
Background and Aims Malpighiales are one of the largest angiosperm orders and have undergone radical systematic restructuring based on molecular phylogenetic studies. The clade has been recalcitrant to molecular phylogenetic reconstruction, but has become much more resolved at the suprafamilial level. It now contains so many newly identified clades that there is an urgent need for comparative studies to understand their structure, biology and evolution. This is especially true because the order contains a disproportionally large diversity of rain forest species and includes numerous agriculturally important plants. This study is a first broad systematic step in this endeavour. It focuses on a comparative structural overview of the flowers across all recently identified suprafamilial clades of Malpighiales, and points towards areas that desperately need attention. Methods The phylogenetic comparative analysis of floral structure for the order is based on our previously published studies on four suprafamilial clades of Malpighiales, including also four related rosid orders (Celastrales, Crossosomatales, Cucurbitales, Oxalidales). In addition, the results are compiled from a survey of over 3000 publications on macrosystematics, floral structure and embryology across all orders of the core eudicots. Key Results Most new suprafamilial clades within Malpighiales are well supported by floral structural features. Inner morphological structures of the gynoecium (i.e. stigmatic lobes, inner shape of the locules, placentation, presence of obturators) and ovules (i.e. structure of the nucellus, thickness of the integuments, presence of vascular bundles in the integuments, presence of an endothelium in the inner integument) appear to be especially suitable for characterizing suprafamilial clades within Malpighiales. Conclusions Although the current phylogenetic reconstruction of Malpighiales is much improved compared with earlier versions, it is incomplete, and further focused phylogenetic and morphological studies are needed. Once all major subclades of Malpighiales are elucidated, more in-depth studies on promising structural features can be conducted. In addition, once the phylogenetic tree of Malpighiales, including closely related orders, is more fully resolved, character optimization studies will be possible to reconstruct evolution of structural and biological features within the order. PMID:23486341
Xu, Jinshi; Chen, Yu; Zhang, Lixia; Chai, Yongfu; Wang, Mao; Guo, Yaoxin; Li, Ting; Yue, Ming
2017-07-01
Community assembly processes is the primary focus of community ecology. Using phylogenetic-based and functional trait-based methods jointly to explore these processes along environmental gradients are useful ways to explain the change of assembly mechanisms under changing world. Our study combined these methods to test assembly processes in wide range gradients of elevation and other habitat environmental factors. We collected our data at 40 plots in Taibai Mountain, China, with more than 2,300 m altitude difference in study area and then measured traits and environmental factors. Variance partitioning was used to distinguish the main environment factors leading to phylogeny and traits change among 40 plots. Principal component analysis (PCA) was applied to colligate other environment factors. Community assembly patterns along environmental gradients based on phylogenetic and functional methods were studied for exploring assembly mechanisms. Phylogenetic signal was calculated for each community along environmental gradients in order to detect the variation of trait performance on phylogeny. Elevation showed a better explanatory power than other environment factors for phylogenetic and most traits' variance. Phylogenetic and several functional structure clustered at high elevation while some conserved traits overdispersed. Convergent tendency which might be caused by filtering or competition along elevation was detected based on functional traits. Leaf dry matter content (LDMC) and leaf nitrogen content along PCA 1 axis showed conflicting patterns comparing to patterns showed on elevation. LDMC exhibited the strongest phylogenetic signal. Only the phylogenetic signal of maximum plant height showed explicable change along environmental gradients. Synthesis . Elevation is the best environment factors for predicting phylogeny and traits change. Plant's phylogenetic and some functional structures show environmental filtering in alpine region while it shows different assembly processes in middle- and low-altitude region by different trait/phylogeny. The results highlight deterministic processes dominate community assembly in large-scale environmental gradients. Performance of phylogeny and traits along gradients may be independent with each other. The novel method for calculating functional structure which we used in this study and the focus of phylogenetic signal change along gradients may provide more useful ways to detect community assembly mechanisms.
Rostral horn evolution among agamid lizards of the genus ceratophora endemic to Sri Lanka
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schulte II, James A.; Macey, J. Robert; Pethiyagoda, Rohan
2001-07-10
The first phylogenetic hypothesis for the Sri Lankan agamid lizard genus Ceratophora is presented based on 1670 aligned base positions (472 parsimony informative) of mitochondrial DNA sequences, representing coding regions for eight tRNAs, ND2, and portions of ND1 and COI. Phylogenetic analysis reveals multiple origins and possibly losses of rostral horns in the evolutionary history of Ceratophora. Our data suggest a middle Miocene origin of Ceratophora with the most recent branching of recognized species occurring at the Pliocene/Pleistocene boundary. Haplotype divergence suggests that an outgroup species, Lyriocephalus scutatus, dates at least to the Pliocene. These phylogenetic results provide a frameworkmore » for comparative studies of the behavioral ecological importance of horn evolution in this group.« less
The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method
2015-01-01
Phylogenetic comparative methods (PCMs) use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE), which purportedly employs a novel model of evolution based on Felsenstein’s Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or “directional” contrasts, and draw parallels between IE and previous PCMs such as “minimum evolution”. Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses. PMID:26683838
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy
Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the averagemore » nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.« less
Using tree diversity to compare phylogenetic heuristics.
Sul, Seung-Jin; Matthews, Suzanne; Williams, Tiffani L
2009-04-29
Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees-especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.
A fully resolved consensus between fully resolved phylogenetic trees.
Quitzau, José Augusto Amgarten; Meidanis, João
2006-03-31
Nowadays, there are many phylogeny reconstruction methods, each with advantages and disadvantages. We explored the advantages of each method, putting together the common parts of trees constructed by several methods, by means of a consensus computation. A number of phylogenetic consensus methods are already known. Unfortunately, there is also a taboo concerning consensus methods, because most biologists see them mainly as comparators and not as phylogenetic tree constructors. We challenged this taboo by defining a consensus method that builds a fully resolved phylogenetic tree based on the most common parts of fully resolved trees in a given collection. We also generated results showing that this consensus is in a way a kind of "median" of the input trees; as such it can be closer to the correct tree in many situations.
Pandey, Ravi S; Saxena, Garima; Bhattacharya, Debashish; Qiu, Huan; Azad, Rajeev K
2017-02-01
Identification of horizontal gene transfers (HGTs) has primarily relied on phylogenetic tree based methods, which require a rich sampling of sequenced genomes to ensure a reliable inference. Because the success of phylogenetic approaches depends on the breadth and depth of the database, researchers usually apply stringent filters to detect only the most likely gene transfers in the genomes of interest. One such study focused on a highly conservative estimate of trans-domain gene transfers in the extremophile eukaryote, Galdieria sulphuraria (Galdieri) Merola (Rhodophyta), by applying multiple filters in their phylogenetic pipeline. This led to the identification of 75 inter-domain acquisitions from Bacteria or Archaea. Because of the evolutionary, ecological, and potential biotechnological significance of foreign genes in algae, alternative approaches and pipelines complementing phylogenetics are needed for a more comprehensive assessment of HGT. We present here a novel pipeline that uncovered 17 novel foreign genes of prokaryotic origin in G. sulphuraria, results that are supported by multiple lines of evidence including composition-based, comparative data, and phylogenetics. These genes encode a variety of potentially adaptive functions, from metabolite transport to DNA repair. © 2016 Phycological Society of America.
2017-01-01
We assessed changes in phylogenetic diversity of angiosperm flora on six oceanic islands located in the southeastern Pacific Ocean, by comparing flora from two periods: the pre-European colonization of islands and current times. We hypothesize that, in the time between these periods, extinction of local plant species and addition of exotic plants modified phylogenetic-α-diversity at different levels (deeper and terminal phylogeny) and increased phylo-β-diversity among islands. Based on floristic studies, we assembled a phylogenetic tree from occurrence data that includes 921 species, of which 165 and 756 were native or exotic in origin, respectively. Then, we studied change in the phylo-α-diversity and phylo-β-diversity (1 –Phylosor) by comparing pre-European and current times. Despite extinction of 18 native angiosperm species, an increase in species richness and phylo-α-diversity was observed for all islands studied, attributed to introduction of exotic plants (between 6 to 477 species per island). We did not observe significant variation of mean phylogenetic distance (MPD), a measure of the ‘deeper’ phylogenetic diversity of assemblages (e.g., orders, families), suggesting that neither extinctions nor introductions altered phylogenetic structure of the angiosperms of these islands. In regard to phylo-β-diversity, we detected temporal turnover (variation in phylogenetic composition) between periods to flora (0.38 ± 0.11). However, when analyses were performed only considering native plants, we did not observe significant temporal turnover between periods (0.07 ± 0.06). These results indicate that introduction of exotic angiosperms has contributed more notably than extinctions to the configuration of plant assemblages and phylogenetic diversity on the studied islands. Because phylogenetic diversity is closely related to functional diversity (species trait variations and roles performed by organisms), our results suggests that the introduction of exotic plants to these islands could have detrimental impacts for ecosystem functions and ecosystem services that islands provide (e.g. productivity). PMID:28763508
The comparative ecology and biogeography of parasites
Poulin, Robert; Krasnov, Boris R.; Mouillot, David; Thieltges, David W.
2011-01-01
Comparative ecology uses interspecific relationships among traits, while accounting for the phylogenetic non-independence of species, to uncover general evolutionary processes. Applied to biogeographic questions, it can be a powerful tool to explain the spatial distribution of organisms. Here, we review how comparative methods can elucidate biogeographic patterns and processes, using analyses of distributional data on parasites (fleas and helminths) as case studies. Methods exist to detect phylogenetic signals, i.e. the degree of phylogenetic dependence of a given character, and either to control for these signals in statistical analyses of interspecific data, or to measure their contribution to variance. Parasite–host interactions present a special case, as a given trait may be a parasite trait, a host trait or a property of the coevolved association rather than of one participant only. For some analyses, it is therefore necessary to correct simultaneously for both parasite phylogeny and host phylogeny, or to evaluate which has the greatest influence on trait expression. Using comparative approaches, we show that two fundamental properties of parasites, their niche breadth, i.e. host specificity, and the nature of their life cycle, can explain interspecific and latitudinal variation in the sizes of their geographical ranges, or rates of distance decay in the similarity of parasite communities. These findings illustrate the ways in which phylogenetically based comparative methods can contribute to biogeographic research. PMID:21768153
Tandem repeats analysis for the high resolution phylogenetic analysis of Yersinia pestis
Pourcel, C; André-Mazeaud, F; Neubauer, H; Ramisse, F; Vergnaud, G
2004-01-01
Background Yersinia pestis, the agent of plague, is a young and highly monomorphic species. Three biovars, each one thought to be associated with the last three Y. pestis pandemics, have been defined based on biochemical assays. More recently, DNA based assays, including DNA sequencing, IS typing, DNA arrays, have significantly improved current knowledge on the origin and phylogenetic evolution of Y. pestis. However, these methods suffer either from a lack of resolution or from the difficulty to compare data. Variable number of tandem repeats (VNTRs) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses in a growing number of pathogens and have given promising results for Y. pestis as well. Results In this study we have genotyped 180 Y. pestis isolates by multiple locus VNTR analysis (MLVA) using 25 markers. Sixty-one different genotypes were observed. The three biovars were distributed into three main branches, with some exceptions. In particular, the Medievalis phenotype is clearly heterogeneous, resulting from different mutation events in the napA gene. Antiqua strains from Asia appear to hold a central position compared to Antiqua strains from Africa. A subset of 7 markers is proposed for the quick comparison of a new strain with the collection typed here. This can be easily achieved using a Web-based facility, specifically set-up for running such identifications. Conclusion Tandem-repeat typing may prove to be a powerful complement to the existing phylogenetic tools for Y. pestis. Typing can be achieved quickly at a low cost in terms of consumables, technical expertise and equipment. The resulting data can be easily compared between different laboratories. The number and selection of markers will eventually depend upon the type and aim of investigations. PMID:15186506
Kent, Angela D.; Smith, Dan J.; Benson, Barbara J.; Triplett, Eric W.
2003-01-01
Culture-independent DNA fingerprints are commonly used to assess the diversity of a microbial community. However, relating species composition to community profiles produced by community fingerprint methods is not straightforward. Terminal restriction fragment length polymorphism (T-RFLP) is a community fingerprint method in which phylogenetic assignments may be inferred from the terminal restriction fragment (T-RF) sizes through the use of web-based resources that predict T-RF sizes for known bacteria. The process quickly becomes computationally intensive due to the need to analyze profiles produced by multiple restriction digests and the complexity of profiles generated by natural microbial communities. A web-based tool is described here that rapidly generates phylogenetic assignments from submitted community T-RFLP profiles based on a database of fragments produced by known 16S rRNA gene sequences. Users have the option of submitting a customized database generated from unpublished sequences or from a gene other than the 16S rRNA gene. This phylogenetic assignment tool allows users to employ T-RFLP to simultaneously analyze microbial community diversity and species composition. An analysis of the variability of bacterial species composition throughout the water column in a humic lake was carried out to demonstrate the functionality of the phylogenetic assignment tool. This method was validated by comparing the results generated by this program with results from a 16S rRNA gene clone library. PMID:14602639
Krajewski, C; Fain, M G; Buckley, L; King, D G
1999-11-01
ki ctes over whether molecular sequence data should be partitioned for phylogenetic analysis often confound two types of heterogeneity among partitions. We distinguish historical heterogeneity (i.e., different partitions have different evolutionary relationships) from dynamic heterogeneity (i.e., different partitions show different patterns of sequence evolution) and explore the impact of the latter on phylogenetic accuracy and precision with a two-gene, mitochondrial data set for cranes. The well-established phylogeny of cranes allows us to contrast tree-based estimates of relevant parameter values with estimates based on pairwise comparisons and to ascertain the effects of incorporating different amounts of process information into phylogenetic estimates. We show that codon positions in the cytochrome b and NADH dehydrogenase subunit 6 genes are dynamically heterogenous under both Poisson and invariable-sites + gamma-rates versions of the F84 model and that heterogeneity includes variation in base composition and transition bias as well as substitution rate. Estimates of transition-bias and relative-rate parameters from pairwise sequence comparisons were comparable to those obtained as tree-based maximum likelihood estimates. Neither rate-category nor mixed-model partitioning strategies resulted in a loss of phylogenetic precision relative to unpartitioned analyses. We suggest that weighted-average distances provide a computationally feasible alternative to direct maximum likelihood estimates of phylogeny for mixed-model analyses of large, dynamically heterogenous data sets. Copyright 1999 Academic Press.
Phylogenetic classification of bony fishes.
Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo
2017-07-06
Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution for more taxa than previous versions, based on more densely sampled phylogenetic trees. The classification presented in this study represents, unlike any other, the most up-to-date hypothesis of the Tree of Life of fishes.
CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis
2011-01-01
Background The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Results Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. Conclusions CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore. PMID:21496247
CDAO-store: ontology-driven data integration for phylogenetic analysis.
Chisham, Brandon; Wright, Ben; Le, Trung; Son, Tran Cao; Pontelli, Enrico
2011-04-15
The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore.
Phylogeny, host-parasite relationship and zoogeography
1999-01-01
Phylogeny is the evolutionary history of a group or the lineage of organisms and is reconstructed based on morphological, molecular and other characteristics. The genealogical relationship of a group of taxa is often expressed as a phylogenetic tree. The difficulty in categorizing the phylogeny is mainly due to the existence of frequent homoplasies that deceive observers. At the present time, cladistic analysis is believed to be one of the most effective methods of reconstructing a phylogenetic tree. Excellent computer program software for phylogenetic analysis is available. As an example, cladistic analysis was applied for nematode genera of the family Acuariidae, and the phylogenetic tree formed was compared with the system used currently. Nematodes in the genera Nippostrongylus and Heligmonoides were also analyzed, and the validity of the reconstructed phylogenetic trees was observed from a zoogeographical point of view. Some of the theories of parasite evolution were briefly reviewed as well. Coevolution of parasites and humans was discussed with special reference to the evolutionary relationship between Enterobius and primates. PMID:10634036
NASA Technical Reports Server (NTRS)
Villanueva, E.; Delihas, N.; Luehrsen, K. R.; Fox, G. E.; Gibson, J.
1985-01-01
The complete nucleotide sequences of 5S ribosomal RNAs from Rhodocyclus gelatinosa, Rhodobacter sphaeroides, and Pseudomonas cepacia were determined. Comparisons of these 5S RNA sequences show that rather than being phylogenetically related to one another, the two photosynthetic bacterial 5S RNAs share more sequence and signature homology with the RNAs of two nonphotosynthetic strains. Rhodobacter sphaeroides is specifically related to Paracoccus denitrificans and Rc. gelatinosa is related to Ps. cepacia. These results support earlier 16S ribosomal RNA studies and add two important groups to the 5S RNA data base. Unique 5S RNA structural features previously found in P. denitrificans are present also in the 5S RNA of Rb. sphaeroides; these provide the basis for subdivisional signatures. The immediate consequence of obtaining these new sequences is that it is possible to clarify the phylogenetic origins of the plant mitochondrion. In particular, a close phylogenetic relationship is found between the plant mitochondria and members of the alpha subdivision of the purple photosynthetic bacteria, namely, Rb. sphaeroides, P. denitrificans, and Rhodospirillum rubrum.
Ali, Khalil H Al; El-Badry, Ayman A; Ali, Mouhanad Al; El-Sayed, Wael S M; El-Beshbishy, Hesham A
2016-06-01
Aedes aegypti is the main vector of the yellow fever and dengue virus. This mosquito has become the major indirect cause of morbidity and mortality of the human worldwide. Dengue virus activity has been reported recently in the western areas of Saudi Arabia. There is no vaccine for dengue virus until now, and the control of the disease depends on the control of the vector. The present study has aimed to perform phylogenetic analysis of Aedes aegypti based on mitochondrial NADH dehydrogenase subunit 4 ( ND4 ) gene at Almadinah, Saudi Arabia in order to get further insight into the epidemiology and transmission of this vector. Mitochondrial ND4 gene was sequenced in the eight isolated Aedes aegypti mosquitoes from Almadinah, Saudi Arabia, sequences were aligned, and phylogenetic analysis were performed and compared with 54 sequences of Aedes reported in the previous studies from Mexico, Thailand, Brazil, and Africa. Our results suggest that increased gene flow among Aedes aegypti populations occurs between Africa and Saudi Arabia. Phylogenetic relationship analysis showed two genetically distinct Aedes aegypti in Saudi Arabia derived from dual African ancestor.
Enumerating all maximal frequent subtrees in collections of phylogenetic trees
2014-01-01
Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474
Enumerating all maximal frequent subtrees in collections of phylogenetic trees.
Deepak, Akshay; Fernández-Baca, David
2014-01-01
A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.
A review of criticisms of phylogenetic nomenclature: is taxonomic freedom the fundamental issue?
Bryant, Harold N; Cantino, Philip D
2002-02-01
The proposal to implement a phylogenetic nomenclatural system governed by the PhyloCode), in which taxon names are defined by explicit reference to common descent, has met with strong criticism from some proponents of phylogenetic taxonomy (taxonomy based on the principle of common descent in which only clades and species are recognized). We examine these criticisms and find that some of the perceived problems with phylogenetic nomenclature are based on misconceptions, some are equally true of the current rank-based nomenclatural system, and some will be eliminated by implementation of the PhyloCode. Most of the criticisms are related to an overriding concern that, because the meanings of names are associated with phylogenetic pattern which is subject to change, the adoption of phylogenetic nomenclature will lead to increased instability in the content of taxa. This concern is associated with the fact that, despite the widespread adoption of the view that taxa are historical entities that are conceptualized based on ancestry, many taxonomists also conceptualize taxa based on their content. As a result, critics of phylogenetic nomenclature have argued that taxonomists should be free to emend the content of taxa without constraints imposed by nomenclatural decisions. However, in phylogenetic nomenclature the contents of taxa are determined, not by the taxonomist, but by the combination of the phylogenetic definition of the name and a phylogenetic hypothesis. Because the contents of taxa, once their names are defined, can no longer be freely modified by taxonomists, phylogenetic nomenclature is perceived as limiting taxonomic freedom. We argue that the form of taxonomic freedom inherent to phylogenetic nomenclature is appropriate to phylogenetic taxonomy in which taxa are considered historical entities that are discovered through phylogenetic analysis and are not human constructs.
Vaux, Felix; Trewick, Steven A; Crampton, James S; Marshall, Bruce A; Beu, Alan G; Hills, Simon F K; Morgan-Richards, Mary
2018-06-15
The relationship between morphology and inheritance is of perennial interest in evolutionary biology and palaeontology. Using three marine snail genera Penion, Antarctoneptunea and Kelletia, we investigate whether systematics based on shell morphology accurately reflect evolutionary lineages indicated by molecular phylogenetics. Members of these gastropod genera have been a taxonomic challenge due to substantial variation in shell morphology, conservative radular and soft tissue morphology, few known ecological differences, and geographical overlap between numerous species. Sampling all sixteen putative taxa identified across the three genera, we infer mitochondrial and nuclear ribosomal DNA phylogenetic relationships within the group, and compare this to variation in adult shell shape and size. Results of phylogenetic analysis indicate that each genus is monophyletic, although the status of some phylogenetically derived and likely more recently evolved taxa within Penion is uncertain. The recently described species P. lineatus is supported by genetic evidence. Morphology, captured using geometric morphometric analysis, distinguishes the genera and matches the molecular phylogeny, although using the same dataset, species and phylogenetic subclades are not identified with high accuracy. Overall, despite abundant variation, we find that shell morphology accurately reflects genus-level classification and the corresponding deep phylogenetic splits identified in this group of marine snails. Copyright © 2018 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Arit, Turkan; Keskin, Burak; Firuzan, Esin; Cavas, Cagin Kandemir; Liu, Liwei; Cavas, Levent
2018-04-01
The report entitled "L. Liu, D. Li, F. Bai, A relative Lempel-Ziv complexity: Application to comparing biological sequences, Chem. Phys. Lett. 530 (2012) 107-112" mentions on the powerful construction of phylogenetic trees based on Lempel-Ziv algorithm. On the other hand, the method explained in the paper does not give promising result on the data set on invasive Caulerpa taxifolia in the Mediterranean Sea. The phylogenetic trees are obtained by the proposed method of the aforementioned paper in this short note.
Yi, Zhenzhen; Strüder-Kypke, Michaela; Hu, Xiaozhong; Lin, Xiaofeng; Song, Weibo
2014-02-01
In order to assess how dataset-selection for multi-gene analyses affects the accuracy of inferred phylogenetic trees in ciliates, we chose five genes and the genus Paramecium, one of the most widely used model protist genera, and compared tree topologies of the single- and multi-gene analyses. Our empirical study shows that: (1) Using multiple genes improves phylogenetic accuracy, even when their one-gene topologies are in conflict with each other. (2) The impact of missing data on phylogenetic accuracy is ambiguous: resolution power and topological similarity, but not number of represented taxa, are the most important criteria of a dataset for inclusion in concatenated analyses. (3) As an example, we tested the three classification models of the genus Paramecium with a multi-gene based approach, and only the monophyly of the subgenus Paramecium is supported. Copyright © 2013 Elsevier Inc. All rights reserved.
Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.
2011-01-01
The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275
Cophenetic metrics for phylogenetic trees, after Sokal and Rohlf.
Cardona, Gabriel; Mir, Arnau; Rosselló, Francesc; Rotger, Lucía; Sánchez, David
2013-01-16
Phylogenetic tree comparison metrics are an important tool in the study of evolution, and hence the definition of such metrics is an interesting problem in phylogenetics. In a paper in Taxon fifty years ago, Sokal and Rohlf proposed to measure quantitatively the difference between a pair of phylogenetic trees by first encoding them by means of their half-matrices of cophenetic values, and then comparing these matrices. This idea has been used several times since then to define dissimilarity measures between phylogenetic trees but, to our knowledge, no proper metric on weighted phylogenetic trees with nested taxa based on this idea has been formally defined and studied yet. Actually, the cophenetic values of pairs of different taxa alone are not enough to single out phylogenetic trees with weighted arcs or nested taxa. For every (rooted) phylogenetic tree T, let its cophenetic vectorφ(T) consist of all pairs of cophenetic values between pairs of taxa in T and all depths of taxa in T. It turns out that these cophenetic vectors single out weighted phylogenetic trees with nested taxa. We then define a family of cophenetic metrics dφ,p by comparing these cophenetic vectors by means of Lp norms, and we study, either analytically or numerically, some of their basic properties: neighbors, diameter, distribution, and their rank correlation with each other and with other metrics. The cophenetic metrics can be safely used on weighted phylogenetic trees with nested taxa and no restriction on degrees, and they can be computed in O(n2) time, where n stands for the number of taxa. The metrics dφ,1 and dφ,2 have positive skewed distributions, and they show a low rank correlation with the Robinson-Foulds metric and the nodal metrics, and a very high correlation with each other and with the splitted nodal metrics. The diameter of dφ,p, for p⩾1 , is in O(n(p+2)/p), and thus for low p they are more discriminative, having a wider range of values.
A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.
Rajan, Vaibhav
2013-03-01
Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.
2013-01-01
Background Dendropsophus is a monophyletic anuran genus with a diploid number of 30 chromosomes as an important synapomorphy. However, the internal phylogenetic relationships of this genus are poorly understood. Interestingly, an intriguing interspecific variation in the telocentric chromosome number has been useful in species identification. To address certain uncertainties related to one of the species groups of Dendropsophus, the D. microcephalus group, we carried out a cytogenetic analysis combined with phylogenetic inferences based on mitochondrial sequences, which aimed to aid in the analysis of chromosomal characters. Populations of Dendropsophus nanus, Dendropsophus walfordi, Dendropsophus sanborni, Dendropsophus jimi and Dendropsophus elianeae, ranging from the extreme south to the north of Brazil, were cytogenetically compared. A mitochondrial region of the ribosomal 12S gene from these populations, as well as from 30 other species of Dendropsophus, was used for the phylogenetic inferences. Phylogenetic relationships were inferred using maximum parsimony and Bayesian analyses. Results The species D. nanus and D. walfordi exhibited identical karyotypes (2n = 30; FN = 52), with four pairs of telocentric chromosomes and a NOR located on metacentric chromosome pair 13. In all of the phylogenetic hypotheses, the paraphyly of D. nanus and D. walfordi was inferred. D. sanborni from Botucatu-SP and Torres-RS showed the same karyotype as D. jimi, with 5 pairs of telocentric chromosomes (2n = 30; FN = 50) and a terminal NOR in the long arm of the telocentric chromosome pair 12. Despite their karyotypic similarity, these species were not found to compose a monophyletic group. Finally, the phylogenetic and cytogenetic analyses did not cluster the specimens of D. elianeae according to their geographical occurrence or recognized morphotypes. Conclusions We suggest that a taxonomic revision of the taxa D. nanus and D. walfordi is quite necessary. We also observe that the number of telocentric chromosomes is useful to distinguish among valid species in some cases, although it is unchanged in species that are not necessarily closely related phylogenetically. Therefore, inferences based on this chromosomal character must be made with caution; a proper evolutionary analysis of the karyotypic variation in Dendropsophus depends on further characterization of the telocentric chromosomes found in this group. PMID:23822759
Singh, Jitendra P; Singh, Ak; Bajpai, Anju; Ahmad, Iffat Zareen
2014-01-01
The Indian black berry (Syzygium cumini Skeels) has a great nutraceutical and medicinal properties. As in other fruit crops, the fruit characteristics are important attributes for differentiation were also determined for different accessions of S. cumini. The fruit weight, length, breadth, length: breadth ratio, pulp weight, pulp content, seed weight and pulp: seed ratio significantly varied in different accessions. Molecular characterization was carried out using PCR based RAPD technique. Out of 80 RAPD primers, only 18 primers produced stable polymorphisms that were used to examine the phylogenetic relationship. A sum of 207 loci were generated out of which 201 loci found polymorphic. The average genetic dissimilarity was 97 per cent among jamun accessions. The phylogenetic relationship was also determined by principal coordinates analysis (PCoA) that explained 46.95 per cent cumulative variance. The two-dimensional PCoA analysis showed grouping of the different accessions that were plotted into four sub-plots, representing clustering of accessions. The UPGMA (r = 0.967) and NJ (r = 0.987) dendrogram constructed based on the dissimilarity matrix revealed a good degree of fit with the cophenetic correlation value. The dendrogram grouped the accessions into three main clusters according to their eco-geographical regions which given useful insight into their phylogenetic relationships.
Chen, Jonathan S.; Reddy, Vamsee; Chen, Joshua H.; Shlykov, Maksim A.; Zheng, Wei Hao; Cho, Jaehoon; Yen, Ming Ren; Saier, Milton H.
2012-01-01
Transport proteins function in the translocation of ions, solutes and macromolecules across cellular and organellar membranes. These integral membrane proteins fall into >600 families as tabulated in the Transporter Classification Database (www.tcdb.org). Recent studies, some of which are reported here, define distant phylogenetic relationships between families with the creation of superfamilies. Several of these are analyzed using a novel set of programs designed to allow reliable prediction of phylogenetic trees when sequence divergence is too great to allow the use of multiple alignments. These new programs, called SuperfamilyTree1 and 2 (SFT1 and 2), allow display of protein and family relationships, respectively, based on thousands of comparative BLAST scores rather than multiple alignments. Superfamilies analyzed include: (1) Aerolysins, (2) RTX Toxins, (3) Defensins, (4) Ion Transporters, (5) Bile/Arsenite/Riboflavin Transporters, (6) Cation: Proton Antiporters, and (7) the Glucose/Fructose/Lactose superfamily within the prokaryotic phosphoenol pyruvate-dependent Phosphotransferase System. In addition to defining the phylogenetic relationships of the proteins and families within these seven superfamilies, evidence is provided showing that the SFT programs outperform programs that are based on multiple alignments whenever sequence divergence of superfamily members is extensive. The SFT programs should be applicable to virtually any superfamily of proteins or nucleic acids. PMID:22286036
Yu, Xiaoyu; Reva, Oleg N
2018-01-01
Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.
Yu, Xiaoyu; Reva, Oleg N
2018-01-01
Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA. PMID:29511354
Inquiry-Based Approach to Understanding Common Descent
ERIC Educational Resources Information Center
Parker, Monica
2010-01-01
In this inquiry-based activity, students catalog external and internal characteristics of four different classes of animals during dissection exercises. On the basis of their accumulated data, students compare and contrast the animals, devise a phylogenetic tree, and provide reasonable characteristics for extinct transitional organisms. (Contains…
Fortunato, Laura; Jordan, Fiona
2010-01-01
Accurate reconstruction of prehistoric social organization is important if we are to put together satisfactory multidisciplinary scenarios about, for example, the dispersal of human groups. Such considerations apply in the case of Indo-European and Austronesian, two large-scale language families that are thought to represent Neolithic expansions. Ancestral kinship patterns have mostly been inferred through reconstruction of kin terminologies in ancestral proto-languages using the linguistic comparative method, and through geographical or distributional arguments based on the comparative patterns of kin terms and ethnographic kinship ‘facts’. While these approaches are detailed and valuable, the processes through which conclusions have been drawn from the data fail to provide explicit criteria for systematic testing of alternative hypotheses. Here, we use language trees derived using phylogenetic tree-building techniques on Indo-European and Austronesian vocabulary data. With these trees, ethnographic data and Bayesian phylogenetic comparative methods, we statistically reconstruct past marital residence and infer rates of cultural change between different residence forms, showing Proto-Indo-European to be virilocal and Proto-Malayo-Polynesian uxorilocal. The instability of uxorilocality and the rare loss of virilocality once gained emerge as common features of both families. PMID:21041215
Humphreys-Pereira, Danny A; Elling, Axel A
2014-01-01
Root-knot nematodes (Meloidogyne spp.) are among the most important plant pathogens. In this study, the mitochondrial (mt) genomes of the root-knot nematodes, M. chitwoodi and M. incognita were sequenced. PCR analyses suggest that both mt genomes are circular, with an estimated size of 19.7 and 18.6-19.1kb, respectively. The mt genomes each contain a large non-coding region with tandem repeats and the control region. The mt gene arrangement of M. chitwoodi and M. incognita is unlike that of other nematodes. Sequence alignments of the two Meloidogyne mt genomes showed three translocations; two in transfer RNAs and one in cox2. Compared with other nematode mt genomes, the gene arrangement of M. chitwoodi and M. incognita was most similar to Pratylenchus vulnus. Phylogenetic analyses (Maximum Likelihood and Bayesian inference) were conducted using 78 complete mt genomes of diverse nematode species. Analyses based on nucleotides and amino acids of the 12 protein-coding mt genes showed strong support for the monophyly of class Chromadorea, but only amino acid-based analyses supported the monophyly of class Enoplea. The suborder Spirurina was not monophyletic in any of the phylogenetic analyses, contradicting the Clade III model, which groups Ascaridomorpha, Spiruromorpha and Oxyuridomorpha based on the small subunit ribosomal RNA gene. Importantly, comparisons of mt gene arrangement and tree-based methods placed Meloidogyne as sister taxa of Pratylenchus, a migratory plant endoparasitic nematode, and not with the sedentary endoparasitic Heterodera. Thus, comparative analyses of mt genomes suggest that sedentary endoparasitism in Meloidogyne and Heterodera is based on convergent evolution. Copyright © 2014 Elsevier B.V. All rights reserved.
Anchoring quartet-based phylogenetic distances and applications to species tree reconstruction.
Sayyari, Erfan; Mirarab, Siavash
2016-11-11
Inferring species trees from gene trees using the coalescent-based summary methods has been the subject of much attention, yet new scalable and accurate methods are needed. We introduce DISTIQUE, a new statistically consistent summary method for inferring species trees from gene trees under the coalescent model. We generalize our results to arbitrary phylogenetic inference problems; we show that two arbitrarily chosen leaves, called anchors, can be used to estimate relative distances between all other pairs of leaves by inferring relevant quartet trees. This results in a family of distance-based tree inference methods, with running times ranging between quadratic to quartic in the number of leaves. We show in simulated studies that DISTIQUE has comparable accuracy to leading coalescent-based summary methods and reduced running times.
Li, Wei; Zhang, Xin-Cheng; Zhao, Jian; Shi, Yan; Zhu, Xin-Ping
2015-01-25
Cuora trifasciata has become one of the most critically endangered species in the world. The complete mitochondrial genome of C. trifasciata (Chinese three-striped box turtle) was determined in this study. Its mitochondrial genome is a 16,575-bp-long circular molecule that consists of 37 genes that are typically found in other vertebrates. And the basic characteristics of the C. trifasciata mitochondrial genome were also determined. Moreover, a comparison of C. trifasciata with Cuora cyclornata, Cuora pani and Cuora aurocapitata indicated that the four mitogenomics differed in length, codons, overlaps, 13 protein-coding genes (PCGs), ND3, rRNA genes, control region, and other aspects. Phylogenetic analysis with Bayesian inference and maximum likelihood based on 12 protein-coding genes of the genus Cuora indicated the phylogenetic position of C. trifasciata within Cuora. The phylogenetic analysis also showed that C. trifasciata from Vietnam and China formed separate monophyletic clades with different Cuora species. The results of nucleotide base compositions, protein-coding genes and phylogenetic analysis showed that C. trifasciata from these two countries may represent different Cuora species. Copyright © 2014 Elsevier B.V. All rights reserved.
Geiger, Matthias F; McCrary, Jeffrey K; Schliewen, Ulrich K
2010-09-01
Nicaraguan Midas cichlids from crater lakes have recently attracted attention as potential model systems for speciation research, but no attempt has been made to comprehensively reconstruct phylogenetic relationships of this highly diverse and recently evolved species complex. We present a first AFLP (2793 loci) and mtDNA based phylogenetic hypothesis including all described and several undescribed species from six crater lakes (Apoyeque, Apoyo, Asososca Leon, Masaya, Tiscapa and Xiloá), the two great Lakes Managua and Nicaragua and the San Juan River. Our analyses demonstrate that the relationships between the Midas cichlid members are complex, and that phylogenetic information from different markers and methods do not always yield congruent results. Nevertheless, monophyly support for crater lake assemblages from Lakes Apoyeque, Apoyo, A. Leon is high as compared to those from L. Xiloá indicating occurrence of sympatric speciation. Further, we demonstrate that a 'three species' concept for the Midas cichlid complex is inapplicable and consequently that an individualized and voucher based approach in speciation research of the Midas cichlid complex is necessary at least as long as there is no comprehensive revision of the species complex available. Copyright 2010 Elsevier Inc. All rights reserved.
SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
2013-01-01
Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408
SUNPLIN: simulation with uncertainty for phylogenetic investigations.
Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F
2013-11-15
Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.
Qian, Hong; Chen, Shengbin; Zhang, Jin-Long
2017-07-17
Niche-based and neutrality-based theories are two major classes of theories explaining the assembly mechanisms of local communities. Both theories have been frequently used to explain species diversity and composition in local communities but their relative importance remains unclear. Here, we analyzed 57 assemblages of angiosperm trees in 0.1-ha forest plots across China to examine the effects of environmental heterogeneity (relevant to niche-based processes) and spatial contingency (relevant to neutrality-based processes) on phylogenetic structure of angiosperm tree assemblages distributed across a wide range of environment and space. Phylogenetic structure was quantified with six phylogenetic metrics (i.e., phylogenetic diversity, mean pairwise distance, mean nearest taxon distance, and the standardized effect sizes of these three metrics), which emphasize on different depths of evolutionary histories and account for different degrees of species richness effects. Our results showed that the variation in phylogenetic metrics explained independently by environmental variables was on average much greater than that explained independently by spatial structure, and the vast majority of the variation in phylogenetic metrics was explained by spatially structured environmental variables. We conclude that niche-based processes have played a more important role than neutrality-based processes in driving phylogenetic structure of angiosperm tree species in forest communities in China.
Tanabe, Akifumi S
2011-09-01
Proportional and separate models able to apply different combination of substitution rate matrix (SRM) and among-site rate variation model (ASRVM) to each locus are frequently used in phylogenetic studies of multilocus data. A proportional model assumes that branch lengths are proportional among partitions and a separate model assumes that each partition has an independent set of branch lengths. However, the selection from among nonpartitioned (i.e., a common combination of models is applied to all-loci concatenated sequences), proportional and separate models is usually based on the researcher's preference rather than on any information criteria. This study describes two programs, 'Kakusan4' (for DNA sequences) and 'Aminosan' (for amino-acid sequences), which allow the selection of evolutionary models based on several types of information criteria. The programs can handle both multilocus and single-locus data, in addition to providing an easy-to-use wizard interface and a noninteractive command line interface. In the case of multilocus data, SRMs and ASRVMs are compared at each locus and at all-loci concatenated sequences, after which nonpartitioned, proportional and separate models are compared based on information criteria. The programs also provide model configuration files for mrbayes, paup*, phyml, raxml and Treefinder to support further phylogenetic analysis using a selected model. When likelihoods are optimized by Treefinder, the best-fit models were found to differ depending on the data set. Furthermore, differences in the information criteria among nonpartitioned, proportional and separate models were much larger than those among the nonpartitioned models. These findings suggest that selecting from nonpartitioned, proportional and separate models results in a better phylogenetic tree. Kakusan4 and Aminosan are available at http://www.fifthdimension.jp/. They are licensed under gnugpl Ver.2, and are able to run on Windows, MacOS X and Linux. © 2011 Blackwell Publishing Ltd.
M Naresh Kumar, C V; Anthony Johnson, A M; R Sai Gopal, D V
2007-12-01
Chikungunya virus has caused numerous large outbreaks in India. Suspected blood samples from the epidemic were collected and characterized for the identification of the responsible causative from Rayalaseema region of Andhra Pradesh. RT-PCR was used for screening of suspected blood samples. Primers were designed to amplify partial E1 gene and the amplified fragment was cloned and sequenced. The sequence was analyzed and compared with other geographical isolates to find the phylogenetic relationship. The sequence was submitted to the Gen bank DNA database (accession DQ888620). Comparative nucleotide homology analysis of the AP Ra-CTR isolate with the other isolates revealed 94.7+/-3.6 per cent of homology of CHIKAPRa-CTR with other isolates of Chikungunya virus at nucleotide level and 96.8+/-3.2 per cent of homology at amino acid level. The current epidemic was caused by the Central African genotype of CHIKV, grouped in Central Africa cluster in phylogenetic trees generated based on nucleotide and amino acid sequences.
Homologization of the flight musculature of zygoptera (insecta: odonata) and neoptera (insecta).
Büsse, Sebastian; Genet, Cécile; Hörnschemeyer, Thomas
2013-01-01
Among the winged insects (Pterygota) the Dragonflies and Damselflies (Odonata) are unique for several reasons. Behaviourally they are aerial predators that hunt and catch their prey in flight, only. Morphologically the flight apparatus of Odonata is significantly different from what is found in the remaining Pterygota. However, to understand the phylogenetic relationships of winged insects and the origin and evolution of insect flight in general, it is essential to know how the elements of the odonatan flight apparatus relate to those of the other Pterygota. Here we present a comprehensive, comparative morphological investigation of the thoracic flight musculature of damselflies (Zygoptera). Based on our new data we propose a homologization scheme for the thoracic musculature throughout Pterygota. The new homology hypotheses will allow for future comparative work and especially for phylogenetic analyses using characters of the thoracic musculature throughout all winged insects. This will contribute to understand the early evolution of pterygote insects and their basal phylogenetic relationship.
Homologization of the Flight Musculature of Zygoptera (Insecta: Odonata) and Neoptera (Insecta)
Büsse, Sebastian; Genet, Cécile; Hörnschemeyer, Thomas
2013-01-01
Among the winged insects (Pterygota) the Dragonflies and Damselflies (Odonata) are unique for several reasons. Behaviourally they are aerial predators that hunt and catch their prey in flight, only. Morphologically the flight apparatus of Odonata is significantly different from what is found in the remaining Pterygota. However, to understand the phylogenetic relationships of winged insects and the origin and evolution of insect flight in general, it is essential to know how the elements of the odonatan flight apparatus relate to those of the other Pterygota. Here we present a comprehensive, comparative morphological investigation of the thoracic flight musculature of damselflies (Zygoptera). Based on our new data we propose a homologization scheme for the thoracic musculature throughout Pterygota. The new homology hypotheses will allow for future comparative work and especially for phylogenetic analyses using characters of the thoracic musculature throughout all winged insects. This will contribute to understand the early evolution of pterygote insects and their basal phylogenetic relationship. PMID:23457479
Julien, Clavel; Leandro, Aristide; Hélène, Morlon
2018-06-19
Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.
Subbotin, S A; Vierstraete, A; De Ley, P; Rowe, J; Waeyenberge, L; Moens, M; Vanfleteren, J R
2001-10-01
The ITS1, ITS2, and 5.8S gene sequences of nuclear ribosomal DNA from 40 taxa of the family Heteroderidae (including the genera Afenestrata, Cactodera, Heterodera, Globodera, Punctodera, Meloidodera, Cryphodera, and Thecavermiculatus) were sequenced and analyzed. The ITS regions displayed high levels of sequence divergence within Heteroderinae and compared to outgroup taxa. Unlike recent findings in root knot nematodes, ITS sequence polymorphism does not appear to complicate phylogenetic analysis of cyst nematodes. Phylogenetic analyses with maximum-parsimony, minimum-evolution, and maximum-likelihood methods were performed with a range of computer alignments, including elision and culled alignments. All multiple alignments and phylogenetic methods yielded similar basic structure for phylogenetic relationships of Heteroderidae. The cyst-forming nematodes are represented by six main clades corresponding to morphological characters and host specialization, with certain clades assuming different positions depending on alignment procedure and/or method of phylogenetic inference. Hypotheses of monophyly of Punctoderinae and Heteroderinae are, respectively, strongly and moderately supported by the ITS data across most alignments. Close relationships were revealed between the Avenae and the Sacchari groups and between the Humuli group and the species H. salixophila within Heteroderinae. The Goettingiana group occupies a basal position within this subfamily. The validity of the genera Afenestrata and Bidera was tested and is discussed based on molecular data. We conclude that ITS sequence data are appropriate for studies of relationships within the different species groups and less so for recovery of more ancient speciations within Heteroderidae. Copyright 2001 Academic Press.
On Tree-Based Phylogenetic Networks.
Zhang, Louxin
2016-07-01
A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.
Cheng, Rubin; Zheng, Xiaodong; Ma, Yuanyuan; Li, Qi
2013-01-01
In the present study, we determined the complete mitochondrial DNA (mtDNA) sequences of two species of Cistopus, namely C. chinensis and C. taiwanicus, and conducted a comparative mt genome analysis across the class Cephalopoda. The mtDNA length of C. chinensis and C. taiwanicus are 15706 and 15793 nucleotides with an AT content of 76.21% and 76.5%, respectively. The sequence identity of mtDNA between C. chinensis and C. taiwanicus was 88%, suggesting a close relationship. Compared with C. taiwanicus and other octopods, C. chinensis encoded two additional tRNA genes, showing a novel gene arrangement. In addition, an unusual 23 poly (A) signal structure is found in the ATP8 coding region of C. chinensis. The entire genome and each protein coding gene of the two Cistopus species displayed notable levels of AT and GC skews. Based on sliding window analysis among Octopodiformes, ND1 and DN5 were considered to be more reliable molecular beacons. Phylogenetic analyses based on the 13 protein-coding genes revealed that C. chinensis and C. taiwanicus form a monophyletic group with high statistical support, consistent with previous studies based on morphological characteristics. Our results also indicated that the phylogenetic position of the genus Cistopus is closer to Octopus than to Amphioctopus and Callistoctopus. The complete mtDNA sequence of C. chinensis and C. taiwanicus represent the first whole mt genomes in the genus Cistopus. These novel mtDNA data will be important in refining the phylogenetic relationships within Octopodiformes and enriching the resource of markers for systematic, population genetic and evolutionary biological studies of Cephalopoda. PMID:24358345
Bayesian models for comparative analysis integrating phylogenetic uncertainty.
de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P
2012-06-28
Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language.
Bayesian models for comparative analysis integrating phylogenetic uncertainty
2012-01-01
Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language. PMID:22741602
MASTtreedist: visualization of tree space based on maximum agreement subtree.
Huang, Hong; Li, Yongji
2013-01-01
Phylogenetic tree construction process might produce many candidate trees as the "best estimates." As the number of constructed phylogenetic trees grows, the need to efficiently compare their topological or physical structures arises. One of the tree comparison's software tools, the Mesquite's Tree Set Viz module, allows the rapid and efficient visualization of the tree comparison distances using multidimensional scaling (MDS). Tree-distance measures, such as Robinson-Foulds (RF), for the topological distance among different trees have been implemented in Tree Set Viz. New and sophisticated measures such as Maximum Agreement Subtree (MAST) can be continuously built upon Tree Set Viz. MAST can detect the common substructures among trees and provide more precise information on the similarity of the trees, but it is NP-hard and difficult to implement. In this article, we present a practical tree-distance metric: MASTtreedist, a MAST-based comparison metric in Mesquite's Tree Set Viz module. In this metric, the efficient optimizations for the maximum weight clique problem are applied. The results suggest that the proposed method can efficiently compute the MAST distances among trees, and such tree topological differences can be translated as a scatter of points in two-dimensional (2D) space. We also provide statistical evaluation of provided measures with respect to RF-using experimental data sets. This new comparison module provides a new tree-tree pairwise comparison metric based on the differences of the number of MAST leaves among constructed phylogenetic trees. Such a new phylogenetic tree comparison metric improves the visualization of taxa differences by discriminating small divergences of subtree structures for phylogenetic tree reconstruction.
Hua, Jimeng; Li, Ming; Dong, Pengzhi; Cui, Ying; Xie, Qiang; Bu, Wenjun
2009-01-01
Background The true water bugs are grouped in infraorder Nepomorpha (Insecta: Hemiptera: Heteroptera) and are of great economic importance. The phylogenetic relationships within Nepomorpha and the taxonomic hierarchies of Pleoidea and Aphelocheiroidea are uncertain. Most of the previous studies were based on morphological characters without algorithmic assessment. In the latest study, the molecular markers employed in phylogenetic analyses were partial sequences of 16S rDNA and 18S rDNA with a total length about 1 kb. Up to now, no mitochondrial genome of the true water bugs has been sequenced, which is one of the largest data sets that could be compared across animal taxa. In this study we analyzed the unresolved problems in Nepomorpha using evidence from mitochondrial genomes. Results Nine mitochondrial genomes of Nepomorpha and five of other hemipterans were sequenced. These mitochondrial genomes contain the commonly found 37 genes without gene rearrangements. Based on the nucleotide sequences of mt-genomes, Pleoidea is not a member of the Nepomorpha and Aphelocheiroidea should be grouped back into Naucoroidea. Phylogenetic relationships among the superfamilies of Nepomorpha were resolved robustly. Conclusion The mt-genome is an effective data source for resolving intraordinal phylogenetic problems at the superfamily level within Heteroptera. The mitochondrial genomes of the true water bugs are typical insect mt-genomes. Based on the nucleotide sequences of the mt-genomes, we propose the Pleoidea to be a separate heteropteran infraorder. The infraorder Nepomorpha consists of five superfamilies with the relationships (Corixoidea + ((Naucoroidea + Notonectoidea) + (Ochteroidea + Nepoidea))). PMID:19523246
Kuramae, Eiko E; Robert, Vincent; Echavarri-Erasun, Carlos; Boekhout, Teun
2007-01-01
Background The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40–45 concatenated proteins seem needed to resolve fungal TOL. Results In total 4852 orthologous proteins (KOGs) were assigned among 33 fungal genomes from the Asco- and Basidiomycota and 70 of these represented single copy proteins. The individual protein distance matrices based on 531 concatenated proteins that has been used for phylogeny reconstruction before [14] were compared one with another in order to select those with the highest CCC, which then was used as a reference. This reference distance matrix was compared with those of the 70 single copy proteins selected and their CCC values were calculated. Sixty four KOGs showed a CCC above 0.50 and these were further considered for their phylogenetic potential. Proteins belonging to the cellular processes and signaling KOG category seem more informative than those belonging to the other three categories: information storage and processing; metabolism; and the poorly characterized category. After concatenation of 40 proteins the topology of the phylogenetic tree remained stable, but after concatenation of 60 or more proteins the bootstrap support values of some branches decreased, most likely due to the inclusion of proteins with lowers CCC values. The selection of protein sequences to be used in various TOL projects remains a critical and important process. The method described in this paper will contribute to a more objective selection of phylogenetically informative protein sequences. Conclusion This study provides candidate protein sequences to be considered as phylogenetic markers in different branches of fungal TOL. The selection procedure described here will be useful to select informative protein sequences to resolve branches of TOL that contain few or no species with completely sequenced genomes. The robust phylogenetic trees resulting from this method may contribute to our understanding of organismal diversification processes. The method proposed can be extended easily to other branches of TOL. PMID:17688684
Schweizer, Manuel; Ayé, Raffael; Kashkarov, Roman; Roth, Tobias
2014-01-01
Although phylogenetic diversity has been suggested to be relevant from a conservation point of view, its role is still limited in applied nature conservation. Recently, the practice of investing conservation resources based on threatened species was identified as a reason for the slow integration of phylogenetic diversity in nature conservation planning. One of the main arguments is based on the observation that threatened species are not evenly distributed over the phylogenetic tree. However this argument seems to dismiss the fact that conservation action is a spatially explicit process, and even if threatened species are not evenly distributed over the phylogenetic tree, the occurrence of threatened species could still indicate areas with above average phylogenetic diversity and consequently could protect phylogenetic diversity. Here we aim to study the selection of important bird areas in Central Asia, which were nominated largely based on the presence of threatened bird species. We show that although threatened species occurring in Central Asia do not capture phylogenetically more distinct species than expected by chance, the current spatially explicit conservation approach of selecting important bird areas covers above average taxonomic and phylogenetic diversity of breeding and wintering birds. We conclude that the spatially explicit processes of conservation actions need to be considered in the current discussion of whether new prioritization methods are needed to complement conservation action based on threatened species. PMID:25337861
Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong
2013-07-04
Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.
Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.
Sferra, Gabriella; Fratini, Federica; Ponzi, Marta; Pizzi, Elisabetta
2017-09-05
Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods. Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity. In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.
Posada, David; Buckley, Thomas R
2004-10-01
Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental concepts and techniques of model selection in the context of phylogenetics. We start by reviewing different aspects of the selection of substitution models in phylogenetics from a theoretical, philosophical and practical point of view, and summarize this comparison in table format. We argue that the most commonly implemented model selection approach, the hierarchical likelihood ratio test, is not the optimal strategy for model selection in phylogenetics, and that approaches like the Akaike Information Criterion (AIC) and Bayesian methods offer important advantages. In particular, the latter two methods are able to simultaneously compare multiple nested or nonnested models, assess model selection uncertainty, and allow for the estimation of phylogenies and model parameters using all available models (model-averaged inference or multimodel inference). We also describe how the relative importance of the different parameters included in substitution models can be depicted. To illustrate some of these points, we have applied AIC-based model averaging to 37 mitochondrial DNA sequences from the subgenus Ohomopterus(genus Carabus) ground beetles described by Sota and Vogler (2001).
A comparative test of phylogenetic diversity indices.
Schweiger, Oliver; Klotz, Stefan; Durka, Walter; Kühn, Ingolf
2008-09-01
Traditional measures of biodiversity, such as species richness, usually treat species as being equal. As this is obviously not the case, measuring diversity in terms of features accumulated over evolutionary history provides additional value to theoretical and applied ecology. Several phylogenetic diversity indices exist, but their behaviour has not yet been tested in a comparative framework. We provide a test of ten commonly used phylogenetic diversity indices based on 40 simulated phylogenies of varying topology. We restrict our analysis to a topological fully resolved tree without information on branch lengths and species lists with presence-absence data. A total of 38,000 artificial communities varying in species richness covering 5-95% of the phylogenies were created by random resampling. The indices were evaluated based on their ability to meet a priori defined requirements. No index meets all requirements, but three indices turned out to be more suitable than others under particular conditions. Average taxonomic distinctness (AvTD) and intensive quadratic entropy (J) are calculated by averaging and are, therefore, unbiased by species richness while reflecting phylogeny per se well. However, averaging leads to the violation of set monotonicity, which requires that species extinction cannot increase the index. Total taxonomic distinctness (TTD) sums up distinctiveness values for particular species across the community. It is therefore strongly linked to species richness and reflects phylogeny per se weakly but satisfies set monotonicity. We suggest that AvTD and J are best applied to studies that compare spatially or temporally rather independent communities that potentially vary strongly in their phylogenetic composition-i.e. where set monotonicity is a more negligible issue, but independence of species richness is desired. In contrast, we suggest that TTD be used in studies that compare rather interdependent communities where changes occur more gradually by species extinction or introduction. Calculating AvTD or TTD, depending on the research question, in addition to species richness is strongly recommended.
Cross-validation to select Bayesian hierarchical models in phylogenetics.
Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C
2016-05-26
Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.
Chen, Tsute; Siddiqui, Huma; Olsen, Ingar
2017-01-01
Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica . All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/.
Chen, Tsute; Siddiqui, Huma; Olsen, Ingar
2017-01-01
Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/. PMID:28261563
Karyotype Evolution in Birds: From Conventional Staining to Chromosome Painting
Ferguson-Smith, Malcolm A.
2018-01-01
In the last few decades, there have been great efforts to reconstruct the phylogeny of Neoaves based mainly on DNA sequencing. Despite the importance of karyotype data in phylogenetic studies, especially with the advent of fluorescence in situ hybridization (FISH) techniques using different types of probes, the use of chromosomal data to clarify phylogenetic proposals is still minimal. Additionally, comparative chromosome painting in birds is restricted to a few orders, while in mammals, for example, virtually all orders have already been analyzed using this method. Most reports are based on comparisons using Gallus gallus probes, and only a small number of species have been analyzed with more informative sets of probes, such as those from Leucopternis albicollis and Gyps fulvus, which show ancestral macrochromosomes rearranged in alternative patterns. Despite this, it is appropriate to review the available cytogenetic information and possible phylogenetic conclusions. In this report, the authors gather both classical and molecular cytogenetic data and describe some interesting and unique characteristics of karyotype evolution in birds. PMID:29584697
PoMo: An Allele Frequency-Based Approach for Species Tree Estimation
De Maio, Nicola; Schrempf, Dominik; Kosiol, Carolin
2015-01-01
Incomplete lineage sorting can cause incongruencies of the overall species-level phylogenetic tree with the phylogenetic trees for individual genes or genomic segments. If these incongruencies are not accounted for, it is possible to incur several biases in species tree estimation. Here, we present a simple maximum likelihood approach that accounts for ancestral variation and incomplete lineage sorting. We use a POlymorphisms-aware phylogenetic MOdel (PoMo) that we have recently shown to efficiently estimate mutation rates and fixation biases from within and between-species variation data. We extend this model to perform efficient estimation of species trees. We test the performance of PoMo in several different scenarios of incomplete lineage sorting using simulations and compare it with existing methods both in accuracy and computational speed. In contrast to other approaches, our model does not use coalescent theory but is allele frequency based. We show that PoMo is well suited for genome-wide species tree estimation and that on such data it is more accurate than previous approaches. PMID:26209413
Karyotype Evolution in Birds: From Conventional Staining to Chromosome Painting.
Kretschmer, Rafael; Ferguson-Smith, Malcolm A; de Oliveira, Edivaldo Herculano Correa
2018-03-27
In the last few decades, there have been great efforts to reconstruct the phylogeny of Neoaves based mainly on DNA sequencing. Despite the importance of karyotype data in phylogenetic studies, especially with the advent of fluorescence in situ hybridization (FISH) techniques using different types of probes, the use of chromosomal data to clarify phylogenetic proposals is still minimal. Additionally, comparative chromosome painting in birds is restricted to a few orders, while in mammals, for example, virtually all orders have already been analyzed using this method. Most reports are based on comparisons using Gallus gallus probes, and only a small number of species have been analyzed with more informative sets of probes, such as those from Leucopternis albicollis and Gyps fulvus , which show ancestral macrochromosomes rearranged in alternative patterns. Despite this, it is appropriate to review the available cytogenetic information and possible phylogenetic conclusions. In this report, the authors gather both classical and molecular cytogenetic data and describe some interesting and unique characteristics of karyotype evolution in birds.
Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K
2010-11-09
Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.
Cheng, Yiming; Perocchi, Fabiana
2015-07-01
ProtPhylo is a web-based tool to identify proteins that are functionally linked to either a phenotype or a protein of interest based on co-evolution. ProtPhylo infers functional associations by comparing protein phylogenetic profiles (co-occurrence patterns of orthology relationships) for more than 9.7 million non-redundant protein sequences from all three domains of life. Users can query any of 2048 fully sequenced organisms, including 1678 bacteria, 255 eukaryotes and 115 archaea. In addition, they can tailor ProtPhylo to a particular kind of biological question by choosing among four main orthology inference methods based either on pair-wise sequence comparisons (One-way Best Hits and Best Reciprocal Hits) or clustering of orthologous proteins across multiple species (OrthoMCL and eggNOG). Next, ProtPhylo ranks phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance as a measure of similarity between pairs of phylogenetic profiles. Candidate hits can be easily and flexibly prioritized by complementary clues on subcellular localization, known protein-protein interactions, membrane spanning regions and protein domains. The resulting protein list can be quickly exported into a csv text file for further analyses. ProtPhylo is freely available at http://www.protphylo.org. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
How does cognition evolve? Phylogenetic comparative psychology
Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria
2014-01-01
Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850
How does cognition evolve? Phylogenetic comparative psychology.
MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria
2012-03-01
Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.
Comparative Study of Lectin Domains in Model Species: New Insights into Evolutionary Dynamics
Van Holle, Sofie; De Schutter, Kristof; Eggermont, Lore; Tsaneva, Mariya; Dang, Liuyi; Van Damme, Els J. M.
2017-01-01
Lectins are present throughout the plant kingdom and are reported to be involved in diverse biological processes. In this study, we provide a comparative analysis of the lectin families from model species in a phylogenetic framework. The analysis focuses on the different plant lectin domains identified in five representative core angiosperm genomes (Arabidopsis thaliana, Glycine max, Cucumis sativus, Oryza sativa ssp. japonica and Oryza sativa ssp. indica). The genomes were screened for genes encoding lectin domains using a combination of Basic Local Alignment Search Tool (BLAST), hidden Markov models, and InterProScan analysis. Additionally, phylogenetic relationships were investigated by constructing maximum likelihood phylogenetic trees. The results demonstrate that the majority of the lectin families are present in each of the species under study. Domain organization analysis showed that most identified proteins are multi-domain proteins, owing to the modular rearrangement of protein domains during evolution. Most of these multi-domain proteins are widespread, while others display a lineage-specific distribution. Furthermore, the phylogenetic analyses reveal that some lectin families evolved to be similar to the phylogeny of the plant species, while others share a closer evolutionary history based on the corresponding protein domain architecture. Our results yield insights into the evolutionary relationships and functional divergence of plant lectins. PMID:28587095
Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis
2016-09-02
Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Open Reading Frame Phylogenetic Analysis on the Cloud
2013-01-01
Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843
Leaché, Adam D.; Banbury, Barbara L.; Felsenstein, Joseph; de Oca, Adrián nieto-Montes; Stamatakis, Alexandros
2015-01-01
Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the presence of missing data. Phylogenetic analysis of RAD loci requires careful attention to model assumptions, especially if downstream analyses depend on branch lengths. PMID:26227865
Tetrapods on the EDGE: Overcoming data limitations to identify phylogenetic conservation priorities
Gray, Claudia L.; Wearn, Oliver R.; Owen, Nisha R.
2018-01-01
The scale of the ongoing biodiversity crisis requires both effective conservation prioritisation and urgent action. As extinction is non-random across the tree of life, it is important to prioritise threatened species which represent large amounts of evolutionary history. The EDGE metric prioritises species based on their Evolutionary Distinctiveness (ED), which measures the relative contribution of a species to the total evolutionary history of their taxonomic group, and Global Endangerment (GE), or extinction risk. EDGE prioritisations rely on adequate phylogenetic and extinction risk data to generate meaningful priorities for conservation. However, comprehensive phylogenetic trees of large taxonomic groups are extremely rare and, even when available, become quickly out-of-date due to the rapid rate of species descriptions and taxonomic revisions. Thus, it is important that conservationists can use the available data to incorporate evolutionary history into conservation prioritisation. We compared published and new methods to estimate missing ED scores for species absent from a phylogenetic tree whilst simultaneously correcting the ED scores of their close taxonomic relatives. We found that following artificial removal of species from a phylogenetic tree, the new method provided the closest estimates of their “true” ED score, differing from the true ED score by an average of less than 1%, compared to the 31% and 38% difference of the previous methods. The previous methods also substantially under- and over-estimated scores as more species were artificially removed from a phylogenetic tree. We therefore used the new method to estimate ED scores for all tetrapods. From these scores we updated EDGE prioritisation rankings for all tetrapod species with IUCN Red List assessments, including the first EDGE prioritisation for reptiles. Further, we identified criteria to identify robust priority species in an effort to further inform conservation action whilst limiting uncertainty and anticipating future phylogenetic advances. PMID:29641585
Reconstruction of phylogenetic trees of prokaryotes using maximal common intervals.
Heydari, Mahdi; Marashi, Sayed-Amir; Tusserkani, Ruzbeh; Sadeghi, Mehdi
2014-10-01
One of the fundamental problems in bioinformatics is phylogenetic tree reconstruction, which can be used for classifying living organisms into different taxonomic clades. The classical approach to this problem is based on a marker such as 16S ribosomal RNA. Since evolutionary events like genomic rearrangements are not included in reconstructions of phylogenetic trees based on single genes, much effort has been made to find other characteristics for phylogenetic reconstruction in recent years. With the increasing availability of completely sequenced genomes, gene order can be considered as a new solution for this problem. In the present work, we applied maximal common intervals (MCIs) in two or more genomes to infer their distance and to reconstruct their evolutionary relationship. Additionally, measures based on uncommon segments (UCS's), i.e., those genomic segments which are not detected as part of any of the MCIs, are also used for phylogenetic tree reconstruction. We applied these two types of measures for reconstructing the phylogenetic tree of 63 prokaryotes with known COG (clusters of orthologous groups) families. Similarity between the MCI-based (resp. UCS-based) reconstructed phylogenetic trees and the phylogenetic tree obtained from NCBI taxonomy browser is as high as 93.1% (resp. 94.9%). We show that in the case of this diverse dataset of prokaryotes, tree reconstruction based on MCI and UCS outperforms most of the currently available methods based on gene orders, including breakpoint distance and DCJ. We additionally tested our new measures on a dataset of 13 closely-related bacteria from the genus Prochlorococcus. In this case, distances like rearrangement distance, breakpoint distance and DCJ proved to be useful, while our new measures are still appropriate for phylogenetic reconstruction. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Che, Xianli; Zhang, Min; Zhao, Yanyan; Zhang, Qiang; Quan, Qing; Møller, Anders; Zou, Fasheng
2018-01-19
Ecological differences may be related to community component divisions between Oriental (west) and Sino-Japanese (east) realms, and such differences may result in weak geographical breaks in migratory species that are highly mobile. Here, we conducted comparative phylogenetic and functional structure analyses of wintering waterbird communities in southern China across two realms and subsequently examined possible climate drivers of the observed patterns. An analysis based on such highly migratory species is particularly telling because migration is bound to reduce or completely eliminate any divergence between communities. Phylogenetic and functional structure of eastern communities showed over-dispersion while western communities were clustered. Basal phylogenetic and functional turnover of western communities was significant lower than that of eastern communities. The break between eastern and western communities was masked by these two realms. Geographic patterns were related to mean temperature changes and temperature fluctuations, suggesting that temperature may filter waterbird lineages and traits, thus underlying geographical community divisions. These results suggest phylogenetic and functional divisions in southern China, coinciding with biogeography. This study shows that temperature fluctuations constitute an essential mechanism shaping geographical divisions that have largely gone undetected previously, even under climate change.
Treelink: data integration, clustering and visualization of phylogenetic trees.
Allende, Christian; Sohn, Erik; Little, Cedric
2015-12-29
Phylogenetic trees are central to a wide range of biological studies. In many of these studies, tree nodes need to be associated with a variety of attributes. For example, in studies concerned with viral relationships, tree nodes are associated with epidemiological information, such as location, age and subtype. Gene trees used in comparative genomics are usually linked with taxonomic information, such as functional annotations and events. A wide variety of tree visualization and annotation tools have been developed in the past, however none of them are intended for an integrative and comparative analysis. Treelink is a platform-independent software for linking datasets and sequence files to phylogenetic trees. The application allows an automated integration of datasets to trees for operations such as classifying a tree based on a field or showing the distribution of selected data attributes in branches and leafs. Genomic and proteonomic sequences can also be linked to the tree and extracted from internal and external nodes. A novel clustering algorithm to simplify trees and display the most divergent clades was also developed, where validation can be achieved using the data integration and classification function. Integrated geographical information allows ancestral character reconstruction for phylogeographic plotting based on parsimony and likelihood algorithms. Our software can successfully integrate phylogenetic trees with different data sources, and perform operations to differentiate and visualize those differences within a tree. File support includes the most popular formats such as newick and csv. Exporting visualizations as images, cluster outputs and genomic sequences is supported. Treelink is available as a web and desktop application at http://www.treelinkapp.com .
Assigning protein functions by comparative genome analysis protein phylogenetic profiles
Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.
2003-05-13
A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
Wang, Cheng-Long; Ding, Meng-Qi; Zou, Chen-Yan; Zhu, Xue-Mei; Tang, Yu; Zhou, Mei-Liang; Shao, Ji-Rong
2017-07-26
Buckwheat is a nutritional and economically crop belonging to Polygonaceae, Fagopyrum. To better understand the mutation patterns and evolution trend in the chloroplast (cp) genome of buckwheat, and found sufficient number of variable regions to explore the phylogenetic relationships of this genus, two complete cp genomes of buckwheat including Fagopyrum dibotrys (F. dibotrys) and Fagopyrum luojishanense (F. luojishanense) were sequenced, and other two Fagopyrum cp genomes were used for comparative analysis. After morphological analysis, the main difference among these buckwheat were height, leaf shape, seeds and flower type. F. luojishanense was distinguishable from the cultivated species easily. Although the F. dibotrys and two cultivated species has some similarity, they different in habit and component contents. The cp genome of F. dibotrys was 159,320 bp while the F. luojishanense was 159,265 bp. 48 and 61 SSRs were found in F. dibotrys and F. luojishanense respectively. Meanwhile, 10 highly variable regions among these buckwheat species were located precisely. The phylogenetic relationships among four Fagopyrum species based on complete cp genomes was showed. The results suggested that F. dibotrys is more closely related to Fagopyrum tataricum. These data provided valuable genetic information for Fagopyrum species identification, taxonomy, phylogenetic study and molecular breeding.
A three-parameter model for classifying anurans into four genera based on advertisement calls.
Gingras, Bruno; Fitch, William Tecumseh
2013-01-01
The vocalizations of anurans are innate in structure and may therefore contain indicators of phylogenetic history. Thus, advertisement calls of species which are more closely related phylogenetically are predicted to be more similar than those of distant species. This hypothesis was evaluated by comparing several widely used machine-learning algorithms. Recordings of advertisement calls from 142 species belonging to four genera were analyzed. A logistic regression model, using mean values for dominant frequency, coefficient of variation of root-mean square energy, and spectral flux, correctly classified advertisement calls with regard to genus with an accuracy above 70%. Similar accuracy rates were obtained using these parameters with a support vector machine model, a K-nearest neighbor algorithm, and a multivariate Gaussian distribution classifier, whereas a Gaussian mixture model performed slightly worse. In contrast, models based on mel-frequency cepstral coefficients did not fare as well. Comparable accuracy levels were obtained on out-of-sample recordings from 52 of the 142 original species. The results suggest that a combination of low-level acoustic attributes is sufficient to discriminate efficiently between the vocalizations of these four genera, thus supporting the initial premise and validating the use of high-throughput algorithms on animal vocalizations to evaluate phylogenetic hypotheses.
Homology and the optimization of DNA sequence data
NASA Technical Reports Server (NTRS)
Wheeler, W.
2001-01-01
Three methods of nucleotide character analysis are discussed. Their implications for molecular sequence homology and phylogenetic analysis are compared. The criterion of inter-data set congruence, both character based and topological, are applied to two data sets to elucidate and potentially discriminate among these parsimony-based ideas. c2001 The Willi Hennig Society.
A Well-Resolved Phylogeny of the Trees of Puerto Rico Based on DNA Barcode Sequence Data
Muscarella, Robert; Uriarte, María; Erickson, David L.; Swenson, Nathan G.; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Background The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. Methodology/principal findings We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. Conclusions/significance With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning. PMID:25386879
A well-resolved phylogeny of the trees of Puerto Rico based on DNA barcode sequence data.
Muscarella, Robert; Uriarte, María; Erickson, David L; Swenson, Nathan G; Zimmerman, Jess K; Kress, W John
2014-01-01
The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning.
Forti, Lucas Rodriguez; Lingnau, Rodrigo; Encarnação, Lais Carvalho; Bertoluci, Jaime; Toledo, Luís Felipe
2017-01-01
Phenotypic traits, such as the frog advertisement call, are generally correlated with interspecific genetic variation, and, as a consequence of strong sexual selection, these behaviors may carry a phylogenetic signal. However, variation in acoustic traits is not always correlated with genetic differences between populations (intraspecific variation); phenotypic plasticity and environmental variables may explain part of such variation. For example, local processes can affect acoustic properties in different lineages due to differences in physical structure, climatic conditions, and biotic interactions, particularly when populations are isolated. However, acoustic traits can be used to test phylogenetic hypotheses. We analyzed the advertisement calls of Dendropsophus elegans males from 18 sites and compared them with those of four closely related congeneric species, in order to test for differences between inter and intraspecific variation. We analyzed 451 calls of 45 males of these five species. Because males from distant sites were grouped together without population congruence, differences found in advertisement calls among individuals were not correlated with phylogeographical clades. Phylogenetic and cluster analyses of the D. elegans clades and those of closely related species grouped all five species into the same topology, as reported by previous molecular and morphological phylogenies. However, the topology of the D. elegans phylogeographical clades did not match the topology previously reported. Acoustic communication in D. elegans seems to be conserved among populations, and the phylogeographical history of the species does not explain the variation among lineages in call properties, despite some congruent phylogenetic signals evident at the species level. Based on molecular clocks retrieved from the literature, it seems that more than 6.5 million years of divergence (late Miocene) are necessary to allow significant changes to occur in the acoustic properties of these treefrog calls, making it possible to recover their phylogenetic history only based on acoustic evidence.
Slabbinck, Bram; Waegeman, Willem; Dawyndt, Peter; De Vos, Paul; De Baets, Bernard
2010-01-30
Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification. In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model. FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for the discrimination of bacterial species. Summarized, by phylogenetic learning we are able to situate and evaluate FAME-based bacterial species classification in a more informative context.
2010-01-01
Background Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification. Results In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model. Conclusions FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for the discrimination of bacterial species. Summarized, by phylogenetic learning we are able to situate and evaluate FAME-based bacterial species classification in a more informative context. PMID:20113515
Miadlikowska, Jolanta; Kauff, Frank; Högnabba, Filip; Oliver, Jeffrey C.; Molnár, Katalin; Fraker, Emily; Gaya, Ester; Hafellner, Josef; Hofstetter, Valérie; Gueidan, Cécile; Otálora, Mónica A.G.; Hodkinson, Brendan; Kukwa, Martin; Lücking, Robert; Björk, Curtis; Sipman, Harrie J.M.; Burgaz, Ana Rosa; Thell, Arne; Passo, Alfredo; Myllys, Leena; Goward, Trevor; Fernández-Brime, Samantha; Hestmark, Geir; Lendemer, James; Lumbsch, H. Thorsten; Schmull, Michaela; Schoch, Conrad; Sérusiaux, Emmanuël; Maddison, David R.; Arnold, A. Elizabeth; Lutzoni, François; Stenroos, Soili
2014-01-01
The Lecanoromycetes is the largest class of lichenized Fungi, and one of the most species-rich classes in the kingdom. Here we provide a multigene phylogenetic synthesis (using three ribosomal RNA-coding and two protein-coding genes) of the Lecanoromycetes based on 642 newly generated and 3329 publicly available sequences representing 1139 taxa, 317 genera, 66 families, 17 orders and five subclasses (four currently recognized: Acarosporomycetidae, Lecanoromycetidae, Ostropomycetidae, Umbilicariomycetidae; and one provisionarily recognized, ‘Candelariomycetidae’). Maximum likelihood phylogenetic analyses on four multigene datasets assembled using a cumulative supermatrix approach with a progressively higher number of species and missing data (5-gene, 5+4-gene, 5+4+3-gene and 5+4+3+2-gene datasets) show that the current classification includes non-monophyletic taxa at various ranks, which need to be recircumscribed and require revisionary treatments based on denser taxon sampling and more loci. Two newly circumscribed orders (Arctomiales and Hymeneliales in the Ostropomycetidae) and three families (Ramboldiaceae and Psilolechiaceae in the Lecanorales, and Strangosporaceae in the Lecanoromycetes inc. sed.) are introduced. The potential resurrection of the families Eigleraceae and Lopadiaceae is considered here to alleviate phylogenetic and classification disparities. An overview of the photobionts associated with the main fungal lineages in the Lecanoromycetes based on available published records is provided. A revised schematic classification at the family level in the phylogenetic context of widely accepted and newly revealed relationships across Lecanoromycetes is included. The cumulative addition of taxa with an increasing amount of missing data (i.e., a cumulative supermatrix approach, starting with taxa for which sequences were available for all five targeted genes and ending with the addition of taxa for which only two genes have been sequenced) revealed relatively stable relationships for many families and orders. However, the increasing number of taxa without the addition of more loci also resulted in an expected substantial loss of phylogenetic resolving power and support (especially for deep phylogenetic relationships), potentially including the misplacements of several taxa. Future phylogenetic analyses should include additional single copy protein-coding markers in order to improve the tree of the Lecanoromycetes. As part of this study, a new module (“Hypha”) of the freely available Mesquite software was developed to compare and display the internodal support values derived from this cumulative supermatrix approach. PMID:24747130
Is geography an accurate predictor of evolutionary history in the millipede family Xystodesmidae?
Marek, Paul E.
2017-01-01
For the past several centuries, millipede taxonomists have used the morphology of male copulatory structures (modified legs called gonopods), which are strongly variable and suggestive of species-level differences, as a source to understand taxon relationships. Millipedes in the family Xystodesmidae are blind, dispersal-limited and have narrow habitat requirements. Therefore, geographical proximity may instead be a better predictor of evolutionary relationship than morphology, especially since gonopodal anatomy is extremely divergent and similarities may be masked by evolutionary convergence. Here we provide a phylogenetics-based test of the power of morphological versus geographical character sets for resolving phylogenetic relationships in xystodesmid millipedes. Molecular data from 90 species-group taxa in the family were included in a six-gene phylogenetic analysis to provide the basis for comparing trees generated from these alternative character sets. The molecular phylogeny was compared to topologies representing three hypotheses: (1) a prior classification formulated using morphological and geographical data, (2) hierarchical groupings derived from Euclidean geographical distance, and (3) one based solely on morphological data. Euclidean geographical distance was not found to be a better predictor of evolutionary relationship than the prior classification, the latter of which was the most similar to the molecular topology. However, all three of the alternative topologies were highly divergent (Bayes factor >10) from the molecular topology, with the tree inferred exclusively from morphology being the most divergent. The results of this analysis show that a high degree of morphological convergence from substantial gonopod shape divergence generated spurious phylogenetic relationships. These results indicate the impact that a high degree of morphological homoplasy may have had on prior treatments of the family. Using the results of our phylogenetic analysis, we make several changes to the classification of the family, including transferring the rare state-threatened species Sigmoria whiteheadi Shelley, 1986 to the genus Apheloria Chamberlin, 1921—a relationship not readily apparent based on morphology alone. We show that while gonopod differences are a premier source of taxonomic characters to diagnose species pairwise, the traits should be viewed critically as taxonomic features uniting higher levels. PMID:29038750
Pereira, Edson H L; Reis, Roberto E
2017-05-11
A phylogenetic study of the Loricariidae with emphasis on the Neoplecostominae is presented based on a maximum parsimony analysis of 268 phenotypic characters encompassing osteology, arthrology, and external morphology. Results support previous hypotheses of the monophyly of the Neoplecostominae and each of the included genera: Hirtella, Isbrueckerichthys, Kronichthys, Neoplecostomus, Pareiorhaphis, and Pareiorhina. In addition, previously undiscovered diversity was revealed within the subfamily as an additional genus-level taxon, herein described as Euryochus. Relationships among neoplecostomine genera are: (Kronichthys (Euryochus ((Hirtella + Pareiorhaphis) (Pareiorhina (Isbrueckerichthys + Neoplecostomus))))). Additional undescribed diversity was also detected among most neoplecostomine genera and the Hypoptopomatinae. In addition, recently discovered genera Nannoplecostomus and Microplecostomus were included in the analysis, and were identified as sequential sister-taxa to Neoplecostominae + Hypoptopomatinae, which are currently not included in any subfamily and regarded as incertae sedis in Loricariidae. The three species of Lithogenes were included in an encompassing phylogenetic analysis for the first time, and were identified as a monophyletic unit and sister group to all remaining loricariids. The other loricariid subfamilies were also corroborated as monophyletic, and presented the following interrelationships (Lithogeninae (Delturinae (Loricariinae (Hypostominae (Nannoplecostomus (Microplecostomus (Hypoptopomatinae + Neoplecostominae). The Neoplecostominae and its genera are phylogenetically diagnosed, and hypothesized relationships are compared to those of previous morphological and molecular phylogenetic studies.
Phylo-VISTA: Interactive visualization of multiple DNA sequence alignments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shah, Nameeta; Couronne, Olivier; Pennacchio, Len A.
The power of multi-sequence comparison for biological discovery is well established. The need for new capabilities to visualize and compare cross-species alignment data is intensified by the growing number of genomic sequence datasets being generated for an ever-increasing number of organisms. To be efficient these visualization algorithms must support the ability to accommodate consistently a wide range of evolutionary distances in a comparison framework based upon phylogenetic relationships. Results: We have developed Phylo-VISTA, an interactive tool for analyzing multiple alignments by visualizing a similarity measure for multiple DNA sequences. The complexity of visual presentation is effectively organized using a frameworkmore » based upon interspecies phylogenetic relationships. The phylogenetic organization supports rapid, user-guided interspecies comparison. To aid in navigation through large sequence datasets, Phylo-VISTA leverages concepts from VISTA that provide a user with the ability to select and view data at varying resolutions. The combination of multiresolution data visualization and analysis, combined with the phylogenetic framework for interspecies comparison, produces a highly flexible and powerful tool for visual data analysis of multiple sequence alignments. Availability: Phylo-VISTA is available at http://www-gsd.lbl. gov/phylovista. It requires an Internet browser with Java Plugin 1.4.2 and it is integrated into the global alignment program LAGAN at http://lagan.stanford.edu« less
A Unique Box in 28S rRNA Is Shared by the Enigmatic Insect Order Zoraptera and Dictyoptera
Dang, Kai; Wu, Haoyang; Wang, Ying; Xie, Qiang; Bu, Wenjun
2013-01-01
The position of the Zoraptera remains one of the most challenging and uncertain concerns in ordinal-level phylogenies of the insects. Zoraptera have been viewed as having a close relationship with five different groups of Polyneoptera, or as being allied to the Paraneoptera or even Holometabola. Although rDNAs have been widely used in phylogenetic studies of insects, the application of the complete 28S rDNA are still scattered in only a few orders. In this study, a secondary structure model of the complete 28S rRNAs of insects was reconstructed based on all orders of Insecta. It was found that one length-variable region, D3-4, is particularly distinctive. The length and/or sequence of D3-4 is conservative within each order of Polyneoptera, but it can be divided into two types between the different orders of the supercohort, of which the enigmatic order Zoraptera and Dictyoptera share one type, while the remaining orders of Polyneoptera share the other. Additionally, independent evidence from phylogenetic results support the clade (Zoraptera+Dictyoptera) as well. Thus, the similarity of D3-4 between Zoraptera and Dictyoptera can serve as potentially valuable autapomorphy or synapomorphy in phylogeny reconstruction. The clades of (Plecoptera+Dermaptera) and ((Grylloblattodea+Mantophasmatodea)+(Embiodea+Phasmatodea)) were also recovered in the phylogenetic study. In addition, considering the other studies based on rDNAs, this study reached the highest congruence with previous phylogenetic studies of Holometabola based on nuclear protein coding genes or morphology characters. Future comparative studies of secondary structures across deep divergences and additional taxa are likely to reveal conserved patterns, structures and motifs that can provide support for major phylogenetic lineages. PMID:23301099
Phylobetadiversity among forest types in the Brazilian Atlantic Forest complex.
Duarte, Leandro Da Silva; Bergamin, Rodrigo Scarton; Marcilio-Silva, Vinícius; Seger, Guilherme Dubal Dos Santos; Marques, Márcia Cristina Mendes
2014-01-01
Phylobetadiversity is defined as the phylogenetic resemblance between communities or biomes. Analyzing phylobetadiversity patterns among different vegetation physiognomies within a single biome is crucial to understand the historical affinities between them. Based on the widely accepted idea that different forest physiognomies within the Southern Brazilian Atlantic Forest constitute different facies of a single biome, we hypothesize that more recent phylogenetic nodes should drive phylobetadiversity gradients between the different forest types within the Atlantic Forest, as the phylogenetic divergence among those forest types is biogeographically recent. We compiled information from 206 checklists describing the occurrence of shrub/tree species across three different forest physiognomies within the Southern Brazilian Atlantic Forest (Dense, Mixed and Seasonal forests). We analyzed intra-site phylogenetic structure (phylogenetic diversity, net relatedness index and nearest taxon index) and phylobetadiversity between plots located at different forest types, using five different methods differing in sensitivity to either basal or terminal nodes (phylogenetic fuzzy weighting, COMDIST, COMDISTNT, UniFrac and Rao's H). Mixed forests showed higher phylogenetic diversity and overdispersion than the other forest types. Furthermore, all forest types differed from each other in relation phylobetadiversity patterns, particularly when phylobetadiversity methods more sensitive to terminal nodes were employed. Mixed forests tended to show higher phylogenetic differentiation to Dense and Seasonal forests than these latter from each other. The higher phylogenetic diversity and phylobetadiversity levels found in Mixed forests when compared to the others likely result from the biogeographical origin of several taxa occurring in these forests. On one hand, Mixed forests shelter several temperate taxa, like the conifers Araucaria and Podocarpus. On the other hand, tropical groups, like Myrtaceae, are also very representative of this forest type. We point out to the need of more attention to Mixed forests as a conservation target within the Brazilian Atlantic Forest given their high phylogenetic uniqueness.
Phylobetadiversity among Forest Types in the Brazilian Atlantic Forest Complex
Duarte, Leandro Da Silva; Bergamin, Rodrigo Scarton; Marcilio-Silva, Vinícius; Seger, Guilherme Dubal Dos Santos; Marques, Márcia Cristina Mendes
2014-01-01
Phylobetadiversity is defined as the phylogenetic resemblance between communities or biomes. Analyzing phylobetadiversity patterns among different vegetation physiognomies within a single biome is crucial to understand the historical affinities between them. Based on the widely accepted idea that different forest physiognomies within the Southern Brazilian Atlantic Forest constitute different facies of a single biome, we hypothesize that more recent phylogenetic nodes should drive phylobetadiversity gradients between the different forest types within the Atlantic Forest, as the phylogenetic divergence among those forest types is biogeographically recent. We compiled information from 206 checklists describing the occurrence of shrub/tree species across three different forest physiognomies within the Southern Brazilian Atlantic Forest (Dense, Mixed and Seasonal forests). We analyzed intra-site phylogenetic structure (phylogenetic diversity, net relatedness index and nearest taxon index) and phylobetadiversity between plots located at different forest types, using five different methods differing in sensitivity to either basal or terminal nodes (phylogenetic fuzzy weighting, COMDIST, COMDISTNT, UniFrac and Rao’s H). Mixed forests showed higher phylogenetic diversity and overdispersion than the other forest types. Furthermore, all forest types differed from each other in relation phylobetadiversity patterns, particularly when phylobetadiversity methods more sensitive to terminal nodes were employed. Mixed forests tended to show higher phylogenetic differentiation to Dense and Seasonal forests than these latter from each other. The higher phylogenetic diversity and phylobetadiversity levels found in Mixed forests when compared to the others likely result from the biogeographical origin of several taxa occurring in these forests. On one hand, Mixed forests shelter several temperate taxa, like the conifers Araucaria and Podocarpus. On the other hand, tropical groups, like Myrtaceae, are also very representative of this forest type. We point out to the need of more attention to Mixed forests as a conservation target within the Brazilian Atlantic Forest given their high phylogenetic uniqueness. PMID:25121495
Pan-genome and phylogeny of Bacillus cereus sensu lato.
Bazinet, Adam L
2017-08-02
Bacillus cereus sensu lato (s. l.) is an ecologically diverse bacterial group of medical and agricultural significance. In this study, I use publicly available genomes and novel bioinformatic workflows to characterize the B. cereus s. l. pan-genome and perform the largest phylogenetic and population genetic analyses of this group to date in terms of the number of genes and taxa included. With these fundamental data in hand, I identify genes associated with particular phenotypic traits (i.e., "pan-GWAS" analysis), and quantify the degree to which taxa sharing common attributes are phylogenetically clustered. A rapid k-mer based approach (Mash) was used to create reduced representations of selected Bacillus genomes, and a fast distance-based phylogenetic analysis of this data (FastME) was performed to determine which species should be included in B. cereus s. l. The complete genomes of eight B. cereus s. l. species were annotated de novo with Prokka, and these annotations were used by Roary to produce the B. cereus s. l. pan-genome. Scoary was used to associate gene presence and absence patterns with various phenotypes. The orthologous protein sequence clusters produced by Roary were filtered and used to build HaMStR databases of gene models that were used in turn to construct phylogenetic data matrices. Phylogenetic analyses used RAxML, DendroPy, ClonalFrameML, PAUP*, and SplitsTree. Bayesian model-based population genetic analysis assigned taxa to clusters using hierBAPS. The genealogical sorting index was used to quantify the phylogenetic clustering of taxa sharing common attributes. The B. cereus s. l. pan-genome currently consists of ≈60,000 genes, ≈600 of which are "core" (common to at least 99% of taxa sampled). Pan-GWAS analysis revealed genes associated with phenotypes such as isolation source, oxygen requirement, and ability to cause diseases such as anthrax or food poisoning. Extensive phylogenetic analyses using an unprecedented amount of data produced phylogenies that were largely concordant with each other and with previous studies. Phylogenetic support as measured by bootstrap probabilities increased markedly when all suitable pan-genome data was included in phylogenetic analyses, as opposed to when only core genes were used. Bayesian population genetic analysis recommended subdividing the three major clades of B. cereus s. l. into nine clusters. Taxa sharing common traits and species designations exhibited varying degrees of phylogenetic clustering. All phylogenetic analyses recapitulated two previously used classification systems, and taxa were consistently assigned to the same major clade and group. By including accessory genes from the pan-genome in the phylogenetic analyses, I produced an exceptionally well-supported phylogeny of 114 complete B. cereus s. l. genomes. The best-performing methods were used to produce a phylogeny of all 498 publicly available B. cereus s. l. genomes, which was in turn used to compare three different classification systems and to test the monophyly status of various B. cereus s. l. species. The majority of the methodology used in this study is generic and could be leveraged to produce pan-genome estimates and similarly robust phylogenetic hypotheses for other bacterial groups.
Poe, Steven; Nieto-Montes de Oca, Adrián; Torres-Carvajal, Omar; De Queiroz, Kevin; Velasco, Julián A; Truett, Brad; Gray, Levi N; Ryan, Mason J; Köhler, Gunther; Ayala-Varela, Fernando; Latella, Ian
2017-09-01
Anolis lizards (anoles) are textbook study organisms in evolution and ecology. Although several topics in evolutionary biology have been elucidated by the study of anoles, progress in some areas has been hampered by limited phylogenetic information on this group. Here, we present a phylogenetic analysis of all 379 extant species of Anolis, with new phylogenetic data for 139 species including new DNA data for 101 species. We use the resulting estimates as a basis for defining anole clade names under the principles of phylogenetic nomenclature and to examine the biogeographic history of anoles. Our new taxonomic treatment achieves the supposed advantages of recent subdivisions of anoles that employed ranked Linnaean-based nomenclature while avoiding the pitfalls of those approaches regarding artificial constraints imposed by ranks. Our biogeographic analyses demonstrate complexity in the dispersal history of anoles, including multiple crossings of the Isthmus of Panama, two invasions of the Caribbean, single invasions to Jamaica and Cuba, and a single evolutionary dispersal from the Caribbean to the mainland that resulted in substantial anole diversity. Our comprehensive phylogenetic estimate of anoles should prove useful for rigorous testing of many comparative evolutionary hypotheses. [Anoles; biogeography; lizards; Neotropics; phylogeny; taxonomy]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Phylogenetic diversity in the genus Bacillus as seen by 16S rRNA sequencing studies
NASA Technical Reports Server (NTRS)
Rossler, D.; Ludwig, W.; Schleifer, K. H.; Lin, C.; McGill, T. J.; Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.
1991-01-01
Comparative sequence analysis of 16S ribosomal (r)RNAs or DNAs of Bacillus alvei, B. laterosporus, B. macerans, B. macquariensis, B. polymyxa and B. stearothermophilus revealed the phylogenetic diversity of the genus Bacillus. Based on the presently available data set of 16S rRNA sequences from bacilli and relatives at least four major "Bacillus clusters" can be defined: a "Bacillus subtilis cluster" including B. stearothermophilus, a "B. brevis cluster" including B. laterosporus, a "B. alvei cluster" including B. macerans, B. maquariensis and B. polymyxa and a "B. cycloheptanicus branch".
Speciation and Neutral Molecular Evolution in One-Dimensional Closed Population
NASA Astrophysics Data System (ADS)
Semovski, Sergei V.; Bukin, Yuri S.; Sherbakov, Dmitry Yu.
Models are presented suitable for a description of speciation processes arising due to reproductive isolation depending on genetic distance. The main attention is paid to the model of a one-dimensional closed population, which describes the evolution of littoral benthic organisms. In order to correspond the modeling results to the results obtained in the course of experimental phylogenetic studies, all individual-based models described here involve neutrally evolving and maternally inherited DNA sequence. Sub-samples of the resulting sequences were used for a posteriori phylogenetic inferences which then were compared to the "true" evolutionary histories.
Phylogenetic tree construction based on 2D graphical representation
NASA Astrophysics Data System (ADS)
Liao, Bo; Shan, Xinzhou; Zhu, Wen; Li, Renfa
2006-04-01
A new approach based on the two-dimensional (2D) graphical representation of the whole genome sequence [Bo Liao, Chem. Phys. Lett., 401(2005) 196.] is proposed to analyze the phylogenetic relationships of genomes. The evolutionary distances are obtained through measuring the differences among the 2D curves. The fuzzy theory is used to construct phylogenetic tree. The phylogenetic relationships of H5N1 avian influenza virus illustrate the utility of our approach.
The problem and promise of scale dependency in community phylogenetics.
Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K
2006-10-01
The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.
Shi, Qing-Hui; Sun, Xiao-Yan; Wang, Yun-Liang; Hao, Jia-Sheng; Yang, Qun
2015-01-01
Nymphalidae is the largest family of butterflies with their phylogenetic relationships not adequately approached to date. The mitochondrial genomes (mitogenomes) of 11 new nymphalid species were reported and a comparative mitogenomic analysis was conducted together with other 22 available nymphalid mitogenomes. A phylogenetic analysis of the 33 species from all 13 currently recognized nymphalid subfamilies was done based on the mitogenomic data set with three Lycaenidae species as the outgroups. The mitogenome comparison showed that the eleven new mitogenomes were similar with those of other butterflies in gene content and order. The reconstructed phylogenetic trees reveal that the nymphalids are made up of five major clades (the nymphaline, heliconiine, satyrine, danaine and libytheine clades), with sister relationship between subfamilies Cyrestinae and Biblidinae, and most likely between subfamilies Morphinae and Satyrinae. This whole mitogenome-based phylogeny is generally congruent with those of former studies based on nuclear-gene and mitogenomic analyses, but differs considerably from the result of morphological cladistic analysis, such as the basal position of Libytheinae in morpho-phylogeny is not confirmed in molecular studies. However, we found that the mitogenomic phylogeny established herein is compatible with selected morphological characters (including developmental and adult morpho-characters).
Garcillán-Barcia, M Pilar; Ruiz del Castillo, Belén; Alvarado, Andrés; de la Cruz, Fernando; Martínez-Martínez, Luis
2015-01-01
Degenerate Primer MOB Typing is a PCR-based protocol for the classification of γ-proteobacterial transmissible plasmids in five phylogenetic relaxase MOB families. It was applied to a multiresistant E. coli collection, previously characterized by PCR-based replicon-typing, in order to compare both methods. Plasmids from 32 clinical isolates of multiresistant E. coli (19 extended spectrum beta-lactamase producers and 13 non producers) and their transconjugants were analyzed. A total of 95 relaxases were detected, at least one per isolate, underscoring the high potential of these strains for antibiotic-resistance transmission. MOBP12 and MOBF12 plasmids were the most abundant. Most MOB subfamilies detected were present in both subsets of the collection, indicating a shared mobilome among multiresistant E. coli. The plasmid profile obtained by both methods was compared, which provided useful data upon which decisions related to the implementation of detection methods in the clinic could be based. The phylogenetic depth at which replicon and MOB-typing classify plasmids is different. While replicon-typing aims at plasmid replication regions with non-degenerate primers, MOB-typing classifies plasmids into relaxase subfamilies using degenerate primers. As a result, MOB-typing provides a deeper phylogenetic depth than replicon-typing and new plasmid groups are uncovered. Significantly, MOB typing identified 17 plasmids and an integrative and conjugative element, which were not detected by replicon-typing. Four of these backbones were different from previously reported elements. Copyright © 2014 Elsevier Inc. All rights reserved.
An ordination of life histories using morphological proxies: capital vs. income breeding in insects.
Davis, Robert B; Javoiš, Juhan; Kaasik, Ants; Õunap, Erki; Tammaru, Toomas
2016-08-01
Predictive classifications of life histories are essential for evolutionary ecology. While attempts to apply a single approach to all organisms may be overambitious, recent advances suggest that more narrow ordination schemes can be useful. However, these schemes mostly lack easily observable proxies of the position of a species on respective axes. It has been proposed that, in insects, the degree of capital (vs. income) breeding, reflecting the importance of adult feeding for reproduction, correlates with various ecological traits at the level of among-species comparison. We sought to prove these ideas via rigorous phylogenetic comparative analyses. We used experimentally derived life-history data for 57 species of European Geometridae (Lepidoptera), and an original phylogenetic reconstruction. The degree of capital breeding was estimated based on morphological proxies, including relative abdomen size of females. Applying Brownian-motion-based comparative analyses (with an original update to include error estimates), we demonstrated the associations between the degree of capital breeding and larval diet breadth, sexual size dimorphism, and reproductive season. Ornstein-Uhlenbeck model based phylogenetic analysis suggested a causal relationship between the degree of capital breeding and diet breadth. Our study indicates that the gradation from capital to income breeding is an informative axis to ordinate life-history strategies in flying insects which are affected by the fecundity vs. mobility trade off, with the availability of easy to record proxies contributing to its predictive power in practical contexts. © 2016 by the Ecological Society of America.
Comparative Analysis of Begonia Plastid Genomes and Their Utility for Species-Level Phylogenetics
Harrison, Nicola; Harrison, Richard J.
2016-01-01
Recent, rapid radiations make species-level phylogenetics difficult to resolve. We used a multiplexed, high-throughput sequencing approach to identify informative genomic regions to resolve phylogenetic relationships at low taxonomic levels in Begonia from a survey of sixteen species. A long-range PCR method was used to generate draft plastid genomes to provide a strong phylogenetic backbone, identify fast evolving regions and provide informative molecular markers for species-level phylogenetic studies in Begonia. PMID:27058864
A phylogenetic Kalman filter for ancestral trait reconstruction using molecular data.
Lartillot, Nicolas
2014-02-15
Correlation between life history or ecological traits and genomic features such as nucleotide or amino acid composition can be used for reconstructing the evolutionary history of the traits of interest along phylogenies. Thus far, however, such ancestral reconstructions have been done using simple linear regression approaches that do not account for phylogenetic inertia. These reconstructions could instead be seen as a genuine comparative regression problem, such as formalized by classical generalized least-square comparative methods, in which the trait of interest and the molecular predictor are represented as correlated Brownian characters coevolving along the phylogeny. Here, a Bayesian sampler is introduced, representing an alternative and more efficient algorithmic solution to this comparative regression problem, compared with currently existing generalized least-square approaches. Technically, ancestral trait reconstruction based on a molecular predictor is shown to be formally equivalent to a phylogenetic Kalman filter problem, for which backward and forward recursions are developed and implemented in the context of a Markov chain Monte Carlo sampler. The comparative regression method results in more accurate reconstructions and a more faithful representation of uncertainty, compared with simple linear regression. Application to the reconstruction of the evolution of optimal growth temperature in Archaea, using GC composition in ribosomal RNA stems and amino acid composition of a sample of protein-coding genes, confirms previous findings, in particular, pointing to a hyperthermophilic ancestor for the kingdom. The program is freely available at www.phylobayes.org.
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.
Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris
2016-01-01
INTRODUCTION Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. METHODS Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. RESULTS Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. CONCLUSION The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. PMID:26805667
Whole genome sequencing data and de novo draft assemblies for 66 teleost species
Malmstrøm, Martin; Matschiner, Michael; Tørresen, Ole K.; Jakobsen, Kjetill S.; Jentoft, Sissel
2017-01-01
Teleost fishes comprise more than half of all vertebrate species, yet genomic data are only available for 0.2% of their diversity. Here, we present whole genome sequencing data for 66 new species of teleosts, vastly expanding the availability of genomic data for this important vertebrate group. We report on de novo assemblies based on low-coverage (9–39×) sequencing and present detailed methodology for all analyses. To facilitate further utilization of this data set, we present statistical analyses of the gene space completeness and verify the expected phylogenetic position of the sequenced genomes in a large mitogenomic context. We further present a nuclear marker set used for phylogenetic inference and evaluate each gene tree in relation to the species tree to test for homogeneity in the phylogenetic signal. Collectively, these analyses illustrate the robustness of this highly diverse data set and enable extensive reuse of the selected phylogenetic markers and the genomic data in general. This data set covers all major teleost lineages and provides unprecedented opportunities for comparative studies of teleosts. PMID:28094797
Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris
2016-12-01
Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. Copyright: © Singapore Medical Association
Diversification of land plants: insights from a family-level phylogenetic analysis.
Fiz-Palacios, Omar; Schneider, Harald; Heinrichs, Jochen; Savolainen, Vincent
2011-11-21
Some of the evolutionary history of land plants has been documented based on the fossil record and a few broad-scale phylogenetic analyses, especially focusing on angiosperms and ferns. Here, we reconstructed phylogenetic relationships among all 706 families of land plants using molecular data. We dated the phylogeny using multiple fossils and a molecular clock technique. Applying various tests of diversification that take into account topology, branch length, numbers of extant species as well as extinction, we evaluated diversification rates through time. We also compared these diversification profiles against the distribution of the climate modes of the Phanerozoic. We found evidence for the radiations of ferns and mosses in the shadow of angiosperms coinciding with the rather warm Cretaceous global climate. In contrast, gymnosperms and liverworts show a signature of declining diversification rates during geological time periods of cool global climate. This broad-scale phylogenetic analysis helps to reveal the successive waves of diversification that made up the diversity of land plants we see today. Both warm temperatures and wet climate may have been necessary for the rise of the diversity under a successive lineage replacement scenario.
Phylogenetic community structure: temporal variation in fish assemblage
Santorelli, Sergio; Magnusson, William; Ferreira, Efrem; Caramaschi, Erica; Zuanon, Jansen; Amadio, Sidnéia
2014-01-01
Hypotheses about phylogenetic relationships among species allow inferences about the mechanisms that affect species coexistence. Nevertheless, most studies assume that phylogenetic patterns identified are stable over time. We used data on monthly samples of fish from a single lake over 10 years to show that the structure in phylogenetic assemblages varies over time and conclusions depend heavily on the time scale investigated. The data set was organized in guild structures and temporal scales (grouped at three temporal scales). Phylogenetic distance was measured as the mean pairwise distances (MPD) and as mean nearest-neighbor distance (MNTD). Both distances were based on counts of nodes. We compared the observed values of MPD and MNTD with values that were generated randomly using null model independent swap. A serial runs test was used to assess the temporal independence of indices over time. The phylogenetic pattern in the whole assemblage and the functional groups varied widely over time. Conclusions about phylogenetic clustering or dispersion depended on the temporal scales. Conclusions about the frequency with which biotic processes and environmental filters affect the local assembly do not depend only on taxonomic grouping and spatial scales. While these analyzes allow the assertion that all proposed patterns apply to the fish assemblages in the floodplain, the assessment of the relative importance of these processes, and how they vary depending on the temporal scale and functional group studied, cannot be determined with the effort commonly used. It appears that, at least in the system that we studied, the assemblages are forming and breaking continuously, resulting in various phylogeny-related structures that makes summarizing difficult. PMID:25360256
Ma, Yue; Tuskan, Gerald A.
2018-01-01
The existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier. Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes. We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID). The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life. The two-dimensional maps (designated as fingerprints here) from the protein distribution densities in the LD space defined by ln(L) and ID are then constructed. The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies. As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments. The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level. PMID:29686995
Mameaux, Sabine; Cockram, James; Thiel, Thomas; Steuernagel, Burkhard; Stein, Nils; Taudien, Stefan; Jack, Peter; Werner, Peter; Gray, John C; Greenland, Andy J; Powell, Wayne
2012-01-01
The genomes of cereals such as wheat (Triticum aestivum) and barley (Hordeum vulgare) are large and therefore problematic for the map-based cloning of agronomicaly important traits. However, comparative approaches within the Poaceae permit transfer of molecular knowledge between species, despite their divergence from a common ancestor sixty million years ago. The finding that null variants of the rice gene cytokinin oxidase/dehydrogenase 2 (OsCKX2) result in large yield increases provides an opportunity to explore whether similar gains could be achieved in other Poaceae members. Here, phylogenetic, molecular and comparative analyses of CKX families in the sequenced grass species rice, brachypodium, sorghum, maize and foxtail millet, as well as members identified from the transcriptomes/genomes of wheat and barley, are presented. Phylogenetic analyses define four Poaceae CKX clades. Comparative analyses showed that CKX phylogenetic groupings can largely be explained by a combination of local gene duplication, and the whole-genome duplication event that predates their speciation. Full-length OsCKX2 homologues in barley (HvCKX2.1, HvCKX2.2) and wheat (TaCKX2.3, TaCKX2.4, TaCKX2.5) are characterized, with comparative analysis at the DNA, protein and genetic/physical map levels suggesting that true CKX2 orthologs have been identified. Furthermore, our analysis shows CKX2 genes in barley and wheat have undergone a Triticeae-specific gene-duplication event. Finally, by identifying ten of the eleven CKX genes predicted to be present in barley by comparative analyses, we show that next-generation sequencing approaches can efficiently determine the gene space of large-genome crops. Together, this work provides the foundation for future functional investigation of CKX family members within the Poaceae. © 2011 National Institute of Agricultural Botany (NIAB). Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Harrison, Nigel; Dickinson, Matthew
2008-08-01
Phytoplasma phylogenetics has focused primarily on sequences of the non-coding 16S rRNA gene and the 16S-23S rRNA intergenic spacer region (16-23S ISR), and primers that enable amplification of these regions from all phytoplasmas by PCR are well established. In this study, primers based on the secA gene have been developed into a semi-nested PCR assay that results in a sequence of the expected size (about 480 bp) from all 34 phytoplasmas examined, including strains representative of 12 16Sr groups. Phylogenetic analysis of secA gene sequences showed similar clustering of phytoplasmas when compared with clusters resolved by similar sequence analyses of a 16-23S ISR-23S rRNA gene contig or of the 16S rRNA gene alone. The main differences between trees were in the branch lengths, which were elongated in the 16-23S ISR-23S rRNA gene tree when compared with the 16S rRNA gene tree and elongated still further in the secA gene tree, despite this being a shorter sequence. The improved resolution in the secA gene-derived phylogenetic tree resulted in the 16SrII group splitting into two distinct clusters, while phytoplasmas associated with coconut lethal yellowing-type diseases split into three distinct groups, thereby supporting past proposals that they represent different candidate species within 'Candidatus Phytoplasma'. The ability to differentiate 16Sr groups and subgroups by virtual RFLP analysis of secA gene sequences suggests that this gene may provide an informative alternative molecular marker for pathogen identification and diagnosis of phytoplasma diseases.
High-resolution phylogenetic microbial community profiling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Singer, Esther; Coleman-Derr, Devin; Bowman, Brett
2014-03-17
The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less
A molecular phylogeny of the stingless bee genus Melipona (Hymenoptera: Apidae).
Ramírez, Santiago R; Nieh, James C; Quental, Tiago B; Roubik, David W; Imperatriz-Fonseca, Vera L; Pierce, Naomi E
2010-08-01
Stingless bees (Meliponini) constitute a diverse group of highly eusocial insects that occur throughout tropical regions around the world. The meliponine genus Melipona is restricted to the New World tropics and has over 50 described species. Melipona, like Apis, possesses the remarkable ability to use representational communication to indicate the location of foraging patches. Although Melipona has been the subject of numerous behavioral, ecological, and genetic studies, the evolutionary history of this genus remains largely unexplored. Here, we implement a multigene phylogenetic approach based on nuclear, mitochondrial, and ribosomal loci, coupled with molecular clock methods, to elucidate the phylogenetic relationships and antiquity of subgenera and species of Melipona. Our phylogenetic analysis resolves the relationship among subgenera and tends to agree with morphology-based classification hypotheses. Our molecular clock analysis indicates that the genus Melipona shared a most recent common ancestor at least approximately 14-17 million years (My) ago. These results provide the groundwork for future comparative analyses aimed at understanding the evolution of complex communication mechanisms in eusocial Apidae. Copyright 2010 Elsevier Inc. All rights reserved.
Phylogeny and species traits predict bird detectability
Solymos, Peter; Matsuoka, Steven M.; Stralberg, Diana; Barker, Nicole K. S.; Bayne, Erin M.
2018-01-01
Avian acoustic communication has resulted from evolutionary pressures and ecological constraints. We therefore expect that auditory detectability in birds might be predictable by species traits and phylogenetic relatedness. We evaluated the relationship between phylogeny, species traits, and field‐based estimates of the two processes that determine species detectability (singing rate and detection distance) for 141 bird species breeding in boreal North America. We used phylogenetic mixed models and cross‐validation to compare the relative merits of using trait data only, phylogeny only, or the combination of both to predict detectability. We found a strong phylogenetic signal in both singing rates and detection distances; however the strength of phylogenetic effects was less than expected under Brownian motion evolution. The evolution of behavioural traits that determine singing rates was found to be more labile, leaving more room for species to evolve independently, whereas detection distance was mostly determined by anatomy (i.e. body size) and thus the laws of physics. Our findings can help in disentangling how complex ecological and evolutionary mechanisms have shaped different aspects of detectability in boreal birds. Such information can greatly inform single‐ and multi‐species models but more work is required to better understand how to best correct possible biases in phylogenetic diversity and other community metrics.
DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.
Kelly, Steven; Maini, Philip K
2013-01-01
The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.
De Cremer, Koen; Piérard, Denis; Hendrickx, Marijke
2016-01-01
Recently, the Fusarium genus has been narrowed based upon phylogenetic analyses and a Fusarium-like clade was adopted. The few species of the Fusarium-like clade were moved to new, re-installed or existing genera or provisionally retained as "Fusarium." Only a limited number of reference strains and DNA marker sequences are available for this clade and not much is known about its actual species diversity. Here, we report six strains, preserved by the Belgian fungal culture collection BCCM/IHEM as a Fusarium species, that belong to the Fusarium-like clade. They showed a slow growth and produced pionnotes, typical morphological characteristics of many Fusarium-like species. Multilocus sequencing with comparative sequence analyses in GenBank and phylogenetic analyses, using reference sequences of type material, confirmed that they were indeed member of the Fusarium-like clade. One strain was identified as "Fusarium" ciliatum whereas another strain was identified as Fusicolla merismoides. The four remaining strains were shown to represent a unique phylogenetic lineage in the Fusarium-like clade and were also found morphologically distinct from other members of the Fusarium-like clade. Based upon phylogenetic considerations, a new genus, Pseudofusicolla gen. nov., and a new species, Pseudofusicolla belgica sp. nov., were installed for this lineage. A formal description is provided in this study. Additional sampling will be required to gather isolates other than the historical strains presented in the present study as well as to further reveal the actual species diversity in the Fusarium-like clade. PMID:27790062
Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria
Gao, Beile
2012-01-01
Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973
Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.
Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto
2014-01-01
The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.
Species Divergence and Phylogenetic Variation of Ecophysiological Traits in Lianas and Trees
Rios, Rodrigo S.; Salgado-Luarte, Cristian; Gianoli, Ernesto
2014-01-01
The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [Amax], dark respiration rate [Rd], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that Rd evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for Rd, while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. Rd followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for Amax. Rd may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades. PMID:24914958
GENOME-WIDE COMPARATIVE ANALYSIS OF PHYLOGENETIC TREES: THE PROKARYOTIC FOREST OF LIFE
Puigbò, Pere; Wolf, Yuri I.; Koonin, Eugene V.
2013-01-01
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance (SD) method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the applications methods used to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a ‘species tree’. PMID:22399455
Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.
Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V
2012-01-01
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."
Betancur-R, Ricardo; Ortí, Guillermo; Pyron, Robert Alexander
2015-05-01
The marine-freshwater boundary is a major biodiversity gradient and few groups have colonised both systems successfully. Fishes have transitioned between habitats repeatedly, diversifying in rivers, lakes and oceans over evolutionary time. However, their history of habitat colonisation and diversification is unclear based on available fossil and phylogenetic data. We estimate ancestral habitats and diversification and transition rates using a large-scale phylogeny of extant fish taxa and one containing a massive number of extinct species. Extant-only phylogenetic analyses indicate freshwater ancestry, but inclusion of fossils reveal strong evidence of marine ancestry in lineages now restricted to freshwaters. Diversification and colonisation dynamics vary asymmetrically between habitats, as marine lineages colonise and flourish in rivers more frequently than the reverse. Our study highlights the importance of including fossils in comparative analyses, showing that freshwaters have played a role as refuges for ancient fish lineages, a signal erased by extinction in extant-only phylogenies. © 2015 John Wiley & Sons Ltd/CNRS.
Lüssen, Arne; Falk, Thomas M; Villwock, Wolfgang
2003-10-01
Patterns of molecular genetic differentiation among taxa of the "agassii species complex" (Parenti, 1984) were analysed based on partial mtDNA control region sequences. Special attention has been paid to Chilean populations of Orestias agassii and species from isolated lakes of northern Chile, e.g., O. agassii, Orestias chungarensis, Orestias parinacotensis, Orestias laucaensis, and Orestias ascotanensis. Orestias tschudii, Orestias luteus, and Orestias ispi were analysed comparatively. Our findings support the utility of mtDNA control region sequences for phylogenetic studies within the "agassii species complex" and confirmed the monophyly of this particular lineage, excluding O. luteus. However, the monophyly of further morphologically defined lineages within the "agassii complex" appears doubtful. No support was found for the utility of these data sets for inferring phylogenetic relationships between more distantly related taxa originating from Lake Titicaca.
Macrini, Thomas E; Flynn, John J; Ni, Xijun; Croft, Darin A; Wyss, André R
2013-01-01
The phylogenetic relationships of notoungulates, an extinct group of predominantly South American herbivores, remain poorly resolved with respect to both other placental mammals and among one another. Most previous phylogenetic analyses of notoungulates have not included characters of the internal cranium, not least because few such features, including the bony labyrinth, have been described for members of the group. Here we describe the inner ears of the notoungulates Altitypotherium chucalensis (Mesotheriidae), Pachyrukhos moyani (Hegetotheriidae) and Cochilius sp. (Interatheriidae) based on reconstructions of bony labyrinths obtained from computed tomography imagery. Comparisons of the bony labyrinths of these taxa with the basally diverging notoungulate Notostylops murinus (Notostylopidae), an isolated petrosal from Itaboraí, Brazil, referred to Notoungulata, and six therian outgroups, yielded an inner ear character matrix of 25 potentially phylogenetically informative characters, 14 of them novel to this study. Two equivocally optimized character states potentially support a pairing of Mesotheriidae and Hegetotheriidae, whereas four others may be diagnostic of Notoungulata. Three additional characters are potentially informative for diagnosing more inclusive clades: one for crown Placentalia; another for a clade containing Kulbeckia, Zalambdalestes, and Placentalia; and a third for Eutheria (crown Placentalia plus stem taxa). Several other characters are apomorphic for at least one notoungulate in our study and are of potential interest for broader taxonomic sampling within Notoungulata to clarify currently enigmatic interrelationships. Measures of the semicircular canals were used to infer agility (e.g. capable of quick movements vs. lethargic movements) of these taxa. Agility scores calculated from these data generally corroborate interpretations based on postcranial remains of these or closely related species. We provide estimates of the low-frequency hearing limits in notoungulates based on the ratio of radii of the apical and basal turns of the cochlea. These limits range from 15 Hz in Notostylops to 149 Hz in Pachyrukhos, values comparable to the Asian elephant (Elephas maximus) and the California sea lion (Zalophus californianus) when hearing in air, respectively. PMID:24102069
Transforming phylogenetic networks: Moving beyond tree space.
Huber, Katharina T; Moulton, Vincent; Wu, Taoyang
2016-09-07
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.
Opposing Patterns of Seasonal Change in Functional and Phylogenetic Diversity of Tadpole Assemblages
Strauß, Axel; Guilhaumon, François; Randrianiaina, Roger Daniel; Wollenberg Valero, Katharina C.; Vences, Miguel; Glos, Julian
2016-01-01
Assemblages that are exposed to recurring temporal environmental changes can show changes in their ecological properties. These can be expressed by differences in diversity and assembly rules. Both can be identified using two measures of diversity: functional (FD) and phylogenetic diversity (PD). Frog communities are understudied in this regard, especially during the tadpole life stage. We utilised tadpole assemblages from Madagascan rainforest streams to test predictions of seasonal changes on diversity and assemblage composition and on diversity measures. From the warm-wet to the cool-dry season, species richness (SR) of tadpole assemblages decreased. Also FD and PD decreased, but FD less and PD more than expected by chance. During the dry season, tadpole assemblages were characterised by functional redundancy (among assemblages—with increasing SR), high FD (compared to a null model), and low PD (phylogenetic clustering; compared to a null model). Although mutually contradictory at first glance, these results indicate competition as tadpole community assembly driving force. This is true during the limiting cool-dry season but not during the more suitable warm-wet season. We thereby show that assembly rules can strongly depend on season, that comparing FD and PD can reveal such forces, that FD and PD are not interchangeable, and that conclusions on assembly rules based on FD alone are critical. PMID:27014867
A comparative perspective on longevity: the effect of body size dominates over ecology in moths.
Holm, S; Davis, R B; Javoiš, J; Õunap, E; Kaasik, A; Molleman, F; Tammaru, T
2016-12-01
Both physiologically and ecologically based explanations have been proposed to account for among-species differences in lifespan, but they remain poorly tested. Phylogenetically explicit comparative analyses are still scarce and those that exist are biased towards homoeothermic vertebrates. Insect studies can significantly contribute as lifespan can feasibly be measured in a high number of species, and the selective forces that have shaped it may differ largely between species and from those acting on larger animals. We recorded adult lifespan in 98 species of geometrid moths. Phylogenetic comparative analyses were applied to study variation in species-specific values of lifespan and to reveal its ecological and life-history correlates. Among-species and between-gender differences in lifespan were found to be notably limited; there was also no evidence of phylogenetic signal in this trait. Larger moth species were found to live longer, with this result supporting a physiological rather than ecological explanation of this relationship. Species-specific lifespan values could not be explained by traits such as reproductive season and larval diet breadth, strengthening the evidence for the dominance of physiological determinants of longevity over ecological ones. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.
A Format for Phylogenetic Placements
Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988
A format for phylogenetic placements.
Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.
Misra, Gopal; Gupta, Sarika; Subramanian, Alagesan; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj
2013-01-01
Foxtail millet ( Setaria italica L.) is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR) markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2%) eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02–0.65) obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%), maize (~61%) and rice (~42%) chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species. PMID:23805325
Kumari, Kajal; Muthamilarasan, Mehanathan; Misra, Gopal; Gupta, Sarika; Subramanian, Alagesan; Parida, Swarup Kumar; Chattopadhyay, Debasis; Prasad, Manoj
2013-01-01
Foxtail millet (Setariaitalica L.) is a tractable experimental model crop for studying functional genomics of millets and bioenergy grasses. But the limited availability of genomic resources, particularly expressed sequence-based genic markers is significantly impeding its genetic improvement. Considering this, we attempted to develop EST-derived-SSR (eSSR) markers and utilize them in germplasm characterization, cross-genera transferability and in silico comparative mapping. From 66,027 foxtail millet EST sequences 24,828 non-redundant ESTs were deduced, representing ~16 Mb, which revealed 534 (~2%) eSSRs in 495 SSR containing ESTs at a frequency of 1/30 kb. A total of 447 pp were successfully designed, of which 327 were mapped physically onto nine chromosomes. About 106 selected primer pairs representing the foxtail millet genome showed high-level of cross-genera amplification at an average of ~88% in eight millets and four non-millet species. Broad range of genetic diversity (0.02-0.65) obtained in constructed phylogenetic tree using 40 eSSR markers demonstrated its utility in germplasm characterizations and phylogenetics. Comparative mapping of physically mapped eSSR markers showed considerable proportion of sequence-based orthology and syntenic relationship between foxtail millet chromosomes and sorghum (~68%), maize (~61%) and rice (~42%) chromosomes. Synteny analysis of eSSRs of foxtail millet, rice, maize and sorghum suggested the nested chromosome fusion frequently observed in grass genomes. Thus, for the first time we had generated large-scale eSSR markers in foxtail millet and demonstrated their utility in germplasm characterization, transferability, phylogenetics and comparative mapping studies in millets and bioenergy grass species.
Meyers-Muñoz, María Angélica; van der Velde, Gerard; van der Meij, Sancia E.T.; Stoffels, Bart E.M.W.; van Alen, Theo; Tuti, Yosephine; Hoeksema, Bert W.
2016-01-01
Abstract Plakobranchus papua Meyers-Muñoz & van der Velde, sp. n. from West Papua (Papua Barat province, Indonesia), is described based on its external morphology, colour pattern, internal anatomy, radula and reproductive system. In a molecular phylogenetic study specimens of this new species were compared with those of ten candidate taxa under the name Plakobranchus ocellatus van Hasselt, 1824. DNA analyses of COI mtDNA showed a clear distinction between Plakobranchus papua sp. n. and “Plakobranchus ocellatus”. Plakobranchus papua, sp. n. also differed from all taxa that have been synonymised with Plakobranchus ocellatus. The genus is in dire need of taxonomic revision, preferably based on an integrative analysis involving morphology and DNA of all known Plakobranchus varieties. PMID:27408559
Phylogenetic incongruence in the Drosophila melanogaster species group
Wong, Alex; Jensen, Jeffrey D.; Pool, John E.; Aquadro, Charles F.
2007-01-01
Drosophila melanogaster and its close relatives are used extensively in comparative biology. Despite the importance of phylogenetic information for such studies, relationships between some melanogaster species group members are unclear due to conflicting phylogenetic signals at different loci. In this study, we use twelve nuclear loci (eleven coding and one non-coding) to assess the degree of phylogenetic incongruence in this model system. We focus on two nodes: (1) The node joining the D. erecta-D. orena, D. melanogaster-D. simulans, and D. yakuba-D. teissieri lineages, and (2) The node joining the lineages leading to the melanogaster, takahashii, and eugracilis subgroups. We find limited evidence for incongruence at the first node; our data, as well as those of several previous studies, strongly support monophyly of a clade consisting of D. erecta-D. orena and D. yakuba-D. teissieri. By contrast, using likelihood based tests of congruence, we find robust evidence for topological incongruence at the second node. Different loci support different relationships among the melanogaster, takahashii and eugracilis subgroups, and the observed incongruence is not easily attributable to homoplasy, non-equilibrium base composition, or positive selection on a subset of loci. We argue that lineage sorting in the common ancestor of these three subgroups is the most plausible explanation for our observations. Such lineage sorting may lead to biased estimation of tree topology and evolutionary rates, and may confound inferences of positive selection. PMID:17071113
2011-01-01
Background Efforts to solve higher-level evolutionary relationships within the class Insecta by using mitochondrial genomic data are hindered due to fast sequence evolution of several groups, most notably Hymenoptera, Strepsiptera, Phthiraptera, Hemiptera and Thysanoptera. Accelerated rates of substitution on their sequences have been shown to have negative consequences in phylogenetic inference. In this study, we tested several methodological approaches to recover phylogenetic signal from whole mitochondrial genomes. As a model, we used two classical problems in insect phylogenetics: The relationships within Paraneoptera and within Holometabola. Moreover, we assessed the mitochondrial phylogenetic signal limits in the deeper Eumetabola dataset, and we studied the contribution of individual genes. Results Long-branch attraction (LBA) artefacts were detected in all the datasets. Methods using Bayesian inference outperformed maximum likelihood approaches, and LBA was avoided in Paraneoptera and Holometabola when using protein sequences and the site-heterogeneous mixture model CAT. The better performance of this method was evidenced by resulting topologies matching generally accepted hypotheses based on nuclear and/or morphological data, and was confirmed by cross-validation and simulation analyses. Using the CAT model, the order Strepsiptera was recovered as sister to Coleoptera for the first time using mitochondrial sequences, in agreement with recent results based on large nuclear and morphological datasets. Also the Hymenoptera-Mecopterida association was obtained, leaving Coleoptera and Strepsiptera as the basal groups of the holometabolan insects, which coincides with one of the two main competing hypotheses. For the Paraneroptera, the currently accepted non-monophyly of Homoptera was documented as a phylogenetic novelty for mitochondrial data. However, results were not satisfactory when exploring the entire Eumetabola, revealing the limits of the phylogenetic signal that can be extracted from Insecta mitogenomes. Based on the combined use of the five best topology-performing genes we obtained comparable results to whole mitogenomes, highlighting the important role of data quality. Conclusion We show for the first time that mitogenomic data agrees with nuclear and morphological data for several of the most controversial insect evolutionary relationships, adding a new independent source of evidence to study relationships among insect orders. We propose that deeper divergences cannot be inferred with the current available methods due to sequence saturation and compositional bias inconsistencies. Our exploratory analysis indicates that the CAT model is the best dealing with LBA and it could be useful for other groups and datasets with similar phylogenetic difficulties. PMID:22032248
Sumner, Jeremy G; Taylor, Amelia; Holland, Barbara R; Jarvis, Peter D
2017-12-01
Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees. In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any invariant-based phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) explicit dependence on the assumption of a continuous-time process. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistically bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference. The binary case is of particular theoretical interest as-in this case only-the Markov invariants can be expressed as linear combinations of the phylogenetic invariants. A wider implication of this is that, for models with more than two states-for example DNA sequence alignments with four-state models-we find that methods which rely on phylogenetic invariants are incapable of satisfying all three of the stated statistical properties. This is because in these cases the relevant Markov invariants belong to a class of polynomials independent from the phylogenetic invariants.
Fu, Chao-Nan; Li, Hong-Tao; Milne, Richard; Zhang, Ting; Ma, Peng-Fei; Yang, Jing; Li, De-Zhu; Gao, Lian-Ming
2017-12-08
The Cornales is the basal lineage of the asterids, the largest angiosperm clade. Phylogenetic relationships within the order were previously not fully resolved. Fifteen plastid genomes representing 14 species, ten genera and seven families of Cornales were newly sequenced for comparative analyses of genome features, evolution, and phylogenomics based on different partitioning schemes and filtering strategies. All plastomes of the 14 Cornales species had the typical quadripartite structure with a genome size ranging from 156,567 bp to 158,715 bp, which included two inverted repeats (25,859-26,451 bp) separated by a large single-copy region (86,089-87,835 bp) and a small single-copy region (18,250-18,856 bp) region. These plastomes encoded the same set of 114 unique genes including 31 transfer RNA, 4 ribosomal RNA and 79 coding genes, with an identical gene order across all examined Cornales species. Two genes (rpl22 and ycf15) contained premature stop codons in seven and five species respectively. The phylogenetic relationships among all sampled species were fully resolved with maximum support. Different filtering strategies (none, light and strict) of sequence alignment did not have an effect on these relationships. The topology recovered from coding and noncoding data sets was the same as for the whole plastome, regardless of filtering strategy. Moreover, mutational hotspots and highly informative regions were identified. Phylogenetic relationships among families and intergeneric relationships within family of Cornales were well resolved. Different filtering strategies and partitioning schemes do not influence the relationships. Plastid genomes have great potential to resolve deep phylogenetic relationships of plants.
USDA-ARS?s Scientific Manuscript database
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence d...
A program to compute the soft Robinson-Foulds distance between phylogenetic networks.
Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai
2017-03-14
Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.
Phylogenetic diversity measures based on Hill numbers.
Chao, Anne; Chiu, Chun-Huo; Jost, Lou
2010-11-27
We propose a parametric class of phylogenetic diversity (PD) measures that are sensitive to both species abundance and species taxonomic or phylogenetic distances. This work extends the conventional parametric species-neutral approach (based on 'effective number of species' or Hill numbers) to take into account species relatedness, and also generalizes the traditional phylogenetic approach (based on 'total phylogenetic length') to incorporate species abundances. The proposed measure quantifies 'the mean effective number of species' over any time interval of interest, or the 'effective number of maximally distinct lineages' over that time interval. The product of the measure and the interval length quantifies the 'branch diversity' of the phylogenetic tree during that interval. The new measures generalize and unify many existing measures and lead to a natural definition of taxonomic diversity as a special case. The replication principle (or doubling property), an important requirement for species-neutral diversity, is generalized to PD. The widely used Rao's quadratic entropy and the phylogenetic entropy do not satisfy this essential property, but a simple transformation converts each to our measures, which do satisfy the property. The proposed approach is applied to forest data for interpreting the effects of thinning.
2010-01-01
Background Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. Results This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Conclusions Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service. PMID:21034504
Forti, Lucas Rodriguez; Lingnau, Rodrigo; Encarnação, Lais Carvalho; Bertoluci, Jaime; Toledo, Luís Felipe
2017-01-01
Phenotypic traits, such as the frog advertisement call, are generally correlated with interspecific genetic variation, and, as a consequence of strong sexual selection, these behaviors may carry a phylogenetic signal. However, variation in acoustic traits is not always correlated with genetic differences between populations (intraspecific variation); phenotypic plasticity and environmental variables may explain part of such variation. For example, local processes can affect acoustic properties in different lineages due to differences in physical structure, climatic conditions, and biotic interactions, particularly when populations are isolated. However, acoustic traits can be used to test phylogenetic hypotheses. We analyzed the advertisement calls of Dendropsophus elegans males from 18 sites and compared them with those of four closely related congeneric species, in order to test for differences between inter and intraspecific variation. We analyzed 451 calls of 45 males of these five species. Because males from distant sites were grouped together without population congruence, differences found in advertisement calls among individuals were not correlated with phylogeographical clades. Phylogenetic and cluster analyses of the D. elegans clades and those of closely related species grouped all five species into the same topology, as reported by previous molecular and morphological phylogenies. However, the topology of the D. elegans phylogeographical clades did not match the topology previously reported. Acoustic communication in D. elegans seems to be conserved among populations, and the phylogeographical history of the species does not explain the variation among lineages in call properties, despite some congruent phylogenetic signals evident at the species level. Based on molecular clocks retrieved from the literature, it seems that more than 6.5 million years of divergence (late Miocene) are necessary to allow significant changes to occur in the acoustic properties of these treefrog calls, making it possible to recover their phylogenetic history only based on acoustic evidence. PMID:28235089
Modeling adaptive kernels from probabilistic phylogenetic trees.
Nicotra, Luca; Micheli, Alessio
2009-01-01
Modeling phylogenetic interactions is an open issue in many computational biology problems. In the context of gene function prediction we introduce a class of kernels for structured data leveraging on a hierarchical probabilistic modeling of phylogeny among species. We derive three kernels belonging to this setting: a sufficient statistics kernel, a Fisher kernel, and a probability product kernel. The new kernels are used in the context of support vector machine learning. The kernels adaptivity is obtained through the estimation of the parameters of a tree structured model of evolution using as observed data phylogenetic profiles encoding the presence or absence of specific genes in a set of fully sequenced genomes. We report results obtained in the prediction of the functional class of the proteins of the budding yeast Saccharomyces cerevisae which favorably compare to a standard vector based kernel and to a non-adaptive tree kernel function. A further comparative analysis is performed in order to assess the impact of the different components of the proposed approach. We show that the key features of the proposed kernels are the adaptivity to the input domain and the ability to deal with structured data interpreted through a graphical model representation.
Marko Gomez-Hernandez; Guadalupe Williams-Linera; D. Jean Lodge; Roger Guevara; Eduardo Ruiz-Sanchez; Etelvina Gandara
2016-01-01
Phylogenetic information provides insight into the ecological and evolutionary processes that organize species assemblages. We compared patterns of phylogenetic diversity among macromycete and woody plant communities along a steep elevational gradient in eastern Mexico to better understand the evolutionary processes that structure their communities. Macrofungi and...
A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.
Takahashi, Tetsumi; Moreno, Edmundo
2015-12-01
The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
Comparing Ontogenetic and Phylogenetic Stages of Human Development
ERIC Educational Resources Information Center
Clarken, Rodney H.
2005-01-01
This paper will present evidence to support ontogenetic and phylogenetic parallels and draw from these comparisons to further illuminate our understanding of micro and macro human development. Individual and collective stages of physical, psychological and spiritual development will be compared and their homologous structures examined.…
2017-01-01
North America’s Great Basin has long been of interest to biologists due to its high level of organismal endemicity throughout its endorheic watersheds. One example of such a group is the subfamily Empetricthyinae. In this paper, we analyzed the relationships of the Empetrichtyinae and assessed the validity of the subspecies designations given by Williams and Wilde within the group using concatenated phylogenetic tree estimation and species tree estimation. Samples from 19 populations were included covering the entire distribution of the three extant species of Empetricthyinae–Crenichthys nevadae, Crenichthys baileyi and Empetricthys latos. Three nuclear introns (S8 intron 4, S7 intron 1, and P0 intron 1) and one mitochondrial gene (Cytb) were sequenced for phylogenetic analysis. Using these sequences, we generated two separate hypotheses of the evolutionary relationships of Empetrichtyinae- one based on the mitochondrial data and one based on the nuclear data using Bayesian phylogenetics. Haplotype networks were also generated to look at the relationships of the populations within Empetrichthyinae. After comparing the two phylogenetic hypotheses, species trees were generated using *BEAST with the nuclear data to further test the validity of the subspecies within Empetrichthyinae. The mitochondrial analyses supported four lineages within C. baileyi and 2 within C. nevadae. The concatenated nuclear tree was more conserved, supporting one clade and an unresolved polytomy in both species. The species tree analysis supported the presence of two species within both C. baileyi and C. nevadae. Based on the results of these analyses, the subspecies designations of Williams and Wilde are not valid, rather a conservative approach suggests there are two species within C. nevadae and two species within C. baileyi. No structure was found for E. latos or the populations of Empetricthyinae. This study represents one of many demonstrating the invalidity of subspecies and their detriment to species identification, conservation, and understanding. PMID:29077708
Bulgarella, Mariana; Heimpel, George E
2015-09-01
Parasite host range can be influenced by physiological, behavioral, and ecological factors. Combining data sets on host-parasite associations with phylogenetic information of the hosts and the parasites involved can generate evolutionary hypotheses about the selective forces shaping host range. Here, we analyzed associations between the nest-parasitic flies in the genus Philornis and their host birds on Trinidad. Four of ten Philornis species were only reared from one species of bird. Of the parasite species with more than one host bird species, P. falsificus was the least specific and P. deceptivus the most specific attacking only Passeriformes. Philornis flies in Trinidad thus include both specialists and generalists, with varying degrees of specificity within the generalists. We used three quantities to more formally compare the host range of Philornis flies: the number of bird species attacked by each species of Philornis, a phylogenetically informed host specificity index (Poulin and Mouillot's S TD), and a branch length-based S TD. We then assessed the phylogenetic signal of these measures of host range for 29 bird species. None of these measures showed significant phylogenetic signal, suggesting that clades of Philornis did not differ significantly in their ability to exploit hosts. We also calculated two quantities of parasite species load for the birds - the parasite species richness, and a variant of the S TD index based on nodes rather than on taxonomic levels - and assessed the signal of these measures on the bird phylogeny. We did not find significant phylogenetic signal for the parasite species load or the node-based S TD index. Finally, we calculated the parasite associations for all bird pairs using the Jaccard index and regressed these similarity values against the number of nodes in the phylogeny separating bird pairs. This analysis showed that Philornis on Trinidad tend to feed on closely related bird species more often than expected by chance.
Guo, Hao-Bo; Ma, Yue; Tuskan, Gerald A.; ...
2018-01-01
The existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier. Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes. We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID). The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life. The two-dimensional maps (designated as fingerprints here)more » from the protein distribution densities in the LD space defined by ln( L ) and ID are then constructed. The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies. As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments. The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guo, Hao-Bo; Ma, Yue; Tuskan, Gerald A.
The existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier. Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes. We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID). The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life. The two-dimensional maps (designated as fingerprints here)more » from the protein distribution densities in the LD space defined by ln( L ) and ID are then constructed. The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies. As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments. The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level.« less
Complete mitogenome of Asiatic lion resolves phylogenetic status within Panthera.
Bagatharia, Snehal B; Joshi, Madhvi N; Pandya, Rohan V; Pandit, Aanal S; Patel, Riddhi P; Desai, Shivangi M; Sharma, Anu; Panchal, Omkar; Jasmani, Falguni P; Saxena, Akshay K
2013-08-23
The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability.
Complete mitogenome of asiatic lion resolves phylogenetic status within Panthera
2013-01-01
Background The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. Results The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. Conclusion This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability. PMID:23968279
Diversification of land plants: insights from a family-level phylogenetic analysis
2011-01-01
Background Some of the evolutionary history of land plants has been documented based on the fossil record and a few broad-scale phylogenetic analyses, especially focusing on angiosperms and ferns. Here, we reconstructed phylogenetic relationships among all 706 families of land plants using molecular data. We dated the phylogeny using multiple fossils and a molecular clock technique. Applying various tests of diversification that take into account topology, branch length, numbers of extant species as well as extinction, we evaluated diversification rates through time. We also compared these diversification profiles against the distribution of the climate modes of the Phanerozoic. Results We found evidence for the radiations of ferns and mosses in the shadow of angiosperms coinciding with the rather warm Cretaceous global climate. In contrast, gymnosperms and liverworts show a signature of declining diversification rates during geological time periods of cool global climate. Conclusions This broad-scale phylogenetic analysis helps to reveal the successive waves of diversification that made up the diversity of land plants we see today. Both warm temperatures and wet climate may have been necessary for the rise of the diversity under a successive lineage replacement scenario. PMID:22103931
New substitution models for rooting phylogenetic trees.
Williams, Tom A; Heaps, Sarah E; Cherlin, Svetlana; Nye, Tom M W; Boys, Richard J; Embley, T Martin
2015-09-26
The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. © 2015 The Authors.
A congruent phylogenomic signal places eukaryotes within the Archaea.
Williams, Tom A; Foster, Peter G; Nye, Tom M W; Cox, Cymon J; Embley, T Martin
2012-12-22
Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook 'three domains' tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation.
Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A
2018-05-01
Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.
Xu, Weijia; Ozer, Stuart; Gutell, Robin R
2009-01-01
With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.
Xu, Weijia; Ozer, Stuart; Gutell, Robin R.
2010-01-01
With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534
Igea, Javier; Bogarín, Diego; Papadopulos, Alexander S T; Savolainen, Vincent
2015-02-01
Speciation on islands, and particularly the divergence of species in situ, has long been debated. Here, we present one of the first, complete assessments of the geographic modes of speciation for the flora of a small oceanic island. Cocos Island (Costa Rica) is pristine; it is located 550 km off the Pacific coast of Central America. It harbors 189 native plant species, 33 of which are endemic. Using phylogenetic data from insular and mainland congeneric species, we show that all of the endemic species are derived from independent colonization events rather than in situ speciation. This is in sharp contrast to the results of a study carried out in a comparable system, Lord Howe Island (Australia), where as much as 8.2% of the plant species were the product of sympatric speciation. Differences in physiography and age between the islands may be responsible for the contrasting patterns of speciation observed. Importantly, comparing phylogenetic assessments of the modes of speciation with taxonomy-based measures shows that widely used island biogeography approaches overestimate rates of in situ speciation. © 2014 The Author(s). Evolution © 2014 The Society for the Study of Evolution.
Effects of Phylogenetic Tree Style on Student Comprehension
NASA Astrophysics Data System (ADS)
Dees, Jonathan Andrew
Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.
The origin and evolution of Basigin(BSG) gene: A comparative genomic and phylogenetic analysis.
Zhu, Xinyan; Wang, Shenglan; Shao, Mingjie; Yan, Jie; Liu, Fei
2017-07-01
Basigin (BSG), also known as extracellular matrix metalloproteinase inducer (EMMPRIN) or cluster of differentiation 147 (CD147), plays various fundamental roles in the intercellular recognition involved in immunologic phenomena, differentiation, and development. In this study, we aimed to compare the similarities and differences of BSG among organisms and explore possible evolutionary relationships based on the comparison result. We used the extensive BLAST tool to search the metazoan genomes, N-glycosylation sites, the transmembrane region and other functional sites. We then identified BSG homologs from genomic sequences and analyzed their phylogenetic relationships. We identified that BSG genes exist not only in the vertebrate metazoans but also in the invertebrate metazoans such as Amphioxus B. floridae, D. melanogaster, A. mellifera, S. japonicum, C. gigas, and T. patagoniensis. After sequence analysis, we confirmed that only vertebrate metazoans and Cephalochordate (amphioxus B. floridae) have the classic structure (a signal peptide, two Ig-like domains (IgC2 and IgI), a transmembrane region, and an intracellular domain). The invertebrate metazoans (excluding amphioxus B. floridae) lack the N-terminal signal peptides and IgC2 domain. We then generated a phylogenetic tree, genome organization comparison, and chromosomal disposition analysis based on the biological information obtained from the NCBI and Ensembl databases. Finally, we established the possible evolutionary scenario of the BSG gene, which showed the restricted exon rearrangement that has occurred during evolution, forming the present-day BSG gene. Copyright © 2017 Elsevier Ltd. All rights reserved.
Analyzing endocrine system conservation and evolution.
Bonett, Ronald M
2016-08-01
Analyzing variation in rates of evolution can provide important insights into the factors that constrain trait evolution, as well as those that promote diversification. Metazoan endocrine systems exhibit apparent variation in evolutionary rates of their constituent components at multiple levels, yet relatively few studies have quantified these patterns and analyzed them in a phylogenetic context. This may be in part due to historical and current data limitations for many endocrine components and taxonomic groups. However, recent technological advancements such as high-throughput sequencing provide the opportunity to collect large-scale comparative data sets for even non-model species. Such ventures will produce a fertile data landscape for evolutionary analyses of nucleic acid and amino acid based endocrine components. Here I summarize evolutionary rate analyses that can be applied to categorical and continuous endocrine traits, and also those for nucleic acid and protein-based components. I emphasize analyses that could be used to test whether other variables (e.g., ecology, ontogenetic timing of expression, etc.) are related to patterns of rate variation and endocrine component diversification. The application of phylogenetic-based rate analyses to comparative endocrine data will greatly enhance our understanding of the factors that have shaped endocrine system evolution. Copyright © 2016 Elsevier Inc. All rights reserved.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees.
Martínez-Aquino, Andrés
2016-08-01
Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host-parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a "compass" when "walking" through jungles of tangled phylogenetic trees.
Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees
2016-01-01
Abstract Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host–parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a “compass” when “walking” through jungles of tangled phylogenetic trees. PMID:29491928
Lessard-Therrien, Malie; Davies, T Jonathan; Bolmgren, Kjell
2014-05-01
Climate change is affecting high-altitude and high-latitude communities in significant ways. In the short growing season of subarctic habitats, it is essential that the timing and duration of phenological phases match favorable environmental conditions. We explored the time of the first appearance of flowers (first flowering day, FFD) and flowering duration across subarctic species composing different communities, from boreal forest to tundra, along an elevational gradient (600-800 m). The study was conducted on Mount Irony (856 m), North-East Canada (54°90'N, 67°16'W) during summer 2012. First, we quantified phylogenetic signal in FFD at different spatial scales. Second, we used phylogenetic comparative methods to explore the relationship between FFD, flowering duration, and elevation. We found that the phylogenetic signal for FFD was stronger at finer spatial scales and at lower elevations, indicating that closely related species tend to flower at similar times when the local environment is less harsh. The comparatively weaker phylogenetic signal at higher elevation may be indicative of convergent evolution for FFD. Flowering duration was correlated significantly with mean FFD, with later-flowering species having a longer flowering duration, but only at the lowest elevation. Our results indicate significant evolutionary conservatism in responses to phenological cues, but high phenotypic plasticity in flowering times. We suggest that phylogenetic relationships should be considered in the search for predictions and drivers of flowering time in comparative analyses, because species cannot be considered as statistically independent. Further, phenological drivers should be measured at spatial scales such that variation in flowering matches variation in environment.
TreeCmp: Comparison of Trees in Polynomial Time
Bogdanowicz, Damian; Giaro, Krzysztof; Wróbel, Borys
2012-01-01
When a phylogenetic reconstruction does not result in one tree but in several, tree metrics permit finding out how far the reconstructed trees are from one another. They also permit to assess the accuracy of a reconstruction if a true tree is known. TreeCmp implements eight metrics that can be calculated in polynomial time for arbitrary (not only bifurcating) trees: four for unrooted (Matching Split metric, which we have recently proposed, Robinson-Foulds, Path Difference, Quartet) and four for rooted trees (Matching Cluster, Robinson-Foulds cluster, Nodal Splitted and Triple). TreeCmp is the first implementation of Matching Split/Cluster metrics and the first efficient and convenient implementation of Nodal Splitted. It allows to compare relatively large trees. We provide an example of the application of TreeCmp to compare the accuracy of ten approaches to phylogenetic reconstruction with trees up to 5000 external nodes, using a measure of accuracy based on normalized similarity between trees.
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.
Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H
2014-02-26
Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
2009-01-01
Background Shigella flexneri is one of the causative agents of shigellosis, a major cause of childhood mortality in developing countries. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related bacterial isolates for investigation of disease outbreaks and provide information for establishing phylogenetic patterns among isolates. The present study aimed to develop an MLVA method for S. flexneri and the VNTR loci identified were tested on 242 S. flexneri isolates to evaluate their variability in various serotypes. The isolates were also analyzed by pulsed-field gel electrophoresis (PFGE) to compare the discriminatory power and to evaluate the usefulness of MLVA as a tool for phylogenetic analysis of S. flexneri. Results Thirty-six VNTR loci were identified by exploring the repeat sequence loci in genomic sequences of Shigella species and by testing the loci on nine isolates of different subserotypes. The VNTR loci in different serotype groups differed greatly in their variability. The discriminatory power of an MLVA assay based on four most variable VNTR loci was higher, though not significantly, than PFGE for the total isolates, a panel of 2a isolates, which were relatively diverse, and a panel of 4a/Y isolates, which were closely-related. Phylogenetic groupings based on PFGE patterns and MLVA profiles were considerably concordant. The genetic relationships among the isolates were correlated with serotypes. The phylogenetic trees constructed using PFGE patterns and MLVA profiles presented two distinct clusters for the isolates of serotype 3 and one distinct cluster for each of the serotype groups, 1a/1b/NT, 2a/2b/X/NT, 4a/Y, and 6. Isolates that had different serotypes but had closer genetic relatedness than those with the same serotype were observed between serotype Y and subserotype 4a, serotype X and subserotype 2b, subserotype 1a and 1b, and subserotype 3a and 3b. Conclusions The 36 VNTR loci identified exhibited considerably different degrees of variability among S. flexneri serotype groups. VNTR locus could be highly variable in a serotype but invariable in others. MLVA assay based on four highly variable loci could display a comparable resolving power to PFGE in discriminating isolates. MLVA is also a prominent molecular tool for phylogenetic analysis of S. flexneri; the resulting data are beneficial to establish clear clonal patterns among different serotype groups and to discern clonal groups among isolates within the same serotype. As highly variable VNTR loci could be serotype-specific, a common MLVA protocol that consists of only a small set of loci, for example four to eight loci, and that provides high resolving power to all S. flexneri serotypes may not be obtainable. PMID:20042119
Cornillon, P A; Pontier, D; Rochet, M J
2000-02-21
Comparative methods are used to investigate the attributes of present species or higher taxa. Difficulties arise from the phylogenetic heritage: taxa are not independent and neglecting phylogenetic inertia can lead to inaccurate results. Within-species variations in life-history traits are also not negligible, but most comparative methods are not designed to take them into account. Taxa are generally described by a single value for each trait. We have developed a new model which permits the incorporation of both the phylogenetic relationships among populations and within-species variations. This is an extension of classical autoregressive models. This family of models was used to study the effect of fishing on six demographic traits measured on 77 populations of teleost fishes. Copyright 2000 Academic Press.
USDA-ARS?s Scientific Manuscript database
The family Rutaceae encompasses several genera including the economically important genus Citrus. In this study, we selected 22 citrus relatives belonging to the various sub groups of Rutaceae and compared the sequences of three gene fragments. The accessions selected belong to the subfamily Rutoide...
Housworth, E A; Martins, E P
2001-01-01
Statistical randomization tests in evolutionary biology often require a set of random, computer-generated trees. For example, earlier studies have shown how large numbers of computer-generated trees can be used to conduct phylogenetic comparative analyses even when the phylogeny is uncertain or unknown. These methods were limited, however, in that (in the absence of molecular sequence or other data) they allowed users to assume that no phylogenetic information was available or that all possible trees were known. Intermediate situations where only a taxonomy or other limited phylogenetic information (e.g., polytomies) are available are technically more difficult. The current study describes a procedure for generating random samples of phylogenies while incorporating limited phylogenetic information (e.g., four taxa belong together in a subclade). The procedure can be used to conduct comparative analyses when the phylogeny is only partially resolved or can be used in other randomization tests in which large numbers of possible phylogenies are needed.
Drury, J P; Grether, G F; Garland, T; Morlon, H
2018-05-01
Much ecological and evolutionary theory predicts that interspecific interactions often drive phenotypic diversification and that species phenotypes in turn influence species interactions. Several phylogenetic comparative methods have been developed to assess the importance of such processes in nature; however, the statistical properties of these methods have gone largely untested. Focusing mainly on scenarios of competition between closely-related species, we assess the performance of available comparative approaches for analyzing the interplay between interspecific interactions and species phenotypes. We find that many currently used statistical methods often fail to detect the impact of interspecific interactions on trait evolution, that sister-taxa analyses are particularly unreliable in general, and that recently developed process-based models have more satisfactory statistical properties. Methods for detecting predictors of species interactions are generally more reliable than methods for detecting character displacement. In weighing the strengths and weaknesses of different approaches, we hope to provide a clear guide for empiricists testing hypotheses about the reciprocal effect of interspecific interactions and species phenotypes and to inspire further development of process-based models.
Zhou, Xuming; Xu, Shixia; Xu, Junxiao; Chen, Bingyao; Zhou, Kaiya; Yang, Guang
2012-01-01
Abstract Although great progress has been made in resolving the relationships of placental mammals, the position of several clades in Laurasiatheria remain controversial. In this study, we performed a phylogenetic analysis of 97 orthologs (46,152 bp) for 15 taxa, representing all laurasiatherian orders. Additionally, phylogenetic trees of laurasiatherian mammals with draft genome sequences were reconstructed based on 1608 exons (2,175,102 bp). Our reconstructions resolve the interordinal relationships within Laurasiatheria and corroborate the clades Scrotifera, Fereuungulata, and Cetartiodactyla. Furthermore, we tested alternative topologies within Laurasiatheria, and among alternatives for the phylogenetic position of Perissodactyla, a sister-group relationship with Cetartiodactyla receives the highest support. Thus, Pegasoferae (Perissodactyla + Carnivora + Pholidota + Chiroptera) does not appear to be a natural group. Divergence time estimates from these genes were compared with published estimates for splits within Laurasiatheria. Our estimates were similar to those of several studies and suggest that the divergences among these orders occurred within just a few million years. PMID:21900649
Disentangling the Trichoderma viridescens complex.
Jaklitsch, W M; Samuels, G J; Ismaiel, A; Voglmayr, H
2013-12-01
Trichoderma viridescens is recognised as a species complex. Multigene analyses based on the translation elongation factor 1-alpha encoding gene (tef1), a part of the rpb2 gene, encoding the second largest RNA polymerase subunit and the larger subunit of ATP citrate lyase (acl1) reveals 13 phylogenetic species with little or no phenotypic differentiation. This is the first use of acl1 in Trichoderma phylogenetics. The typification of T. viridescens s.str. is clarified and Hypocrea viridescens is replaced by the new name T. paraviridescens. Besides these two species, eleven are phylogenetically recognised and T. olivascens, T. viridarium, T. virilente, T. trixiae, T. viridialbum, T. appalachiense, T. neosinense, T. composticola, T. nothescens and T. sempervirentis are formally described and illustrated. Several species produce yellow diffusing pigment on cornmeal dextrose agar, particularly after storage at 15 °C, while T. olivascens is characterised by the formation of an olivaceous pigment. The results are compared with earlier publications on this group of species.
Fan, Chunyu; Tan, Lingzhao; Zhang, Chunyu; Zhao, Xiuhai; von Gadow, Klaus
2017-10-30
One of the core issues of forest community ecology is the exploration of how ecological processes affect community structure. The relative importance of different processes is still under debate. This study addresses four questions: (1) how is the taxonomic structure of a forest community affected by spatial scale? (2) does the taxonomic structure reveal effects of local processes such as environmental filtering, dispersal limitation or interspecific competition at a local scale? (3) does the effect of local processes on the taxonomic structure vary with the spatial scale? (4) does the analysis based on taxonomic structures provide similar insights when compared with the use of phylogenetic information? Based on the data collected in two large forest observational field studies, the taxonomic structures of the plant communities were analyzed at different sampling scales using taxonomic ratios (number of genera/number of species, number of families/number of species), and the relationship between the number of higher taxa and the number of species. Two random null models were used and the "standardized effect size" (SES) of taxonomic ratios was calculated, to assess possible differences between the observed and simulated taxonomic structures, which may be caused by specific ecological processes. We further applied a phylogeny-based method to compare results with those of the taxonomic approach. As expected, the taxonomic ratios decline with increasing grain size. The quantitative relationship between genera/families and species, described by a linearized power function, showed a good fit. With the exception of the family-species relationship in the Jiaohe study area, the exponents of the genus/family-species relationships did not show any scale dependent effects. The taxonomic ratios of the observed communities had significantly lower values than those of the simulated random community under the test of two null models at almost all scales. Null Model 2 which considered the spatial dispersion of species generated a taxonomic structure which proved to be more consistent with that in the observed community. As sampling sizes increased from 20 m × 20 m to 50 m × 50 m, the magnitudes of SESs of taxonomic ratios increased. Based on the phylogenetic analysis, we found that the Jiaohe plot was phylogenetically clustered at almost all scales. We detected significant phylogenetically overdispersion at the 20 m × 20 m and 30 m × 30 m scales in the Liangshui plot. The results suggest that the effect of abiotic filtering is greater than the effects of interspecific competition in shaping the local community at almost all scales. Local processes influence the taxonomic structures, but their combined effects vary with the spatial scale. The taxonomic approach provides similar insights as the phylogenetic approach, especially when we applied a more conservative null model. Analysing taxonomic structure may be a useful tool for communities where well-resolved phylogenetic data are not available.
LEEBENS-MACK, JIM; VISION, TODD; BRENNER, ERIC; BOWERS, JOHN E.; CANNON, STEVEN; CLEMENT, MARK J.; CUNNINGHAM, CLIFFORD W.; dePAMPHILIS, CLAUDE; deSALLE, ROB; DOYLE, JEFF J.; EISEN, JONATHAN A.; GU, XUN; HARSHMAN, JOHN; JANSEN, ROBERT K.; KELLOGG, ELIZABETH A.; KOONIN, EUGENE V.; MISHLER, BRENT D.; PHILIPPE, HERVÉ; PIRES, J. CHRIS; QIU, YIN-LONG; RHEE, SEUNG Y.; SJÖLANDER, KIMMEN; SOLTIS, DOUGLAS E.; SOLTIS, PAMELA S.; STEVENSON, DENNIS W.; WALL, KERR; WARNOW, TANDY; ZMASEK, CHRISTIAN
2011-01-01
In the eight years since phylogenomics was introduced as the intersection of genomics and phylogenetics, the field has provided fundamental insights into gene function, genome history and organismal relationships. The utility of phylogenomics is growing with the increase in the number and diversity of taxa for which whole genome and large transcriptome sequence sets are being generated. We assert that the synergy between genomic and phylogenetic perspectives in comparative biology would be enhanced by the development and refinement of minimal reporting standards for phylogenetic analyses. Encouraged by the development of the Minimum Information About a Microarray Experiment (MIAME) standard, we propose a similar roadmap for the development of a Minimal Information About a Phylogenetic Analysis (MIAPA) standard. Key in the successful development and implementation of such a standard will be broad participation by developers of phylogenetic analysis software, phylogenetic database developers, practitioners of phylogenomics, and journal editors. PMID:16901231
Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study
Weißenborn, Sandra; Walther, Dirk
2017-01-01
Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the potential and challenges associated with phylogenetic profiling methods for the detection of functional relationships between genes as well as the need to enlarge the set of plant genes with proven secondary metabolism involvement as well as the limitations of distinct pathways as abstractions of relationships between genes. PMID:29163570
Diversity and Phylogenetic Distribution of Extracellular Microbial Peptidases
NASA Astrophysics Data System (ADS)
Nguyen, Trang; Mueller, Ryan; Myrold, David
2017-04-01
Depolymerization of proteinaceous compounds by extracellular proteolytic enzymes is a bottleneck in the nitrogen cycle, limiting the rate of the nitrogen turnover in soils. Protein degradation is accomplished by a diverse range of extracellular (secreted) peptidases. Our objective was to better understand the evolution of these enzymes and how their functional diversity corresponds to known phylogenetic diversity. Peptidase subfamilies from 110 archaeal, 1,860 bacterial, and 97 fungal genomes were extracted from the MEROPS database along with corresponding SSU sequences for each genome from the SILVA database, resulting in 43,177 secreted peptidases belonging to 34 microbial phyla and 149 peptidase subfamilies. We compared the distribution of each peptidase subfamily across all taxa to the phylogenetic relationships of these organisms based on their SSU gene sequences. The occurrence and abundance of genes coding for secreted peptidases varied across microbial taxa, distinguishing the peptidase complement of the three microbial kingdoms. Bacteria had the highest frequency of secreted peptidase coding genes per 1,000 genes and contributed from 1% to 6% of the gene content. Fungi only had a slightly higher number of secreted peptidase gene content than archaea, standardized by the total genes. The relative abundance profiles of secreted peptidases in each microbial kingdom also varied, in which aspartic family was found to be the greatest in fungi (25%), whereas it was only 12% in archaea and 4% in bacteria. Serine, metallo, and cysteine families consistently contributed widely up to 75% of the secreted peptidase abundance across the three kingdoms. Overall, bacteria had a much wider collection of secreted peptidases, whereas fungi and archaea shared most of their secreted peptidase families. Principle coordinate analysis of the peptidase subfamily-based dissimilarities showed distinguishable clusters for different groups of microorganisms. The distribution of secreted peptidases was found to be significantly correlated with phylogenetic relationships within kingdoms (archaea rMantel=0.364, p=0.001; bacteria rMantel=0.257, p=0.001, and fungi rMantel=0.281, p=0.005), inferring an evolutionary relationship where subsets of phylogenetically related organisms share similar types of secreted peptidases. We also tested the phylogenetic signal strength of each peptidase subfamily for each microbial kingdom based on the binary traits of the distribution (presence or absence of secreted peptidase subfamilies in individual species). About one-third of the peptidase subfamilies displayed a strong evolutionary signal; the rest were phylogenetically over-dispersed, suggesting that these subfamilies are randomly distributed across the tree of life or the result of events such as horizontal gene transfer. Study of the diversity and phylogenetic distribution of secreted peptidases offered a mechanistic basis to anticipate the proteolytic potential function of microbial communities.
Increased phylogenetic resolution using target enrichment in Rubus
USDA-ARS?s Scientific Manuscript database
Phylogenetic analyses in Rubus L. have been challenging due to polyploidy, hybridization, and apomixis within the genus. Wide morphological diversity occurs within and between species, contributing to challenges at lower and higher systematic levels. Phylogenetic inferences to date have been based o...
[A phylogenetic analysis of plant communities of Teberda Biosphere Reserve].
Shulakov, A A; Egorov, A V; Onipchenko, V G
2016-01-01
Phylogenetic analysis of communities is based on the comparison of distances on the phylogenetic tree between species of a community under study and those distances in random samples taken out of local flora. It makes it possible to determine to what extent a community composition is formed by more closely related species (i.e., "clustered") or, on the opposite, it is more even and includes species that are less related with each other. The first case is usually interpreted as a result of strong influence caused by abiotic factors, due to which species with similar ecology, a priori more closely related, would remain: In the second case, biotic factors, such as competition, may come to the fore and lead to forming a community out of distant clades due to divergence of their ecological niches: The aim of this' study Was Ad explore the phylogenetic structure in communities of the northwestern Caucasus at two spatial scales - the scale of area from 4 to 100 m2 and the smaller scale within a community. The list of local flora of the alpine belt has been composed using the database of geobotanic descriptions carried out in Teberda Biosphere Reserve at true altitudes exceeding.1800 m. It includes 585 species of flowering plants belonging to 57 families. Basal groups of flowering plants are.not represented in the list. At the scale of communities of three classes, namely Thlaspietea rotundifolii - commumties formed on screes and pebbles, Calluno-Ulicetea - alpine meadow, and Mulgedio-Aconitetea subalpine meadows, have not demonstrated significant distinction of phylogenetic structure. At intra level, for alpine meadows the larger share of closely related species. (clustered community) is detected. Significantly clustered happen to be those communities developing on rocks (class Asplenietea trichomanis) and alpine (class Juncetea trifidi). At the same time, alpine lichen proved to have even phylogenetic structure at the small scale. Alpine (class Salicetea herbaceae) that develop under conditions of winter snow accumulation were more,even at the both.scale, i.e., contained more diverse and distantly related plant species compared with random samples. (Scheuchzerio-Caricetea fuscae) aquatic communities in cold (Montio-Cardaminetea), sedge meadows (Carici rupestris-Kobresietea bellardii), and communities, in which shrubs and predominated (juniper and rhododendron elfin woods, class Loiseleurio-Vaccinietea), have been studied only at the larger scale and showed significant evenness of species composition, i.e., were phylogenetically more diverse compared with random samples.
Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology.
Soltis, P S; Soltis, D E; Chase, M W
1999-11-25
Comparative biology requires a firm phylogenetic foundation to uncover and understand patterns of diversification and evaluate hypotheses of the processes responsible for these patterns. In the angiosperms, studies of diversification in floral form, stamen organization, reproductive biology, photosynthetic pathway, nitrogen-fixing symbioses and life histories have relied on either explicit or implied phylogenetic trees. Furthermore, to understand the evolution of specific genes and gene families, evaluate the extent of conservation of plant genomes and make proper sense of the huge volume of molecular genetic data available for model organisms such as Arabidopsis, Antirrhinum, maize, rice and wheat, a phylogenetic perspective is necessary. Here we report the results of parsimony analyses of DNA sequences of the plastid genes rbcL and atpB and the nuclear 18S rDNA for 560 species of angiosperms and seven non-flowering seed plants and show a well-resolved and well-supported phylogenetic tree for the angiosperms for use in comparative biology.
Borkent, Art
2018-01-17
The order Diptera is remarkably diverse, not only in species but in morphological variation in every life stage, making them excellent candidates for phylogenetic analysis. Such analysis has been hampered by methods that have severely restricted character state interpretation. Morphological-based phylogenies should be based on a deep understanding of the morphology, development and function of character states, and have extensive outgroup comparisons made to determine their polarity. Character states clearly vary in their value for determining phylogenetic relationships and this needs to be studied and utilized. Characters themselves need more explicit discussion, including how some may be developmentally or functionally related to other characters (and potentially not independent indicators of genealogical relationship). The current practice by many, of filling a matrix with poorly understood character states and highly limited outgroup comparisons, is unacceptable if the results are to be a valid reflection of the actual history of the group.Parsimony analysis is not an objective interpretation of phylogenetic relationships when all characters are treated as equal in value. Exact mathematical values applied to characters are entirely arbitrary and are generally used to produce a phylogeny that the author considers as reasonable. Mathematical appraisal of a given node is similarly inconsequential because characters do not have an intrinsic mathematical value. Bremer support, for example, provides values that have no biological reality but provide the pretence of objectivity. Cladists need to focus their attention on testing the validity of each synapomorphy proposed, as the basis for all further phylogenetic interpretation, rather than the testing of differing phylogenies through various comparative programs.Current phylogenetic analyses have come to increasingly depend on DNA sequence-based characters, in spite of their tumultuous history of inconsistent results. Until such time as sequences can be shown to produce predictive phylogenies (i.e., using Hennigian logic), independent of morphological analysis, they should be viewed with caution and certainly not as a panacea as they are commonly portrayed.The purported comprehensive analyses of phylogenetic relationships between families of Diptera by Wiegmann et al. (2011) and Lambkin et al. (2013) have serious flaws and cannot be considered as the "Periodic Table" of such relationships as originally heralded.Systematists working on Diptera have a plethora of complex and informative morphological synapomorphies in every life stage, either described or awaiting study. Many lineages have the potential of providing a wealth of evolutionary stories to share with other biologists if we produce stable phylogenies based on weighted synapomorphies and interpreted to elucidate the zoogeographic and bionomic divergence of the group. Some lineages are devoid of convincing synapomorphies and, in spite of our desires, should be recognized as being largely uninterpretable.
Tree-Based Unrooted Phylogenetic Networks.
Francis, A; Huber, K T; Moulton, V
2018-02-01
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.
Improved Maximum Parsimony Models for Phylogenetic Networks.
Van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2018-05-01
Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.
Mammalian phylogenetic diversity-area relationships at a continental scale
Mazel, Florent; Renaud, Julien; Guilhaumon, François; Mouillot, David; Gravel, Dominique; Thuiller, Wilfried
2015-01-01
In analogy to the species-area relationship (SAR), one of the few laws in Ecology, the phylogenetic diversity-area relationship (PDAR) describes the tendency of phylogenetic diversity (PD) to increase with area. Although investigating PDAR has the potential to unravel the underlying processes shaping assemblages across spatial scales and to predict PD loss through habitat reduction, it has been little investigated so far. Focusing on PD has noticeable advantages compared to species richness (SR) since PD also gives insights on processes such as speciation/extinction, assembly rules and ecosystem functioning. Here we investigate the universality and pervasiveness of the PDAR at continental scale using terrestrial mammals as study case. We define the relative robustness of PD (compared to SR) to habitat loss as the area between the standardized PDAR and standardized SAR (i.e. standardized by the diversity of the largest spatial window) divided by the area under the standardized SAR only. This metric quantifies the relative increase of PD robustness compared to SR robustness. We show that PD robustness is higher than SR robustness but that it varies among continents. We further use a null model approach to disentangle the relative effect of phylogenetic tree shape and non random spatial distribution of evolutionary history on the PDAR. We find that for most spatial scales and for all continents except Eurasia, PDARs are not different from expected by a model using only the observed SAR and the shape of the phylogenetic tree at continental scale. Interestingly, we detect a strong phylogenetic structure of the Eurasian PDAR that can be predicted by a model that specifically account for a finer biogeographical delineation of this continent. In conclusion, the relative robustness of PD to habitat loss compared to species richness is determined by the phylogenetic tree shape but also depends on the spatial structure of PD. PMID:26649401
Ortí, G; Meyer, A
1996-04-01
The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.
Liu, Chi; Yao, Minjie; Stegen, James C.; ...
2017-12-13
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. Here, we started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightlymore » increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Chi; Yao, Minjie; Stegen, James C.
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. Here, we started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightlymore » increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.« less
Liu, Chi; Yao, Minjie; Stegen, James C; Rui, Junpeng; Li, Jiabao; Li, Xiangzhen
2017-12-13
How press disturbance (long-term) influences the phylogenetic turnover of soil microbial communities responding to pulse disturbances (short-term) is not fully known. Understanding the complex connections between the history of environmental conditions, assembly processes and microbial community dynamics is necessary to predict microbial response to perturbation. We started by investigating phylogenetic spatial turnover (based on DNA) of soil prokaryotic communities after long-term nitrogen (N) deposition and temporal turnover (based on RNA) of communities responding to pulse by conducting short-term rewetting experiments. The results showed that moderate N addition increased ecological stochasticity and phylogenetic diversity. In contrast, high N addition slightly increased homogeneous selection and decreased phylogenetic diversity. Examining the system with higher phylogenetic resolution revealed a moderate contribution of variable selection across the whole N gradient. The moisture pulse experiment showed that high N soils had higher rates of phylogenetic turnover across short phylogenetic distances and significant changes in community compositions through time. Long-term N input history influenced spatial turnover of microbial communities, but the dominant community assembly mechanisms differed across different N deposition gradients. We further revealed an interaction between press and pulse disturbances whereby deterministic processes were particularly important following pulse disturbances in high N soils.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Medina, Monica; Collins, Timothy M.; Walsh, Patrick J.
2000-08-10
Sea hares within the genus Aplysia are important neurobiological model organisms, and as studies based on different Aplysia species appear in the literature, a phylogenetic framework has become essential. We present a phylogenetic hypothesis for this genus, based on portions of two mitochondrial genes (12S and 16S). In addition, we reconstruct the evolution of several behavioral characters of interest to neurobiologists in order to illustrate the potential benefits of a phylogeny for the genus Aplysia. These benefits include the determination of ancestral traits, the direction and timing of evolution of characters, prediction of the distribution of traits, and identification ofmore » cases of independent acquisition of traits within lineages. This last benefit may prove especially useful in understanding the linkage between behaviors and their underlying neurological basis.« less
Dating human cultural capacity using phylogenetic principles
Lind, J.; Lindenfors, P.; Ghirlanda, S.; Lidén, K.; Enquist, M.
2013-01-01
Humans have genetically based unique abilities making complex culture possible; an assemblage of traits which we term “cultural capacity”. The age of this capacity has for long been subject to controversy. We apply phylogenetic principles to date this capacity, integrating evidence from archaeology, genetics, paleoanthropology, and linguistics. We show that cultural capacity is older than the first split in the modern human lineage, and at least 170,000 years old, based on data on hyoid bone morphology, FOXP2 alleles, agreement between genetic and language trees, fire use, burials, and the early appearance of tools comparable to those of modern hunter-gatherers. We cannot exclude that Neanderthals had cultural capacity some 500,000 years ago. A capacity for complex culture, therefore, must have existed before complex culture itself. It may even originated long before. This seeming paradox is resolved by theoretical models suggesting that cultural evolution is exceedingly slow in its initial stages. PMID:23648831
Yeast species diversity in apple juice for cider production evidenced by culture-based method.
Lorenzini, Marilinda; Simonato, Barbara; Zapparoli, Giacomo
2018-05-07
Identification of yeasts isolated from apple juices of two cider houses (one located in a plain area and one in an alpine area) was carried out by culture-based method. Wallerstein Laboratory Nutrient Agar was used as medium for isolation and preliminary yeasts identification. A total of 20 species of yeasts belonging to ten different genera were identified using both BLAST algorithm for pairwise sequence comparison and phylogenetic approaches. A wide variety of non-Saccharomyces species was found. Interestingly, Candida railenensis, Candida cylindracea, Hanseniaspora meyeri, Hanseniaspora pseudoguilliermondii, and Metschnikowia sinensis were recovered for the first time in the yeast community of an apple environment. Phylogenetic analysis revealed a better resolution in identifying Metschnikowia and Moesziomyces isolates than comparative analysis using the GenBank or YeastIP gene databases. This study provides important data on yeast microbiota of apple juice and evidenced differences between two geographical cider production areas in terms of species composition.
Chesser, R.T.; ten, Have J.
2007-01-01
Evolutionary relationships of the scrub-birds Atrichornis were investigated using complete sequences of the recombination-activating gene RAG-1 and the proto-oncogene c-mos for two individuals of the noisy scrub-bird Atrichornis clamosus. Phylogenetic analysis revealed that Atrichornis was sister to the genus Menura (the lyrebirds) and that these two genera (the Menurae) were sister to the rest of the oscine passerines. A sister relationship between Atrichornis and Menura supports the traditional view, based on morphology and DNA hybridization, that these taxa are closely related. Similarly, a sister relationship with the remaining oscine passerines agrees with the morphological distinctiveness of Atrichornis and Menura, although this result contradicts conclusions based on DNA hybridization studies. Although Atrichornis is very well known morphologically, previous conclusions regarding its relationships were hampered by a lack of comparative knowledge of other passerines, making concurrence of the sequence data of particular significance. ?? Dt. Ornithologen-Gesellschaft e.V. 2007.
Winkler, Isaac S; Blaschke, Jeremy D; Davis, Daniel J; Stireman, John O; O'Hara, James E; Cerretti, Pierfilippo; Moulton, John K
2015-07-01
Molecular phylogenetic studies at all taxonomic levels often infer rapid radiation events based on short, poorly resolved internodes. While such rapid episodes of diversification are an important and widespread evolutionary phenomenon, much of this poor phylogenetic resolution may be attributed to the continuing widespread use of "traditional" markers (mitochondrial, ribosomal, and some nuclear protein-coding genes) that are often poorly suited to resolve difficult, higher-level phylogenetic problems. Here we reconstruct phylogenetic relationships among a representative set of taxa of the parasitoid fly family Tachinidae and related outgroups of the superfamily Oestroidea. The Tachinidae are one of the most species rich, yet evolutionarily recent families of Diptera, providing an ideal case study for examining the differential performance of loci in resolving phylogenetic relationships and the benefits of adding more loci to phylogenetic analyses. We assess the phylogenetic utility of nine genes including both traditional genes (e.g., CO1 mtDNA, 28S rDNA) and nuclear protein-coding genes newly developed for phylogenetic analysis. Our phylogenetic findings, based on a limited set of taxa, include: a close relationship between Tachinidae and the calliphorid subfamily Polleninae, monophyly of Tachinidae and the subfamilies Exoristinae and Dexiinae, subfamily groupings of Dexiinae+Phasiinae and Tachininae+Exoristinae, and robust phylogenetic placement of the somewhat enigmatic genera Strongygaster, Euthera, and Ceracia. In contrast to poor resolution and phylogenetic incongruence of "traditional genes," we find that a more selective set of highly informative genes is able to more precisely identify regions of the phylogeny that experienced rapid radiation of lineages, while more accurately depicting their phylogenetic context. Although much expanded taxon sampling is necessary to effectively assess the monophyly of and relationships among major tachinid lineages and their relatives, we show that a small number of well-chosen nuclear protein-coding genes can successfully resolve even difficult phylogenetic problems. Copyright © 2015 Elsevier Inc. All rights reserved.
Phylogenomics of African guenons.
Moulin, Sibyle; Gerbault-Seureau, Michèle; Dutrillaux, Bernard; Richard, Florence Anne
2008-01-01
The karyotypes of 28 specimens belonging to 26 species of Cercopithecinae have been compared with each other and with human karyotype by chromosome banding and, for some of them, by Zoo-FISH (human painting probes) techniques. The study includes the first description of the karyotypes of four species and a synonym of Cercopithecus nictitans. The chromosomal homologies obtained provide us with new data on a large number of rearrangements. This allows us to code chromosomal characters to draw Cercopithecini phylogenetic trees, which are compared to phylogenetic data based on DNA sequences. Our findings show that some of the superspecies proposed by Kingdon (1997 The Kingdon Field Guide to African Mammals, Academic Press.) and Groves (2001 Primates Taxonomy, Smithsonian Institution Press) do not form homogeneous groups and that the genus Cercopithecus is paraphyletic, in agreement with previous molecular analyses. The evolution of Cercopithecini karyotypes is mainly due to non-centromeric chromosome fissions and centromeric shifts or inversions. Non-Robertsonian translocations occurred in C. hamlyni and C. neglectus. The position of chromosomal rearrangements in the phylogenetic tree leads us to propose that the Cercopithecini evolution proceeded by either repeated fission events facilitated by peculiar genomic structures or successive reticulate phases, in which heterozygous populations for few rearranged chromosomes were present, allowing the spreading of chromosomal forms in various combinations, before the speciation process.
A taxonomic and phylogenetic re-appraisal of the genus Curvularia
USDA-ARS?s Scientific Manuscript database
Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...
Cai, J; Collins, M D
1994-04-01
The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.
Cao, Huiluo; Hong, Yiguo; Li, Meng; Gu, Ji-Dong
2011-11-01
The phylogenetic diversity of ammonia-oxidizing archaea (AOA) was surveyed in the surface sediments from the northern part of the South China Sea (SCS). The distribution pattern of AOA in the western Pacific was discussed through comparing the SCS with other areas in the western Pacific including Changjiang Estuary and the adjacent East China Sea where high input of anthropogenic nitrogen was evident, the tropical West Pacific Continental Margins close to the Philippines, the deep-sea methane seep sediments in the Okhotsk Sea, the cold deep sea of Northeastern Japan Sea, and the hydrothermal field in the Southern Okinawa Trough. These various environments provide a wide spectrum of physical and chemical conditions for a better understanding of the distribution pattern and diversities of AOA in the western Pacific. Under these different conditions, the distinct community composition between shallow and deep-sea sediments was clearly delineated based on the UniFrac PCoA and Jackknife Environmental Cluster analyses. Phylogenetic analyses showed that a few ammonia-oxidizing archaeal subclades in the marine water column/sediment clade and endemic lineages were indicative phylotypes for some environments. Higher phylogenetic diversity was observed in the Philippines while lower diversity in the hydrothermal vent habitat. Water depth and possibly with other environmental factors could be the main driving forces to shape the phylogenetic diversity of AOA observed, not only in the SCS but also in the whole western Pacific. The multivariate regression tree analysis also supported this observation consistently. Moreover, the functions of current and other climate factors were also discussed in comparison of phylogenetic diversity. The information collectively provides important insights into the ecophysiological requirements of uncultured ammonia-oxidizing archaeal lineages in the western Pacific Ocean.
NASA Astrophysics Data System (ADS)
Kuo, Chun Wei; Hao Huang, Kuan; Hsu, Bing Mu; Tsai, Hsien Lung; Tseng, Shao Feng; Kao, Po Min; Shen, Shu Min; Chou Chiu, Yi; Chen, Jung Sheng
2013-04-01
Salmonella is one of the most important pathogens of waterborne diseases with outbreaks from contaminated water reported worldwide. In addition, Salmonella spp. can survive for long periods in aquatic environments. To realize genotypes and serovars of Salmonella in aquatic environments, we isolated the Salmonella strains by selective culture plates to identify the serovars of Salmonella by serological assay, and identify the genotypes by Multilocus sequence typing (MLST) based on the sequence data from University College Cork (UCC), respectively. The results show that 36 stream water samples (30.1%) and 18 drinking water samples (23.3%) were confirmed the existence of Salmonella using culture method combined PCR specific invA gene amplification. In this study, 24 cultured isolates of Salmonella from water samples were classified to fifteen Salmonella enterica serovars. In addition, we construct phylogenetic analysis using phylogenetic tree and Minimum spanning tree (MST) method to analyze the relationship of clinical, environmental, and geographical data. Phylogenetic tree showed that four main clusters and our strains can be distributed in all. The genotypes of isolates from stream water are more biodiversity while comparing the Salmonella strains genotypes from drinking water sources. According to MST data, we can found the positive correlation between serovars and genotypes of Salmonella. Previous studies revealed that the result of Pulsed field gel electrophoresis (PFGE) method can predict the serovars of Salmonella strain. Hence, we used the MLST data combined phylogenetic analysis to identify the serovars of Salmonella strain and achieved effectiveness. While using the geographical data combined phylogenetic analysis, the result showed that the dominant strains were existed in whole stream area in rainy season. Keywords: Salmonella spp., MLST, phylogenetic analysis, PFGE
Phylogenetic diversity and biodiversity indices on phylogenetic networks.
Wicke, Kristina; Fischer, Mareike
2018-04-01
In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.
Yuko Ota; Mee-Sook Kim; Hitoshi Neda; Ned B. Klopfenstein; Eri Hasegawa
2011-01-01
An undetermined Armillaria species was collected on Amami-Oshima, a subtropical island of Japan. The phylogenetic position of the Armillaria sp. was determined using sequences of the elongation factor-1a (EF-1a) gene and the internal transcribed spacer (ITS) region (ITS1-5.8S-ITS2) of ribosomal DNA (rDNA). The phylogenetic analyses based on EF-1a and ITS sequences...
Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita
2017-06-01
The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
Effectiveness of protected areas for vertebrates based on taxonomic and phylogenetic diversity.
Quan, Qing; Che, Xianli; Wu, Yongjie; Wu, Yuchun; Zhang, Qiang; Zhang, Min; Zou, Fasheng
2018-04-01
Establishing protected areas is the primary goal and tool for preventing irreversible biodiversity loss. However, the effectiveness of protected areas that target specific species has been questioned for some time because targeting key species for conservation may impair the integral regional pool of species diversity and phylogenetic and functional diversity are seldom considered. We assessed the efficacy of protected areas in China for the conservation of phylogenetic diversity based on the ranges and phylogenies of 2279 terrestrial vertebrates. Phylogenetic and taxonomic diversity were strongly and positively correlated, and only 12.1-43.8% of priority conservation areas are currently protected. However, the patterns and coverage of phylogenetic diversity were affected when weighted by species richness. These results indicated that in China, protected areas targeting high species richness protected phylogenetic diversity well overall but failed to do so in some regions with more unique or threatened communities (e.g., coastal areas of eastern China, where severely threatened avian communities were less protected). Our results suggest that the current distribution of protected areas could be improved, although most protected areas protect both taxonomic and phylogenetic diversity. © 2017 Society for Conservation Biology.
Vidal-Martínez, Victor M.
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1–2) and internal transcribed spacers (ITS1–5.8S–ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum (A. cf. americanum and A. burminis) and paraphyly of the Acanthostominae. These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments. PMID:29250471
Martínez-Aquino, Andrés; Vidal-Martínez, Victor M; Aguirre-Macedo, M Leopoldina
2017-01-01
The phylogenetic position of three taxa from two trematode genera, belonging to the subfamily Acanthostominae (Opisthorchioidea: Cryptogonimidae), were analysed using partial 28S ribosomal DNA (Domains 1-2) and internal transcribed spacers (ITS1-5.8S-ITS2). Bayesian inference and Maximum likelihood analyses of combined 28S rDNA and ITS1 + 5.8S + ITS2 sequences indicated the monophyly of the genus Acanthostomum ( A. cf. americanum and A. burminis ) and paraphyly of the Acanthostominae . These phylogenetic relationships were consistent in analyses of 28S alone and concatenated 28S + ITS1 + 5.8S + ITS2 sequences analyses. Based on molecular phylogenetic analyses, the subfamily Acanthostominae is therefore a paraphyletic taxon, in contrast with previous classifications based on morphological data. Phylogenetic patterns of host specificity inferred from adult stages of other cryptogonimid taxa are also well supported. However, analyses using additional genera and species are necessary to support the phylogenetic inferences from this study. Our molecular phylogenetic reconstruction linked two larval stages of A. cf. americanum cercariae and metacercariae. Here, we present the evolutionary and ecological implications of parasitic infections in freshwater and brackish environments.
Computing all hybridization networks for multiple binary phylogenetic input trees.
Albrecht, Benjamin
2015-07-30
The computation of phylogenetic trees on the same set of species that are based on different orthologous genes can lead to incongruent trees. One possible explanation for this behavior are interspecific hybridization events recombining genes of different species. An important approach to analyze such events is the computation of hybridization networks. This work presents the first algorithm computing the hybridization number as well as a set of representative hybridization networks for multiple binary phylogenetic input trees on the same set of taxa. To improve its practical runtime, we show how this algorithm can be parallelized. Moreover, we demonstrate the efficiency of the software Hybroscale, containing an implementation of our algorithm, by comparing it to PIRNv2.0, which is so far the best available software computing the exact hybridization number for multiple binary phylogenetic trees on the same set of taxa. The algorithm is part of the software Hybroscale, which was developed specifically for the investigation of hybridization networks including their computation and visualization. Hybroscale is freely available(1) and runs on all three major operating systems. Our simulation study indicates that our approach is on average 100 times faster than PIRNv2.0. Moreover, we show how Hybroscale improves the interpretation of the reported hybridization networks by adding certain features to its graphical representation.
Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D
2015-02-01
The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.
The bacterial composition of chlorinated drinking water was analyzed using 16S rRNA gene clone libraries derived from DNA extracts of 12 samples and compared to clone libraries previously generated using RNA extracts from the same samples. Phylogenetic analysis of 761 DNA-based ...
USDA-ARS?s Scientific Manuscript database
An extensive phylogenetic analysis and genus-level taxonomic revision of Paranoplocephala Lühe, 1910 -like cestodes (Cyclophyllidea, Anoplocephalidae) are presented. The phylogenetic analysis is based on DNA sequences of two partial mitochondrial genes, i.e. cytochrome c oxidase subunit 1 (cox1) and...
Phylogenetic comparative methods on phylogenetic networks with reticulations.
Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile
2018-04-25
The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.
Callejón, Rocío; Robles, María Del Rosario; Panei, Carlos Javier; Cutillas, Cristina
2016-08-01
A molecular phylogenetic hypothesis is presented for the genus Trichuris based on sequence data from mitochondrial cytochrome c oxidase 1 (cox1) and cytochrome b (cob). The taxa consisted of nine populations of whipworm from five species of Sigmodontinae rodents from Argentina. Bayesian Inference, Maximum Parsimony, and Maximum Likelihood methods were used to infer phylogenies for each gene separately but also for the combined mitochondrial data and the combined mitochondrial and nuclear dataset. Phylogenetic results based on cox1 and cob mitochondrial DNA (mtDNA) revealed three clades strongly resolved corresponding to three different species (Trichuris navonae, Trichuris bainae, and Trichuris pardinasi) showing phylogeographic variation, but relationships among Trichuris species were poorly resolved. Phylogenetic reconstruction based on concatenated sequences had greater phylogenetic resolution for delimiting species and populations intra-specific of Trichuris than those based on partitioned genes. Thus, populations of T. bainae and T. pardinasi could be affected by geographical factors and co-divergence parasite-host.
Singh, Dadabhai T; Trehan, Rahul; Schmidt, Bertil; Bretschneider, Timo
2008-01-01
Preparedness for a possible global pandemic caused by viruses such as the highly pathogenic influenza A subtype H5N1 has become a global priority. In particular, it is critical to monitor the appearance of any new emerging subtypes. Comparative phyloinformatics can be used to monitor, analyze, and possibly predict the evolution of viruses. However, in order to utilize the full functionality of available analysis packages for large-scale phyloinformatics studies, a team of computer scientists, biostatisticians and virologists is needed--a requirement which cannot be fulfilled in many cases. Furthermore, the time complexities of many algorithms involved leads to prohibitive runtimes on sequential computer platforms. This has so far hindered the use of comparative phyloinformatics as a commonly applied tool in this area. In this paper the graphical-oriented workflow design system called Quascade and its efficient usage for comparative phyloinformatics are presented. In particular, we focus on how this task can be effectively performed in a distributed computing environment. As a proof of concept, the designed workflows are used for the phylogenetic analysis of neuraminidase of H5N1 isolates (micro level) and influenza viruses (macro level). The results of this paper are hence twofold. Firstly, this paper demonstrates the usefulness of a graphical user interface system to design and execute complex distributed workflows for large-scale phyloinformatics studies of virus genes. Secondly, the analysis of neuraminidase on different levels of complexity provides valuable insights of this virus's tendency for geographical based clustering in the phylogenetic tree and also shows the importance of glycan sites in its molecular evolution. The current study demonstrates the efficiency and utility of workflow systems providing a biologist friendly approach to complex biological dataset analysis using high performance computing. In particular, the utility of the platform Quascade for deploying distributed and parallelized versions of a variety of computationally intensive phylogenetic algorithms has been shown. Secondly, the analysis of the utilized H5N1 neuraminidase datasets at macro and micro levels has clearly indicated a pattern of spatial clustering of the H5N1 viral isolates based on geographical distribution rather than temporal or host range based clustering.
Sun, Bian Na; Wei, Luan Luan; Shen, He Ding; Wu, Hong Xi; Wang, Dong Feng
2016-09-01
We generated complete mitochondrial genome sequences data for 4 genera (Onchidium, Platevindex, Paraoncidium and Peronia) in Onchidiidae to construct a phylogenetic tree in conjunction with other 9 existing data among gastropods. The topology showed that the taxa clustered into two main groups of four species, one of which included Onchidium struma and the Platevindex mortoni, the other Paraoncidium reevesii and Peronia verruculata. The process in Pulmonata from sea to land in accordance with the evolution of respiratory organs from branchial gills to pulmonary cavity has been shown. This will also constitute a framework for phylogeny evolution analysis, systematic classfication of Onchidiidae and other euthyneurans (pulmonates and opisthobranchs).
Tsuji, Masaharu; Yokota, Yuji; Kudoh, Sakae; Hoshino, Tamotsu
2015-06-01
Milk fat curdle is difficult to remove from sewage. In an attempt to identify an appropriate agent for bio-remediation of milk fat curdle, Mrakia strains were collected from the Skarvsnes ice-free area of Antarctica. A total of 27 strains were isolated and tested for their ability to decompose milk fat at temperatures ranging from 4°C to 15°C. All strains could decompose milk fat at 4°C and 10°C. Phylogenetic analysis and comparison of the decomposition ability of milk fat (DAMF) revealed that the DAMF may be useful for predicting the outcome of phylogenetic analysis based on ITS sequences. Copyright © 2015 Elsevier Inc. All rights reserved.
Aging in the natural world: comparative data reveal similar mortality patterns across primates.
Bronikowski, Anne M; Altmann, Jeanne; Brockman, Diane K; Cords, Marina; Fedigan, Linda M; Pusey, Anne; Stoinski, Tara; Morris, William F; Strier, Karen B; Alberts, Susan C
2011-03-11
Human senescence patterns-late onset of mortality increase, slow mortality acceleration, and exceptional longevity-are often described as unique in the animal world. Using an individual-based data set from longitudinal studies of wild populations of seven primate species, we show that contrary to assumptions of human uniqueness, human senescence falls within the primate continuum of aging; the tendency for males to have shorter life spans and higher age-specific mortality than females throughout much of adulthood is a common feature in many, but not all, primates; and the aging profiles of primate species do not reflect phylogenetic position. These findings suggest that mortality patterns in primates are shaped by local selective forces rather than phylogenetic history.
Shapcott, Alison; Forster, Paul I.; Guymer, Gordon P.; McDonald, William J. F.; Faith, Daniel P.; Erickson, David; Kress, W. John
2015-01-01
Australian rainforests have been fragmented due to past climatic changes and more recently landscape change as a result of clearing for agriculture and urban spread. The subtropical rainforests of South Eastern Queensland are significantly more fragmented than the tropical World Heritage listed northern rainforests and are subject to much greater human population pressures. The Australian rainforest flora is relatively taxonomically rich at the family level, but less so at the species level. Current methods to assess biodiversity based on species numbers fail to adequately capture this richness at higher taxonomic levels. We developed a DNA barcode library for the SE Queensland rainforest flora to support a methodology for biodiversity assessment that incorporates both taxonomic diversity and phylogenetic relationships. We placed our SE Queensland phylogeny based on a three marker DNA barcode within a larger international rainforest barcode library and used this to calculate phylogenetic diversity (PD). We compared phylo- diversity measures, species composition and richness and ecosystem diversity of the SE Queensland rainforest estate to identify which bio subregions contain the greatest rainforest biodiversity, subregion relationships and their level of protection. We identified areas of highest conservation priority. Diversity was not correlated with rainforest area in SE Queensland subregions but PD was correlated with both the percent of the subregion occupied by rainforest and the diversity of regional ecosystems (RE) present. The patterns of species diversity and phylogenetic diversity suggest a strong influence of historical biogeography. Some subregions contain significantly more PD than expected by chance, consistent with the concept of refugia, while others were significantly phylogenetically clustered, consistent with recent range expansions. PMID:25803607
Shapcott, Alison; Forster, Paul I; Guymer, Gordon P; McDonald, William J F; Faith, Daniel P; Erickson, David; Kress, W John
2015-01-01
Australian rainforests have been fragmented due to past climatic changes and more recently landscape change as a result of clearing for agriculture and urban spread. The subtropical rainforests of South Eastern Queensland are significantly more fragmented than the tropical World Heritage listed northern rainforests and are subject to much greater human population pressures. The Australian rainforest flora is relatively taxonomically rich at the family level, but less so at the species level. Current methods to assess biodiversity based on species numbers fail to adequately capture this richness at higher taxonomic levels. We developed a DNA barcode library for the SE Queensland rainforest flora to support a methodology for biodiversity assessment that incorporates both taxonomic diversity and phylogenetic relationships. We placed our SE Queensland phylogeny based on a three marker DNA barcode within a larger international rainforest barcode library and used this to calculate phylogenetic diversity (PD). We compared phylo- diversity measures, species composition and richness and ecosystem diversity of the SE Queensland rainforest estate to identify which bio subregions contain the greatest rainforest biodiversity, subregion relationships and their level of protection. We identified areas of highest conservation priority. Diversity was not correlated with rainforest area in SE Queensland subregions but PD was correlated with both the percent of the subregion occupied by rainforest and the diversity of regional ecosystems (RE) present. The patterns of species diversity and phylogenetic diversity suggest a strong influence of historical biogeography. Some subregions contain significantly more PD than expected by chance, consistent with the concept of refugia, while others were significantly phylogenetically clustered, consistent with recent range expansions.
Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud
2017-10-01
The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.
Contextualising primate origins--an ecomorphological framework.
Soligo, Christophe; Smaers, Jeroen B
2016-04-01
Ecomorphology - the characterisation of the adaptive relationship between an organism's morphology and its ecological role - has long been central to theories of the origin and early evolution of the primate order. This is exemplified by two of the most influential theories of primate origins: Matt Cartmill's Visual Predation Hypothesis, and Bob Sussman's Angiosperm Co-Evolution Hypothesis. However, the study of primate origins is constrained by the absence of data directly documenting the events under investigation, and has to rely instead on a fragmentary fossil record and the methodological assumptions inherent in phylogenetic comparative analyses of extant species. These constraints introduce particular challenges for inferring the ecomorphology of primate origins, as morphology and environmental context must first be inferred before the relationship between the two can be considered. Fossils can be integrated in comparative analyses and observations of extant model species and laboratory experiments of form-function relationships are critical for the functional interpretation of the morphology of extinct species. Recent developments have led to important advancements, including phylogenetic comparative methods based on more realistic models of evolution, and improved methods for the inference of clade divergence times, as well as an improved fossil record. This contribution will review current perspectives on the origin and early evolution of primates, paying particular attention to their phylogenetic (including cladistic relationships and character evolution) and environmental (including chronology, geography, and physical environments) contextualisation, before attempting an up-to-date ecomorphological synthesis of primate origins. © 2016 Anatomical Society.
Mira, Alex; Pushker, Ravindra; Legault, Boris A; Moreira, David; Rodríguez-Valera, Francisco
2004-01-01
Background The phylogenetic position and evolutionary relationships of Fusobacteria remain uncertain. Especially intriguing is their relatedness to low G+C Gram positive bacteria (Firmicutes) by ribosomal molecular phylogenies, but their possession of a typical gram negative outer membrane. Taking advantage of the recent completion of the Fusobacterium nucleatum genome sequence we have examined the evolutionary relationships of Fusobacterium genes by phylogenetic analysis and comparative genomics tools. Results The data indicate that Fusobacterium has a core genome of a very different nature to other bacterial lineages, and branches out at the base of Firmicutes. However, depending on the method used, 35–56% of Fusobacterium genes appear to have a xenologous origin from bacteroidetes, proteobacteria, spirochaetes and the Firmicutes themselves. A high number of hypothetical ORFs with unusual codon usage and short lengths were found and hypothesized to be remnants of transferred genes that were discarded. Some proteins and operons are also hypothesized to be of mixed ancestry. A large portion of the Gram-negative cell wall-related genes seems to have been transferred from proteobacteria. Conclusions Many instances of similarity to other inhabitants of the dental plaque that have been sequenced were found. This suggests that the close physical contact found in this environment might facilitate horizontal gene transfer, supporting the idea of niche-specific gene pools. We hypothesize that at a point in time, probably associated to the rise of mammals, a strong selective pressure might have existed for a cell with a Clostridia-like metabolic apparatus but with the adhesive and immune camouflage features of Proteobacteria. PMID:15566569
Moro, Marcelo Freire; Silva, Igor Aurélio; de Araújo, Francisca Soares; Nic Lughadha, Eimear; Meagher, Thomas R.; Martins, Fernando Roberto
2015-01-01
Seasonally dry tropical plant formations (SDTF) are likely to exhibit phylogenetic clustering owing to niche conservatism driven by a strong environmental filter (water stress), but heterogeneous edaphic environments and life histories may result in heterogeneity in degree of phylogenetic clustering. We investigated phylogenetic patterns across ecological gradients related to water availability (edaphic environment and climate) in the Caatinga, a SDTF in Brazil. Caatinga is characterized by semiarid climate and three distinct edaphic environments – sedimentary, crystalline, and inselberg –representing a decreasing gradient in soil water availability. We used two measures of phylogenetic diversity: Net Relatedness Index based on the entire phylogeny among species present in a site, reflecting long-term diversification; and Nearest Taxon Index based on the tips of the phylogeny, reflecting more recent diversification. We also evaluated woody species in contrast to herbaceous species. The main climatic variable influencing phylogenetic pattern was precipitation in the driest quarter, particularly for herbaceous species, suggesting that environmental filtering related to minimal periods of precipitation is an important driver of Caatinga biodiversity, as one might expect for a SDTF. Woody species tended to show phylogenetic clustering whereas herbaceous species tended towards phylogenetic overdispersion. We also found phylogenetic clustering in two edaphic environments (sedimentary and crystalline) in contrast to phylogenetic overdispersion in the third (inselberg). We conclude that while niche conservatism is evident in phylogenetic clustering in the Caatinga, this is not a universal pattern likely due to heterogeneity in the degree of realized environmental filtering across edaphic environments. Thus, SDTF, in spite of a strong shared environmental filter, are potentially heterogeneous in phylogenetic structuring. Our results support the need for scientifically informed conservation strategies in the Caatinga and other SDTF regions that have not previously been prioritized for conservation in order to take into account this heterogeneity. PMID:25798584
Pineda-Peña, Andrea-Clemencia; Faria, Nuno Rodrigues; Imbrechts, Stijn; Libin, Pieter; Abecasis, Ana Barroso; Deforche, Koen; Gómez-López, Arley; Camacho, Ricardo J; de Oliveira, Tulio; Vandamme, Anne-Mieke
2013-10-01
To investigate differences in pathogenesis, diagnosis and resistance pathways between HIV-1 subtypes, an accurate subtyping tool for large datasets is needed. We aimed to evaluate the performance of automated subtyping tools to classify the different subtypes and circulating recombinant forms using pol, the most sequenced region in clinical practice. We also present the upgraded version 3 of the Rega HIV subtyping tool (REGAv3). HIV-1 pol sequences (PR+RT) for 4674 patients retrieved from the Portuguese HIV Drug Resistance Database, and 1872 pol sequences trimmed from full-length genomes retrieved from the Los Alamos database were classified with statistical-based tools such as COMET, jpHMM and STAR; similarity-based tools such as NCBI and Stanford; and phylogenetic-based tools such as REGA version 2 (REGAv2), REGAv3, and SCUEAL. The performance of these tools, for pol, and for PR and RT separately, was compared in terms of reproducibility, sensitivity and specificity with respect to the gold standard which was manual phylogenetic analysis of the pol region. The sensitivity and specificity for subtypes B and C was more than 96% for seven tools, but was variable for other subtypes such as A, D, F and G. With regard to the most common circulating recombinant forms (CRFs), the sensitivity and specificity for CRF01_AE was ~99% with statistical-based tools, with phylogenetic-based tools and with Stanford, one of the similarity based tools. CRF02_AG was correctly identified for more than 96% by COMET, REGAv3, Stanford and STAR. All the tools reached a specificity of more than 97% for most of the subtypes and the two main CRFs (CRF01_AE and CRF02_AG). Other CRFs were identified only by COMET, REGAv2, REGAv3, and SCUEAL and with variable sensitivity. When analyzing sequences for PR and RT separately, the performance for PR was generally lower and variable between the tools. Similarity and statistical-based tools were 100% reproducible, but this was lower for phylogenetic-based tools such as REGA (~99%) and SCUEAL (~96%). REGAv3 had an improved performance for subtype B and CRF02_AG compared to REGAv2 and is now able to also identify all epidemiologically relevant CRFs. In general the best performing tools, in alphabetical order, were COMET, jpHMM, REGAv3, and SCUEAL when analyzing pure subtypes in the pol region, and COMET and REGAv3 when analyzing most of the CRFs. Based on this study, we recommend to confirm subtyping with 2 well performing tools, and be cautious with the interpretation of short sequences. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Naushad, Sohail; Adeolu, Mobolaji; Goel, Nisha; Khadka, Bijendra; Al-Dahwi, Aqeel; Gupta, Radhey S.
2015-01-01
The genera Actinobacillus, Haemophilus, and Pasteurella exhibit extensive polyphyletic branching in phylogenetic trees and do not represent coherent clusters of species. In this study, we have utilized molecular signatures identified through comparative genomic analyses in conjunction with genome based and multilocus sequence based phylogenetic analyses to clarify the phylogenetic and taxonomic boundary of these genera. We have identified large clusters of Actinobacillus, Haemophilus, and Pasteurella species which represent the “sensu stricto” members of these genera. We have identified 3, 7, and 6 conserved signature indels (CSIs), which are specifically shared by sensu stricto members of Actinobacillus, Haemophilus, and Pasteurella, respectively. We have also identified two different sets of CSIs that are unique characteristics of the pathogen containing genera Aggregatibacter and Mannheimia, respectively. It is now possible to demarcate the genera Actinobacillus sensu stricto, Haemophilus sensu stricto, and Pasteurella sensu stricto on the basis of discrete molecular signatures. The other members of the genera Actinobacillus, Haemophilus, and Pasteurella that do not fall within the “sensu stricto” clades and do not contain these molecular signatures should be reclassified as other genera. The CSIs identified here also provide useful diagnostic targets for the identification of current and novel members of the indicated genera. PMID:25821780
NASA Astrophysics Data System (ADS)
Kantachumpoo, Attachai; Uwai, Shinya; Noiraksar, Thidarat; Komatsu, Teruhisa
2015-06-01
The marine brown algal genus Sargassum has been investigated extensively based on genetic information. In this report, we performed the first comparative study of morphological and molecular data among common species of Sargassum found in Thailand and explored the phylogenetic diversity within the genus. Our results revealed an incongruent pattern for species classification in Thai Sargassum. Morphologically, our Sargassum specimens were distinguishable and represented 8 species, namely, S. aquifolium (Turner) C.Agardh, Sargassum baccularia (Mertens) C. Agardh, S. cinereum J. Agardh, S. ilicifolium (Turner) C.Agardh, S. oligocystum Montagne, S. plagiophyllum C. Agardh, S. polycystum C. Agardh and S. swartzii (Turuner) C. Agardh. In contrast, using three different methods, phylogenetic analysis of nuclear ribosomal internal transcribed spacer 2 (ITS2) revealed six distinct clades, including S. baccularia/ S. oligosyntum clade, S. aquifolium/ S. swartzii clade, S. cinereum clade, S. aquifolium/ S. ilicifolium clade, S. polycystum clade, and S. plagiophyllum clade, which was suggestive of a phenotypic plasticity species complex. Our molecular data also confirmed the paraphyletic relationship in the section Binderianae and suggested that this section requires reassessment. Overall, further studies are required to increase our understanding of the taxonomy, phylogenetic relationships and species boundaries among Sargassum species in Thailand.
Gupta, Payal; Saini, Raman; Dash, Prasanta K
2017-07-01
Phospholipase A 2 (PLA 2 ) belongs to class of lipolytic enzymes (EC 3.1.1.4). Lysophosphatidic acid (LPA) and free fatty acids (FFAs) are the products of PLA 2 catalyzed hydrolysis of phosphoglycerides at sn-2 position. LPA and FFA that act as second mediators involved in the development and maturation of plants and animals. Mining of flax genome identified two phospholipase A 2 encoding genes, viz., LusPLA 2 I and LusPLA 2 II (Linum usitatissimum secretory phospholipase A 2 ). Molecular simulation of LusPLA 2 s with already characterized plant sPLA 2 s revealed the presence of conserved motifs and signature domains necessary to classify them as secretory phospholipase A 2 . Phylogenetic analysis of flax sPLA 2 with representative sPLA 2 s from other organisms revealed that they evolved rapidly via gene duplication/deletion events and shares a common ancestor. Our study is the first report of detailed phylogenetic analysis for secretory phospholipase A 2 in flax. Comparative genomic analysis of two LusPLA 2 s with earlier reported plant sPLA 2 s, based on their gene architectures, sequence similarities, and domain structures are presented elucidating the uniqueness of flax sPLA 2 .
Picante: R tools for integrating phylogenies and ecology.
Kembel, Steven W; Cowan, Peter D; Helmus, Matthew R; Cornwell, William K; Morlon, Helene; Ackerly, David D; Blomberg, Simon P; Webb, Campbell O
2010-06-01
Picante is a software package that provides a comprehensive set of tools for analyzing the phylogenetic and trait diversity of ecological communities. The package calculates phylogenetic diversity metrics, performs trait comparative analyses, manipulates phenotypic and phylogenetic data, and performs tests for phylogenetic signal in trait distributions, community structure and species interactions. Picante is a package for the R statistical language and environment written in R and C, released under a GPL v2 open-source license, and freely available on the web (http://picante.r-forge.r-project.org) and from CRAN (http://cran.r-project.org).
Comparing Phylogenetic Trees by Matching Nodes Using the Transfer Distance Between Partitions
Giaro, Krzysztof
2017-01-01
Abstract Ability to quantify dissimilarity of different phylogenetic trees describing the relationship between the same group of taxa is required in various types of phylogenetic studies. For example, such metrics are used to assess the quality of phylogeny construction methods, to define optimization criteria in supertree building algorithms, or to find horizontal gene transfer (HGT) events. Among the set of metrics described so far in the literature, the most commonly used seems to be the Robinson–Foulds distance. In this article, we define a new metric for rooted trees—the Matching Pair (MP) distance. The MP metric uses the concept of the minimum-weight perfect matching in a complete bipartite graph constructed from partitions of all pairs of leaves of the compared phylogenetic trees. We analyze the properties of the MP metric and present computational experiments showing its potential applicability in tasks related to finding the HGT events. PMID:28177699
Comparing Phylogenetic Trees by Matching Nodes Using the Transfer Distance Between Partitions.
Bogdanowicz, Damian; Giaro, Krzysztof
2017-05-01
Ability to quantify dissimilarity of different phylogenetic trees describing the relationship between the same group of taxa is required in various types of phylogenetic studies. For example, such metrics are used to assess the quality of phylogeny construction methods, to define optimization criteria in supertree building algorithms, or to find horizontal gene transfer (HGT) events. Among the set of metrics described so far in the literature, the most commonly used seems to be the Robinson-Foulds distance. In this article, we define a new metric for rooted trees-the Matching Pair (MP) distance. The MP metric uses the concept of the minimum-weight perfect matching in a complete bipartite graph constructed from partitions of all pairs of leaves of the compared phylogenetic trees. We analyze the properties of the MP metric and present computational experiments showing its potential applicability in tasks related to finding the HGT events.
Molecular Phylogenetics: Concepts for a Newcomer.
Ajawatanawong, Pravech
Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.
Phylogenetic inference under varying proportions of indel-induced alignment gaps
Dwivedi, Bhakti; Gadagkar, Sudhindra R
2009-01-01
Background The effect of alignment gaps on phylogenetic accuracy has been the subject of numerous studies. In this study, we investigated the relationship between the total number of gapped sites and phylogenetic accuracy, when the gaps were introduced (by means of computer simulation) to reflect indel (insertion/deletion) events during the evolution of DNA sequences. The resulting (true) alignments were subjected to commonly used gap treatment and phylogenetic inference methods. Results (1) In general, there was a strong – almost deterministic – relationship between the amount of gap in the data and the level of phylogenetic accuracy when the alignments were very "gappy", (2) gaps resulting from deletions (as opposed to insertions) contributed more to the inaccuracy of phylogenetic inference, (3) the probabilistic methods (Bayesian, PhyML & "MLε, " a method implemented in DNAML in PHYLIP) performed better at most levels of gap percentage when compared to parsimony (MP) and distance (NJ) methods, with Bayesian analysis being clearly the best, (4) methods that treat gapped sites as missing data yielded less accurate trees when compared to those that attribute phylogenetic signal to the gapped sites (by coding them as binary character data – presence/absence, or as in the MLε method), and (5) in general, the accuracy of phylogenetic inference depended upon the amount of available data when the gaps resulted from mainly deletion events, and the amount of missing data when insertion events were equally likely to have caused the alignment gaps. Conclusion When gaps in an alignment are a consequence of indel events in the evolution of the sequences, the accuracy of phylogenetic analysis is likely to improve if: (1) alignment gaps are categorized as arising from insertion events or deletion events and then treated separately in the analysis, (2) the evolutionary signal provided by indels is harnessed in the phylogenetic analysis, and (3) methods that utilize the phylogenetic signal in indels are developed for distance methods too. When the true homology is known and the amount of gaps is 20 percent of the alignment length or less, the methods used in this study are likely to yield trees with 90–100 percent accuracy. PMID:19698168
Harrison, Luke B; Larsson, Hans C E
2015-03-01
Likelihood-based methods are commonplace in phylogenetic systematics. Although much effort has been directed toward likelihood-based models for molecular data, comparatively less work has addressed models for discrete morphological character (DMC) data. Among-character rate variation (ACRV) may confound phylogenetic analysis, but there have been few analyses of the magnitude and distribution of rate heterogeneity among DMCs. Using 76 data sets covering a range of plants, invertebrate, and vertebrate animals, we used a modified version of MrBayes to test equal, gamma-distributed and lognormally distributed models of ACRV, integrating across phylogenetic uncertainty using Bayesian model selection. We found that in approximately 80% of data sets, unequal-rates models outperformed equal-rates models, especially among larger data sets. Moreover, although most data sets were equivocal, more data sets favored the lognormal rate distribution relative to the gamma rate distribution, lending some support for more complex character correlations than in molecular data. Parsimony estimation of the underlying rate distributions in several data sets suggests that the lognormal distribution is preferred when there are many slowly evolving characters and fewer quickly evolving characters. The commonly adopted four rate category discrete approximation used for molecular data was found to be sufficient to approximate a gamma rate distribution with discrete characters. However, among the two data sets tested that favored a lognormal rate distribution, the continuous distribution was better approximated with at least eight discrete rate categories. Although the effect of rate model on the estimation of topology was difficult to assess across all data sets, it appeared relatively minor between the unequal-rates models for the one data set examined carefully. As in molecular analyses, we argue that researchers should test and adopt the most appropriate model of rate variation for the data set in question. As discrete characters are increasingly used in more sophisticated likelihood-based phylogenetic analyses, it is important that these studies be built on the most appropriate and carefully selected underlying models of evolution. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Adams, Dean C
2014-09-01
Phylogenetic signal is the tendency for closely related species to display similar trait values due to their common ancestry. Several methods have been developed for quantifying phylogenetic signal in univariate traits and for sets of traits treated simultaneously, and the statistical properties of these approaches have been extensively studied. However, methods for assessing phylogenetic signal in high-dimensional multivariate traits like shape are less well developed, and their statistical performance is not well characterized. In this article, I describe a generalization of the K statistic of Blomberg et al. that is useful for quantifying and evaluating phylogenetic signal in highly dimensional multivariate data. The method (K(mult)) is found from the equivalency between statistical methods based on covariance matrices and those based on distance matrices. Using computer simulations based on Brownian motion, I demonstrate that the expected value of K(mult) remains at 1.0 as trait variation among species is increased or decreased, and as the number of trait dimensions is increased. By contrast, estimates of phylogenetic signal found with a squared-change parsimony procedure for multivariate data change with increasing trait variation among species and with increasing numbers of trait dimensions, confounding biological interpretations. I also evaluate the statistical performance of hypothesis testing procedures based on K(mult) and find that the method displays appropriate Type I error and high statistical power for detecting phylogenetic signal in high-dimensional data. Statistical properties of K(mult) were consistent for simulations using bifurcating and random phylogenies, for simulations using different numbers of species, for simulations that varied the number of trait dimensions, and for different underlying models of trait covariance structure. Overall these findings demonstrate that K(mult) provides a useful means of evaluating phylogenetic signal in high-dimensional multivariate traits. Finally, I illustrate the utility of the new approach by evaluating the strength of phylogenetic signal for head shape in a lineage of Plethodon salamanders. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.
Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju
2015-01-01
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelilhood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the...
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelihood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the ...
A phylogenetically-based nomenclature for Cordycipitaceae (Hypocreales)
USDA-ARS?s Scientific Manuscript database
Changes in Article 59 of the International Code of Nomenclature for algae, fungi, and plants (ICN) disallow the use of dual nomenclatural systems for fungi. This change requires the reconciliation of competing names, ideally linked through culture based or molecular methods. The phylogenetic syste...
Cardoso Oelemann, Maranibia; Gomes, Harrison M.; Willery, Eve; Possuelo, Lia; Batista Lima, Karla Valéria; Allix-Béguec, Caroline; Locht, Camille; Goguet de la Salmonière, Yves-Olivier L.; Gutierrez, Maria Cristina; Suffys, Philip; Supply, Philip
2011-01-01
Background Genotyping of Mycobacterium tuberculosis isolates is a powerful tool for epidemiological control of tuberculosis (TB) and phylogenetic exploration of the pathogen. Standardized PCR-based typing, based on 15 to 24 mycobacterial interspersed repetitive unit-variable number of tandem repeat (MIRU-VNTR) loci combined with spoligotyping, has been shown to have adequate resolution power for tracing TB transmission and to be useful for predicting diverse strain lineages in European settings. Its informative value needs to be tested in high TB-burden countries, where the use of genotyping is often complicated by dominance of geographically specific, genetically homogeneous strain lineages. Methodology/Principal Findings We tested this genotyping system for molecular epidemiological analysis of 369 M. tuberculosis isolates from 3 regions of Brazil, a high TB-burden country. Deligotyping, targeting 43 large sequence polymorphisms (LSPs), and the MIRU-VNTRplus identification database were used to assess phylogenetic predictions. High congruence between the different typing results consistently revealed the countrywide supremacy of the Latin-American-Mediterranean (LAM) lineage, comprised of three main branches. In addition to an already known RDRio branch, at least one other branch characterized by a phylogenetically informative LAM3 spoligo-signature seems to be globally distributed beyond Brazil. Nevertheless, by distinguishing 321 genotypes in this strain population, combined MIRU-VNTR typing and spoligotyping demonstrated the presence of multiple distinct clones. The use of 15 to 24 loci discriminated 21 to 25% more strains within the LAM lineage, compared to a restricted lineage-specific locus set suggested to be used after SNP analysis. Noteworthy, 23 of the 28 molecular clusters identified were exclusively composed of patient isolates from a same region, consistent with expected patterns of mostly local TB transmission. Conclusions/Significance Standard MIRU-VNTR typing combined with spoligotyping can reveal epidemiologically meaningful clonal diversity behind a dominant M. tuberculosis strain lineage in a high TB-burden country and is useful to explore international phylogenetical ramifications. PMID:21464915
Verdú, Miguel; Traveset, Anna
2004-02-01
Most studies using meta-analysis try to establish relationships between traits across taxa from interspecific databases and, thus, the phylogenetic relatedness among these taxa should be taken into account to avoid pseudoreplication derived from common ancestry. This paper illustrates, with a representative example of the relationship between seed size and the effect of frugivore's gut on seed germination, that meta-analytic procedures can also be phylogenetically corrected by means of the comparative method. The conclusions obtained in the meta-analytical and phylogenetical approaches are very different. The meta-analysis revealed that the positive effects that gut passage had on seed germination increased with seed size in the case of gut passage through birds whereas decreased in the case of gut passage through non-flying mammals. However, once the phylogenetic relatedness among plant species was taken into account, the effects of gut passage on seed germination did not depend on seed size and were similar between birds and non-flying mammals. Some methodological considerations are given to improve the bridge between the meta-analysis and the comparative method.
NASA Astrophysics Data System (ADS)
Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann
2017-04-01
Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first chamber length and P/D ratio) and some growth-invariant characters of the chamber sequence (e.g., backbend angle, initial chamber base length and chamber length increase) are evident.
van Keulen, H; Gutell, R R; Gates, M A; Campbell, S R; Erlandsen, S L; Jarroll, E L; Kulda, J; Meyer, E A
1993-01-01
Complete small-subunit rRNA (SSU-rRNA) coding region sequences were determined for two species of the intestinal parasite Giardia: G. ardeae and G. muris, both belonging to the order Diplomonadida, and a free-living member of this order, Hexamita sp. These sequences were compared to published SSU-rDNA sequences from a third member of the genus Giardia, G. duodenalis (often called G. intestinalis or G. lamblia) and various representative organisms from other taxa. Of the three Giardia sequences analyzed, the SSU-rRNA from G. muris is the smallest (1432 bases as compared to 1435 and 1453 for G. ardeae and G. duodenalis, respectively) and has the lowest G+C content (58.9%). The Hexamita SSU-rRNA is the largest in this group, containing 1550 bases. Because the sizes of the SSU-rRNA are prokaryotic rather than typically eukaryotic, the secondary structures of the SSU-rRNAs were constructed. These structures show a number of typically eukaryotic signature sequences. Sequence alignments based on constraints imposed by secondary structure were used for construction of a phylogenetic tree for these four taxa. The results show that of the four diplomonads represented, the Giardia species form a distinct group. The other diplomonad Hexamita and the microsporidium Vairimorpha necatrix appear to be distinct from Giardia.
Thuillard, Marc; Fraix-Burnet, Didier
2015-01-01
This article presents an innovative approach to phylogenies based on the reduction of multistate characters to binary-state characters. We show that the reduction to binary characters' approach can be applied to both character- and distance-based phylogenies and provides a unifying framework to explain simply and intuitively the similarities and differences between distance- and character-based phylogenies. Building on these results, this article gives a possible explanation on why phylogenetic trees obtained from a distance matrix or a set of characters are often quite reasonable despite lateral transfers of genetic material between taxa. In the presence of lateral transfers, outer planar networks furnish a better description of evolution than phylogenetic trees. We present a polynomial-time reconstruction algorithm for perfect outer planar networks with a fixed number of states, characters, and lateral transfers.
An Improved Binary Differential Evolution Algorithm to Infer Tumor Phylogenetic Trees.
Liang, Ying; Liao, Bo; Zhu, Wen
2017-01-01
Tumourigenesis is a mutation accumulation process, which is likely to start with a mutated founder cell. The evolutionary nature of tumor development makes phylogenetic models suitable for inferring tumor evolution through genetic variation data. Copy number variation (CNV) is the major genetic marker of the genome with more genes, disease loci, and functional elements involved. Fluorescence in situ hybridization (FISH) accurately measures multiple gene copy number of hundreds of single cells. We propose an improved binary differential evolution algorithm, BDEP, to infer tumor phylogenetic tree based on FISH platform. The topology analysis of tumor progression tree shows that the pathway of tumor subcell expansion varies greatly during different stages of tumor formation. And the classification experiment shows that tree-based features are better than data-based features in distinguishing tumor. The constructed phylogenetic trees have great performance in characterizing tumor development process, which outperforms other similar algorithms.
Schwenk, K; Posada, D; Hebert, P D
2000-01-01
We examined phylogenetic relationships among Daphnia using mitochondrial DNA (mtDNA) sequences from the small subunit ribosomal RNA (12S), cytochrome c oxidase subunit I and nuclear DNA sequences from the first and second internal transcribed spacer representing 1612 base positions. Phylogenetic analyses using several species of the three main Daphnia subgenera, Ctenodaphnia, Hyalodaphnia and Daphnia, revealed that the Hyalodaphnia are a monophyletic sister group of the Daphnia. Most Hyalodaphnia species occur on one continent, whereas only three are found in North America and Europe. Endemicity of species is associated with variation in thermal tolerance and habitat differentiation. Although many species of the Hyalodaphnia are known to hybridize in nature, mtDNA divergence is relatively high ca. 9%) compared to other hybridizing arthropods (ca. 3%). Reproductive isolation in Daphnia seems to evolve significantly slower than genetic isolation. We related these findings to what is known about the ecology and genetics of Daphnia in order to better understand the evolutionary diversification of lineages. The relationship of these data to phylogenetic patterns is discussed in the context of speciation processes in Daphnia. PMID:11052533
Sharifiyazdi, Hassan; Nazifi, Saeed; Shirzad Aski, Hesamaddin; Shayegh, Hossein
2014-09-01
Hemoplasmas are the trivial name for a group of erythrocyte-parasitizing bacteria of the genus Mycoplasma. This study is the first report of hemoplasma infection in Small Indian Mongoose (Herpestes Javanicus) based on molecular analysis of 16S rDNA. Whole blood samples were collected by sterile methods, from 14 live captured mongooses, in the south of Iran. Candidatus Mycoplasma turicensis (CMt)-like hemoplasma was detected in blood samples from one animal tested. BLAST search and phylogenetic analysis of partial 16S rDNA sequence (933bp) of the hemoplasma from Small Indian mongoose (KJ530704) revealed only 96-97% identity to the previously described CMt followed by 95% and 91% similarity with Mycoplasma coccoides and Mycoplasma haemomuris, respectively. Accordingly, the Iranian mongoose CMt isolate showed a high intra-specific genetic variation compared to all previously reported CMt strains in GenBank. Further molecular studies using multiple phylogenetic markers are required to characterize the exact species of Mongoose-derived hemoplasma. Copyright © 2014 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.
2003-06-01
OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less
Legionella busanensis sp. nov., isolated from cooling tower water in Korea.
Park, Mi-Yeoun; Ko, Kwan Soo; Lee, Hae Kyung; Park, Man-Suk; Kook, Yoon-Hoh
2003-01-01
Three Legionella-like micro-organisms, isolated from cooling tower water of a building in Busan, Korea, were characterized by a variety of biochemical and molecular phylogenetic tests. Analyses of whole-cell fatty acids and results of biochemical tests revealed that these three isolates are distinct from previously described Legionella species. Furthermore, results of comparative analyses of 16S rDNA (1476-1488 bp), mip (408 bp) and rpoB (300 bp) sequences also confirmed that these strains represent a novel species within the genus Legionella. The 16S rDNA sequences of the three Korean isolates had similarities of less than 95.8% to other Legionella species. Phylogenetic trees formed by analysis of the 16S rRNA, rpoB and mip genes revealed that the isolates formed a distinct cluster within the genus Legionella. Based on the evaluated phenotypic and phylogenetic characteristics, it is proposed that these Korean isolates from water be classified as a novel species, Legionella busanensis sp. nov.; the type strain is strain K9951T (=KCTC 12084T =ATCC BAA-518T).
Phylogeny of the Genus Flavivirus
Kuno, Goro; Chang, Gwong-Jen J.; Tsuchiya, K. Richard; Karabatsos, Nick; Cropp, C. Bruce
1998-01-01
We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses. PMID:9420202
Phylogeny of the genus Flavivirus.
Kuno, G; Chang, G J; Tsuchiya, K R; Karabatsos, N; Cropp, C B
1998-01-01
We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses.
Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang
2011-06-01
Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.
Allometry of sexual size dimorphism in turtles: a comparison of mass and length data.
Regis, Koy W; Meik, Jesse M
2017-01-01
The macroevolutionary pattern of Rensch's Rule (positive allometry of sexual size dimorphism) has had mixed support in turtles. Using the largest carapace length dataset and only large-scale body mass dataset assembled for this group, we determine (a) whether turtles conform to Rensch's Rule at the order, suborder, and family levels, and (b) whether inferences regarding allometry of sexual size dimorphism differ based on choice of body size metric used for analyses. We compiled databases of mean body mass and carapace length for males and females for as many populations and species of turtles as possible. We then determined scaling relationships between males and females for average body mass and straight carapace length using traditional and phylogenetic comparative methods. We also used regression analyses to evalutate sex-specific differences in the variance explained by carapace length on body mass. Using traditional (non-phylogenetic) analyses, body mass supports Rensch's Rule, whereas straight carapace length supports isometry. Using phylogenetic independent contrasts, both body mass and straight carapace length support Rensch's Rule with strong congruence between metrics. At the family level, support for Rensch's Rule is more frequent when mass is used and in phylogenetic comparative analyses. Turtles do not differ in slopes of sex-specific mass-to-length regressions and more variance in body size within each sex is explained by mass than by carapace length. Turtles display Rensch's Rule overall and within families of Cryptodires, but not within Pleurodire families. Mass and length are strongly congruent with respect to Rensch's Rule across turtles, and discrepancies are observed mostly at the family level (the level where Rensch's Rule is most often evaluated). At macroevolutionary scales, the purported advantages of length measurements over weight are not supported in turtles.
Guo, Zhong-Long; Wang, Juan; Shen, Yu-Ying
2015-01-01
Insect mitochondrial genome (mitogenome) are the most extensively used genetic information for molecular evolution, phylogenetics and population genetics. Pentatomomorpha (>14,000 species) is the second largest infraorder of Heteroptera and of great economic importance. To better understand the diversity and phylogeny within Pentatomomorpha, we sequenced and annotated the complete mitogenome of Corizus tetraspilus (Hemiptera: Rhopalidae), an important pest of alfalfa in China. We analyzed the main features of the C. tetraspilus mitogenome, and provided a comparative analysis with four other Coreoidea species. Our results reveal that gene content, gene arrangement, nucleotide composition, codon usage, rRNA structures and sequences of mitochondrial transcription termination factor are conserved in Coreoidea. Comparative analysis shows that different protein-coding genes have been subject to different evolutionary rates correlated with the G+C content. All the transfer RNA genes found in Coreoidea have the typical clover leaf secondary structure, except for trnS1 (AGN) which lacks the dihydrouridine (DHU) arm and possesses a unusual anticodon stem (9 bp vs. the normal 5 bp). The control regions (CRs) among Coreoidea are highly variable in size, of which the CR of C. tetraspilus is the smallest (440 bp), making the C. tetraspilus mitogenome the smallest (14,989 bp) within all completely sequenced Coreoidea mitogenomes. No conserved motifs are found in the CRs of Coreoidea. In addition, the A+T content (60.68%) of the CR of C. tetraspilus is much lower than that of the entire mitogenome (74.88%), and is lowest among Coreoidea. Phylogenetic analyses based on mitogenomic data support the monophyly of each superfamily within Pentatomomorpha, and recognize a phylogenetic relationship of (Aradoidea + (Pentatomoidea + (Lygaeoidea + (Pyrrhocoroidea + Coreoidea)))). PMID:26042898
Steinke, Dirk; Salzburger, Walter; Meyer, Axel
2006-06-01
The power of comparative phylogenomic analyses also depends on the amount of data that are included in such studies. We used expressed sequence tags (ESTs) from fish model species as a proof of principle approach in order to test the reliability of using ESTs for phylogenetic inference. As expected, the robustness increases with the amount of sequences. Although some progress has been made in the elucidation of the phylogeny of teleosts, relationships among the main lineages of the derived fish (Euteleostei) remain poorly defined and are still debated. We performed a phylogenomic analysis of a set of 42 of orthologous genes from 10 available fish model systems from seven different orders (Salmoniformes, Siluriformes, Cypriniformes, Tetraodontiformes, Cyprinodontiformes, Beloniformes, and Perciformes) of euteleostean fish to estimate divergence times and evolutionary relationships among those lineages. All 10 fish species serve as models for developmental, aquaculture, genomic, and comparative genetic studies. The phylogenetic signal and the strength of the contribution of each of the 42 orthologous genes were estimated with randomly chosen data subsets. Our study revealed a molecular phylogeny of higher-level relationships of derived teleosts, which indicates that the use of multiple genes produces robust phylogenies, a finding that is expected to apply to other phylogenetic issues among distantly related taxa. Our phylogenomic analyses confirm that the euteleostean superorders Ostariophysi and Acanthopterygii are monophyletic and the Protacanthopterygii and Ostariophysi are sister clades. In addition, and contrary to the traditional phylogenetic hypothesis, our analyses determine that killifish (Cyprinodontiformes), medaka (Beloniformes), and cichlids (Perciformes) appear to be more closely related to each other than either of them is to pufferfish (Tetraodontiformes). All 10 lineages split before or during the fragmentation of the supercontinent Pangea in the Jurassic.
Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
Baele, Guy; Lemey, Philippe; Suchard, Marc A.
2016-01-01
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of “working distributions” to facilitate—or shorten—the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a “working” distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different “working” distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses. PMID:26526428
Do chromosome numbers reflect phylogeny? New counts for Bombacoideae and a review of Malvaceae s.l.
Marinho, Rafaela C; Mendes-Rodrigues, Clesnan; Balao, Francisco; Ortiz, Pedro L; Yamagishi-Costa, Júlia; Bonetti, Ana M; Oliveira, Paulo E
2014-09-01
• Whole genome duplication (WGD) and specific polyploidy events marked turning points for angiosperm genome structure and evolution. Therefore, cytogenetic studies of polyploidy-prone groups such as the tropical Malvaceae and plant formations such as as the Brazilian Cerrado have gained further importance. We present new chromosome counts for Cerrado Bombacoideae and revised chromosome numbers for the Malvaceae s.l., compare these between subfamilies, and relate them to phylogenetic signal.• We studied the chromosome number of Eriotheca candolleana, E. gracilipes, E. pubescens, Pachira glabra, Pseudobombax longiflorum, and P. tomentosum. We also compared Eriotheca species ploidy levels using flow cytometry. We compiled chromosome numbers for 557 species of Malvaceae s.l., including 37 Bombacoideae species. We included this information in a phylogenetic reconstruction based on chloroplast matK-trnK DNA to evaluate chromosome evolution of the Malvaceae s.l. and the Bombacoideae in particular.• The Cerrado Bombacoideae presented consistently high chromosome numbers. Numbers for Eriotheca species were among the highest and varied among populations. Flow cytometry analyses showed similar 1Cx DNA for all cytotypes and indicated neopolyploidy. Chromosome numbers differed between subfamilies, with the lowest numbers in the Malvoideae and Byttnerioideae and the highest in Tilioideae. Chromosome numbers had significant phylogenetic signal for Bombacoideae but not for Malvoideae or Malvaceae s.l.• Clearly distinct chromosome numbers allied to monophyly provide some support for a circumscription of the Bombacoideae and distinction within the Malvaceae. The phylogenetic signal for chromosome number supports the idea of an ancient WGD and further neopolyploidy events as important evolutionary trends for the Bombacoideae. © 2014 Botanical Society of America, Inc.
Schluessel, Vera; Bennett, Michael B; Bleckmann, Horst; Blomberg, Simon; Collin, Shaun P
2008-11-01
This study investigated the relationship between olfactory morphology, habitat occupancy, and lifestyle in 21 elasmobranch species in a phylogenetic context. Four measures of olfactory capability, that is, the number of olfactory lamellae, the surface area of the olfactory epithelium, the mass of the olfactory bulb, and the mass of the olfactory rosette were compared between individual species and groups, comprised of species with similar habitat and/or lifestyle. Statistical analyses using generalized least squares phylogenetic regression revealed that bentho-pelagic sharks and rays possess significantly more olfactory lamellae and larger sensory epithelial surface areas than benthic species. There was no significant correlation between either olfactory bulb or rosette mass and habitat type. There was also no significant difference between the number of lamellae or the size of the sensory surface area in groups comprised of species with similar diets, that is, groups preying predominantly on crustaceans, cephalopods, echinoderms, polychaetes, molluscs, or teleosts. However, some groups had significantly larger olfactory bulb or rosette masses than others. There was little evidence to support a correlation between phylogeny and morphology, indicating that differences in olfactory capabilities are the result of functional rather than phylogenetic adaptations. All olfactory epithelia exhibited microvilli and cilia, with microvilli in both nonsensory and sensory areas, and cilia only in sensory areas. Cilia over the sensory epithelia originated from supporting cells. In contrast to teleosts, which possess ciliated and microvillous olfactory receptor types, no ciliated olfactory receptor cells were observed. This is the first comprehensive study comparing olfactory morphology to several aspects of elasmobranch ecology in a phylogenetic context.
Resolving the Lophiostoma bipolare complex: Generic delimitations within Lophiostomataceae.
Hashimoto, A; Hirayama, K; Takahashi, H; Matsumura, M; Okada, G; Chen, C Y; Huang, J W; Kakishima, M; Ono, T; Tanaka, K
2018-06-01
Lophiostoma bipolare was taxonomically revised based on the morphological observations and phylogenetic analyses of molecular data from nuclear rDNA SSU-ITS-LSU, TUB , tef1 , and rpb2 genes. Twenty-nine strains were morphologically similar to Lo . bipolare . A total of 174 sequences were generated from the Lo . bipolare complex. Phylogenetic analyses based on TUB sequence revealed 11 distinct species within the Lo. bipolare complex. Morphological features of the ascospores and the anatomical structure of the ascomata from both field collections as well as axenic culture, which have been reported previously as variable features at intraspecific levels, were compared to evaluate the taxonomic reliability of these features. To clarify the generic position of the 11 species, phylogenetic analyses were done on SSU-ITS-LSU- tef1 - rpb2 gene sequences. The Lo . bipolare complex shared phylogenetic relationships with Pseudolophiostoma and Vaginatispora , and formed an additional five distinct clades from other members of Lophiostomataceae . According to its phylogenetic position, Lo. bipolare sensu stricto was distantly related to Lophiostoma s. str., and formed an independent clade within Lophiostomataceae. Lophiostoma bipolare s. str. could be distinguished from the other lophiostomataceous genera by the clypeus around the ostiolar neck and by the thin and uniformly thick peridium. A novel genus described as Lentistoma was established to accommodate this species, and the epitypification of Lentistoma bipolare (basionym: Massarina bipolaris ) was proposed. Other lineages of the Lo. bipolare complex could not be separated on the basis of the ascospore size and sheath variations, but were distinguished based on ascomatal features, such as the existence of the clypeus, brown hyphae surrounding the peridium, and the contexture of the peridium, which were stable indicators of generic boundaries in Lophiostomataceae . Four additional new genera with five new species were recognised based on these morphological differences: Crassiclypeus ( C . aquaticus ), Flabellascoma ( F . cycadicola and F . minimum ), Leptoparies ( Lep . palmarum ), and Pseudopaucispora ( Pseudop . brunneospora ). Three new species were added to Pseudolophiostoma ( Pseudol . cornisporum , Pseudol . obtusisporum , and Pseudol . tropicum ) and two new species were added to Vaginatispora ( V . amygdali and V . scabrispora ). The re-evaluation of the validity of several previously recognised genera resulted in the introduction of two new genera with new combinations for Lophiostoma pseudoarmatisporum as Parapaucispora pseudoarmatispora and Vaginatispora fuckelii as Neovaginatispora fuckelii .
Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree
NASA Astrophysics Data System (ADS)
Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca
2017-04-01
The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.
Vieira, Willie A S; Lima, Waléria G; Nascimento, Eduardo S; Michereff, Sami J; Câmara, Marcos P S; Doyle, Vinson P
2017-01-01
Developing a comprehensive and reliable taxonomy for the Colletotrichum gloeosporioides species complex will require adopting data standards on the basis of an understanding of how methodological choices impact morphological evaluations and phylogenetic inference. We explored the impact of methodological choices in a morphological and molecular evaluation of Colletotrichum species associated with banana in Brazil. The choice of alignment filtering algorithm has a significant impact on topological inference and the retention of phylogenetically informative sites. Similarly, the choice of phylogenetic marker affects the delimitation of species boundaries, particularly if low phylogenetic signal is confounded with strong discordance, and inference of the species tree from multiple-gene trees. According to both phylogenetic informativeness profiling and Bayesian concordance analyses, the most informative loci are DNA lyase (APN2), intergenic spacer (IGS) between DNA lyase and the mating-type locus MAT1-2-1 (APN2/MAT-IGS), calmodulin (CAL), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), glutamine synthetase (GS), β-tubulin (TUB2), and a new marker, the intergenic spacer between GAPDH and an hypothetical protein (GAP2-IGS). Cornmeal agar minimizes the variance in conidial dimensions compared with potato dextrose agar and synthetic nutrient-poor agar, such that species are more readily distinguishable based on phenotypic differences. We apply these insights to investigate the diversity of Colletotrichum species associated with banana anthracnose in Brazil and report C. musae, C. tropicale, C. theobromicola, and C. siamense in association with banana anthracnose. One lineage did not cluster with any previously described species and is described here as C. chrysophilum.
Streptococcus himalayensis sp. nov., isolated from the respiratory tract of Marmota himalayana.
Niu, Lina; Lu, Shan; Lai, Xin-He; Hu, Shoukui; Chen, Cuixia; Zhang, Gui; Yang, Jing; Jin, Dong; Wang, Yi; Lan, Ruiting; Lu, Gang; Xie, Yingping; Ye, Changyun; Xu, Jianguo
2017-02-01
Five strains of Gram-positive-staining, catalase-negative, coccus-shaped, chain-forming organisms isolated separately from the respiratory tracts of five Marmota himalayana animals in the Qinghai-Tibet Plateau of China were subjected to phenotypic and molecular taxonomic analyses. Comparative analysis of the 16S rRNA gene indicated that these singular organisms represent a new member of the genus Streptococcus, being phylogenetically closest to Streptococcus marmotae DSM 101995T (98.4 % similarity). The groEL, sodA and rpoB sequence analysis showed interspecies similarity values between HTS2T and Streptococcus. marmotae DSM 101995T, its closest phylogenetic relative based on 16S rRNA gene sequences, of 98.2, 78.8 and 93.7 %, respectively. A whole-genome phylogenetic tree built from 82 core genes of genomes from 16 species of the genus Streptococcus validated that HTS2T forms a distinct subline and exhibits specific phylogenetic affinity with S. marmotae. In silico DNA-DNA hybridization of HTS2T showed an estimated DNA reassociation value of 40.5 % with Streptococcus. marmotae DSM 101995T. On the basis of their phenotypic characteristics and phylogenetic findings, it is proposed that the five isolates be classified as representatives of a novel species of the genus Streptococcus, Streptococcus himalayensis sp. nov. The type strain is HTS2T (=DSM 101997T=CGMCC 1.15533T). The genome of Streptococcus himalayensis sp. nov. strain HTS2T contains 2195 genes with a size of 2 275 471 bp and a mean DNA G+C content of 41.3 mol%.
Phylogenetic effective sample size.
Bartoszek, Krzysztof
2016-10-21
In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present concept of effective sample size (the mean effective sample size). Through a simulation study I find that the AICc is robust if one corrects for the number of species or effective number of species. Lastly I discuss how the concept of the phylogenetic effective sample size can be useful for biodiversity quantification, identification of interesting clades and deciding on the importance of phylogenetic correlations. Copyright © 2016 Elsevier Ltd. All rights reserved.
Moreira, Xoaquín; Abdala-Roberts, Luis; Galmán, Andrea; Francisco, Marta; Fuente, María de la; Butrón, Ana; Rasmann, Sergio
2018-06-07
Biogeographical factors and phylogenetic history are key determinants of inter-specific variation in plant defences. However, few studies have conducted broad-scale geographical comparisons of plant defences while controlling for phylogenetic relationships, and, in doing so, none have separated constitutive from induced defences. This gap has limited our understanding of how historical or large-scale processes mediate biogeographical patterns in plant defences since these may be contingent upon shared evolutionary history and phylogenetic constraints. We conducted a phylogenetically-controlled experiment testing for differences in constitutive leaf chemical defences and their inducibility between Palearctic and Nearctic oak species (Quercus, total 18 species). We induced defences in one-year old plants by inflicting damage by gypsy moth larvae (Lymantria dispar), estimated the amount of leaf area consumed, and quantified various groups of phenolic compounds. There was no detectable phylogenetic signal for constitutive or induced levels of most defensive traits except for constitutive condensed tannins, as well as no phylogenetic signal in leaf herbivory. We did, however, find marked differences in defence levels between oak species from each region: Palearctic species had higher levels of constitutive condensed tannins, but less constitutive lignins and less constitutive and induced hydrolysable tannins compared with Nearctic species. Additionally, Palearctic species had lower levels of leaf damage compared with Nearctic species. These differences in leaf damage, lignins and hydrolysable (but not condensed) tannins were lost after accounting for phylogeny, suggesting that geographical structuring of phylogenetic relationships mediated biogeographical differences in defences and herbivore resistance. Together, these findings suggest that historical processes and large-scale drivers have shaped differences in allocation to constitutive defences (and in turn resistance) between Palearctic and Nearctic oaks. Moreover, although evidence of phylogenetic conservatism in the studied traits is rather weak, shared evolutionary history appears to mediate some of these biogeographical patterns in allocation to chemical defences. Copyright © 2018 Elsevier Ltd. All rights reserved.
Phylogeny of the Genus Drosophila
O’Grady, Patrick M.; DeSalle, Rob
2018-01-01
Understanding phylogenetic relationships among taxa is key to designing and implementing comparative analyses. The genus Drosophila, which contains over 1600 species, is one of the most important model systems in the biological sciences. For over a century, one species in this group, Drosophila melanogaster, has been key to studies of animal development and genetics, genome organization and evolution, and human disease. As whole-genome sequencing becomes more cost-effective, there is increasing interest in other members of this morphologically, ecologically, and behaviorally diverse genus. Phylogenetic relationships within Drosophila are complicated, and the goal of this paper is to provide a review of the recent taxonomic changes and phylogenetic relationships in this genus to aid in further comparative studies. PMID:29716983
Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong; Lee, Hyang Burm
2015-09-01
A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae.
Zautner, Andreas Erich; Masanta, Wycliffe Omurwa; Tareen, Abdul Malik; Weig, Michael; Lugert, Raimond; Groß, Uwe; Bader, Oliver
2013-11-07
Campylobacter jejuni, the most common bacterial pathogen causing gastroenteritis, shows a wide genetic diversity. Previously, we demonstrated by the combination of multi locus sequence typing (MLST)-based UPGMA-clustering and analysis of 16 genetic markers that twelve different C. jejuni subgroups can be distinguished. Among these are two prominent subgroups. The first subgroup contains the majority of hyperinvasive strains and is characterized by a dimeric form of the chemotaxis-receptor Tlp7(m+c). The second has an extended amino acid metabolism and is characterized by the presence of a periplasmic asparaginase (ansB) and gamma-glutamyl-transpeptidase (ggt). Phyloproteomic principal component analysis (PCA) hierarchical clustering of MALDI-TOF based intact cell mass spectrometry (ICMS) spectra was able to group particular C. jejuni subgroups of phylogenetic related isolates in distinct clusters. Especially the aforementioned Tlp7(m+c)(+) and ansB+/ ggt+ subgroups could be discriminated by PCA. Overlay of ICMS spectra of all isolates led to the identification of characteristic biomarker ions for these specific C. jejuni subgroups. Thus, mass peak shifts can be used to identify the C. jejuni subgroup with an extended amino acid metabolism. Although the PCA hierarchical clustering of ICMS-spectra groups the tested isolates into a different order as compared to MLST-based UPGMA-clustering, the isolates of the indicator-groups form predominantly coherent clusters. These clusters reflect phenotypic aspects better than phylogenetic clustering, indicating that the genes corresponding to the biomarker ions are phylogenetically coupled to the tested marker genes. Thus, PCA clustering could be an additional tool for analyzing the relatedness of bacterial isolates.
The Evolution of Tyrosine-Recombinase Elements in Nematoda
Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L.; Lunt, David H.
2014-01-01
Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum. PMID:25197791
The evolution of tyrosine-recombinase elements in Nematoda.
Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L; Lunt, David H
2014-01-01
Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum.
Toussaint, Emmanuel F A; Morinière, Jérôme; Müller, Chris J; Kunte, Krushnamegh; Turlin, Bernard; Hausmann, Axel; Balke, Michael
2015-10-01
The charismatic tropical Polyura Nawab butterflies are distributed across twelve biodiversity hotspots in the Indomalayan/Australasian archipelago. In this study, we tested an array of species delimitation methods and compared the results to existing morphology-based taxonomy. We sequenced two mitochondrial and two nuclear gene fragments to reconstruct phylogenetic relationships within Polyura using both Bayesian inference and maximum likelihood. Based on this phylogenetic framework, we used the recently introduced bGMYC, BPP and PTP methods to investigate species boundaries. Based on our results, we describe two new species Polyura paulettae Toussaint sp. n. and Polyura smilesi Toussaint sp. n., propose one synonym, and five populations are raised to species status. Most of the newly recognized species are single-island endemics likely resulting from the recent highly complex geological history of the Indomalayan-Australasian archipelago. Surprisingly, we also find two newly recognized species in the Indomalayan region where additional biotic or abiotic factors have fostered speciation. Species delimitation methods were largely congruent and succeeded to cross-validate most extant morphological species. PTP and BPP seem to yield more consistent and robust estimations of species boundaries with respect to morphological characters while bGMYC delivered contrasting results depending on the different gene trees considered. Our findings demonstrate the efficiency of comparative approaches using molecular species delimitation methods on empirical data. They also pave the way for the investigation of less well-known groups to unveil patterns of species richness and catalogue Earth's concealed, therefore unappreciated diversity. Published by Elsevier Inc.
Welcome to pandoraviruses at the ‘Fourth TRUC’ club
Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier
2015-01-01
Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9–2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the ‘Fourth TRUC’ club, encompassing distinct life forms compared with cellular organisms. PMID:26042093
Welcome to pandoraviruses at the 'Fourth TRUC' club.
Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier
2015-01-01
Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9-2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the 'Fourth TRUC' club, encompassing distinct life forms compared with cellular organisms.
González-Durán, Gustavo A; Targino, Mariane; Rada, Marco; Grant, Taran
2017-03-13
We evaluate the monophyly and phylogenetic relationships of the Pristimantis leptolophus species group and describe its external morphology, osteology, and some myological characteristics. We also compare the P. leptolophus species group to other related species groups. The P. leptolophus group is not monophyletic due to the inclusion of P. acatallelus, formerly believed to be part of the P. devillei group. The revised P. leptolophus group is composed of nine named species and six unnamed species. Based on our results, we recognize a new species group, the P. boulengeri species group, composed of eight species, many of which were previously assigned to the P. lacrimosus species group.
Leekitcharoenphon, Pimlapas; Friis, Carsten; Zankari, Ea; Svendsen, Christina Aaby; Price, Lance B; Rahmani, Maral; Herrero-Fresno, Ana; Fashae, Kayode; Vandenberg, Olivier; Aarestrup, Frank M; Hendriksen, Rene S
2013-10-15
Salmonella enterica serovar Typhimurium ST313 is an invasive and phylogenetically distinct lineage present in sub-Saharan Africa. We report the presence of S. Typhimurium ST313 from patients in the Democratic Republic of Congo and Nigeria. Eighteen S. Typhimurium ST313 isolates were characterized by antimicrobial susceptibility testing, pulsed-field gel electrophoresis (PFGE), and multilocus sequence typing (MLST). Additionally, six of the isolates were characterized by whole genome sequence typing (WGST). The presence of a putative virulence determinant was examined in 177 Salmonella isolates belonging to 57 different serovars. All S. Typhimurium ST313 isolates harbored resistant genes encoded by blaTEM1b, catA1, strA/B, sul1, and dfrA1. Additionally, aac(6')1aa gene was detected. Phylogenetic analyses revealed close genetic relationships among Congolese and Nigerian isolates from both blood and stool. Comparative genomic analyses identified a putative virulence fragment (ST313-TD) unique to S. Typhimurium ST313 and S. Dublin. We showed in a limited number of isolates that S. Typhimurium ST313 is a prevalent sequence-type causing gastrointestinal diseases and septicemia in patients from Nigeria and DRC. We found three distinct phylogenetic clusters based on the origin of isolation suggesting some spatial evolution. Comparative genomics showed an interesting putative virulence fragment (ST313-TD) unique to S. Typhimurium ST313 and invasive S. Dublin.
Jaeggi, Adrian V; Kramer, Karen L; Hames, Raymond; Kiely, Evan J; Gomes, Cristina; Kaplan, Hillard; Gurven, Michael
2017-04-01
Grooming has important utilitarian and social functions in primates but little is known about grooming and its functional analogues in traditional human societies. We compare human grooming to typical primate patterns to test its hygienic and social functions. Bayesian phylogenetic analyses were used to derive expected human grooming time given the potential associations between grooming, group size, body size, terrestriality, and several climatic variables across 69 primate species. This was compared against observed times dedicated to grooming, other hygienic behavior, and conversation among the Maya, Pumé, Sanöma, Tsimane', Yanomamö, and Ye'kwana (mean number of behavioral scans = 23,514). Expected grooming time for humans was 4% (95% Credible Interval = 0.07%-14%), similar to values observed in primates, based largely on terrestriality and phylogenetic signal (mean λ = 0.56). No other covariates strongly associated with grooming across primates. Observed grooming time across societies was 0.8%, lower than 89% of the expected values. However, the observed times dedicated to any hygienic behavior (3.0%) or "vocal grooming," that is conversation (7.3%), fell within the expected range. We found (i) that human grooming may be a (recent) phylogenetic outlier when defined narrowly as parasite removal but not defined broadly as personal hygiene, (ii) there was no support for thermoregulatory functions of grooming, and (iii) no support for the "vocal grooming" hypothesis of language having evolved as a less time-consuming means of bonding. Thus, human grooming reflects decreased hygienic needs, but similar social needs compared to primate grooming. © 2017 Wiley Periodicals, Inc.
Tanglegrams for rooted phylogenetic trees and networks
Scornavacca, Celine; Zickmann, Franziska; Huson, Daniel H.
2011-01-01
Motivation: In systematic biology, one is often faced with the task of comparing different phylogenetic trees, in particular in multi-gene analysis or cospeciation studies. One approach is to use a tanglegram in which two rooted phylogenetic trees are drawn opposite each other, using auxiliary lines to connect matching taxa. There is an increasing interest in using rooted phylogenetic networks to represent evolutionary history, so as to explicitly represent reticulate events, such as horizontal gene transfer, hybridization or reassortment. Thus, the question arises how to define and compute a tanglegram for such networks. Results: In this article, we present the first formal definition of a tanglegram for rooted phylogenetic networks and present a heuristic approach for computing one, called the NN-tanglegram method. We compare the performance of our method with existing tree tanglegram algorithms and also show a typical application to real biological datasets. For maximum usability, the algorithm does not require that the trees or networks are bifurcating or bicombining, or that they are on identical taxon sets. Availability: The algorithm is implemented in our program Dendroscope 3, which is freely available from www.dendroscope.org. Contact: scornava@informatik.uni-tuebingen.de; huson@informatik.uni-tuebingen.de PMID:21685078
Palmer, A. Richard
1996-01-01
Phylogenetic analyses of asymmetry variation offer a powerful tool for exploring the interplay between ontogeny and evolution because (i) conspicuous asymmetries exist in many higher metazoans with widely varying modes of development, (ii) patterns of bilateral variation within species may identify genetically and environmentally triggered asymmetries, and (iii) asymmetries arising at different times during development may be more sensitive to internal cytoplasmic inhomogeneities compared to external environmental stimuli. Using four broadly comparable asymmetry states (symmetry, antisymmetry, dextral, and sinistral), and two stages at which asymmetry appears developmentally (larval and postlarval), I evaluated relations between ontogenetic and phylogenetic patterns of asymmetry variation. Among 140 inferred phylogenetic transitions between asymmetry states, recorded from 11 classes in five phyla, directional asymmetry (dextral or sinistral) evolved directly from symmetrical ancestors proportionally more frequently among larval asymmetries. In contrast, antisymmetry, either as an end state or as a transitional stage preceding directional asymmetry, was confined primarily to postlarval asymmetries. The ontogenetic origin of asymmetry thus significantly influences its subsequent evolution. Furthermore, because antisymmetry typically signals an environmentally triggered asymmetry, the phylogenetic transition from antisymmetry to directional asymmetry suggests that many cases of laterally fixed asymmetries evolved via genetic assimilation. PMID:8962039
USDA-ARS?s Scientific Manuscript database
Arylamine N-acetyltransferases (NATs) are xenobiotic metabolizing enzymes characterized in several bacteria and eukaryotic organisms. We report a comprehensive phylogenetic analysis employing an exhaustive dataset of NAT-homologous sequences recovered through inspection of 2445 genomes. We describe ...
Parks, Donovan H; Beiko, Robert G
2013-01-01
High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.
Mishra, Priyanka; Kumar, Amit; Rodrigues, Vereena; Shukla, Ashutosh K; Sundaresan, Velusamy
2016-01-01
The internal transcribed spacer (ITS) region is situated between 18S and 26S in a polycistronic rRNA precursor transcript. It had been proved to be the most commonly sequenced region across plant species to resolve phylogenetic relationships ranging from shallow to deep taxonomic levels. Despite several taxonomical revisions in Cassiinae, a stable phylogeny remains elusive at the molecular level, particularly concerning the delineation of species in the genera Cassia, Senna and Chamaecrista . This study addresses the comparative potential of ITS datasets (ITS1, ITS2 and concatenated) in resolving the underlying morphological disparity in the highly complex genera, to assess their discriminatory power as potential barcode candidates in Cassiinae. A combination of experimental data and an in-silico approach based on threshold genetic distances, sequence similarity based and hierarchical tree-based methods was performed to decipher the discriminating power of ITS datasets on 18 different species of Cassiinae complex. Lab-generated s equences were compared against those available in the GenBank using BLAST and were aligned through MUSCLE 3.8.31 and analysed in PAUP 4.0 and BEAST1.8 using parsimony ratchet, maximum likelihood and Bayesian inference (BI) methods of gene and species tree reconciliation with bootstrapping. DNA barcoding gap was realized based on the Kimura two-parameter distance model (K2P) in TaxonDNA and MEGA. Based on the K2P distance, significant divergences between the inter- and intra-specific genetic distances were observed, while the presence of a DNA barcoding gap was obvious. The ITS1 region efficiently identified 81.63% and 90% of species using TaxonDNA and BI methods, respectively. The PWG-distance method based on simple pairwise matching indicated the significance of ITS1 whereby highest number of variable (210) and informative sites (206) were obtained. The BI tree-based methods outperformed the similarity-based methods producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. The reticulated phylogenetic hypothesis using the ITS1 region mainly supported the relationship between the species of Cassiinae established by traditional morphological methods. The ITS1 region showed a higher discrimination power and desirable characteristics as compared to ITS2 and ITS1 + 2, thereby concluding to be the locus of choice. Considering the complexity of the group and the underlying biological ambiguities, the results presented here are encouraging for developing DNA barcoding as a useful tool for resolving taxonomical challenges in corroboration with morphological framework.
New high through put approach to study ancient microbial phylogenetic diversity in permafrost
NASA Astrophysics Data System (ADS)
Spirina, E.; Cole, J.; Chai, B.; Gilichinksy, D.; Tiedje, J.
2003-04-01
The study of microbial diversity in the deep ancient permafrost can help to answer many questions: (1) what kind of mechanisms keeps microbial cells alive, (2) how many of phylogenetic groups exist in situ and never had been cultivated, (3) what is the difference between modern and ancient microorganisms? From this point, distinct environments were examined: Arctic and Antarctic modern soil and permafrost. 16S rDNA genes were amplified from genomic DNA extracted from both original frozen samples and the same samples incubated at 10oC for 8 weeks under both aerobic and anaerobic conditions to determine those capable to grow. High throughput DNA sequencing was performed on the cloned PCR products to obtain partial 16S rDNA gene sequences. The unique script was written to automatically compare over 2,000 partial sequences with those rrn sequences in the Ribosomal Database Project (RDP) release 8.1 using the SEQUENCE MATCH. Sequences were grouped into categories from the RDPs phylogenetic hierarchy based on the closest database matches. Investigation revealed significant microbial diversity; two phylogenetic groups were predominant in all samples: Proteobacteria and Gram Positive Bacteria. Microbial community composition within those groups is different from sample to sample. However, similar genera, such as Arthrobacter, Bacillus, Citrobacter, Caulobacter, Comamonas, Flavobacterium, Nocardioides, Pseudomonas, Rhodocyclus, Rhodococcus, Sphingobacterium, Sphingomonas, Streptococcus, Terrabacter appeared in both polar regions. The greatest microbial diversity was detected in Arctic surface samples. According to RDPs phylogenetic hierarchy those organisms are related to Proteobacteria_SD, Gram Positive Bacteria_SD, Leptospirillum-Nitrospira, Nitrospina_SD, Flexibacter-Cytophaga-Bacteroides, Planctomyces and Relatives. Both the aerobic and anaerobic low temperatures soil incubation yielded some microbes not detected in the original samples. It should be possible, using phylogenetic diversity from the same organisms from modern top layers to the several millions years old, to find out what are the differences among members of the same species as we go back in time. Then, if we compare those mutations rate with geological time, we can speculate on how fast or slow evolution or adaptation takes place and for that particular type of organism. This is a beginning of studies concerning the biological clocks extending back the duration of the permanently frozen state in the terrestrial and extraterrestrial soils, i. e. the age of biota.
Kim, W J; Ji, Y; Choi, G; Kang, Y M; Yang, S; Moon, B C
2016-08-05
This study was performed to identify and analyze the phylogenetic relationship among four herbaceous species of the genus Paeonia, P. lactiflora, P. japonica, P. veitchii, and P. suffruticosa, using DNA barcodes. These four species, which are commonly used in traditional medicine as Paeoniae Radix and Moutan Radicis Cortex, are pharmaceutically defined in different ways in the national pharmacopoeias in Korea, Japan, and China. To authenticate the different species used in these medicines, we evaluated rDNA-internal transcribed spacers (ITS), matK and rbcL regions, which provide information capable of effectively distinguishing each species from one another. Seventeen samples were collected from different geographic regions in Korea and China, and DNA barcode regions were amplified using universal primers. Comparative analyses of these DNA barcode sequences revealed species-specific nucleotide sequences capable of discriminating the four Paeonia species. Among the entire sequences of three barcodes, marker nucleotides were identified at three positions in P. lactiflora, eleven in P. japonica, five in P. veitchii, and 25 in P. suffruticosa. Phylogenetic analyses also revealed four distinct clusters showing homogeneous clades with high resolution at the species level. The results demonstrate that the analysis of these three DNA barcode sequences is a reliable method for identifying the four Paeonia species and can be used to authenticate Paeoniae Radix and Moutan Radicis Cortex at the species level. Furthermore, based on the assessment of amplicon sizes, inter/intra-specific distances, marker nucleotides, and phylogenetic analysis, rDNA-ITS was the most suitable DNA barcode for identification of these species.
FPGA Acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods.
Zierke, Stephanie; Bakos, Jason D
2010-04-12
Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA's on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10x speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs).
Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan
2012-01-01
This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques.
2010-01-01
Background Animal mitochondrial genomes are potential models for molecular evolution and markers for phylogenetic and population studies. Previous research has shown interesting features in hymenopteran mitochondrial genomes. Here, we conducted a comparative study of mitochondrial genomes of the family Braconidae, one of the largest families of Hymenoptera, and assessed the utility of mitochondrial genomic data for phylogenetic inference at three different hierarchical levels, i.e., Braconidae, Hymenoptera, and Holometabola. Results Seven mitochondrial genomes from seven subfamilies of Braconidae were sequenced. Three of the four sequenced A+T-rich regions are shown to be inverted. Furthermore, all species showed reversal of strand asymmetry, suggesting that inversion of the A+T-rich region might be a synapomorphy of the Braconidae. Gene rearrangement events occurred in all braconid species, but gene rearrangement rates were not taxonomically correlated. Most rearranged genes were tRNAs, except those of Cotesia vestalis, in which 13 protein-coding genes and 14 tRNA genes changed positions or/and directions through three kinds of gene rearrangement events. Remote inversion is posited to be the result of two independent recombination events. Evolutionary rates were lower in species of the cyclostome group than those of noncyclostomes. Phylogenetic analyses based on complete mitochondrial genomes and secondary structure of rrnS supported a sister-group relationship between Aphidiinae and cyclostomes. Many well accepted relationships within Hymenoptera, such as paraphyly of Symphyta and Evaniomorpha, a sister-group relationship between Orussoidea and Apocrita, and monophyly of Proctotrupomorpha, Ichneumonoidea and Aculeata were robustly confirmed. New hypotheses, such as a sister-group relationship between Evanioidea and Aculeata, were generated. Among holometabolous insects, Hymenoptera was shown to be the sister to all other orders. Mecoptera was recovered as the sister-group of Diptera. Neuropterida (Neuroptera + Megaloptera), and a sister-group relationship with (Diptera + Mecoptera) were supported across all analyses. Conclusions Our comparative studies indicate that mitochondrial genomes are a useful phylogenetic tool at the ordinal level within Holometabola, at the superfamily within Hymenoptera and at the subfamily level within Braconidae. Variation at all of these hierarchical levels suggests that the utility of mitochondrial genomes is likely to be a valuable tool for systematics in other groups of arthropods. PMID:20537196
Wang, Ran; Gamon, John A; Cavender-Bares, Jeannine; Townsend, Philip A; Zygielbaum, Arthur I
2018-03-01
Remote sensing has been used to detect plant biodiversity in a range of ecosystems based on the varying spectral properties of different species or functional groups. However, the most appropriate spatial resolution necessary to detect diversity remains unclear. At coarse resolution, differences among spectral patterns may be too weak to detect. In contrast, at fine resolution, redundant information may be introduced. To explore the effect of spatial resolution, we studied the scale dependence of spectral diversity in a prairie ecosystem experiment at Cedar Creek Ecosystem Science Reserve, Minnesota, USA. Our study involved a scaling exercise comparing synthetic pixels resampled from high-resolution images within manipulated diversity treatments. Hyperspectral data were collected using several instruments on both ground and airborne platforms. We used the coefficient of variation (CV) of spectral reflectance in space as the indicator of spectral diversity and then compared CV at different scales ranging from 1 mm 2 to 1 m 2 to conventional biodiversity metrics, including species richness, Shannon's index, Simpson's index, phylogenetic species variation, and phylogenetic species evenness. In this study, higher species richness plots generally had higher CV. CV showed higher correlations with Shannon's index and Simpson's index than did species richness alone, indicating evenness contributed to the spectral diversity. Correlations with species richness and Simpson's index were generally higher than with phylogenetic species variation and evenness measured at comparable spatial scales, indicating weaker relationships between spectral diversity and phylogenetic diversity metrics than with species diversity metrics. High resolution imaging spectrometer data (1 mm 2 pixels) showed the highest sensitivity to diversity level. With decreasing spatial resolution, the difference in CV between diversity levels decreased and greatly reduced the optical detectability of biodiversity. The optimal pixel size for distinguishing α diversity in these prairie plots appeared to be around 1 mm to 10 cm, a spatial scale similar to the size of an individual herbaceous plant. These results indicate a strong scale-dependence of the spectral diversity-biodiversity relationships, with spectral diversity best able to detect a combination of species richness and evenness, and more weakly detecting phylogenetic diversity. These findings can be used to guide airborne studies of biodiversity and develop more effective large-scale biodiversity sampling methods. ©2018 The Authors Ecological Applications published by Wiley Periodicals, Inc. on behalf of Ecological Society of America.
Wei, Shu-jun; Shi, Min; Sharkey, Michael J; van Achterberg, Cornelis; Chen, Xue-xin
2010-06-11
Animal mitochondrial genomes are potential models for molecular evolution and markers for phylogenetic and population studies. Previous research has shown interesting features in hymenopteran mitochondrial genomes. Here, we conducted a comparative study of mitochondrial genomes of the family Braconidae, one of the largest families of Hymenoptera, and assessed the utility of mitochondrial genomic data for phylogenetic inference at three different hierarchical levels, i.e., Braconidae, Hymenoptera, and Holometabola. Seven mitochondrial genomes from seven subfamilies of Braconidae were sequenced. Three of the four sequenced A+T-rich regions are shown to be inverted. Furthermore, all species showed reversal of strand asymmetry, suggesting that inversion of the A+T-rich region might be a synapomorphy of the Braconidae. Gene rearrangement events occurred in all braconid species, but gene rearrangement rates were not taxonomically correlated. Most rearranged genes were tRNAs, except those of Cotesia vestalis, in which 13 protein-coding genes and 14 tRNA genes changed positions or/and directions through three kinds of gene rearrangement events. Remote inversion is posited to be the result of two independent recombination events. Evolutionary rates were lower in species of the cyclostome group than those of noncyclostomes. Phylogenetic analyses based on complete mitochondrial genomes and secondary structure of rrnS supported a sister-group relationship between Aphidiinae and cyclostomes. Many well accepted relationships within Hymenoptera, such as paraphyly of Symphyta and Evaniomorpha, a sister-group relationship between Orussoidea and Apocrita, and monophyly of Proctotrupomorpha, Ichneumonoidea and Aculeata were robustly confirmed. New hypotheses, such as a sister-group relationship between Evanioidea and Aculeata, were generated. Among holometabolous insects, Hymenoptera was shown to be the sister to all other orders. Mecoptera was recovered as the sister-group of Diptera. Neuropterida (Neuroptera + Megaloptera), and a sister-group relationship with (Diptera + Mecoptera) were supported across all analyses. Our comparative studies indicate that mitochondrial genomes are a useful phylogenetic tool at the ordinal level within Holometabola, at the superfamily within Hymenoptera and at the subfamily level within Braconidae. Variation at all of these hierarchical levels suggests that the utility of mitochondrial genomes is likely to be a valuable tool for systematics in other groups of arthropods.
Blastocystis phylogeny among various isolates from humans to insects.
Yoshikawa, Hisao; Koyama, Yukiko; Tsuchiya, Erika; Takami, Kazutoshi
2016-12-01
Blastocystis is a common unicellular eukaryotic parasite found not only in humans, but also in various kinds of animal species worldwide. Since Blastocystis isolates are morphologically indistinguishable, many molecular biological approaches have been applied to classify these isolates. The complete or partial sequences of the small subunit rRNA gene (SSU rDNA) are mainly used for comparisons and phylogenetic analyses among Blastocystis isolates. However, various lengths of the partial SSU rDNA sequence have been used for phylogenetic inference among genetically different isolates. Based on the complete SSU rDNA sequences, consensus terminology of nine subtypes (STs) of Blastocystis sp. that were supported by phylogenetically monophyletic nine clades was proposed in 2007. Thereafter, eight additional kinds of STs comprising non-human mammalian Blastocystis isolates have been reported based on the phylogeny of SSU rDNA sequences, while STs 11 and 12 were only proposed on the base of partial sequences. Although many sequence data from mammalian and avian Blastocystis are registered in GenBank, only limited data on SSU rDNA are available for poikilotherm-derived Blastocystis isolates. Therefore, the phylogenetic positions of the reptilian/amphibian Blastocystis clades are unstable. The phylogenetic inference of various STs comprising mammalian and/or avian Blastocystis isolates was verified herein based on comparisons between partial and complete SSU rDNA sequences, and the phylogenetic positions of reptilian and amphibian Blastocystis isolates were also investigated using 14 new Blastocystis isolates from reptiles with all known isolates from other reptilians, amphibians, and insects registered in GenBank. Copyright © 2016. Published by Elsevier Ireland Ltd.
Constructing phylogenetic trees using interacting pathways.
Wan, Peng; Che, Dongsheng
2013-01-01
Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.
Goggin, C L; Barker, S C
1993-07-01
Parasites of the genus Perkinsus destroy marine molluscs worldwide. Their phylogenetic position within the kingdom Protista is controversial. Nucleotide sequence data (1792 bp) from the small subunit rRNA gene of Perkinsus sp. from Anadara trapezia (Mollusca: Bivalvia) from Moreton Bay, Queensland, was used to examine the phylogenetic affinities of this enigmatic genus. These data were aligned with nucleotide sequences from 6 apicomplexans, 3 ciliates, 3 flagellates, a dinoflagellate, 3 fungi, maize and human. Phylogenetic trees were constructed after analysis with maximum parsimony and distance matrix methods. Our analyses indicate that Perkinsus is phylogenetically closer to dinoflagellates and to coccidean and piroplasm apicomplexans than to fungi or flagellates.
Jin, Yi; Qian, Hong; Yu, Mingjian
2015-01-01
Investigating patterns of phylogenetic structure across different life stages of tree species in forests is crucial to understanding forest community assembly, and investigating forest gap influence on the phylogenetic structure of forest regeneration is necessary for understanding forest community assembly. Here, we examine the phylogenetic structure of tree species across life stages from seedlings to canopy trees, as well as forest gap influence on the phylogenetic structure of forest regeneration in a forest of the subtropical region in China. We investigate changes in phylogenetic relatedness (measured as NRI) of tree species from seedlings, saplings, treelets to canopy trees; we compare the phylogenetic turnover (measured as βNRI) between canopy trees and seedlings in forest understory with that between canopy trees and seedlings in forest gaps. We found that phylogenetic relatedness generally increases from seedlings through saplings and treelets up to canopy trees, and that phylogenetic relatedness does not differ between seedlings in forest understory and those in forest gaps, but phylogenetic turnover between canopy trees and seedlings in forest understory is lower than that between canopy trees and seedlings in forest gaps. We conclude that tree species tend to be more closely related from seedling to canopy layers, and that forest gaps alter the seedling phylogenetic turnover of the studied forest. It is likely that the increasing trend of phylogenetic clustering as tree stem size increases observed in this subtropical forest is primarily driven by abiotic filtering processes, which select a set of closely related evergreen broad-leaved tree species whose regeneration has adapted to the closed canopy environments of the subtropical forest developed under the regional monsoon climate.
Jin, Yi; Qian, Hong; Yu, Mingjian
2015-01-01
Investigating patterns of phylogenetic structure across different life stages of tree species in forests is crucial to understanding forest community assembly, and investigating forest gap influence on the phylogenetic structure of forest regeneration is necessary for understanding forest community assembly. Here, we examine the phylogenetic structure of tree species across life stages from seedlings to canopy trees, as well as forest gap influence on the phylogenetic structure of forest regeneration in a forest of the subtropical region in China. We investigate changes in phylogenetic relatedness (measured as NRI) of tree species from seedlings, saplings, treelets to canopy trees; we compare the phylogenetic turnover (measured as βNRI) between canopy trees and seedlings in forest understory with that between canopy trees and seedlings in forest gaps. We found that phylogenetic relatedness generally increases from seedlings through saplings and treelets up to canopy trees, and that phylogenetic relatedness does not differ between seedlings in forest understory and those in forest gaps, but phylogenetic turnover between canopy trees and seedlings in forest understory is lower than that between canopy trees and seedlings in forest gaps. We conclude that tree species tend to be more closely related from seedling to canopy layers, and that forest gaps alter the seedling phylogenetic turnover of the studied forest. It is likely that the increasing trend of phylogenetic clustering as tree stem size increases observed in this subtropical forest is primarily driven by abiotic filtering processes, which select a set of closely related evergreen broad-leaved tree species whose regeneration has adapted to the closed canopy environments of the subtropical forest developed under the regional monsoon climate. PMID:26098916
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chakraborty, Sandeep; Rao, Basuthkar J.; Baker, Nathan A.
2013-04-01
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evolutionary relationship in these proteins which occasionally remains undetected due to considerable sequence divergence. Structural alignment programs have been developed to unravel such fuzzy relationships. However, none of these structure based methods have used electrostatic properties to discriminate between spatially equivalent residues. We present a methodology for MSA of a set of related proteins with known structures using electrostatic properties as an additional discriminator (STEEP). STEEP first extracts a profile, then generates a multiple structural superimposition providing a consolidated spatial framework for comparing residues and finally emits themore » MSA. Residues that are aligned differently by including or excluding electrostatic properties can be targeted by directed evolution experiments to transform the enzymatic properties of one protein into another. We have compared STEEP results to those obtained from a MSA program (ClustalW) and a structural alignment method (MUSTANG) for chymotrypsin serine proteases. Subsequently, we used PhyML to generate phylogenetic trees for the serine and metallo-β-lactamase superfamilies from the STEEP generated MSA, and corroborated the accepted relationships in these superfamilies. We have observed that STEEP acts as a functional classifier when electrostatic congruence is used as a discriminator, and thus identifies potential targets for directed evolution experiments. In summary, STEEP is unique among phylogenetic methods for its ability to use electrostatic congruence to specify mutations that might be the source of the functional divergence in a protein family. Based on our results, we also hypothesize that the active site and its close vicinity contains enough information to infer the correct phylogeny for related proteins.« less
Fossils matter: improved estimates of divergence times in Pinus reveal older diversification.
Saladin, Bianca; Leslie, Andrew B; Wüest, Rafael O; Litsios, Glenn; Conti, Elena; Salamin, Nicolas; Zimmermann, Niklaus E
2017-04-04
The taxonomy of pines (genus Pinus) is widely accepted and a robust gene tree based on entire plastome sequences exists. However, there is a large discrepancy in estimated divergence times of major pine clades among existing studies, mainly due to differences in fossil placement and dating methods used. We currently lack a dated molecular phylogeny that makes use of the rich pine fossil record, and this study is the first to estimate the divergence dates of pines based on a large number of fossils (21) evenly distributed across all major clades, in combination with applying both node and tip dating methods. We present a range of molecular phylogenetic trees of Pinus generated within a Bayesian framework. We find the origin of crown Pinus is likely up to 30 Myr older (Early Cretaceous) than inferred in most previous studies (Late Cretaceous) and propose generally older divergence times for major clades within Pinus than previously thought. Our age estimates vary significantly between the different dating approaches, but the results generally agree on older divergence times. We present a revised list of 21 fossils that are suitable to use in dating or comparative analyses of pines. Reliable estimates of divergence times in pines are essential if we are to link diversification processes and functional adaptation of this genus to geological events or to changing climates. In addition to older divergence times in Pinus, our results also indicate that node age estimates in pines depend on dating approaches and the specific fossil sets used, reflecting inherent differences in various dating approaches. The sets of dated phylogenetic trees of pines presented here provide a way to account for uncertainties in age estimations when applying comparative phylogenetic methods.
Callejón, R; Halajian, A; de Rojas, M; Marrugal, A; Guevara, D; Cutillas, C
2012-05-25
Comparative morphological, biometrical and molecular studies of Trichuris discolor isolated from Bos taurus from Spain and Iran was carried out. Furthermore, Trichuris ovis isolated from B. taurus and Capra hircus from Spain has been, molecularly, analyzed. Morphological studies revealed clear differences between T. ovis and T. discolor isolated from B. taurus but differences were not observed between populations of T. discolor isolated from different geographical regions. Nevertheless, the molecular studies based on the amplification and sequencing of the internal transcribed spacers 1 and 2 ribosomal DNA and 16S partial gene mitochondrial DNA showed clear differences between both populations of T. discolor from Spain and Iran suggesting two cryptic species. Phylogenetic studies corroborated these data. Thus, phylogenetic trees based on ITS1, ITS2 and 16S partial gene sequences showed that individuals of T. discolor from B. taurus from Iran clustered together and separated, with high bootstrap values, of T. discolor isolated from B. taurus from Spain, while populations of T. ovis from B. taurus and C. hircus from Spain clustered together but separated with high bootstrap values of both populations of T. discolor. Furthermore, a comparative phylogenetic study has been carried out with the ITS1and ITS2 sequences of Trichuris species from different hosts. Three clades were observed: the first clustered all the species of Trichuris parasitizing herbivores (T. discolor, T. ovis, Trichuris leporis and Trichuris skrjabini), the second clustered all the species of Trichuris parasitizing omnivores (Trichuris trichiura and Trichuris suis) and finally, the third clustered species of Trichuris parasitizing carnivores (Trichuris muris, Trichuris arvicolae and Trichuris vulpis). Copyright © 2011 Elsevier B.V. All rights reserved.
QueTAL: a suite of tools to classify and compare TAL effectors functionally and phylogenetically
Pérez-Quintero, Alvaro L.; Lamy, Léo; Gordon, Jonathan L.; Escalon, Aline; Cunnac, Sébastien; Szurek, Boris; Gagnevin, Lionel
2015-01-01
Transcription Activator-Like (TAL) effectors from Xanthomonas plant pathogenic bacteria can bind to the promoter region of plant genes and induce their expression. DNA-binding specificity is governed by a central domain made of nearly identical repeats, each determining the recognition of one base pair via two amino acid residues (a.k.a. Repeat Variable Di-residue, or RVD). Knowing how TAL effectors differ from each other within and between strains would be useful to infer functional and evolutionary relationships, but their repetitive nature precludes reliable use of traditional alignment methods. The suite QueTAL was therefore developed to offer tailored tools for comparison of TAL effector genes. The program DisTAL considers each repeat as a unit, transforms a TAL effector sequence into a sequence of coded repeats and makes pair-wise alignments between these coded sequences to construct trees. The program FuncTAL is aimed at finding TAL effectors with similar DNA-binding capabilities. It calculates correlations between position weight matrices of potential target DNA sequence predicted from the RVD sequence, and builds trees based on these correlations. The programs accurately represented phylogenetic and functional relationships between TAL effectors using either simulated or literature-curated data. When using the programs on a large set of TAL effector sequences, the DisTAL tree largely reflected the expected species phylogeny. In contrast, FuncTAL showed that TAL effectors with similar binding capabilities can be found between phylogenetically distant taxa. This suite will help users to rapidly analyse any TAL effector genes of interest and compare them to other available TAL genes and should improve our understanding of TAL effectors evolution. It is available at http://bioinfo-web.mpl.ird.fr/cgi-bin2/quetal/quetal.cgi. PMID:26284082
Meineke, Till; Manisseri, Chithra; Voigt, Christian A.
2014-01-01
The production of ethanol from pretreated plant biomass during fermentation is a strategy to mitigate climate change by substituting fossil fuels. However, biomass conversion is mainly limited by the recalcitrant nature of the plant cell wall. To overcome recalcitrance, the optimization of the plant cell wall for subsequent processing is a promising approach. Based on their phylogenetic proximity to existing and emerging energy crops, model plants have been proposed to study bioenergy-related cell wall biochemistry. One example is Brachypodium distachyon, which has been considered as a general model plant for cell wall analysis in grasses. To test whether relative phylogenetic proximity would be sufficient to qualify as a model plant not only for cell wall composition but also for the complete process leading to bioethanol production, we compared the processing of leaf and stem biomass from the C3 grasses B. distachyon and Triticum aestivum (wheat) with the C4 grasses Zea mays (maize) and Miscanthus x giganteus, a perennial energy crop. Lambda scanning with a confocal laser-scanning microscope allowed a rapid qualitative analysis of biomass saccharification. A maximum of 108–117 mg ethanol·g−1 dry biomass was yielded from thermo-chemically and enzymatically pretreated stem biomass of the tested plant species. Principal component analysis revealed that a relatively strong correlation between similarities in lignocellulosic ethanol production and phylogenetic relation was only given for stem and leaf biomass of the two tested C4 grasses. Our results suggest that suitability of B. distachyon as a model plant for biomass conversion of energy crops has to be specifically tested based on applied processing parameters and biomass tissue type. PMID:25133818
Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu
2017-01-01
Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.
Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.
Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M
2012-02-01
The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).
Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.
2014-01-01
Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554
TreSpEx—Detection of Misleading Signal in Phylogenetic Reconstructions Based on Tree Information
Struck, Torsten H
2014-01-01
Phylogenies of species or genes are commonplace nowadays in many areas of comparative biological studies. However, for phylogenetic reconstructions one must refer to artificial signals such as paralogy, long-branch attraction, saturation, or conflict between different datasets. These signals might eventually mislead the reconstruction even in phylogenomic studies employing hundreds of genes. Unfortunately, there has been no program allowing the detection of such effects in combination with an implementation into automatic process pipelines. TreSpEx (Tree Space Explorer) now combines different approaches (including statistical tests), which utilize tree-based information like nodal support or patristic distances (PDs) to identify misleading signals. The program enables the parallel analysis of hundreds of trees and/or predefined gene partitions, and being command-line driven, it can be integrated into automatic process pipelines. TreSpEx is implemented in Perl and supported on Linux, Mac OS X, and MS Windows. Source code, binaries, and additional material are freely available at http://www.annelida.de/research/bioinformatics/software.html. PMID:24701118
The power and pitfalls of HIV phylogenetics in public health.
Brooks, James I; Sandstrom, Paul A
2013-07-25
Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.
Mendes, Joana; Harris, D James; Carranza, Salvador; Salvi, Daniele
2016-07-01
Estimating the phylogeny of lacertid lizards, and particularly the tribe Lacertini has been challenging, possibly due to the fast radiation of this group resulting in a hard polytomy. However this is still an open question, as concatenated data primarily from mitochondrial markers have been used so far whereas in a recent phylogeny based on a compilation of these data within a squamate supermatrix the basal polytomy seems to be resolved. In this study, we estimate phylogenetic relationships between all Lacertini genera using for the first time DNA sequences from five fast evolving nuclear genes (acm4, mc1r, pdc, βfib and reln) and two mitochondrial genes (nd4 and 12S). We generated a total of 529 sequences from 88 species and used Maximum Likelihood and Bayesian Inference methods based on concatenated multilocus dataset as well as a coalescent-based species tree approach with the aim of (i) shedding light on the basal relationships of Lacertini (ii) assessing the monophyly of genera which were previously questioned, and (iii) discussing differences between estimates from this and previous studies based on different markers, and phylogenetic methods. Results uncovered (i) a new phylogenetic clade formed by the monotypic genera Archaeolacerta, Zootoca, Teira and Scelarcis; and (ii) support for the monophyly of the Algyroides clade, with two sister species pairs represented by western (A. marchi and A. fitzingeri) and eastern (A. nigropunctatus and A. moreoticus) lineages. In both cases the members of these groups show peculiar morphology and very different geographical distributions, suggesting that they are relictual groups that were once diverse and widespread. They probably originated about 11-13 million years ago during early events of speciation in the tribe, and the split between their members is estimated to be only slightly older. This scenario may explain why mitochondrial markers (possibly saturated at higher divergence levels) or slower nuclear markers used in previous studies (likely lacking enough phylogenetic signal) failed to recover these relationships. Finally, the phylogenetic position of most remaining genera was unresolved, corroborating the hypothesis of a hard polytomy in the Lacertini phylogeny due to a fast radiation. This is in agreement with all previous studies but in sharp contrast with a recent squamate megaphylogeny. We show that the supermatrix approach may provide high support for incorrect nodes that are not supported either by original sequence data or by new data from this study. This finding suggests caution when using megaphylogenies to integrate inter-generic relationships in comparative ecological and evolutionary studies. Copyright © 2016 Elsevier Inc. All rights reserved.
Martínez-Azorín, Mario; Crespo, Manuel B.; Juan, Ana; Fay, Michael F.
2011-01-01
Background and Aims The taxonomic arrangement within subfamily Ornithogaloideae (Hyacinthaceae) has been a matter of controversy in recent decades: several new taxonomic treatments have been proposed, based exclusively on plastid DNA sequences, and these have resulted in classifications which are to a great extent contradictory. Some authors have recognized only a single genus Ornithogalum for the whole subfamily, including 250–300 species of variable morphology, whereas others have recognized many genera. In the latter case, the genera are inevitably much smaller and they are better defined morphologically. However, some are not monophyletic as circumscribed. Methods Phylogenetic analyses of Ornithogaloideae were based on nucleotide sequences of four plastid regions (trnL intron, trnL-F spacer, rbcL and matK) and a nuclear region (ITS). Eighty species covering all relevant taxonomic groups previously recognized in the subfamily were sampled. Parsimony and Bayesian analyses were performed. The molecular data were compared with a matrix of 34 morphological characters. Key Results Combinations of plastid and nuclear data yielded phylogenetic trees which are better resolved than those obtained with any plastid region alone or plastid regions in combination. Three main clades are found, corresponding to the previously recognized tribes Albuceae, Dipcadieae and Ornithogaleae. In these, up to 19 clades are described which are definable by morphology and biogeography. These mostly correspond to previously described taxa, though some need recircumscription. Morphological characters are assessed for their diagnostic value for taxonomy in the subfamily. Conclusions On the basis of the phylogenetic analyses, 19 monophyletic genera are accepted within Ornithogaloideae: Albuca, Avonsera, Battandiera, Cathissa, Coilonox, Dipcadi, Eliokarmos, Elsiea, Ethesia, Galtonia, Honorius, Loncomelos, Melomphis, Neopatersonia, Nicipe, Ornithogalum, Pseudogaltonia, Stellarioides and Trimelopter. Each of these has a particular syndrome of morphological characters. As a result, 105 new combinations are made and two new names are proposed to accommodate the taxa studied in the new arrangement. A short morphological diagnosis, synonymy, details of distribution and an identification key are presented. PMID:21163815
Huszar, Tunde I; Jobling, Mark A; Wetton, Jon H
2018-04-12
Short tandem repeats on the male-specific region of the Y chromosome (Y-STRs) are permanently linked as haplotypes, and therefore Y-STR sequence diversity can be considered within the robust framework of a phylogeny of haplogroups defined by single nucleotide polymorphisms (SNPs). Here we use massively parallel sequencing (MPS) to analyse the 23 Y-STRs in Promega's prototype PowerSeq™ Auto/Mito/Y System kit (containing the markers of the PowerPlex® Y23 [PPY23] System) in a set of 100 diverse Y chromosomes whose phylogenetic relationships are known from previous megabase-scale resequencing. Including allele duplications and alleles resulting from likely somatic mutation, we characterised 2311 alleles, demonstrating 99.83% concordance with capillary electrophoresis (CE) data on the same sample set. The set contains 267 distinct sequence-based alleles (an increase of 58% compared to the 169 detectable by CE), including 60 novel Y-STR variants phased with their flanking sequences which have not been reported previously to our knowledge. Variation includes 46 distinct alleles containing non-reference variants of SNPs/indels in both repeat and flanking regions, and 145 distinct alleles containing repeat pattern variants (RPV). For DYS385a,b, DYS481 and DYS390 we observed repeat count variation in short flanking segments previously considered invariable, and suggest new MPS-based structural designations based on these. We considered the observed variation in the context of the Y phylogeny: several specific haplogroup associations were observed for SNPs and indels, reflecting the low mutation rates of such variant types; however, RPVs showed less phylogenetic coherence and more recurrence, reflecting their relatively high mutation rates. In conclusion, our study reveals considerable additional diversity at the Y-STRs of the PPY23 set via MPS analysis, demonstrates high concordance with CE data, facilitates nomenclature standardisation, and places Y-STR sequence variants in their phylogenetic context. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Coscarón, Sixto; Coscarón, María Del Carmen; Gil-Azevedo, Leonardo H
2013-02-20
The previously unknown pupa of Heterostomus curvipalpis Bigot is described. The morphology of the pupa of Heterostomus is compared with the pupae of Xylophagomorpha, Tabanomorpha and Stratiomyomorpha families, based on five characters, and its phylogenetic position is discussed. Currently placed in Xylophagidae, we conclude that the pupa of Heterostomus shares most characters with the pupae of Pelecorhynchidae.
Karaulov, Alexander; Aleshkin, Vladimir; Slobodenyuk, Vladimir; Grechishnikova, Olga; Afanasyev, Stanislav; Lapin, Boris; Dzhikidze, Eteri; Nesvizhsky, Yuriy; Evsegneeva, Irina; Voropayeva, Elena; Afanasyev, Maxim; Aleshkin, Andrei; Metelskaya, Valeria; Yegorova, Ekaterina; Bayrakova, Alexandra
2010-01-01
Based on the results of the comparative analysis concerning relatedness and evolutional difference of the 16S-23S nucleotide sequences of the middle ribosomal cluster and 23S rRNA I domain, and based on identification of phylogenetic position for Chlamydophila pneumoniae and Chlamydia trichomatis strains released from monkeys, relatedness of the above stated isolates with similar strains released from humans and with strains having nucleotide sequences presented in the GenBank electronic database has been detected for the first time ever. Position of these isolates in the Chlamydiaceae family phylogenetic tree has been identified. The evolutional position of the investigated original Chlamydia and Chlamydophila strains close to analogous strains from the Gen-Bank electronic database has been demonstrated. Differences in the 16S-23S nucleotide sequence of the middle ribosomal cluster and 23S rRNA I domain of plasmid and nonplasmid Chlamydia trachomatis strains released from humans and monkeys relative to different genotype groups (group B-B, Ba, D, Da, E, L1, L2, L2a; intermediate group-F, G, Ga) have been revealed for the first time ever. Abnormality in incA chromosomal gene expression resulting in Chlamydia life development cycle disorder, and decrease of Chlamydia virulence can be related to probable changes in the nucleotide sequence of the gene under consideration.
Vela, Ana I; Casas-Díaz, Encarna; Lavín, Santiago; Domínguez, Lucas; Fernández-Garayzábal, Jose F
2015-09-01
Four isolates of an unknown Gram-stain-positive, catalase-negative coccus-shaped organism, isolated from the pharynx of four wild rabbits, were characterized by phenotypic and molecular genetic methods. The micro-organisms were tentatively assigned to the genus Streptococcus based on cellular morphological and biochemical criteria, although the organisms did not appear to correspond to any species with a validly published name. Comparative 16S rRNA gene sequencing confirmed their identification as members of the genus Streptococcus, being most closely related phylogenetically to Streptococcus porcorum 682-03(T) (96.9% 16S rRNA gene sequence similarity). Analysis of rpoB and sodA gene sequences showed divergence values between the novel species and S. porcorum 682-03(T) (the closest phylogenetic relative determined from 16S rRNA gene sequences) of 18.1 and 23.9%, respectively. The novel bacterial isolate could be distinguished from the type strain of S. porcorum by several biochemical characteristics, such as the production of glycyl-tryptophan arylamidase and α-chymotrypsin, and the non-acidification of different sugars. Based on both phenotypic and phylogenetic findings, it is proposed that the unknown bacterium be assigned to a novel species of the genus Streptococcus, and named Streptococcus pharyngis sp. nov. The type strain is DICM10-00796B(T) ( = CECT 8754(T) = CCUG 66496(T)).
ERIC Educational Resources Information Center
Maier, Caroline Alexandra
2001-01-01
Presents an activity in which students seek answers to questions about evolutionary relationships by using genetic databases and bioinformatics software. Students build genetic distance matrices and phylogenetic trees based on molecular sequence data using web-based resources. Provides a flowchart of steps involved in accessing, retrieving, and…
Pearson, Alannah; Groves, Colin; Cardini, Andrea
2015-11-01
In 2004, an analysis by Lockwood and colleagues of hard-tissue morphology, using geometric morphometrics on the temporal bone, succeeded in recovering the correct phylogeny of living hominids without resorting to potentially problematic methods for transforming continuous shape variables into meristic characters. That work has increased hope that by using modern analytical methods and phylogenetically informative anatomical data we might one day be able to accurately infer the relationships of hominins, including the closest extinct relatives of modern humans. In the present study, using 3D virtually generated models of the hominid temporal bone and a larger suite of geometric morphometric and comparative techniques, we have re-examined the evidence for a Pan-Homo clade. Despite differences in samples, as well as the type of raw data, the effect of measurement error (and especially landmark digitization by a different operator), but also a broader perspective brought in by our diverse set of approaches, our reanalysis largely supports Lockwood and colleagues' original results. However, by focusing not only mainly on shape (as in the original 2004 analysis) but also on size and 'size-corrected' (non-allometric) shape, we demonstrate that the strong phylogenetic signal in the temporal bone is largely related to similarities in size. Thus, with this study, we are not suggesting the use of a single 'character', such as size, for phylogenetic inference, but we do challenge the common view that shape, with its highly complex and multivariate nature, is necessarily more phylogenetically informative than size and that actually size and size-related shape variation (i.e., allometry) confound phylogenetic inference based on morphology. This perspective may in fact be less generalizable than often believed. Thus, while we confirm the original findings by Lockwood et al., we provide a deep reinterpretation of their nature and potential implications for hominid phylogenetics and we show how crucial it is not to overlook size in geometric morphometric analyses. Copyright © 2015 Elsevier Ltd. All rights reserved.
J. Geml; I. Timling; C.H. Robinson; N. Lennon; H.C. Nusbaum; C. Brochmann; M.E. Noordeloos; D.L. Taylor
2011-01-01
Current evidence from temperate studies suggests that ectomycorrhizal (ECM) fungi require overland routes for migration because of their obligate symbiotic associations with woody plants. Despite their key roles in arctic ecosystems, the phylogenetic diversity and phylogeography of arctic ECM fungi remains little known. Here we assess the phylogenetic diversity of ECM...
Lee, Chang Soo; Lee, Jiyoung
2010-09-01
A rapid and specific gyrB-based real-time PCR system has been developed for detecting Bacteroides fragilis as a human-specific marker of fecal contamination. Its specificity and sensitivity was evaluated by comparison with other 16S rRNA gene-based primers using closely related Bacteroides and Prevotella. Many studies have used 16S rRNA gene-based method targeting Bacteroides because this genus is relatively abundant in human feces and is useful for microbial source tracking. However, 16S rRNA gene-based primers are evolutionarily too conserved among taxa to discriminate between human-specific species of Bacteroides and other closely related genera, such as Prevotella. Recently, one of the housekeeping genes, gyrB, has been used as an alternative target in multilocus sequence analysis (MLSA) to provide greater phylogenetic resolution. In this study, a new B. fragilis-specific primer set (Bf904F/Bf958R) was designed by alignments of 322 gyrB genes and was compared with the performance of the 16S rRNA gene-based primers in the presence of B. fragilis, Bacteroides ovatus and Prevotella melaninogenica. Amplicons were sequenced and a phylogenetic tree was constructed to confirm the specificity of the primers to B. fragilis. The gyrB-based primers successfully discriminated B. fragilis from B. ovatus and P. melaninogenica. Real-time PCR results showed that the gyrB primer set had a comparable sensitivity in the detection of B. fragilis when compared with the 16S rRNA primer set. The host-specificity of our gyrB-based primer set was validated with human, pig, cow, and dog fecal samples. The gyrB primer system had superior human-specificity. The gyrB-based system can rapidly detect human-specific fecal source and can be used for improved source tracking of human contamination. (c) 2010 Elsevier B.V. All rights reserved.
dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees.
Wise, Michael J
2016-01-01
Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa.
dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees
2016-01-01
Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa. PMID:27898695
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies.
Kassian, Alexei
2015-01-01
A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456
Dor, Roi; Carling, Matthew D; Lovette, Irby J; Sheldon, Frederick H; Winkler, David W
2012-10-01
The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group. Copyright © 2012 Elsevier Inc. All rights reserved.
Dool, Serena E; Puechmaille, Sebastien J; Foley, Nicole M; Allegrini, Benjamin; Bastian, Anna; Mutumi, Gregory L; Maluleke, Tinyiko G; Odendaal, Lizelle J; Teeling, Emma C; Jacobs, David S
2016-04-01
Despite many studies illustrating the perils of utilising mitochondrial DNA in phylogenetic studies, it remains one of the most widely used genetic markers for this purpose. Over the last decade, nuclear introns have been proposed as alternative markers for phylogenetic reconstruction. However, the resolution capabilities of mtDNA and nuclear introns have rarely been quantified and compared. In the current study we generated a novel ∼5kb dataset comprising six nuclear introns and a mtDNA fragment. We assessed the relative resolution capabilities of the six intronic fragments with respect to each other, when used in various combinations together, and when compared to the traditionally used mtDNA. We focused on a major clade in the horseshoe bat family (Afro-Palaearctic clade; Rhinolophidae) as our case study. This old, widely distributed and speciose group contains a high level of conserved morphology. This morphological stasis renders the reconstruction of the phylogeny of this group with traditional morphological characters complex. We sampled multiple individuals per species to represent their geographic distributions as best as possible (122 individuals, 24 species, 68 localities). We reconstructed the species phylogeny using several complementary methods (partitioned Maximum Likelihood and Bayesian and Bayesian multispecies-coalescent) and made inferences based on consensus across these methods. We computed pairwise comparisons based on Robinson-Foulds tree distance metric between all Bayesian topologies generated (27,000) for every gene(s) and visualised the tree space using multidimensional scaling (MDS) plots. Using our supported species phylogeny we estimated the ancestral state of key traits of interest within this group, e.g. echolocation peak frequency which has been implicated in speciation. Our results revealed many potential cryptic species within this group, even in taxa where this was not suspected a priori and also found evidence for mtDNA introgression. We demonstrated that by using just two introns one can recover a better supported species tree than when using the mtDNA alone, despite the shorter overall length of the combined introns. Additionally, when combining any single intron with mtDNA, we showed that the result is highly similar to the mtDNA gene tree and far from the true species tree and therefore this approach should be avoided. We caution against the indiscriminate use of mtDNA in phylogenetic studies and advocate for pilot studies to select nuclear introns. The selection of marker type and number is a crucial step that is best based on critical examination of preliminary or previously published data. Based on our findings and previous publications, we recommend the following markers to recover phylogenetic relationships between recently diverged taxa (<20 My) in bats and other mammals: ACOX2, COPS7A, BGN, ROGDI and STAT5A. Copyright © 2016 Elsevier Inc. All rights reserved.
Diogo, Rui; Ziermann, Janine M; Linde-Medina, Marta
2015-05-01
The notion of scala naturae dates back to thinkers such as Aristotle, who placed plants below animals and ranked the latter along a graded scale of complexity from 'lower' to 'higher' animals, such as humans. In the last decades, evolutionary biologists have tended to move from one extreme (i.e. the idea of scala naturae or the existence of a general evolutionary trend in complexity from 'lower' to "higher" taxa, with Homo sapiens as the end stage) to the other, opposite, extreme (i.e. to avoid using terms such as 'phylogenetically basal' and 'anatomically plesiomorphic' taxa, which are seen as the undesired vestige of old teleological theories). The latter view tries to avoid any possible connotations with the original anthropocentric idea of a scala naturae crowned by man and, in that sense, it can be regarded as a more politically correct view. In the past years and months there has been renewed interest in these topics, which have been discussed in various papers and monographs that tend to subscribe, in general, to the points defended in the more politically correct view. Importantly, most evolutionary and phylogenetic studies of tetrapods and other vertebrates, and therefore most discussions on the scala naturae and related issues have been based on hard tissue and, more recently, on molecular data. Here we provide the first discussion of these topics based on a comparative myological study of all the major vertebrate clades and of myological cladistic and Bayesian phylogenetic analyses of bony fish and tetrapods, including Primates. We specifically (i) contradict the notions of a scala naturae or evolutionary progressive trends leading to more complexity in 'higher' animals and culminating in Homo sapiens, and (ii) stress that the refutation of these old notions does not necessarily mean that one should not keep using the terms 'phylogenetically basal' and particularly 'anatomically plesiomorphic' to refer to groups such as the urodeles within the Tetrapoda, or the strepsirrhines and lemurs within the Primates, for instance. This review will contribute to improving our understanding of these broad evolutionary issues and of the evolution of the vertebrate Bauplans, and hopefully will stimulate future phylogenetic, evolutionary and developmental studies of these clades. © 2014 The Authors. Biological Reviews © 2014 Cambridge Philosophical Society.
2011-01-01
Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus. Ancient and recent duplications lead to a complex structure of Adh outparalogs and inparalogs in Pyrus and Malus, resulting in neofunctionalization, nonfunctionalization and possible subfunctionalization. Among all investigated orthologs, LFY2int2-N is the best nuclear marker for phylogenetic reconstruction of Pyrus due to suitable sequence divergence and the absence of lineage sorting. PMID:21917170
The chordate proteome history database.
Levasseur, Anthony; Paganini, Julien; Dainat, Jacques; Thompson, Julie D; Poch, Olivier; Pontarotti, Pierre; Gouret, Philippe
2012-01-01
The chordate proteome history database (http://ioda.univ-provence.fr) comprises some 20,000 evolutionary analyses of proteins from chordate species. Our main objective was to characterize and study the evolutionary histories of the chordate proteome, and in particular to detect genomic events and automatic functional searches. Firstly, phylogenetic analyses based on high quality multiple sequence alignments and a robust phylogenetic pipeline were performed for the whole protein and for each individual domain. Novel approaches were developed to identify orthologs/paralogs, and predict gene duplication/gain/loss events and the occurrence of new protein architectures (domain gains, losses and shuffling). These important genetic events were localized on the phylogenetic trees and on the genomic sequence. Secondly, the phylogenetic trees were enhanced by the creation of phylogroups, whereby groups of orthologous sequences created using OrthoMCL were corrected based on the phylogenetic trees; gene family size and gene gain/loss in a given lineage could be deduced from the phylogroups. For each ortholog group obtained from the phylogenetic or the phylogroup analysis, functional information and expression data can be retrieved. Database searches can be performed easily using biological objects: protein identifier, keyword or domain, but can also be based on events, eg, domain exchange events can be retrieved. To our knowledge, this is the first database that links group clustering, phylogeny and automatic functional searches along with the detection of important events occurring during genome evolution, such as the appearance of a new domain architecture.
Cisneros, Laura M; Fagan, Matthew E; Willig, Michael R
2016-01-01
Assembly of species into communities following human disturbance (e.g., deforestation, fragmentation) may be governed by spatial (e.g., dispersal) or environmental (e.g., niche partitioning) mechanisms. Variation partitioning has been used to broadly disentangle spatial and environmental mechanisms, and approaches utilizing functional and phylogenetic characteristics of communities have been implemented to determine the relative importance of particular environmental (or niche-based) mechanisms. Nonetheless, few studies have integrated these quantitative approaches to comprehensively assess the relative importance of particular structuring processes. We employed a novel variation partitioning approach to evaluate the relative importance of particular spatial and environmental drivers of taxonomic, functional, and phylogenetic aspects of bat communities in a human-modified landscape in Costa Rica. Specifically, we estimated the amount of variation in species composition (taxonomic structure) and in two aspects of functional and phylogenetic structure (i.e., composition and dispersion) along a forest loss and fragmentation gradient that are uniquely explained by landscape characteristics (i.e., environment) or space to assess the importance of competing mechanisms. The unique effects of space on taxonomic, functional and phylogenetic structure were consistently small. In contrast, landscape characteristics (i.e., environment) played an appreciable role in structuring bat communities. Spatially-structured landscape characteristics explained 84% of the variation in functional or phylogenetic dispersion, and the unique effects of landscape characteristics significantly explained 14% of the variation in species composition. Furthermore, variation in bat community structure was primarily due to differences in dispersion of species within functional or phylogenetic space along the gradient, rather than due to differences in functional or phylogenetic composition. Variation among bat communities was related to environmental mechanisms, especially niche-based (i.e., environmental) processes, rather than spatial mechanisms. High variation in functional or phylogenetic dispersion, as opposed to functional or phylogenetic composition, suggests that loss or gain of niche space is driving the progressive loss or gain of species with particular traits from communities along the human-modified gradient. Thus, environmental characteristics associated with landscape structure influence functional or phylogenetic aspects of bat communities by effectively altering the ways in which species partition niche space.
Fagan, Matthew E.; Willig, Michael R.
2016-01-01
Background Assembly of species into communities following human disturbance (e.g., deforestation, fragmentation) may be governed by spatial (e.g., dispersal) or environmental (e.g., niche partitioning) mechanisms. Variation partitioning has been used to broadly disentangle spatial and environmental mechanisms, and approaches utilizing functional and phylogenetic characteristics of communities have been implemented to determine the relative importance of particular environmental (or niche-based) mechanisms. Nonetheless, few studies have integrated these quantitative approaches to comprehensively assess the relative importance of particular structuring processes. Methods We employed a novel variation partitioning approach to evaluate the relative importance of particular spatial and environmental drivers of taxonomic, functional, and phylogenetic aspects of bat communities in a human-modified landscape in Costa Rica. Specifically, we estimated the amount of variation in species composition (taxonomic structure) and in two aspects of functional and phylogenetic structure (i.e., composition and dispersion) along a forest loss and fragmentation gradient that are uniquely explained by landscape characteristics (i.e., environment) or space to assess the importance of competing mechanisms. Results The unique effects of space on taxonomic, functional and phylogenetic structure were consistently small. In contrast, landscape characteristics (i.e., environment) played an appreciable role in structuring bat communities. Spatially-structured landscape characteristics explained 84% of the variation in functional or phylogenetic dispersion, and the unique effects of landscape characteristics significantly explained 14% of the variation in species composition. Furthermore, variation in bat community structure was primarily due to differences in dispersion of species within functional or phylogenetic space along the gradient, rather than due to differences in functional or phylogenetic composition. Discussion Variation among bat communities was related to environmental mechanisms, especially niche-based (i.e., environmental) processes, rather than spatial mechanisms. High variation in functional or phylogenetic dispersion, as opposed to functional or phylogenetic composition, suggests that loss or gain of niche space is driving the progressive loss or gain of species with particular traits from communities along the human-modified gradient. Thus, environmental characteristics associated with landscape structure influence functional or phylogenetic aspects of bat communities by effectively altering the ways in which species partition niche space. PMID:27761338
USDA-ARS?s Scientific Manuscript database
Phylogenetic relatedness among ascomycetous yeast genera (subphylum Saccharomycotina, phylum Ascomycota) has been uncertain. In the present study, type species of 70 currently recognized genera are compared from divergence in the nearly entire nuclear gene sequences for large subunit rRNA, small sub...
Constructing Student Problems in Phylogenetic Tree Construction.
ERIC Educational Resources Information Center
Brewer, Steven D.
Evolution is often equated with natural selection and is taught from a primarily functional perspective while comparative and historical approaches, which are critical for developing an appreciation of the power of evolutionary theory, are often neglected. This report describes a study of expert problem-solving in phylogenetic tree construction.…
USDA-ARS?s Scientific Manuscript database
The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on ...
Meyer, Joana B; Trapiello, Estefanía; Senn-Irlet, Beatrice; Sieber, Thomas N; Cornejo, Carolina; Aghayeva, Dilzara; González, Ana J; Prospero, Simone
2017-08-01
In this paper we resolve the taxonomic status of the fungus Diplodina castaneae (Ascomycetes, Diaporthales, Gnomoniaceae) which occurs on the European chestnut (Castanea sativa) as endophyte and as the causal agent of Javart disease. Specimens from Switzerland, Spain, and Azerbaijan were sequenced at five nuclear loci (β-tubulin, EF-1α, ITS, LSU, and RPB2). Phylogenies were inferred to place D. castaneae in the Gnomoniaceae family. Moreover, growth rates and morphological characteristics on different agar media were assessed and compared to those of Gnomoniopsis castaneae, which can easily be confused with D. castaneae. Based on morphological and phylogenetic characteristics, we propose to reallocate D. castaneae to the genus Sirococcus, as S. castaneae comb. nov. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Unrealistic phylogenetic trees may improve phylogenetic footprinting.
Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo
2017-06-01
The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Disentangling the phylogenetic and ecological components of spider phenotypic variation.
Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo
2014-01-01
An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.
Disentangling the Phylogenetic and Ecological Components of Spider Phenotypic Variation
Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo
2014-01-01
An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264
Alignment methods: strategies, challenges, benchmarking, and comparative overview.
Löytynoja, Ari
2012-01-01
Comparative evolutionary analyses of molecular sequences are solely based on the identities and differences detected between homologous characters. Errors in this homology statement, that is errors in the alignment of the sequences, are likely to lead to errors in the downstream analyses. Sequence alignment and phylogenetic inference are tightly connected and many popular alignment programs use the phylogeny to divide the alignment problem into smaller tasks. They then neglect the phylogenetic tree, however, and produce alignments that are not evolutionarily meaningful. The use of phylogeny-aware methods reduces the error but the resulting alignments, with evolutionarily correct representation of homology, can challenge the existing practices and methods for viewing and visualising the sequences. The inter-dependency of alignment and phylogeny can be resolved by joint estimation of the two; methods based on statistical models allow for inferring the alignment parameters from the data and correctly take into account the uncertainty of the solution but remain computationally challenging. Widely used alignment methods are based on heuristic algorithms and unlikely to find globally optimal solutions. The whole concept of one correct alignment for the sequences is questionable, however, as there typically exist vast numbers of alternative, roughly equally good alignments that should also be considered. This uncertainty is hidden by many popular alignment programs and is rarely correctly taken into account in the downstream analyses. The quest for finding and improving the alignment solution is complicated by the lack of suitable measures of alignment goodness. The difficulty of comparing alternative solutions also affects benchmarks of alignment methods and the results strongly depend on the measure used. As the effects of alignment error cannot be predicted, comparing the alignments' performance in downstream analyses is recommended.
Phylogenetic signal, feeding behaviour and brain volume in Neotropical bats.
Rojas, D; Mancina, C A; Flores-Martínez, J J; Navarro, L
2013-09-01
Comparative correlational studies of brain size and ecological traits (e.g. feeding habits and habitat complexity) have increased our knowledge about the selective pressures on brain evolution. Studies conducted in bats as a model system assume that shared evolutionary history has a maximum effect on the traits. However, this effect has not been quantified. In addition, the effect of levels of diet specialization on brain size remains unclear. We examined the role of diet on the evolution of brain size in Mormoopidae and Phyllostomidae using two comparative methods. Body mass explained 89% of the variance in brain volume. The effect of feeding behaviour (either characterized as feeding habits, as levels of specialization on a type of item or as handling behaviour) on brain volume was also significant albeit not consistent after controlling for body mass and the strength of the phylogenetic signal (λ). Although the strength of the phylogenetic signal of brain volume and body mass was high when tested individually, λ values in phylogenetic generalized least squares models were significantly different from 1. This suggests that phylogenetic independent contrasts models are not always the best approach for the study of ecological correlates of brain size in New World bats. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.
Fossil embryos from the Middle and Late Cambrian period of Hunan, south China.
Dong, Xi-Ping; Donoghue, Philip C J; Cheng, Hong; Liu, Jian-Bo
2004-01-15
Comparative embryology is integral to uncovering the pattern and process of metazoan phylogeny, but it relies on the assumption that life histories of living taxa are representative of their antecedents. Fossil embryos provide a crucial test of this assumption and, potentially, insight into the evolution of development, but because discoveries so far lack phylogenetic constraint, their significance is moot. Here we describe a collection of embryos from the Middle and Late Cambrian period (500 million years ago) of Hunan, south China, that preserves stages of development from cleavage to the pre-hatching embryo of a direct-developing animal comparable to living Scalidophora (phyla Priapulida, Kinorhyncha, Loricifera). The latest-stage embryos show affinity to the Lower Cambrian embryo Markuelia, whose life-history strategy contrasts both with the primitive condition inferred for metazoan phyla and with many proposed hypotheses of affinity, all of which prescribe indirect development. Phylogenetic tests based on these embryological data suggest a stem Scalidophora affinity. These discoveries corroborate, rather than contradict, the predictions of comparative embryology, providing direct historical support for the view that the life-history strategies of living taxa are representative of their stem lineages.
NASA Astrophysics Data System (ADS)
Poff, N.; Vieira, N. K.; Simmons, M. P.; Olden, J. D.; Kondratieff, B. C.; Finn, D. S.
2005-05-01
The use of species traits as indicators of environmental disturbance is being considered for biomonitoring programs globally. As such, methods to select relevant and informative traits for inclusion in biometrics need to be developed. In this research, we identified 20 traits of aquatic insects within six trait groups: morphology, mobility, life-history strategy, thermal tolerance, feeding guild and ecology (e.g., habitat preference). We constructed phylogenetic trees for 1) all lotic insect species of North America and 2) all Ephemeroptera, Plecoptera and Trichoptera species based on morphology- and molecular-based analyses and classifications. We then measured variability (i.e., plasticity) of the 20 traits and six trait groups across the two phylogenetic trees. Traits with higher degrees of plasticity indicated traits that were less phylogenetically constrained, and were considered informative for biomonitoring purposes. Thermal tolerance, rheophily, body size at maturity and feeding guild showed the highest plasticity across both phylogenetic trees. Two mobility traits, occurrence in drift and adult dispersal distance, showed moderate plasticity. By contrast, adult exiting ability, degree of attachment, adult lifespan and body shape showed low variability and were thus less informative. Plastic species traits that are less phylogenetically constrained may be most useful in detecting community change along environmental gradients.
Zheng, Xiaoyan; Cai, Danying; Potter, Daniel; Postman, Joseph; Liu, Jing; Teng, Yuanwen
2014-11-01
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence datasets. Phylogenetic trees based on both cpDNA and nuclear LFY2int2-N (LN) data resulted in poor resolution, especially, only five primary species were monophyletic in the LN tree. A phylogenetic network of LN suggested that reticulation caused by hybridization is one of the major evolutionary processes for Pyrus species. Polytomies of the gene trees and star-like structure of cpDNA networks suggested rapid radiation is another major evolutionary process, especially for the occidental species. Pyrus calleryana and P. regelii were the earliest diverged Pyrus species. Two North African species, P. cordata, P. spinosa and P. betulaefolia were descendent of primitive stock Pyrus species and still share some common molecular characters. Southwestern China, where a large number of P. pashia populations are found, is probably the most important diversification center of Pyrus. More accessions and nuclear genes are needed for further understanding the evolutionary histories of Pyrus. Copyright © 2014 Elsevier Inc. All rights reserved.
Nauheimer, Lars; Schley, Rowan J; Clements, Mark A; Micheneau, Claire; Nargar, Katharina
2018-06-02
Australia harbours a rich and highly endemic orchid flora, with c. 90 % of species endemic to the country. Despite that, the biogeographic history of Australasian orchid lineages is only poorly understood. Here we examined evolutionary relationships and the spatio-temporal evolution of the sun orchids (Thelymitra, 119 species), which display disjunct distribution patterns frequently found in Australasian orchid lineages. Phylogenetic analyses were conducted based on one nuclear (ITS) and three plastid markers (matK, psbJ-petA, ycf1) using Maximum Likelihood and Bayesian inference. Divergence time estimations were carried out with a relaxed molecular clock in a Bayesian framework. Ancestral ranges were estimated using the dispersal-extinction-cladogenesis model and an area coding based on major disjunctions. The phylogenetic analyses clarified intergeneric relationships within Thelymitrinae, with Epiblema being sister to Thelymitra plus Calochilus, both of which were well-supported. Within Thelymitra, eight major and several minor clades were retrieved in the nuclear and plastid phylogenetic reconstructions. Five major clades corresponded to species complexes previously recognized based on morphological characters, whereas other previously recognized species groups were found to be paraphyletic. Conflicting signals between the nuclear and plastid phylogenetic reconstructions provided support for hybridization and plastid capture events both in the deeper evolutionary history of the genus and more recently. Divergence time estimation placed the origin of Thelymitra in the late Miocene (c. 10.8 Ma) and the origin of the majority of the main clades within Thelymitra during the late Pliocene and early Pleistocene, with the majority of extant species arising during the Pleistocene. Ancestral range reconstruction revealed that the early diversification of the genus in the late Miocene and Pliocene took place predominantly in southwest Australia, where most species with highly restricted distributional ranges occur. Several long-distance dispersal events eastwards across the Nullarbor Plain were inferred, recurrently resulting in lineage divergence within the genus. The predominant eastwards direction of long-distance dispersal events in Thelymitra highlights the importance of the West Wind Drift for the present-day distribution of the genus, giving rise to the Thelymitra floras of Tasmania, New Zealand and New Caledonia, which were inferred to be of comparatively recent origin. Copyright © 2018. Published by Elsevier Inc.
USDA-ARS?s Scientific Manuscript database
The 10 species of Streptomyces implicated as the etiological agents in scab disease of potatoes or soft rot disease of sweet potatoes are distributed among 7 different phylogenetic clades in analyses based on 16S rRNA gene sequences, but high sequence similarity of this gene among Streptomyces speci...
Liu, Ai-Rong; Chen, Shuang-Chen; Wu, Shang-Ying; Xu, Tong; Guo, Liang-Dong; Jeewon, Rajesh; Wei, Ji-Guang
2010-11-01
Previous phylogenetic studies based on DNA sequence data have partially resolved taxonomic relationships among Pestalotiopsis species. There are still some morphological characters whose phylogenetic significance have not been assessed properly due to limited taxon sampling, in particular the degree of pigmentation of median cells. In this study, the stability of pigmentation of median cells of conidia in Pestalotiopsis species was evaluated in subculture, and a molecular phylogenetic analysis was conducted on 45 strains belonging to 26 species in order to reappraise the pigmentation of median cells for its significance in the taxonomy of Pestalotiopsis. Phylogenetic relationships were inferred from nucleotide sequences in ITS regions (ITS1, 5.8S and ITS2) and β-tubulin 2 gene (tub2). The results showed that pigmentation of median cells was stable and it could be a key character in the taxonomy of Pestalotiopsis species. Instead of "concolorous" and "versicolor" proposed by Steyeart (1949), "brown to olivaceous" and "umber to fuliginous" are described and proposed in this paper. Copyright © 2010. Published by Elsevier Inc.
Gottschling, Marc; Soehner, Sylvia; Zinssmeister, Carmen; John, Uwe; Plötner, Jörg; Schweikert, Michael; Aligizaki, Katerina; Elbrächter, Malte
2012-01-01
The phylogenetic relationships of the Dinophyceae (Alveolata) are not sufficiently resolved at present. The Thoracosphaeraceae (Peridiniales) are the only group of the Alveolata that include members with calcareous coccoid stages; this trait is considered apomorphic. Although the coccoid stage apparently is not calcareous, Bysmatrum has been assigned to the Thoracosphaeraceae based on thecal morphology. We tested the monophyly of the Thoracosphaeraceae using large sets of ribosomal RNA sequence data of the Alveolata including the Dinophyceae. Phylogenetic analyses were performed using Maximum Likelihood and Bayesian approaches. The Thoracosphaeraceae were monophyletic, but included also a number of non-calcareous dinophytes (such as Pentapharsodinium and Pfiesteria) and even parasites (such as Duboscquodinium and Tintinnophagus). Bysmatrum had an isolated and uncertain phylogenetic position outside the Thoracosphaeraceae. The phylogenetic relationships among calcareous dinophytes appear complex, and the assumption of the single origin of the potential to produce calcareous structures is challenged. The application of concatenated ribosomal RNA sequence data may prove promising for phylogenetic reconstructions of the Dinophyceae in future. Copyright © 2011 Elsevier GmbH. All rights reserved.
Zhou, Xiaofan; Shen, Xing-Xing; Hittinger, Chris Todd
2018-01-01
Abstract The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses. PMID:29177474
2012-01-01
Background The circumscription of the avian superfamily Sylvioidea is a matter of long ongoing debate. While the overall inclusiveness has now been mostly agreed on and 20 families recognised, the phylogenetic relationships among the families are largely unknown. We here present a phylogenetic hypothesis for Sylvioidea based on one mitochondrial and six nuclear markers, in total ~6.3 kbp, for 79 ingroup species representing all currently recognised families and some species with uncertain affinities, making this the most comprehensive analysis of this taxon. Results The resolution, especially of the deeper nodes, is much improved compared to previous studies. However, many relationships among families remain uncertain and are in need of verification. Most families themselves are very well supported based on the total data set and also by indels. Our data do not support the inclusion of Hylia in Cettiidae, but do not strongly reject a close relationship with Cettiidae either. The genera Scotocerca and Erythrocercus are closely related to Cettiidae, but separated by relatively long internodes. The families Paridae, Remizidae and Stenostiridae clustered among the outgroup taxa and not within Sylvioidea. Conclusions Although the phylogenetic position of Hylia is uncertain, we tentatively support the recognition of the family Hyliidae Bannerman, 1923 for this genus and Pholidornis. We propose new family names for the genera Scotocerca and Erythrocercus, Scotocercidae and Erythrocercidae, respectively, rather than including these in Cettiidae, and we formally propose the name Macrosphenidae, which has been in informal use for some time. We recommend that Paridae, Remizidae and Stenostiridae are not included in Sylvioidea. We also briefly discuss the problems of providing a morphological diagnosis when proposing a new family-group name (or genus-group name) based on a clade. PMID:22920688
Genome-Based Taxonomic Classification of Bacteroidetes
Hahnke, Richard L.; Meier-Kolthoff, Jan P.; García-López, Marina; Mukherjee, Supratim; Huntemann, Marcel; Ivanova, Natalia N.; Woyke, Tanja; Kyrpides, Nikos C.; Klenk, Hans-Peter; Göker, Markus
2016-01-01
The bacterial phylum Bacteroidetes, characterized by a distinct gliding motility, occurs in a broad variety of ecosystems, habitats, life styles, and physiologies. Accordingly, taxonomic classification of the phylum, based on a limited number of features, proved difficult and controversial in the past, for example, when decisions were based on unresolved phylogenetic trees of the 16S rRNA gene sequence. Here we use a large collection of type-strain genomes from Bacteroidetes and closely related phyla for assessing their taxonomy based on the principles of phylogenetic classification and trees inferred from genome-scale data. No significant conflict between 16S rRNA gene and whole-genome phylogenetic analysis is found, whereas many but not all of the involved taxa are supported as monophyletic groups, particularly in the genome-scale trees. Phenotypic and phylogenomic features support the separation of Balneolaceae as new phylum Balneolaeota from Rhodothermaeota and of Saprospiraceae as new class Saprospiria from Chitinophagia. Epilithonimonas is nested within the older genus Chryseobacterium and without significant phenotypic differences; thus merging the two genera is proposed. Similarly, Vitellibacter is proposed to be included in Aequorivita. Flexibacter is confirmed as being heterogeneous and dissected, yielding six distinct genera. Hallella seregens is a later heterotypic synonym of Prevotella dentalis. Compared to values directly calculated from genome sequences, the G+C content mentioned in many species descriptions is too imprecise; moreover, corrected G+C content values have a significantly better fit to the phylogeny. Corresponding emendations of species descriptions are provided where necessary. Whereas most observed conflict with the current classification of Bacteroidetes is already visible in 16S rRNA gene trees, as expected whole-genome phylogenies are much better resolved. PMID:28066339
Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A
2006-03-01
Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.
Extending the BEAGLE library to a multi-FPGA platform.
Jin, Zheming; Bakos, Jason D
2013-01-19
Maximum Likelihood (ML)-based phylogenetic inference using Felsenstein's pruning algorithm is a standard method for estimating the evolutionary relationships amongst a set of species based on DNA sequence data, and is used in popular applications such as RAxML, PHYLIP, GARLI, BEAST, and MrBayes. The Phylogenetic Likelihood Function (PLF) and its associated scaling and normalization steps comprise the computational kernel for these tools. These computations are data intensive but contain fine grain parallelism that can be exploited by coprocessor architectures such as FPGAs and GPUs. A general purpose API called BEAGLE has recently been developed that includes optimized implementations of Felsenstein's pruning algorithm for various data parallel architectures. In this paper, we extend the BEAGLE API to a multiple Field Programmable Gate Array (FPGA)-based platform called the Convey HC-1. The core calculation of our implementation, which includes both the phylogenetic likelihood function (PLF) and the tree likelihood calculation, has an arithmetic intensity of 130 floating-point operations per 64 bytes of I/O, or 2.03 ops/byte. Its performance can thus be calculated as a function of the host platform's peak memory bandwidth and the implementation's memory efficiency, as 2.03 × peak bandwidth × memory efficiency. Our FPGA-based platform has a peak bandwidth of 76.8 GB/s and our implementation achieves a memory efficiency of approximately 50%, which gives an average throughput of 78 Gflops. This represents a ~40X speedup when compared with BEAGLE's CPU implementation on a dual Xeon 5520 and 3X speedup versus BEAGLE's GPU implementation on a Tesla T10 GPU for very large data sizes. The power consumption is 92 W, yielding a power efficiency of 1.7 Gflops per Watt. The use of data parallel architectures to achieve high performance for likelihood-based phylogenetic inference requires high memory bandwidth and a design methodology that emphasizes high memory efficiency. To achieve this objective, we integrated 32 pipelined processing elements (PEs) across four FPGAs. For the design of each PE, we developed a specialized synthesis tool to generate a floating-point pipeline with resource and throughput constraints to match the target platform. We have found that using low-latency floating-point operators can significantly reduce FPGA area and still meet timing requirement on the target platform. We found that this design methodology can achieve performance that exceeds that of a GPU-based coprocessor.
Angelini, David R.; Jockusch, Elizabeth L.
2008-01-01
Model species often provide initial hypotheses and tools for studies of development, genetics, and molecular evolution in closely related species. Flour beetles of the genus Tribolium MacLeay (1825) are one group with potential for such comparative studies. Tribolium castaneum (Herbst 1797) is an increasingly useful developmental genetic system. The convenience with which congeneric and other species of tenebrionid flour beetles can be reared in the laboratory makes this group attractive for comparative studies on a small phylogenetic scale. Here we present the results of phylogenetic analyses of relationships among the major pest species of Tribolium based on two mitochondrial and three nuclear markers (cytochrome oxidase 1, 16S ribosomal DNA, wingless, 28S ribosomal DNA, histone H3). The utility of partitioning the dataset in a manner informed by biological structure and function is demonstrated by comparing various partitioning strategies. In parsimony and partitioned Bayesian analyses of the combined dataset, the castaneum and confusum species groups are supported as monophyletic and as each other’s closest relatives. However, a sister group relationship between this clade and Tribolium brevicornis (Leconte 1859) is not supported. Therefore, we suggest transferring brevicornis group species to the genus Aphanotus Leconte (1862). The inferred phylogeny provides an evolutionary framework for comparative studies using flour beetles. PMID:18024090
Peña, Arantxa; Busquets, Antonio; Gomila, Margarita; ...
2016-09-01
Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less
Starrett, James; Derkarabetian, Shahan; Richart, Casey H.; Cabrero, Allan; Hedin, Marshal
2016-01-01
Abstract The monotypic genus Cryptomaster Briggs, 1969 was described based on individuals from a single locality in southwestern Oregon. The described species Cryptomaster leviathan Briggs, 1969 was named for its large body size compared to most travunioid Laniatores. However, as the generic name suggests, Cryptomaster are notoriously difficult to find, and few subsequent collections have been recorded for this genus. Here, we increase sampling of Cryptomaster to 15 localities, extending their known range from the Coast Range northeast to the western Cascade Mountains of southern Oregon. Phylogenetic analyses of mitochondrial and nuclear DNA sequence data reveal deep phylogenetic breaks consistent with independently evolving lineages. We use discovery and validation species delimitation approaches to generate and test species hypotheses, including a coalescent species delimitation method to test multi-species hypotheses. For delimited species, we use light microscopy and SEM to discover diagnostic morphological characters. Although Cryptomaster has a small geographic distribution, this taxon is consistent with other short-range endemics in having deep phylogenetic breaks indicative of species level divergences. Herein we describe Cryptomaster behemoth sp. n., and provide morphological diagnostic characters for identifying Cryptomaster leviathan and Cryptomaster behemoth. PMID:26877685
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peña, Arantxa; Busquets, Antonio; Gomila, Margarita
Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less
Benedict, John C.; Smith, Selena Y.; Specht, Chelsea D.; Collinson, Margaret E.; Leong-Škorničková, Jana; Parkinson, Dilworth Y.; Marone, Federica
2016-01-01
Phenotypic variation can be attributed to genetic heritability as well as biotic and abiotic factors. Across Zingiberales, there is a high variation in the number of species per clade and in phenotypic diversity. Factors contributing to this phenotypic variation have never been studied in a phylogenetic or ecological context. Seeds of 166 species from all eight families in Zingiberales were analyzed for 51 characters using synchrotron based 3D X-ray tomographic microscopy to determine phylogenetically informative characters and to understand the distribution of morphological disparity within the order. All families are distinguishable based on seed characters. Non-metric multidimensional scaling analyses show Zingiberaceae occupy the largest seed morphospace relative to the other families, and environmental analyses demonstrate that Zingiberaceae inhabit both temperate and tropical regions, while other Zingiberales are almost exclusively tropical. Temperate species do not cluster in morphospace nor do they share a common suite of character states. This suggests that the diversity seen is not driven by adaptation to temperate niches; rather, the morphological disparity seen likely reflects an underlying genetic plasticity that allowed Zingiberaceae to repeatedly colonize temperate environments. The notable morphoanatomical variety in Zingiberaceae seeds may account for their extraordinary ecological success and high species diversity as compared to other Zingiberales. PMID:27594701
Rearrangement moves on rooted phylogenetic networks
Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2017-01-01
Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for practical phylogenetic network reconstruction. PMID:28763439
NASA Astrophysics Data System (ADS)
Porco, David; Deharveng, Louis
2009-08-01
The phylogeny of Collembola, originally discussed from a morphological point of view, has more recently benefited from novel insights brought by molecular analyses. Both morphological and molecular characters produced a well-resolved phylogenetic hypothesis including all orders, most families, and a large number of genera. However, several conflicting points exist between molecular and morphological data, and new characters are clearly needed to resolve these inconsistencies. In this study the usefulness of a new character type not previously used in the phylogenetic study of Collembola was tested: the epicuticular chemical compounds. Our phylogenetic analysis was based on 380 compounds from 26 Collembola species. The results show good resolution for terminal branches but not for internal nodes. This is probably due to the partial involvement of epicuticular lipids in ecological functions such as water conservation and sexual attraction. Thus, this character type is appropriate for reconstructing phylogenetic relationships among recently diversified groups.
Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro
2012-01-01
Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.
Phylogeny of anaerobic fungi (phylum Neocallimastigomycota), with contributions from yak in China.
Wang, Xuewei; Liu, Xingzhong; Groenewald, Johannes Z
2017-01-01
The phylum Neocallimastigomycota contains eight genera (about 20 species) of strictly anaerobic fungi. The evolutionary relationships of these genera are uncertain due to insufficient sequence data to infer their phylogenies. Based on morphology and molecular phylogeny, thirteen isolates obtained from yak faeces and rumen digesta in China were assigned to Neocallimastix frontalis (nine isolates), Orpinomyces joyonii (two isolates) and Caecomyces sp. (two isolates), respectively. The phylogenetic relationships of the eight genera were evaluated using complete ITS and partial LSU sequences, compared to the ITS1 region which has been widely used in this phylum in the past. Five monophyletic lineages corresponding to six of the eight genera were statistically supported. Isolates of Caecomyces and Cyllamyces were present in a single lineage and could not be separated properly. Members of Neocallimastigomycota with uniflagellate zoospores represented by Piromyces were polyphyletic. The Piromyces-like genus Oontomyces was consistently closely related to the traditional Anaeromyces, and separated the latter genus into two clades. The phylogenetic position of the Piromyces-like genus Buwchfawromyces remained unresolved. Orpinomyces and Neocallimastix, sharing polyflagellate zoospores, were supported as sister genera in the LSU phylogeny. Apparently ITS, specifically ITS1 alone, is not a good marker to resolve the generic affinities of the studied fungi. The LSU sequences are easier to align and appear to work well to resolve generic relationships. This study provides a comparative phylogenetic revision of Neocallimastigomycota isolates known from culture and sequence data.
The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.
Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo
2018-02-01
The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.
COI (cytochrome oxidase-I) sequence based studies of Carangid fishes from Kakinada coast, India.
Persis, M; Chandra Sekhar Reddy, A; Rao, L M; Khedkar, G D; Ravinder, K; Nasruddin, K
2009-09-01
Mitochondrial DNA, cytochrome oxidase-1 gene sequences were analyzed for species identification and phylogenetic relationship among the very high food value and commercially important Indian carangid fish species. Sequence analysis of COI gene very clearly indicated that all the 28 fish species fell into five distinct groups, which are genetically distant from each other and exhibited identical phylogenetic reservation. All the COI gene sequences from 28 fishes provide sufficient phylogenetic information and evolutionary relationship to distinguish the carangid species unambiguously. This study proves the utility of mtDNA COI gene sequence based approach in identifying fish species at a faster pace.
Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach
Reaz, Rezwana; Bayzid, Md. Shamsuzzoha; Rahman, M. Sohel
2014-01-01
Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A ‘quartet’ is an unrooted tree over taxa, hence the quartet-based supertree methods combine many -taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets. PMID:25117474
Halsey, Lewis G
2013-06-01
The slope of the typically linear relationship between metabolic rate and walking speed represents the net cost of transport (NCOT). The extrapolated y-intercept is often greater than resting metabolic rate, thus representing a fixed cost associated with pedestrian transport including body maintenance costs. The full cause of the elevated y-intercept remains elusive and it could simply represent experimental stresses. The present literature-based study compares the mass-independent energetic cost of pedestrian locomotion in birds (excluding those with an upright posture, i.e. penguins), represented by the y-intercept, to a known predictor of cost of transport, hip height. Both phylogenetically informed and non-phylogenetically informed analyses were undertaken to determine if patterns of association between hip height, body mass, and the y-intercept are robust with respect to the method of analysis. Body mass and hip height were significant predictors of the y-intercept in the best phylogenetically-informed and non-phylogenetically informed models. Thus there is evidence that, in birds at least, the elevated y-intercept is a legitimate component of locomotion energy expenditure. Hip height is probably a good proxy of effective limb length and thus perhaps birds with greater hip heights have lower y-intercepts because their longer legs more efficiently accommodate body motion and/or because their limbs are more aligned with the ground reaction forces. Copyright © 2013 Elsevier Inc. All rights reserved.
Park, J.-M.; Manen, J.-F.; Colwell, A.E.; Schneeweiss, G.M.
2008-01-01
The phylogenetic relationships of the non-photosynthetic Orobanche sensu lato (Orobanchaceae), which includes some of the economically most important parasitic weeds, remain insufficiently understood and controversial. This concerns both the phylogenetic relationships within the genus, in particular its monophyly or lack thereof, and the relationships to other holoparasitic genera such as Cistanche or Conopholis. Here we present the first comprehensive phylogenetic study of this group based on a region from the plastid genome (rps2 gene). Although substitution rates appear to be elevated compared to the photosynthetic members of Orobanchaceae, relationships among the major lineages Cistanche, Conopholis plus Epifagus, Boschniakia rossica (Cham. & Schltdl.) B. Fedtsch., B. himalaica Hook. f. & Thomson, B. hookeri Walp. plus B. strobilacea A. Gray, and Orobanche s. l. remain unresolved. Resolution within Orobanche, however, is much better. In agreement with morphological, cytological and other molecular phylogenetic evidence, five lineages, corresponding to the four traditionally recognised sections (Gymnocaulis, Myzorrhiza, Orobanche, Trionychon) and O. latisquama Reut. ex Boiss. (of sect. Orobanche), can be distinguished. A combined analysis of plastid rps2 and nuclear ITS sequences of the holoparasitic genera results in more resolved and better supported trees, although the relationships among Orobanche s. l., Cistanche, and the clade including the remaining genera is unresolved. Therefore, rps2 is a marker from the plastid genome that is well-suited to be used in combination with other already established nuclear markers for resolving generic relationships of Orobanche and related genera. ?? 2008 The Botanical Society of Japan and Springer.
Multilocus inference of species trees and DNA barcoding.
Mallo, Diego; Posada, David
2016-09-05
The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui
2014-06-27
Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.
Ned B. Klopfenstein; John W. Hanna; Amy L. Ross-Davis; Jane E. Stewart; Yuko Ota; Rosario Medel-Ortiz; Miguel Armando Lopez-Ramirez; Ruben Damian Elias-Roman; Dionicio Alvarado-Rosales; Mee-Sook Kim
2013-01-01
Armillaria plays diverse ecological roles in forests worldwide, which has inspired interest in understanding phylogenetic relationships within and among species of this genus. Previous rDNA sequence-based phylogenetic analyses of Armillaria have shown general relationships among widely divergent taxa, but rDNA sequences were not reliable for separating closely related...
Phylogenetic turnover along local environmental gradients in tropical forest communities.
Baldeck, C A; Kembel, S W; Harms, K E; Yavitt, J B; John, R; Turner, B L; Madawala, S; Gunatilleke, N; Gunatilleke, S; Bunyavejchewin, S; Kiratiprayoon, S; Yaacob, A; Supardi, M N N; Valencia, R; Navarrete, H; Davies, S J; Chuyong, G B; Kenfack, D; Thomas, D W; Dalling, J W
2016-10-01
While the importance of local-scale habitat niches in shaping tree species turnover along environmental gradients in tropical forests is well appreciated, relatively little is known about the influence of phylogenetic signal in species' habitat niches in shaping local community structure. We used detailed maps of the soil resource and topographic variation within eight 24-50 ha tropical forest plots combined with species phylogenies created from the APG III phylogeny to examine how phylogenetic beta diversity (indicating the degree of phylogenetic similarity of two communities) was related to environmental gradients within tropical tree communities. Using distance-based redundancy analysis we found that phylogenetic beta diversity, expressed as either nearest neighbor distance or mean pairwise distance, was significantly related to both soil and topographic variation in all study sites. In general, more phylogenetic beta diversity within a forest plot was explained by environmental variables this was expressed as nearest neighbor distance versus mean pairwise distance (3.0-10.3 % and 0.4-8.8 % of variation explained among plots, respectively), and more variation was explained by soil resource variables than topographic variables using either phylogenetic beta diversity metric. We also found that patterns of phylogenetic beta diversity expressed as nearest neighbor distance were consistent with previously observed patterns of niche similarity among congeneric species pairs in these plots. These results indicate the importance of phylogenetic signal in local habitat niches in shaping the phylogenetic structure of tropical tree communities, especially at the level of close phylogenetic neighbors, where similarity in habitat niches is most strongly preserved.
A detailed phylogeny for the Methanomicrobiales
NASA Technical Reports Server (NTRS)
Rouviere, P.; Mandelco, L.; Winker, S.; Woese, C. R.
1992-01-01
The small subunit rRNA sequence of twenty archaea, members of the Methanomicrobiales, permits a detailed phylogenetic tree to be inferred for the group. The tree confirms earlier studies, based on far fewer sequences, in showing the group to be divided into two major clusters, temporarily designated the "methanosarcina" group and the "methanogenium" group. The tree also defines phylogenetic relationships within these two groups, which in some cases do not agree with the phylogenetic relationships implied by current taxonomic names--a problem most acute for the genus Methanogenium and its relatives. The present phylogenetic characterization provides the basis for a consistent taxonomic restructuring of this major methanogenic taxon.
Swenson, Nathan G; Enquist, Brian J
2009-08-01
Species diversity is promoted and maintained by ecological and evolutionary processes operating on species attributes through space and time. The degree to which variability in species function regulates distribution and promotes coexistence of species has been debated. Previous work has attempted to quantify the relative importance of species function by using phylogenetic relatedness as a proxy for functional similarity. The key assumption of this approach is that function is phylogenetically conserved. If this assumption is supported, then the phylogenetic dispersion in a community should mirror the functional dispersion. Here we quantify functional trait dispersion along several key axes of tree life-history variation and on multiple spatial scales in a Neotropical dry-forest community. We next compare these results to previously reported patterns of phylogenetic dispersion in this same forest. We find that, at small spatial scales, coexisting species are typically more functionally clustered than expected, but traits related to adult and regeneration niches are overdispersed. This outcome was repeated when the analyses were stratified by size class. Some of the trait dispersion results stand in contrast to the previously reported phylogenetic dispersion results. In order to address this inconsistency we examined the strength of phylogenetic signal in traits at different depths in the phylogeny. We argue that: (1) while phylogenetic relatedness may be a good general multivariate proxy for ecological similarity, it may have a reduced capacity to depict the functional mechanisms behind species coexistence when coexisting species simultaneously converge and diverge in function; and (2) the previously used metric of phylogenetic signal provided erroneous inferences about trait dispersion when married with patterns of phylogenetic dispersion.
Phylogenetic patterns and the adaptive evolution of osmoregulation in fiddler crabs (Brachyura, Uca)
Faria, Samuel Coelho; Provete, Diogo Borges; Thurman, Carl Leo
2017-01-01
Salinity is the primary driver of osmoregulatory evolution in decapods, and may have influenced their diversification into different osmotic niches. In semi-terrestrial crabs, hyper-osmoregulatory ability favors sojourns into burrows and dilute media, and provides a safeguard against hemolymph dilution; hypo-osmoregulatory ability underlies emersion capability and a life more removed from water sources. However, most comparative studies have neglected the roles of the phylogenetic and environmental components of inter-specific physiological variation, hindering evaluation of phylogenetic patterns and the adaptive nature of osmoregulatory evolution. Semi-terrestrial fiddler crabs (Uca) inhabit fresh to hyper-saline waters, with species from the Americas occupying higher intertidal habitats than Indo-west Pacific species mainly found in the low intertidal zone. Here, we characterize numerous osmoregulatory traits in all ten fiddler crabs found along the Atlantic coast of Brazil, and we employ phylogenetic comparative methods using 24 species to test for: (i) similarities of osmoregulatory ability among closely related species; (ii) salinity as a driver of osmoregulatory evolution; (iii) correlation between salt uptake and secretion; and (iv) adaptive peaks in osmoregulatory ability in the high intertidal American lineages. Our findings reveal that osmoregulation in Uca exhibits strong phylogenetic patterns in salt uptake traits. Salinity does not correlate with hyper/hypo-regulatory abilities, but drives hemolymph osmolality at ambient salinities. Osmoregulatory traits have evolved towards three adaptive peaks, revealing a significant contribution of hyper/hypo-regulatory ability in the American clades. Thus, during the evolutionary history of fiddler crabs, salinity has driven some of the osmoregulatory transformations that underpin habitat diversification, although others are apparently constrained phylogenetically. PMID:28182764
Mitochondrial DNA haplogroup phylogeny of the dog: Proposal for a cladistic nomenclature.
Fregel, Rosa; Suárez, Nicolás M; Betancor, Eva; González, Ana M; Cabrera, Vicente M; Pestano, José
2015-05-01
Canis lupus familiaris mitochondrial DNA analysis has increased in recent years, not only for the purpose of deciphering dog domestication but also for forensic genetic studies or breed characterization. The resultant accumulation of data has increased the need for a normalized and phylogenetic-based nomenclature like those provided for human maternal lineages. Although a standardized classification has been proposed, haplotype names within clades have been assigned gradually without considering the evolutionary history of dog mtDNA. Moreover, this classification is based only on the D-loop region, proven to be insufficient for phylogenetic purposes due to its high number of recurrent mutations and the lack of relevant information present in the coding region. In this study, we design 1) a refined mtDNA cladistic nomenclature from a phylogenetic tree based on complete sequences, classifying dog maternal lineages into haplogroups defined by specific diagnostic mutations, and 2) a coding region SNP analysis that allows a more accurate classification into haplogroups when combined with D-loop sequencing, thus improving the phylogenetic information obtained in dog mitochondrial DNA studies. Copyright © 2015 Elsevier B.V. All rights reserved.
Mooers, Arne Ø.; Caccone, Adalgisa; Russello, Michael A.
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization. PMID:27635324
Jensen, Evelyn L; Mooers, Arne Ø; Caccone, Adalgisa; Russello, Michael A
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization.
Molecular phylogeny and systematics of the Echinostomatoidea Looss, 1899 (Platyhelminthes: Digenea).
Tkach, Vasyl V; Kudlai, Olena; Kostadinova, Aneta
2016-03-01
The Echinostomatoidea is a large, cosmopolitan group of digeneans currently including nine families and 105 genera, the vast majority parasitic, as adults, in birds with relatively few taxa parasitising mammals, reptiles and, exceptionally, fish. Despite the complex structure, diverse content and substantial species richness of the group, almost no attempt has been made to elucidate its phylogenetic relationships at the suprageneric level based on molecules due to the lack of data. Herein, we evaluate the consistency of the present morphology-based classification system of the Echinostomatoidea with the phylogenetic relationships of its members based on partial sequences of the nuclear lsrRNA gene for a broad diversity of taxa (80 species, representing eight families and 40 genera), including representatives of five subfamilies of the Echinostomatidae, which currently exhibits the most complex taxonomic structure within the superfamily. This first comprehensive phylogeny for the Echinostomatoidea challenged the current systematic framework based on comparative morphology. A morphology-based evaluation of this new molecular framework resulted in a number of systematic and nomenclatural changes consistent with the phylogenetic estimates of the generic and suprageneric boundaries and a new phylogeny-based classification of the Echinostomatoidea. In the current systematic treatment: (i) the rank of two family level lineages, the former Himasthlinae and Echinochasminae, is elevated to full family status; (ii) Caballerotrema is distinguished at the family level; (iii) the content and diagnosis of the Echinostomatidae (sensu stricto) (s. str.) are revised to reflect its phylogeny, resulting in the abolition of the Nephrostominae and Chaunocephalinae as synonyms of the Echinostomatidae (s. str.); (iv) Artyfechinostomum, Cathaemasia, Rhopalias and Ribeiroia are re-allocated within the Echinostomatidae (s. str.), resulting in the abolition of the Cathaemasiidae, Rhopaliidae and Ribeiroiinae, which become synonyms of the Echinostomatidae (s. str.); and (v) refinements of the generic boundaries within the Echinostomatidae (s. str.), Psilostomidae and Fasciolidae are made. Copyright © 2015 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Yu, J; Blom, J; Glaeser, S P; Jaenicke, S; Juhre, T; Rupp, O; Schwengers, O; Spänig, S; Goesmann, A
2017-11-10
The rapid development of next generation sequencing technology has greatly increased the amount of available microbial genomes. As a result of this development, there is a rising demand for fast and automated approaches in analyzing these genomes in a comparative way. Whole genome sequencing also bears a huge potential for obtaining a higher resolution in phylogenetic and taxonomic classification. During the last decade, several software tools and platforms have been developed in the field of comparative genomics. In this manuscript, we review the most commonly used platforms and approaches for ortholog group analyses with a focus on their potential for phylogenetic and taxonomic research. Furthermore, we describe the latest improvements of the EDGAR platform for comparative genome analyses and present recent examples of its application for the phylogenomic analysis of different taxa. Finally, we illustrate the role of the EDGAR platform as part of the BiGi Center for Microbial Bioinformatics within the German network on Bioinformatics Infrastructure (de.NBI). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Maheux, Andrée F; Sellam, Adnane; Piché, Yves; Boissinot, Maurice; Pelletier, René; Boudreau, Dominique K; Picard, François J; Trépanier, Hélène; Boily, Marie-Josée; Ouellette, Marc; Roy, Paul H; Bergeron, Michel G
2016-12-01
Successful treatment of a Candida infection relies on 1) an accurate identification of the pathogenic fungus and 2) on its susceptibility to antifungal drugs. In the present study we investigated the level of correlation between phylogenetical evolution and susceptibility of pathogenic Candida spp. to antifungal drugs. For this, we compared a phylogenetic tree, assembled with the concatenated sequences (2475-bp) of the ATP2, TEF1, and TUF1 genes from 20 representative Candida species, with published minimal inhibitory concentrations (MIC) of the four principal antifungal drug classes commonly used in the treatment of candidiasis: polyenes, triazoles, nucleoside analogues, and echinocandins. The phylogenetic tree revealed three distinct phylogenetic clusters among Candida species. Species within a given phylogenetic cluster have generally similar susceptibility profiles to antifungal drugs and species within Clusters II and III were less sensitive to antifungal drugs than Cluster I species. These results showed that phylogenetical relationship between clusters and susceptibility to several antifungal drugs could be used to guide therapy when only species identification is available prior to information pertaining to its resistance profile. An extended study comprising a large panel of clinical samples should be conducted to confirm the efficiency of this approach in the treatment of candidiasis. Copyright © 2016. Published by Elsevier B.V.
Brankovics, Balázs; van Dam, Peter; Rep, Martijn; de Hoog, G Sybren; J van der Lee, Theo A; Waalwijk, Cees; van Diepeningen, Anne D
2017-09-18
The Fusarium oxysporum species complex (FOSC) contains several phylogenetic lineages. Phylogenetic studies identified two to three major clades within the FOSC. The mitochondrial sequences are highly informative phylogenetic markers, but have been mostly neglected due to technical difficulties. A total of 61 complete mitogenomes of FOSC strains were de novo assembled and annotated. Length variations and intron patterns support the separation of three phylogenetic species. The variable region of the mitogenome that is typical for the genus Fusarium shows two new variants in the FOSC. The variant typical for Fusarium is found in members of all three clades, while variant 2 is found in clades 2 and 3 and variant 3 only in clade 2. The extended set of loci analyzed using a new implementation of the genealogical concordance species recognition method support the identification of three phylogenetic species within the FOSC. Comparative analysis of the mitogenomes in the FOSC revealed ongoing mitochondrial recombination within, but not between phylogenetic species. The recombination indicates the presence of a parasexual cycle in F. oxysporum. The obstacles hindering the usage of the mitogenomes are resolved by using next generation sequencing and selective genome assemblers, such as GRAbB. Complete mitogenome sequences offer a stable basis and reference point for phylogenetic and population genetic studies.
Hou, Zehai; Li, Qinglong; Wei, Cong
2014-01-01
The present investigation provides comparative morphometrics of the final instar nymphs of three dominant cicada species, i.e., Cryptotympana atrata (Fabricius), Meimuna mongolica (Distant) and Platypleura kaempferi (Fabricius), in Guanzhong Plain, China. Particularly, characters on the antennae, legs, and apex of abdomen of both males and females of these three species were investigated and analyzed. In addition, the numbers of hind tibial spines of the final instar nymphs of 21 representatives of Cicadoidea were compared. The results provide useful characteristics for nymph identification of related species and for further taxonomic and phylogenetic analysis of Cicadoidea.
Hou, Zehai; Li, Qinglong; Wei, Cong
2014-01-01
Abstract The present investigation provides comparative morphometrics of the final instar nymphs of three dominant cicada species, i.e., Cryptotympana atrata (Fabricius), Meimuna mongolica (Distant) and Platypleura kaempferi (Fabricius), in Guanzhong Plain, China. Particularly, characters on the antennae, legs, and apex of abdomen of both males and females of these three species were investigated and analyzed. In addition, the numbers of hind tibial spines of the final instar nymphs of 21 representatives of Cicadoidea were compared. The results provide useful characteristics for nymph identification of related species and for further taxonomic and phylogenetic analysis of Cicadoidea. PMID:25147447
Macroevolutionary developmental biology: Embryos, fossils, and phylogenies.
Organ, Chris L; Cooper, Lisa Noelle; Hieronymus, Tobin L
2015-10-01
The field of evolutionary developmental biology is broadly focused on identifying the genetic and developmental mechanisms underlying morphological diversity. Connecting the genotype with the phenotype means that evo-devo research often considers a wide range of evidence, from genetics and morphology to fossils. In this commentary, we provide an overview and framework for integrating fossil ontogenetic data with developmental data using phylogenetic comparative methods to test macroevolutionary hypotheses. We survey the vertebrate fossil record of preserved embryos and discuss how phylogenetic comparative methods can integrate data from developmental genetics and paleontology. Fossil embryos provide limited, yet critical, developmental data from deep time. They help constrain when developmental innovations first appeared during the history of life and also reveal the order in which related morphologies evolved. Phylogenetic comparative methods provide a powerful statistical approach that allows evo-devo researchers to infer the presence of nonpreserved developmental traits in fossil species and to detect discordant evolutionary patterns and processes across levels of biological organization. © 2015 Wiley Periodicals, Inc.
Sequencing and comparing whole mitochondrial genomes ofanimals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica
2005-04-22
Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based onmore » our experiences to date with determining and comparing complete mtDNA sequences.« less
Functional & phylogenetic diversity of copepod communities
NASA Astrophysics Data System (ADS)
Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.
2016-02-01
The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.
Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.
Barr, W Andrew; Scott, Robert S
2014-04-01
In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.
Baca, Stephen M; Toussaint, Emmanuel F A; Miller, Kelly B; Short, Andrew E Z
2017-02-01
The first molecular phylogenetic hypothesis for the aquatic beetle family Noteridae is inferred using DNA sequence data from five gene fragments (mitochondrial and nuclear): COI, H3, 16S, 18S, and 28S. Our analysis is the most comprehensive phylogenetic reconstruction of Noteridae to date, and includes 53 species representing all subfamilies, tribes and 16 of the 17 genera within the family. We examine the impact of data partitioning on phylogenetic inference by comparing two different algorithm-based partitioning strategies: one using predefined subsets of the dataset, and another recently introduced method, which uses the k-means algorithm to iteratively divide the dataset into clusters of sites evolving at similar rates across sampled loci. We conducted both maximum likelihood and Bayesian inference analyses using these different partitioning schemes. Resulting trees are strongly incongruent with prior classifications of Noteridae. We recover variant tree topologies and support values among the implemented partitioning schemes. Bayes factors calculated with marginal likelihoods of Bayesian analyses support a priori partitioning over k-means and unpartitioned data strategies. Our study substantiates the importance of data partitioning in phylogenetic inference, and underscores the use of comparative analyses to determine optimal analytical strategies. Our analyses recover Noterini Thomson to be paraphyletic with respect to three other tribes. The genera Suphisellus Crotch and Hydrocanthus Say are also recovered as paraphyletic. Following the results of the preferred partitioning scheme, we here propose a revised classification of Noteridae, comprising two subfamilies, three tribes and 18 genera. The following taxonomic changes are made: Notomicrinae sensu n. (= Phreatodytinae syn. n.) is expanded to include the tribe Phreatodytini; Noterini sensu n. (= Neohydrocoptini syn. n., Pronoterini syn. n., Tonerini syn. n.) is expanded to include all genera of the Noterinae; The genus Suphisellus Crotch is expanded to include species of Pronoterus Sharp syn. n.; and the former subgenus Sternocanthus Guignot stat. rev. is resurrected from synonymy and elevated to genus rank. Copyright © 2016 Elsevier Inc. All rights reserved.
Chen, Yuan
2017-01-01
Abstract In this study, we sequenced fragments of cytochrome oxidase subunit 1 (CO1), internal transcribed spacer 1 (ITS1), and internal transcribed spacer 2 (ITS2) genes from 150 specimens belonging to 16 species of the ant genus Formica from China. Odontoponera transversa from Ponerinae and Polyergus samurai from Formicinae were added as distant relative and close relative outgroups, respectively. Neighbor-joining, maximum parsimony, and Bayesian interference methods were used to analyze their phylogenetic relationships based on CO1 gene sequence as well as combined sequence data of CO1 + ITS1, CO1 + ITS2, and CO1 + ITS1 + ITS2. The results showed that nine Formica species (i.e., Formica sinensis, Formica manchu, Formica uralensis, Formica sanguinea, Formica gagatoides, Formica candida, Formica fusca, Formica glauca, and Formica sp.) formed monophyletic clades, which in agreement with the results based on morphological taxonomy. By comparing the results of DNA barcoding and morphological taxonomy, we propose that Formica aquilonia maybe a junior synonym of F. polyctena and that cryptic species could likely existed in Formica sinae. Further studies on morphology, biology, and geography are needed to confirm this notion.
Dees, Jonathan; Bussard, Caitlin; Momsen, Jennifer L
2018-06-01
Phylogenetic trees have become increasingly important across the life sciences, and as a result, learning to interpret and reason from these diagrams is now an essential component of biology education. Unfortunately, students often struggle to understand phylogenetic trees. Style (i.e., diagonal or bracket) is one factor that has been observed to impact how students interpret phylogenetic trees, and one goal of this research was to investigate these style effects across an introductory biology course. In addition, we investigated the impact of instruction that integrated diagonal and bracket phylogenetic trees equally. Before instruction, students were significantly more accurate with the bracket style for a variety of interpretation and construction tasks. After instruction, however, students were significantly more accurate only for construction tasks and interpretations involving taxa relatedness when using the bracket style. Thus, instruction that used both styles equally mitigated some, but not all, style effects. These results inform the development of research-based instruction that best supports student understanding of phylogenetic trees.
Shavit Grievink, Liat; Penny, David; Holland, Barbara R.
2013-01-01
Phylogenetic studies based on molecular sequence alignments are expected to become more accurate as the number of sites in the alignments increases. With the advent of genomic-scale data, where alignments have very large numbers of sites, bootstrap values close to 100% and posterior probabilities close to 1 are the norm, suggesting that the number of sites is now seldom a limiting factor on phylogenetic accuracy. This provokes the question, should we be fussy about the sites we choose to include in a genomic-scale phylogenetic analysis? If some sites contain missing data, ambiguous character states, or gaps, then why not just throw them away before conducting the phylogenetic analysis? Indeed, this is exactly the approach taken in many phylogenetic studies. Here, we present an example where the decision on how to treat sites with missing data is of equal importance to decisions on taxon sampling and model choice, and we introduce a graphical method for illustrating this. PMID:23471508
Larmuseau, Maarten H D; Van Geystelen, Anneleen; Kayser, Manfred; van Oven, Mannis; Decorte, Ronny
2015-03-01
Currently, several different Y-chromosomal phylogenies and haplogroup nomenclatures are presented in scientific literature and at conferences demonstrating the present diversity in Y-chromosomal phylogenetic trees and Y-SNP sets used within forensic and anthropological research. This situation can be ascribed to the exponential growth of the number of Y-SNPs discovered due to mostly next-generation sequencing (NGS) studies. As Y-SNPs and their respective phylogenetic positions are important in forensics, such as for male lineage characterization and paternal bio-geographic ancestry inference, there is a need for forensic geneticists to know how to deal with these newly identified Y-SNPs and phylogenies, especially since these phylogenies are often created with other aims than to carry out forensic genetic research. Therefore, we give here an overview of four categories of currently used Y-chromosomal phylogenies and the associated Y-SNP sets in scientific research in the current NGS era. We compare these categories based on the construction method, their advantages and disadvantages, the disciplines wherein the phylogenetic tree can be used, and their specific relevance for forensic geneticists. Based on this overview, it is clear that an up-to-date reduced tree with a consensus Y-SNP set and a stable nomenclature will be the most appropriate reference resource for forensic research. Initiatives to reach such an international consensus are therefore highly recommended. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Methylobacterium platani sp. nov., isolated from a leaf of the tree Platanus orientalis.
Kang, Yoon-Suk; Kim, Juhyun; Shin, Hyeon-Dong; Nam, Young-Do; Bae, Jin-Woo; Jeon, Che Ok; Park, Woojun
2007-12-01
A novel bacterial strain, designated PMB02(T), was isolated from a leaf of the tree Platanus orientalis. Colonies grown on TYG agar plates were circular, pink-pigmented and slow-growing, being 0.2-1.5 mm in diameter after 3 days growth. The cells of strain PMB02(T) were Gram-negative, aerobic, motile rods that possessed oxidase and catalase activities and grew at 20-30 degrees C, pH 6-8 and in media containing less than 1 % NaCl. The major respiratory quinone was identified as Q-10. A phylogenetic analysis based on 16S rRNA gene sequence comparisons indicated that strain PMB02(T) was related to members of the genus Methylobacterium. A comparative 16S rRNA gene sequence-based phylogenetic analysis placed the strain in a clade with the species Methylobacterium aquaticum and Methylobacterium variabile, with which it showed sequence similarities of 97.7 and 97.4 %, respectively. The values for DNA-DNA hybridization between strain PMB02(T) and M. aquaticum CCM 7218(T) and M. variabile GR3(T) were less than 32 %. On the basis of the phenotypic characterization, the phylogenetic analysis and the DNA-DNA relatedness data, strain PMB02(T) is considered to represent a novel species of the genus Methylobacterium, for which the name Methylobacterium platani sp. nov. is proposed. The type strain is PMB02(T) (=KCTC 12901(T)=JCM 14648(T)).
Gao, Feng; Gao, Shan; Wang, Pu; Katz, Laura A; Song, Weibo
2014-06-01
Cyclidiids and thigmotrichids are two diverse groups of scuticociliates, a diverse clade of ciliates that is often difficult to investigate due to the small size and conserved morphology among its members. Compared to other groups (e.g. hypotrichs and oligotrichs), the scuticociliates have received relatively little attention and their phylogenetic relationships are largely unresolved. To contribute to our understanding of their evolutionary history, we characterized 26 sequences for three linked genes (SSU-rDNA, 5.8S and LSU-rDNA) from 14 isolates of cyclidiids and thigmotrichids. Phylogenetic analyses reveal the following: (1) traditional cyclidiids are associated with thigmotrichs rather than pleuronematids as expected; (2) the validity of the newly-reported genus Falcicyclidium is confirmed by the molecular data and we suggest to transfer this genus to the family Ctedoctematidae; (3) both the genera Cyclidium and Protocyclidium are not monophyletic and the separation of Protocyclidium from Cyclidium is not supported; (4) the genus Cristigera is a well supported monophyletic group and may stand for a new family; (5) according to both morphological and molecular information, Cyclidium plouneouriDragesco, 1963 should be assigned in the genus Falcicyclidium and thus a new combination is suggested: Falcicyclidium plouneouri (Dragesco, 1963) n. comb.; and (6) based on the data available, a new genus is suggested: Acucyclidium gen. nov. with the type species, Acucyclidium atractodes (Fan et al., 2011a) n. comb. Copyright © 2014 Elsevier Inc. All rights reserved.
Phylogenomic Insights into Mouse Evolution Using a Pseudoreference Approach
Sarver, Brice A.J.; Keeble, Sara; Cosart, Ted; Tucker, Priscilla K.; Dean, Matthew D.
2017-01-01
Comparative genomic studies are now possible across a broad range of evolutionary timescales, but the generation and analysis of genomic data across many different species still present a number of challenges. The most sophisticated genotyping and down-stream analytical frameworks are still predominantly based on comparisons to high-quality reference genomes. However, established genomic resources are often limited within a given group of species, necessitating comparisons to divergent reference genomes that could restrict or bias comparisons across a phylogenetic sample. Here, we develop a scalable pseudoreference approach to iteratively incorporate sample-specific variation into a genome reference and reduce the effects of systematic mapping bias in downstream analyses. To characterize this framework, we used targeted capture to sequence whole exomes (∼54 Mbp) in 12 lineages (ten species) of mice spanning the Mus radiation. We generated whole exome pseudoreferences for all species and show that this iterative reference-based approach improved basic genomic analyses that depend on mapping accuracy while preserving the associated annotations of the mouse reference genome. We then use these pseudoreferences to resolve evolutionary relationships among these lineages while accounting for phylogenetic discordance across the genome, contributing an important resource for comparative studies in the mouse system. We also describe patterns of genomic introgression among lineages and compare our results to previous studies. Our general approach can be applied to whole or partitioned genomic data and is easily portable to any system with sufficient genomic resources, providing a useful framework for phylogenomic studies in mice and other taxa. PMID:28338821
Prioritizing Populations for Conservation Using Phylogenetic Networks
Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.
2014-01-01
In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451
Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
McDonald, Daniel; Gonzalez, Antonio; Navas-Molina, Jose A.; Jiang, Lingjing; Xu, Zhenjiang Zech; Winker, Kevin; Kado, Deborah M.; Orwoll, Eric; Manary, Mark; Mirarab, Siavash
2018-01-01
ABSTRACT Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. PMID:29719869
Dessimoz, Christophe; Boeckmann, Brigitte; Roth, Alexander C J; Gonnet, Gaston H
2006-01-01
Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.
Aldous, E W; Fuller, C M; Ridgeon, J H; Irvine, R M; Alexander, D J; Brown, I H
2014-04-01
Newcastle disease (ND), caused by virulent strains of avian paramyxovirus type 1 (APMV-1), is considered throughout the world as one of the most important animal diseases. For over three decades now, there has been a continuing panzootic caused by a variant virulent APMV-1 strain, so-called pigeon paramyxovirus type 1 (PPMV-1), primarily in racing pigeons, which has also spread to wild birds and poultry. PPMV-1 isolations have been made in Great Britain every year since 1983. In this study, we have completed a comparative phylogenetic analysis based on a 374 nucleotide section of the fusion protein gene of 63 isolates of PPMV-1 that were isolated over a 26-year period; 43 of these were sequenced for this study. Phylogenetic analysis of these sequences revealed that all were closely related and placed in the genetic sublineage 4b (VIb), subdivision 4biif. © 2012 Crown copyright.
The Deinococcus-Thermus phylum and the effect of rRNA composition on phylogenetic tree construction
NASA Technical Reports Server (NTRS)
Weisburg, W. G.; Giovannoni, S. J.; Woese, C. R.
1989-01-01
Through comparative analysis of 16S ribosomal RNA sequences, it can be shown that two seemingly dissimilar types of eubacteria Deinococcus and the ubiquitous hot spring organism Thermus are distantly but specifically related to one another. This confirms an earlier report based upon 16S rRNA oligonucleotide cataloging studies (Hensel et al., 1986). Their two lineages form a distinctive grouping within the eubacteria that deserved the taxonomic status of a phylum. The (partial) sequence of T. aquaticus rRNA appears relatively close to those of other thermophilic eubacteria. e.g. Thermotoga maritima and Thermomicrobium roseum. However, this closeness does not reflect a true evolutionary closeness; rather it is due to a "thermophilic convergence", the result of unusually high G+C composition in the rRNAs of thermophilic bacteria. Unless such compositional biases are taken into account, the branching order and root of phylogenetic trees can be incorrectly inferred.
The Use of Weighted Graphs for Large-Scale Genome Analysis
Zhou, Fang; Toivonen, Hannu; King, Ross D.
2014-01-01
There is an acute need for better tools to extract knowledge from the growing flood of sequence data. For example, thousands of complete genomes have been sequenced, and their metabolic networks inferred. Such data should enable a better understanding of evolution. However, most existing network analysis methods are based on pair-wise comparisons, and these do not scale to thousands of genomes. Here we propose the use of weighted graphs as a data structure to enable large-scale phylogenetic analysis of networks. We have developed three types of weighted graph for enzymes: taxonomic (these summarize phylogenetic importance), isoenzymatic (these summarize enzymatic variety/redundancy), and sequence-similarity (these summarize sequence conservation); and we applied these types of weighted graph to survey prokaryotic metabolism. To demonstrate the utility of this approach we have compared and contrasted the large-scale evolution of metabolism in Archaea and Eubacteria. Our results provide evidence for limits to the contingency of evolution. PMID:24619061
Mobilomics in Saccharomyces cerevisiae strains
2013-01-01
Background Mobile Genetic Elements (MGEs) are selfish DNA integrated in the genomes. Their detection is mainly based on consensus–like searches by scanning the investigated genome against the sequence of an already identified MGE. Mobilomics aims at discovering all the MGEs in a genome and understanding their dynamic behavior: The data for this kind of investigation can be provided by comparative genomics of closely related organisms. The amount of data thus involved requires a strong computational effort, which should be alleviated. Results Our approach proposes to exploit the high similarity among homologous chromosomes of different strains of the same species, following a progressive comparative genomics philosophy. We introduce a software tool based on our new fast algorithm, called regender, which is able to identify the conserved regions between chromosomes. Our case study is represented by a unique recently available dataset of 39 different strains of S.cerevisiae, which regender is able to compare in few minutes. By exploring the non–conserved regions, where MGEs are mainly retrotransposons called Tys, and marking the candidate Tys based on their length, we are able to locate a priori and automatically all the already known Tys and map all the putative Tys in all the strains. The remaining putative mobile elements (PMEs) emerging from this intra–specific comparison are sharp markers of inter–specific evolution: indeed, many events of non–conservation among different yeast strains correspond to PMEs. A clustering based on the presence/absence of the candidate Tys in the strains suggests an evolutionary interconnection that is very similar to classic phylogenetic trees based on SNPs analysis, even though it is computed without using phylogenetic information. Conclusions The case study indicates that the proposed methodology brings two major advantages: (a) it does not require any template sequence for the wanted MGEs and (b) it can be applied to infer MGEs also for low coverage genomes with unresolved bases, where traditional approaches are largely ineffective. PMID:23514613
Li, De-Zhu
2011-01-01
Background Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies. Methodology/Principal Findings Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae. Conclusions/Significance The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of temperate woody bamboos, even with complete cp genome sequences, suggests that these lineages may have diverged very rapidly. PMID:21655229
Groth, J G
1998-12-01
The complete mitochondrial cytochrome b genes of 53 genera of oscine passerine birds representing the major groups of finches and some allies were compared. Phylogenetic trees resulting from three levels of character partition removal (no data removed, transitions at third positions of codons removed, and all transitions removed [transversion parsimony]) were generally concordant, and all supported several basic statements regarding relationships of finches and finch-like birds, including: (1) larks (Alaudidae) show no close relationship to any finch group; (2) Peucedramus (olive warbler) is phylogenetically far removed from true wood warblers; (3) a clade consisting of fringillids, passerids, motacillids, and emberizids is supported, and this clade is characterized by evolution of a vestigial 10th wing primary; and (4) Hawaiian honeycreepers are derived from within the cardueline finches. Excluding transition substitutions at third positions of codons resulted in phylogenetic trees similar to, but with greater bootstrap nodal support than, trees derived using either all data (equally weighted) or transversion parsimony. Relative to the shortest trees obtained using all data, the topologies obtained after elimination of third-position transitions showed only slight increases in realized treelength and homoplasy. These increases were negligable compared to increases in overall nodal support; therefore, this partition removal scheme may enhance recovery of deep phylogenetic signal in protein-coding DNA datasets. Copyright 1998 Academic Press.
On Determining if Tree-based Networks Contain Fixed Trees.
Anaya, Maria; Anipchenko-Ulaj, Olga; Ashfaq, Aisha; Chiu, Joyce; Kaiser, Mahedi; Ohsawa, Max Shoji; Owen, Megan; Pavlechko, Ella; St John, Katherine; Suleria, Shivam; Thompson, Keith; Yap, Corrine
2016-05-01
We address an open question of Francis and Steel about phylogenetic networks and trees. They give a polynomial time algorithm to decide if a phylogenetic network, N, is tree-based and pose the problem: given a fixed tree T and network N, is N based on T? We show that it is [Formula: see text]-hard to decide, by reduction from 3-Dimensional Matching (3DM) and further that the problem is fixed-parameter tractable.
Steudel, Bastian; Hallmann, Christine; Lorenz, Maike; Abrahamczyk, Stefan; Prinz, Kathleen; Herrfurth, Cornelia; Feussner, Ivo; Martini, Johannes W R; Kessler, Michael
2016-10-01
It is well known that ecosystem functioning is positively influenced by biodiversity. Most biodiversity-ecosystem functioning experiments have measured biodiversity based on species richness or phylogenetic relationships. However, theoretical and empirical evidence suggests that ecosystem functioning should be more closely related to functional diversity than to species richness. We applied different metrics of biodiversity in an artificial biodiversity-ecosystem functioning experiment using 64 species of green microalgae in combinations of two to 16 species. We found that phylogenetic and functional diversity were positively correlated with biomass overyield, driven by their strong correlation with species richness. At low species richness, no significant correlation between overyield and functional and phylogenetic diversity was found. However, at high species richness (16 species), we found a positive relationship of overyield with functional diversity and a negative relationship with phylogenetic diversity. We show that negative phylogenetic diversity-ecosystem functioning relationships can result from interspecific growth inhibition. The opposing performances of facilitation (functional diversity) and inhibition (phylogenetic diversity) we observed at the 16 species level suggest that phylogenetic diversity is not always a good proxy for functional diversity and that results from experiments with low species numbers may underestimate negative species interactions. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
USDA-ARS?s Scientific Manuscript database
We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results to prior phylogenetic results using plastid, nuclear, and mitochondrial DNA sequences. We obtained, using Illumina sequencing, full plastid sequences of 37 accessions of 20 Daucus taxa and outgrou...
P-type ATPase superfamily: evidence for critical roles for kingdom evolution.
Okamura, Hideyuki; Denawa, Masatsugu; Ohniwa, Ryosuke; Takeyasu, Kunio
2003-04-01
The P-type ATPase has become a protein superfamily. On the basis of sequence similarities, the phylogenetic analyses, and substrate specificities, this superfamily can be classified into 5 families and 11 subfamilies. A comparative phylogenetic analysis demonstrates the relationship between the molecular evolution of these subfamilies and the establishment of the kingdoms of living things.
The space of ultrametric phylogenetic trees.
Gavryushkin, Alex; Drummond, Alexei J
2016-08-21
The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
2013-01-01
Background Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. Results We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. Conclusion CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. PMID:23617892
Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus
2017-06-01
Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Yang, Fang; Chia, Nicholas; White, Bryan A; Schook, Lawrence B
2013-04-23
Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets.
KinFin: Software for Taxon-Aware Analysis of Clustered Protein Sequences.
Laetsch, Dominik R; Blaxter, Mark L
2017-10-05
The field of comparative genomics is concerned with the study of similarities and differences between the information encoded in the genomes of organisms. A common approach is to define gene families by clustering protein sequences based on sequence similarity, and analyze protein cluster presence and absence in different species groups as a guide to biology. Due to the high dimensionality of these data, downstream analysis of protein clusters inferred from large numbers of species, or species with many genes, is nontrivial, and few solutions exist for transparent, reproducible, and customizable analyses. We present KinFin, a streamlined software solution capable of integrating data from common file formats and delivering aggregative annotation of protein clusters. KinFin delivers analyses based on systematic taxonomy of the species analyzed, or on user-defined, groupings of taxa, for example, sets based on attributes such as life history traits, organismal phenotypes, or competing phylogenetic hypotheses. Results are reported through graphical and detailed text output files. We illustrate the utility of the KinFin pipeline by addressing questions regarding the biology of filarial nematodes, which include parasites of veterinary and medical importance. We resolve the phylogenetic relationships between the species and explore functional annotation of proteins in clusters in key lineages and between custom taxon sets, identifying gene families of interest. KinFin can easily be integrated into existing comparative genomic workflows, and promotes transparent and reproducible analysis of clustered protein data. Copyright © 2017 Laetsch and Blaxter.
A Pan-Genomic Approach to Understand the Basis of Host Adaptation in Achromobacter
Jeukens, Julie; Freschi, Luca; Vincent, Antony T.; Emond-Rheault, Jean-Guillaume; Kukavica-Ibrulj, Irena; Charette, Steve J.
2017-01-01
Over the past decade, there has been a rising interest in Achromobacter sp., an emerging opportunistic pathogen responsible for nosocomial and cystic fibrosis lung infections. Species of this genus are ubiquitous in the environment, can outcompete resident microbiota, and are resistant to commonly used disinfectants as well as antibiotics. Nevertheless, the Achromobacter genus suffers from difficulties in diagnosis, unresolved taxonomy and limited understanding of how it adapts to the cystic fibrosis lung, not to mention other host environments. The goals of this first genus-wide comparative genomics study were to clarify the taxonomy of this genus and identify genomic features associated with pathogenicity and host adaptation. This was done with a widely applicable approach based on pan-genome analysis. First, using all publicly available genomes, a combination of phylogenetic analysis based on 1,780 conserved genes with average nucleotide identity and accessory genome composition allowed the identification of a largely clinical lineage composed of Achromobacter xylosoxidans, Achromobacter insuavis, Achromobacter dolens, and Achromobacter ruhlandii. Within this lineage, we identified 35 positively selected genes involved in metabolism, regulation and efflux-mediated antibiotic resistance. Second, resistome analysis showed that this clinical lineage carried additional antibiotic resistance genes compared with other isolates. Finally, we identified putative mobile elements that contribute 53% of the genus’s resistome and support horizontal gene transfer between Achromobacter and other ecologically similar genera. This study provides strong phylogenetic and pan-genomic bases to motivate further research on Achromobacter, and contributes to the understanding of opportunistic pathogen evolution. PMID:28383665
A phylogeny and revised classification of Squamata, including 4161 species of lizards and snakes
2013-01-01
Background The extant squamates (>9400 known species of lizards and snakes) are one of the most diverse and conspicuous radiations of terrestrial vertebrates, but no studies have attempted to reconstruct a phylogeny for the group with large-scale taxon sampling. Such an estimate is invaluable for comparative evolutionary studies, and to address their classification. Here, we present the first large-scale phylogenetic estimate for Squamata. Results The estimated phylogeny contains 4161 species, representing all currently recognized families and subfamilies. The analysis is based on up to 12896 base pairs of sequence data per species (average = 2497 bp) from 12 genes, including seven nuclear loci (BDNF, c-mos, NT3, PDC, R35, RAG-1, and RAG-2), and five mitochondrial genes (12S, 16S, cytochrome b, ND2, and ND4). The tree provides important confirmation for recent estimates of higher-level squamate phylogeny based on molecular data (but with more limited taxon sampling), estimates that are very different from previous morphology-based hypotheses. The tree also includes many relationships that differ from previous molecular estimates and many that differ from traditional taxonomy. Conclusions We present a new large-scale phylogeny of squamate reptiles that should be a valuable resource for future comparative studies. We also present a revised classification of squamates at the family and subfamily level to bring the taxonomy more in line with the new phylogenetic hypothesis. This classification includes new, resurrected, and modified subfamilies within gymnophthalmid and scincid lizards, and boid, colubrid, and lamprophiid snakes. PMID:23627680
Synthesis of phylogeny and taxonomy into a comprehensive tree of life
Hinchliff, Cody E.; Smith, Stephen A.; Allman, James F.; Burleigh, J. Gordon; Chaudhary, Ruchi; Coghill, Lyndon M.; Crandall, Keith A.; Deng, Jiabin; Drew, Bryan T.; Gazis, Romina; Gude, Karl; Hibbett, David S.; Katz, Laura A.; Laughinghouse, H. Dail; McTavish, Emily Jane; Midford, Peter E.; Owen, Christopher L.; Ree, Richard H.; Rees, Jonathan A.; Soltis, Douglas E.; Williams, Tiffani; Cranston, Karen A.
2015-01-01
Reconstructing the phylogenetic relationships that unite all lineages (the tree of life) is a grand challenge. The paucity of homologous character data across disparately related lineages currently renders direct phylogenetic inference untenable. To reconstruct a comprehensive tree of life, we therefore synthesized published phylogenies, together with taxonomic classifications for taxa never incorporated into a phylogeny. We present a draft tree containing 2.3 million tips—the Open Tree of Life. Realization of this tree required the assembly of two additional community resources: (i) a comprehensive global reference taxonomy and (ii) a database of published phylogenetic trees mapped to this taxonomy. Our open source framework facilitates community comment and contribution, enabling the tree to be continuously updated when new phylogenetic and taxonomic data become digitally available. Although data coverage and phylogenetic conflict across the Open Tree of Life illuminate gaps in both the underlying data available for phylogenetic reconstruction and the publication of trees as digital objects, the tree provides a compelling starting point for community contribution. This comprehensive tree will fuel fundamental research on the nature of biological diversity, ultimately providing up-to-date phylogenies for downstream applications in comparative biology, ecology, conservation biology, climate change, agriculture, and genomics. PMID:26385966
Allen, Benjamin; Kon, Mark; Bar-Yam, Yaneer
2009-08-01
Protecting biodiversity involves preserving the maximum number and abundance of species while giving special attention to species with unique genetic or morphological characteristics. In balancing different priorities, conservation policymakers may consider quantitative measures that compare diversity across ecological communities. To serve this purpose, a measure should increase or decrease with changes in community composition in a way that reflects what is valued, including species richness, evenness, and distinctness. However, counterintuitively, studies have shown that established indices, including those that emphasize average interspecies phylogenetic distance, may increase with the elimination of species. We introduce a new diversity index, the phylogenetic entropy, which generalizes in a natural way the Shannon index to incorporate species relatedness. Phylogenetic entropy favors communities in which highly distinct species are more abundant, but it does not advocate decreasing any species proportion below a community structure-dependent threshold. We contrast the behavior of multiple indices on a community of phyllostomid bats in the Selva Lacandona. The optimal genus distribution for phylogenetic entropy populates all genera in a linear relationship to their total phylogenetic distance to other genera. Two other indices favor eliminating 12 out of the 23 genera.
Synthesis of phylogeny and taxonomy into a comprehensive tree of life.
Hinchliff, Cody E; Smith, Stephen A; Allman, James F; Burleigh, J Gordon; Chaudhary, Ruchi; Coghill, Lyndon M; Crandall, Keith A; Deng, Jiabin; Drew, Bryan T; Gazis, Romina; Gude, Karl; Hibbett, David S; Katz, Laura A; Laughinghouse, H Dail; McTavish, Emily Jane; Midford, Peter E; Owen, Christopher L; Ree, Richard H; Rees, Jonathan A; Soltis, Douglas E; Williams, Tiffani; Cranston, Karen A
2015-10-13
Reconstructing the phylogenetic relationships that unite all lineages (the tree of life) is a grand challenge. The paucity of homologous character data across disparately related lineages currently renders direct phylogenetic inference untenable. To reconstruct a comprehensive tree of life, we therefore synthesized published phylogenies, together with taxonomic classifications for taxa never incorporated into a phylogeny. We present a draft tree containing 2.3 million tips-the Open Tree of Life. Realization of this tree required the assembly of two additional community resources: (i) a comprehensive global reference taxonomy and (ii) a database of published phylogenetic trees mapped to this taxonomy. Our open source framework facilitates community comment and contribution, enabling the tree to be continuously updated when new phylogenetic and taxonomic data become digitally available. Although data coverage and phylogenetic conflict across the Open Tree of Life illuminate gaps in both the underlying data available for phylogenetic reconstruction and the publication of trees as digital objects, the tree provides a compelling starting point for community contribution. This comprehensive tree will fuel fundamental research on the nature of biological diversity, ultimately providing up-to-date phylogenies for downstream applications in comparative biology, ecology, conservation biology, climate change, agriculture, and genomics.
Hurtado, Luis A; Santamaria, Carlos A; Fitzgerald, Lee A
2014-05-06
The phylogenetic position of the critically endangered Saint Croix ground lizard Ameiva polops is presently unknown and several hypotheses have been proposed. We investigated the phylogenetic position of this species using molecular phylogenetic methods. We obtained sequences of DNA fragments of the mitochondrial ribosomal genes 12S rDNA and 16S rDNA for this species. We aligned these sequences with published sequences of other Ameiva species, which include most of the Ameiva species from the West Indies, three Ameiva species from Central America and South America, and one from the teiid lizard Tupinambis teguixin, which was used as outgroup. We conducted Maximum Likelihood and Bayesian phylogenetic analyses. The phylogenetic reconstructions among the different methods were very similar, supporting the monophyly of West Indian Ameiva and showing within this lineage, a basal polytomy of four clades that are separated geographically. Ameiva polops grouped in a cluster that included the other two Ameiva species found in the Puerto Rican Bank: A. wetmorei and A. exsul. A sister relationship between A. polops and A. wetmorei is suggested by our analyses. We compare our results with a previous study on molecular systematics of West Indian Ameiva.
Using Genotype Abundance to Improve Phylogenetic Inference
Mesin, Luka; Victora, Gabriel D; Minin, Vladimir N; Matsen, Frederick A
2018-01-01
Abstract Modern biological techniques enable very dense genetic sampling of unfolding evolutionary histories, and thus frequently sample some genotypes multiple times. This motivates strategies to incorporate genotype abundance information in phylogenetic inference. In this article, we synthesize a stochastic process model with standard sequence-based phylogenetic optimality, and show that tree estimation is substantially improved by doing so. Our method is validated with extensive simulations and an experimental single-cell lineage tracing study of germinal center B cell receptor affinity maturation. PMID:29474671
flyDIVaS: A Comparative Genomics Resource for Drosophila Divergence and Selection
Stanley, Craig E.; Kulathinal, Rob J.
2016-01-01
With arguably the best finished and expertly annotated genome assembly, Drosophila melanogaster is a formidable genetics model to study all aspects of biology. Nearly a decade ago, the 12 Drosophila genomes project expanded D. melanogaster’s breadth as a comparative model through the community-development of an unprecedented genus- and genome-wide comparative resource. However, since its inception, these datasets for evolutionary inference and biological discovery have become increasingly outdated, outmoded, and inaccessible. Here, we provide an updated and upgradable comparative genomics resource of Drosophila divergence and selection, flyDIVaS, based on the latest genomic assemblies, curated FlyBase annotations, and recent OrthoDB orthology calls. flyDIVaS is an online database containing D. melanogaster-centric orthologous gene sets, CDS and protein alignments, divergence statistics (% gaps, dN, dS, dN/dS), and codon-based tests of positive Darwinian selection. Out of 13,920 protein-coding D. melanogaster genes, ∼80% have one aligned ortholog in the closely related species, D. simulans, and ∼50% have 1–1 12-way alignments in the original 12 sequenced species that span over 80 million yr of divergence. Genes and their orthologs can be chosen from four different taxonomic datasets differing in phylogenetic depth and coverage density, and visualized via interactive alignments and phylogenetic trees. Users can also batch download entire comparative datasets. A functional survey finds conserved mitotic and neural genes, highly diverged immune and reproduction-related genes, more conspicuous signals of divergence across tissue-specific genes, and an enrichment of positive selection among highly diverged genes. flyDIVaS will be regularly updated and can be freely accessed at www.flydivas.info. We encourage researchers to regularly use this resource as a tool for biological inference and discovery, and in their classrooms to help train the next generation of biologists to creatively use such genomic big data resources in an integrative manner. PMID:27226167
flyDIVaS: A Comparative Genomics Resource for Drosophila Divergence and Selection.
Stanley, Craig E; Kulathinal, Rob J
2016-08-09
With arguably the best finished and expertly annotated genome assembly, Drosophila melanogaster is a formidable genetics model to study all aspects of biology. Nearly a decade ago, the 12 Drosophila genomes project expanded D. melanogaster's breadth as a comparative model through the community-development of an unprecedented genus- and genome-wide comparative resource. However, since its inception, these datasets for evolutionary inference and biological discovery have become increasingly outdated, outmoded, and inaccessible. Here, we provide an updated and upgradable comparative genomics resource of Drosophila divergence and selection, flyDIVaS, based on the latest genomic assemblies, curated FlyBase annotations, and recent OrthoDB orthology calls. flyDIVaS is an online database containing D. melanogaster-centric orthologous gene sets, CDS and protein alignments, divergence statistics (% gaps, dN, dS, dN/dS), and codon-based tests of positive Darwinian selection. Out of 13,920 protein-coding D. melanogaster genes, ∼80% have one aligned ortholog in the closely related species, D. simulans, and ∼50% have 1-1 12-way alignments in the original 12 sequenced species that span over 80 million yr of divergence. Genes and their orthologs can be chosen from four different taxonomic datasets differing in phylogenetic depth and coverage density, and visualized via interactive alignments and phylogenetic trees. Users can also batch download entire comparative datasets. A functional survey finds conserved mitotic and neural genes, highly diverged immune and reproduction-related genes, more conspicuous signals of divergence across tissue-specific genes, and an enrichment of positive selection among highly diverged genes. flyDIVaS will be regularly updated and can be freely accessed at www.flydivas.info We encourage researchers to regularly use this resource as a tool for biological inference and discovery, and in their classrooms to help train the next generation of biologists to creatively use such genomic big data resources in an integrative manner. Copyright © 2016 Stanley and Kulathinal.
Aust, Shelly K; Ahrendsen, Dakota L; Kellar, P Roxanne
2015-01-01
Conservation of the evolutionary diversity among organisms should be included in the selection of priority regions for preservation of Earth's biodiversity. Traditionally, biodiversity has been determined from an assessment of species richness (S), abundance, evenness, rarity, etc. of organisms but not from variation in species' evolutionary histories. Phylogenetic diversity (PD) measures evolutionary differences between taxa in a community and is gaining acceptance as a biodiversity assessment tool. However, with the increase in the number of ways to calculate PD, end-users and decision-makers are left wondering how metrics compare and what data are needed to calculate various metrics. In this study, we used massively parallel sequencing to generate over 65,000 DNA characters from three cellular compartments for over 60 species in the asterid clade of flowering plants. We estimated asterid phylogenies from character datasets of varying nucleotide quantities, and then assessed the effect of varying character datasets on resulting PD metric values. We also compared multiple PD metrics with traditional diversity indices (including S) among two endangered grassland prairies in Nebraska (U.S.A.). Our results revealed that PD metrics varied based on the quantity of genes used to infer the phylogenies; therefore, when comparing PD metrics between sites, it is vital to use comparable datasets. Additionally, various PD metrics and traditional diversity indices characterize biodiversity differently and should be chosen depending on the research question. Our study provides empirical results that reveal the value of measuring PD when considering sites for conservation, and it highlights the usefulness of using PD metrics in combination with other diversity indices when studying community assembly and ecosystem functioning. Ours is just one example of the types of investigations that need to be conducted across the tree of life and across varying ecosystems in order to build a database of phylogenetic diversity assessments that lead to a pool of results upon which a guide through the plethora of PD metrics may be prepared for use by ecologists and conservation planners.
A Phylogenetic Perspective on Biogeographical Divergence of the Flora in Yunnan, Southwestern China.
Liu, Shuiyin; Zhu, Hua; Yang, Jie
2017-02-21
In recent years, an increasing number of studies incorporated biogeography with phylogenetic analyses to reveal the origin and evolutionary history of specific floras. In this study, we constructed the mega-phylogeny of the floras of three representative regions across Yunnan, southwestern China. We analyzed the phylogenetic structure and beta diversity based on the presence/absence of species (genus or family) data to investigate the phylogenetic patterns of regional floras. We found conspicuous divergence at the genus and species level in the pattern of phylogenetic structures, which most likely related to historical biogeography. The flora of southern Yunnan was shaped by the strike-slip extrusion of Indochina and the regional climatic stability, while the flora of northwestern Yunnan was shaped by the uplift of the Himalaya-Tibetan Plateau and the oscillations of the glacial-interglacial periods. The flora of central Yunnan had nearly equal proportions of the northern and southern floras that may be derived from a common Tertiary tropical or subtropical flora. Geological events fit well with the floristic and phylogenetic patterns across Yunnan. This study highlighted the importance of linking phylogenetic analyses to biogeographic interpretations to improve our understanding of the origin, evolution and divergence of regional floras.
Bào, Yīmíng; Kuhn, Jens H
2018-01-01
During the last decade, genome sequence-based classification of viruses has become increasingly prominent. Viruses can be even classified based on coding-complete genome sequence data alone. Nevertheless, classification remains arduous as experts are required to establish phylogenetic trees to depict the evolutionary relationships of such sequences for preliminary taxonomic placement. Pairwise sequence comparison (PASC) of genomes is one of several novel methods for establishing relationships among viruses. This method, provided by the US National Center for Biotechnology Information as an open-access tool, circumvents phylogenetics, and yet PASC results are often in agreement with those of phylogenetic analyses. Computationally inexpensive, PASC can be easily performed by non-taxonomists. Here we describe how to use the PASC tool for the preliminary classification of novel viral hemorrhagic fever-causing viruses.
Pennell, Matthew W; Harmon, Luke J
2013-06-01
Recent innovations in phylogenetic comparative methods (PCMs) have spurred a renaissance of research into the causes and consequences of large-scale patterns of biodiversity. In this paper, we review these advances. We also highlight the potential of comparative methods to integrate across fields and focus on three examples where such integration might be particularly valuable: quantitative genetics, community ecology, and paleobiology. We argue that PCMs will continue to be a key set of tools in evolutionary biology, shedding new light on how evolutionary processes have shaped patterns of biodiversity through deep time. © 2013 New York Academy of Sciences.
de la Rúa, Nicholas M.; Bustamante, Dulce M.; Menes, Marianela; Stevens, Lori; Monroy, Carlota; Kilpatrick, William; Rizzo, Donna; Klotz, Stephen A.; Schmidt, Justin; Axen, Heather J.; Dorn, Patricia L.
2014-01-01
Phylogenetic relationships of insect vectors of parasitic diseases are important for understanding the evolution of epidemiologically relevant traits, and may be useful in vector control. The subfamily Triatominae (Hemiptera:Reduviidae) includes ~140 extant species arranged in five tribes comprised of 15 genera. The genus Triatoma is the most species-rich and contains important vectors of Trypanosoma cruzi, the causative agent of Chagas disease. Triatoma species were grouped into complexes originally by morphology and more recently with the addition of information from molecular phylogenetics (the four-complex hypothesis); however, without a strict adherence to monophyly. To date, the validity of proposed species complexes has not been tested by statistical tests of topology. The goal of this study was to clarify the systematics of 19 Triatoma species from North and Central America. We inferred their evolutionary relatedness using two independent data sets: the complete nuclear Internal Transcribed Spacer-2 ribosomal DNA (ITS-2 rDNA) and head morphometrics. In addition, we used the Shimodaira-Hasegawa statistical test of topology to assess the fit of the data to a set of competing systematic hypotheses (topologies). An unconstrained topology inferred from the ITS-2 data was compared to topologies constrained based on the four-complex hypothesis or one inferred from our morphometry results. The unconstrained topology represents a statistically significant better fit of the molecular data than either the four-complex or the morphometric topology. We propose an update to the composition of species complexes in the North and Central American Triatoma, based on a phylogeny inferred from ITS-2 as a first step towards updating the phylogeny of the complexes based on monophyly and statistical tests of topologies. PMID:24681261
Assugeni, Camila de O; Magalhães, Tatiana; Bolaños, Juan A; Tudge, Christopher C; Mantelatto, Fernando L; Zara, Fernando J
2017-12-01
Recent studies based on morphological and molecular data provide a new perspective concerning taxonomic aspects of the brachyuran family Mithracidae. These studies proposed a series of nominal changes and indicated that the family is actually represented by a different number and representatives of genera than previously thought. Here, we provide a comparative description of the ultrastructure of spermatozoa and spermatophores of some species of Mithracidae in a phylogenetic context. The ultrastructure of the spermatozoa and spermatophore was observed by scanning and transmission electron microscopy. The most informative morphological characters analysed were thickness of the operculum, shape of the perforatorial chamber and shape and thickness of the inner acrosomal zone. As a framework, we used a topology based on a phylogenetic analysis using mitochondrial data obtained here and from previous studies. Our results indicate that closely related species share a series of morphological characteristics of the spermatozoa. A thick operculum, for example, is a feature observed in species of the genera Amphithrax, Teleophrys, and Omalacantha in contrast to the slender operculum observed in Mithraculus and Mithrax. Amphithrax and Teleophrys have a rhomboid perforatorial chamber, while Mithraculus, Mithrax, and Omalacantha show a wider, deltoid morphology. Furthermore, our results are in agreement with recently proposed taxonomic changes including the separation of the genera Mithrax (previously Damithrax), Amphithrax (previously Mithrax) and Mithraculus, and the synonymy of Mithrax caribbaeus with Mithrax hispidus. Overall, the spermiotaxonomy of these species of Mithracidae represent a novel set of data that corroborates the most recent taxonomic revision of the family and can be used in future taxonomic and phylogenetic studies within this family. © 2017 Wiley Periodicals, Inc.
Kaden, Jennifer; Joshi, Jyoti; Bhattarai, Susmita; Kusi, Naresh; Sillero-Zubiri, Claudio; Macdonald, David W.
2017-01-01
Wolves in the Himalayan region form a monophyletic lineage distinct from the present-day Holarctic grey wolf Canis lupus spp. (Linnaeus 1758) found across Eurasia and North America. Here, we analyse phylogenetic relationships and the geographic distribution of mitochondrial DNA haplotypes of the contemporary Himalayan wolf (proposed in previous studies as Canis himalayensis) found in Central Asia. We combine genetic data from a living Himalayan wolf population collected in northwestern Nepal in this study with already published genetic data, and confirm the Himalayan wolf lineage based on mitochondrial genomic data (508 bp cytochrome b and 242 bp D-loop), and X- and Y-linked zinc-finger protein gene (ZFX and ZFY) sequences. We then compare the genetic profile of the Himalayan wolf lineage found in northwestern Nepal with canid reference sequences from around the globe with maximum likelihood and Bayesian phylogeny building methods to demonstrate that the Himalayan wolf forms a distinct monophyletic clade supported by posterior probabilities/bootstrap for D-loop of greater than 0.92/85 and cytochrome b greater than 0.99/93. The Himalayan wolf shows a unique Y-chromosome (ZFY) haplotype, and shares an X-chromosome haplotype (ZFX) with the newly postulated African wolf. Our results imply that the Himalayan wolf distribution range extends from the Himalayan range north across the Tibetan Plateau up to the Qinghai Lakes region in Qinghai Province in the People's Republic of China. Based on its phylogenetic distinction and its older age of divergence relative to the Holarctic grey wolf, the Himalayan wolf merits formal classification as a distinct taxon of special conservation concern. PMID:28680672
Tang, Ying; Yukawa, Tomohisa; Bateman, Richard M; Jiang, Hong; Peng, Hua
2015-05-26
Tribe Orchideae dominates the orchid flora of the temperate Northern Hemisphere but its representatives in East Asia had been subject to less intensive phylogenetic study than those in Eurasia and North America. Although this situation was improved recently by the molecular phylogenetic study of Jin et al., comparatively few species were analyzed from the species-rich and taxonomically controversial East Asian Amitostigma alliance. Here, we present a framework nrITS tree of 235 accessions of Orchideae plus an in-depth analysis of 110 representative accessions, encompassing most widely recognized species within the alliance, to elucidate their relationships. We used parsimony, likelihood and Bayesian approaches to generate trees from data for two nuclear (nrITS, low-copy Xdh) and four chloroplast (matK, psbA-trnH, trnL-F, trnS-trnG) markers. Nuclear and plastid data were analyzed separately due to a few hard incongruences that most likely reflect chloroplast capture. Our results suggest key phylogenetic placements for Sirindhornia and Brachycorythis, and confirm previous assertions that the Amitostigma alliance is monophyletic and sister to the Eurasian plus European clades of subtribe Orchidinae. Seven robust clades are evident within the alliance, but none corresponds precisely with any of the traditional genera; the smaller and more morphologically distinct genera Tsaiorchis, Hemipilia, Neottianthe and Hemipiliopsis are monophyletic but each is nested within a polyphyletic plexus of species attributed to either Ponerorchis or the most plesiomorphic genus, Amitostigma. Two early-divergent clades that escaped analysis by Jin et al. undermine their attempt to circumscribe an expanded monophyletic genus Ponerorchis. We provide a new framework on the complex phylogenetic relationships between Amitostigma and other genera traditionally included in its alliance; based on which, we combine the entire Amitostigma alliance into a morphologically and molecularly circumscribed Amitostigma sensu latissimo that also contains seven molecularly circumscribed sections. Our molecular trees imply unusually high levels of morphological homoplasy, but these will need to be quantified via a future group-wide review of the alliance based on living plants if morphology is to be fully integrated into our classification.
Smith, Adam R.; Proffitt, Melissa R.; Ho, Winnie W.; Mullaney, Claire B.; Maldonado-Ocampo, Javier A.; Lovejoy, Nathan R.; Alves-Gomes, José A.; Smith, G. Troy
2018-01-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus + Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus + Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, ‘Apteronotus’, Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and ‘‘big” chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. PMID:27769924
Smith, Adam R; Proffitt, Melissa R; Ho, Winnie W; Mullaney, Claire B; Maldonado-Ocampo, Javier A; Lovejoy, Nathan R; Alves-Gomes, José A; Smith, G Troy
2016-10-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus+Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus+Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, 'Apteronotus', Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and "big" chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. Published by Elsevier Ltd.
Rainford, James L; Hofreiter, Michael; Mayhew, Peter J
2016-01-08
Skewed body size distributions and the high relative richness of small-bodied taxa are a fundamental property of a wide range of animal clades. The evolutionary processes responsible for generating these distributions are well described in vertebrate model systems but have yet to be explored in detail for other major terrestrial clades. In this study, we explore the macro-evolutionary patterns of body size variation across families of Hexapoda (insects and their close relatives), using recent advances in phylogenetic understanding, with an aim to investigate the link between size and diversity within this ancient and highly diverse lineage. The maximum, minimum and mean-log body lengths of hexapod families are all approximately log-normally distributed, consistent with previous studies at lower taxonomic levels, and contrasting with skewed distributions typical of vertebrate groups. After taking phylogeny and within-tip variation into account, we find no evidence for a negative relationship between diversification rate and body size, suggesting decoupling of the forces controlling these two traits. Likelihood-based modeling of the log-mean body size identifies distinct processes operating within Holometabola and Diptera compared with other hexapod groups, consistent with accelerating rates of size evolution within these clades, while as a whole, hexapod body size evolution is found to be dominated by neutral processes including significant phylogenetic conservatism. Based on our findings we suggest that the use of models derived from well-studied but atypical clades, such as vertebrates may lead to misleading conclusions when applied to other major terrestrial lineages. Our results indicate that within hexapods, and within the limits of current systematic and phylogenetic knowledge, insect diversification is generally unfettered by size-biased macro-evolutionary processes, and that these processes over large timescales tend to converge on apparently neutral evolutionary processes. We also identify limitations on available data within the clade and modeling approaches for the resolution of trees of higher taxa, the resolution of which may collectively enhance our understanding of this key component of terrestrial ecosystems.
galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.
Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M
2004-06-12
The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se
MaxAlign: maximizing usable data in an alignment.
Gouveia-Oliveira, Rodrigo; Sackett, Peter W; Pedersen, Anders G
2007-08-28
The presence of gaps in an alignment of nucleotide or protein sequences is often an inconvenience for bioinformatical studies. In phylogenetic and other analyses, for instance, gapped columns are often discarded entirely from the alignment. MaxAlign is a program that optimizes the alignment prior to such analyses. Specifically, it maximizes the number of nucleotide (or amino acid) symbols that are present in gap-free columns - the alignment area - by selecting the optimal subset of sequences to exclude from the alignment. MaxAlign can be used prior to phylogenetic and bioinformatical analyses as well as in other situations where this form of alignment improvement is useful. In this work we test MaxAlign's performance in these tasks and compare the accuracy of phylogenetic estimates including and excluding gapped columns from the analysis, with and without processing with MaxAlign. In this paper we also introduce a new simple measure of tree similarity, Normalized Symmetric Similarity (NSS) that we consider useful for comparing tree topologies. We demonstrate how MaxAlign is helpful in detecting misaligned or defective sequences without requiring manual inspection. We also show that it is not advisable to exclude gapped columns from phylogenetic analyses unless MaxAlign is used first. Finally, we find that the sequences removed by MaxAlign from an alignment tend to be those that would otherwise be associated with low phylogenetic accuracy, and that the presence of gaps in any given sequence does not seem to disturb the phylogenetic estimates of other sequences. The MaxAlign web-server is freely available online at http://www.cbs.dtu.dk/services/MaxAlign where supplementary information can also be found. The program is also freely available as a Perl stand-alone package.
Hoyal Cuthill, Jennifer F.
2015-01-01
Biological variety and major evolutionary transitions suggest that the space of possible morphologies may have varied among lineages and through time. However, most models of phylogenetic character evolution assume that the potential state space is finite. Here, I explore what the morphological state space might be like, by analysing trends in homoplasy (repeated derivation of the same character state). Analyses of ten published character matrices are compared against computer simulations with different state space models: infinite states, finite states, ordered states and an ‘inertial' model, simulating phylogenetic constraints. Of these, only the infinite states model results in evolution without homoplasy, a prediction which is not generally met by real phylogenies. Many authors have interpreted the ubiquity of homoplasy as evidence that the number of evolutionary alternatives is finite. However, homoplasy is also predicted by phylogenetic constraints on the morphological distance that can be traversed between ancestor and descendent. Phylogenetic rarefaction (sub-sampling) shows that finite and inertial state spaces do produce contrasting trends in the distribution of homoplasy. Two clades show trends characteristic of phylogenetic inertia, with decreasing homoplasy (increasing consistency index) as we sub-sample more distantly related taxa. One clade shows increasing homoplasy, suggesting exhaustion of finite states. Different clades may, therefore, show different patterns of character evolution. However, when parsimony uninformative characters are excluded (which may occur without documentation in cladistic studies), it may no longer be possible to distinguish inertial and finite state spaces. Interestingly, inertial models predict that homoplasy should be clustered among comparatively close relatives (parallel evolution), whereas finite state models do not. If morphological evolution is often inertial in nature, then homoplasy (false homology) may primarily occur between close relatives, perhaps being replaced by functional analogy at higher taxonomic scales. PMID:26640650
Vanhommerig, Joost W; Bezemer, Daniela; Molenkamp, Richard; Van Sighem, Ard I; Smit, Colette; Arends, Joop E; Lauw, Fanny N; Brinkman, Kees; Rijnders, Bart J; Newsum, Astrid M; Bruisten, Sylvia M; Prins, Maria; Van Der Meer, Jan T; Van De Laar, Thijs J; Schinkel, Janke
2017-09-24
MSM are at increased risk for infection with HIV-1 and hepatitis C virus (HCV). Is HIV/HCV coinfection confined to specific HIV transmission networks? A HIV phylogenetic tree was constructed for 5038 HIV-1 subtype B polymerase (pol) sequences obtained from MSM in the AIDS therapy evaluation in the Netherlands cohort. We investigated the existence of HIV clusters with increased HCV prevalence, the HIV phylogenetic density (i.e. the number of potential HIV transmission partners) of HIV/HCV-coinfected MSM compared with HIV-infected MSM without HCV, and the overlap in HIV and HCV phylogenies using HCV nonstructural protein 5B sequences from 183 HIV-infected MSM with acute HCV infection. Five hundred and sixty-three of 5038 (11.2%) HIV-infected MSM tested HCV positive. Phylogenetic analysis revealed 93 large HIV clusters (≥10 MSM), 370 small HIV clusters (2-9 MSM), and 867 singletons with a median HCV prevalence of 11.5, 11.6, and 9.3%, respectively. We identified six large HIV clusters with elevated HCV prevalence (range 23.5-46.2%). Median HIV phylogenetic densities for MSM with HCV (3, interquartile range 1-7) and without HCV (3, interquartile range 1-8) were similar. HCV phylogeny showed 12 MSM-specific HCV clusters (clustersize: 2-39 HCV sequences); 12.7% of HCV infections were part of the same HIV and HCV cluster. We observed few HIV clusters with elevated HCV prevalence, no increase in the HIV phylogenetic density of HIV/HCV-coinfected MSM compared to HIV-infected MSM without HCV, and limited overlap between HIV and HCV phylogenies among HIV/HCV-coinfected MSM. Our data do not support the existence of MSM-specific sexual networks that fuel both the HIV and HCV epidemic.
Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis.
Gibbs, S; Collard, M; Wood, B
2002-01-01
This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence.
Genetic and evolutionary characterization of RABVs from China using the phosphoprotein gene.
Wang, Lihua; Wu, Hui; Tao, Xiaoyan; Li, Hao; Rayner, Simon; Liang, Guodong; Tang, Qing
2013-01-07
While the function of the phosphoprotein (P) gene of the rabies virus (RABV) has been well studied in laboratory adapted RABVs, the genetic diversity and evolution characteristics of the P gene of street RABVs remain unclear. The objective of the present study was to investigate the mutation and evolution of P genes in Chinese street RABVs. The P gene of 77 RABVs from brain samples of dogs and wild animals collected in eight Chinese provinces through 2003 to 2008 were sequenced. The open reading frame (ORF) of the P genes was 894 nucleotides (nt) in length, with 85-99% (80-89%) amino acid (nucleotide) identity compared with the laboratory RABVs and vaccine strains. Phylogenetic analysis based on the P gene revealed that Chinese RABVs strains could be divided into two distinct clades, and several RABV variants were found to co circulating in the same province. Two conserved (CD1, 2) and two variable (VD1, 2) domains were identified by comparing the deduced primary sequences of the encoded P proteins. Two sequence motifs, one believed to confer binding to the cytoplasmic dynein light chain LC8 and a lysine-rich sequence were conserved throughout the Chinese RABVs. In contrast, the isolates exhibited lower conservation of one phosphate acceptor and one internal translation initiation site identified in the P protein of the rabies challenge virus standard (CVS) strain. Bayesian coalescent analysis showed that the P gene in Chinese RABVs have a substitution rate (3.305x10(-4) substitutions per site per year) and evolution history (592 years ago) similar to values for the glycoprotein (G) and nucleoprotein (N) reported previously. Several substitutions were found in the P gene of Chinese RABVs strains compared to the laboratory adapted and vaccine strains, whether these variations could affect the biological characteristics of Chinese RABVs need to be further investigated. The substitution rate and evolution history of P gene is similar to G and N gene, combine the topology of phylogenetic tree based on the P gene is similar to the G and N gene trees, indicate that the P, G and N genes are equally valid for examining the phylogenetics of RABVs.
On the information content of discrete phylogenetic characters.
Bordewich, Magnus; Deutschmann, Ina Maria; Fischer, Mareike; Kasbohm, Elisa; Semple, Charles; Steel, Mike
2017-12-16
Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.
Rajakumaran, P; Vaseeharan, B; Jayakumar, R; Chidambara, R
2014-01-01
Understanding of accurate phylogenetic relationship among Penaeidae shrimp is important for academic and fisheries industry. The Morphometric and Randomly amplified polymorphic DNA (RAPD) analysis was used to make the phylogenetic relationsip among 13 Penaeidae shrimp. For morphometric analysis forty variables and total lengths of shrimp were measured for each species, and removed the effect of size variation. The size normalized values obtained was subjected to UPGMA (Unweighted Pair-Group Method with Arithmetic Mean) cluster analysis. For RAPD analysis, the four primers showed reliable differentiation between species, and used correlation coefficient between the DNA banding patterns of 13 Penaeidae species to construct UPGMA dendrogram. Phylogenetic relationship from morphometric and molecular analysis for Penaeidae species found to be congruent. We concluded that as the results from morphometry investigations concur with molecular one, phylogenetic relationship obtained for the studied Penaeidae are considered to be reliable.
Bellanger, J-M; Moreau, P-A; Corriol, G; Bidaud, A; Chalange, R; Dudova, Z; Richard, F
2015-04-01
During the last two decades, the unprecedented development of molecular phylogenetic tools has propelled an opportunity to revisit the fungal kingdom under an evolutionary perspective. Mycology has been profoundly changed but a sustained effort to elucidate large sections of the astonishing fungal diversity is still needed. Here we fill this gap in the case of Lyophyllaceae, a species-rich and ecologically diversified family of mushrooms. Assembly and genealogical concordance multigene phylogenetic analysis of a large dataset that includes original, vouchered material from expert field mycologists reveal the phylogenetic topology of the family, from higher (generic) to lower (species) levels. A comparative analysis of the most widely used phylogenetic markers in Fungi indicates that the nuc rDNA region encompassing the internal transcribed spacers 1 and 2, along with the 5.8S rDNA (ITS) and portions of the genes for RNA polymerase II second largest subunit (RPB2) is the most performing combination to resolve the broadest range of taxa within Lyophyllaceae. Eleven distinct evolutionary lineages are identified, that display partial overlap with traditional genera as well as with the phylogenetic framework previously proposed for the family. Eighty phylogenetic species are delineated, which shed light on a large number of morphological concepts, including rare and poorly documented ones. Probing these novel phylogenetic species to the barcoding method of species limit delineation, indicates that the latter method fully resolves Lyophyllaceae species, except in one clade. This case study provides the first comprehensive phylogenetic overview of Lyophyllaceae, a necessary step towards a taxonomical, ecological and nomenclatural revision of this family of mushrooms. It also proposes a set of methodological guidelines that may be of relevance for future taxonomic works in other groups of Fungi.
Comparative inflorescence development in selected Andean Santalales.
Suaza-Gaviria, Vanessa; González, Favio; Pabón-Mora, Natalia
2017-01-01
Loranthaceae, Santalaceae, and Viscaceae are the most diversified hemiparasitic families of Santalales in the Andes. Their partial inflorescences (PIs) vary from solitary flowers, or dichasia in most Santalales, to congested floral groups along articles in most Viscaceae. The atypical articled inflorescences in Phoradendreae (Viscaceae), a phylogenetic novelty restricted to this tribe, have been variously described as racemes, spikes, fascicles, or as intercalary inflorescences, but no developmental studies have been performed to compare them with the construction of PIs across Santalales. We used standard light microscopy and scanning electron microscopy to record the inflorescence development in members of Phoradendreae (Viscaceae) in comparison to those in species of Aetanthus, Gaiadendron, Oryctanthus, Passovia, and Peristethium (Loranthaceae) and Antidaphne (Santalaceae). Morphological and developmental comparisons as well as optimization onto a phylogenetic framework indicate that individual inflorescences in Santalales are indeterminate and are formed by axillary cymose PIs. The latter correspond to dichasia, either simple, compound, or variously reduced by abortion of lateral flowers, abortion of the terminal flower, or loss of bracteoles. Dichasia are plesiomorphic in Santalales. These results favor the interpretation that inflorescences in Phoradendreae are formed by the fusion of serial dichasia (=floral rows) with the main inflorescence axis via syndesmy. We compared this interpretation with the competing one based on the co-occurrence of collateral and serial floral buds. © 2017 Botanical Society of America.
Phylogenetic diversity anomaly in angiosperms between eastern Asia and eastern North America.
Qian, Hong; Jin, Yi; Ricklefs, Robert E
2017-10-24
Although eastern Asia (EAS) and eastern North America (ENA) have similar climates, plant species richness in EAS greatly exceeds that in ENA. The degree to which this diversity difference reflects the ages of the floras or their rates of evolutionary diversification has not been quantified. Measures of species diversity that do not incorporate the ages of lineages disregard the evolutionary distinctiveness of species. In contrast, phylogenetic diversity integrates both the number of species and their history of evolutionary diversification. Here we compared species diversity and phylogenetic diversity in a large number of flowering plant (angiosperm) floras distributed across EAS and ENA, two regions with similar contemporary environments and broadly shared floristic history. After accounting for climate and sample area, we found both species diversity and phylogenetic diversity to be significantly higher in EAS than in ENA. When we controlled the number of species statistically, we found that phylogenetic diversity remained substantially higher in EAS than in ENA, although it tended to converge at high latitude. This pattern held independently for herbs, shrubs, and trees. The anomaly in species and phylogenetic diversity likely resulted from differences in regional processes, related in part to high climatic and topographic heterogeneity, and a strong monsoon climate, in EAS. The broad connection between tropical and temperate floras in southern Asia also might have played a role in creating the phylogenetic diversity anomaly.
Stability and phylogenetic correlation in gut microbiota: lessons from ants and apes.
Sanders, Jon G; Powell, Scott; Kronauer, Daniel J C; Vasconcelos, Heraldo L; Frederickson, Megan E; Pierce, Naomi E
2014-03-01
Correlation between gut microbiota and host phylogeny could reflect codiversification over shared evolutionary history or a selective environment that is more similar in related hosts. These alternatives imply substantial differences in the relationship between host and symbiont, but can they be distinguished based on patterns in the community data themselves? We explored patterns of phylogenetic correlation in the distribution of gut bacteria among species of turtle ants (genus Cephalotes), which host a dense gut microbial community. We used 16S rRNA pyrosequencing from 25 Cephalotes species to show that their gut community is remarkably stable, from the colony to the genus level. Despite this overall similarity, the existing differences among species' microbiota significantly correlated with host phylogeny. We introduced a novel analytical technique to test whether these phylogenetic correlations are derived from recent bacterial evolution, as would be expected in the case of codiversification, or from broader shifts more likely to reflect environmental filters imposed by factors such as diet or habitat. We also tested this technique on a published data set of ape microbiota, confirming earlier results while revealing previously undescribed patterns of phylogenetic correlation. Our results indicated a high degree of partner fidelity in the Cephalotes microbiota, suggesting that vertical transmission of the entire community could play an important role in the evolution and maintenance of the association. As additional comparative microbiota data become available, the techniques presented here can be used to explore trends in the evolution of host-associated microbial communities. © 2014 John Wiley & Sons Ltd.
Tosh, J.; Dessein, S.; Buerki, S.; Groeninckx, I.; Mouly, A.; Bremer, B.; Smets, E. F.; De Block, P.
2013-01-01
Background and Aims Previous work on the pantropical genus Ixora has revealed an Afro-Madagascan clade, but as yet no study has focused in detail on the evolutionary history and morphological trends in this group. Here the evolutionary history of Afro-Madagascan Ixora spp. (a clade of approx. 80 taxa) is investigated and the phylogenetic trees compared with several key morphological traits in taxa occurring in Madagascar. Methods Phylogenetic relationships of Afro-Madagascan Ixora are assessed using sequence data from four plastid regions (petD, rps16, rpoB-trnC and trnL-trnF) and nuclear ribosomal external transcribed spacer (ETS) and internal transcribed spacer (ITS) regions. The phylogenetic distribution of key morphological characters is assessed. Bayesian inference (implemented in BEAST) is used to estimate the temporal origin of Ixora based on fossil evidence. Key Results Two separate lineages of Madagascan taxa are recovered, one of which is nested in a group of East African taxa. Divergence in Ixora is estimated to have commenced during the mid Miocene, with extensive cladogenesis occurring in the Afro-Madagascan clade during the Pliocene onwards. Conclusions Both lineages of Madagascan Ixora exhibit morphological innovations that are rare throughout the rest of the genus, including a trend towards pauciflorous inflorescences and a trend towards extreme corolla tube length, suggesting that the same ecological and selective pressures are acting upon taxa from both Madagascan lineages. Novel ecological opportunities resulting from climate-induced habitat fragmentation and corolla tube length diversification are likely to have facilitated species radiation on Madagascar. PMID:24142919
Phylogeny of economically important insect pests that infesting several crops species in Malaysia
NASA Astrophysics Data System (ADS)
Ghazali, Siti Zafirah; Zain, Badrul Munir Md.; Yaakop, Salmah
2014-09-01
This paper reported molecular data on insect pests of commercial crops in Peninsular Malaysia. Fifteen insect pests (Metisa plana, Calliteara horsefeldii, Cotesia vestalis, Bactrocera papayae, Bactrocera carambolae, Bactrocera latifrons, Conopomorpha cramella, Sesamia inferens, Chilo polychrysa, Rhynchophorus vulneratus, and Rhynchophorus ferrugineus) of nine crops were sampled (oil palm, coconut, paddy, cocoa, starfruit, angled loofah, guava, chili and mustard) and also four species that belong to the fern's pest (Herpetogramma platycapna) and storage and rice pests (Tribolium castaneum, Oryzaephilus surinamensis and Cadra cautella). The presented phylogeny summarized the initial phylogenetic hypothesis, which concerning by implementation of the economically important insect pests. In this paper, phylogenetic relationships among 39 individuals of 15 species that belonging to three orders under 12 genera were inferred from DNA sequences of mitochondrial marker, cytochrome oxidase subunit I (COI) and nuclear marker, ribosomal DNA 28S D2 region. The phylogenies resulted from the phylogenetic analyses of both genes are relatively similar, but differ in the sequence of evolution. Interestingly, this most recent molecular data of COI sequences data by using Bayesian Inference analysis resulted a more-resolved phylogeny that corroborated with traditional hypotheses of holometabolan relationships based on traditional hypotheses of holometabolan relationships and most of recently molecular study compared to 28S sequences. This finding provides the information on relationships of pests species, which infested several crops in Malaysia and also estimation on Holometabola's order relationships. The identification of the larval stages of insect pests could be done accurately, without waiting the emergence of adults and supported by the phylogenetic tree.
Nie, Xiaojun; Lv, Shuzuo; Zhang, Yingxin; Du, Xianghong; Wang, Le; Biradar, Siddanagouda S; Tan, Xiufang; Wan, Fanghao; Weining, Song
2012-01-01
Crofton weed (Ageratina adenophora) is one of the most hazardous invasive plant species, which causes serious economic losses and environmental damages worldwide. However, the sequence resource and genome information of A. adenophora are rather limited, making phylogenetic identification and evolutionary studies very difficult. Here, we report the complete sequence of the A. adenophora chloroplast (cp) genome based on Illumina sequencing. The A. adenophora cp genome is 150, 689 bp in length including a small single-copy (SSC) region of 18, 358 bp and a large single-copy (LSC) region of 84, 815 bp separated by a pair of inverted repeats (IRs) of 23, 755 bp. The genome contains 130 unique genes and 18 duplicated in the IR regions, with the gene content and organization similar to other Asteraceae cp genomes. Comparative analysis identified five DNA regions (ndhD-ccsA, psbI-trnS, ndhF-ycf1, ndhI-ndhG and atpA-trnR) containing parsimony-informative characters higher than 2%, which may be potential informative markers for barcoding and phylogenetic analysis. Repeat structure, codon usage and contraction of the IR were also investigated to reveal the pattern of evolution. Phylogenetic analysis demonstrated a sister relationship between A. adenophora and Guizotia abyssinica and supported a monophyly of the Asterales. We have assembled and analyzed the chloroplast genome of A. adenophora in this study, which was the first sequenced plastome in the Eupatorieae tribe. The complete chloroplast genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family.
Onto-phylogenetic aspect of myotomal myogenesis in Chordata.
Kiełbówna, Leokadia; Daczewska, Małgorzata
2004-01-01
This paper presents an onto- and phylogenetic aspect of myotoamal myogenesis in Chordata. A comparative analysis of early stages of myotomal myogenesis in Chordata indicates that the myogenic process in this phylum underwent evolutionary changes. The first stage of the process is myogenesis leading to development of mononucleate mature muscle cells, the most advanced stage is formation of multinucleate muscle fibres.
USDA-ARS?s Scientific Manuscript database
The present study was conducted to better understand how the phylogenetic diversity of true morels (Morchella) in Turkey compares with species found in other regions of the world. The current research builds on our recently published survey of 10 Turkish provinces and another of the world in which D...
ERIC Educational Resources Information Center
Julius, Matthew L.; Schoenfuss, Heiko L.
2006-01-01
This laboratory exercise introduces students to a fundamental tool in evolutionary biology--phylogenetic inference. Students are required to create a data set via observation and through mining preexisting data sets. These student data sets are then used to develop and compare competing hypotheses of vertebrate phylogeny. The exercise uses readily…
Alternaria section Alternaria: Species, formae speciales or pathotypes?
Woudenberg, J.H.C.; Seidl, M.F.; Groenewald, J.Z.; de Vries, M.; Stielow, J.B.; Thomma, B.P.H.J.; Crous, P.W.
2015-01-01
The cosmopolitan fungal genus Alternaria consists of multiple saprophytic and pathogenic species. Based on phylogenetic and morphological studies, the genus is currently divided into 26 sections. Alternaria sect. Alternaria contains most of the small-spored Alternaria species with concatenated conidia, including important plant, human and postharvest pathogens. Species within sect. Alternaria have been mostly described based on morphology and / or host-specificity, yet molecular variation between them is minimal. To investigate whether the described morphospecies within sect. Alternaria are supported by molecular data, whole-genome sequencing of nine Alternaria morphospecies supplemented with transcriptome sequencing of 12 Alternaria morphospecies as well as multi-gene sequencing of 168 Alternaria isolates was performed. The assembled genomes ranged in size from 33.3–35.2 Mb within sect. Alternaria and from 32.0–39.1 Mb for all Alternaria genomes. The number of repetitive sequences differed significantly between the different Alternaria genomes; ranging from 1.4–16.5 %. The repeat content within sect. Alternaria was relatively low with only 1.4–2.7 % of repeats. Whole-genome alignments revealed 96.7–98.2 % genome identity between sect. Alternaria isolates, compared to 85.1–89.3 % genome identity for isolates from other sections to the A. alternata reference genome. Similarly, 1.4–2.8 % and 0.8–1.8 % single nucleotide polymorphisms (SNPs) were observed in genomic and transcriptomic sequences, respectively, between isolates from sect. Alternaria, while the percentage of SNPs found in isolates from different sections compared to the A. alternata reference genome was considerably higher; 8.0–10.3 % and 6.1–8.5 %. The topology of a phylogenetic tree based on the whole-genome and transcriptome reads was congruent with multi-gene phylogenies based on commonly used gene regions. Based on the genome and transcriptome data, a set of core proteins was extracted, and primers were designed on two gene regions with a relatively low degree of conservation within sect. Alternaria (96.8 and 97.3 % conservation). Their potential discriminatory power within sect. Alternaria was tested next to nine commonly used gene regions in sect. Alternaria, namely the SSU, LSU, ITS, gapdh, rpb2, tef1, Alt a 1, endoPG and OPA10-2 gene regions. The phylogenies from the two gene regions with a relatively low conservation, KOG1058 and KOG1077, could not distinguish the described morphospecies within sect. Alternaria more effectively than the phylogenies based on the commonly used gene regions for Alternaria. Based on genome and transcriptome comparisons and molecular phylogenies, Alternaria sect. Alternaria consists of only 11 phylogenetic species and one species complex. Thirty-five morphospecies, which cannot be distinguished based on the multi-gene phylogeny, are synonymised under A. alternata. By providing guidelines for the naming and identification of phylogenetic species in Alternaria sect. Alternaria, this manuscript provides a clear and stable species classification in this section. PMID:26951037
Hesamizadeh, Khashayar; Alavian, Seyed Moayed; Najafi Tireh Shabankareh, Azar; Sharafi, Heidar
2016-12-01
Hepatitis C virus (HCV) is characterized by a high degree of genetic heterogeneity and classified into 7 genotypes and different subtypes. It heterogeneously distributed through various risk groups and geographical regions. A well-established phylogenetic relationship can simplify the tracing of HCV hierarchical strata into geographical regions. The current study aimed to find genetic phylogeny of subtypes 1a and 1b of HCV isolates based on NS5B nucleotide sequences in Iran and other members of Eastern Mediterranean regional office of world health organization, as well as other Middle Eastern countries, with a systematic review of available published and unpublished studies. The phylogenetic analyses were performed based on the nucleotide sequences of NS5B gene of HCV genotype 1 (HCV-1), which were registered in the GenBank database. The literature review was performed in two steps: 1) searching studies evaluating the NS5B sequences of HCV-1, on PubMed, Scopus, and Web of Science, and 2) Searching sequences of unpublished studies registered in the GenBank database. In this study, 442 sequences from HCV-1a and 232 from HCV-1b underwent phylogenetic analysis. Phylogenetic analysis of all sequences revealed different clusters in the phylogenetic trees. The results showed that the proportion of HCV-1a and -1b isolates from Iranian patients probably originated from domestic sources. Moreover, the HCV-1b isolates from Iranian patients may have similarities with the European ones. In this study, phylogenetic reconstruction of HCV-1 sequences clearly indicated for molecular tracing and ancestral relationships of the HCV genotypes in Iran, and showed the likelihood of domestic origin for HCV-1a and various origin for HCV-1b.
Tang, Cuong Q; Humphreys, Aelys M; Fontaneto, Diego; Barraclough, Timothy G; Paradis, Emmanuel
2014-01-01
Coalescent-based species delimitation methods combine population genetic and phylogenetic theory to provide an objective means for delineating evolutionarily significant units of diversity. The generalised mixed Yule coalescent (GMYC) and the Poisson tree process (PTP) are methods that use ultrametric (GMYC or PTP) or non-ultrametric (PTP) gene trees as input, intended for use mostly with single-locus data such as DNA barcodes. Here, we assess how robust the GMYC and PTP are to different phylogenetic reconstruction and branch smoothing methods. We reconstruct over 400 ultrametric trees using up to 30 different combinations of phylogenetic and smoothing methods and perform over 2000 separate species delimitation analyses across 16 empirical data sets. We then assess how variable diversity estimates are, in terms of richness and identity, with respect to species delimitation, phylogenetic and smoothing methods. The PTP method generally generates diversity estimates that are more robust to different phylogenetic methods. The GMYC is more sensitive, but provides consistent estimates for BEAST trees. The lower consistency of GMYC estimates is likely a result of differences among gene trees introduced by the smoothing step. Unresolved nodes (real anomalies or methodological artefacts) affect both GMYC and PTP estimates, but have a greater effect on GMYC estimates. Branch smoothing is a difficult step and perhaps an underappreciated source of bias that may be widespread among studies of diversity and diversification. Nevertheless, careful choice of phylogenetic method does produce equivalent PTP and GMYC diversity estimates. We recommend simultaneous use of the PTP model with any model-based gene tree (e.g. RAxML) and GMYC approaches with BEAST trees for obtaining species hypotheses. PMID:25821577
Garamszegi, László Zsolt
2011-02-01
Plasmodium parasites, the causative agents of malaria, are generally considered as harmful parasites, but many of them cause mild symptoms. Little is known about the evolutionary history and phylogenetic constraints that generate this interspecific variation in virulence due to uncertainties about the phylogenetic associations of parasites. Here, to account for such phylogenetic uncertainty, phylogenetic methods based on Bayesian statistics were followed in combination with sequence data from five genes to estimate the ancestral state of virulence in primate Plasmodium parasites. When recent parasites were categorised according to the damage caused to the host, Bayesian estimates of ancestral states indicated that the acquisition of a harmful host exploitation strategy is more likely to be a recent evolutionary event than a result of an ancient change in a character state altering virulence. On the contrary, there was more evidence for moderate host exploitation having a deep origin along the phylogenetic tree. Moreover, the evolution of host severity is determined by the phylogenetic relationships of parasites, as severity gains did not appear randomly on the evolutionary tree. Such phylogenetic constraints can be mediated by the acquisition of virulence genes. As the impact of a parasite on a host is the result of both the parasite's investment in reproduction and host sensitivity, virulence was also estimated by calculating peak parasitemia after eliminating host effects. A directional random-walk evolutionary model showed that the ancestral primate malarias reproduced at very low parasitemia in their hosts. Consequently, the extreme variation in the outcome of malaria infection in different host species can be better understood in light of the phylogeny of parasites. Copyright © 2010 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Dornburg, Alex; Friedman, Matt; Near, Thomas J
2015-08-01
Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum
DOE Office of Scientific and Technical Information (OSTI.GOV)
Riley, Robert; Salamov, Asaf; Otillar, Robert
Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes revealsmore » that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.« less
Belskii, Eugen A; Mikryukov, Vladimir S
2018-05-07
The effects of industrial pollution on bird diversity have been widely studied using traditional diversity measures, which assume all species to be equivalent. We compared species richness and Shannon index with distance-based measures of taxonomic, functional, and phylogenetic diversity (the abundance-weighted mean nearest taxon distances), which describe within-community dissimilarity at terminal branches. Analysis of dissimilarity can shed light on the processes underlying community assembly, i.e., environmental filtering decreases dissimilarity whereas competitive exclusion increases it. In the 2-year study near Karabash and Revda copper smelters in Russia, point counts of nesting birds and habitat descriptions were taken at 10 sites (40 plots) along each pollution gradient. The abundance and diversity of birds showed good repeatability in both regions. The total density of birds, number of species per plot, and Shannon diversity decreased at high toxic load in both regions. The taxonomic, functional, and phylogenetic nearest taxon distances showed the same pattern within regions. Species dissimilarity within communities increased with pollution in Karabash (due to loss of functionally similar species), but did not change in Revda (due to mass replacement of forest species by species of open habitats). Pollution-induced changes in bird communities near Karabash were greater due to the stronger deterioration of the forest ecosystems and less favorable natural conditions (more arid climate, lower diversity and vitality of the tree stand and understorey) compared to Revda. This study emphasizes the need for a multi-level approach to the analysis of bird communities using traditional indices of diversity, functional, taxonomic, or phylogenetic distances between species and environmental variables.
Pollinator shifts as triggers of speciation in painted petal irises (Lapeirousia: Iridaceae)
Forest, Félix; Goldblatt, Peter; Manning, John C.; Baker, David; Colville, Jonathan F.; Devey, Dion S.; Jose, Sarah; Kaye, Maria; Buerki, Sven
2014-01-01
Background and Aims Adaptation to different pollinators has been hypothesized as one of the main factors promoting the formation of new species in the Cape region of South Africa. Other researchers favour alternative causes such as shifts in edaphic preferences. Using a phylogenetic framework and taking into consideration the biogeographical scenario explaining the distribution of the group as well as the distribution of pollinators, this study compares pollination strategies with substrate adaptations to develop hypotheses of the primary factors leading to speciation in Lapeirousia (Iridaceae), a genus of corm-bearing geophytes well represented in the Cape and presenting an important diversity of pollination syndromes and edaphic preferences. Methods Phylogenetic relationships are reconstructed within Lapeirousia using nuclear and plastid DNA sequence data. State-of-the-art methods in biogeography, divergence time estimation, character optimization and diversification rate assessments are used to examine the evolution of pollination syndromes and substrate shifts in the history of the group. Based on the phylogenetic results, ecological factors are compared for nine sister species pairs in Lapeirousia. Key Results Seventeen pollinator shifts and ten changes in substrate types were inferred during the evolution of the genus Lapeirousia. Of the nine species pairs examined, all show divergence in pollination syndromes, while only four pairs present different substrate types. Conclusions The available evidence points to a predominant influence of pollinator shifts over substrate types on the speciation process within Lapeirousia, contrary to previous studies that favoured a more important role for edaphic factors in these processes. This work also highlights the importance of biogeographical patterns in the study of pollination syndromes. PMID:24323246
Cenozoic climate change shaped the evolutionary ecophysiology of the Cupressaceae conifers
Pittermann, Jarmila; Stuart, Stephanie A.; Dawson, Todd E.; Moreau, Astrid
2012-01-01
The Cupressaceae clade has the broadest diversity in habitat and morphology of any conifer family. This clade is characterized by highly divergent physiological strategies, with deciduous swamp-adapted genera-like Taxodium at one extreme, and evergreen desert genera-like Cupressus at the other. The size disparity within the Cupressaceae is equally impressive, with members ranging from 5-m-tall juniper shrubs to 100-m-tall redwood trees. Phylogenetic studies demonstrate that despite this variation, these taxa all share a single common ancestor; by extension, they also share a common ancestral habitat. Here, we use a common-garden approach to compare xylem and leaf-level physiology in this family. We then apply comparative phylogenetic methods to infer how Cenozoic climatic change shaped the morphological and physiological differences between modern-day members of the Cupressaceae. Our data show that drought-resistant crown clades (the Cupressoid and Callitroid clades) most likely evolved from drought-intolerant Mesozoic ancestors, and that this pattern is consistent with proposed shifts in post-Eocene paleoclimates. We also provide evidence that within the Cupressaceae, the evolution of drought-resistant xylem is coupled to increased carbon investment in xylem tissue, reduced xylem transport efficiency, and at the leaf level, reduced photosynthetic capacity. Phylogenetically based analyses suggest that the ancestors of the Cupressaceae were dependent upon moist habitats, and that drought-resistant physiology developed along with increasing habitat aridity from the Oligocene onward. We conclude that the modern biogeography of the Cupressaceae conifers was shaped in large part by their capacity to adapt to drought. PMID:22628565
Cenozoic climate change shaped the evolutionary ecophysiology of the Cupressaceae conifers.
Pittermann, Jarmila; Stuart, Stephanie A; Dawson, Todd E; Moreau, Astrid
2012-06-12
The Cupressaceae clade has the broadest diversity in habitat and morphology of any conifer family. This clade is characterized by highly divergent physiological strategies, with deciduous swamp-adapted genera-like Taxodium at one extreme, and evergreen desert genera-like Cupressus at the other. The size disparity within the Cupressaceae is equally impressive, with members ranging from 5-m-tall juniper shrubs to 100-m-tall redwood trees. Phylogenetic studies demonstrate that despite this variation, these taxa all share a single common ancestor; by extension, they also share a common ancestral habitat. Here, we use a common-garden approach to compare xylem and leaf-level physiology in this family. We then apply comparative phylogenetic methods to infer how Cenozoic climatic change shaped the morphological and physiological differences between modern-day members of the Cupressaceae. Our data show that drought-resistant crown clades (the Cupressoid and Callitroid clades) most likely evolved from drought-intolerant Mesozoic ancestors, and that this pattern is consistent with proposed shifts in post-Eocene paleoclimates. We also provide evidence that within the Cupressaceae, the evolution of drought-resistant xylem is coupled to increased carbon investment in xylem tissue, reduced xylem transport efficiency, and at the leaf level, reduced photosynthetic capacity. Phylogenetically based analyses suggest that the ancestors of the Cupressaceae were dependent upon moist habitats, and that drought-resistant physiology developed along with increasing habitat aridity from the Oligocene onward. We conclude that the modern biogeography of the Cupressaceae conifers was shaped in large part by their capacity to adapt to drought.
Asakura, H; Makino, S; Kobori, H; Watarai, M; Shirahata, T; Ikeda, T; Takeshi, K
2001-08-01
Nucleotide sequences of Shiga toxin (Stx) genes in STEC from various origins were determined and characterized by phylogenetic analysis based on Shiga toxin (Stx) with those deposited in GenBank. The phylogenetic trees placed Stx1 and Stx2 into two and five groups respectively, and indicated that Stx1 in sheep-origin STEC were placed into a different group from those in other STEC, and that Stx2 of deer-origin STEC also belonged to the unique group and appeared to be distantly related to human-origin STEC. On the other hand, Stx of STEC isolated from cattle, seagulls and flies were closely related to those of human-origin STEC. Such a diversity of Stx suggested that STEC might be widely disseminated in many animal species, and be dependent on their host species or their habitat. In addition, the active sites in both toxins were compared; the active sites in both subunits of Stx in all the animal-origin STEC were identical to those in human-origin STEC, suggesting that all the toxin of STEC from animals might be also cytotoxic, and therefore, such animal-origin STEC might have potential pathogenicity for humans.
Asakura, H.; Makino, S.; Kobori, H.; Watarai, M.; Shirahata, T.; Ikeda, T.; Takeshi, K.
2001-01-01
Nucleotide sequences of Shiga toxin (Stx) genes in STEC from various origins were determined and characterized by phylogenetic analysis based on Shiga toxin (Stx) with those deposited in GenBank. The phylogenetic trees placed Stx1 and Stx2 into two and five groups respectively, and indicated that Stx1 in sheep-origin STEC were placed into a different group from those in other STEC, and that Stx2 of deer-origin STEC also belonged to the unique group and appeared to be distantly related to human-origin STEC. On the other hand, Stx of STEC isolated from cattle, seagulls and flies were closely related to those of human-origin STEC. Such a diversity of Stx suggested that STEC might be widely disseminated in many animal species, and be dependent on their host species or their habitat. In addition, the active sites in both toxins were compared; the active sites in both subunits of Stx in all the animal-origin STEC were identical to those in human-origin STEC, suggesting that all the toxin of STEC from animals might be also cytotoxic, and therefore, such animal-origin STEC might have potential pathogenicity for humans. PMID:11561972
Lentendu, Guillaume; Mahé, Frédéric; Bass, David; Rueckert, Sonja; Stoeck, Thorsten; Dunthorn, Micah
2018-05-30
Tropical animals and plants are known to have high alpha diversity within forests, but low beta diversity between forests. By contrast, it is unknown whether microbes inhabiting the same ecosystems exhibit similar biogeographic patterns. To evaluate the biogeographies of tropical protists, we used metabarcoding data of species sampled in the soils of three lowland Neotropical rainforests. Taxa-area and distance-decay relationships for three of the dominant protist taxa and their subtaxa were estimated at both the OTU and phylogenetic levels, with presence-absence and abundance-based measures. These estimates were compared to null models. High local alpha and low regional beta diversity patterns were consistently found for both the parasitic Apicomplexa and the largely free-living Cercozoa and Ciliophora. Similar to animals and plants, the protists showed spatial structures between forests at the OTU and phylogenetic levels, and only at the phylogenetic level within forests. These results suggest that the biogeographies of macro- and micro-organismal eukaryotes in lowland Neotropical rainforests are partially structured by the same general processes. However, and unlike the animals and plants, the protist OTUs did not exhibit spatial structures within forests, which hinders our ability to estimate the local and regional diversity of protists in tropical forests. © 2018 John Wiley & Sons Ltd.
Rezadehbashi, Maryam; Baldwin, Susan A.
2018-01-01
Biochemical reactors (BCRs) based on the stimulation of sulphate-reducing microorganisms (SRM) are emerging semi-passive remediation technologies for treatment of mine-influenced water. Their successful removal of metals and sulphate has been proven at the pilot-scale, but little is known about the types of SRM that grow in these systems and whether they are diverse or restricted to particular phylogenetic or taxonomic groups. A phylogenetic study of four established pilot-scale BCRs on three different mine sites compared the diversity of SRM growing in them. The mine sites were geographically distant from each other, nevertheless the BCRs selected for similar SRM types. Clostridia SRM related to Desulfosporosinus spp. known to be tolerant to high concentrations of copper were members of the core microbial community. Members of the SRM family Desulfobacteraceae were dominant, particularly those related to Desulfatirhabdium butyrativorans. Methanogens were dominant archaea and possibly were present at higher relative abundances than SRM in some BCRs. Both hydrogenotrophic and acetoclastic types were present. There were no strong negative or positive co-occurrence correlations of methanogen and SRM taxa. Knowing which SRM inhabit successfully operating BCRs allows practitioners to target these phylogenetic groups when selecting inoculum for future operations. PMID:29473875
Phylogenetic convolutional neural networks in metagenomics.
Fioravanti, Diego; Giarratano, Ylenia; Maggio, Valerio; Agostinelli, Claudio; Chierici, Marco; Jurman, Giuseppe; Furlanello, Cesare
2018-03-08
Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user.
Diversification of Rosaceae since the Late Cretaceous based on plastid phylogenomics.
Zhang, Shu-Dong; Jin, Jian-Jun; Chen, Si-Yun; Chase, Mark W; Soltis, Douglas E; Li, Hong-Tao; Yang, Jun-Bo; Li, De-Zhu; Yi, Ting-Shuang
2017-05-01
Phylogenetic relationships in Rosaceae have long been problematic because of frequent hybridisation, apomixis and presumed rapid radiation, and their historical diversification has not been clarified. With 87 genera representing all subfamilies and tribes of Rosaceae and six of the other eight families of Rosales (outgroups), we analysed 130 newly sequenced plastomes together with 12 from GenBank in an attempt to reconstruct deep relationships and reveal temporal diversification of this family. Our results highlight the importance of improving sequence alignment and the use of appropriate substitution models in plastid phylogenomics. Three subfamilies and 16 tribes (as previously delimited) were strongly supported as monophyletic, and their relationships were fully resolved and strongly supported at most nodes. Rosaceae were estimated to have originated during the Late Cretaceous with evidence for rapid diversification events during several geological periods. The major lineages rapidly diversified in warm and wet habits during the Late Cretaceous, and the rapid diversification of genera from the early Oligocene onwards occurred in colder and drier environments. Plastid phylogenomics offers new and important insights into deep phylogenetic relationships and the diversification history of Rosaceae. The robust phylogenetic backbone and time estimates we provide establish a framework for future comparative studies on rosaceous evolution. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Krause, Cornelia; Garnica, Sigisfredo; Bauer, Robert; Nebel, Martin
2011-09-01
A total of 35 population samples of the liverwort genera Aneura (A. pinguis) and Riccardia (R. latifrons, R. multifida, and R. palmata) were sampled from diverse habitats and geographical provenances in Germany, Austria, and Switzerland. Light and transmission electron microscopy were used to characterise the morphological features of the associations, and phylogenetic analyses based on internal transcribed spacers (ITS) and the D1/D2 regions of the fungal 28S rDNA were used to address diversity and phylogenetic relationships. By comparing the cellular structures of the plant-fungus interactions, we recognised the following states of fungal colonisation within the thalli: fungus-free, epiphytic, intercellular, and intracellular. Colonising hyphae showed dolipores with imperforate parenthesomes, slime bodies, and multilayered walls. Colonised liverwort cells had pleomorphic nuclei and elongated starch-free chloroplasts with distinctive grana. Our analyses revealed six phylogenetic groups of tulasnelloid fungi associated with liverworts, where major lineages mostly share similar host and/or ecological specialisations. The mode of colonisation of the tulasnelloid mycobionts in Aneura and Riccardia sharing identical fungal sequences is different. Consequently, the mode of colonisation may be host-dependent. Finally, our findings demonstrate that Aneuraceae are model organisms for evolutionary studies of symbiotic associations between liverworts and fungi. Copyright © 2011 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
McGovern, Emily; McCabe, Matthew S; Cormican, Paul; Popova, Milka; Keogh, Kate; Kelly, Alan K; Kenny, David A; Waters, Sinead M
2017-10-12
Methane generated during enteric fermentation in ruminant livestock species is a major contributor to global anthropogenic greenhouse gas emissions. A period of moderate feed restriction followed by ad libitum access to feed is widely applied in cattle management to exploit the animal's compensatory growth potential and reduce feed costs. In the present study, we utilised microbial RNA from rumen digesta samples to assess the phylogenetic diversity of transcriptionally active methanogens from feed-restricted and non-restricted animals. To determine the contribution of different rumen methanogens to methanogenesis during dietary restriction of cattle, we conducted high-throughput mcrA cDNA amplicon sequencing on an Illumina MiSeq and analysed both the abundance and phylogenetic origin of different mcrA cDNA sequences. When compared to their unrestricted contemporaries, in feed-restricted animals, the methanogenic activity, based on mcrA transcript abundance, of Methanobrevibacter gottschalkii clade increased while the methanogenic activity of the Methanobrevibacter ruminantium clade and members of the Methanomassiliicoccaceae family decreased. This study shows that the quantity of feed consumed can evoke large effects on the composition of methanogenically active species in the rumen of cattle. These data potentially have major implications for targeted CH 4 mitigation approaches such as anti-methanogen vaccines and/or tailored dietary management.
Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A
2008-01-01
Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277
Wang, Juan; Zhang, Li; Zhang, Qi-Lin; Zhou, Min-Qiang; Wang, Xiao-Tong; Yang, Xing-Zhuo; Yuan, Ming-Long
2017-01-01
The family Miridae is one of the most species-rich families of insects. To better understand the diversity and evolution of mirids, we determined the mitogenome of Lygus pratenszs and re-sequenced the mitogenomes of four mirids (i.e., Apolygus lucorum , Adelphocoris suturalis , Ade. fasciaticollis and Ade. lineolatus ). We performed a comparative analysis for 15 mitogenomic sequences representing 11 species of five genera within Miridae and evaluated the potential of these mitochondrial genes as molecular markers. Our results showed that the general mitogenomic features (gene content, gene arrangement, base composition and codon usage) were well conserved among these mirids. Four protein-coding genes (PCGs) ( cox1 , cox3 , nad1 and nad3 ) had no length variability, where nad5 showed the largest size variation; no intraspecific length variation was found in PCGs. Two PCGs ( nad4 and nad5 ) showed relatively high substitution rates at the nucleotide and amino acid levels, where cox1 had the lowest substitution rate. The Ka/Ks values for all PCGs were far lower than 1 (<0.59), but the Ka/Ks values of cox1 -barcode sequences were always larger than 1 (1.34 -15.20), indicating that the 658 bp sequences of cox1 may be not the appropriate marker due to positive selection or selection relaxation. Phylogenetic analyses based on two concatenated mitogenomic datasets consistently supported the relationship of Nesidiocoris + ( Trigonotylus + ( Adelphocoris + ( Apolygus + Lygus ))), as revealed by nad4 , nad5 , rrnL and the combined 22 transfer RNA genes (tRNAs), respectively. Taken sequence length, substitution rate and phylogenetic signal together, the individual genes ( nad4 , nad5 and rrnL ) and the combined 22 tRNAs could been used as potential molecular markers for Miridae at various taxonomic levels. Our results suggest that it is essential to evaluate and select suitable markers for different taxa groups when performing phylogenetic, population genetic and species identification studies.
A phylogenetic analysis of the megadiverse Chalcidoidea (Hymenoptera)
USDA-ARS?s Scientific Manuscript database
Chalcidoidea (Hymenoptera) are extremely diverse with an estimated 500,000 species. We present the first phylogenetic analysis of the superfamily based on a cladistic analysis of both morphological and molecular data. A total of 233 morphological characters were scored for 300 taxa and 265 genera, a...