Sample records for phylogenetically informed comparative

  1. Comparative Analysis of Begonia Plastid Genomes and Their Utility for Species-Level Phylogenetics

    PubMed Central

    Harrison, Nicola; Harrison, Richard J.

    2016-01-01

    Recent, rapid radiations make species-level phylogenetics difficult to resolve. We used a multiplexed, high-throughput sequencing approach to identify informative genomic regions to resolve phylogenetic relationships at low taxonomic levels in Begonia from a survey of sixteen species. A long-range PCR method was used to generate draft plastid genomes to provide a strong phylogenetic backbone, identify fast evolving regions and provide informative molecular markers for species-level phylogenetic studies in Begonia. PMID:27058864

  2. Taking the First Steps towards a Standard for Reporting on Phylogenies: Minimal Information about a Phylogenetic Analysis (MIAPA)

    PubMed Central

    LEEBENS-MACK, JIM; VISION, TODD; BRENNER, ERIC; BOWERS, JOHN E.; CANNON, STEVEN; CLEMENT, MARK J.; CUNNINGHAM, CLIFFORD W.; dePAMPHILIS, CLAUDE; deSALLE, ROB; DOYLE, JEFF J.; EISEN, JONATHAN A.; GU, XUN; HARSHMAN, JOHN; JANSEN, ROBERT K.; KELLOGG, ELIZABETH A.; KOONIN, EUGENE V.; MISHLER, BRENT D.; PHILIPPE, HERVÉ; PIRES, J. CHRIS; QIU, YIN-LONG; RHEE, SEUNG Y.; SJÖLANDER, KIMMEN; SOLTIS, DOUGLAS E.; SOLTIS, PAMELA S.; STEVENSON, DENNIS W.; WALL, KERR; WARNOW, TANDY; ZMASEK, CHRISTIAN

    2011-01-01

    In the eight years since phylogenomics was introduced as the intersection of genomics and phylogenetics, the field has provided fundamental insights into gene function, genome history and organismal relationships. The utility of phylogenomics is growing with the increase in the number and diversity of taxa for which whole genome and large transcriptome sequence sets are being generated. We assert that the synergy between genomic and phylogenetic perspectives in comparative biology would be enhanced by the development and refinement of minimal reporting standards for phylogenetic analyses. Encouraged by the development of the Minimum Information About a Microarray Experiment (MIAME) standard, we propose a similar roadmap for the development of a Minimal Information About a Phylogenetic Analysis (MIAPA) standard. Key in the successful development and implementation of such a standard will be broad participation by developers of phylogenetic analysis software, phylogenetic database developers, practitioners of phylogenomics, and journal editors. PMID:16901231

  3. Random sampling of constrained phylogenies: conducting phylogenetic analyses when the phylogeny is partially known.

    PubMed

    Housworth, E A; Martins, E P

    2001-01-01

    Statistical randomization tests in evolutionary biology often require a set of random, computer-generated trees. For example, earlier studies have shown how large numbers of computer-generated trees can be used to conduct phylogenetic comparative analyses even when the phylogeny is uncertain or unknown. These methods were limited, however, in that (in the absence of molecular sequence or other data) they allowed users to assume that no phylogenetic information was available or that all possible trees were known. Intermediate situations where only a taxonomy or other limited phylogenetic information (e.g., polytomies) are available are technically more difficult. The current study describes a procedure for generating random samples of phylogenies while incorporating limited phylogenetic information (e.g., four taxa belong together in a subclade). The procedure can be used to conduct comparative analyses when the phylogeny is only partially resolved or can be used in other randomization tests in which large numbers of possible phylogenies are needed.

  4. SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations

    PubMed Central

    2013-01-01

    Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408

  5. SUNPLIN: simulation with uncertainty for phylogenetic investigations.

    PubMed

    Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F

    2013-11-15

    Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.

  6. Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks.

    PubMed

    Oh, S June; Joung, Je-Gun; Chang, Jeong-Ho; Zhang, Byoung-Tak

    2006-06-06

    To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway structures using meta-level information rather than sequence information. This method may yield further information about biological evolution, such as the history of horizontal transfer of each gene, by studying the detailed structure of the phylogenetic tree constructed by the kernel-based method.

  7. Phylogenetic diversity of macromycetes and woody plants along an elevational gradient in Eastern Mexico

    Treesearch

    Marko Gomez-Hernandez; Guadalupe Williams-Linera; D. Jean Lodge; Roger Guevara; Eduardo Ruiz-Sanchez; Etelvina Gandara

    2016-01-01

    Phylogenetic information provides insight into the ecological and evolutionary processes that organize species assemblages. We compared patterns of phylogenetic diversity among macromycete and woody plant communities along a steep elevational gradient in eastern Mexico to better understand the evolutionary processes that structure their communities. Macrofungi and...

  8. Cophenetic correlation analysis as a strategy to select phylogenetically informative proteins: an example from the fungal kingdom

    PubMed Central

    Kuramae, Eiko E; Robert, Vincent; Echavarri-Erasun, Carlos; Boekhout, Teun

    2007-01-01

    Background The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40–45 concatenated proteins seem needed to resolve fungal TOL. Results In total 4852 orthologous proteins (KOGs) were assigned among 33 fungal genomes from the Asco- and Basidiomycota and 70 of these represented single copy proteins. The individual protein distance matrices based on 531 concatenated proteins that has been used for phylogeny reconstruction before [14] were compared one with another in order to select those with the highest CCC, which then was used as a reference. This reference distance matrix was compared with those of the 70 single copy proteins selected and their CCC values were calculated. Sixty four KOGs showed a CCC above 0.50 and these were further considered for their phylogenetic potential. Proteins belonging to the cellular processes and signaling KOG category seem more informative than those belonging to the other three categories: information storage and processing; metabolism; and the poorly characterized category. After concatenation of 40 proteins the topology of the phylogenetic tree remained stable, but after concatenation of 60 or more proteins the bootstrap support values of some branches decreased, most likely due to the inclusion of proteins with lowers CCC values. The selection of protein sequences to be used in various TOL projects remains a critical and important process. The method described in this paper will contribute to a more objective selection of phylogenetically informative protein sequences. Conclusion This study provides candidate protein sequences to be considered as phylogenetic markers in different branches of fungal TOL. The selection procedure described here will be useful to select informative protein sequences to resolve branches of TOL that contain few or no species with completely sequenced genomes. The robust phylogenetic trees resulting from this method may contribute to our understanding of organismal diversification processes. The method proposed can be extended easily to other branches of TOL. PMID:17688684

  9. One tree to link them all: a phylogenetic dataset for the European tetrapoda.

    PubMed

    Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried

    2014-08-08

    Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.

  10. Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.

    2017-02-01

    Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.

  11. The phylogenetic utility of acetyltransferase (ARD1) and glutaminyl tRNA synthetase (QtRNA) for reconstructing Cenozoic relationships as exemplified by the large Australian cicada Pauropsalta generic complex.

    PubMed

    Owen, Christopher L; Marshall, David C; Hill, Kathy B R; Simon, Chris

    2015-02-01

    The Pauropsalta generic complex is a large group of cicadas (72 described spp.; >82 undescribed spp.) endemic to Australia. No previous molecular work on deep level relationships within this complex has been conducted, but a recent morphological revision and phylogenetic analysis proposed relationships among the 11 genera. We present here the first comprehensive molecular phylogeny of the complex using five loci (1 mtDNA, 4 nDNA), two of which are from nuclear genes new to cicada systematics. We compare the molecular phylogeny to the morphological phylogeny. We evaluate the phylogenetic informativeness of the new loci to traditional cicada systematics loci to generate a baseline of performance and behavior to aid in gene choice decisions in future systematic and phylogenomic studies. Our maximum likelihood and Bayesian inference phylogenies strongly support the monophyly of most of the newly described genera; however, relationships among genera differ from the morphological phylogeny. A comparison of phylogenetic informativeness among all loci revealed that COI 3rd positions dominate the informativeness profiles relative to all other loci but exhibit some among taxon nucleotide bias. After removing COI 3rd positions, COI 1st positions dominate near the terminals, while the period intron has the most phylogenetic informativeness near the root. Among the nuclear loci, ARD1 and QtRNA have lower phylogenetic informativeness than period intron and elongation factor 1 alpha intron, but the informativeness increases at you move from the tips to the root. The increase in phylogenetic informativeness deeper in the tree suggests these loci may be useful for resolving older relationships. Copyright © 2015. Published by Elsevier Inc.

  12. Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence

    PubMed Central

    Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W.; Samuel, Rosabelle

    2016-01-01

    Background and Aims Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Methods Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. Key Results The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. Conclusions In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree because the total number of variable sites was much lower than in the entire plastid genome. The geographical clustering of the individuals against a background of overall low sequence divergence could indicate transfer of plastid genomes due to hybridization and introgression following secondary contact. PMID:27098088

  13. The Development of Three Long Universal Nuclear Protein-Coding Locus Markers and Their Application to Osteichthyan Phylogenetics with Nested PCR

    PubMed Central

    Zhang, Peng

    2012-01-01

    Background Universal nuclear protein-coding locus (NPCL) markers that are applicable across diverse taxa and show good phylogenetic discrimination have broad applications in molecular phylogenetic studies. For example, RAG1, a representative NPCL marker, has been successfully used to make phylogenetic inferences within all major osteichthyan groups. However, such markers with broad working range and high phylogenetic performance are still scarce. It is necessary to develop more universal NPCL markers comparable to RAG1 for osteichthyan phylogenetics. Methodology/Principal Findings We developed three long universal NPCL markers (>1.6 kb each) based on single-copy nuclear genes (KIAA1239, SACS and TTN) that possess large exons and exhibit the appropriate evolutionary rates. We then compared their phylogenetic utilities with that of the reference marker RAG1 in 47 jawed vertebrate species. In comparison with RAG1, each of the three long universal markers yielded similar topologies and branch supports, all in congruence with the currently accepted osteichthyan phylogeny. To compare their phylogenetic performance visually, we also estimated the phylogenetic informativeness (PI) profile for each of the four long universal NPCL markers. The PI curves indicated that SACS performed best over the whole timescale, while RAG1, KIAA1239 and TTN exhibited similar phylogenetic performances. In addition, we compared the success of nested PCR and standard PCR when amplifying NPCL marker fragments. The amplification success rate and efficiency of the nested PCR were overwhelmingly higher than those of standard PCR. Conclusions/Significance Our work clearly demonstrates the superiority of nested PCR over the conventional PCR in phylogenetic studies and develops three long universal NPCL markers (KIAA1239, SACS and TTN) with the nested PCR strategy. The three markers exhibit high phylogenetic utilities in osteichthyan phylogenetics and can be widely used as pilot genes for phylogenetic questions of osteichthyans at different taxonomic levels. PMID:22720083

  14. Mitogenome Phylogenetics: The Impact of Using Single Regions and Partitioning Schemes on Topology, Substitution Rate and Divergence Time Estimation

    PubMed Central

    Duchêne, Sebastián; Archer, Frederick I.; Vilstrup, Julia; Caballero, Susana; Morin, Phillip A.

    2011-01-01

    The availability of mitochondrial genome sequences is growing as a result of recent technological advances in molecular biology. In phylogenetic analyses, the complete mitogenome is increasingly becoming the marker of choice, usually providing better phylogenetic resolution and precision relative to traditional markers such as cytochrome b (CYTB) and the control region (CR). In some cases, the differences in phylogenetic estimates between mitogenomic and single-gene markers have yielded incongruent conclusions. By comparing phylogenetic estimates made from different genes, we identified the most informative mitochondrial regions and evaluated the minimum amount of data necessary to reproduce the same results as the mitogenome. We compared results among individual genes and the mitogenome for recently published complete mitogenome datasets of selected delphinids (Delphinidae) and killer whales (genus Orcinus). Using Bayesian phylogenetic methods, we investigated differences in estimation of topologies, divergence dates, and clock-like behavior among genes for both datasets. Although the most informative regions were not the same for each taxonomic group (COX1, CYTB, ND3 and ATP6 for Orcinus, and ND1, COX1 and ND4 for Delphinidae), in both cases they were equivalent to less than a quarter of the complete mitogenome. This suggests that gene information content can vary among groups, but can be adequately represented by a portion of the complete sequence. Although our results indicate that complete mitogenomes provide the highest phylogenetic resolution and most precise date estimates, a minimum amount of data can be selected using our approach when the complete sequence is unavailable. Studies based on single genes can benefit from the addition of a few more mitochondrial markers, producing topologies and date estimates similar to those obtained using the entire mitogenome. PMID:22073275

  15. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  16. Phylogenetic overdispersion of plant species in southern Brazilian savannas.

    PubMed

    Silva, I A; Batalha, M A

    2009-08-01

    Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutionary processes. Based on these assumptions, we may infer mechanisms of contemporary coexistence from the phylogenetic relationships of the species in a community. We studied the phylogenetic structure of plant communities in four cerrado sites, in southeastern Brazil. We calculated two raw phylogenetic distances among the species sampled. We estimated the phylogenetic structure by comparing the observed phylogenetic distances to the distribution of phylogenetic distances in null communities. We obtained null communities by randomizing the phylogenetic relationships of the regional pool of species. We found a phylogenetic overdispersion of the cerrado species. Phylogenetic overdispersion has several explanations, depending on the phylogenetic history of traits and contemporary ecological interactions. However, based on coexistence models between grasses and trees, density-dependent ecological forces, and the evolutionary history of the cerrado flora, we argue that the phylogenetic overdispersion of cerrado species is predominantly due to competitive interactions, herbivores and pathogen attacks, and ecological speciation. Future studies will need to include information on the phylogenetic history of plant traits.

  17. Estimating Bayesian Phylogenetic Information Content

    PubMed Central

    Lewis, Paul O.; Chen, Ming-Hui; Kuo, Lynn; Lewis, Louise A.; Fučíková, Karolina; Neupane, Suman; Wang, Yu-Bo; Shi, Daoyuan

    2016-01-01

    Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.] PMID:27155008

  18. Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis.

    PubMed

    Gibbs, S; Collard, M; Wood, B

    2002-01-01

    This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence.

  19. Chromosomal evolution in Rodentia

    PubMed Central

    Romanenko, S A; Perelman, P L; Trifonov, V A; Graphodatsky, A S

    2012-01-01

    Rodentia is the most species-rich mammalian order and includes several important laboratory model species. The amount of new information on karyotypic and phylogenetic relations within and among rodent taxa is rapidly increasing, but a synthesis of these data is currently lacking. Here, we have integrated information drawn from conventional banding studies, recent comparative painting investigations and molecular phylogenetic reconstructions of different rodent taxa. This permitted a revision of several ancestral karyotypic reconstructions, and a more accurate depiction of rodent chromosomal evolution. PMID:22086076

  20. Phylogenetic effective sample size.

    PubMed

    Bartoszek, Krzysztof

    2016-10-21

    In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present concept of effective sample size (the mean effective sample size). Through a simulation study I find that the AICc is robust if one corrects for the number of species or effective number of species. Lastly I discuss how the concept of the phylogenetic effective sample size can be useful for biodiversity quantification, identification of interesting clades and deciding on the importance of phylogenetic correlations. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

    PubMed Central

    Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John

    2014-01-01

    Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723

  2. Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests.

    PubMed

    Posada, David; Buckley, Thomas R

    2004-10-01

    Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental concepts and techniques of model selection in the context of phylogenetics. We start by reviewing different aspects of the selection of substitution models in phylogenetics from a theoretical, philosophical and practical point of view, and summarize this comparison in table format. We argue that the most commonly implemented model selection approach, the hierarchical likelihood ratio test, is not the optimal strategy for model selection in phylogenetics, and that approaches like the Akaike Information Criterion (AIC) and Bayesian methods offer important advantages. In particular, the latter two methods are able to simultaneously compare multiple nested or nonnested models, assess model selection uncertainty, and allow for the estimation of phylogenies and model parameters using all available models (model-averaged inference or multimodel inference). We also describe how the relative importance of the different parameters included in substitution models can be depicted. To illustrate some of these points, we have applied AIC-based model averaging to 37 mitochondrial DNA sequences from the subgenus Ohomopterus(genus Carabus) ground beetles described by Sota and Vogler (2001).

  3. Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis

    PubMed Central

    Gibbs, S; Collard, M; Wood, B

    2002-01-01

    This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence. PMID:11833653

  4. Symbiosis between hydra and chlorella: molecular phylogenetic analysis and experimental study provide insight into its origin and evolution.

    PubMed

    Kawaida, Hitomi; Ohba, Kohki; Koutake, Yuhki; Shimizu, Hiroshi; Tachida, Hidenori; Kobayakawa, Yoshitaka

    2013-03-01

    Although many physiological studies have been reported on the symbiosis between hydra and green algae, very little information from a molecular phylogenetic aspect of symbiosis is available. In order to understand the origin and evolution of symbiosis between the two organisms, we compared the phylogenetic relationships among symbiotic green algae with the phylogenetic relationships among host hydra strains. To do so, we reconstructed molecular phylogenetic trees of several strains of symbiotic chlorella harbored in the endodermal epithelial cells of viridissima group hydra strains and investigated their congruence with the molecular phylogenetic trees of the host hydra strains. To examine the species specificity between the host and the symbiont with respect to the genetic distance, we also tried to introduce chlorella strains into two aposymbiotic strains of viridissima group hydra in which symbiotic chlorella had been eliminated in advance. We discussed the origin and history of symbiosis between hydra and green algae based on the analysis. Copyright © 2012 Elsevier Inc. All rights reserved.

  5. Including Fossils in Phylogenetic Climate Reconstructions: A Deep Time Perspective on the Climatic Niche Evolution and Diversification of Spiny Lizards (Sceloporus).

    PubMed

    Lawing, A Michelle; Polly, P David; Hews, Diana K; Martins, Emília P

    2016-08-01

    Fossils and other paleontological information can improve phylogenetic comparative method estimates of phenotypic evolution and generate hypotheses related to species diversification. Here, we use fossil information to calibrate ancestral reconstructions of suitable climate for Sceloporus lizards in North America. Integrating data from the fossil record, general circulation models of paleoclimate during the Miocene, climate envelope modeling, and phylogenetic comparative methods provides a geographically and temporally explicit species distribution model of Sceloporus-suitable habitat through time. We provide evidence to support the historic biogeographic hypothesis of Sceloporus diversification in warm North American deserts and suggest a relatively recent Sceloporus invasion into Mexico around 6 Ma. We use a physiological model to map extinction risk. We suggest that the number of hours of restriction to a thermal refuge limited Sceloporus from inhabiting Mexico until the climate cooled enough to provide suitable habitat at approximately 6 Ma. If the future climate returns to the hotter climates of the past, Mexico, the place of highest modern Sceloporus richness, will no longer provide suitable habitats for Sceloporus to survive and reproduce.

  6. Kakusan4 and Aminosan: two programs for comparing nonpartitioned, proportional and separate models for combined molecular phylogenetic analyses of multilocus sequence data.

    PubMed

    Tanabe, Akifumi S

    2011-09-01

    Proportional and separate models able to apply different combination of substitution rate matrix (SRM) and among-site rate variation model (ASRVM) to each locus are frequently used in phylogenetic studies of multilocus data. A proportional model assumes that branch lengths are proportional among partitions and a separate model assumes that each partition has an independent set of branch lengths. However, the selection from among nonpartitioned (i.e., a common combination of models is applied to all-loci concatenated sequences), proportional and separate models is usually based on the researcher's preference rather than on any information criteria. This study describes two programs, 'Kakusan4' (for DNA sequences) and 'Aminosan' (for amino-acid sequences), which allow the selection of evolutionary models based on several types of information criteria. The programs can handle both multilocus and single-locus data, in addition to providing an easy-to-use wizard interface and a noninteractive command line interface. In the case of multilocus data, SRMs and ASRVMs are compared at each locus and at all-loci concatenated sequences, after which nonpartitioned, proportional and separate models are compared based on information criteria. The programs also provide model configuration files for mrbayes, paup*, phyml, raxml and Treefinder to support further phylogenetic analysis using a selected model. When likelihoods are optimized by Treefinder, the best-fit models were found to differ depending on the data set. Furthermore, differences in the information criteria among nonpartitioned, proportional and separate models were much larger than those among the nonpartitioned models. These findings suggest that selecting from nonpartitioned, proportional and separate models results in a better phylogenetic tree. Kakusan4 and Aminosan are available at http://www.fifthdimension.jp/. They are licensed under gnugpl Ver.2, and are able to run on Windows, MacOS X and Linux. © 2011 Blackwell Publishing Ltd.

  7. Multivariate Phylogenetic Comparative Methods: Evaluations, Comparisons, and Recommendations.

    PubMed

    Adams, Dean C; Collyer, Michael L

    2018-01-01

    Recent years have seen increased interest in phylogenetic comparative analyses of multivariate data sets, but to date the varied proposed approaches have not been extensively examined. Here we review the mathematical properties required of any multivariate method, and specifically evaluate existing multivariate phylogenetic comparative methods in this context. Phylogenetic comparative methods based on the full multivariate likelihood are robust to levels of covariation among trait dimensions and are insensitive to the orientation of the data set, but display increasing model misspecification as the number of trait dimensions increases. This is because the expected evolutionary covariance matrix (V) used in the likelihood calculations becomes more ill-conditioned as trait dimensionality increases, and as evolutionary models become more complex. Thus, these approaches are only appropriate for data sets with few traits and many species. Methods that summarize patterns across trait dimensions treated separately (e.g., SURFACE) incorrectly assume independence among trait dimensions, resulting in nearly a 100% model misspecification rate. Methods using pairwise composite likelihood are highly sensitive to levels of trait covariation, the orientation of the data set, and the number of trait dimensions. The consequences of these debilitating deficiencies are that a user can arrive at differing statistical conclusions, and therefore biological inferences, simply from a dataspace rotation, like principal component analysis. By contrast, algebraic generalizations of the standard phylogenetic comparative toolkit that use the trace of covariance matrices are insensitive to levels of trait covariation, the number of trait dimensions, and the orientation of the data set. Further, when appropriate permutation tests are used, these approaches display acceptable Type I error and statistical power. We conclude that methods summarizing information across trait dimensions, as well as pairwise composite likelihood methods should be avoided, whereas algebraic generalizations of the phylogenetic comparative toolkit provide a useful means of assessing macroevolutionary patterns in multivariate data. Finally, we discuss areas in which multivariate phylogenetic comparative methods are still in need of future development; namely highly multivariate Ornstein-Uhlenbeck models and approaches for multivariate evolutionary model comparisons. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  8. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility.

    PubMed

    Buchwalter, David B; Cain, Daniel J; Martin, Caitrin A; Xie, Lingtian; Luoma, Samuel N; Garland, Theodore

    2008-06-17

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature.

  9. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility

    PubMed Central

    Buchwalter, David B.; Cain, Daniel J.; Martin, Caitrin A.; Xie, Lingtian; Luoma, Samuel N.; Garland, Theodore

    2008-01-01

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. PMID:18559853

  10. Treelink: data integration, clustering and visualization of phylogenetic trees.

    PubMed

    Allende, Christian; Sohn, Erik; Little, Cedric

    2015-12-29

    Phylogenetic trees are central to a wide range of biological studies. In many of these studies, tree nodes need to be associated with a variety of attributes. For example, in studies concerned with viral relationships, tree nodes are associated with epidemiological information, such as location, age and subtype. Gene trees used in comparative genomics are usually linked with taxonomic information, such as functional annotations and events. A wide variety of tree visualization and annotation tools have been developed in the past, however none of them are intended for an integrative and comparative analysis. Treelink is a platform-independent software for linking datasets and sequence files to phylogenetic trees. The application allows an automated integration of datasets to trees for operations such as classifying a tree based on a field or showing the distribution of selected data attributes in branches and leafs. Genomic and proteonomic sequences can also be linked to the tree and extracted from internal and external nodes. A novel clustering algorithm to simplify trees and display the most divergent clades was also developed, where validation can be achieved using the data integration and classification function. Integrated geographical information allows ancestral character reconstruction for phylogeographic plotting based on parsimony and likelihood algorithms. Our software can successfully integrate phylogenetic trees with different data sources, and perform operations to differentiate and visualize those differences within a tree. File support includes the most popular formats such as newick and csv. Exporting visualizations as images, cluster outputs and genomic sequences is supported. Treelink is available as a web and desktop application at http://www.treelinkapp.com .

  11. Phylogeny and species traits predict bird detectability

    USGS Publications Warehouse

    Solymos, Peter; Matsuoka, Steven M.; Stralberg, Diana; Barker, Nicole K. S.; Bayne, Erin M.

    2018-01-01

    Avian acoustic communication has resulted from evolutionary pressures and ecological constraints. We therefore expect that auditory detectability in birds might be predictable by species traits and phylogenetic relatedness. We evaluated the relationship between phylogeny, species traits, and field‐based estimates of the two processes that determine species detectability (singing rate and detection distance) for 141 bird species breeding in boreal North America. We used phylogenetic mixed models and cross‐validation to compare the relative merits of using trait data only, phylogeny only, or the combination of both to predict detectability. We found a strong phylogenetic signal in both singing rates and detection distances; however the strength of phylogenetic effects was less than expected under Brownian motion evolution. The evolution of behavioural traits that determine singing rates was found to be more labile, leaving more room for species to evolve independently, whereas detection distance was mostly determined by anatomy (i.e. body size) and thus the laws of physics. Our findings can help in disentangling how complex ecological and evolutionary mechanisms have shaped different aspects of detectability in boreal birds. Such information can greatly inform single‐ and multi‐species models but more work is required to better understand how to best correct possible biases in phylogenetic diversity and other community metrics.

  12. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    PubMed

    Kelly, Steven; Maini, Philip K

    2013-01-01

    The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  13. Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca

    2017-04-01

    The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.

  14. The relationship between energy expenditure and speed during pedestrian locomotion in birds: a morphological basis for the elevated y-intercept?

    PubMed

    Halsey, Lewis G

    2013-06-01

    The slope of the typically linear relationship between metabolic rate and walking speed represents the net cost of transport (NCOT). The extrapolated y-intercept is often greater than resting metabolic rate, thus representing a fixed cost associated with pedestrian transport including body maintenance costs. The full cause of the elevated y-intercept remains elusive and it could simply represent experimental stresses. The present literature-based study compares the mass-independent energetic cost of pedestrian locomotion in birds (excluding those with an upright posture, i.e. penguins), represented by the y-intercept, to a known predictor of cost of transport, hip height. Both phylogenetically informed and non-phylogenetically informed analyses were undertaken to determine if patterns of association between hip height, body mass, and the y-intercept are robust with respect to the method of analysis. Body mass and hip height were significant predictors of the y-intercept in the best phylogenetically-informed and non-phylogenetically informed models. Thus there is evidence that, in birds at least, the elevated y-intercept is a legitimate component of locomotion energy expenditure. Hip height is probably a good proxy of effective limb length and thus perhaps birds with greater hip heights have lower y-intercepts because their longer legs more efficiently accommodate body motion and/or because their limbs are more aligned with the ground reaction forces. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility

    USGS Publications Warehouse

    Buchwalter, D.B.; Cain, D.J.; Martin, C.A.; Xie, Lingtian; Luoma, S.N.; Garland, T.

    2008-01-01

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. ?? 2008 by The National Academy of Sciences of the USA.

  16. The impact of phenotypic and molecular data on the inference of Colletotrichum diversity associated with Musa.

    PubMed

    Vieira, Willie A S; Lima, Waléria G; Nascimento, Eduardo S; Michereff, Sami J; Câmara, Marcos P S; Doyle, Vinson P

    2017-01-01

    Developing a comprehensive and reliable taxonomy for the Colletotrichum gloeosporioides species complex will require adopting data standards on the basis of an understanding of how methodological choices impact morphological evaluations and phylogenetic inference. We explored the impact of methodological choices in a morphological and molecular evaluation of Colletotrichum species associated with banana in Brazil. The choice of alignment filtering algorithm has a significant impact on topological inference and the retention of phylogenetically informative sites. Similarly, the choice of phylogenetic marker affects the delimitation of species boundaries, particularly if low phylogenetic signal is confounded with strong discordance, and inference of the species tree from multiple-gene trees. According to both phylogenetic informativeness profiling and Bayesian concordance analyses, the most informative loci are DNA lyase (APN2), intergenic spacer (IGS) between DNA lyase and the mating-type locus MAT1-2-1 (APN2/MAT-IGS), calmodulin (CAL), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), glutamine synthetase (GS), β-tubulin (TUB2), and a new marker, the intergenic spacer between GAPDH and an hypothetical protein (GAP2-IGS). Cornmeal agar minimizes the variance in conidial dimensions compared with potato dextrose agar and synthetic nutrient-poor agar, such that species are more readily distinguishable based on phenotypic differences. We apply these insights to investigate the diversity of Colletotrichum species associated with banana anthracnose in Brazil and report C. musae, C. tropicale, C. theobromicola, and C. siamense in association with banana anthracnose. One lineage did not cluster with any previously described species and is described here as C. chrysophilum.

  17. On the information content of discrete phylogenetic characters.

    PubMed

    Bordewich, Magnus; Deutschmann, Ina Maria; Fischer, Mareike; Kasbohm, Elisa; Semple, Charles; Steel, Mike

    2017-12-16

    Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.

  18. New substitution models for rooting phylogenetic trees.

    PubMed

    Williams, Tom A; Heaps, Sarah E; Cherlin, Svetlana; Nye, Tom M W; Boys, Richard J; Embley, T Martin

    2015-09-26

    The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. © 2015 The Authors.

  19. Improving phylogenetic analyses by incorporating additional information from genetic sequence databases.

    PubMed

    Liang, Li-Jung; Weiss, Robert E; Redelings, Benjamin; Suchard, Marc A

    2009-10-01

    Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.

  20. Karyotype Evolution in Birds: From Conventional Staining to Chromosome Painting

    PubMed Central

    Ferguson-Smith, Malcolm A.

    2018-01-01

    In the last few decades, there have been great efforts to reconstruct the phylogeny of Neoaves based mainly on DNA sequencing. Despite the importance of karyotype data in phylogenetic studies, especially with the advent of fluorescence in situ hybridization (FISH) techniques using different types of probes, the use of chromosomal data to clarify phylogenetic proposals is still minimal. Additionally, comparative chromosome painting in birds is restricted to a few orders, while in mammals, for example, virtually all orders have already been analyzed using this method. Most reports are based on comparisons using Gallus gallus probes, and only a small number of species have been analyzed with more informative sets of probes, such as those from Leucopternis albicollis and Gyps fulvus, which show ancestral macrochromosomes rearranged in alternative patterns. Despite this, it is appropriate to review the available cytogenetic information and possible phylogenetic conclusions. In this report, the authors gather both classical and molecular cytogenetic data and describe some interesting and unique characteristics of karyotype evolution in birds. PMID:29584697

  1. Karyotype Evolution in Birds: From Conventional Staining to Chromosome Painting.

    PubMed

    Kretschmer, Rafael; Ferguson-Smith, Malcolm A; de Oliveira, Edivaldo Herculano Correa

    2018-03-27

    In the last few decades, there have been great efforts to reconstruct the phylogeny of Neoaves based mainly on DNA sequencing. Despite the importance of karyotype data in phylogenetic studies, especially with the advent of fluorescence in situ hybridization (FISH) techniques using different types of probes, the use of chromosomal data to clarify phylogenetic proposals is still minimal. Additionally, comparative chromosome painting in birds is restricted to a few orders, while in mammals, for example, virtually all orders have already been analyzed using this method. Most reports are based on comparisons using Gallus gallus probes, and only a small number of species have been analyzed with more informative sets of probes, such as those from Leucopternis albicollis and Gyps fulvus , which show ancestral macrochromosomes rearranged in alternative patterns. Despite this, it is appropriate to review the available cytogenetic information and possible phylogenetic conclusions. In this report, the authors gather both classical and molecular cytogenetic data and describe some interesting and unique characteristics of karyotype evolution in birds.

  2. Use of phylogenetical analysis to predict susceptibility of pathogenic Candida spp. to antifungal drugs.

    PubMed

    Maheux, Andrée F; Sellam, Adnane; Piché, Yves; Boissinot, Maurice; Pelletier, René; Boudreau, Dominique K; Picard, François J; Trépanier, Hélène; Boily, Marie-Josée; Ouellette, Marc; Roy, Paul H; Bergeron, Michel G

    2016-12-01

    Successful treatment of a Candida infection relies on 1) an accurate identification of the pathogenic fungus and 2) on its susceptibility to antifungal drugs. In the present study we investigated the level of correlation between phylogenetical evolution and susceptibility of pathogenic Candida spp. to antifungal drugs. For this, we compared a phylogenetic tree, assembled with the concatenated sequences (2475-bp) of the ATP2, TEF1, and TUF1 genes from 20 representative Candida species, with published minimal inhibitory concentrations (MIC) of the four principal antifungal drug classes commonly used in the treatment of candidiasis: polyenes, triazoles, nucleoside analogues, and echinocandins. The phylogenetic tree revealed three distinct phylogenetic clusters among Candida species. Species within a given phylogenetic cluster have generally similar susceptibility profiles to antifungal drugs and species within Clusters II and III were less sensitive to antifungal drugs than Cluster I species. These results showed that phylogenetical relationship between clusters and susceptibility to several antifungal drugs could be used to guide therapy when only species identification is available prior to information pertaining to its resistance profile. An extended study comprising a large panel of clinical samples should be conducted to confirm the efficiency of this approach in the treatment of candidiasis. Copyright © 2016. Published by Elsevier B.V.

  3. Mitochondrial genomes reveal recombination in the presumed asexual Fusarium oxysporum species complex.

    PubMed

    Brankovics, Balázs; van Dam, Peter; Rep, Martijn; de Hoog, G Sybren; J van der Lee, Theo A; Waalwijk, Cees; van Diepeningen, Anne D

    2017-09-18

    The Fusarium oxysporum species complex (FOSC) contains several phylogenetic lineages. Phylogenetic studies identified two to three major clades within the FOSC. The mitochondrial sequences are highly informative phylogenetic markers, but have been mostly neglected due to technical difficulties. A total of 61 complete mitogenomes of FOSC strains were de novo assembled and annotated. Length variations and intron patterns support the separation of three phylogenetic species. The variable region of the mitogenome that is typical for the genus Fusarium shows two new variants in the FOSC. The variant typical for Fusarium is found in members of all three clades, while variant 2 is found in clades 2 and 3 and variant 3 only in clade 2. The extended set of loci analyzed using a new implementation of the genealogical concordance species recognition method support the identification of three phylogenetic species within the FOSC. Comparative analysis of the mitogenomes in the FOSC revealed ongoing mitochondrial recombination within, but not between phylogenetic species. The recombination indicates the presence of a parasexual cycle in F. oxysporum. The obstacles hindering the usage of the mitogenomes are resolved by using next generation sequencing and selective genome assemblers, such as GRAbB. Complete mitogenome sequences offer a stable basis and reference point for phylogenetic and population genetic studies.

  4. Comparative analysis of the complete mitochondrial genomes of two types of ducks, peking duck (Anas platyrhychos) and muscovy duck (Cairina moschata).

    PubMed

    Tu, Jianfeng; Yang, Ying; Yang, Fuhe; Xing, Xiumei

    2017-03-01

    Peking duck (Anas platyrhychos) and Muscovy duck (Cairina moschata) are two types of domestic ducks and the most popular meat breeds on the world. In this study, we sequenced and compared complete mitochondrial genomes of both breeds. In order to investigate the phylogeny of both breeds within Anseriformes, the sequences of concatenated 12 protein-coding genes were used for phylogenetic analysis. The result was consistent with most of the previous morphological and molecular studies. Our complete mitochondrial genome sequences of both breeds will be useful information in phylogenetics, and be available as basic data for the breeding and genetics.

  5. Complete chloroplast genome sequence of a major invasive species, crofton weed (Ageratina adenophora).

    PubMed

    Nie, Xiaojun; Lv, Shuzuo; Zhang, Yingxin; Du, Xianghong; Wang, Le; Biradar, Siddanagouda S; Tan, Xiufang; Wan, Fanghao; Weining, Song

    2012-01-01

    Crofton weed (Ageratina adenophora) is one of the most hazardous invasive plant species, which causes serious economic losses and environmental damages worldwide. However, the sequence resource and genome information of A. adenophora are rather limited, making phylogenetic identification and evolutionary studies very difficult. Here, we report the complete sequence of the A. adenophora chloroplast (cp) genome based on Illumina sequencing. The A. adenophora cp genome is 150, 689 bp in length including a small single-copy (SSC) region of 18, 358 bp and a large single-copy (LSC) region of 84, 815 bp separated by a pair of inverted repeats (IRs) of 23, 755 bp. The genome contains 130 unique genes and 18 duplicated in the IR regions, with the gene content and organization similar to other Asteraceae cp genomes. Comparative analysis identified five DNA regions (ndhD-ccsA, psbI-trnS, ndhF-ycf1, ndhI-ndhG and atpA-trnR) containing parsimony-informative characters higher than 2%, which may be potential informative markers for barcoding and phylogenetic analysis. Repeat structure, codon usage and contraction of the IR were also investigated to reveal the pattern of evolution. Phylogenetic analysis demonstrated a sister relationship between A. adenophora and Guizotia abyssinica and supported a monophyly of the Asterales. We have assembled and analyzed the chloroplast genome of A. adenophora in this study, which was the first sequenced plastome in the Eupatorieae tribe. The complete chloroplast genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family.

  6. Misconceptions on Missing Data in RAD-seq Phylogenetics with a Deep-scale Example from Flowering Plants.

    PubMed

    Eaton, Deren A R; Spriggs, Elizabeth L; Park, Brian; Donoghue, Michael J

    2017-05-01

    Restriction-site associated DNA (RAD) sequencing and related methods rely on the conservation of enzyme recognition sites to isolate homologous DNA fragments for sequencing, with the consequence that mutations disrupting these sites lead to missing information. There is thus a clear expectation for how missing data should be distributed, with fewer loci recovered between more distantly related samples. This observation has led to a related expectation: that RAD-seq data are insufficiently informative for resolving deeper scale phylogenetic relationships. Here we investigate the relationship between missing information among samples at the tips of a tree and information at edges within it. We re-analyze and review the distribution of missing data across ten RAD-seq data sets and carry out simulations to determine expected patterns of missing information. We also present new empirical results for the angiosperm clade Viburnum (Adoxaceae, with a crown age >50 Ma) for which we examine phylogenetic information at different depths in the tree and with varied sequencing effort. The total number of loci, the proportion that are shared, and phylogenetic informativeness varied dramatically across the examined RAD-seq data sets. Insufficient or uneven sequencing coverage accounted for similar proportions of missing data as dropout from mutation-disruption. Simulations reveal that mutation-disruption, which results in phylogenetically distributed missing data, can be distinguished from the more stochastic patterns of missing data caused by low sequencing coverage. In Viburnum, doubling sequencing coverage nearly doubled the number of parsimony informative sites, and increased by >10X the number of loci with data shared across >40 taxa. Our analysis leads to a set of practical recommendations for maximizing phylogenetic information in RAD-seq studies. [hierarchical redundancy; phylogenetic informativeness; quartet informativeness; Restriction-site associated DNA (RAD) sequencing; sequencing coverage; Viburnum.]. © The authors 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please e-mail: journals.permission@oup.com.

  7. Rostral horn evolution among agamid lizards of the genus ceratophora endemic to Sri Lanka

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schulte II, James A.; Macey, J. Robert; Pethiyagoda, Rohan

    2001-07-10

    The first phylogenetic hypothesis for the Sri Lankan agamid lizard genus Ceratophora is presented based on 1670 aligned base positions (472 parsimony informative) of mitochondrial DNA sequences, representing coding regions for eight tRNAs, ND2, and portions of ND1 and COI. Phylogenetic analysis reveals multiple origins and possibly losses of rostral horns in the evolutionary history of Ceratophora. Our data suggest a middle Miocene origin of Ceratophora with the most recent branching of recognized species occurring at the Pliocene/Pleistocene boundary. Haplotype divergence suggests that an outgroup species, Lyriocephalus scutatus, dates at least to the Pliocene. These phylogenetic results provide a frameworkmore » for comparative studies of the behavioral ecological importance of horn evolution in this group.« less

  8. The giant zooxanthellae-bearing ciliate Maristentor dinoferus (Heterotrichea) is closely related to folliculinidae.

    PubMed

    Miao, Wei; Simpson, Alastair G B; Fu, Chengjie; Lobban, Christopher S

    2005-01-01

    The small subunit rDNA sequence of Maristentor dinoferus (Lobban, Schefter, Simpson, Pochon, Pawlowski, and Foissner, 2002) was determined and compared with sequences from other Heterotrichea and Karyorelictea. Maristentor resembles Stentor in basic morphology and had been provisionally assigned to Stentoridae. However, our phylogenetic analyses show that Maristentor is more closely related to Folliculinidae. Our results support the creation of a separate family for Maristentor, Maristentoridae n. fam., and also confirm the phylogenetic grouping of Folliculindae, Stentoridae, Blepharismidae, and Maristentoridae, which we informally call 'stentorids'. Maristentor, rather than Stentor itself, appears to be most significant in understanding the origins of folliculinids from their aloricate ancestors. Our analyses suggest continued uncertainty in the exact placement of the root of heterotrichs with this phylogenetic marker.

  9. Comparison of natural and nonnative two-species communities of Anolis lizards.

    PubMed

    Poe, Steven

    2014-07-01

    Human-mediated colonizations present an informative model system for understanding assembly of organismal communities. However, it is unclear whether communities including naturalized species are accurate analogs of natural communities or unique combinations not present in nature. I compared morphology and phylogenetic structure of natural and naturalized two-species communities of Anolis lizards. Natural communities are phylogenetically clustered, whereas naturalized communities show no significant phylogenetic structure. This result likely reflects differences in colonization pools for these communities-that is, invasion from anywhere for naturalized communities but from proximal and thus phylogenetically close lineages in natural communities. Both natural and naturalized communities each include pairs of species that are significantly similar to each other in morphology, and both sets of communities are composed of species that possess traits of good colonizers. These similarities suggest that the formation of natural and naturalized communities may be at least partially governed by similar processes. Human-mediated invasions may be credibly viewed as modern incarnations of natural colonizations in this case.

  10. New partial sequences of phosphoenolpyruvate carboxylase as molecular phylogenetic markers.

    PubMed

    Gehrig, H; Heute, V; Kluge, M

    2001-08-01

    To better understand the evolution of the enzyme phosphoenolpyruvate carboxylase (PEPC) and to test its versatility as a molecular character in phylogenetic and taxonomic studies, we have characterized and compared 70 new partial PEPC nucleotide and amino acid sequences (about 1100 bp of the 3' side of the gene) from 50 plant species (24 species of Bryophyta, 1 of Pteridophyta, and 25 of Spermatophyta). Together with previously published data, the new set of sequences allowed us to construct the up to now most complete phylogenetic tree of PEPC, where the PEPC sequences cluster according to both the taxonomic positions of the donor plants and the assumed specific function of the PEPC isoforms. Altogether, the study further strengthens the view that PEPC sequences can provide interesting information for the reconstruction of phylogenetic relations between organisms and metabolic pathways. To avoid confusion in future discussion, we propose a new nomenclature for the denotation of PEPC isoforms. Copyright 2001 Academic Press.

  11. Predicting rates of interspecific interaction from phylogenetic trees.

    PubMed

    Nuismer, Scott L; Harmon, Luke J

    2015-01-01

    Integrating phylogenetic information can potentially improve our ability to explain species' traits, patterns of community assembly, the network structure of communities, and ecosystem function. In this study, we use mathematical models to explore the ecological and evolutionary factors that modulate the explanatory power of phylogenetic information for communities of species that interact within a single trophic level. We find that phylogenetic relationships among species can influence trait evolution and rates of interaction among species, but only under particular models of species interaction. For example, when interactions within communities are mediated by a mechanism of phenotype matching, phylogenetic trees make specific predictions about trait evolution and rates of interaction. In contrast, if interactions within a community depend on a mechanism of phenotype differences, phylogenetic information has little, if any, predictive power for trait evolution and interaction rate. Together, these results make clear and testable predictions for when and how evolutionary history is expected to influence contemporary rates of species interaction. © 2014 John Wiley & Sons Ltd/CNRS.

  12. dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees.

    PubMed

    Wise, Michael J

    2016-01-01

    Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa.

  13. dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees

    PubMed Central

    2016-01-01

    Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa. PMID:27898695

  14. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  15. Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.

    PubMed

    Sferra, Gabriella; Fratini, Federica; Ponzi, Marta; Pizzi, Elisabetta

    2017-09-05

    Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods. Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity. In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.

  16. Molecular evolution of ependymin and the phylogenetic resolution of early divergences among euteleost fishes.

    PubMed

    Ortí, G; Meyer, A

    1996-04-01

    The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.

  17. A Penalized Likelihood Framework For High-Dimensional Phylogenetic Comparative Methods And An Application To New-World Monkeys Brain Evolution.

    PubMed

    Julien, Clavel; Leandro, Aristide; Hélène, Morlon

    2018-06-19

    Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.

  18. PHYLOGEOrec: A QGIS plugin for spatial phylogeographic reconstruction from phylogenetic tree and geographical information data

    NASA Astrophysics Data System (ADS)

    Nashrulloh, Maulana Malik; Kurniawan, Nia; Rahardi, Brian

    2017-11-01

    The increasing availability of genetic sequence data associated with explicit geographic and environment (including biotic and abiotic components) information offers new opportunities to study the processes that shape biodiversity and its patterns. Developing phylogeography reconstruction, by integrating phylogenetic and biogeographic knowledge, provides richer and deeper visualization and information on diversification events than ever before. Geographical information systems such as QGIS provide an environment for spatial modeling, analysis, and dissemination by which phylogenetic models can be explicitly linked with their associated spatial data, and subsequently, they will be integrated with other related georeferenced datasets describing the biotic and abiotic environment. We are introducing PHYLOGEOrec, a QGIS plugin for building spatial phylogeographic reconstructions constructed from phylogenetic tree and geographical information data based on QGIS2threejs. By using PHYLOGEOrec, researchers can integrate existing phylogeny and geographical information data, resulting in three-dimensional geographic visualizations of phylogenetic trees in the Keyhole Markup Language (KML) format. Such formats can be overlaid on a map using QGIS and finally, spatially viewed in QGIS by means of a QGIS2threejs engine for further analysis. KML can also be viewed in reputable geobrowsers with KML-support (i.e., Google Earth).

  19. Complete chloroplast genome sequences of Praxelis (Eupatorium catarium Veldkamp), an important invasive species.

    PubMed

    Zhang, Ying; Li, Lei; Yan, Ting Liang; Liu, Qiang

    2014-10-01

    Praxelis (Eupatorium catarium Veldkamp) is a new hazardous invasive plant species that has caused serious economic losses and environmental damage in the Northern hemisphere tropical and subtropical regions. Although previous studies focused on detecting the biological characteristics of this plant to prevent its expansion, little effort has been made to understand the impact of Praxelis on the ecosystem in an evolutionary process. The genetic information of Praxelis is required for further phylogenetic identification and evolutionary studies. Here, we report the complete Praxelis chloroplast (cp) genome sequence. The Praxelis chloroplast genome is 151,410 bp in length including a small single-copy region (18,547 bp) and a large single-copy region (85,311 bp) separated by a pair of inverted repeats (IRs; 23,776 bp). The genome contains 85 unique and 18 duplicated genes in the IR region. The gene content and organization are similar to other Asteraceae tribe cp genomes. We also analyzed the whole cp genome sequence, repeat structure, codon usage, contraction of the IR and gene structure/organization features between native and invasive Asteraceae plants, in order to understand the evolution of organelle genomes between native and invasive Asteraceae. Comparative analysis identified the 14 markers containing greater than 2% parsimony-informative characters, indicating that they are potential informative markers for barcoding and phylogenetic analysis. Moreover, a sister relationship between Praxelis and seven other species in Asteraceae was found based on phylogenetic analysis of 28 protein-coding sequences. Complete cp genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. PLAZA 3.0: an access point for plant comparative genomics

    PubMed Central

    Proost, Sebastian; Van Bel, Michiel; Vaneechoutte, Dries; Van de Peer, Yves; Inzé, Dirk; Mueller-Roeber, Bernd; Vandepoele, Klaas

    2015-01-01

    Comparative sequence analysis has significantly altered our view on the complexity of genome organization and gene functions in different kingdoms. PLAZA 3.0 is designed to make comparative genomics data for plants available through a user-friendly web interface. Structural and functional annotation, gene families, protein domains, phylogenetic trees and detailed information about genome organization can easily be queried and visualized. Compared with the first version released in 2009, which featured nine organisms, the number of integrated genomes is more than four times higher, and now covers 37 plant species. The new species provide a wider phylogenetic range as well as a more in-depth sampling of specific clades, and genomes of additional crop species are present. The functional annotation has been expanded and now comprises data from Gene Ontology, MapMan, UniProtKB/Swiss-Prot, PlnTFDB and PlantTFDB. Furthermore, we improved the algorithms to transfer functional annotation from well-characterized plant genomes to other species. The additional data and new features make PLAZA 3.0 (http://bioinformatics.psb.ugent.be/plaza/) a versatile and comprehensible resource for users wanting to explore genome information to study different aspects of plant biology, both in model and non-model organisms. PMID:25324309

  1. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education.

  2. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  3. Comparative study of notoungulate (Placentalia, Mammalia) bony labyrinths and new phylogenetically informative inner ear characters

    PubMed Central

    Macrini, Thomas E; Flynn, John J; Ni, Xijun; Croft, Darin A; Wyss, André R

    2013-01-01

    The phylogenetic relationships of notoungulates, an extinct group of predominantly South American herbivores, remain poorly resolved with respect to both other placental mammals and among one another. Most previous phylogenetic analyses of notoungulates have not included characters of the internal cranium, not least because few such features, including the bony labyrinth, have been described for members of the group. Here we describe the inner ears of the notoungulates Altitypotherium chucalensis (Mesotheriidae), Pachyrukhos moyani (Hegetotheriidae) and Cochilius sp. (Interatheriidae) based on reconstructions of bony labyrinths obtained from computed tomography imagery. Comparisons of the bony labyrinths of these taxa with the basally diverging notoungulate Notostylops murinus (Notostylopidae), an isolated petrosal from Itaboraí, Brazil, referred to Notoungulata, and six therian outgroups, yielded an inner ear character matrix of 25 potentially phylogenetically informative characters, 14 of them novel to this study. Two equivocally optimized character states potentially support a pairing of Mesotheriidae and Hegetotheriidae, whereas four others may be diagnostic of Notoungulata. Three additional characters are potentially informative for diagnosing more inclusive clades: one for crown Placentalia; another for a clade containing Kulbeckia, Zalambdalestes, and Placentalia; and a third for Eutheria (crown Placentalia plus stem taxa). Several other characters are apomorphic for at least one notoungulate in our study and are of potential interest for broader taxonomic sampling within Notoungulata to clarify currently enigmatic interrelationships. Measures of the semicircular canals were used to infer agility (e.g. capable of quick movements vs. lethargic movements) of these taxa. Agility scores calculated from these data generally corroborate interpretations based on postcranial remains of these or closely related species. We provide estimates of the low-frequency hearing limits in notoungulates based on the ratio of radii of the apical and basal turns of the cochlea. These limits range from 15 Hz in Notostylops to 149 Hz in Pachyrukhos, values comparable to the Asian elephant (Elephas maximus) and the California sea lion (Zalophus californianus) when hearing in air, respectively. PMID:24102069

  4. MaxAlign: maximizing usable data in an alignment.

    PubMed

    Gouveia-Oliveira, Rodrigo; Sackett, Peter W; Pedersen, Anders G

    2007-08-28

    The presence of gaps in an alignment of nucleotide or protein sequences is often an inconvenience for bioinformatical studies. In phylogenetic and other analyses, for instance, gapped columns are often discarded entirely from the alignment. MaxAlign is a program that optimizes the alignment prior to such analyses. Specifically, it maximizes the number of nucleotide (or amino acid) symbols that are present in gap-free columns - the alignment area - by selecting the optimal subset of sequences to exclude from the alignment. MaxAlign can be used prior to phylogenetic and bioinformatical analyses as well as in other situations where this form of alignment improvement is useful. In this work we test MaxAlign's performance in these tasks and compare the accuracy of phylogenetic estimates including and excluding gapped columns from the analysis, with and without processing with MaxAlign. In this paper we also introduce a new simple measure of tree similarity, Normalized Symmetric Similarity (NSS) that we consider useful for comparing tree topologies. We demonstrate how MaxAlign is helpful in detecting misaligned or defective sequences without requiring manual inspection. We also show that it is not advisable to exclude gapped columns from phylogenetic analyses unless MaxAlign is used first. Finally, we find that the sequences removed by MaxAlign from an alignment tend to be those that would otherwise be associated with low phylogenetic accuracy, and that the presence of gaps in any given sequence does not seem to disturb the phylogenetic estimates of other sequences. The MaxAlign web-server is freely available online at http://www.cbs.dtu.dk/services/MaxAlign where supplementary information can also be found. The program is also freely available as a Perl stand-alone package.

  5. Comparative Skull Morphology of Uropeltid Snakes (Alethinophidia: Uropeltidae) with Special Reference to Disarticulated Elements and Variation

    PubMed Central

    Olori, Jennifer C.; Bell, Christopher J.

    2012-01-01

    Uropeltids form a diverse clade of highly derived, fossorial snakes that, because of their phylogenetic position among other alethinophidian lineages, may play a key role in understanding the early evolution of cranial morphology in snakes. We include detailed osteological descriptions of crania and mandibles for eight uropeltid species from three nominal genera (Uropeltis, Rhinophis, and Brachyophidium) and emphasize disarticulated elements and the impact of intraspecific variation on previously proposed morphological characters used for phylogenetic analysis. Preliminary analysis of phylogenetic relationships strongly supports a clade composed exclusively of species of Plectrurus, Uropeltis, and Rhinophis. However, monophyly of each of those genera and Melanophidium is not upheld. There is moderate support that Sri Lankan species (e.g., Rhinophis and Uropeltis melanogaster) are monophyletic with respect to Indian uropeltids. Previously proposed characters that are phylogenetically informative include the shape of the nasals, length of the occipital condyle, level of development of the posteroventral process of the dentary, and participation of the parietal in the optic foramen. Additionally, thirty new features that may be systematically informative are identified and described, but were not verified for their utility. Such verification must await availability of additional disarticulated cranial material from a larger sample of taxa. All characters require further testing through increased focus on sources and patterns of intraspecific variation, inclusion of broader taxonomic samples in comparative studies, and exploration of skeletal development, sexual dimorphism, and biogeographic patterns. Additionally, trends in the relative enlargement of the sensory capsules, reduction in cranial ossification and dentition, fusion of elements, and the appearance of novel morphological conditions, such as the structure and location of the suspensorium, may be related to fossoriality and miniaturization in some uropeltid taxa, and may complicate analysis of relationships within Uropeltidae and among alethinophidian snakes. PMID:22412874

  6. Influenza Virus Database (IVDB): an integrated information resource and analysis platform for influenza virus research.

    PubMed

    Chang, Suhua; Zhang, Jiajie; Liao, Xiaoyun; Zhu, Xinxing; Wang, Dahai; Zhu, Jiang; Feng, Tao; Zhu, Baoli; Gao, George F; Wang, Jian; Yang, Huanming; Yu, Jun; Wang, Jing

    2007-01-01

    Frequent outbreaks of highly pathogenic avian influenza and the increasing data available for comparative analysis require a central database specialized in influenza viruses (IVs). We have established the Influenza Virus Database (IVDB) to integrate information and create an analysis platform for genetic, genomic, and phylogenetic studies of the virus. IVDB hosts complete genome sequences of influenza A virus generated by Beijing Institute of Genomics (BIG) and curates all other published IV sequences after expert annotation. Our Q-Filter system classifies and ranks all nucleotide sequences into seven categories according to sequence content and integrity. IVDB provides a series of tools and viewers for comparative analysis of the viral genomes, genes, genetic polymorphisms and phylogenetic relationships. A search system has been developed for users to retrieve a combination of different data types by setting search options. To facilitate analysis of global viral transmission and evolution, the IV Sequence Distribution Tool (IVDT) has been developed to display the worldwide geographic distribution of chosen viral genotypes and to couple genomic data with epidemiological data. The BLAST, multiple sequence alignment and phylogenetic analysis tools were integrated for online data analysis. Furthermore, IVDB offers instant access to pre-computed alignments and polymorphisms of IV genes and proteins, and presents the results as SNP distribution plots and minor allele distributions. IVDB is publicly available at http://influenza.genomics.org.cn.

  7. Radiating despite a Lack of Character: Ecological Divergence among Closely Related, Morphologically Similar Honeyeaters (Aves: Meliphagidae) Co-occurring in Arid Australian Environments.

    PubMed

    Miller, Eliot T; Wagner, Sarah K; Harmon, Luke J; Ricklefs, Robert E

    2017-02-01

    Quantifying the relationship between form and function can inform use of morphology as a surrogate for ecology. How the strength of this relationship varies continentally can inform understanding of evolutionary radiations; for example, does the relationship break down when certain lineages invade and diversify in novel habitats? The 75 species of Australian honeyeaters (Meliphagidae) are morphologically and ecologically diverse, with species feeding on nectar, insects, fruit, and other resources. We investigated Meliphagidae ecomorphology and community structure by (1) quantifying the concordance between morphology and ecology (foraging behavior), (2) estimating rates of trait evolution in relation to the packing of ecological space, and (3) comparing phylogenetic and trait community structure across the broad environmental gradients of the continent. We found that morphology explained 37% of the variance in ecology (and 62% vice versa), and we uncovered well-known bivariate relationships among the multivariate ecomorphological data. Ecological trait diversity declined less rapidly than phylogenetic diversity along a gradient of decreasing precipitation. We employ a new method (trait fields) and extend another (phylogenetic fields) to show that while species in phylogenetically clustered, arid-environment assemblages are similar morphologically, they are as varied in foraging behavior as those from more diverse assemblages. Thus, although closely related and similar morphologically, these arid-adapted species have diverged in ecological space to a similar degree as their mesic counterparts.

  8. Tetrapods on the EDGE: Overcoming data limitations to identify phylogenetic conservation priorities

    PubMed Central

    Gray, Claudia L.; Wearn, Oliver R.; Owen, Nisha R.

    2018-01-01

    The scale of the ongoing biodiversity crisis requires both effective conservation prioritisation and urgent action. As extinction is non-random across the tree of life, it is important to prioritise threatened species which represent large amounts of evolutionary history. The EDGE metric prioritises species based on their Evolutionary Distinctiveness (ED), which measures the relative contribution of a species to the total evolutionary history of their taxonomic group, and Global Endangerment (GE), or extinction risk. EDGE prioritisations rely on adequate phylogenetic and extinction risk data to generate meaningful priorities for conservation. However, comprehensive phylogenetic trees of large taxonomic groups are extremely rare and, even when available, become quickly out-of-date due to the rapid rate of species descriptions and taxonomic revisions. Thus, it is important that conservationists can use the available data to incorporate evolutionary history into conservation prioritisation. We compared published and new methods to estimate missing ED scores for species absent from a phylogenetic tree whilst simultaneously correcting the ED scores of their close taxonomic relatives. We found that following artificial removal of species from a phylogenetic tree, the new method provided the closest estimates of their “true” ED score, differing from the true ED score by an average of less than 1%, compared to the 31% and 38% difference of the previous methods. The previous methods also substantially under- and over-estimated scores as more species were artificially removed from a phylogenetic tree. We therefore used the new method to estimate ED scores for all tetrapods. From these scores we updated EDGE prioritisation rankings for all tetrapod species with IUCN Red List assessments, including the first EDGE prioritisation for reptiles. Further, we identified criteria to identify robust priority species in an effort to further inform conservation action whilst limiting uncertainty and anticipating future phylogenetic advances. PMID:29641585

  9. Phylogenetic versus functional signals in the evolution of form-function relationships in terrestrial vision.

    PubMed

    Motani, Ryosuke; Schmitz, Lars

    2011-08-01

    Phylogeny is deeply pertinent to evolutionary studies. Traits that perform a body function are expected to be strongly influenced by physical "requirements" of the function. We investigated if such traits exhibit phylogenetic signals, and, if so, how phylogenetic noises bias quantification of form-function relationships. A form-function system that is strongly influenced by physics, namely the relationship between eye morphology and visual optics in amniotes, was used. We quantified the correlation between form (i.e., eye morphology) and function (i.e., ocular optics) while varying the level of phylogenetic bias removal through adjusting Pagel's λ. Ocular soft-tissue dimensions exhibited the highest correlation with ocular optics when 1% of phylogenetic bias expected from Brownian motion was removed (i.e., λ= 0.01); the value for hard-tissue data were 8%. A small degree of phylogenetic bias therefore exists in morphology despite of the stringent functional constraints. We also devised a phylogenetically informed discriminant analysis and recorded the effects of phylogenetic bias on this method using the same data. Use of proper λ values during phylogenetic bias removal improved misidentification rates in resulting classifications when prior probabilities were assumed to be equal. Even a small degree of phylogenetic bias affected the classification resulting from phylogenetically informed discriminant analysis. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.

  10. Spatial phylogenetics of the vascular flora of Chile.

    PubMed

    Scherson, Rosa A; Thornhill, Andrew H; Urbina-Casanova, Rafael; Freyman, William A; Pliscoff, Patricio A; Mishler, Brent D

    2017-07-01

    Current geographic patterns of biodiversity are a consequence of the evolutionary history of the lineages that comprise them. This study was aimed at exploring how evolutionary features of the vascular flora of Chile are distributed across the landscape. Using a phylogeny at the genus level for 87% of the Chilean vascular flora, and a geographic database of sample localities, we calculated phylogenetic diversity (PD), phylogenetic endemism (PE), relative PD (RPD), and relative PE (RPE). Categorical Analyses of Neo- and Paleo-Endemism (CANAPE) were also performed, using a spatial randomization to assess statistical significance. A cluster analysis using range-weighted phylogenetic turnover was used to compare among grid cells, and with known Chilean bioclimates. PD patterns were concordant with known centers of high taxon richness and the Chilean biodiversity hotspot. In addition, several other interesting areas of concentration of evolutionary history were revealed as potential conservation targets. The south of the country shows areas of significantly high RPD and a concentration of paleo-endemism, and the north shows areas of significantly low PD and RPD, and a concentration of neo-endemism. Range-weighted phylogenetic turnover shows high congruence with the main macrobioclimates of Chile. Even though the study was done at the genus level, the outcome provides an accurate outline of phylogenetic patterns that can be filled in as more fine-scaled information becomes available. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. A Phylogenetic, Biogeographic, and Taxonomic study of all Extant Species of Anolis (Squamata; Iguanidae).

    PubMed

    Poe, Steven; Nieto-Montes de Oca, Adrián; Torres-Carvajal, Omar; De Queiroz, Kevin; Velasco, Julián A; Truett, Brad; Gray, Levi N; Ryan, Mason J; Köhler, Gunther; Ayala-Varela, Fernando; Latella, Ian

    2017-09-01

    Anolis lizards (anoles) are textbook study organisms in evolution and ecology. Although several topics in evolutionary biology have been elucidated by the study of anoles, progress in some areas has been hampered by limited phylogenetic information on this group. Here, we present a phylogenetic analysis of all 379 extant species of Anolis, with new phylogenetic data for 139 species including new DNA data for 101 species. We use the resulting estimates as a basis for defining anole clade names under the principles of phylogenetic nomenclature and to examine the biogeographic history of anoles. Our new taxonomic treatment achieves the supposed advantages of recent subdivisions of anoles that employed ranked Linnaean-based nomenclature while avoiding the pitfalls of those approaches regarding artificial constraints imposed by ranks. Our biogeographic analyses demonstrate complexity in the dispersal history of anoles, including multiple crossings of the Isthmus of Panama, two invasions of the Caribbean, single invasions to Jamaica and Cuba, and a single evolutionary dispersal from the Caribbean to the mainland that resulted in substantial anole diversity. Our comprehensive phylogenetic estimate of anoles should prove useful for rigorous testing of many comparative evolutionary hypotheses. [Anoles; biogeography; lizards; Neotropics; phylogeny; taxonomy]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. Partial 16S rRNA primary structure of five Actinomyces species: phylogenetic implications and development of an Actinomyces israelii-specific oligonucleotide probe.

    PubMed

    Stackebrandt, E; Charfreitag, O

    1990-01-01

    The intra- and intergeneric relationships of the genus Actinomyces were determined by comparing long 16S rRNA sequences, generated by reverse transcriptase. All species formed a phylogenetically coherent cluster in which Actinomyces bovis, A. viscosus, A. naeslundii, A. odontolyticus and A. israelii constituted genetically well defined species. A. israelii DSM 43322 (serotype 2) was not closely related to three other strains of this species (serotype 1) and, as judged from phylogenetic distances, could be accommodated within A. naeslundii, or represent a new species. In contrast to previous findings, members of the genus Actinomyces appear to be related to Bifidobacterium bifidum. Sequence information was used to develop an oligonucleotide probe for the A. israelii serotype 1 strains, which did not react with the serotype 2 strain or with rRNA from strains of eight Actinomyces species.

  13. Enhanced use of phylogenetic data to inform public health approaches to HIV among MSM

    PubMed Central

    German, Danielle; Grabowski, Mary Kate; Beyrer, Chris

    2017-01-01

    The multi-dimensional nature and continued evolution of HIV epidemics among men who have sex with men (MSM) requires innovative intervention approaches. Strategies are needed that recognize the individual, social, and structural factors driving HIV transmission; that can pinpoint networks with heightened transmission risk; and that can help target intervention in real-time. HIV phylogenetics is a rapidly evolving field with strong promise for informing innovative responses to the HIV epidemic among MSM. Currently, HIV phylogenetic insights are providing new understandings of characteristics of HIV epidemics involving MSM, social networks influencing transmission, characteristics of HIV transmission clusters involving MSM, targets for antiretroviral and other prevention strategies, and dynamics of emergent epidemics. Maximizing the potential of HIV phylogenetics for HIV responses among MSM will require attention to key methodological challenges and ethical considerations, as well as resolving key implementation and scientific questions. Enhanced and integrated use of HIV surveillance, socio-behavioral, and phylogenetic data resources are becoming increasingly critical for informing public health approaches to HIV among MSM. PMID:27584826

  14. Transmission clustering among newly diagnosed HIV patients in Chicago, 2008 to 2011: using phylogenetics to expand knowledge of regional HIV transmission patterns

    PubMed Central

    Lubelchek, Ronald J.; Hoehnen, Sarah C.; Hotton, Anna L.; Kincaid, Stacey L.; Barker, David E.; French, Audrey L.

    2014-01-01

    Introduction HIV transmission cluster analyses can inform HIV prevention efforts. We describe the first such assessment for transmission clustering among HIV patients in Chicago. Methods We performed transmission cluster analyses using HIV pol sequences from newly diagnosed patients presenting to Chicago’s largest HIV clinic between 2008 and 2011. We compared sequences via progressive pairwise alignment, using neighbor joining to construct an un-rooted phylogenetic tree. We defined clusters as >2 sequences among which each sequence had at least one partner within a genetic distance of ≤ 1.5%. We used multivariable regression to examine factors associated with clustering and used geospatial analysis to assess geographic proximity of phylogenetically clustered patients. Results We compared sequences from 920 patients; median age 35 years; 75% male; 67% Black, 23% Hispanic; 8% had a Rapid Plasma Reagin (RPR) titer ≥ 1:16 concurrent with their HIV diagnosis. We had HIV transmission risk data for 54%; 43% identified as men who have sex with men (MSM). Phylogenetic analysis demonstrated 123 patients (13%) grouped into 26 clusters, the largest having 20 members. In multivariable regression, age < 25, Black race, MSM status, male gender, higher HIV viral load, and RPR ≥ 1:16 associated with clustering. We did not observe geographic grouping of genetically clustered patients. Discussion Our results demonstrate high rates of HIV transmission clustering, without local geographic foci, among young Black MSM in Chicago. Applied prospectively, phylogenetic analyses could guide prevention efforts and help break the cycle of transmission. PMID:25321182

  15. Phylobetadiversity among forest types in the Brazilian Atlantic Forest complex.

    PubMed

    Duarte, Leandro Da Silva; Bergamin, Rodrigo Scarton; Marcilio-Silva, Vinícius; Seger, Guilherme Dubal Dos Santos; Marques, Márcia Cristina Mendes

    2014-01-01

    Phylobetadiversity is defined as the phylogenetic resemblance between communities or biomes. Analyzing phylobetadiversity patterns among different vegetation physiognomies within a single biome is crucial to understand the historical affinities between them. Based on the widely accepted idea that different forest physiognomies within the Southern Brazilian Atlantic Forest constitute different facies of a single biome, we hypothesize that more recent phylogenetic nodes should drive phylobetadiversity gradients between the different forest types within the Atlantic Forest, as the phylogenetic divergence among those forest types is biogeographically recent. We compiled information from 206 checklists describing the occurrence of shrub/tree species across three different forest physiognomies within the Southern Brazilian Atlantic Forest (Dense, Mixed and Seasonal forests). We analyzed intra-site phylogenetic structure (phylogenetic diversity, net relatedness index and nearest taxon index) and phylobetadiversity between plots located at different forest types, using five different methods differing in sensitivity to either basal or terminal nodes (phylogenetic fuzzy weighting, COMDIST, COMDISTNT, UniFrac and Rao's H). Mixed forests showed higher phylogenetic diversity and overdispersion than the other forest types. Furthermore, all forest types differed from each other in relation phylobetadiversity patterns, particularly when phylobetadiversity methods more sensitive to terminal nodes were employed. Mixed forests tended to show higher phylogenetic differentiation to Dense and Seasonal forests than these latter from each other. The higher phylogenetic diversity and phylobetadiversity levels found in Mixed forests when compared to the others likely result from the biogeographical origin of several taxa occurring in these forests. On one hand, Mixed forests shelter several temperate taxa, like the conifers Araucaria and Podocarpus. On the other hand, tropical groups, like Myrtaceae, are also very representative of this forest type. We point out to the need of more attention to Mixed forests as a conservation target within the Brazilian Atlantic Forest given their high phylogenetic uniqueness.

  16. Phylobetadiversity among Forest Types in the Brazilian Atlantic Forest Complex

    PubMed Central

    Duarte, Leandro Da Silva; Bergamin, Rodrigo Scarton; Marcilio-Silva, Vinícius; Seger, Guilherme Dubal Dos Santos; Marques, Márcia Cristina Mendes

    2014-01-01

    Phylobetadiversity is defined as the phylogenetic resemblance between communities or biomes. Analyzing phylobetadiversity patterns among different vegetation physiognomies within a single biome is crucial to understand the historical affinities between them. Based on the widely accepted idea that different forest physiognomies within the Southern Brazilian Atlantic Forest constitute different facies of a single biome, we hypothesize that more recent phylogenetic nodes should drive phylobetadiversity gradients between the different forest types within the Atlantic Forest, as the phylogenetic divergence among those forest types is biogeographically recent. We compiled information from 206 checklists describing the occurrence of shrub/tree species across three different forest physiognomies within the Southern Brazilian Atlantic Forest (Dense, Mixed and Seasonal forests). We analyzed intra-site phylogenetic structure (phylogenetic diversity, net relatedness index and nearest taxon index) and phylobetadiversity between plots located at different forest types, using five different methods differing in sensitivity to either basal or terminal nodes (phylogenetic fuzzy weighting, COMDIST, COMDISTNT, UniFrac and Rao’s H). Mixed forests showed higher phylogenetic diversity and overdispersion than the other forest types. Furthermore, all forest types differed from each other in relation phylobetadiversity patterns, particularly when phylobetadiversity methods more sensitive to terminal nodes were employed. Mixed forests tended to show higher phylogenetic differentiation to Dense and Seasonal forests than these latter from each other. The higher phylogenetic diversity and phylobetadiversity levels found in Mixed forests when compared to the others likely result from the biogeographical origin of several taxa occurring in these forests. On one hand, Mixed forests shelter several temperate taxa, like the conifers Araucaria and Podocarpus. On the other hand, tropical groups, like Myrtaceae, are also very representative of this forest type. We point out to the need of more attention to Mixed forests as a conservation target within the Brazilian Atlantic Forest given their high phylogenetic uniqueness. PMID:25121495

  17. GenomicusPlants: a web resource to study genome evolution in flowering plants.

    PubMed

    Louis, Alexandra; Murat, Florent; Salse, Jérôme; Crollius, Hugues Roest

    2015-01-01

    Comparative genomics combined with phylogenetic reconstructions are powerful approaches to study the evolution of genes and genomes. However, the current rapid expansion of the volume of genomic information makes it increasingly difficult to interrogate, integrate and synthesize comparative genome data while taking into account the maximum breadth of information available. GenomicusPlants (http://www.genomicus.biologie.ens.fr/genomicus-plants) is an extension of the Genomicus webserver that addresses this issue by allowing users to explore flowering plant genomes in an intuitive way, across the broadest evolutionary scales. Extant genomes of 26 flowering plants can be analyzed, as well as 23 ancestral reconstructed genomes. Ancestral gene order provides a long-term chronological view of gene order evolution, greatly facilitating comparative genomics and evolutionary studies. Four main interfaces ('views') are available where: (i) PhyloView combines phylogenetic trees with comparisons of genomic loci across any number of genomes; (ii) AlignView projects loci of interest against all other genomes to visualize its topological conservation; (iii) MatrixView compares two genomes in a classical dotplot representation; and (iv) Karyoview visualizes chromosome karyotypes 'painted' with colours of another genome of interest. All four views are interconnected and benefit from many customizable features. © The Author 2014. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.

  18. PLAZA 3.0: an access point for plant comparative genomics.

    PubMed

    Proost, Sebastian; Van Bel, Michiel; Vaneechoutte, Dries; Van de Peer, Yves; Inzé, Dirk; Mueller-Roeber, Bernd; Vandepoele, Klaas

    2015-01-01

    Comparative sequence analysis has significantly altered our view on the complexity of genome organization and gene functions in different kingdoms. PLAZA 3.0 is designed to make comparative genomics data for plants available through a user-friendly web interface. Structural and functional annotation, gene families, protein domains, phylogenetic trees and detailed information about genome organization can easily be queried and visualized. Compared with the first version released in 2009, which featured nine organisms, the number of integrated genomes is more than four times higher, and now covers 37 plant species. The new species provide a wider phylogenetic range as well as a more in-depth sampling of specific clades, and genomes of additional crop species are present. The functional annotation has been expanded and now comprises data from Gene Ontology, MapMan, UniProtKB/Swiss-Prot, PlnTFDB and PlantTFDB. Furthermore, we improved the algorithms to transfer functional annotation from well-characterized plant genomes to other species. The additional data and new features make PLAZA 3.0 (http://bioinformatics.psb.ugent.be/plaza/) a versatile and comprehensible resource for users wanting to explore genome information to study different aspects of plant biology, both in model and non-model organisms. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. A Well-Resolved Phylogeny of the Trees of Puerto Rico Based on DNA Barcode Sequence Data

    PubMed Central

    Muscarella, Robert; Uriarte, María; Erickson, David L.; Swenson, Nathan G.; Zimmerman, Jess K.; Kress, W. John

    2014-01-01

    Background The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. Methodology/principal findings We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. Conclusions/significance With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning. PMID:25386879

  20. The 'temporal effect' in hominids: Reinvestigating the nature of support for a chimp-human clade in bone morphology.

    PubMed

    Pearson, Alannah; Groves, Colin; Cardini, Andrea

    2015-11-01

    In 2004, an analysis by Lockwood and colleagues of hard-tissue morphology, using geometric morphometrics on the temporal bone, succeeded in recovering the correct phylogeny of living hominids without resorting to potentially problematic methods for transforming continuous shape variables into meristic characters. That work has increased hope that by using modern analytical methods and phylogenetically informative anatomical data we might one day be able to accurately infer the relationships of hominins, including the closest extinct relatives of modern humans. In the present study, using 3D virtually generated models of the hominid temporal bone and a larger suite of geometric morphometric and comparative techniques, we have re-examined the evidence for a Pan-Homo clade. Despite differences in samples, as well as the type of raw data, the effect of measurement error (and especially landmark digitization by a different operator), but also a broader perspective brought in by our diverse set of approaches, our reanalysis largely supports Lockwood and colleagues' original results. However, by focusing not only mainly on shape (as in the original 2004 analysis) but also on size and 'size-corrected' (non-allometric) shape, we demonstrate that the strong phylogenetic signal in the temporal bone is largely related to similarities in size. Thus, with this study, we are not suggesting the use of a single 'character', such as size, for phylogenetic inference, but we do challenge the common view that shape, with its highly complex and multivariate nature, is necessarily more phylogenetically informative than size and that actually size and size-related shape variation (i.e., allometry) confound phylogenetic inference based on morphology. This perspective may in fact be less generalizable than often believed. Thus, while we confirm the original findings by Lockwood et al., we provide a deep reinterpretation of their nature and potential implications for hominid phylogenetics and we show how crucial it is not to overlook size in geometric morphometric analyses. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. A well-resolved phylogeny of the trees of Puerto Rico based on DNA barcode sequence data.

    PubMed

    Muscarella, Robert; Uriarte, María; Erickson, David L; Swenson, Nathan G; Zimmerman, Jess K; Kress, W John

    2014-01-01

    The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning.

  2. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution

    PubMed Central

    Kendall, Michelle; Colijn, Caroline

    2016-01-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. Key words: phylogenetics, evolution, tree metrics, genetics, sequencing. PMID:27343287

  3. Evaluation of sequence alignments and oligonucleotide probes with respect to three-dimensional structure of ribosomal RNA using ARB software package

    PubMed Central

    Kumar, Yadhu; Westram, Ralf; Kipfer, Peter; Meier, Harald; Ludwig, Wolfgang

    2006-01-01

    Background Availability of high-resolution RNA crystal structures for the 30S and 50S ribosomal subunits and the subsequent validation of comparative secondary structure models have prompted the biologists to use three-dimensional structure of ribosomal RNA (rRNA) for evaluating sequence alignments of rRNA genes. Furthermore, the secondary and tertiary structural features of rRNA are highly useful and successfully employed in designing rRNA targeted oligonucleotide probes intended for in situ hybridization experiments. RNA3D, a program to combine sequence alignment information with three-dimensional structure of rRNA was developed. Integration into ARB software package, which is used extensively by the scientific community for phylogenetic analysis and molecular probe designing, has substantially extended the functionality of ARB software suite with 3D environment. Results Three-dimensional structure of rRNA is visualized in OpenGL 3D environment with the abilities to change the display and overlay information onto the molecule, dynamically. Phylogenetic information derived from the multiple sequence alignments can be overlaid onto the molecule structure in a real time. Superimposition of both statistical and non-statistical sequence associated information onto the rRNA 3D structure can be done using customizable color scheme, which is also applied to a textual sequence alignment for reference. Oligonucleotide probes designed by ARB probe design tools can be mapped onto the 3D structure along with the probe accessibility models for evaluation with respect to secondary and tertiary structural conformations of rRNA. Conclusion Visualization of three-dimensional structure of rRNA in an intuitive display provides the biologists with the greater possibilities to carry out structure based phylogenetic analysis. Coupled with secondary structure models of rRNA, RNA3D program aids in validating the sequence alignments of rRNA genes and evaluating probe target sites. Superimposition of the information derived from the multiple sequence alignment onto the molecule dynamically allows the researchers to observe any sequence inherited characteristics (phylogenetic information) in real-time environment. The extended ARB software package is made freely available for the scientific community via . PMID:16672074

  4. Increased phylogenetic resolution within the ecologically important Rhizopogon subgenus Amylopogon using 10 anonymous nuclear loci.

    PubMed

    Dowie, Nicholas J; Grubisha, Lisa C; Burton, Brent A; Klooster, Matthew R; Miller, Steven L

    2017-01-01

    Rhizopogon species are ecologically significant ectomycorrhizal fungi in conifer ecosystems. The importance of this system merits the development and utilization of a more robust set of molecular markers specifically designed to evaluate their evolutionary ecology. Anonymous nuclear loci (ANL) were developed for R. subgenus Amylopogon. Members of this subgenus occur throughout the United States and are exclusive fungal symbionts associated with Pterospora andromedea, a threatened mycoheterotrophic plant endemic to disjunct eastern and western regions of North America. Candidate ANL were developed from 454 shotgun pyrosequencing and assessed for positive amplification across targeted species, sequencing success, and recovery of phylogenetically informative sites. Ten ANL were successfully developed and were subsequently used to sequence representative taxa, herbaria holotype and paratype specimens in R. subgenus Amylopogon. Phylogenetic reconstructions were performed on individual and concatenated data sets by Bayesian inference and maximum likelihood methods. Phylogenetic analyses of these 10 ANL were compared with a phylogeny traditionally constructed using the universal fungal barcode nuc rDNA ITS1-5.8S-ITS2 region (ITS). The resulting ANL phylogeny was consistent with most of the species designations delineated by ITS. However, the ANL phylogeny provided much greater phylogenetic resolution, yielding new evidence for cryptic species within previously defined species of R. subgenus Amylopogon. Additionally, the rooted ANL phylogeny provided an alternate topology to the ITS phylogeny, which inferred a novel set of evolutionary relationships not identified in prior phylogenetic studies.

  5. Inferring phylogenetic trees from the knowledge of rare evolutionary events.

    PubMed

    Hellmuth, Marc; Hernandez-Rosales, Maribel; Long, Yangjing; Stadler, Peter F

    2018-06-01

    Rare events have played an increasing role in molecular phylogenetics as potentially homoplasy-poor characters. In this contribution we analyze the phylogenetic information content from a combinatorial point of view by considering the binary relation on the set of taxa defined by the existence of a single event separating two taxa. We show that the graph-representation of this relation must be a tree. Moreover, we characterize completely the relationship between the tree of such relations and the underlying phylogenetic tree. With directed operations such as tandem-duplication-random-loss events in mind we demonstrate how non-symmetric information constrains the position of the root in the partially reconstructed phylogeny.

  6. Enhanced use of phylogenetic data to inform public health approaches to HIV among men who have sex with men.

    PubMed

    German, Danielle; Grabowski, Mary Kate; Beyrer, Chris

    2017-02-01

    The multidimensional nature and continued evolution of HIV epidemics among men who have sex with men (MSM) requires innovative intervention approaches. Strategies are needed that recognise the individual, social and structural factors driving HIV transmission; that can pinpoint networks with heightened transmission risk; and that can help target intervention in real time. HIV phylogenetics is a rapidly evolving field with strong promise for informing innovative responses to the HIV epidemic among MSM. Currently, HIV phylogenetic insights are providing new understandings of characteristics of HIV epidemics involving MSM, social networks influencing transmission, characteristics of HIV transmission clusters involving MSM, targets for antiretroviral and other prevention strategies and dynamics of emergent epidemics. Maximising the potential of HIV phylogenetics for HIV responses among MSM will require attention to key methodological challenges and ethical considerations, as well as resolving key implementation and scientific questions. Enhanced and integrated use of HIV surveillance, sociobehavioural and phylogenetic data resources are becoming increasingly critical for informing public health approaches to HIV among MSM.

  7. Selecting Species Traits for Biomonitoring Applications in light of Phylogenetic Relationships among Lotic Insects

    NASA Astrophysics Data System (ADS)

    Poff, N.; Vieira, N. K.; Simmons, M. P.; Olden, J. D.; Kondratieff, B. C.; Finn, D. S.

    2005-05-01

    The use of species traits as indicators of environmental disturbance is being considered for biomonitoring programs globally. As such, methods to select relevant and informative traits for inclusion in biometrics need to be developed. In this research, we identified 20 traits of aquatic insects within six trait groups: morphology, mobility, life-history strategy, thermal tolerance, feeding guild and ecology (e.g., habitat preference). We constructed phylogenetic trees for 1) all lotic insect species of North America and 2) all Ephemeroptera, Plecoptera and Trichoptera species based on morphology- and molecular-based analyses and classifications. We then measured variability (i.e., plasticity) of the 20 traits and six trait groups across the two phylogenetic trees. Traits with higher degrees of plasticity indicated traits that were less phylogenetically constrained, and were considered informative for biomonitoring purposes. Thermal tolerance, rheophily, body size at maturity and feeding guild showed the highest plasticity across both phylogenetic trees. Two mobility traits, occurrence in drift and adult dispersal distance, showed moderate plasticity. By contrast, adult exiting ability, degree of attachment, adult lifespan and body shape showed low variability and were thus less informative. Plastic species traits that are less phylogenetically constrained may be most useful in detecting community change along environmental gradients.

  8. Comparative analyses of plastid genomes from fourteen Cornales species: inferences for phylogenetic relationships and genome evolution.

    PubMed

    Fu, Chao-Nan; Li, Hong-Tao; Milne, Richard; Zhang, Ting; Ma, Peng-Fei; Yang, Jing; Li, De-Zhu; Gao, Lian-Ming

    2017-12-08

    The Cornales is the basal lineage of the asterids, the largest angiosperm clade. Phylogenetic relationships within the order were previously not fully resolved. Fifteen plastid genomes representing 14 species, ten genera and seven families of Cornales were newly sequenced for comparative analyses of genome features, evolution, and phylogenomics based on different partitioning schemes and filtering strategies. All plastomes of the 14 Cornales species had the typical quadripartite structure with a genome size ranging from 156,567 bp to 158,715 bp, which included two inverted repeats (25,859-26,451 bp) separated by a large single-copy region (86,089-87,835 bp) and a small single-copy region (18,250-18,856 bp) region. These plastomes encoded the same set of 114 unique genes including 31 transfer RNA, 4 ribosomal RNA and 79 coding genes, with an identical gene order across all examined Cornales species. Two genes (rpl22 and ycf15) contained premature stop codons in seven and five species respectively. The phylogenetic relationships among all sampled species were fully resolved with maximum support. Different filtering strategies (none, light and strict) of sequence alignment did not have an effect on these relationships. The topology recovered from coding and noncoding data sets was the same as for the whole plastome, regardless of filtering strategy. Moreover, mutational hotspots and highly informative regions were identified. Phylogenetic relationships among families and intergeneric relationships within family of Cornales were well resolved. Different filtering strategies and partitioning schemes do not influence the relationships. Plastid genomes have great potential to resolve deep phylogenetic relationships of plants.

  9. A congruent phylogenomic signal places eukaryotes within the Archaea.

    PubMed

    Williams, Tom A; Foster, Peter G; Nye, Tom M W; Cox, Cymon J; Embley, T Martin

    2012-12-22

    Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook 'three domains' tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation.

  10. Not a simple case - A first comprehensive phylogenetic hypothesis for the Midas cichlid complex in Nicaragua (Teleostei: Cichlidae: Amphilophus).

    PubMed

    Geiger, Matthias F; McCrary, Jeffrey K; Schliewen, Ulrich K

    2010-09-01

    Nicaraguan Midas cichlids from crater lakes have recently attracted attention as potential model systems for speciation research, but no attempt has been made to comprehensively reconstruct phylogenetic relationships of this highly diverse and recently evolved species complex. We present a first AFLP (2793 loci) and mtDNA based phylogenetic hypothesis including all described and several undescribed species from six crater lakes (Apoyeque, Apoyo, Asososca Leon, Masaya, Tiscapa and Xiloá), the two great Lakes Managua and Nicaragua and the San Juan River. Our analyses demonstrate that the relationships between the Midas cichlid members are complex, and that phylogenetic information from different markers and methods do not always yield congruent results. Nevertheless, monophyly support for crater lake assemblages from Lakes Apoyeque, Apoyo, A. Leon is high as compared to those from L. Xiloá indicating occurrence of sympatric speciation. Further, we demonstrate that a 'three species' concept for the Midas cichlid complex is inapplicable and consequently that an individualized and voucher based approach in speciation research of the Midas cichlid complex is necessary at least as long as there is no comprehensive revision of the species complex available. Copyright 2010 Elsevier Inc. All rights reserved.

  11. A Guide to the PLAZA 3.0 Plant Comparative Genomic Database.

    PubMed

    Vandepoele, Klaas

    2017-01-01

    PLAZA 3.0 is an online resource for comparative genomics and offers a versatile platform to study gene functions and gene families or to analyze genome organization and evolution in the green plant lineage. Starting from genome sequence information for over 35 plant species, precomputed comparative genomic data sets cover homologous gene families, multiple sequence alignments, phylogenetic trees, and genomic colinearity information within and between species. Complementary functional data sets, a Workbench, and interactive visualization tools are available through a user-friendly web interface, making PLAZA an excellent starting point to translate sequence or omics data sets into biological knowledge. PLAZA is available at http://bioinformatics.psb.ugent.be/plaza/ .

  12. Complete mitogenome of Asiatic lion resolves phylogenetic status within Panthera.

    PubMed

    Bagatharia, Snehal B; Joshi, Madhvi N; Pandya, Rohan V; Pandit, Aanal S; Patel, Riddhi P; Desai, Shivangi M; Sharma, Anu; Panchal, Omkar; Jasmani, Falguni P; Saxena, Akshay K

    2013-08-23

    The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability.

  13. Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in angiosperms

    PubMed Central

    Alonso, Conchita; Pérez, Ricardo; Bazaga, Pilar; Herrera, Carlos M.

    2015-01-01

    DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value). Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis) and 39.2% (Narcissus). Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages. PMID:25688257

  14. Complete mitogenome of asiatic lion resolves phylogenetic status within Panthera

    PubMed Central

    2013-01-01

    Background The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. Results The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. Conclusion This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability. PMID:23968279

  15. Bayesian models for comparative analysis integrating phylogenetic uncertainty.

    PubMed

    de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P

    2012-06-28

    Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language.

  16. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    PubMed Central

    2012-01-01

    Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language. PMID:22741602

  17. geophylobuilder 1.0: an arcgis extension for creating 'geophylogenies'.

    PubMed

    Kidd, David M; Liu, Xianhua

    2008-01-01

    Evolution is inherently a spatiotemporal process; however, despite this, phylogenetic and geographical data and models remain largely isolated from one another. Geographical information systems provide a ready-made spatial modelling, analysis and dissemination environment within which phylogenetic models can be explicitly linked with their associated spatial data and subsequently integrated with other georeferenced data sets describing the biotic and abiotic environment. geophylobuilder 1.0 is an extension for the arcgis geographical information system that builds a 'geophylogenetic' data model from a phylogenetic tree and associated geographical data. Geophylogenetic database objects can subsequently be queried, spatially analysed and visualized in both 2D and 3D within a geographical information systems. © 2007 The Authors.

  18. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.

    PubMed

    Kendall, Michelle; Colijn, Caroline

    2016-10-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. How does cognition evolve? Phylogenetic comparative psychology

    PubMed Central

    Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

    2014-01-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850

  20. How does cognition evolve? Phylogenetic comparative psychology.

    PubMed

    MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria

    2012-03-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.

  1. Markov model plus k-word distributions: a synergy that produces novel statistical measures for sequence comparison.

    PubMed

    Dai, Qi; Yang, Yanchun; Wang, Tianming

    2008-10-15

    Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.

  2. Phylogeny and evolutionary histories of Pyrus L. revealed by phylogenetic trees and networks based on data from multiple DNA sequences

    USDA-ARS?s Scientific Manuscript database

    Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence d...

  3. Genotypic and Phylogenetic Insights on Prevention of the Spread of HIV-1 and Drug Resistance in “Real-World” Settings

    PubMed Central

    Brenner, Bluma G.; Ibanescu, Ruxandra-Ilinca; Hardy, Isabelle; Roger, Michel

    2017-01-01

    HIV continues to spread among vulnerable heterosexual (HET), Men-having-Sex with Men (MSM) and intravenous drug user (IDU) populations, influenced by a complex array of biological, behavioral and societal factors. Phylogenetics analyses of large sequence datasets from national drug resistance testing programs reveal the evolutionary interrelationships of viral strains implicated in the dynamic spread of HIV in different regional settings. Viral phylogenetics can be combined with demographic and behavioral information to gain insights on epidemiological processes shaping transmission networks at the population-level. Drug resistance testing programs also reveal emergent mutational pathways leading to resistance to the 23 antiretroviral drugs used in HIV-1 management in low-, middle- and high-income settings. This article describes how genotypic and phylogenetic information from Quebec and elsewhere provide critical information on HIV transmission and resistance, Cumulative findings can be used to optimize public health strategies to tackle the challenges of HIV in “real-world” settings. PMID:29283390

  4. A comparative test of phylogenetic diversity indices.

    PubMed

    Schweiger, Oliver; Klotz, Stefan; Durka, Walter; Kühn, Ingolf

    2008-09-01

    Traditional measures of biodiversity, such as species richness, usually treat species as being equal. As this is obviously not the case, measuring diversity in terms of features accumulated over evolutionary history provides additional value to theoretical and applied ecology. Several phylogenetic diversity indices exist, but their behaviour has not yet been tested in a comparative framework. We provide a test of ten commonly used phylogenetic diversity indices based on 40 simulated phylogenies of varying topology. We restrict our analysis to a topological fully resolved tree without information on branch lengths and species lists with presence-absence data. A total of 38,000 artificial communities varying in species richness covering 5-95% of the phylogenies were created by random resampling. The indices were evaluated based on their ability to meet a priori defined requirements. No index meets all requirements, but three indices turned out to be more suitable than others under particular conditions. Average taxonomic distinctness (AvTD) and intensive quadratic entropy (J) are calculated by averaging and are, therefore, unbiased by species richness while reflecting phylogeny per se well. However, averaging leads to the violation of set monotonicity, which requires that species extinction cannot increase the index. Total taxonomic distinctness (TTD) sums up distinctiveness values for particular species across the community. It is therefore strongly linked to species richness and reflects phylogeny per se weakly but satisfies set monotonicity. We suggest that AvTD and J are best applied to studies that compare spatially or temporally rather independent communities that potentially vary strongly in their phylogenetic composition-i.e. where set monotonicity is a more negligible issue, but independence of species richness is desired. In contrast, we suggest that TTD be used in studies that compare rather interdependent communities where changes occur more gradually by species extinction or introduction. Calculating AvTD or TTD, depending on the research question, in addition to species richness is strongly recommended.

  5. The Forest behind the Tree: Phylogenetic Exploration of a Dominant Mycobacterium tuberculosis Strain Lineage from a High Tuberculosis Burden Country

    PubMed Central

    Cardoso Oelemann, Maranibia; Gomes, Harrison M.; Willery, Eve; Possuelo, Lia; Batista Lima, Karla Valéria; Allix-Béguec, Caroline; Locht, Camille; Goguet de la Salmonière, Yves-Olivier L.; Gutierrez, Maria Cristina; Suffys, Philip; Supply, Philip

    2011-01-01

    Background Genotyping of Mycobacterium tuberculosis isolates is a powerful tool for epidemiological control of tuberculosis (TB) and phylogenetic exploration of the pathogen. Standardized PCR-based typing, based on 15 to 24 mycobacterial interspersed repetitive unit-variable number of tandem repeat (MIRU-VNTR) loci combined with spoligotyping, has been shown to have adequate resolution power for tracing TB transmission and to be useful for predicting diverse strain lineages in European settings. Its informative value needs to be tested in high TB-burden countries, where the use of genotyping is often complicated by dominance of geographically specific, genetically homogeneous strain lineages. Methodology/Principal Findings We tested this genotyping system for molecular epidemiological analysis of 369 M. tuberculosis isolates from 3 regions of Brazil, a high TB-burden country. Deligotyping, targeting 43 large sequence polymorphisms (LSPs), and the MIRU-VNTRplus identification database were used to assess phylogenetic predictions. High congruence between the different typing results consistently revealed the countrywide supremacy of the Latin-American-Mediterranean (LAM) lineage, comprised of three main branches. In addition to an already known RDRio branch, at least one other branch characterized by a phylogenetically informative LAM3 spoligo-signature seems to be globally distributed beyond Brazil. Nevertheless, by distinguishing 321 genotypes in this strain population, combined MIRU-VNTR typing and spoligotyping demonstrated the presence of multiple distinct clones. The use of 15 to 24 loci discriminated 21 to 25% more strains within the LAM lineage, compared to a restricted lineage-specific locus set suggested to be used after SNP analysis. Noteworthy, 23 of the 28 molecular clusters identified were exclusively composed of patient isolates from a same region, consistent with expected patterns of mostly local TB transmission. Conclusions/Significance Standard MIRU-VNTR typing combined with spoligotyping can reveal epidemiologically meaningful clonal diversity behind a dominant M. tuberculosis strain lineage in a high TB-burden country and is useful to explore international phylogenetical ramifications. PMID:21464915

  6. Host range and community structure of avian nest parasites in the genus Philornis (Diptera: Muscidae) on the island of Trinidad.

    PubMed

    Bulgarella, Mariana; Heimpel, George E

    2015-09-01

    Parasite host range can be influenced by physiological, behavioral, and ecological factors. Combining data sets on host-parasite associations with phylogenetic information of the hosts and the parasites involved can generate evolutionary hypotheses about the selective forces shaping host range. Here, we analyzed associations between the nest-parasitic flies in the genus Philornis and their host birds on Trinidad. Four of ten Philornis species were only reared from one species of bird. Of the parasite species with more than one host bird species, P. falsificus was the least specific and P. deceptivus the most specific attacking only Passeriformes. Philornis flies in Trinidad thus include both specialists and generalists, with varying degrees of specificity within the generalists. We used three quantities to more formally compare the host range of Philornis flies: the number of bird species attacked by each species of Philornis, a phylogenetically informed host specificity index (Poulin and Mouillot's S TD), and a branch length-based S TD. We then assessed the phylogenetic signal of these measures of host range for 29 bird species. None of these measures showed significant phylogenetic signal, suggesting that clades of Philornis did not differ significantly in their ability to exploit hosts. We also calculated two quantities of parasite species load for the birds - the parasite species richness, and a variant of the S TD index based on nodes rather than on taxonomic levels - and assessed the signal of these measures on the bird phylogeny. We did not find significant phylogenetic signal for the parasite species load or the node-based S TD index. Finally, we calculated the parasite associations for all bird pairs using the Jaccard index and regressed these similarity values against the number of nodes in the phylogeny separating bird pairs. This analysis showed that Philornis on Trinidad tend to feed on closely related bird species more often than expected by chance.

  7. Comparative Analysis of Four Buckwheat Species Based on Morphology and Complete Chloroplast Genome Sequences.

    PubMed

    Wang, Cheng-Long; Ding, Meng-Qi; Zou, Chen-Yan; Zhu, Xue-Mei; Tang, Yu; Zhou, Mei-Liang; Shao, Ji-Rong

    2017-07-26

    Buckwheat is a nutritional and economically crop belonging to Polygonaceae, Fagopyrum. To better understand the mutation patterns and evolution trend in the chloroplast (cp) genome of buckwheat, and found sufficient number of variable regions to explore the phylogenetic relationships of this genus, two complete cp genomes of buckwheat including Fagopyrum dibotrys (F. dibotrys) and Fagopyrum luojishanense (F. luojishanense) were sequenced, and other two Fagopyrum cp genomes were used for comparative analysis. After morphological analysis, the main difference among these buckwheat were height, leaf shape, seeds and flower type. F. luojishanense was distinguishable from the cultivated species easily. Although the F. dibotrys and two cultivated species has some similarity, they different in habit and component contents. The cp genome of F. dibotrys was 159,320 bp while the F. luojishanense was 159,265 bp. 48 and 61 SSRs were found in F. dibotrys and F. luojishanense respectively. Meanwhile, 10 highly variable regions among these buckwheat species were located precisely. The phylogenetic relationships among four Fagopyrum species based on complete cp genomes was showed. The results suggested that F. dibotrys is more closely related to Fagopyrum tataricum. These data provided valuable genetic information for Fagopyrum species identification, taxonomy, phylogenetic study and molecular breeding.

  8. Computing prokaryotic gene ubiquity: rescuing the core from extinction.

    PubMed

    Charlebois, Robert L; Doolittle, W Ford

    2004-12-01

    The genomic core concept has found several uses in comparative and evolutionary genomics. Defined as the set of all genes common to (ubiquitous among) all genomes in a phylogenetically coherent group, core size decreases as the number and phylogenetic diversity of the relevant group increases. Here, we focus on methods for defining the size and composition of the core of all genes shared by sequenced genomes of prokaryotes (Bacteria and Archaea). There are few (almost certainly less than 50) genes shared by all of the 147 genomes compared, surely insufficient to conduct all essential functions. Sequencing and annotation errors are responsible for the apparent absence of some genes, while very limited but genuine disappearances (from just one or a few genomes) can account for several others. Core size will continue to decrease as more genome sequences appear, unless the requirement for ubiquity is relaxed. Such relaxation seems consistent with any reasonable biological purpose for seeking a core, but it renders the problem of definition more problematic. We propose an alternative approach (the phylogenetically balanced core), which preserves some of the biological utility of the core concept. Cores, however delimited, preferentially contain informational rather than operational genes; we present a new hypothesis for why this might be so.

  9. TreSpEx—Detection of Misleading Signal in Phylogenetic Reconstructions Based on Tree Information

    PubMed Central

    Struck, Torsten H

    2014-01-01

    Phylogenies of species or genes are commonplace nowadays in many areas of comparative biological studies. However, for phylogenetic reconstructions one must refer to artificial signals such as paralogy, long-branch attraction, saturation, or conflict between different datasets. These signals might eventually mislead the reconstruction even in phylogenomic studies employing hundreds of genes. Unfortunately, there has been no program allowing the detection of such effects in combination with an implementation into automatic process pipelines. TreSpEx (Tree Space Explorer) now combines different approaches (including statistical tests), which utilize tree-based information like nodal support or patristic distances (PDs) to identify misleading signals. The program enables the parallel analysis of hundreds of trees and/or predefined gene partitions, and being command-line driven, it can be integrated into automatic process pipelines. TreSpEx is implemented in Perl and supported on Linux, Mac OS X, and MS Windows. Source code, binaries, and additional material are freely available at http://www.annelida.de/research/bioinformatics/software.html. PMID:24701118

  10. False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.

    PubMed

    Xiao, Jian; Cao, Hongyuan; Chen, Jun

    2017-09-15

    Next generation sequencing technologies have enabled the study of the human microbiome through direct sequencing of microbial DNA, resulting in an enormous amount of microbiome sequencing data. One unique characteristic of microbiome data is the phylogenetic tree that relates all the bacterial species. Closely related bacterial species have a tendency to exhibit a similar relationship with the environment or disease. Thus, incorporating the phylogenetic tree information can potentially improve the detection power for microbiome-wide association studies, where hundreds or thousands of tests are conducted simultaneously to identify bacterial species associated with a phenotype of interest. Despite much progress in multiple testing procedures such as false discovery rate (FDR) control, methods that take into account the phylogenetic tree are largely limited. We propose a new FDR control procedure that incorporates the prior structure information and apply it to microbiome data. The proposed procedure is based on a hierarchical model, where a structure-based prior distribution is designed to utilize the phylogenetic tree. By borrowing information from neighboring bacterial species, we are able to improve the statistical power of detecting associated bacterial species while controlling the FDR at desired levels. When the phylogenetic tree is mis-specified or non-informative, our procedure achieves a similar power as traditional procedures that do not take into account the tree structure. We demonstrate the performance of our method through extensive simulations and real microbiome datasets. We identified far more alcohol-drinking associated bacterial species than traditional methods. R package StructFDR is available from CRAN. chen.jun2@mayo.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  11. Linear programming model to construct phylogenetic network for 16S rRNA sequences of photosynthetic organisms and influenza viruses.

    PubMed

    Mathur, Rinku; Adlakha, Neeru

    2014-06-01

    Phylogenetic trees give the information about the vertical relationships of ancestors and descendants but phylogenetic networks are used to visualize the horizontal relationships among the different organisms. In order to predict reticulate events there is a need to construct phylogenetic networks. Here, a Linear Programming (LP) model has been developed for the construction of phylogenetic network. The model is validated by using data sets of chloroplast of 16S rRNA sequences of photosynthetic organisms and Influenza A/H5N1 viruses. Results obtained are in agreement with those obtained by earlier researchers.

  12. Dynamically heterogenous partitions and phylogenetic inference: an evaluation of analytical strategies with cytochrome b and ND6 gene sequences in cranes.

    PubMed

    Krajewski, C; Fain, M G; Buckley, L; King, D G

    1999-11-01

    ki ctes over whether molecular sequence data should be partitioned for phylogenetic analysis often confound two types of heterogeneity among partitions. We distinguish historical heterogeneity (i.e., different partitions have different evolutionary relationships) from dynamic heterogeneity (i.e., different partitions show different patterns of sequence evolution) and explore the impact of the latter on phylogenetic accuracy and precision with a two-gene, mitochondrial data set for cranes. The well-established phylogeny of cranes allows us to contrast tree-based estimates of relevant parameter values with estimates based on pairwise comparisons and to ascertain the effects of incorporating different amounts of process information into phylogenetic estimates. We show that codon positions in the cytochrome b and NADH dehydrogenase subunit 6 genes are dynamically heterogenous under both Poisson and invariable-sites + gamma-rates versions of the F84 model and that heterogeneity includes variation in base composition and transition bias as well as substitution rate. Estimates of transition-bias and relative-rate parameters from pairwise sequence comparisons were comparable to those obtained as tree-based maximum likelihood estimates. Neither rate-category nor mixed-model partitioning strategies resulted in a loss of phylogenetic precision relative to unpartitioned analyses. We suggest that weighted-average distances provide a computationally feasible alternative to direct maximum likelihood estimates of phylogeny for mixed-model analyses of large, dynamically heterogenous data sets. Copyright 1999 Academic Press.

  13. Genome wide in silico characterization of Dof gene families of pigeonpea (Cajanus cajan (L) Millsp.).

    PubMed

    Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D

    2015-02-01

    The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.

  14. Inferring epidemiological parameters from phylogenetic information for the HIV-1 epidemic among MSM

    NASA Astrophysics Data System (ADS)

    Quax, Rick; van de Vijver, David A. M. C.; Frentz, Dineke; Sloot, Peter M. A.

    2013-09-01

    The HIV-1 epidemic in Europe is primarily sustained by a dynamic topology of sexual interactions among MSM who have individual immune systems and behavior. This epidemiological process shapes the phylogeny of the virus population. Both fields of epidemic modeling and phylogenetics have a long history, however it remains difficult to use phylogenetic data to infer epidemiological parameters such as the structure of the sexual network and the per-act infectiousness. This is because phylogenetic data is necessarily incomplete and ambiguous. Here we show that the cluster-size distribution indeed contains information about epidemiological parameters using detailed numberical experiments. We simulate the HIV epidemic among MSM many times using the Monte Carlo method with all parameter values and their ranges taken from literature. For each simulation and the corresponding set of parameter values we calculate the likelihood of reproducing an observed cluster-size distribution. The result is an estimated likelihood distribution of all parameters from the phylogenetic data, in particular the structure of the sexual network, the per-act infectiousness, and the risk behavior reduction upon diagnosis. These likelihood distributions encode the knowledge provided by the observed cluster-size distrbution, which we quantify using information theory. Our work suggests that the growing body of genetic data of patients can be exploited to understand the underlying epidemiological process.

  15. Phylogenetic diversity and ecological pattern of ammonia-oxidizing archaea in the surface sediments of the western Pacific.

    PubMed

    Cao, Huiluo; Hong, Yiguo; Li, Meng; Gu, Ji-Dong

    2011-11-01

    The phylogenetic diversity of ammonia-oxidizing archaea (AOA) was surveyed in the surface sediments from the northern part of the South China Sea (SCS). The distribution pattern of AOA in the western Pacific was discussed through comparing the SCS with other areas in the western Pacific including Changjiang Estuary and the adjacent East China Sea where high input of anthropogenic nitrogen was evident, the tropical West Pacific Continental Margins close to the Philippines, the deep-sea methane seep sediments in the Okhotsk Sea, the cold deep sea of Northeastern Japan Sea, and the hydrothermal field in the Southern Okinawa Trough. These various environments provide a wide spectrum of physical and chemical conditions for a better understanding of the distribution pattern and diversities of AOA in the western Pacific. Under these different conditions, the distinct community composition between shallow and deep-sea sediments was clearly delineated based on the UniFrac PCoA and Jackknife Environmental Cluster analyses. Phylogenetic analyses showed that a few ammonia-oxidizing archaeal subclades in the marine water column/sediment clade and endemic lineages were indicative phylotypes for some environments. Higher phylogenetic diversity was observed in the Philippines while lower diversity in the hydrothermal vent habitat. Water depth and possibly with other environmental factors could be the main driving forces to shape the phylogenetic diversity of AOA observed, not only in the SCS but also in the whole western Pacific. The multivariate regression tree analysis also supported this observation consistently. Moreover, the functions of current and other climate factors were also discussed in comparison of phylogenetic diversity. The information collectively provides important insights into the ecophysiological requirements of uncultured ammonia-oxidizing archaeal lineages in the western Pacific Ocean.

  16. Specimen-level phylogenetics in paleontology using the Fossilized Birth-Death model with sampled ancestors.

    PubMed

    Cau, Andrea

    2017-01-01

    Bayesian phylogenetic methods integrating simultaneously morphological and stratigraphic information have been applied increasingly among paleontologists. Most of these studies have used Bayesian methods as an alternative to the widely-used parsimony analysis, to infer macroevolutionary patterns and relationships among species-level or higher taxa. Among recently introduced Bayesian methodologies, the Fossilized Birth-Death (FBD) model allows incorporation of hypotheses on ancestor-descendant relationships in phylogenetic analyses including fossil taxa. Here, the FBD model is used to infer the relationships among an ingroup formed exclusively by fossil individuals, i.e., dipnoan tooth plates from four localities in the Ain el Guettar Formation of Tunisia. Previous analyses of this sample compared the results of phylogenetic analysis using parsimony with stratigraphic methods, inferred a high diversity (five or more genera) in the Ain el Guettar Formation, and interpreted it as an artifact inflated by depositional factors. In the analysis performed here, the uncertainty on the chronostratigraphic relationships among the specimens was included among the prior settings. The results of the analysis confirm the referral of most of the specimens to the taxa Asiatoceratodus , Equinoxiodus, Lavocatodus and Neoceratodus , but reject those to Ceratodus and Ferganoceratodus . The resulting phylogeny constrained the evolution of the Tunisian sample exclusively in the Early Cretaceous, contrasting with the previous scenario inferred by the stratigraphically-calibrated topology resulting from parsimony analysis. The phylogenetic framework also suggests that (1) the sampled localities are laterally equivalent, (2) but three localities are restricted to the youngest part of the section; both results are in agreement with previous stratigraphic analyses of these localities. The FBD model of specimen-level units provides a novel tool for phylogenetic inference among fossils but also for independent tests of stratigraphic scenarios.

  17. Entire plastid phylogeny of the carrot genus (Daucus, Apiaceae): Concordance with nuclear data and mitochondrial and nuclear DNA insertions to the plastid.

    PubMed

    Spooner, David M; Ruess, Holly; Iorizzo, Massimo; Senalik, Douglas; Simon, Philipp

    2017-02-01

    We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results with prior phylogenetic results using plastid and nuclear DNA sequences. We used Illumina sequencing to obtain full plastid sequences of 37 accessions of 20 Daucus taxa and outgroups, analyzed the data with phylogenetic methods, and examined evidence for mitochondrial DNA transfer to the plastid ( Dc MP). Our phylogenetic trees of the entire data set were highly resolved, with 100% bootstrap support for most of the external and many of the internal clades, except for the clade of D. carota and its most closely related species D. syrticus . Subsets of the data, including regions traditionally used as phylogenetically informative regions, provide various degrees of soft congruence with the entire data set. There are areas of hard incongruence, however, with phylogenies using nuclear data. We extended knowledge of a mitochondrial to plastid DNA insertion sequence previously named Dc MP and identified the first instance in flowering plants of a sequence of potential nuclear genome origin inserted into the plastid genome. There is a relationship of inverted repeat junction classes and repeat DNA to phylogeny, but no such relationship with nonsynonymous mutations. Our data have allowed us to (1) produce a well-resolved plastid phylogeny of Daucus , (2) evaluate subsets of the entire plastid data for phylogeny, (3) examine evidence for plastid and nuclear DNA phylogenetic incongruence, and (4) examine mitochondrial and nuclear DNA insertion into the plastid. © 2017 Spooner et al. Published by the Botanical Society of America. This work is licensed under a Creative Commons public domain license (CC0 1.0).

  18. Phylogenetic tree construction using trinucleotide usage profile (TUP).

    PubMed

    Chen, Si; Deng, Lih-Yuan; Bowman, Dale; Shiau, Jyh-Jen Horng; Wong, Tit-Yee; Madahian, Behrouz; Lu, Henry Horng-Shing

    2016-10-06

    It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4 6 =4096 to 4 15 . We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4 3 =64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods. Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of overlapping words, which is the average (or mixture) of the frequency distributions of three possible reading frames. Consequently, we show (from the entropy viewpoint) that the FFP procedure could dilute important gene information and therefore provides less accurate classification.

  19. Visualizing phylogenetic tree landscapes.

    PubMed

    Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A

    2017-02-02

    Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D projections significantly increase the fit between the tree-to-tree distances and can facilitate the interpretation of the relationship among phylogenetic trees. We demonstrate that the choice of dimensionality reduction method can significantly influence the spatial relationship among a large set of competing phylogenetic trees. We highlight the importance of selecting a dimensionality reduction method to visualize large multi-locus phylogenetic landscapes and demonstrate that 3D projections of mitochondrial tree landscapes better capture the relationship among the trees being compared.

  20. The effect of orthology and coregulation on detecting regulatory motifs.

    PubMed

    Storms, Valerie; Claeys, Marleen; Sanchez, Aminael; De Moor, Bart; Verstuyf, Annemieke; Marchal, Kathleen

    2010-02-03

    Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the conditions under which complementing coregulation with orthologous information improves motif detection for the class of probabilistic motif detection algorithms with an explicit evolutionary model. We designed datasets (real and synthetic) covering different degrees of coregulation and orthologous information to test how well Phylogibbs and Phylogenetic sampler, as representatives of the motif detection algorithms with evolutionary model performed as compared to MEME, a more classical motif detection algorithm that treats orthologs independently. Under certain conditions detecting motifs in the combined coregulation-orthology space is indeed more efficient than using each space separately, but this is not always the case. Moreover, the difference in success rate between the advanced algorithms and MEME is still marginal. The success rate of motif detection depends on the complex interplay between the added information and the specificities of the applied algorithms. Insights in this relation provide information useful to both developers and users. All benchmark datasets are available at http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE.

  1. The Effect of Orthology and Coregulation on Detecting Regulatory Motifs

    PubMed Central

    Storms, Valerie; Claeys, Marleen; Sanchez, Aminael; De Moor, Bart; Verstuyf, Annemieke; Marchal, Kathleen

    2010-01-01

    Background Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the conditions under which complementing coregulation with orthologous information improves motif detection for the class of probabilistic motif detection algorithms with an explicit evolutionary model. Methodology We designed datasets (real and synthetic) covering different degrees of coregulation and orthologous information to test how well Phylogibbs and Phylogenetic sampler, as representatives of the motif detection algorithms with evolutionary model performed as compared to MEME, a more classical motif detection algorithm that treats orthologs independently. Results and Conclusions Under certain conditions detecting motifs in the combined coregulation-orthology space is indeed more efficient than using each space separately, but this is not always the case. Moreover, the difference in success rate between the advanced algorithms and MEME is still marginal. The success rate of motif detection depends on the complex interplay between the added information and the specificities of the applied algorithms. Insights in this relation provide information useful to both developers and users. All benchmark datasets are available at http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE. PMID:20140085

  2. A deer (subfamily Cervinae) genetic linkage map and the evolution of ruminant genomes.

    PubMed Central

    Slate, Jon; Van Stijn, Tracey C; Anderson, Rayna M; McEwan, K Mary; Maqbool, Nauman J; Mathias, Helen C; Bixley, Matthew J; Stevens, Deirdre R; Molenaar, Adrian J; Beever, Jonathan E; Galloway, Susan M; Tate, Michael L

    2002-01-01

    Comparative maps between ruminant species and humans are increasingly important tools for the discovery of genes underlying economically important traits. In this article we present a primary linkage map of the deer genome derived from an interspecies hybrid between red deer (Cervus elaphus) and Père David's deer (Elaphurus davidianus). The map is approximately 2500 cM long and contains >600 markers including both evolutionary conserved type I markers and highly polymorphic type II markers (microsatellites). Comparative mapping by annotation and sequence similarity (COMPASS) was demonstrated to be a useful tool for mapping bovine and ovine ESTs in deer. Using marker order as a phylogenetic character and comparative map information from human, mouse, deer, cattle, and sheep, we reconstructed the karyotype of the ancestral Pecoran mammal and identified the chromosome rearrangements that have occurred in the sheep, cattle, and deer lineages. The deer map and interspecies hybrid pedigrees described here are a valuable resource for (1) predicting the location of orthologs to human genes in ruminants, (2) mapping QTL in farmed and wild deer populations, and (3) ruminant phylogenetic studies. PMID:11973312

  3. The problem and promise of scale dependency in community phylogenetics.

    PubMed

    Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K

    2006-10-01

    The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.

  4. Phylogenetic marker development for target enrichment from transcriptome and genome skim data: the pipeline and its application in southern African Oxalis (Oxalidaceae)

    Treesearch

    Roswitha Schmickl; Aaron Liston; Vojtěch Zeisek; Kenneth Oberlander; Kevin Weitemier; Shannon C. K. Straub; Richard C. Cronn; Léanne L. Dreyer; Jan Suda

    2016-01-01

    Phylogenetics benefits from using a large number of putatively independent nuclear loci and their combination with other sources of information, such as the plastid and mitochondrial genomes. To facilitate the selection of orthologous low-copy nuclear (LCN) loci for phylogenetics in nonmodel organisms, we created an automated and interactive script to select hundreds...

  5. Phylogenetic patterns of climatic, habitat and trophic niches in a European avian assemblage

    PubMed Central

    Pearman, Peter B; Lavergne, Sébastien; Roquet, Cristina; Wüest, Rafael; Zimmermann, Niklaus E; Thuiller, Wilfried

    2014-01-01

    Aim The origins of ecological diversity in continental species assemblages have long intrigued biogeographers. We apply phylogenetic comparative analyses to disentangle the evolutionary patterns of ecological niches in an assemblage of European birds. We compare phylogenetic patterns in trophic, habitat and climatic niche components. Location Europe. Methods From polygon range maps and handbook data we inferred the realized climatic, habitat and trophic niches of 405 species of breeding birds in Europe. We fitted Pagel's lambda and kappa statistics, and conducted analyses of disparity through time to compare temporal patterns of ecological diversification on all niche axes together. All observed patterns were compared with expectations based on neutral (Brownian) models of niche divergence. Results In this assemblage, patterns of phylogenetic signal (lambda) suggest that related species resemble each other less in regard to their climatic and habitat niches than they do in their trophic niche. Kappa estimates show that ecological divergence does not gradually increase with divergence time, and that this punctualism is stronger in climatic niches than in habitat and trophic niches. Observed niche disparity markedly exceeds levels expected from a Brownian model of ecological diversification, thus providing no evidence for past phylogenetic niche conservatism in these multivariate niches. Levels of multivariate disparity are greatest for the climatic niche, followed by disparity of the habitat and the trophic niches. Main conclusions Phylogenetic patterns in the three niche components differ within this avian assemblage. Variation in evolutionary rates (degree of gradualism, constancy through the tree) and/or non-random macroecological sampling probably lead here to differences in the phylogenetic structure of niche components. Testing hypotheses on the origin of these patterns requires more complete phylogenetic trees of the birds, and extended ecological data on different niche components for all bird species. PMID:24790525

  6. Comparative Chloroplast Genomes of Photosynthetic Orchids: Insights into Evolution of the Orchidaceae and Development of Molecular Markers for Phylogenetic Applications

    PubMed Central

    Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

    2014-01-01

    The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family. PMID:24911363

  7. Comparative chloroplast genomes of photosynthetic orchids: insights into evolution of the Orchidaceae and development of molecular markers for phylogenetic applications.

    PubMed

    Luo, Jing; Hou, Bei-Wei; Niu, Zhi-Tao; Liu, Wei; Xue, Qing-Yun; Ding, Xiao-Yu

    2014-01-01

    The orchid family Orchidaceae is one of the largest angiosperm families, including many species of important economic value. While chloroplast genomes are very informative for systematics and species identification, there is very limited information available on chloroplast genomes in the Orchidaceae. Here, we report the complete chloroplast genomes of the medicinal plant Dendrobium officinale and the ornamental orchid Cypripedium macranthos, demonstrating their gene content and order and potential RNA editing sites. The chloroplast genomes of the above two species and five known photosynthetic orchids showed similarities in structure as well as gene order and content, but differences in the organization of the inverted repeat/small single-copy junction and ndh genes. The organization of the inverted repeat/small single-copy junctions in the chloroplast genomes of these orchids was classified into four types; we propose that inverted repeats flanking the small single-copy region underwent expansion or contraction among Orchidaceae. The AT-rich regions of the ycf1 gene in orchids could be linked to the recombination of inverted repeat/small single-copy junctions. Relative species in orchids displayed similar patterns of variation in ndh gene contents. Furthermore, fifteen highly divergent protein-coding genes were identified, which are useful for phylogenetic analyses in orchids. To test the efficiency of these genes serving as markers in phylogenetic analyses, coding regions of four genes (accD, ccsA, matK, and ycf1) were used as a case study to construct phylogenetic trees in the subfamily Epidendroideae. High support was obtained for placement of previously unlocated subtribes Collabiinae and Dendrobiinae in the subfamily Epidendroideae. Our findings expand understanding of the diversity of orchid chloroplast genomes and provide a reference for study of the molecular systematics of this family.

  8. The use of phylogeny to interpret cross-cultural patterns in plant use and guide medicinal plant discovery: an example from Pterocarpus (Leguminosae).

    PubMed

    Saslis-Lagoudakis, C Haris; Klitgaard, Bente B; Forest, Félix; Francis, Louise; Savolainen, Vincent; Williamson, Elizabeth M; Hawkins, Julie A

    2011-01-01

    The study of traditional knowledge of medicinal plants has led to discoveries that have helped combat diseases and improve healthcare. However, the development of quantitative measures that can assist our quest for new medicinal plants has not greatly advanced in recent years. Phylogenetic tools have entered many scientific fields in the last two decades to provide explanatory power, but have been overlooked in ethnomedicinal studies. Several studies show that medicinal properties are not randomly distributed in plant phylogenies, suggesting that phylogeny shapes ethnobotanical use. Nevertheless, empirical studies that explicitly combine ethnobotanical and phylogenetic information are scarce. In this study, we borrowed tools from community ecology phylogenetics to quantify significance of phylogenetic signal in medicinal properties in plants and identify nodes on phylogenies with high bioscreening potential. To do this, we produced an ethnomedicinal review from extensive literature research and a multi-locus phylogenetic hypothesis for the pantropical genus Pterocarpus (Leguminosae: Papilionoideae). We demonstrate that species used to treat a certain conditions, such as malaria, are significantly phylogenetically clumped and we highlight nodes in the phylogeny that are significantly overabundant in species used to treat certain conditions. These cross-cultural patterns in ethnomedicinal usage in Pterocarpus are interpreted in the light of phylogenetic relationships. This study provides techniques that enable the application of phylogenies in bioscreening, but also sheds light on the processes that shape cross-cultural ethnomedicinal patterns. This community phylogenetic approach demonstrates that similar ethnobotanical uses can arise in parallel in different areas where related plants are available. With a vast amount of ethnomedicinal and phylogenetic information available, we predict that this field, after further refinement of the techniques, will expand into similar research areas, such as pest management or the search for bioactive plant-based compounds.

  9. Towards a general framework for predicting threat status of data-deficient species from phylogenetic, spatial and environmental information.

    PubMed

    Jetz, Walter; Freckleton, Robert P

    2015-02-19

    In taxon-wide assessments of threat status many species remain not included owing to lack of data. Here, we present a novel spatial-phylogenetic statistical framework that uses a small set of readily available or derivable characteristics, including phylogenetically imputed body mass and remotely sensed human encroachment, to provide initial baseline predictions of threat status for data-deficient species. Applied to assessed mammal species worldwide, the approach effectively identifies threatened species and predicts the geographical variation in threat. For the 483 data-deficient species, the models predict highly elevated threat, with 69% 'at-risk' species in this set, compared with 22% among assessed species. This results in 331 additional potentially threatened mammals, with elevated conservation importance in rodents, bats and shrews, and countries like Colombia, Sulawesi and the Philippines. These findings demonstrate the future potential for combining phylogenies and remotely sensed data with species distributions to identify species and regions of conservation concern. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  10. Constructing phylogenetic trees using interacting pathways.

    PubMed

    Wan, Peng; Che, Dongsheng

    2013-01-01

    Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.

  11. Comprehensive Genetic Characterization of Intraprostatic Chronic Inflammation and Prostate Cancer in African American Men

    DTIC Science & Technology

    2017-09-01

    with new methodologies of intratumoral phylogenetic analyses, will yield pivotal information in elucidating the key genes involved evolution of PCa...combined with both clinical and experimental genetic data produced by this study may empower patients and doctors to make personalized treatment decisions...sequencing, paired with new methodologies of intratumoral phylogenetic analyses, will yield pivotal information in elucidating the key genes involved

  12. Phylogenomics with paralogs

    PubMed Central

    Hellmuth, Marc; Wieseke, Nicolas; Lechner, Marcus; Lenhof, Hans-Peter; Middendorf, Martin; Stadler, Peter F.

    2015-01-01

    Phylogenomics heavily relies on well-curated sequence data sets that comprise, for each gene, exclusively 1:1 orthologos. Paralogs are treated as a dangerous nuisance that has to be detected and removed. We show here that this severe restriction of the data sets is not necessary. Building upon recent advances in mathematical phylogenetics, we demonstrate that gene duplications convey meaningful phylogenetic information and allow the inference of plausible phylogenetic trees, provided orthologs and paralogs can be distinguished with a degree of certainty. Starting from tree-free estimates of orthology, cograph editing can sufficiently reduce the noise to find correct event-annotated gene trees. The information of gene trees can then directly be translated into constraints on the species trees. Although the resolution is very poor for individual gene families, we show that genome-wide data sets are sufficient to generate fully resolved phylogenetic trees, even in the presence of horizontal gene transfer. PMID:25646426

  13. Welcome to pandoraviruses at the ‘Fourth TRUC’ club

    PubMed Central

    Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

    2015-01-01

    Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9–2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the ‘Fourth TRUC’ club, encompassing distinct life forms compared with cellular organisms. PMID:26042093

  14. Welcome to pandoraviruses at the 'Fourth TRUC' club.

    PubMed

    Sharma, Vikas; Colson, Philippe; Chabrol, Olivier; Scheid, Patrick; Pontarotti, Pierre; Raoult, Didier

    2015-01-01

    Nucleocytoplasmic large DNA viruses, or representatives of the proposed order Megavirales, belong to families of giant viruses that infect a broad range of eukaryotic hosts. Megaviruses have been previously described to comprise a fourth monophylogenetic TRUC (things resisting uncompleted classification) together with cellular domains in the universal tree of life. Recently described pandoraviruses have large (1.9-2.5 MB) and highly divergent genomes. In the present study, we updated the classification of pandoraviruses and other reported giant viruses. Phylogenetic trees were constructed based on six informational genes. Hierarchical clustering was performed based on a set of informational genes from Megavirales members and cellular organisms. Homologous sequences were selected from cellular organisms using TimeTree software, comprising comprehensive, and representative sets of members from Bacteria, Archaea, and Eukarya. Phylogenetic analyses based on three conserved core genes clustered pandoraviruses with phycodnaviruses, exhibiting their close relatedness. Additionally, hierarchical clustering analyses based on informational genes grouped pandoraviruses with Megavirales members as a super group distinct from cellular organisms. Thus, the analyses based on core conserved genes revealed that pandoraviruses are new genuine members of the 'Fourth TRUC' club, encompassing distinct life forms compared with cellular organisms.

  15. Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.

    PubMed

    Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.

  16. Species Divergence and Phylogenetic Variation of Ecophysiological Traits in Lianas and Trees

    PubMed Central

    Rios, Rodrigo S.; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [Amax], dark respiration rate [Rd], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that Rd evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for Rd, while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. Rd followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for Amax. Rd may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades. PMID:24914958

  17. GENOME-WIDE COMPARATIVE ANALYSIS OF PHYLOGENETIC TREES: THE PROKARYOTIC FOREST OF LIFE

    PubMed Central

    Puigbò, Pere; Wolf, Yuri I.; Koonin, Eugene V.

    2013-01-01

    Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance (SD) method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the applications methods used to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a ‘species tree’. PMID:22399455

  18. Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.

    PubMed

    Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V

    2012-01-01

    Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."

  19. Keeping All the PIECES: Phylogenetically Informed Ex Situ Conservation of Endangered Species.

    PubMed

    Larkin, Daniel J; Jacobi, Sarah K; Hipp, Andrew L; Kramer, Andrea T

    2016-01-01

    Ex situ conservation in germplasm and living collections is a major focus of global plant conservation strategies. Prioritizing species for ex situ collection is a necessary component of this effort for which sound strategies are needed. Phylogenetic considerations can play an important role in prioritization. Collections that are more phylogenetically diverse are likely to encompass more ecological and trait variation, and thus provide stronger conservation insurance and richer resources for future restoration efforts. However, phylogenetic criteria need to be weighed against other, potentially competing objectives. We used ex situ collection and threat rank data for North American angiosperms to investigate gaps in ex situ coverage and phylogenetic diversity of collections and to develop a flexible framework for prioritizing species across multiple objectives. We found that ex situ coverage of 18,766 North American angiosperm taxa was low with respect to the most vulnerable taxa: just 43% of vulnerable to critically imperiled taxa were in ex situ collections, far short of a year-2020 goal of 75%. In addition, species held in ex situ collections were phylogenetically clustered (P < 0.001), i.e., collections comprised less phylogenetic diversity than would be expected had species been drawn at random. These patterns support incorporating phylogenetic considerations into ex situ prioritization in a manner balanced with other criteria, such as vulnerability. To meet this need, we present the 'PIECES' index (Phylogenetically Informed Ex situ Conservation of Endangered Species). PIECES integrates phylogenetic considerations into a flexible framework for prioritizing species across competing objectives using multi-criteria decision analysis. Applying PIECES to prioritizing ex situ conservation of North American angiosperms, we show strong return on investment across multiple objectives, some of which are negatively correlated with each other. A spreadsheet-based decision support tool for North American angiosperms is provided; this tool can be customized to align with different conservation objectives.

  20. Keeping All the PIECES: Phylogenetically Informed Ex Situ Conservation of Endangered Species

    PubMed Central

    Larkin, Daniel J.; Jacobi, Sarah K.; Hipp, Andrew L.; Kramer, Andrea T.

    2016-01-01

    Ex situ conservation in germplasm and living collections is a major focus of global plant conservation strategies. Prioritizing species for ex situ collection is a necessary component of this effort for which sound strategies are needed. Phylogenetic considerations can play an important role in prioritization. Collections that are more phylogenetically diverse are likely to encompass more ecological and trait variation, and thus provide stronger conservation insurance and richer resources for future restoration efforts. However, phylogenetic criteria need to be weighed against other, potentially competing objectives. We used ex situ collection and threat rank data for North American angiosperms to investigate gaps in ex situ coverage and phylogenetic diversity of collections and to develop a flexible framework for prioritizing species across multiple objectives. We found that ex situ coverage of 18,766 North American angiosperm taxa was low with respect to the most vulnerable taxa: just 43% of vulnerable to critically imperiled taxa were in ex situ collections, far short of a year-2020 goal of 75%. In addition, species held in ex situ collections were phylogenetically clustered (P < 0.001), i.e., collections comprised less phylogenetic diversity than would be expected had species been drawn at random. These patterns support incorporating phylogenetic considerations into ex situ prioritization in a manner balanced with other criteria, such as vulnerability. To meet this need, we present the ‘PIECES’ index (Phylogenetically Informed Ex situ Conservation of Endangered Species). PIECES integrates phylogenetic considerations into a flexible framework for prioritizing species across competing objectives using multi-criteria decision analysis. Applying PIECES to prioritizing ex situ conservation of North American angiosperms, we show strong return on investment across multiple objectives, some of which are negatively correlated with each other. A spreadsheet-based decision support tool for North American angiosperms is provided; this tool can be customized to align with different conservation objectives. PMID:27257671

  1. Transforming phylogenetic networks: Moving beyond tree space.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-07

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. SERAPHIM: studying environmental rasters and phylogenetically informed movements.

    PubMed

    Dellicour, Simon; Rose, Rebecca; Faria, Nuno R; Lemey, Philippe; Pybus, Oliver G

    2016-10-15

    SERAPHIM ("Studying Environmental Rasters and PHylogenetically Informed Movements") is a suite of computational methods developed to study phylogenetic reconstructions of spatial movement in an environmental context. SERAPHIM extracts the spatio-temporal information contained in estimated phylogenetic trees and uses this information to calculate summary statistics of spatial spread and to visualize dispersal history. Most importantly, SERAPHIM enables users to study the impact of customized environmental variables on the spread of the study organism. Specifically, given an environmental raster, SERAPHIM computes environmental "weights" for each phylogeny branch, which represent the degree to which the environmental variable impedes (or facilitates) lineage movement. Correlations between movement duration and these environmental weights are then assessed, and the statistical significances of these correlations are evaluated using null distributions generated by a randomization procedure. SERAPHIM can be applied to any phylogeny whose nodes are annotated with spatial and temporal information. At present, such phylogenies are most often found in the field of emerging infectious diseases, but will become increasingly common in other biological disciplines as population genomic data grows. SERAPHIM 1.0 is freely available from http://evolve.zoo.ox.ac.uk/ R package, source code, example files, tutorials and a manual are also available from this website. simon.dellicour@kuleuven.be or oliver.pybus@zoo.ox.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. BrassiBase: introduction to a novel knowledge database on Brassicaceae evolution.

    PubMed

    Kiefer, Markus; Schmickl, Roswitha; German, Dmitry A; Mandáková, Terezie; Lysak, Martin A; Al-Shehbaz, Ihsan A; Franzke, Andreas; Mummenhoff, Klaus; Stamatakis, Alexandros; Koch, Marcus A

    2014-01-01

    The Brassicaceae family (mustards or crucifers) includes Arabidopsis thaliana as one of the most important model species in plant biology and a number of important crop plants such as the various Brassica species (e.g. cabbage, canola and mustard). Moreover, the family comprises an increasing number of species that serve as study systems in many fields of plant science and evolutionary research. However, the systematics and taxonomy of the family are very complex and access to scientifically valuable and reliable information linked to species and genus names and its interpretation are often difficult. BrassiBase is a continuously developing and growing knowledge database (http://brassibase.cos.uni-heidelberg.de) that aims at providing direct access to many different types of information ranging from taxonomy and systematics to phylo- and cytogenetics. Providing critically revised key information, the database intends to optimize comparative evolutionary research in this family and supports the introduction of the Brassicaceae as the model family for evolutionary biology and plant sciences. Some features that should help to accomplish these goals within a comprehensive taxonomic framework have now been implemented in the new version 1.1.9. A 'Phylogenetic Placement Tool' should help to identify critical accessions and germplasm and provide a first visualization of phylogenetic relationships. The 'Cytogenetics Tool' provides in-depth information on genome sizes, chromosome numbers and polyploidy, and sets this information into a Brassicaceae-wide context.

  4. MASTtreedist: visualization of tree space based on maximum agreement subtree.

    PubMed

    Huang, Hong; Li, Yongji

    2013-01-01

    Phylogenetic tree construction process might produce many candidate trees as the "best estimates." As the number of constructed phylogenetic trees grows, the need to efficiently compare their topological or physical structures arises. One of the tree comparison's software tools, the Mesquite's Tree Set Viz module, allows the rapid and efficient visualization of the tree comparison distances using multidimensional scaling (MDS). Tree-distance measures, such as Robinson-Foulds (RF), for the topological distance among different trees have been implemented in Tree Set Viz. New and sophisticated measures such as Maximum Agreement Subtree (MAST) can be continuously built upon Tree Set Viz. MAST can detect the common substructures among trees and provide more precise information on the similarity of the trees, but it is NP-hard and difficult to implement. In this article, we present a practical tree-distance metric: MASTtreedist, a MAST-based comparison metric in Mesquite's Tree Set Viz module. In this metric, the efficient optimizations for the maximum weight clique problem are applied. The results suggest that the proposed method can efficiently compute the MAST distances among trees, and such tree topological differences can be translated as a scatter of points in two-dimensional (2D) space. We also provide statistical evaluation of provided measures with respect to RF-using experimental data sets. This new comparison module provides a new tree-tree pairwise comparison metric based on the differences of the number of MAST leaves among constructed phylogenetic trees. Such a new phylogenetic tree comparison metric improves the visualization of taxa differences by discriminating small divergences of subtree structures for phylogenetic tree reconstruction.

  5. Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

    PubMed Central

    Gao, Beile

    2012-01-01

    Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973

  6. Phytoplasma phylogenetics based on analysis of secA and 23S rRNA gene sequences for improved resolution of candidate species of 'Candidatus Phytoplasma'.

    PubMed

    Hodgetts, Jennifer; Boonham, Neil; Mumford, Rick; Harrison, Nigel; Dickinson, Matthew

    2008-08-01

    Phytoplasma phylogenetics has focused primarily on sequences of the non-coding 16S rRNA gene and the 16S-23S rRNA intergenic spacer region (16-23S ISR), and primers that enable amplification of these regions from all phytoplasmas by PCR are well established. In this study, primers based on the secA gene have been developed into a semi-nested PCR assay that results in a sequence of the expected size (about 480 bp) from all 34 phytoplasmas examined, including strains representative of 12 16Sr groups. Phylogenetic analysis of secA gene sequences showed similar clustering of phytoplasmas when compared with clusters resolved by similar sequence analyses of a 16-23S ISR-23S rRNA gene contig or of the 16S rRNA gene alone. The main differences between trees were in the branch lengths, which were elongated in the 16-23S ISR-23S rRNA gene tree when compared with the 16S rRNA gene tree and elongated still further in the secA gene tree, despite this being a shorter sequence. The improved resolution in the secA gene-derived phylogenetic tree resulted in the 16SrII group splitting into two distinct clusters, while phytoplasmas associated with coconut lethal yellowing-type diseases split into three distinct groups, thereby supporting past proposals that they represent different candidate species within 'Candidatus Phytoplasma'. The ability to differentiate 16Sr groups and subgroups by virtual RFLP analysis of secA gene sequences suggests that this gene may provide an informative alternative molecular marker for pathogen identification and diagnosis of phytoplasma diseases.

  7. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less

  8. The Complete Mitochondrial Genome of Corizus tetraspilus (Hemiptera: Rhopalidae) and Phylogenetic Analysis of Pentatomomorpha

    PubMed Central

    Guo, Zhong-Long; Wang, Juan; Shen, Yu-Ying

    2015-01-01

    Insect mitochondrial genome (mitogenome) are the most extensively used genetic information for molecular evolution, phylogenetics and population genetics. Pentatomomorpha (>14,000 species) is the second largest infraorder of Heteroptera and of great economic importance. To better understand the diversity and phylogeny within Pentatomomorpha, we sequenced and annotated the complete mitogenome of Corizus tetraspilus (Hemiptera: Rhopalidae), an important pest of alfalfa in China. We analyzed the main features of the C. tetraspilus mitogenome, and provided a comparative analysis with four other Coreoidea species. Our results reveal that gene content, gene arrangement, nucleotide composition, codon usage, rRNA structures and sequences of mitochondrial transcription termination factor are conserved in Coreoidea. Comparative analysis shows that different protein-coding genes have been subject to different evolutionary rates correlated with the G+C content. All the transfer RNA genes found in Coreoidea have the typical clover leaf secondary structure, except for trnS1 (AGN) which lacks the dihydrouridine (DHU) arm and possesses a unusual anticodon stem (9 bp vs. the normal 5 bp). The control regions (CRs) among Coreoidea are highly variable in size, of which the CR of C. tetraspilus is the smallest (440 bp), making the C. tetraspilus mitogenome the smallest (14,989 bp) within all completely sequenced Coreoidea mitogenomes. No conserved motifs are found in the CRs of Coreoidea. In addition, the A+T content (60.68%) of the CR of C. tetraspilus is much lower than that of the entire mitogenome (74.88%), and is lowest among Coreoidea. Phylogenetic analyses based on mitogenomic data support the monophyly of each superfamily within Pentatomomorpha, and recognize a phylogenetic relationship of (Aradoidea + (Pentatomoidea + (Lygaeoidea + (Pyrrhocoroidea + Coreoidea)))). PMID:26042898

  9. Do chromosome numbers reflect phylogeny? New counts for Bombacoideae and a review of Malvaceae s.l.

    PubMed

    Marinho, Rafaela C; Mendes-Rodrigues, Clesnan; Balao, Francisco; Ortiz, Pedro L; Yamagishi-Costa, Júlia; Bonetti, Ana M; Oliveira, Paulo E

    2014-09-01

    • Whole genome duplication (WGD) and specific polyploidy events marked turning points for angiosperm genome structure and evolution. Therefore, cytogenetic studies of polyploidy-prone groups such as the tropical Malvaceae and plant formations such as as the Brazilian Cerrado have gained further importance. We present new chromosome counts for Cerrado Bombacoideae and revised chromosome numbers for the Malvaceae s.l., compare these between subfamilies, and relate them to phylogenetic signal.• We studied the chromosome number of Eriotheca candolleana, E. gracilipes, E. pubescens, Pachira glabra, Pseudobombax longiflorum, and P. tomentosum. We also compared Eriotheca species ploidy levels using flow cytometry. We compiled chromosome numbers for 557 species of Malvaceae s.l., including 37 Bombacoideae species. We included this information in a phylogenetic reconstruction based on chloroplast matK-trnK DNA to evaluate chromosome evolution of the Malvaceae s.l. and the Bombacoideae in particular.• The Cerrado Bombacoideae presented consistently high chromosome numbers. Numbers for Eriotheca species were among the highest and varied among populations. Flow cytometry analyses showed similar 1Cx DNA for all cytotypes and indicated neopolyploidy. Chromosome numbers differed between subfamilies, with the lowest numbers in the Malvoideae and Byttnerioideae and the highest in Tilioideae. Chromosome numbers had significant phylogenetic signal for Bombacoideae but not for Malvoideae or Malvaceae s.l.• Clearly distinct chromosome numbers allied to monophyly provide some support for a circumscription of the Bombacoideae and distinction within the Malvaceae. The phylogenetic signal for chromosome number supports the idea of an ancient WGD and further neopolyploidy events as important evolutionary trends for the Bombacoideae. © 2014 Botanical Society of America, Inc.

  10. The comparative osteology of the petrotympanic complex (ear region) of extant baleen whales (Cetacea: Mysticeti).

    PubMed

    Ekdale, Eric G; Berta, Annalisa; Deméré, Thomas A

    2011-01-01

    Anatomical comparisons of the ear region of baleen whales (Mysticeti) are provided through detailed osteological descriptions and high-resolution photographs of the petrotympanic complex (tympanic bulla and petrosal bone) of all extant species of mysticete cetaceans. Salient morphological features are illustrated and identified, including overall shape of the bulla, size of the conical process of the bulla, morphology of the promontorium, and the size and shape of the anterior process of the petrosal. We place our comparative osteological observations into a phylogenetic context in order to initiate an exploration into petrotympanic evolution within Mysticeti. The morphology of the petrotympanic complex is diagnostic for individual species of baleen whale (e.g., sigmoid and conical processes positioned at midline of bulla in Balaenoptera musculus; confluence of fenestra cochleae and perilymphatic foramen in Eschrichtius robustus), and several mysticete clades are united by derived characteristics. Balaenids and neobalaenids share derived features of the bulla, such as a rhomboid shape and a reduced anterior lobe (swelling) in ventral aspect, and eschrichtiids share derived morphologies of the petrosal with balaenopterids, including loss of a medial promontory groove and dorsomedial elongation of the promontorium. Monophyly of Balaenoidea (Balaenidae and Neobalaenidae) and Balaenopteroidea (Balaenopteridae and Eschrichtiidae) was recovered in phylogenetic analyses utilizing data exclusively from the petrotympanic complex. This study fills a major gap in our knowledge of the complex structures of the mysticete petrotympanic complex, which is an important anatomical region for the interpretation of the evolutionary history of mammals. In addition, we introduce a novel body of phylogenetically informative characters from the ear region of mysticetes. Our detailed anatomical descriptions, illustrations, and comparisons provide valuable data for current and future studies on the phylogenetic relationships, evolution, and auditory physiology of mysticetes and other cetaceans throughout Earth's history.

  11. Edge-related loss of tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

    PubMed

    Santos, Bráulio A; Arroyo-Rodríguez, Víctor; Moreno, Claudia E; Tabarelli, Marcelo

    2010-09-08

    Deforestation and forest fragmentation are known major causes of nonrandom extinction, but there is no information about their impact on the phylogenetic diversity of the remaining species assemblages. Using a large vegetation dataset from an old hyper-fragmented landscape in the Brazilian Atlantic rainforest we assess whether the local extirpation of tree species and functional impoverishment of tree assemblages reduce the phylogenetic diversity of the remaining tree assemblages. We detected a significant loss of tree phylogenetic diversity in forest edges, but not in core areas of small (<80 ha) forest fragments. This was attributed to a reduction of 11% in the average phylogenetic distance between any two randomly chosen individuals from forest edges; an increase of 17% in the average phylogenetic distance to closest non-conspecific relative for each individual in forest edges; and to the potential manifestation of late edge effects in the core areas of small forest remnants. We found no evidence supporting fragmentation-induced phylogenetic clustering or evenness. This could be explained by the low phylogenetic conservatism of key life-history traits corresponding to vulnerable species. Edge effects must be reduced to effectively protect tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

  12. Phylogenetic incongruence in the Drosophila melanogaster species group

    PubMed Central

    Wong, Alex; Jensen, Jeffrey D.; Pool, John E.; Aquadro, Charles F.

    2007-01-01

    Drosophila melanogaster and its close relatives are used extensively in comparative biology. Despite the importance of phylogenetic information for such studies, relationships between some melanogaster species group members are unclear due to conflicting phylogenetic signals at different loci. In this study, we use twelve nuclear loci (eleven coding and one non-coding) to assess the degree of phylogenetic incongruence in this model system. We focus on two nodes: (1) The node joining the D. erecta-D. orena, D. melanogaster-D. simulans, and D. yakuba-D. teissieri lineages, and (2) The node joining the lineages leading to the melanogaster, takahashii, and eugracilis subgroups. We find limited evidence for incongruence at the first node; our data, as well as those of several previous studies, strongly support monophyly of a clade consisting of D. erecta-D. orena and D. yakuba-D. teissieri. By contrast, using likelihood based tests of congruence, we find robust evidence for topological incongruence at the second node. Different loci support different relationships among the melanogaster, takahashii and eugracilis subgroups, and the observed incongruence is not easily attributable to homoplasy, non-equilibrium base composition, or positive selection on a subset of loci. We argue that lineage sorting in the common ancestor of these three subgroups is the most plausible explanation for our observations. Such lineage sorting may lead to biased estimation of tree topology and evolutionary rates, and may confound inferences of positive selection. PMID:17071113

  13. A program to compute the soft Robinson-Foulds distance between phylogenetic networks.

    PubMed

    Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai

    2017-03-14

    Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.

  14. Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.

    PubMed

    Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M

    2012-02-01

    The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).

  15. Simultaneously estimating evolutionary history and repeated traits phylogenetic signal: applications to viral and host phenotypic evolution

    PubMed Central

    Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.

    2014-01-01

    Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554

  16. RPS8—a New Informative DNA Marker for Phylogeny of Babesia and Theileria Parasites in China

    PubMed Central

    Tian, Zhan-Cheng; Liu, Guang-Yuan; Yin, Hong; Luo, Jian-Xun; Guan, Gui-Quan; Luo, Jin; Xie, Jun-Ren; Shen, Hui; Tian, Mei-Yuan; Zheng, Jin-feng; Yuan, Xiao-song; Wang, Fang-fang

    2013-01-01

    Piroplasmosis is a serious debilitating and sometimes fatal disease. Phylogenetic relationships within piroplasmida are complex and remain unclear. We compared the intron–exon structure and DNA sequences of the RPS8 gene from Babesia and Theileria spp. isolates in China. Similar to 18S rDNA, the 40S ribosomal protein S8 gene, RPS8, including both coding and non-coding regions is a useful and novel genetic marker for defining species boundaries and for inferring phylogenies because it tends to have little intra-specific variation but considerable inter-specific difference. However, more samples are needed to verify the usefulness of the RPS8 (coding and non-coding regions) gene as a marker for the phylogenetic position and detection of most Babesia and Theileria species, particularly for some closely related species. PMID:24244571

  17. rbcL gene sequences provide evidence for the evolutionary lineages of leptosporangiate ferns.

    PubMed

    Hasebe, M; Omori, T; Nakazawa, M; Sano, T; Kato, M; Iwatsuki, K

    1994-06-07

    Pteriodophytes have a longer evolutionary history than any other vascular land plant and, therefore, have endured greater loss of phylogenetically informative information. This factor has resulted in substantial disagreements in evaluating characters and, thus, controversy in establishing a stable classification. To compare competing classifications, we obtained DNA sequences of a chloroplast gene. The sequence of 1206 nt of the large subunit of the ribulose-bisphosphate carboxylase gene (rbcL) was determined from 58 species, representing almost all families of leptosporangiate ferns. Phlogenetic trees were inferred by the neighbor-joining and the parsimony methods. The two methods produced almost identical phylogenetic trees that provided insights concerning major general evolutionary trends in the leptosporangiate ferns. Interesting findings were as follows: (i) two morphologically distinct heterosporous water ferns, Marsilea and Salvinia, are sister genera; (ii) the tree ferns (Cyatheaceae, Dicksoniaceae, and Metaxyaceae) are monophyletic; and (iii) polypodioids are distantly related to the gleichenioids in spite of the similarity of their exindusiate soral morphology and are close to the higher indusiate ferns. In addition, the affinities of several "problematic genera" were assessed.

  18. A worldview of root traits: the influence of ancestry, growth form, climate and mycorrhizal association on the functional trait variation of fine-root tissues in seed plants.

    PubMed

    Valverde-Barrantes, Oscar J; Freschet, Grégoire T; Roumet, Catherine; Blackwood, Christopher B

    2017-09-01

    Fine-root traits play key roles in ecosystem processes, but the drivers of fine-root trait diversity remain poorly understood. The plant economic spectrum (PES) hypothesis predicts that leaf and root traits evolved in coordination. Mycorrhizal association type, plant growth form and climate may also affect root traits. However, the extent to which these controls are confounded with phylogenetic structuring remains unclear. Here we compiled information about root and leaf traits for > 600 species. Using phylogenetic relatedness, climatic ranges, growth form and mycorrhizal associations, we quantified the importance of these factors in the global distribution of fine-root traits. Phylogenetic structuring accounts for most of the variation for all traits excepting root tissue density, with root diameter and nitrogen concentration showing the strongest phylogenetic signal and specific root length showing intermediate values. Climate was the second most important factor, whereas mycorrhizal type had little effect. Substantial trait coordination occurred between leaves and roots, but the strength varied between growth forms and clades. Our analyses provide evidence that the integration of roots and leaves in the PES requires better accounting of the variation in traits across phylogenetic clades. Inclusion of phylogenetic information provides a powerful framework for predictions of belowground functional traits at global scales. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  19. Phylogenetic and Evolutionary Patterns in Microbial Carotenoid Biosynthesis Are Revealed by Comparative Genomics

    PubMed Central

    Klassen, Jonathan L.

    2010-01-01

    Background Carotenoids are multifunctional, taxonomically widespread and biotechnologically important pigments. Their biosynthesis serves as a model system for understanding the evolution of secondary metabolism. Microbial carotenoid diversity and evolution has hitherto been analyzed primarily from structural and biosynthetic perspectives, with the few phylogenetic analyses of microbial carotenoid biosynthetic proteins using either used limited datasets or lacking methodological rigor. Given the recent accumulation of microbial genome sequences, a reappraisal of microbial carotenoid biosynthetic diversity and evolution from the perspective of comparative genomics is warranted to validate and complement models of microbial carotenoid diversity and evolution based upon structural and biosynthetic data. Methodology/Principal Findings Comparative genomics were used to identify and analyze in silico microbial carotenoid biosynthetic pathways. Four major phylogenetic lineages of carotenoid biosynthesis are suggested composed of: (i) Proteobacteria; (ii) Firmicutes; (iii) Chlorobi, Cyanobacteria and photosynthetic eukaryotes; and (iv) Archaea, Bacteroidetes and two separate sub-lineages of Actinobacteria. Using this phylogenetic framework, specific evolutionary mechanisms are proposed for carotenoid desaturase CrtI-family enzymes and carotenoid cyclases. Several phylogenetic lineage-specific evolutionary mechanisms are also suggested, including: (i) horizontal gene transfer; (ii) gene acquisition followed by differential gene loss; (iii) co-evolution with other biochemical structures such as proteorhodopsins; and (iv) positive selection. Conclusions/Significance Comparative genomics analyses of microbial carotenoid biosynthetic proteins indicate a much greater taxonomic diversity then that identified based on structural and biosynthetic data, and divides microbial carotenoid biosynthesis into several, well-supported phylogenetic lineages not evident previously. This phylogenetic framework is applicable to understanding the evolution of specific carotenoid biosynthetic proteins or the unique characteristics of carotenoid biosynthetic evolution in a specific phylogenetic lineage. Together, these analyses suggest a “bramble” model for microbial carotenoid biosynthesis whereby later biosynthetic steps exhibit greater evolutionary plasticity and reticulation compared to those closer to the biosynthetic “root”. Structural diversification may be constrained (“trimmed”) where selection is strong, but less so where selection is weaker. These analyses also highlight likely productive avenues for future research and bioprospecting by identifying both gaps in current knowledge and taxa which may particularly facilitate carotenoid diversification. PMID:20582313

  20. Phylogenetic rooting using minimal ancestor deviation.

    PubMed

    Tria, Fernando Domingues Kümmel; Landan, Giddy; Dagan, Tal

    2017-06-19

    Ancestor-descendent relations play a cardinal role in evolutionary theory. Those relations are determined by rooting phylogenetic trees. Existing rooting methods are hampered by evolutionary rate heterogeneity or the unavailability of auxiliary phylogenetic information. Here we present a rooting approach, the minimal ancestor deviation (MAD) method, which accommodates heterotachy by using all pairwise topological and metric information in unrooted trees. We demonstrate the performance of the method, in comparison to existing rooting methods, by the analysis of phylogenies from eukaryotes and prokaryotes. MAD correctly recovers the known root of eukaryotes and uncovers evidence for the origin of cyanobacteria in the ocean. MAD is more robust and consistent than existing methods, provides measures of the root inference quality and is applicable to any tree with branch lengths.

  1. The Use of Phylogeny to Interpret Cross-Cultural Patterns in Plant Use and Guide Medicinal Plant Discovery: An Example from Pterocarpus (Leguminosae)

    PubMed Central

    Saslis-Lagoudakis, C. Haris; Klitgaard, Bente B.; Forest, Félix; Francis, Louise; Savolainen, Vincent; Williamson, Elizabeth M.; Hawkins, Julie A.

    2011-01-01

    Background The study of traditional knowledge of medicinal plants has led to discoveries that have helped combat diseases and improve healthcare. However, the development of quantitative measures that can assist our quest for new medicinal plants has not greatly advanced in recent years. Phylogenetic tools have entered many scientific fields in the last two decades to provide explanatory power, but have been overlooked in ethnomedicinal studies. Several studies show that medicinal properties are not randomly distributed in plant phylogenies, suggesting that phylogeny shapes ethnobotanical use. Nevertheless, empirical studies that explicitly combine ethnobotanical and phylogenetic information are scarce. Methodology/Principal Findings In this study, we borrowed tools from community ecology phylogenetics to quantify significance of phylogenetic signal in medicinal properties in plants and identify nodes on phylogenies with high bioscreening potential. To do this, we produced an ethnomedicinal review from extensive literature research and a multi-locus phylogenetic hypothesis for the pantropical genus Pterocarpus (Leguminosae: Papilionoideae). We demonstrate that species used to treat a certain conditions, such as malaria, are significantly phylogenetically clumped and we highlight nodes in the phylogeny that are significantly overabundant in species used to treat certain conditions. These cross-cultural patterns in ethnomedicinal usage in Pterocarpus are interpreted in the light of phylogenetic relationships. Conclusions/Significance This study provides techniques that enable the application of phylogenies in bioscreening, but also sheds light on the processes that shape cross-cultural ethnomedicinal patterns. This community phylogenetic approach demonstrates that similar ethnobotanical uses can arise in parallel in different areas where related plants are available. With a vast amount of ethnomedicinal and phylogenetic information available, we predict that this field, after further refinement of the techniques, will expand into similar research areas, such as pest management or the search for bioactive plant-based compounds. PMID:21789247

  2. Calibrated birth-death phylogenetic time-tree priors for bayesian inference.

    PubMed

    Heled, Joseph; Drummond, Alexei J

    2015-05-01

    Here we introduce a general class of multiple calibration birth-death tree priors for use in Bayesian phylogenetic inference. All tree priors in this class separate ancestral node heights into a set of "calibrated nodes" and "uncalibrated nodes" such that the marginal distribution of the calibrated nodes is user-specified whereas the density ratio of the birth-death prior is retained for trees with equal values for the calibrated nodes. We describe two formulations, one in which the calibration information informs the prior on ranked tree topologies, through the (conditional) prior, and the other which factorizes the prior on divergence times and ranked topologies, thus allowing uniform, or any arbitrary prior distribution on ranked topologies. Although the first of these formulations has some attractive properties, the algorithm we present for computing its prior density is computationally intensive. However, the second formulation is always faster and computationally efficient for up to six calibrations. We demonstrate the utility of the new class of multiple-calibration tree priors using both small simulations and a real-world analysis and compare the results to existing schemes. The two new calibrated tree priors described in this article offer greater flexibility and control of prior specification in calibrated time-tree inference and divergence time dating, and will remove the need for indirect approaches to the assessment of the combined effect of calibration densities and tree priors in Bayesian phylogenetic inference. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  3. Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene

    PubMed Central

    2017-01-01

    The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference. PMID:29136663

  4. Comparing Ontogenetic and Phylogenetic Stages of Human Development

    ERIC Educational Resources Information Center

    Clarken, Rodney H.

    2005-01-01

    This paper will present evidence to support ontogenetic and phylogenetic parallels and draw from these comparisons to further illuminate our understanding of micro and macro human development. Individual and collective stages of physical, psychological and spiritual development will be compared and their homologous structures examined.…

  5. Patterns and effects of GC3 heterogeneity and parsimony informative sites on the phylogenetic tree of genes.

    PubMed

    Ma, Shuai; Wu, Qi; Hu, Yibo; Wei, Fuwen

    2018-05-20

    The explosive growth in genomic data has provided novel insights into the conflicting signals hidden in phylogenetic trees. Although some studies have explored the effects of the GC content and parsimony informative sites (PIS) on the phylogenetic tree, the effect of the heterogeneity of the GC content at the first/second/third codon position on parsimony informative sites (GC1/2/3 PIS ) among different species and the effect of PIS on phylogenetic tree construction remain largely unexplored. Here, we used two different mammal genomic datasets to explore the patterns of GC1/2/3 PIS heterogeneity and the effect of PIS on the phylogenetic tree of genes: (i) all GC1/2/3 PIS have obvious heterogeneity between different mammals, and the levels of heterogeneity are GC3 PIS  > GC2 PIS  > GC1 PIS ; (ii) the number of PIS is positively correlated with the metrics of "good" gene tree topologies, and excluding the third codon position (C3) decreases the quality of gene trees by removing too many PIS. These results provide novel insights into the heterogeneity pattern of GC1/2/3 PIS in mammals and the relationship between GC3/PIS and gene trees. Additionally, it is necessary to carefully consider whether to exclude C3 to improve the quality of gene trees, especially in the super-tree method. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A

    PubMed Central

    Ndhlovu, Andrew; Durand, Pierre M.; Hazelhurst, Scott

    2015-01-01

    The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. Database URL: http://www.bioinf.wits.ac.za/software/fire/evodb PMID:26140928

  7. EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A.

    PubMed

    Ndhlovu, Andrew; Durand, Pierre M; Hazelhurst, Scott

    2015-01-01

    The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. © The Author(s) 2015. Published by Oxford University Press.

  8. Sequencing and comparing whole mitochondrial genomes ofanimals

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica

    2005-04-22

    Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based onmore » our experiences to date with determining and comparing complete mtDNA sequences.« less

  9. The spatial sensitivity of the spectral diversity-biodiversity relationship: an experimental test in a prairie grassland.

    PubMed

    Wang, Ran; Gamon, John A; Cavender-Bares, Jeannine; Townsend, Philip A; Zygielbaum, Arthur I

    2018-03-01

    Remote sensing has been used to detect plant biodiversity in a range of ecosystems based on the varying spectral properties of different species or functional groups. However, the most appropriate spatial resolution necessary to detect diversity remains unclear. At coarse resolution, differences among spectral patterns may be too weak to detect. In contrast, at fine resolution, redundant information may be introduced. To explore the effect of spatial resolution, we studied the scale dependence of spectral diversity in a prairie ecosystem experiment at Cedar Creek Ecosystem Science Reserve, Minnesota, USA. Our study involved a scaling exercise comparing synthetic pixels resampled from high-resolution images within manipulated diversity treatments. Hyperspectral data were collected using several instruments on both ground and airborne platforms. We used the coefficient of variation (CV) of spectral reflectance in space as the indicator of spectral diversity and then compared CV at different scales ranging from 1 mm 2 to 1 m 2 to conventional biodiversity metrics, including species richness, Shannon's index, Simpson's index, phylogenetic species variation, and phylogenetic species evenness. In this study, higher species richness plots generally had higher CV. CV showed higher correlations with Shannon's index and Simpson's index than did species richness alone, indicating evenness contributed to the spectral diversity. Correlations with species richness and Simpson's index were generally higher than with phylogenetic species variation and evenness measured at comparable spatial scales, indicating weaker relationships between spectral diversity and phylogenetic diversity metrics than with species diversity metrics. High resolution imaging spectrometer data (1 mm 2 pixels) showed the highest sensitivity to diversity level. With decreasing spatial resolution, the difference in CV between diversity levels decreased and greatly reduced the optical detectability of biodiversity. The optimal pixel size for distinguishing α diversity in these prairie plots appeared to be around 1 mm to 10 cm, a spatial scale similar to the size of an individual herbaceous plant. These results indicate a strong scale-dependence of the spectral diversity-biodiversity relationships, with spectral diversity best able to detect a combination of species richness and evenness, and more weakly detecting phylogenetic diversity. These findings can be used to guide airborne studies of biodiversity and develop more effective large-scale biodiversity sampling methods. ©2018 The Authors Ecological Applications published by Wiley Periodicals, Inc. on behalf of Ecological Society of America.

  10. Using Genotype Abundance to Improve Phylogenetic Inference

    PubMed Central

    Mesin, Luka; Victora, Gabriel D; Minin, Vladimir N; Matsen, Frederick A

    2018-01-01

    Abstract Modern biological techniques enable very dense genetic sampling of unfolding evolutionary histories, and thus frequently sample some genotypes multiple times. This motivates strategies to incorporate genotype abundance information in phylogenetic inference. In this article, we synthesize a stochastic process model with standard sequence-based phylogenetic optimality, and show that tree estimation is substantially improved by doing so. Our method is validated with extensive simulations and an experimental single-cell lineage tracing study of germinal center B cell receptor affinity maturation. PMID:29474671

  11. Genetic Identification of Orientobilharzia turkestanicum from Sheep Isolates in Iran.

    PubMed

    Tabaripour, Reza; Youssefi, Mohammad Reza; Tabaripour, Rabeeh

    2015-01-01

    Adult worms of Orientobilharzia turkestanicum live in the portal veins, or intestinal veins of cattle, sheep, goat and many other mammals causing orientobilharziasis. Orientobilharziasis causes significant economic losses to livestock industry of Iran. However, there is limited information about genotypes of O. turkestanicum in Iran. In this study, 30 isolates of O. turkestanicum obtained from sheep were characterized by sequencing mitochondrial cytochrome c oxidase subunit 1 (cox1) and nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1) gene. The mitochondrial cox1 and nad1 DNA were amplified by polymerase chain reaction (PCR) and then sequenced and compared with O. turkestanicum and that of other members of the Schistosomatidae available in Gen-Bank(™). Phylogenetic relationships between them were re-constructed using the maximum parsimony method. Phylogenetic analyses done in present study placed O. turkestanicum within the Schistosoma genus, and indicates that O. turkestanicum was phylogenetically closer to the African schistosome group than to the Asian schistosome group. Comparison of nad1 and cox1 sequences of O. turkestanicum obtained in this study with corresponding sequences available in Genbank(™) revealed some sequence variations and provided evidence for presence of microvarients in Iran.

  12. Complete mitochondrial DNA genome of bonnethead shark, Sphyrna tiburo, and phylogenetic relationships among main superorders of modern elasmobranchs

    PubMed Central

    Díaz-Jaimes, Píndaro; Bayona-Vásquez, Natalia J.; Adams, Douglas H.; Uribe-Alcocer, Manuel

    2015-01-01

    Elasmobranchs are one of the most diverse groups in the marine realm represented by 18 orders, 55 families and about 1200 species reported, but also one of the most vulnerable to exploitation and to climate change. Phylogenetic relationships among main orders have been controversial since the emergence of the Hypnosqualean hypothesis by Shirai (1992) that considered batoids as a sister group of sharks. The use of the complete mitochondrial DNA (mtDNA) may shed light to further validate this hypothesis by increasing the number of informative characters. We report the mtDNA genome of the bonnethead shark Sphyrna tiburo, and compare it with mitogenomes of other 48 species to assess phylogenetic relationships. The mtDNA genome of S. tiburo, is quite similar in size to that of congeneric species but also similar to the reported mtDNA genome of other Carcharhinidae species. Like most vertebrate mitochondrial genomes, it contained 13 protein coding genes, two rRNA genes and 22 tRNA genes and the control region of 1086 bp (D-loop). The Bayesian analysis of the 49 mitogenomes supported the view that sharks and batoids are separate groups. PMID:27014583

  13. Integrated analyses using RNA-Seq data reveal viral genomes, single nucleotide variations, the phylogenetic relationship, and recombination for Apple stem grooving virus.

    PubMed

    Jo, Yeonhwa; Choi, Hoseong; Kim, Sang-Min; Kim, Sun-Lim; Lee, Bong Choon; Cho, Won Kyong

    2016-08-09

    Next-generation sequencing (NGS) provides many possibilities for plant virology research. In this study, we performed integrated analyses using plant transcriptome data for plant virus identification using Apple stem grooving virus (ASGV) as an exemplar virus. We used 15 publicly available transcriptome libraries from three different studies, two mRNA-Seq studies and a small RNA-Seq study. We de novo assembled nearly complete genomes of ASGV isolates Fuji and Cuiguan from apple and pear transcriptomes, respectively, and identified single nucleotide variations (SNVs) of ASGV within the transcriptomes. We demonstrated the application of NGS raw data to confirm viral infections in the plant transcriptomes. In addition, we compared the usability of two de novo assemblers, Trinity and Velvet, for virus identification and genome assembly. A phylogenetic tree revealed that ASGV and Citrus tatter leaf virus (CTLV) are the same virus, which was divided into two clades. Recombination analyses identified six recombination events from 21 viral genomes. Taken together, our in silico analyses using NGS data provide a successful application of plant transcriptomes to reveal extensive information associated with viral genome assembly, SNVs, phylogenetic relationships, and genetic recombination.

  14. Effects of Phylogenetic Tree Style on Student Comprehension

    NASA Astrophysics Data System (ADS)

    Dees, Jonathan Andrew

    Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.

  15. Relationships among pest flour beetles of the genus Tribolium (Tenebrionidae) inferred from multiple molecular markers

    PubMed Central

    Angelini, David R.; Jockusch, Elizabeth L.

    2008-01-01

    Model species often provide initial hypotheses and tools for studies of development, genetics, and molecular evolution in closely related species. Flour beetles of the genus Tribolium MacLeay (1825) are one group with potential for such comparative studies. Tribolium castaneum (Herbst 1797) is an increasingly useful developmental genetic system. The convenience with which congeneric and other species of tenebrionid flour beetles can be reared in the laboratory makes this group attractive for comparative studies on a small phylogenetic scale. Here we present the results of phylogenetic analyses of relationships among the major pest species of Tribolium based on two mitochondrial and three nuclear markers (cytochrome oxidase 1, 16S ribosomal DNA, wingless, 28S ribosomal DNA, histone H3). The utility of partitioning the dataset in a manner informed by biological structure and function is demonstrated by comparing various partitioning strategies. In parsimony and partitioned Bayesian analyses of the combined dataset, the castaneum and confusum species groups are supported as monophyletic and as each other’s closest relatives. However, a sister group relationship between this clade and Tribolium brevicornis (Leconte 1859) is not supported. Therefore, we suggest transferring brevicornis group species to the genus Aphanotus Leconte (1862). The inferred phylogeny provides an evolutionary framework for comparative studies using flour beetles. PMID:18024090

  16. High School Students' Learning and Perceptions of Phylogenetics of Flowering Plants

    ERIC Educational Resources Information Center

    Bokor, Julie R.; Landis, Jacob B.; Crippen, Kent J.

    2014-01-01

    Basic phylogenetics and associated "tree thinking" are often minimized or excluded in formal school curricula. Informal settings provide an opportunity to extend the K-12 school curriculum, introducing learners to new ideas, piquing interest in science, and fostering scientific literacy. Similarly, university researchers participating in…

  17. Comparative endocrinology of leptin: Assessing function in a phylogenetic context

    PubMed Central

    Londraville, Richard L.; Macotela, Yazmin; Duff, Robert J.; Easterling, Marietta R.; Liu, Qin; Crespi, Erica J.

    2014-01-01

    As we approach the end of two decades of leptin research, the comparative biology of leptin is just beginning. We now have several leptin orthologs described from nearly every major clade among vertebrates, and are moving beyond gene descriptions to functional studies. Even at this early stage, it is clear that non-mammals display clear functional similarities and differences with their better-studied mammalian counterparts. This review assesses what we know about leptin function in mammals and non-mammals, and gives examples of how these data can inform leptin biology in humans. PMID:24525452

  18. A phylogenetic comparative study of flowering phenology along an elevational gradient in the Canadian subarctic.

    PubMed

    Lessard-Therrien, Malie; Davies, T Jonathan; Bolmgren, Kjell

    2014-05-01

    Climate change is affecting high-altitude and high-latitude communities in significant ways. In the short growing season of subarctic habitats, it is essential that the timing and duration of phenological phases match favorable environmental conditions. We explored the time of the first appearance of flowers (first flowering day, FFD) and flowering duration across subarctic species composing different communities, from boreal forest to tundra, along an elevational gradient (600-800 m). The study was conducted on Mount Irony (856 m), North-East Canada (54°90'N, 67°16'W) during summer 2012. First, we quantified phylogenetic signal in FFD at different spatial scales. Second, we used phylogenetic comparative methods to explore the relationship between FFD, flowering duration, and elevation. We found that the phylogenetic signal for FFD was stronger at finer spatial scales and at lower elevations, indicating that closely related species tend to flower at similar times when the local environment is less harsh. The comparatively weaker phylogenetic signal at higher elevation may be indicative of convergent evolution for FFD. Flowering duration was correlated significantly with mean FFD, with later-flowering species having a longer flowering duration, but only at the lowest elevation. Our results indicate significant evolutionary conservatism in responses to phenological cues, but high phenotypic plasticity in flowering times. We suggest that phylogenetic relationships should be considered in the search for predictions and drivers of flowering time in comparative analyses, because species cannot be considered as statistically independent. Further, phenological drivers should be measured at spatial scales such that variation in flowering matches variation in environment.

  19. Phylogenetic shadowing of primate sequences to find functional regions of the human genome.

    PubMed

    Boffelli, Dario; McAuliffe, Jon; Ovcharenko, Dmitriy; Lewis, Keith D; Ovcharenko, Ivan; Pachter, Lior; Rubin, Edward M

    2003-02-28

    Nonhuman primates represent the most relevant model organisms to understand the biology of Homo sapiens. The recent divergence and associated overall sequence conservation between individual members of this taxon have nonetheless largely precluded the use of primates in comparative sequence studies. We used sequence comparisons of an extensive set of Old World and New World monkeys and hominoids to identify functional regions in the human genome. Analysis of these data enabled the discovery of primate-specific gene regulatory elements and the demarcation of the exons of multiple genes. Much of the information content of the comprehensive primate sequence comparisons could be captured with a small subset of phylogenetically close primates. These results demonstrate the utility of intraprimate sequence comparisons to discover common mammalian as well as primate-specific functional elements in the human genome, which are unattainable through the evaluation of more evolutionarily distant species.

  20. A gharial from the Oligocene of Puerto Rico: transoceanic dispersal in the history of a non-marine reptile

    PubMed Central

    Vélez-Juarbe, Jorge; Brochu, Christopher A; Santos, Hernán

    2007-01-01

    The Indian gharial (Gavialis gangeticus) is not found in saltwater, but the geographical distribution of fossil relatives suggests a derivation from ancestors that lived in, or were at least able to withstand, saline conditions. Here, we describe a new Oligocene gharial, Aktiogavialis puertoricensis, from deltaic–coastal deposits of northern Puerto Rico. It is related to a clade of Neogene gharials otherwise restricted to South America. Its geological and geographical settings, along with its phylogenetic relationships, are consistent with two scenarios: (i) that a single trans-Atlantic dispersal event during the Tertiary explains the South American Neogene gharial assemblage and (ii) that stem gharials were coastal animals and their current restriction to freshwater settings is a comparatively recent environmental shift for the group. This discovery highlights the importance of including fossil information in a phylogenetic context when assessing the ecological history of modern organisms. PMID:17341454

  1. Physiological mechanisms of thermoregulation in reptiles: a review.

    PubMed

    Seebacher, Frank; Franklin, Craig E

    2005-11-01

    The thermal dependence of biochemical reaction rates means that many animals regulate their body temperature so that fluctuations in body temperature are small compared to environmental temperature fluctuations. Thermoregulation is a complex process that involves sensing of the environment, and subsequent processing of the environmental information. We suggest that the physiological mechanisms that facilitate thermoregulation transcend phylogenetic boundaries. Reptiles are primarily used as model organisms for ecological and evolutionary research and, unlike in mammals, the physiological basis of many aspects in thermoregulation remains obscure. Here, we review recent research on regulation of body temperature, thermoreception, body temperature set-points, and cardiovascular control of heating and cooling in reptiles. The aim of this review is to place physiological thermoregulation of reptiles in a wider phylogenetic context. Future research on reptilian thermoregulation should focus on the pathways that connect peripheral sensing to central processing which will ultimately lead to the thermoregulatory response.

  2. Autoregressive models for estimating phylogenetic and environmental effects: accounting for within-species variations.

    PubMed

    Cornillon, P A; Pontier, D; Rochet, M J

    2000-02-21

    Comparative methods are used to investigate the attributes of present species or higher taxa. Difficulties arise from the phylogenetic heritage: taxa are not independent and neglecting phylogenetic inertia can lead to inaccurate results. Within-species variations in life-history traits are also not negligible, but most comparative methods are not designed to take them into account. Taxa are generally described by a single value for each trait. We have developed a new model which permits the incorporation of both the phylogenetic relationships among populations and within-species variations. This is an extension of classical autoregressive models. This family of models was used to study the effect of fishing on six demographic traits measured on 77 populations of teleost fishes. Copyright 2000 Academic Press.

  3. Phylogenetic system and zoogeography of the Plecoptera.

    PubMed

    Zwick, P

    2000-01-01

    Information about the phylogenetic relationships of Plecoptera is summarized. The few characters supporting monophyly of the order are outlined. Several characters of possible significance for the search for the closest relatives of the stoneflies are discussed, but the sister-group of the order remains unknown. Numerous characters supporting the presently recognized phylogenetic system of Plecoptera are presented, alternative classifications are discussed, and suggestions for future studies are made. Notes on zoogeography are appended. The order as such is old (Permian fossils), but phylogenetic relationships and global distribution patterns suggest that evolution of the extant suborders started with the breakup of Pangaea. There is evidence of extensive recent speciation in all parts of the world.

  4. Further Effects of Phylogenetic Tree Style on Student Comprehension in an Introductory Biology Course.

    PubMed

    Dees, Jonathan; Bussard, Caitlin; Momsen, Jennifer L

    2018-06-01

    Phylogenetic trees have become increasingly important across the life sciences, and as a result, learning to interpret and reason from these diagrams is now an essential component of biology education. Unfortunately, students often struggle to understand phylogenetic trees. Style (i.e., diagonal or bracket) is one factor that has been observed to impact how students interpret phylogenetic trees, and one goal of this research was to investigate these style effects across an introductory biology course. In addition, we investigated the impact of instruction that integrated diagonal and bracket phylogenetic trees equally. Before instruction, students were significantly more accurate with the bracket style for a variety of interpretation and construction tasks. After instruction, however, students were significantly more accurate only for construction tasks and interpretations involving taxa relatedness when using the bracket style. Thus, instruction that used both styles equally mitigated some, but not all, style effects. These results inform the development of research-based instruction that best supports student understanding of phylogenetic trees.

  5. An ordination of life histories using morphological proxies: capital vs. income breeding in insects.

    PubMed

    Davis, Robert B; Javoiš, Juhan; Kaasik, Ants; Õunap, Erki; Tammaru, Toomas

    2016-08-01

    Predictive classifications of life histories are essential for evolutionary ecology. While attempts to apply a single approach to all organisms may be overambitious, recent advances suggest that more narrow ordination schemes can be useful. However, these schemes mostly lack easily observable proxies of the position of a species on respective axes. It has been proposed that, in insects, the degree of capital (vs. income) breeding, reflecting the importance of adult feeding for reproduction, correlates with various ecological traits at the level of among-species comparison. We sought to prove these ideas via rigorous phylogenetic comparative analyses. We used experimentally derived life-history data for 57 species of European Geometridae (Lepidoptera), and an original phylogenetic reconstruction. The degree of capital breeding was estimated based on morphological proxies, including relative abdomen size of females. Applying Brownian-motion-based comparative analyses (with an original update to include error estimates), we demonstrated the associations between the degree of capital breeding and larval diet breadth, sexual size dimorphism, and reproductive season. Ornstein-Uhlenbeck model based phylogenetic analysis suggested a causal relationship between the degree of capital breeding and diet breadth. Our study indicates that the gradation from capital to income breeding is an informative axis to ordinate life-history strategies in flying insects which are affected by the fecundity vs. mobility trade off, with the availability of easy to record proxies contributing to its predictive power in practical contexts. © 2016 by the Ecological Society of America.

  6. The origin and evolution of Basigin(BSG) gene: A comparative genomic and phylogenetic analysis.

    PubMed

    Zhu, Xinyan; Wang, Shenglan; Shao, Mingjie; Yan, Jie; Liu, Fei

    2017-07-01

    Basigin (BSG), also known as extracellular matrix metalloproteinase inducer (EMMPRIN) or cluster of differentiation 147 (CD147), plays various fundamental roles in the intercellular recognition involved in immunologic phenomena, differentiation, and development. In this study, we aimed to compare the similarities and differences of BSG among organisms and explore possible evolutionary relationships based on the comparison result. We used the extensive BLAST tool to search the metazoan genomes, N-glycosylation sites, the transmembrane region and other functional sites. We then identified BSG homologs from genomic sequences and analyzed their phylogenetic relationships. We identified that BSG genes exist not only in the vertebrate metazoans but also in the invertebrate metazoans such as Amphioxus B. floridae, D. melanogaster, A. mellifera, S. japonicum, C. gigas, and T. patagoniensis. After sequence analysis, we confirmed that only vertebrate metazoans and Cephalochordate (amphioxus B. floridae) have the classic structure (a signal peptide, two Ig-like domains (IgC2 and IgI), a transmembrane region, and an intracellular domain). The invertebrate metazoans (excluding amphioxus B. floridae) lack the N-terminal signal peptides and IgC2 domain. We then generated a phylogenetic tree, genome organization comparison, and chromosomal disposition analysis based on the biological information obtained from the NCBI and Ensembl databases. Finally, we established the possible evolutionary scenario of the BSG gene, which showed the restricted exon rearrangement that has occurred during evolution, forming the present-day BSG gene. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Phylogenetic Variation in the Silicon Composition of Plants

    PubMed Central

    HODSON, M. J.; WHITE, P. J.; MEAD, A.; BROADLEY, M. R.

    2005-01-01

    • Background and Aims Silicon (Si) in plants provides structural support and improves tolerance to diseases, drought and metal toxicity. Shoot Si concentrations are generally considered to be greater in monocotyledonous than in non-monocot plant species. The phylogenetic variation in the shoot Si concentration of plants reported in the primary literature has been quantified. • Methods Studies were identified which reported Si concentrations in leaf or non-woody shoot tissues from at least two plant species growing in the same environment. Each study contained at least one species in common with another study. • Key Results Meta-analysis of the data revealed that, in general, ferns, gymnosperms and angiosperms accumulated less Si in their shoots than non-vascular plant species and horsetails. Within angiosperms and ferns, differences in shoot Si concentration between species grouped by their higher-level phylogenetic position were identified. Within the angiosperms, species from the commelinoid monocot orders Poales and Arecales accumulated substantially more Si in their shoots than species from other monocot clades. • Conclusions A high shoot Si concentration is not a general feature of monocot species. Information on the phylogenetic variation in shoot Si concentration may provide useful palaeoecological and archaeological information, and inform studies of the biogeochemical cycling of Si and those of the molecular genetics of Si uptake and transport in plants. PMID:16176944

  8. Phylogenetic Information Content of Copepoda Ribosomal DNA Repeat Units: ITS1 and ITS2 Impact

    PubMed Central

    Zagoskin, Maxim V.; Lazareva, Valentina I.; Grishanin, Andrey K.; Mukha, Dmitry V.

    2014-01-01

    The utility of various regions of the ribosomal repeat unit for phylogenetic analysis was examined in 16 species representing four families, nine genera, and two orders of the subclass Copepoda (Crustacea). Fragments approximately 2000 bp in length containing the ribosomal DNA (rDNA) 18S and 28S gene fragments, the 5.8S gene, and the internal transcribed spacer regions I and II (ITS1 and ITS2) were amplified and analyzed. The DAMBE (Data Analysis in Molecular Biology and Evolution) software was used to analyze the saturation of nucleotide substitutions; this test revealed the suitability of both the 28S gene fragment and the ITS1/ITS2 rDNA regions for the reconstruction of phylogenetic trees. Distance (minimum evolution) and probabilistic (maximum likelihood, Bayesian) analyses of the data revealed that the 28S rDNA and the ITS1 and ITS2 regions are informative markers for inferring phylogenetic relationships among families of copepods and within the Cyclopidae family and associated genera. Split-graph analysis of concatenated ITS1/ITS2 rDNA regions of cyclopoid copepods suggested that the Mesocyclops, Thermocyclops, and Macrocyclops genera share complex evolutionary relationships. This study revealed that the ITS1 and ITS2 regions potentially represent different phylogenetic signals. PMID:25215300

  9. Phylogenetic analysis of different breeds of domestic chickens in selected area of Peninsular Malaysia inferred from partial cytochrome b gene information and RAPD markers.

    PubMed

    Yap, Fook Choy; Yan, Yap Jin; Loon, Kiung Teh; Zhen, Justina Lee Ning; Kamau, Nelly Warau; Kumaran, Jayaraj Vijaya

    2010-10-01

    The present investigation was carried out in an attempt to study the phylogenetic analysis of different breeds of domestic chickens in Peninsular Malaysia inferred from partial cytochrome b gene information and random amplified polymorphic DNA (RAPD) markers. Phylogenetic analysis using both neighbor-joining (NJ) and maximum parsimony (MP) methods produced three clusters that encompassed Type-I village chickens, the red jungle fowl subspecies and the Japanese Chunky broilers. The phylogenetic analysis also revealed that majority of the Malaysian commercial chickens were randomly assembled with the Type-II village chickens. In RAPD assay, phylogenetic analysis using neighbor-joining produced six clusters that were completely distinguished based on the locality of chickens. High levels of genetic variations were observed among the village chickens, the commercial broilers, and between the commercial broilers and layer chickens. In this study, it was found that Type-I village chickens could be distinguished from the commercial chickens and Type-II village chickens at the position of the 27th nucleotide of the 351 bp cytochrome b gene. This study also revealed that RAPD markers were unable to differentiate the type of chickens, but it showed the effectiveness of RAPD in evaluating the genetic variation and the genetic relationships between chicken lines and populations.

  10. Tooth development and histology patterns in lamniform sharks (Elasmobranchii, Lamniformes) revisited.

    PubMed

    Schnetz, Lisa; Pfaff, Cathrin; Kriwet, Jürgen

    2016-12-01

    The dentition of lamniforme sharks exhibits several characters that have been used extensively to resolve the phylogenetic relationships of extant taxa, yet some uncertainties remain. Also, the development of different teeth of a tooth file within the jaws of most extant lamniforms has not been documented to date. High-resolution micro-computed tomography is used here to re-evaluate the importance of two dental characters within the order Lamniformes, which were considered not to be phylogenetically informative, the histotype and the number of teeth per tooth file. Additionally, the development and mineralization patterns of the teeth of the two osteodont lamniforms Lamna nasus and Alopias superciliosus were compared. We discuss the importance of these dental characters for phylogenetic interpretations to assess the quality of these characters in resolving lamniform relationships. The dental characters suggest that (1) Lamniformes are the only modern-level sharks exhibiting the osteodont histotype, (2) the osteodont histotype in lamniform sharks is a derived state in modern-level sharks (Elasmobranchii), (3) the osteodont type, conversely is convergently achieved when the clade Chondrichthyes is considered and thus might comprise a functional rather than a phylogenetic signal, and (4) there is an increase in the number of teeth per file throughout lamniform phylogeny. Structural development of the teeth of L. nasus and A. superciliosus is congruent with a previous investigation of the lamniform shark Carcharodon carcharias. J. Morphol. 277:1584-1598, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  11. Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales.

    PubMed

    Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter

    2017-12-01

    Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.

  12. Ignoring heterozygous sites biases phylogenomic estimates of divergence times: implications for the evolutionary history of microtus voles.

    PubMed

    Lischer, Heidi E L; Excoffier, Laurent; Heckel, Gerald

    2014-04-01

    Phylogenetic reconstruction of the evolutionary history of closely related organisms may be difficult because of the presence of unsorted lineages and of a relatively high proportion of heterozygous sites that are usually not handled well by phylogenetic programs. Genomic data may provide enough fixed polymorphisms to resolve phylogenetic trees, but the diploid nature of sequence data remains analytically challenging. Here, we performed a phylogenomic reconstruction of the evolutionary history of the common vole (Microtus arvalis) with a focus on the influence of heterozygosity on the estimation of intraspecific divergence times. We used genome-wide sequence information from 15 voles distributed across the European range. We provide a novel approach to integrate heterozygous information in existing phylogenetic programs by repeated random haplotype sampling from sequences with multiple unphased heterozygous sites. We evaluated the impact of the use of full, partial, or no heterozygous information for tree reconstructions on divergence time estimates. All results consistently showed four deep and strongly supported evolutionary lineages in the vole data. These lineages undergoing divergence processes split only at the end or after the last glacial maximum based on calibration with radiocarbon-dated paleontological material. However, the incorporation of information from heterozygous sites had a significant impact on absolute and relative branch length estimations. Ignoring heterozygous information led to an overestimation of divergence times between the evolutionary lineages of M. arvalis. We conclude that the exclusion of heterozygous sites from evolutionary analyses may cause biased and misleading divergence time estimates in closely related taxa.

  13. The phylogenetic roots of human lethal violence.

    PubMed

    Gómez, José María; Verdú, Miguel; González-Megías, Adela; Méndez, Marcos

    2016-10-13

    The psychological, sociological and evolutionary roots of conspecific violence in humans are still debated, despite attracting the attention of intellectuals for over two millennia. Here we propose a conceptual approach towards understanding these roots based on the assumption that aggression in mammals, including humans, has a significant phylogenetic component. By compiling sources of mortality from a comprehensive sample of mammals, we assessed the percentage of deaths due to conspecifics and, using phylogenetic comparative tools, predicted this value for humans. The proportion of human deaths phylogenetically predicted to be caused by interpersonal violence stood at 2%. This value was similar to the one phylogenetically inferred for the evolutionary ancestor of primates and apes, indicating that a certain level of lethal violence arises owing to our position within the phylogeny of mammals. It was also similar to the percentage seen in prehistoric bands and tribes, indicating that we were as lethally violent then as common mammalian evolutionary history would predict. However, the level of lethal violence has changed through human history and can be associated with changes in the socio-political organization of human populations. Our study provides a detailed phylogenetic and historical context against which to compare levels of lethal violence observed throughout our history.

  14. Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology.

    PubMed

    Soltis, P S; Soltis, D E; Chase, M W

    1999-11-25

    Comparative biology requires a firm phylogenetic foundation to uncover and understand patterns of diversification and evaluate hypotheses of the processes responsible for these patterns. In the angiosperms, studies of diversification in floral form, stamen organization, reproductive biology, photosynthetic pathway, nitrogen-fixing symbioses and life histories have relied on either explicit or implied phylogenetic trees. Furthermore, to understand the evolution of specific genes and gene families, evaluate the extent of conservation of plant genomes and make proper sense of the huge volume of molecular genetic data available for model organisms such as Arabidopsis, Antirrhinum, maize, rice and wheat, a phylogenetic perspective is necessary. Here we report the results of parsimony analyses of DNA sequences of the plastid genes rbcL and atpB and the nuclear 18S rDNA for 560 species of angiosperms and seven non-flowering seed plants and show a well-resolved and well-supported phylogenetic tree for the angiosperms for use in comparative biology.

  15. Genetic Comparison of B. Anthracis and its Close Relatives Using AFLP and PCR Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, P.J.; Hill, K.K.; Laker, M.T.

    1999-02-01

    Amplified Fragment length Polymorphism (AFLP) analysis allows a rapid, relatively simple analysis of a large portion of a microbial genome, providing information about the species and its phylogenetic relationship to other microbes (Vos, et al., 1995). The method simply surveys the genome for length and sequence polymorphisms. The pattern identified can be used for comparison to the genomes of other species. Unlike other methods, it does not rely on analysis of a single genetic locus that may bias the interpretation of results and it does not require any prior knowledge of the targeted organism. Moreover, a standard set of reagentsmore » can be applied to any species without using species-specific information or molecular probes. The authors are using AFLP's to rapidly identify different bacterial species. A comparison of AFLP profiles generated from a large battery of B. anthracis strains shows very little variability among different isolates (Keim, et al., 1997). By contrast, there is a significant difference between AFLP profiles generated for any B. anthracis strain and even the most closely related Bacillus species. Sufficient variability is apparent among all known microbial species to allow phylogenetic analysis based on large numbers of genetically unlinked loci. These striking differences among AFLP profiles allow unambiguous identification of previously identified species and phylogenetic placement of newly characterized isolates relative to known species based on a large number of independent genetic loci. Data generated thus far show that the method provides phylogenetic analyses that are consistent with other widely accepted phylogenetic methods. However, AFLP analysis provides a more detailed analysis of the targets and samples a much larger portion of the genome. Consequently, it provides an inexpensive, rapid means of characterizing microbial isolates to further differentiate among strains and closely related microbial species. Such information cannot be rapidly generated by other means. AFLP sample analysis quickly generates a very large amount of molecular information about microbial genomes. However, this information cannot be analyzed rapidly using manual methods. The authors are developing a large archive of electronic AFLP signatures that is being used to identify isolates collected from medical, veterinary, forensic and environmental samples. They are also developing the computational packages necessary to rapidly and unambiguously analyze the AFLP profiles and conduct a phylogenetic comparison of these data relative to information already in the database. They will use this archive and the associated algorithms to determine the species identity of previously uncharacterized isolates and place them phylogenetically relative to other microbes based on their AFLP signatures. This study provides significant new information about microbes with environmental, veterinary and medical significance. This information can be used in further studies to understand the relationships among these species and the factors that distinguish them from one another. It should also allow identification of unique factors that contribute to important microbial traits including pathogenicity and virulence. They are also using AFLP data to identify, isolate and sequence DNA fragments that are unique to particular microbial species and strains. The fragment patterns and sequence information provide insights into the complexity and organization of bacterial genomes relative to one another. They also provide the information necessary for development of species-specific PCR primers that can be used to interrogate complex samples for the presence of B. anthracis, other microbial pathogens or their remnants.« less

  16. Phylogenetic diversity of plants alters the effect of species richness on invertebrate herbivory

    PubMed Central

    2013-01-01

    Long-standing ecological theory proposes that diverse communities of plants should experience a decrease in herbivory. Yet previous empirical examinations of this hypothesis have revealed that plant species richness increases herbivory in just as many systems as it decreases it. In this study, I ask whether more insight into the role of plant diversity in promoting or suppressing herbivory can be gained by incorporating information about the evolutionary history of species in a community. In an old field system in southern Ontario, I surveyed communities of plants and measured levels of leaf damage on 27 species in 38 plots. I calculated a measure of phylogenetic diversity (PSE) that encapsulates information about the amount of evolutionary history represented in each of the plots and looked for a relationship between levels of herbivory and both species richness and phylogenetic diversity using a generalized linear mixed model (GLMM) that could account for variation in herbivory levels between species. I found that species richness was positively associated with herbivore damage at the plot-level, in keeping with the results from several other recent studies on this question. On the other hand, phylogenetic diversity was associated with decreased herbivory. Importantly, there was also an interaction between species richness and phylogenetic diversity, such that plots with the highest levels of herbivory were plots which had many species but only if those species tended to be closely related to one another. I propose that these results are the consequence of interactions with herbivores whose diets are phylogenetically specialized (for which I introduce the term cladophage), and how phylogenetic diversity may alter their realized host ranges. These results suggest that incorporating a phylogenetic perspective can add valuable additional insight into the role of plant diversity in explaining or predicting levels of herbivory at a whole-community scale. PMID:23825795

  17. Mammalian phylogenetic diversity-area relationships at a continental scale

    PubMed Central

    Mazel, Florent; Renaud, Julien; Guilhaumon, François; Mouillot, David; Gravel, Dominique; Thuiller, Wilfried

    2015-01-01

    In analogy to the species-area relationship (SAR), one of the few laws in Ecology, the phylogenetic diversity-area relationship (PDAR) describes the tendency of phylogenetic diversity (PD) to increase with area. Although investigating PDAR has the potential to unravel the underlying processes shaping assemblages across spatial scales and to predict PD loss through habitat reduction, it has been little investigated so far. Focusing on PD has noticeable advantages compared to species richness (SR) since PD also gives insights on processes such as speciation/extinction, assembly rules and ecosystem functioning. Here we investigate the universality and pervasiveness of the PDAR at continental scale using terrestrial mammals as study case. We define the relative robustness of PD (compared to SR) to habitat loss as the area between the standardized PDAR and standardized SAR (i.e. standardized by the diversity of the largest spatial window) divided by the area under the standardized SAR only. This metric quantifies the relative increase of PD robustness compared to SR robustness. We show that PD robustness is higher than SR robustness but that it varies among continents. We further use a null model approach to disentangle the relative effect of phylogenetic tree shape and non random spatial distribution of evolutionary history on the PDAR. We find that for most spatial scales and for all continents except Eurasia, PDARs are not different from expected by a model using only the observed SAR and the shape of the phylogenetic tree at continental scale. Interestingly, we detect a strong phylogenetic structure of the Eurasian PDAR that can be predicted by a model that specifically account for a finer biogeographical delineation of this continent. In conclusion, the relative robustness of PD to habitat loss compared to species richness is determined by the phylogenetic tree shape but also depends on the spatial structure of PD. PMID:26649401

  18. Phylogenetic placement of two species known only from resting spores: Zoophthora independentia sp. nov. and Z. porteri comb. nov. (Entomophthorales: Entomophthoraceae)

    USDA-ARS?s Scientific Manuscript database

    Molecular methods were used to determine the generic placement of two species of Entomophthorales known only from resting spores. Historically, these species would belong in the form-genus Tarichium, but this classification provides no information about phylogenetic relationships. Using DNA from res...

  19. Mitochondrial DNA haplogroup phylogeny of the dog: Proposal for a cladistic nomenclature.

    PubMed

    Fregel, Rosa; Suárez, Nicolás M; Betancor, Eva; González, Ana M; Cabrera, Vicente M; Pestano, José

    2015-05-01

    Canis lupus familiaris mitochondrial DNA analysis has increased in recent years, not only for the purpose of deciphering dog domestication but also for forensic genetic studies or breed characterization. The resultant accumulation of data has increased the need for a normalized and phylogenetic-based nomenclature like those provided for human maternal lineages. Although a standardized classification has been proposed, haplotype names within clades have been assigned gradually without considering the evolutionary history of dog mtDNA. Moreover, this classification is based only on the D-loop region, proven to be insufficient for phylogenetic purposes due to its high number of recurrent mutations and the lack of relevant information present in the coding region. In this study, we design 1) a refined mtDNA cladistic nomenclature from a phylogenetic tree based on complete sequences, classifying dog maternal lineages into haplogroups defined by specific diagnostic mutations, and 2) a coding region SNP analysis that allows a more accurate classification into haplogroups when combined with D-loop sequencing, thus improving the phylogenetic information obtained in dog mitochondrial DNA studies. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. Phylogenetic analysis reveals a scattered distribution of autumn colours

    PubMed Central

    Archetti, Marco

    2009-01-01

    Background and Aims Leaf colour in autumn is rarely considered informative for taxonomy, but there is now growing interest in the evolution of autumn colours and different hypotheses are debated. Research efforts are hindered by the lack of basic information: the phylogenetic distribution of autumn colours. It is not known when and how autumn colours evolved. Methods Data are reported on the autumn colours of 2368 tree species belonging to 400 genera of the temperate regions of the world, and an analysis is made of their phylogenetic relationships in order to reconstruct the evolutionary origin of red and yellow in autumn leaves. Key Results Red autumn colours are present in at least 290 species (70 genera), and evolved independently at least 25 times. Yellow is present independently from red in at least 378 species (97 genera) and evolved at least 28 times. Conclusions The phylogenetic reconstruction suggests that autumn colours have been acquired and lost many times during evolution. This scattered distribution could be explained by hypotheses involving some kind of coevolutionary interaction or by hypotheses that rely on the need for photoprotection. PMID:19126636

  1. Comparative transcriptomics of early dipteran development

    PubMed Central

    2013-01-01

    Background Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). Results We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. Conclusions We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies). PMID:23432914

  2. Phylogenetics of modern birds in the era of genomics

    PubMed Central

    Edwards, Scott V; Bryan Jennings, W; Shedlock, Andrew M

    2005-01-01

    In the 14 years since the first higher-level bird phylogenies based on DNA sequence data, avian phylogenetics has witnessed the advent and maturation of the genomics era, the completion of the chicken genome and a suite of technologies that promise to add considerably to the agenda of avian phylogenetics. In this review, we summarize current approaches and data characteristics of recent higher-level bird studies and suggest a number of as yet untested molecular and analytical approaches for the unfolding tree of life for birds. A variety of comparative genomics strategies, including adoption of objective quality scores for sequence data, analysis of contiguous DNA sequences provided by large-insert genomic libraries, and the systematic use of retroposon insertions and other rare genomic changes all promise an integrated phylogenetics that is solidly grounded in genome evolution. The avian genome is an excellent testing ground for such approaches because of the more balanced representation of single-copy and repetitive DNA regions than in mammals. Although comparative genomics has a number of obvious uses in avian phylogenetics, its application to large numbers of taxa poses a number of methodological and infrastructural challenges, and can be greatly facilitated by a ‘community genomics’ approach in which the modest sequencing throughputs of single PI laboratories are pooled to produce larger, complementary datasets. Although the polymerase chain reaction era of avian phylogenetics is far from complete, the comparative genomics era—with its ability to vastly increase the number and type of molecular characters and to provide a genomic context for these characters—will usher in a host of new perspectives and opportunities for integrating genome evolution and avian phylogenetics. PMID:16024355

  3. Morphological, molecular and phylogenetic analyses of Diplotriaena bargusinica Skrjabin, 1917 (Nematoda: Diplotriaenidae).

    PubMed

    Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud

    2017-10-01

    The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Phylogenetic structure of soil bacterial communities predicts ecosystem functioning.

    PubMed

    Pérez-Valera, Eduardo; Goberna, Marta; Verdú, Miguel

    2015-05-01

    Quantifying diversity with phylogeny-informed metrics helps understand the effects of diversity on ecosystem functioning (EF). The sign of these effects remains controversial because phylogenetic diversity and taxonomic identity may interactively influence EF. Positive relationships, traditionally attributed to complementarity effects, seem unimportant in natural soil bacterial communities. Negative relationships could be attributed to fitness differences leading to the overrepresentation of few productive clades, a mechanism recently invoked to assemble soil bacteria communities. We tested in two ecosystems contrasting in terms of environmental heterogeneity whether two metrics of phylogenetic community structure, a simpler measure of phylogenetic diversity (NRI) and a more complex metric incorporating taxonomic identity (PCPS), correctly predict microbially mediated EF. We show that the relationship between phylogenetic diversity and EF depends on the taxonomic identity of the main coexisting lineages. Phylogenetic diversity was negatively related to EF in soils where a marked fertility gradient exists and a single and productive clade (Proteobacteria) outcompete other clades in the most fertile plots. However, phylogenetic diversity was unrelated to EF in soils where the fertility gradient is less marked and Proteobacteria coexist with other abundant lineages. Including the taxonomic identity of bacterial lineages in metrics of phylogenetic community structure allows the prediction of EF in both ecosystems. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences.

    PubMed

    Macey, J Robert; Papenfuss, Theodore J; Kuehl, Jennifer V; Fourcade, H Mathew; Boore, Jeffrey L

    2004-10-01

    Complete mitochondrial genomic sequences are reported from 12 members in the four families of the reptile group Amphisbaenia. Analysis of 11,946 aligned nucleotide positions (5797 informative) produces a robust phylogenetic hypothesis. The family Rhineuridae is basal and Bipedidae is the sister taxon to the Amphisbaenidae plus Trogonophidae. Amphisbaenian reptiles are surprisingly old, predating the breakup of Pangaea 200 million years before present, because successive basal taxa (Rhineuridae and Bipedidae) are situated in tectonic regions of Laurasia and nested taxa (Amphisbaenidae and Trogonophidae) are found in Gondwanan regions. Thorough sampling within the Bipedidae shows that it is not tectonic movement of Baja California away from the Mexican mainland that is primary in isolating Bipes species, but rather that primary vicariance occurred between northern and southern groups. Amphisbaenian families show parallel reduction in number of limbs and Bipes species exhibit parallel reduction in number of digits. A measure is developed for comparing the phylogenetic information content of various genes. A synapomorphic trait defining the Bipedidae is a shift from the typical vertebrate mitochondrial gene arrangement to the derived state of trnE and nad6. In addition, a tandem duplication of trnT and trnP is observed in Bipes biporus with a pattern of pseudogene formation that varies among populations. The first case of convergent rearrangement of the mitochondrial genome among animals demonstrated by complete genomic sequences is reported. Relative to most vertebrates, the Rhineuridae has the block nad6, trnE switched in order with the block cob, trnT, trnP, as they are in birds.

  6. Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Macey, J. Robert; Papenfuss, Theodore J.; Kuehl, Jennifer V.

    2004-05-19

    Complete mitochondrial genomic sequences are reported from 12 members in the four families of the reptile group Amphisbaenia. Analysis of 11,946 aligned nucleotide positions (5,797 informative) produces a robust phylogenetic hypothesis. The family Rhineuridae is basal and Bipedidae is the sister taxon to the Amphisbaenidae plus Trogonophidae. Amphisbaenian reptiles are surprisingly old, predating the breakup of Pangaea 200 million years before present, because successive basal taxa (Rhineuridae and Bipedidae) are situated in tectonic regions of Laurasia and nested taxa (Amphisbaenidae and Trogonophidae) are found in Gondwanan regions. Thorough sampling within the Bipedidae shows that it is not tectonic movement ofmore » Baja California away from the Mexican mainland that is primary in isolating Bipes species, but rather that primary vicariance occurred between northern and southern groups. Amphisbaenian families show parallel reduction in number of limbs and Bipes species exhibit parallel reduction in number of digits. A measure is developed for comparing the phylogenetic information content of various genes. A synapomorphic trait defining the Bipedidae is a shift from the typical vertebrate mitochondrial gene arrangement to the derived state of trnE and nad6. In addition, a tandem duplication of trnT and trnP is observed in B. biporus with a pattern of pseudogene formation that varies among populations. The first case of convergent rearrangement of the mitochondrial genome among animals demonstrated by complete genomic sequences is reported. Relative to most vertebrates, the Rhineuridae has the block nad6, trnE switched in order with cob, trnT, trnP, as they are in birds.« less

  7. Evidence for a close phylogenetic relationship between Melissococcus pluton, the causative agent of European foulbrood disease, and the genus Enterococcus.

    PubMed

    Cai, J; Collins, M D

    1994-04-01

    The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.

  8. Development of phylogenetic markers for Sebacina (Sebacinaceae) mycorrhizal fungi associated with Australian orchids.

    PubMed

    Ruibal, Monica P; Peakall, Rod; Foret, Sylvain; Linde, Celeste C

    2014-06-01

    To investigate fungal species identity and diversity in mycorrhizal fungi of order Sebacinales, we developed phylogenetic markers. These new markers will enable future studies investigating species delineation and phylogenetic relationships of the fungal symbionts and facilitate investigations into evolutionary interactions among Sebacina species and their orchid hosts. • We generated partial genome sequences for a Sebacina symbiont originating from Caladenia huegelii with 454 genome sequencing and from three symbionts from Eriochilus dilatatus and one from E. pulchellus using Illumina sequencing. Six nuclear and two mitochondrial loci showed high variability (10-31% parsimony informative sites) for Sebacinales mycorrhizal fungi across four genera of Australian orchids (Caladenia, Eriochilus, Elythranthera, and Glossodia). • We obtained highly informative DNA markers that will allow investigation of mycorrhizal diversity of Sebacinaceae fungi associated with terrestrial orchids in Australia and worldwide.

  9. On the distribution of interspecies correlation for Markov models of character evolution on Yule trees.

    PubMed

    Mulder, Willem H; Crawford, Forrest W

    2015-01-07

    Efforts to reconstruct phylogenetic trees and understand evolutionary processes depend fundamentally on stochastic models of speciation and mutation. The simplest continuous-time model for speciation in phylogenetic trees is the Yule process, in which new species are "born" from existing lineages at a constant rate. Recent work has illuminated some of the structural properties of Yule trees, but it remains mostly unknown how these properties affect sequence and trait patterns observed at the tips of the phylogenetic tree. Understanding the interplay between speciation and mutation under simple models of evolution is essential for deriving valid phylogenetic inference methods and gives insight into the optimal design of phylogenetic studies. In this work, we derive the probability distribution of interspecies covariance under Brownian motion and Ornstein-Uhlenbeck models of phenotypic change on a Yule tree. We compute the probability distribution of the number of mutations shared between two randomly chosen taxa in a Yule tree under discrete Markov mutation models. Our results suggest summary measures of phylogenetic information content, illuminate the correlation between site patterns in sequences or traits of related organisms, and provide heuristics for experimental design and reconstruction of phylogenetic trees. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Phylogenetic comparative methods on phylogenetic networks with reticulations.

    PubMed

    Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile

    2018-04-25

    The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.

  11. Invasions but not extinctions change phylogenetic diversity of angiosperm assemblage on southeastern Pacific Oceanic islands

    PubMed Central

    2017-01-01

    We assessed changes in phylogenetic diversity of angiosperm flora on six oceanic islands located in the southeastern Pacific Ocean, by comparing flora from two periods: the pre-European colonization of islands and current times. We hypothesize that, in the time between these periods, extinction of local plant species and addition of exotic plants modified phylogenetic-α-diversity at different levels (deeper and terminal phylogeny) and increased phylo-β-diversity among islands. Based on floristic studies, we assembled a phylogenetic tree from occurrence data that includes 921 species, of which 165 and 756 were native or exotic in origin, respectively. Then, we studied change in the phylo-α-diversity and phylo-β-diversity (1 –Phylosor) by comparing pre-European and current times. Despite extinction of 18 native angiosperm species, an increase in species richness and phylo-α-diversity was observed for all islands studied, attributed to introduction of exotic plants (between 6 to 477 species per island). We did not observe significant variation of mean phylogenetic distance (MPD), a measure of the ‘deeper’ phylogenetic diversity of assemblages (e.g., orders, families), suggesting that neither extinctions nor introductions altered phylogenetic structure of the angiosperms of these islands. In regard to phylo-β-diversity, we detected temporal turnover (variation in phylogenetic composition) between periods to flora (0.38 ± 0.11). However, when analyses were performed only considering native plants, we did not observe significant temporal turnover between periods (0.07 ± 0.06). These results indicate that introduction of exotic angiosperms has contributed more notably than extinctions to the configuration of plant assemblages and phylogenetic diversity on the studied islands. Because phylogenetic diversity is closely related to functional diversity (species trait variations and roles performed by organisms), our results suggests that the introduction of exotic plants to these islands could have detrimental impacts for ecosystem functions and ecosystem services that islands provide (e.g. productivity). PMID:28763508

  12. The Comparative Osteology of the Petrotympanic Complex (Ear Region) of Extant Baleen Whales (Cetacea: Mysticeti)

    PubMed Central

    Ekdale, Eric G.; Berta, Annalisa; Deméré, Thomas A.

    2011-01-01

    Background Anatomical comparisons of the ear region of baleen whales (Mysticeti) are provided through detailed osteological descriptions and high-resolution photographs of the petrotympanic complex (tympanic bulla and petrosal bone) of all extant species of mysticete cetaceans. Salient morphological features are illustrated and identified, including overall shape of the bulla, size of the conical process of the bulla, morphology of the promontorium, and the size and shape of the anterior process of the petrosal. We place our comparative osteological observations into a phylogenetic context in order to initiate an exploration into petrotympanic evolution within Mysticeti. Principal Findings The morphology of the petrotympanic complex is diagnostic for individual species of baleen whale (e.g., sigmoid and conical processes positioned at midline of bulla in Balaenoptera musculus; confluence of fenestra cochleae and perilymphatic foramen in Eschrichtius robustus), and several mysticete clades are united by derived characteristics. Balaenids and neobalaenids share derived features of the bulla, such as a rhomboid shape and a reduced anterior lobe (swelling) in ventral aspect, and eschrichtiids share derived morphologies of the petrosal with balaenopterids, including loss of a medial promontory groove and dorsomedial elongation of the promontorium. Monophyly of Balaenoidea (Balaenidae and Neobalaenidae) and Balaenopteroidea (Balaenopteridae and Eschrichtiidae) was recovered in phylogenetic analyses utilizing data exclusively from the petrotympanic complex. Significance This study fills a major gap in our knowledge of the complex structures of the mysticete petrotympanic complex, which is an important anatomical region for the interpretation of the evolutionary history of mammals. In addition, we introduce a novel body of phylogenetically informative characters from the ear region of mysticetes. Our detailed anatomical descriptions, illustrations, and comparisons provide valuable data for current and future studies on the phylogenetic relationships, evolution, and auditory physiology of mysticetes and other cetaceans throughout Earth's history. PMID:21731700

  13. Polyphasic characterization of Trichocoleus desertorum sp. nov. (Pseudanabaenales, Cyanobacteria) from desert soils and phylogenetic placement of the genus Trichocoleus

    Treesearch

    Radka Muhlsteinova; Jeffrey R. Johansen; Nicole Pietrasiak; Michael P. Martin; Karina Osorio-Santos; Steven D. Warren

    2014-01-01

    Little is known about the taxonomic diversity of cyanobacteria in deserts, despite their important ecological roles in these ecosystems. In this study, cyanobacterial strains from the Atacama, Colorado, and Mojave Deserts were isolated and characterized using molecular, morphological, and ecological information. Phylogenetic placement of these strains was revealed...

  14. Nodal distances for rooted phylogenetic trees.

    PubMed

    Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel

    2010-08-01

    Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).

  15. Phylodiversity to inform conservation policy: An Australian example.

    PubMed

    Laity, Tania; Laffan, Shawn W; González-Orozco, Carlos E; Faith, Daniel P; Rosauer, Dan F; Byrne, Margaret; Miller, Joseph T; Crayn, Darren; Costion, Craig; Moritz, Craig C; Newport, Karl

    2015-11-15

    Phylodiversity measures summarise the phylogenetic diversity patterns of groups of organisms. By using branches of the tree of life, rather than its tips (e.g., species), phylodiversity measures provide important additional information about biodiversity that can improve conservation policy and outcomes. As a biodiverse nation with a strong legislative and policy framework, Australia provides an opportunity to use phylogenetic information to inform conservation decision-making. We explored the application of phylodiversity measures across Australia with a focus on two highly biodiverse regions, the south west of Western Australia (SWWA) and the South East Queensland bioregion (SEQ). We analysed seven diverse groups of organisms spanning five separate phyla on the evolutionary tree of life, the plant genera Acacia and Daviesia, mammals, hylid frogs, myobatrachid frogs, passerine birds, and camaenid land snails. We measured species richness, weighted species endemism (WE) and two phylodiversity measures, phylogenetic diversity (PD) and phylogenetic endemism (PE), as well as their respective complementarity scores (a measure of gains and losses) at 20 km resolution. Higher PD was identified within SEQ for all fauna groups, whereas more PD was found in SWWA for both plant groups. PD and PD complementarity were strongly correlated with species richness and species complementarity for most groups but less so for plants. PD and PE were found to complement traditional species-based measures for all groups studied: PD and PE follow similar spatial patterns to richness and WE, but highlighted different areas that would not be identified by conventional species-based biodiversity analyses alone. The application of phylodiversity measures, particularly the novel weighted complementary measures considered here, in conservation can enhance protection of the evolutionary history that contributes to present day biodiversity values of areas. Phylogenetic measures in conservation can include important elements of biodiversity in conservation planning, such as evolutionary potential and feature diversity that will improve decision-making and lead to better biodiversity conservation outcomes. Crown Copyright © 2015. Published by Elsevier B.V. All rights reserved.

  16. Explosive radiation or uninformative genes? Origin and early diversification of tachinid flies (Diptera: Tachinidae).

    PubMed

    Winkler, Isaac S; Blaschke, Jeremy D; Davis, Daniel J; Stireman, John O; O'Hara, James E; Cerretti, Pierfilippo; Moulton, John K

    2015-07-01

    Molecular phylogenetic studies at all taxonomic levels often infer rapid radiation events based on short, poorly resolved internodes. While such rapid episodes of diversification are an important and widespread evolutionary phenomenon, much of this poor phylogenetic resolution may be attributed to the continuing widespread use of "traditional" markers (mitochondrial, ribosomal, and some nuclear protein-coding genes) that are often poorly suited to resolve difficult, higher-level phylogenetic problems. Here we reconstruct phylogenetic relationships among a representative set of taxa of the parasitoid fly family Tachinidae and related outgroups of the superfamily Oestroidea. The Tachinidae are one of the most species rich, yet evolutionarily recent families of Diptera, providing an ideal case study for examining the differential performance of loci in resolving phylogenetic relationships and the benefits of adding more loci to phylogenetic analyses. We assess the phylogenetic utility of nine genes including both traditional genes (e.g., CO1 mtDNA, 28S rDNA) and nuclear protein-coding genes newly developed for phylogenetic analysis. Our phylogenetic findings, based on a limited set of taxa, include: a close relationship between Tachinidae and the calliphorid subfamily Polleninae, monophyly of Tachinidae and the subfamilies Exoristinae and Dexiinae, subfamily groupings of Dexiinae+Phasiinae and Tachininae+Exoristinae, and robust phylogenetic placement of the somewhat enigmatic genera Strongygaster, Euthera, and Ceracia. In contrast to poor resolution and phylogenetic incongruence of "traditional genes," we find that a more selective set of highly informative genes is able to more precisely identify regions of the phylogeny that experienced rapid radiation of lineages, while more accurately depicting their phylogenetic context. Although much expanded taxon sampling is necessary to effectively assess the monophyly of and relationships among major tachinid lineages and their relatives, we show that a small number of well-chosen nuclear protein-coding genes can successfully resolve even difficult phylogenetic problems. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Picante: R tools for integrating phylogenies and ecology.

    PubMed

    Kembel, Steven W; Cowan, Peter D; Helmus, Matthew R; Cornwell, William K; Morlon, Helene; Ackerly, David D; Blomberg, Simon P; Webb, Campbell O

    2010-06-01

    Picante is a software package that provides a comprehensive set of tools for analyzing the phylogenetic and trait diversity of ecological communities. The package calculates phylogenetic diversity metrics, performs trait comparative analyses, manipulates phenotypic and phylogenetic data, and performs tests for phylogenetic signal in trait distributions, community structure and species interactions. Picante is a package for the R statistical language and environment written in R and C, released under a GPL v2 open-source license, and freely available on the web (http://picante.r-forge.r-project.org) and from CRAN (http://cran.r-project.org).

  18. Analyzing the relationship between sequence divergence and nodal support using Bayesian phylogenetic analyses.

    PubMed

    Makowsky, Robert; Cox, Christian L; Roelke, Corey; Chippindale, Paul T

    2010-11-01

    Determining the appropriate gene for phylogeny reconstruction can be a difficult process. Rapidly evolving genes tend to resolve recent relationships, but suffer from alignment issues and increased homoplasy among distantly related species. Conversely, slowly evolving genes generally perform best for deeper relationships, but lack sufficient variation to resolve recent relationships. We determine the relationship between sequence divergence and Bayesian phylogenetic reconstruction ability using both natural and simulated datasets. The natural data are based on 28 well-supported relationships within the subphylum Vertebrata. Sequences of 12 genes were acquired and Bayesian analyses were used to determine phylogenetic support for correct relationships. Simulated datasets were designed to determine whether an optimal range of sequence divergence exists across extreme phylogenetic conditions. Across all genes we found that an optimal range of divergence for resolving the correct relationships does exist, although this level of divergence expectedly depends on the distance metric. Simulated datasets show that an optimal range of sequence divergence exists across diverse topologies and models of evolution. We determine that a simple to measure property of genetic sequences (genetic distance) is related to phylogenic reconstruction ability in Bayesian analyses. This information should be useful for selecting the most informative gene to resolve any relationships, especially those that are difficult to resolve, as well as minimizing both cost and confounding information during project design. Copyright © 2010. Published by Elsevier Inc.

  19. Informational Gene Phylogenies Do Not Support a Fourth Domain of Life for Nucleocytoplasmic Large DNA Viruses

    PubMed Central

    Williams, Tom A.; Embley, T. Martin; Heinz, Eva

    2011-01-01

    Mimivirus is a nucleocytoplasmic large DNA virus (NCLDV) with a genome size (1.2 Mb) and coding capacity ( 1000 genes) comparable to that of some cellular organisms. Unlike other viruses, Mimivirus and its NCLDV relatives encode homologs of broadly conserved informational genes found in Bacteria, Archaea, and Eukaryotes, raising the possibility that they could be placed on the tree of life. A recent phylogenetic analysis of these genes showed the NCLDVs emerging as a monophyletic group branching between Eukaryotes and Archaea. These trees were interpreted as evidence for an independent “fourth domain” of life that may have contributed DNA processing genes to the ancestral eukaryote. However, the analysis of ancient evolutionary events is challenging, and tree reconstruction is susceptible to bias resulting from non-phylogenetic signals in the data. These include compositional heterogeneity and homoplasy, which can lead to the spurious grouping of compositionally-similar or fast-evolving sequences. Here, we show that these informational gene alignments contain both significant compositional heterogeneity and homoplasy, which were not adequately modelled in the original analysis. When we use more realistic evolutionary models that better fit the data, the resulting trees are unable to reject a simple null hypothesis in which these informational genes, like many other NCLDV genes, were acquired by horizontal transfer from eukaryotic hosts. Our results suggest that a fourth domain is not required to explain the available sequence data. PMID:21698163

  20. Comparing Phylogenetic Trees by Matching Nodes Using the Transfer Distance Between Partitions

    PubMed Central

    Giaro, Krzysztof

    2017-01-01

    Abstract Ability to quantify dissimilarity of different phylogenetic trees describing the relationship between the same group of taxa is required in various types of phylogenetic studies. For example, such metrics are used to assess the quality of phylogeny construction methods, to define optimization criteria in supertree building algorithms, or to find horizontal gene transfer (HGT) events. Among the set of metrics described so far in the literature, the most commonly used seems to be the Robinson–Foulds distance. In this article, we define a new metric for rooted trees—the Matching Pair (MP) distance. The MP metric uses the concept of the minimum-weight perfect matching in a complete bipartite graph constructed from partitions of all pairs of leaves of the compared phylogenetic trees. We analyze the properties of the MP metric and present computational experiments showing its potential applicability in tasks related to finding the HGT events. PMID:28177699

  1. Comparing Phylogenetic Trees by Matching Nodes Using the Transfer Distance Between Partitions.

    PubMed

    Bogdanowicz, Damian; Giaro, Krzysztof

    2017-05-01

    Ability to quantify dissimilarity of different phylogenetic trees describing the relationship between the same group of taxa is required in various types of phylogenetic studies. For example, such metrics are used to assess the quality of phylogeny construction methods, to define optimization criteria in supertree building algorithms, or to find horizontal gene transfer (HGT) events. Among the set of metrics described so far in the literature, the most commonly used seems to be the Robinson-Foulds distance. In this article, we define a new metric for rooted trees-the Matching Pair (MP) distance. The MP metric uses the concept of the minimum-weight perfect matching in a complete bipartite graph constructed from partitions of all pairs of leaves of the compared phylogenetic trees. We analyze the properties of the MP metric and present computational experiments showing its potential applicability in tasks related to finding the HGT events.

  2. Biodiversity comparison among phylogenetic diversity metrics and between three North American prairies1

    PubMed Central

    Kellar, P. Roxanne (Steele); Ahrendsen, Dakota L.; Aust, Shelly K.; Jones, Amanda R.; Pires, J. Chris

    2015-01-01

    Protection of Earth’s ecosystems requires identification of geographical areas of greatest biodiversity. Assessment of biodiversity begins with knowledge of the evolutionary histories of species in a geographic area. Multiple phylogenetic diversity (PD) metrics have been developed to describe biodiversity beyond species counts, but sufficient empirical studies, particularly at fine phylogenetic scales, have not been conducted to provide conservation planners with evidence for incorporating PD metrics into selection of priority regions. We review notable studies that are contributing to a growing database of empirical results, we report on the effect of using high-throughput sequencing to estimate the phylogenies used to calculate PD metrics, and we discuss difficulties in selecting appropriate diversity indices. We focused on two of the most speciose angiosperm families in prairies—Asteraceae and Fabaceae—and compared 12 PD metrics and four traditional measures of biodiversity between three North American prairie sites. The varying results from the literature and from the current data reveal the wide range of applications of PD metrics and the necessity for many more empirical studies. The accumulation of results from further investigations will eventually lead to a scientific understanding upon which conservation planners can make informed decisions about where to apply limited preservation funds. PMID:26191461

  3. Epidemiological and Phylogenetic Characteristics of Influenza B Infection in Severe Acute Respiratory Infection Cases in Beijing, 2014 to 2015.

    PubMed

    Pan, Yang; Zhang, Yi; Yang, Peng; Qian, Haiqun; Shi, Weixian; Wu, Shuangsheng; Cui, Shujuan; Zhang, Daitao; Wang, Quanyi

    2015-12-01

    Influenza B viral infection is of great importance, but the epidemiological and phylogenetic characteristics of influenza B infection in severe acute respiratory infection (SARI) cases are still unclear.The clinical information of 2816 SARI cases and 467,737 influenza-like illness (ILI) cases in Beijing area from September 2014 to April 2015 were collected and analyzed. Among them, 91 influenza B viruses isolated from SARI cases were sequenced.The overall yield rate of influenza A/B infection was 14.21% and 27.77% in sampled SARI and ILI cases, respectively. Compared with influenza A infection, the frequency of influenza B infection in SARI cases was higher in younger patients. Phylogenetic analysis suggested that most tested hemagglutination genes belonged to Yamagata lineage Clade 3, which were similar with current circulating viruses but different with 2014 to 2015 influenza season vaccine strain (Clade 2). Importantly, HA-Y3/NA-V4 intralineage reassorting was identified in Beijing area for the first time, which can act as a possible risk factor of SARIs.The influenza activity and virus types/subtypes/lineages among SARI patients were well correlated with that of ILI cases. Furthermore, the potential risk of reassorted influenza B virus infection should not be overlooked.

  4. Systematics of marine brown alga Sargassum from Thailand: A preliminary study based on morphological data and nuclear ribosomal internal transcribed spacer 2 (ITS2) sequences

    NASA Astrophysics Data System (ADS)

    Kantachumpoo, Attachai; Uwai, Shinya; Noiraksar, Thidarat; Komatsu, Teruhisa

    2015-06-01

    The marine brown algal genus Sargassum has been investigated extensively based on genetic information. In this report, we performed the first comparative study of morphological and molecular data among common species of Sargassum found in Thailand and explored the phylogenetic diversity within the genus. Our results revealed an incongruent pattern for species classification in Thai Sargassum. Morphologically, our Sargassum specimens were distinguishable and represented 8 species, namely, S. aquifolium (Turner) C.Agardh, Sargassum baccularia (Mertens) C. Agardh, S. cinereum J. Agardh, S. ilicifolium (Turner) C.Agardh, S. oligocystum Montagne, S. plagiophyllum C. Agardh, S. polycystum C. Agardh and S. swartzii (Turuner) C. Agardh. In contrast, using three different methods, phylogenetic analysis of nuclear ribosomal internal transcribed spacer 2 (ITS2) revealed six distinct clades, including S. baccularia/ S. oligosyntum clade, S. aquifolium/ S. swartzii clade, S. cinereum clade, S. aquifolium/ S. ilicifolium clade, S. polycystum clade, and S. plagiophyllum clade, which was suggestive of a phenotypic plasticity species complex. Our molecular data also confirmed the paraphyletic relationship in the section Binderianae and suggested that this section requires reassessment. Overall, further studies are required to increase our understanding of the taxonomy, phylogenetic relationships and species boundaries among Sargassum species in Thailand.

  5. Phylogenetic inference under varying proportions of indel-induced alignment gaps

    PubMed Central

    Dwivedi, Bhakti; Gadagkar, Sudhindra R

    2009-01-01

    Background The effect of alignment gaps on phylogenetic accuracy has been the subject of numerous studies. In this study, we investigated the relationship between the total number of gapped sites and phylogenetic accuracy, when the gaps were introduced (by means of computer simulation) to reflect indel (insertion/deletion) events during the evolution of DNA sequences. The resulting (true) alignments were subjected to commonly used gap treatment and phylogenetic inference methods. Results (1) In general, there was a strong – almost deterministic – relationship between the amount of gap in the data and the level of phylogenetic accuracy when the alignments were very "gappy", (2) gaps resulting from deletions (as opposed to insertions) contributed more to the inaccuracy of phylogenetic inference, (3) the probabilistic methods (Bayesian, PhyML & "MLε, " a method implemented in DNAML in PHYLIP) performed better at most levels of gap percentage when compared to parsimony (MP) and distance (NJ) methods, with Bayesian analysis being clearly the best, (4) methods that treat gapped sites as missing data yielded less accurate trees when compared to those that attribute phylogenetic signal to the gapped sites (by coding them as binary character data – presence/absence, or as in the MLε method), and (5) in general, the accuracy of phylogenetic inference depended upon the amount of available data when the gaps resulted from mainly deletion events, and the amount of missing data when insertion events were equally likely to have caused the alignment gaps. Conclusion When gaps in an alignment are a consequence of indel events in the evolution of the sequences, the accuracy of phylogenetic analysis is likely to improve if: (1) alignment gaps are categorized as arising from insertion events or deletion events and then treated separately in the analysis, (2) the evolutionary signal provided by indels is harnessed in the phylogenetic analysis, and (3) methods that utilize the phylogenetic signal in indels are developed for distance methods too. When the true homology is known and the amount of gaps is 20 percent of the alignment length or less, the methods used in this study are likely to yield trees with 90–100 percent accuracy. PMID:19698168

  6. The riddle of Tasmanian languages

    PubMed Central

    Bowern, Claire

    2012-01-01

    Recent work which combines methods from linguistics and evolutionary biology has been fruitful in discovering the history of major language families because of similarities in evolutionary processes. Such work opens up new possibilities for language research on previously unsolvable problems, especially in areas where information from other sources may be lacking. I use phylogenetic methods to investigate Tasmanian languages. Existing materials are so fragmentary that scholars have been unable to discover how many languages are represented in the sources. Using a clustering algorithm which identifies admixture, source materials representing more than one language are identified. Using the Neighbor-Net algorithm, 12 languages are identified in five clusters. Bayesian phylogenetic methods reveal that the families are not demonstrably related; an important result, given the importance of Tasmanian Aborigines for information about how societies have responded to population collapse in prehistory. This work provides insight into the societies of prehistoric Tasmania and illustrates a new utility of phylogenetics in reconstructing linguistic history. PMID:23015621

  7. Optimal rates for phylogenetic inference and experimental design in the era of genome-scale datasets.

    PubMed

    Dornburg, Alex; Su, Zhuo; Townsend, Jeffrey P

    2018-06-25

    With the rise of genome- scale datasets there has been a call for increased data scrutiny and careful selection of loci appropriate for attempting the resolution of a phylogenetic problem. Such loci are desired to maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve under such an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Here we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters-not genes-are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design.

  8. Bridging meta-analysis and the comparative method: a test of seed size effect on germination after frugivores' gut passage.

    PubMed

    Verdú, Miguel; Traveset, Anna

    2004-02-01

    Most studies using meta-analysis try to establish relationships between traits across taxa from interspecific databases and, thus, the phylogenetic relatedness among these taxa should be taken into account to avoid pseudoreplication derived from common ancestry. This paper illustrates, with a representative example of the relationship between seed size and the effect of frugivore's gut on seed germination, that meta-analytic procedures can also be phylogenetically corrected by means of the comparative method. The conclusions obtained in the meta-analytical and phylogenetical approaches are very different. The meta-analysis revealed that the positive effects that gut passage had on seed germination increased with seed size in the case of gut passage through birds whereas decreased in the case of gut passage through non-flying mammals. However, once the phylogenetic relatedness among plant species was taken into account, the effects of gut passage on seed germination did not depend on seed size and were similar between birds and non-flying mammals. Some methodological considerations are given to improve the bridge between the meta-analysis and the comparative method.

  9. DNA Sequence Analyses Reveal Abundant Diversity, Endemism and Evidence for Asian Origin of the Porcini Mushrooms

    PubMed Central

    Feng, Bang; Xu, Jianping; Wu, Gang; Zeng, Nian-Kai; Li, Yan-Chun; Tolgor, Bau; Kost, Gerhard W.; Yang, Zhu L.

    2012-01-01

    The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species) and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions. PMID:22629418

  10. Phylogeny of economically important insect pests that infesting several crops species in Malaysia

    NASA Astrophysics Data System (ADS)

    Ghazali, Siti Zafirah; Zain, Badrul Munir Md.; Yaakop, Salmah

    2014-09-01

    This paper reported molecular data on insect pests of commercial crops in Peninsular Malaysia. Fifteen insect pests (Metisa plana, Calliteara horsefeldii, Cotesia vestalis, Bactrocera papayae, Bactrocera carambolae, Bactrocera latifrons, Conopomorpha cramella, Sesamia inferens, Chilo polychrysa, Rhynchophorus vulneratus, and Rhynchophorus ferrugineus) of nine crops were sampled (oil palm, coconut, paddy, cocoa, starfruit, angled loofah, guava, chili and mustard) and also four species that belong to the fern's pest (Herpetogramma platycapna) and storage and rice pests (Tribolium castaneum, Oryzaephilus surinamensis and Cadra cautella). The presented phylogeny summarized the initial phylogenetic hypothesis, which concerning by implementation of the economically important insect pests. In this paper, phylogenetic relationships among 39 individuals of 15 species that belonging to three orders under 12 genera were inferred from DNA sequences of mitochondrial marker, cytochrome oxidase subunit I (COI) and nuclear marker, ribosomal DNA 28S D2 region. The phylogenies resulted from the phylogenetic analyses of both genes are relatively similar, but differ in the sequence of evolution. Interestingly, this most recent molecular data of COI sequences data by using Bayesian Inference analysis resulted a more-resolved phylogeny that corroborated with traditional hypotheses of holometabolan relationships based on traditional hypotheses of holometabolan relationships and most of recently molecular study compared to 28S sequences. This finding provides the information on relationships of pests species, which infested several crops in Malaysia and also estimation on Holometabola's order relationships. The identification of the larval stages of insect pests could be done accurately, without waiting the emergence of adults and supported by the phylogenetic tree.

  11. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Riley, Robert; Salamov, Asaf; Otillar, Robert

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes revealsmore » that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.« less

  12. Sex Determination, Sex Chromosomes, and Karyotype Evolution in Insects.

    PubMed

    Blackmon, Heath; Ross, Laura; Bachtrog, Doris

    2017-01-01

    Insects harbor a tremendous diversity of sex determining mechanisms both within and between groups. For example, in some orders such as Hymenoptera, all members are haplodiploid, whereas Diptera contain species with homomorphic as well as male and female heterogametic sex chromosome systems or paternal genome elimination. We have established a large database on karyotypes and sex chromosomes in insects, containing information on over 13000 species covering 29 orders of insects. This database constitutes a unique starting point to report phylogenetic patterns on the distribution of sex determination mechanisms, sex chromosomes, and karyotypes among insects and allows us to test general theories on the evolutionary dynamics of karyotypes, sex chromosomes, and sex determination systems in a comparative framework. Phylogenetic analysis reveals that male heterogamety is the ancestral mode of sex determination in insects, and transitions to female heterogamety are extremely rare. Many insect orders harbor species with complex sex chromosomes, and gains and losses of the sex-limited chromosome are frequent in some groups. Haplodiploidy originated several times within insects, and parthenogenesis is rare but evolves frequently. Providing a single source to electronically access data previously distributed among more than 500 articles and books will not only accelerate analyses of the assembled data, but also provide a unique resource to guide research on which taxa are likely to be informative to address specific questions, for example, for genome sequencing projects or large-scale comparative studies. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase).

    PubMed

    Odronitz, Florian; Kollmar, Martin

    2006-11-29

    Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein.

  14. Component identification of electron transport chains in curdlan-producing Agrobacterium sp. ATCC 31749 and its genome-specific prediction using comparative genome and phylogenetic trees analysis.

    PubMed

    Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang

    2011-06-01

    Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.

  15. Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences

    Treesearch

    Ned B. Klopfenstein; Jane E. Stewart; Yuko Ota; John W. Hanna; Bryce A. Richardson; Amy L. Ross-Davis; Ruben D. Elias-Roman; Kari Korhonen; Nenad Keca; Eugenia Iturritxa; Dionicio Alvarado-Rosales; Halvor Solheim; Nicholas J. Brazee; Piotr Lakomy; Michelle R. Cleary; Eri Hasegawa; Taisei Kikuchi; Fortunato Garza-Ocanas; Panaghiotis Tsopelas; Daniel Rigling; Simone Prospero; Tetyana Tsykun; Jean A. Berube; Franck O. P. Stefani; Saeideh Jafarpour; Vladimir Antonin; Michal Tomsovsky; Geral I. McDonald; Stephen Woodward; Mee-Sook Kim

    2017-01-01

    Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence–based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation...

  16. The Role of Edaphic Environment and Climate in Structuring Phylogenetic Pattern in Seasonally Dry Tropical Plant Communities

    PubMed Central

    Moro, Marcelo Freire; Silva, Igor Aurélio; de Araújo, Francisca Soares; Nic Lughadha, Eimear; Meagher, Thomas R.; Martins, Fernando Roberto

    2015-01-01

    Seasonally dry tropical plant formations (SDTF) are likely to exhibit phylogenetic clustering owing to niche conservatism driven by a strong environmental filter (water stress), but heterogeneous edaphic environments and life histories may result in heterogeneity in degree of phylogenetic clustering. We investigated phylogenetic patterns across ecological gradients related to water availability (edaphic environment and climate) in the Caatinga, a SDTF in Brazil. Caatinga is characterized by semiarid climate and three distinct edaphic environments – sedimentary, crystalline, and inselberg –representing a decreasing gradient in soil water availability. We used two measures of phylogenetic diversity: Net Relatedness Index based on the entire phylogeny among species present in a site, reflecting long-term diversification; and Nearest Taxon Index based on the tips of the phylogeny, reflecting more recent diversification. We also evaluated woody species in contrast to herbaceous species. The main climatic variable influencing phylogenetic pattern was precipitation in the driest quarter, particularly for herbaceous species, suggesting that environmental filtering related to minimal periods of precipitation is an important driver of Caatinga biodiversity, as one might expect for a SDTF. Woody species tended to show phylogenetic clustering whereas herbaceous species tended towards phylogenetic overdispersion. We also found phylogenetic clustering in two edaphic environments (sedimentary and crystalline) in contrast to phylogenetic overdispersion in the third (inselberg). We conclude that while niche conservatism is evident in phylogenetic clustering in the Caatinga, this is not a universal pattern likely due to heterogeneity in the degree of realized environmental filtering across edaphic environments. Thus, SDTF, in spite of a strong shared environmental filter, are potentially heterogeneous in phylogenetic structuring. Our results support the need for scientifically informed conservation strategies in the Caatinga and other SDTF regions that have not previously been prioritized for conservation in order to take into account this heterogeneity. PMID:25798584

  17. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

    PubMed

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-09-02

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Change in phylogenetic community structure during succession of traditionally managed tropical rainforest in southwest China.

    PubMed

    Mo, Xiao-Xue; Shi, Ling-Ling; Zhang, Yong-Jiang; Zhu, Hua; Slik, J W Ferry

    2013-01-01

    Tropical rainforests in Southeast Asia are facing increasing and ever more intense human disturbance that often negatively affects biodiversity. The aim of this study was to determine how tree species phylogenetic diversity is affected by traditional forest management types and to understand the change in community phylogenetic structure during succession. Four types of forests with different management histories were selected for this purpose: old growth forests, understorey planted old growth forests, old secondary forests (∼200-years after slash and burn), and young secondary forests (15-50-years after slash and burn). We found that tree phylogenetic community structure changed from clustering to over-dispersion from early to late successional forests and finally became random in old-growth forest. We also found that the phylogenetic structure of the tree overstorey and understorey responded differentially to change in environmental conditions during succession. In addition, we show that slash and burn agriculture (swidden cultivation) can increase landscape level plant community evolutionary information content.

  19. Change in Phylogenetic Community Structure during Succession of Traditionally Managed Tropical Rainforest in Southwest China

    PubMed Central

    Mo, Xiao-Xue; Shi, Ling-Ling; Zhang, Yong-Jiang; Zhu, Hua; Slik, J. W. Ferry

    2013-01-01

    Tropical rainforests in Southeast Asia are facing increasing and ever more intense human disturbance that often negatively affects biodiversity. The aim of this study was to determine how tree species phylogenetic diversity is affected by traditional forest management types and to understand the change in community phylogenetic structure during succession. Four types of forests with different management histories were selected for this purpose: old growth forests, understorey planted old growth forests, old secondary forests (∼200-years after slash and burn), and young secondary forests (15–50-years after slash and burn). We found that tree phylogenetic community structure changed from clustering to over-dispersion from early to late successional forests and finally became random in old-growth forest. We also found that the phylogenetic structure of the tree overstorey and understorey responded differentially to change in environmental conditions during succession. In addition, we show that slash and burn agriculture (swidden cultivation) can increase landscape level plant community evolutionary information content. PMID:23936268

  20. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences.

    PubMed

    Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

    2017-01-26

    The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.

  1. Phylogenetic affinity of tree shrews to Glires is attributed to fast evolution rate.

    PubMed

    Lin, Jiannan; Chen, Guangfeng; Gu, Liang; Shen, Yuefeng; Zheng, Meizhu; Zheng, Weisheng; Hu, Xinjie; Zhang, Xiaobai; Qiu, Yu; Liu, Xiaoqing; Jiang, Cizhong

    2014-02-01

    Previous phylogenetic analyses have led to incongruent evolutionary relationships between tree shrews and other suborders of Euarchontoglires. What caused the incongruence remains elusive. In this study, we identified 6845 orthologous genes between seventeen placental mammals. Tree shrews and Primates were monophyletic in the phylogenetic trees derived from the first or/and second codon positions whereas tree shrews and Glires formed a monophyly in the trees derived from the third or all codon positions. The same topology was obtained in the phylogeny inference using the slowly and fast evolving genes, respectively. This incongruence was likely attributed to the fast substitution rate in tree shrews and Glires. Notably, sequence GC content only was not informative to resolve the controversial phylogenetic relationships between tree shrews, Glires, and Primates. Finally, estimation in the confidence of the tree selection strongly supported the phylogenetic affiliation of tree shrews to Primates as a monophyly. Copyright © 2013 Elsevier Inc. All rights reserved.

  2. Phylogenetic relatedness and leaf functional traits, not introduced status, influence community assembly.

    PubMed

    Lemoine, Nathan P; Shue, Jessica; Verrico, Brittany; Erickson, David; Kress, W John; Parker, John D

    2015-10-01

    Considerable debate focuses on whether invasive species establish and become abundant by being functionally and phylogenetically distinct from native species, leading to a host of invasion-specific hypotheses of community assembly. Few studies, however, have quantitatively assessed whether similar patterns of phylogenetic and functional similarity explain local abundance of both native and introduced species, which would suggest similar assembly mechanisms regardless of origin. Using a chronosequence of invaded temperate forest stands, we tested whether the occurrence and abundance of both introduced and native species were predicted by phylogenetic relatedness, functional overlap, and key environmental characteristics including forest age. Environmental filtering against functionally and phylogenetically distinct species strongly dictated the occurrence and abundance of both introduced and native species, with slight modifications of these patterns according to forest age. Thus, once functional and evolutionary novelty were quantified, introduced status provided little information about species' presence or abundance, indicating largely similar sorting mechanisms for both native and introduced species.

  3. The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method

    PubMed Central

    2015-01-01

    Phylogenetic comparative methods (PCMs) use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE), which purportedly employs a novel model of evolution based on Felsenstein’s Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or “directional” contrasts, and draw parallels between IE and previous PCMs such as “minimum evolution”. Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses. PMID:26683838

  4. Assessing the influence of biogeographical region and phylogenetic history on chemical defences and herbivory in Quercus species.

    PubMed

    Moreira, Xoaquín; Abdala-Roberts, Luis; Galmán, Andrea; Francisco, Marta; Fuente, María de la; Butrón, Ana; Rasmann, Sergio

    2018-06-07

    Biogeographical factors and phylogenetic history are key determinants of inter-specific variation in plant defences. However, few studies have conducted broad-scale geographical comparisons of plant defences while controlling for phylogenetic relationships, and, in doing so, none have separated constitutive from induced defences. This gap has limited our understanding of how historical or large-scale processes mediate biogeographical patterns in plant defences since these may be contingent upon shared evolutionary history and phylogenetic constraints. We conducted a phylogenetically-controlled experiment testing for differences in constitutive leaf chemical defences and their inducibility between Palearctic and Nearctic oak species (Quercus, total 18 species). We induced defences in one-year old plants by inflicting damage by gypsy moth larvae (Lymantria dispar), estimated the amount of leaf area consumed, and quantified various groups of phenolic compounds. There was no detectable phylogenetic signal for constitutive or induced levels of most defensive traits except for constitutive condensed tannins, as well as no phylogenetic signal in leaf herbivory. We did, however, find marked differences in defence levels between oak species from each region: Palearctic species had higher levels of constitutive condensed tannins, but less constitutive lignins and less constitutive and induced hydrolysable tannins compared with Nearctic species. Additionally, Palearctic species had lower levels of leaf damage compared with Nearctic species. These differences in leaf damage, lignins and hydrolysable (but not condensed) tannins were lost after accounting for phylogeny, suggesting that geographical structuring of phylogenetic relationships mediated biogeographical differences in defences and herbivore resistance. Together, these findings suggest that historical processes and large-scale drivers have shaped differences in allocation to constitutive defences (and in turn resistance) between Palearctic and Nearctic oaks. Moreover, although evidence of phylogenetic conservatism in the studied traits is rather weak, shared evolutionary history appears to mediate some of these biogeographical patterns in allocation to chemical defences. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. Phylogeny of the Genus Drosophila

    PubMed Central

    O’Grady, Patrick M.; DeSalle, Rob

    2018-01-01

    Understanding phylogenetic relationships among taxa is key to designing and implementing comparative analyses. The genus Drosophila, which contains over 1600 species, is one of the most important model systems in the biological sciences. For over a century, one species in this group, Drosophila melanogaster, has been key to studies of animal development and genetics, genome organization and evolution, and human disease. As whole-genome sequencing becomes more cost-effective, there is increasing interest in other members of this morphologically, ecologically, and behaviorally diverse genus. Phylogenetic relationships within Drosophila are complicated, and the goal of this paper is to provide a review of the recent taxonomic changes and phylogenetic relationships in this genus to aid in further comparative studies. PMID:29716983

  6. Recalcitrant deep and shallow nodes in Aristolochia (Aristolochiaceae) illuminated using anchored hybrid enrichment.

    PubMed

    Wanke, Stefan; Granados Mendoza, Carolina; Müller, Sebastian; Paizanni Guillén, Anna; Neinhuis, Christoph; Lemmon, Alan R; Lemmon, Emily Moriarty; Samain, Marie-Stéphanie

    2017-12-01

    Recalcitrant relationships are characterized by very short internodes that can be found among shallow and deep phylogenetic scales all over the tree of life. Adding large amounts of presumably informative sequences, while decreasing systematic error, has been suggested as a possible approach to increase phylogenetic resolution. The development of enrichment strategies, coupled with next generation sequencing, resulted in a cost-effective way to facilitate the reconstruction of recalcitrant relationships. By applying the anchored hybrid enrichment (AHE) genome partitioning strategy to Aristolochia using an universal angiosperm probe set, we obtained 231-233 out of 517 single or low copy nuclear loci originally contained in the enrichment kit, resulting in a total alignment length of 154,756bp to 160,150bp. Since Aristolochia (Piperales; magnoliids) is distantly related to any angiosperm species whose genome has been used for the plant AHE probe design (Amborella trichopoda being the closest), it serves as a proof of universality for this probe set. Aristolochia comprises approximately 500 species grouped in several clades (OTUs), whose relationships to each other are partially unknown. Previous phylogenetic studies have shown that these lineages branched deep in time and in quick succession, seen as short-deep internodes. Short-shallow internodes are also characteristic of some Aristolochia lineages such as Aristolochia subsection Pentandrae, a clade of presumably recent diversification. This subsection is here included to test the performance of AHE at species level. Filtering and subsampling loci using the phylogenetic informativeness method resolves several recalcitrant phylogenetic relationships within Aristolochia. By assuming different ploidy levels during bioinformatics processing of raw data, first hints are obtained that polyploidization contributed to the evolution of Aristolochia. Phylogenetic results are discussed in the light of current systematics and morphology. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  7. Phenotypic Microdiversity and Phylogenetic Signal Analysis of Traits Related to Social Interaction in Bacillus spp. from Sediment Communities.

    PubMed

    Rodríguez-Torres, María Dolores; Islas-Robles, África; Gómez-Lunar, Zulema; Delaye, Luis; Hernández-González, Ismael; Souza, Valeria; Travisano, Michael; Olmedo-Álvarez, Gabriela

    2017-01-01

    Understanding the relationship between phylogeny and predicted traits is important to uncover the dimension of the predictive power of a microbial composition approach. Numerous works have addressed the taxonomic composition of bacteria in communities, but little is known about trait heterogeneity in closely related bacteria that co-occur in communities. We evaluated a sample of 467 isolates from the Churince water system of the Cuatro Cienegas Basin (CCB), enriched for Bacillus spp. The 16S rRNA gene revealed a random distribution of taxonomic groups within this genus among 11 sampling sites. A subsample of 141 Bacillus spp. isolates from sediment, with seven well-represented species was chosen to evaluate the heterogeneity and the phylogenetic signal of phenotypic traits that are known to diverge within small clades, such as substrate utilization, and traits that are conserved deep in the lineage, such as prototrophy, swarming and biofilm formation. We were especially interested in evaluating social traits, such as swarming and biofilm formation, for which cooperation is needed to accomplish a multicellular behavior and for which there is little information from natural communities. The phylogenetic distribution of traits, evaluated by the Purvis and Fritz's D statistics approached a Brownian model of evolution. Analysis of the phylogenetic relatedness of the clusters of members sharing the trait using consenTRAIT algorithm, revealed more clustering and deeper phylogenetic signal for prototrophy, biofilm and swimming compared to the data obtained for substrate utilization. The explanation to the observed Brownian evolution of social traits could be either loss due to complete dispensability or to compensated trait loss due to the availability of public goods. Since many of the evaluated traits can be considered to be collective action traits, such as swarming, motility and biofilm formation, the observed microdiversity within taxonomic groups might be explained by distributed functions in structured communities.

  8. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.

    PubMed

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R

    2009-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.

  9. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

    PubMed Central

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R.

    2010-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534

  10. Utilizing novel diversity estimators to quantify multiple dimensions of microbial biodiversity across domains

    PubMed Central

    2013-01-01

    Background Microbial ecologists often employ methods from classical community ecology to analyze microbial community diversity. However, these methods have limitations because microbial communities differ from macro-organismal communities in key ways. This study sought to quantify microbial diversity using methods that are better suited for data spanning multiple domains of life and dimensions of diversity. Diversity profiles are one novel, promising way to analyze microbial datasets. Diversity profiles encompass many other indices, provide effective numbers of diversity (mathematical generalizations of previous indices that better convey the magnitude of differences in diversity), and can incorporate taxa similarity information. To explore whether these profiles change interpretations of microbial datasets, diversity profiles were calculated for four microbial datasets from different environments spanning all domains of life as well as viruses. Both similarity-based profiles that incorporated phylogenetic relatedness and naïve (not similarity-based) profiles were calculated. Simulated datasets were used to examine the robustness of diversity profiles to varying phylogenetic topology and community composition. Results Diversity profiles provided insights into microbial datasets that were not detectable with classical univariate diversity metrics. For all datasets analyzed, there were key distinctions between calculations that incorporated phylogenetic diversity as a measure of taxa similarity and naïve calculations. The profiles also provided information about the effects of rare species on diversity calculations. Additionally, diversity profiles were used to examine thousands of simulated microbial communities, showing that similarity-based and naïve diversity profiles only agreed approximately 50% of the time in their classification of which sample was most diverse. This is a strong argument for incorporating similarity information and calculating diversity with a range of emphases on rare and abundant species when quantifying microbial community diversity. Conclusions For many datasets, diversity profiles provided a different view of microbial community diversity compared to analyses that did not take into account taxa similarity information, effective diversity, or multiple diversity metrics. These findings are a valuable contribution to data analysis methodology in microbial ecology. PMID:24238386

  11. Tanglegrams for rooted phylogenetic trees and networks

    PubMed Central

    Scornavacca, Celine; Zickmann, Franziska; Huson, Daniel H.

    2011-01-01

    Motivation: In systematic biology, one is often faced with the task of comparing different phylogenetic trees, in particular in multi-gene analysis or cospeciation studies. One approach is to use a tanglegram in which two rooted phylogenetic trees are drawn opposite each other, using auxiliary lines to connect matching taxa. There is an increasing interest in using rooted phylogenetic networks to represent evolutionary history, so as to explicitly represent reticulate events, such as horizontal gene transfer, hybridization or reassortment. Thus, the question arises how to define and compute a tanglegram for such networks. Results: In this article, we present the first formal definition of a tanglegram for rooted phylogenetic networks and present a heuristic approach for computing one, called the NN-tanglegram method. We compare the performance of our method with existing tree tanglegram algorithms and also show a typical application to real biological datasets. For maximum usability, the algorithm does not require that the trees or networks are bifurcating or bicombining, or that they are on identical taxon sets. Availability: The algorithm is implemented in our program Dendroscope 3, which is freely available from www.dendroscope.org. Contact: scornava@informatik.uni-tuebingen.de; huson@informatik.uni-tuebingen.de PMID:21685078

  12. From symmetry to asymmetry: Phylogenetic patterns of asymmetry variation in animals and their evolutionary significance

    PubMed Central

    Palmer, A. Richard

    1996-01-01

    Phylogenetic analyses of asymmetry variation offer a powerful tool for exploring the interplay between ontogeny and evolution because (i) conspicuous asymmetries exist in many higher metazoans with widely varying modes of development, (ii) patterns of bilateral variation within species may identify genetically and environmentally triggered asymmetries, and (iii) asymmetries arising at different times during development may be more sensitive to internal cytoplasmic inhomogeneities compared to external environmental stimuli. Using four broadly comparable asymmetry states (symmetry, antisymmetry, dextral, and sinistral), and two stages at which asymmetry appears developmentally (larval and postlarval), I evaluated relations between ontogenetic and phylogenetic patterns of asymmetry variation. Among 140 inferred phylogenetic transitions between asymmetry states, recorded from 11 classes in five phyla, directional asymmetry (dextral or sinistral) evolved directly from symmetrical ancestors proportionally more frequently among larval asymmetries. In contrast, antisymmetry, either as an end state or as a transitional stage preceding directional asymmetry, was confined primarily to postlarval asymmetries. The ontogenetic origin of asymmetry thus significantly influences its subsequent evolution. Furthermore, because antisymmetry typically signals an environmentally triggered asymmetry, the phylogenetic transition from antisymmetry to directional asymmetry suggests that many cases of laterally fixed asymmetries evolved via genetic assimilation. PMID:8962039

  13. Cophenetic metrics for phylogenetic trees, after Sokal and Rohlf.

    PubMed

    Cardona, Gabriel; Mir, Arnau; Rosselló, Francesc; Rotger, Lucía; Sánchez, David

    2013-01-16

    Phylogenetic tree comparison metrics are an important tool in the study of evolution, and hence the definition of such metrics is an interesting problem in phylogenetics. In a paper in Taxon fifty years ago, Sokal and Rohlf proposed to measure quantitatively the difference between a pair of phylogenetic trees by first encoding them by means of their half-matrices of cophenetic values, and then comparing these matrices. This idea has been used several times since then to define dissimilarity measures between phylogenetic trees but, to our knowledge, no proper metric on weighted phylogenetic trees with nested taxa based on this idea has been formally defined and studied yet. Actually, the cophenetic values of pairs of different taxa alone are not enough to single out phylogenetic trees with weighted arcs or nested taxa. For every (rooted) phylogenetic tree T, let its cophenetic vectorφ(T) consist of all pairs of cophenetic values between pairs of taxa in T and all depths of taxa in T. It turns out that these cophenetic vectors single out weighted phylogenetic trees with nested taxa. We then define a family of cophenetic metrics dφ,p by comparing these cophenetic vectors by means of Lp norms, and we study, either analytically or numerically, some of their basic properties: neighbors, diameter, distribution, and their rank correlation with each other and with other metrics. The cophenetic metrics can be safely used on weighted phylogenetic trees with nested taxa and no restriction on degrees, and they can be computed in O(n2) time, where n stands for the number of taxa. The metrics dφ,1 and dφ,2 have positive skewed distributions, and they show a low rank correlation with the Robinson-Foulds metric and the nodal metrics, and a very high correlation with each other and with the splitted nodal metrics. The diameter of dφ,p, for p⩾1 , is in O(n(p+2)/p), and thus for low p they are more discriminative, having a wider range of values.

  14. Comparative genomic and phylogenetic investigation of the xenobiotic metabolizing arylamine N-acetyltransferase enzyme family

    USDA-ARS?s Scientific Manuscript database

    Arylamine N-acetyltransferases (NATs) are xenobiotic metabolizing enzymes characterized in several bacteria and eukaryotic organisms. We report a comprehensive phylogenetic analysis employing an exhaustive dataset of NAT-homologous sequences recovered through inspection of 2445 genomes. We describe ...

  15. Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

    PubMed

    Parks, Donovan H; Beiko, Robert G

    2013-01-01

    High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.

  16. Characterization and phylogenetic analysis of the swine leukocyte antigen 3 gene from Korean native pigs.

    PubMed

    Chung, H Y; Choi, Y C; Park, H N

    2015-05-18

    We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.

  17. The comparative ecology and biogeography of parasites

    PubMed Central

    Poulin, Robert; Krasnov, Boris R.; Mouillot, David; Thieltges, David W.

    2011-01-01

    Comparative ecology uses interspecific relationships among traits, while accounting for the phylogenetic non-independence of species, to uncover general evolutionary processes. Applied to biogeographic questions, it can be a powerful tool to explain the spatial distribution of organisms. Here, we review how comparative methods can elucidate biogeographic patterns and processes, using analyses of distributional data on parasites (fleas and helminths) as case studies. Methods exist to detect phylogenetic signals, i.e. the degree of phylogenetic dependence of a given character, and either to control for these signals in statistical analyses of interspecific data, or to measure their contribution to variance. Parasite–host interactions present a special case, as a given trait may be a parasite trait, a host trait or a property of the coevolved association rather than of one participant only. For some analyses, it is therefore necessary to correct simultaneously for both parasite phylogeny and host phylogeny, or to evaluate which has the greatest influence on trait expression. Using comparative approaches, we show that two fundamental properties of parasites, their niche breadth, i.e. host specificity, and the nature of their life cycle, can explain interspecific and latitudinal variation in the sizes of their geographical ranges, or rates of distance decay in the similarity of parasite communities. These findings illustrate the ways in which phylogenetically based comparative methods can contribute to biogeographic research. PMID:21768153

  18. Do Branch Lengths Help to Locate a Tree in a Phylogenetic Network?

    PubMed

    Gambette, Philippe; van Iersel, Leo; Kelk, Steven; Pardi, Fabio; Scornavacca, Celine

    2016-09-01

    Phylogenetic networks are increasingly used in evolutionary biology to represent the history of species that have undergone reticulate events such as horizontal gene transfer, hybrid speciation and recombination. One of the most fundamental questions that arise in this context is whether the evolution of a gene with one copy in all species can be explained by a given network. In mathematical terms, this is often translated in the following way: is a given phylogenetic tree contained in a given phylogenetic network? Recently this tree containment problem has been widely investigated from a computational perspective, but most studies have only focused on the topology of the phylogenies, ignoring a piece of information that, in the case of phylogenetic trees, is routinely inferred by evolutionary analyses: branch lengths. These measure the amount of change (e.g., nucleotide substitutions) that has occurred along each branch of the phylogeny. Here, we study a number of versions of the tree containment problem that explicitly account for branch lengths. We show that, although length information has the potential to locate more precisely a tree within a network, the problem is computationally hard in its most general form. On a positive note, for a number of special cases of biological relevance, we provide algorithms that solve this problem efficiently. This includes the case of networks of limited complexity, for which it is possible to recover, among the trees contained by the network with the same topology as the input tree, the closest one in terms of branch lengths.

  19. Plastid Phylogenomics Resolve Deep Relationships among Eupolypod II Ferns with Rapid Radiation and Rate Heterogeneity

    PubMed Central

    Wei, Ran; Yan, Yue-Hong; Harris, AJ; Kang, Jong-Soo; Shen, Hui; Zhang, Xian-Chun

    2017-01-01

    Abstract The eupolypods II ferns represent a classic case of evolutionary radiation and, simultaneously, exhibit high substitution rate heterogeneity. These factors have been proposed to contribute to the contentious resolutions among clades within this fern group in multilocus phylogenetic studies. We investigated the deep phylogenetic relationships of eupolypod II ferns by sampling all major families and using 40 plastid genomes, or plastomes, of which 33 were newly sequenced with next-generation sequencing technology. We performed model-based analyses to evaluate the diversity of molecular evolutionary rates for these ferns. Our plastome data, with more than 26,000 informative characters, yielded good resolution for deep relationships within eupolypods II and unambiguously clarified the position of Rhachidosoraceae and the monophyly of Athyriaceae. Results of rate heterogeneity analysis revealed approximately 33 significant rate shifts in eupolypod II ferns, with the most heterogeneous rates (both accelerations and decelerations) occurring in two phylogenetically difficult lineages, that is, the Rhachidosoraceae–Aspleniaceae and Athyriaceae clades. These observations support the hypothesis that rate heterogeneity has previously constrained the deep phylogenetic resolution in eupolypods II. According to the plastome data, we propose that 14 chloroplast markers are particularly phylogenetically informative for eupolypods II both at the familial and generic levels. Our study demonstrates the power of a character-rich plastome data set and high-throughput sequencing for resolving the recalcitrant lineages, which have undergone rapid evolutionary radiation and dramatic changes in substitution rates. PMID:28854625

  20. Phylogenetic analysis of human immunodeficiency virus type 2 isolated from Cuban individuals.

    PubMed

    Machado, Liuber Y; Díaz, Héctor M; Noa, Enrique; Martín, Dayamí; Blanco, Madeline; Díaz, Dervel F; Sánchez, Yordank R; Nibot, Carmen; Sánchez, Lourdes; Dubed, Marta

    2014-08-01

    The presence of infection by human immunodeficiency virus type 2 (HIV-2) in Cuba has been previously documented. However, genetic information on the strains that circulate in the Cuban people is still unknown. The present work constitutes the first study concerning the phylogenetic relationship of HIV-2 Cuban isolates conducted on 13 Cuban patients who were diagnosed with HIV-2. The env sequences were analyzed for the construction of a phylogenetic tree with reference sequences of HIV-2. Phylogenetic analysis of the env gene showed that all the Cuban sequences clustered in group A of HIV-2. The analysis indicated several independent introductions of HIV-2 into Cuba. The results of the study will reinforce the program on the epidemiological surveillance of the infection in Cuba and make possible further molecular evolutionary studies.

  1. More on the Best Evolutionary Rate for Phylogenetic Analysis

    PubMed Central

    Massingham, Tim; Goldman, Nick

    2017-01-01

    Abstract The accumulation of genome-scale molecular data sets for nonmodel taxa brings us ever closer to resolving the tree of life of all living organisms. However, despite the depth of data available, a number of studies that each used thousands of genes have reported conflicting results. The focus of phylogenomic projects must thus shift to more careful experimental design. Even though we still have a limited understanding of what are the best predictors of the phylogenetic informativeness of a gene, there is wide agreement that one key factor is its evolutionary rate; but there is no consensus as to whether the rates derived as optimal in various analytical, empirical, and simulation approaches have any general applicability. We here use simulations to infer optimal rates in a set of realistic phylogenetic scenarios with varying tree sizes, numbers of terminals, and tree shapes. Furthermore, we study the relationship between the optimal rate and rate variation among sites and among lineages. Finally, we examine how well the predictions made by a range of experimental design methods correlate with the observed performance in our simulations. We find that the optimal level of divergence is surprisingly robust to differences in taxon sampling and even to among-site and among-lineage rate variation as often encountered in empirical data sets. This finding encourages the use of methods that rely on a single optimal rate to predict a gene’s utility. Focusing on correct recovery either of the most basal node in the phylogeny or of the entire topology, the optimal rate is about 0.45 substitutions from root to tip in average Yule trees and about 0.2 in difficult trees with short basal and long-apical branches, but all rates leading to divergence levels between about 0.1 and 0.5 perform reasonably well. Testing the performance of six methods that can be used to predict a gene’s utility against our simulation results, we find that the probability of resolution, signal-noise analysis, and Fisher information are good predictors of phylogenetic informativeness, but they require specification of at least part of a model tree. Likelihood quartet mapping also shows very good performance but only requires sequence alignments and is thus applicable without making assumptions about the phylogeny. Despite them being the most commonly used methods for experimental design, geometric quartet mapping and the integration of phylogenetic informativeness curves perform rather poorly in our comparison. Instead of derived predictors of phylogenetic informativeness, we suggest that the number of sites in a gene that evolve at near-optimal rates (as inferred here) could be used directly to prioritize genes for phylogenetic inference. In combination with measures of model fit, especially with respect to compositional biases and among-site and among-lineage rate variation, such an approach has the potential to greatly improve marker choice and should be tested on empirical data. PMID:28595363

  2. COI (cytochrome oxidase-I) sequence based studies of Carangid fishes from Kakinada coast, India.

    PubMed

    Persis, M; Chandra Sekhar Reddy, A; Rao, L M; Khedkar, G D; Ravinder, K; Nasruddin, K

    2009-09-01

    Mitochondrial DNA, cytochrome oxidase-1 gene sequences were analyzed for species identification and phylogenetic relationship among the very high food value and commercially important Indian carangid fish species. Sequence analysis of COI gene very clearly indicated that all the 28 fish species fell into five distinct groups, which are genetically distant from each other and exhibited identical phylogenetic reservation. All the COI gene sequences from 28 fishes provide sufficient phylogenetic information and evolutionary relationship to distinguish the carangid species unambiguously. This study proves the utility of mtDNA COI gene sequence based approach in identifying fish species at a faster pace.

  3. Ribosomal RNA: a key to phylogeny

    NASA Technical Reports Server (NTRS)

    Olsen, G. J.; Woese, C. R.

    1993-01-01

    As molecular phylogeny increasingly shapes our understanding of organismal relationships, no molecule has been applied to more questions than have ribosomal RNAs. We review this role of the rRNAs and some of the insights that have been gained from them. We also offer some of the practical considerations in extracting the phylogenetic information from the sequences. Finally, we stress the importance of comparing results from multiple molecules, both as a method for testing the overall reliability of the organismal phylogeny and as a method for more broadly exploring the history of the genome.

  4. Phylogenetic Structure of Tree Species across Different Life Stages from Seedlings to Canopy Trees in a Subtropical Evergreen Broad-Leaved Forest.

    PubMed

    Jin, Yi; Qian, Hong; Yu, Mingjian

    2015-01-01

    Investigating patterns of phylogenetic structure across different life stages of tree species in forests is crucial to understanding forest community assembly, and investigating forest gap influence on the phylogenetic structure of forest regeneration is necessary for understanding forest community assembly. Here, we examine the phylogenetic structure of tree species across life stages from seedlings to canopy trees, as well as forest gap influence on the phylogenetic structure of forest regeneration in a forest of the subtropical region in China. We investigate changes in phylogenetic relatedness (measured as NRI) of tree species from seedlings, saplings, treelets to canopy trees; we compare the phylogenetic turnover (measured as βNRI) between canopy trees and seedlings in forest understory with that between canopy trees and seedlings in forest gaps. We found that phylogenetic relatedness generally increases from seedlings through saplings and treelets up to canopy trees, and that phylogenetic relatedness does not differ between seedlings in forest understory and those in forest gaps, but phylogenetic turnover between canopy trees and seedlings in forest understory is lower than that between canopy trees and seedlings in forest gaps. We conclude that tree species tend to be more closely related from seedling to canopy layers, and that forest gaps alter the seedling phylogenetic turnover of the studied forest. It is likely that the increasing trend of phylogenetic clustering as tree stem size increases observed in this subtropical forest is primarily driven by abiotic filtering processes, which select a set of closely related evergreen broad-leaved tree species whose regeneration has adapted to the closed canopy environments of the subtropical forest developed under the regional monsoon climate.

  5. Phylogenetic Structure of Tree Species across Different Life Stages from Seedlings to Canopy Trees in a Subtropical Evergreen Broad-Leaved Forest

    PubMed Central

    Jin, Yi; Qian, Hong; Yu, Mingjian

    2015-01-01

    Investigating patterns of phylogenetic structure across different life stages of tree species in forests is crucial to understanding forest community assembly, and investigating forest gap influence on the phylogenetic structure of forest regeneration is necessary for understanding forest community assembly. Here, we examine the phylogenetic structure of tree species across life stages from seedlings to canopy trees, as well as forest gap influence on the phylogenetic structure of forest regeneration in a forest of the subtropical region in China. We investigate changes in phylogenetic relatedness (measured as NRI) of tree species from seedlings, saplings, treelets to canopy trees; we compare the phylogenetic turnover (measured as βNRI) between canopy trees and seedlings in forest understory with that between canopy trees and seedlings in forest gaps. We found that phylogenetic relatedness generally increases from seedlings through saplings and treelets up to canopy trees, and that phylogenetic relatedness does not differ between seedlings in forest understory and those in forest gaps, but phylogenetic turnover between canopy trees and seedlings in forest understory is lower than that between canopy trees and seedlings in forest gaps. We conclude that tree species tend to be more closely related from seedling to canopy layers, and that forest gaps alter the seedling phylogenetic turnover of the studied forest. It is likely that the increasing trend of phylogenetic clustering as tree stem size increases observed in this subtropical forest is primarily driven by abiotic filtering processes, which select a set of closely related evergreen broad-leaved tree species whose regeneration has adapted to the closed canopy environments of the subtropical forest developed under the regional monsoon climate. PMID:26098916

  6. The power and pitfalls of HIV phylogenetics in public health.

    PubMed

    Brooks, James I; Sandstrom, Paul A

    2013-07-25

    Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.

  7. Molecular characterization of measles viruses in Turkey (2010-2011): first report of genotype D9 involved in an outbreak in 2011.

    PubMed

    Kalaycioglu, Atila T; Baykal, Atakan; Guldemir, Dilek; Bakkaloglu, Zekiye; Korukluoglu, Gulay; Coskun, Aslihan; Torunoglu, Mehmet Ali; Ertek, Mustafa; Durmaz, Riza

    2013-12-01

    Genetic characterization of measles viruses (MVs) combined with acquisition of epidemiologic information is essential for measles surveillance programs used in determining transmission pathways. This study describes the molecular characterization of 26 MV strains (3 from 2010, 23 from 2011) obtained from urine or throat swabs harvested from patients in Turkey. MV RNA samples (n = 26) were subjected to sequence analysis of 450 nucleotides comprising the most variable C-terminal region of the nucleoprotein (N) gene. Phylogenetic analysis revealed 20 strains from 2011 belonged to genotype D9, 3 to D4, 2 strains from 2010 to genotype D4 and 1 to genotype B3. This study represents the first report describing the involvement of MV genotype D9 in an outbreak in Turkey. The sequence of the majority of genotype D9 strains was identical to those identified in Russia, Malaysia, Japan, and the UK. Despite lack of sufficient epidemiologic information, the presence of variants observed following phylogenetic analysis suggested that exposure to genotype D9 might have occurred due to importation more than once. Phylogenetic analysis of five genotype D4 strains revealed the presence of four variants. Epidemiological information and phylogenetic analysis suggested that three genotype D4 strains and one genotype B3 strain were associated with importation. This study suggests the presence of pockets of unimmunized individuals making Turkey susceptible to outbreaks. Continuing molecular surveillance of measles strains in Turkey is essential as a means of acquiring epidemiologic information to define viral transmission patterns and determine the effectiveness of measles vaccination programs designed to eliminate this virus. © 2013 Wiley Periodicals, Inc.

  8. Phylogenetic search through partial tree mixing

    PubMed Central

    2012-01-01

    Background Recent advances in sequencing technology have created large data sets upon which phylogenetic inference can be performed. Current research is limited by the prohibitive time necessary to perform tree search on a reasonable number of individuals. This research develops new phylogenetic algorithms that can operate on tens of thousands of species in a reasonable amount of time through several innovative search techniques. Results When compared to popular phylogenetic search algorithms, better trees are found much more quickly for large data sets. These algorithms are incorporated in the PSODA application available at http://dna.cs.byu.edu/psoda Conclusions The use of Partial Tree Mixing in a partition based tree space allows the algorithm to quickly converge on near optimal tree regions. These regions can then be searched in a methodical way to determine the overall optimal phylogenetic solution. PMID:23320449

  9. Distance-Based Phylogenetic Methods Around a Polytomy.

    PubMed

    Davidson, Ruth; Sullivant, Seth

    2014-01-01

    Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.

  10. Measuring the distance between multiple sequence alignments.

    PubMed

    Blackburne, Benjamin P; Whelan, Simon

    2012-02-15

    Multiple sequence alignment (MSA) is a core method in bioinformatics. The accuracy of such alignments may influence the success of downstream analyses such as phylogenetic inference, protein structure prediction, and functional prediction. The importance of MSA has lead to the proliferation of MSA methods, with different objective functions and heuristics to search for the optimal MSA. Different methods of inferring MSAs produce different results in all but the most trivial cases. By measuring the differences between inferred alignments, we may be able to develop an understanding of how these differences (i) relate to the objective functions and heuristics used in MSA methods, and (ii) affect downstream analyses. We introduce four metrics to compare MSAs, which include the position in a sequence where a gap occurs or the location on a phylogenetic tree where an insertion or deletion (indel) event occurs. We use both real and synthetic data to explore the information given by these metrics and demonstrate how the different metrics in combination can yield more information about MSA methods and the differences between them. MetAl is a free software implementation of these metrics in Haskell. Source and binaries for Windows, Linux and Mac OS X are available from http://kumiho.smith.man.ac.uk/whelan/software/metal/.

  11. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes.

    PubMed

    Ye, Fei; Lan, Xu-E; Zhu, Wen-Bo; You, Ping

    2016-05-09

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects.

  12. Mitochondrial genomes of praying mantises (Dictyoptera, Mantodea): rearrangement, duplication, and reassignment of tRNA genes

    PubMed Central

    Ye, Fei; Lan, Xu-e; Zhu, Wen-bo; You, Ping

    2016-01-01

    Insect mitochondrial genomes (mitogenomes) contain a conserved set of 37 genes for an extensive diversity of lineages. Previously reported dictyopteran mitogenomes share this conserved mitochondrial gene arrangement, although surprisingly little is known about the mitogenome of Mantodea. We sequenced eight mantodean mitogenomes including the first representatives of two families: Hymenopodidae and Liturgusidae. Only two of these genomes retain the typical insect gene arrangement. In three Liturgusidae species, the trnM genes have translocated. Four species of mantis (Creobroter gemmata, Mantis religiosa, Statilia sp., and Theopompa sp.-HN) have multiple identical tandem duplication of trnR, and Statilia sp. additionally includes five extra duplicate trnW. These extra trnR and trnW in Statilia sp. are erratically arranged and form another novel gene order. Interestingly, the extra trnW is converted from trnR by the process of point mutation at anticodon, which is the first case of tRNA reassignment for an insect. Furthermore, no significant differences were observed amongst mantodean mitogenomes with variable copies of tRNA according to comparative analysis of codon usage. Combined with phylogenetic analysis, the characteristics of tRNA only possess limited phylogenetic information in this research. Nevertheless, these features of gene rearrangement, duplication, and reassignment provide valuable information toward understanding mitogenome evolution in insects. PMID:27157299

  13. Molecular identification and phylogenetic analysis of important medicinal plant species in genus Paeonia based on rDNA-ITS, matK, and rbcL DNA barcode sequences.

    PubMed

    Kim, W J; Ji, Y; Choi, G; Kang, Y M; Yang, S; Moon, B C

    2016-08-05

    This study was performed to identify and analyze the phylogenetic relationship among four herbaceous species of the genus Paeonia, P. lactiflora, P. japonica, P. veitchii, and P. suffruticosa, using DNA barcodes. These four species, which are commonly used in traditional medicine as Paeoniae Radix and Moutan Radicis Cortex, are pharmaceutically defined in different ways in the national pharmacopoeias in Korea, Japan, and China. To authenticate the different species used in these medicines, we evaluated rDNA-internal transcribed spacers (ITS), matK and rbcL regions, which provide information capable of effectively distinguishing each species from one another. Seventeen samples were collected from different geographic regions in Korea and China, and DNA barcode regions were amplified using universal primers. Comparative analyses of these DNA barcode sequences revealed species-specific nucleotide sequences capable of discriminating the four Paeonia species. Among the entire sequences of three barcodes, marker nucleotides were identified at three positions in P. lactiflora, eleven in P. japonica, five in P. veitchii, and 25 in P. suffruticosa. Phylogenetic analyses also revealed four distinct clusters showing homogeneous clades with high resolution at the species level. The results demonstrate that the analysis of these three DNA barcode sequences is a reliable method for identifying the four Paeonia species and can be used to authenticate Paeoniae Radix and Moutan Radicis Cortex at the species level. Furthermore, based on the assessment of amplicon sizes, inter/intra-specific distances, marker nucleotides, and phylogenetic analysis, rDNA-ITS was the most suitable DNA barcode for identification of these species.

  14. A multi-locus analysis of phylogenetic relationships within grass subfamily Pooideae (Poaceae) inferred from sequences of nuclear single copy gene regions compared with plastid DNA.

    PubMed

    Hochbach, Anne; Schneider, Julia; Röser, Martin

    2015-06-01

    To investigate phylogenetic relationships within the grass subfamily Pooideae we studied about 50 taxa covering all recognized tribes, using one plastid DNA (cpDNA) marker (matK gene-3'trnK exon) and for the first time four nuclear single copy gene loci. DNA sequence information from two parts of the nuclear genes topoisomerase 6 (Topo6) spanning the exons 8-13 and 17-19, the exons 9-13 encoding plastid acetyl-CoA-carboxylase (Acc1) and the partial exon 1 of phytochrome B (PhyB) were generated. Individual and nuclear combined data were evaluated using maximum parsimony, maximum likelihood and Bayesian methods. All of the phylogenetic results show Brachyelytrum and the tribe Nardeae as earliest diverging lineages within the subfamily. The 'core' Pooideae (Hordeeae and the Aveneae/Poeae tribe complex) are also strongly supported, as well as the monophyly of the tribes Brachypodieae, Meliceae and Stipeae (except PhyB). The beak grass tribe Diarrheneae and the tribe Duthieeae are not monophyletic in some of the analyses. However, the combined nuclear DNA (nDNA) tree yields the highest resolution and the best delimitation of the tribes, and provides the following evolutionary hypothesis for the tribes: Brachyelytrum, Nardeae, Duthieeae, Meliceae, Stipeae, Diarrheneae, Brachypodieae and the 'core' Pooideae. Within the individual datasets, the phylogenetic trees obtained from Topo6 exon 8-13 shows the most interesting results. The divergent positions of some clone sequences of Ampelodesmos mauritanicus and Trikeraia pappiformis, for instance, may indicate a hybrid origin of these stipoid taxa. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Ontogenetic development of intestinal length and relationships to diet in an Australasian fish family (Terapontidae)

    PubMed Central

    2013-01-01

    Background One of the most widely accepted ecomorphological relationships in vertebrates is the negative correlation between intestinal length and proportion of animal prey in diet. While many fish groups exhibit this general pattern, other clades demonstrate minimal, and in some cases contrasting, associations between diet and intestinal length. Moreover, this relationship and its evolutionary derivation have received little attention from a phylogenetic perspective. This study documents the phylogenetic development of intestinal length variability, and resultant correlation with dietary habits, within a molecular phylogeny of 28 species of terapontid fishes. The Terapontidae (grunters), an ancestrally euryhaline-marine group, is the most trophically diverse of Australia’s freshwater fish families, with widespread shifts away from animal-prey-dominated diets occurring since their invasion of fresh waters. Results Description of ontogenetic development of intestinal complexity of terapontid fishes, in combination with ancestral character state reconstruction, demonstrated that complex intestinal looping (convolution) has evolved independently on multiple occasions within the family. This modification of ontogenetic development drives much of the associated interspecific variability in intestinal length evident in terapontids. Phylogenetically informed comparative analyses (phylogenetic independent contrasts) showed that the interspecific differences in intestinal length resulting from these ontogenetic developmental mechanisms explained ~65% of the variability in the proportion of animal material in terapontid diets. Conclusions The ontogenetic development of intestinal complexity appears to represent an important functional innovation underlying the extensive trophic differentiation seen in Australia’s freshwater terapontids, specifically facilitating the pronounced shifts away from carnivorous (including invertebrates and vertebrates) diets evident across the family. The capacity to modify intestinal morphology and physiology may also be an important facilitator of trophic diversification during other phyletic radiations. PMID:23441994

  16. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

    PubMed Central

    2010-01-01

    Background Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. Results This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Conclusions Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service. PMID:21034504

  17. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  18. High-resolution phylogenetic microbial community profiling

    DOE PAGES

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; ...

    2016-02-09

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  19. Phylogenetic analysis of Common Garter Snake (Thamnophis sirtalis) stomach contents detects cryptic range of a secretive salamander (Ensatina eschscholtzii oregonensis) Herpetological Conservation and Biology 5(3):395–402

    Treesearch

    Sean B. Reilly; Andrew D Gottsho; Justin M. Garwood; Bryan Jennings

    2010-01-01

    Given the current global amphibian decline, it is crucial to obtain accurate and current information regarding species distributions. Secretive amphibians such as plethodontid salamanders can be difficult to detect in many cases, especially in remote, high elevation areas. We used molecular phylogenetic analyses to identify three partially digested salamanders palped...

  20. A curated database of cyanobacterial strains relevant for modern taxonomy and phylogenetic studies.

    PubMed

    Ramos, Vitor; Morais, João; Vasconcelos, Vitor M

    2017-04-25

    The dataset herein described lays the groundwork for an online database of relevant cyanobacterial strains, named CyanoType (http://lege.ciimar.up.pt/cyanotype). It is a database that includes categorized cyanobacterial strains useful for taxonomic, phylogenetic or genomic purposes, with associated information obtained by means of a literature-based curation. The dataset lists 371 strains and represents the first version of the database (CyanoType v.1). Information for each strain includes strain synonymy and/or co-identity, strain categorization, habitat, accession numbers for molecular data, taxonomy and nomenclature notes according to three different classification schemes, hierarchical automatic classification, phylogenetic placement according to a selection of relevant studies (including this), and important bibliographic references. The database will be updated periodically, namely by adding new strains meeting the criteria for inclusion and by revising and adding up-to-date metadata for strains already listed. A global 16S rDNA-based phylogeny is provided in order to assist users when choosing the appropriate strains for their studies.

  1. A curated database of cyanobacterial strains relevant for modern taxonomy and phylogenetic studies

    PubMed Central

    Ramos, Vitor; Morais, João; Vasconcelos, Vitor M.

    2017-01-01

    The dataset herein described lays the groundwork for an online database of relevant cyanobacterial strains, named CyanoType (http://lege.ciimar.up.pt/cyanotype). It is a database that includes categorized cyanobacterial strains useful for taxonomic, phylogenetic or genomic purposes, with associated information obtained by means of a literature-based curation. The dataset lists 371 strains and represents the first version of the database (CyanoType v.1). Information for each strain includes strain synonymy and/or co-identity, strain categorization, habitat, accession numbers for molecular data, taxonomy and nomenclature notes according to three different classification schemes, hierarchical automatic classification, phylogenetic placement according to a selection of relevant studies (including this), and important bibliographic references. The database will be updated periodically, namely by adding new strains meeting the criteria for inclusion and by revising and adding up-to-date metadata for strains already listed. A global 16S rDNA-based phylogeny is provided in order to assist users when choosing the appropriate strains for their studies. PMID:28440791

  2. Evolutionary characterization of the West Nile Virus complete genome.

    PubMed

    Gray, R R; Veras, N M C; Santos, L A; Salemi, M

    2010-07-01

    The spatial dynamics of the West Nile Virus epidemic in North America are largely unknown. Previous studies that investigated the evolutionary history of the virus used sequence data from the structural genes (prM and E); however, these regions may lack phylogenetic information and obscure true evolutionary relationships. This study systematically evaluated the evolutionary patterns in the eleven genes of the WNV genome in order to determine which region(s) were most phylogenetically informative. We found that while the E region lacks resolution and can potentially result in misleading conclusions, the full NS3 or NS5 regions have strong phylogenetic signal. Furthermore, we show that geographic structure of WNV infection within the US is more pronounced than previously reported in studies that used the structural genes. We conclude that future evolutionary studies should focus on NS3 and NS5 in order to maximize the available sequences while retaining maximal interpretative power to infer temporal and geographic trends among WNV strains. Copyright 2010 Elsevier Inc. All rights reserved.

  3. Evolutionary anatomy of the Neandertal ulna and radius in the light of the new El Sidrón sample.

    PubMed

    Pérez-Criado, Laura; Rosas, Antonio

    2017-05-01

    This paper aims to improve our understanding of the phylogenetic trait polarity related to hominin forearm evolution, in particular those traits traditionally defined as "Neandertal features." To this aim, twelve adult and adolescent fragmented forelimb elements (including ulnae and radii) of Homo neanderthalensis recovered from the site of El Sidrón (Asturias, Spain) were examined comparatively using three-dimensional geometric and traditional morphometrics. Mean centroid size and shape comparisons, principal components analysis, and phylogenetic signal analysis were undertaken. Our investigations revealed that the proximal region of the ulna discriminated best between Neandertals and modern humans, with fewer taxonomically-informative features in the distal ulna and radius. Compared to modern humans, the divergent features in the Neandertal ulna are an increase in olecranon breadth (a derived trait), lower coronoid length (primitive), and anterior orientation of the trochlear notch (primitive). In the Neandertal radius, we observe a larger neck length (primitive), medial orientation of the radial tubercle (secondarily primitive), and a curved diaphysis (secondarily primitive). Anatomically, we identified three units of evolutionary change: 1) the olecranon and its fossa, 2) the coronoid-radius neck complex, and 3) the tubercle and radial diaphysis. Based on our data, forearm evolution followed a mosaic pattern in which some features were inherited from a pre-Homo ancestor, others originated in some post-ergaster and pre-antecessor populations, and other characters emerged in the specific Homo sapiens and H. neanderthalensis lineages, sometimes appearing as secondarily primitive. Future investigations might consider the diverse phylogenetic origin of apomorphies while at the same time seeking to elucidate their functional meaning. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Structural phylogeny by profile extraction and multiple superimposition using electrostatic congruence as a discriminator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chakraborty, Sandeep; Rao, Basuthkar J.; Baker, Nathan A.

    2013-04-01

    Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evolutionary relationship in these proteins which occasionally remains undetected due to considerable sequence divergence. Structural alignment programs have been developed to unravel such fuzzy relationships. However, none of these structure based methods have used electrostatic properties to discriminate between spatially equivalent residues. We present a methodology for MSA of a set of related proteins with known structures using electrostatic properties as an additional discriminator (STEEP). STEEP first extracts a profile, then generates a multiple structural superimposition providing a consolidated spatial framework for comparing residues and finally emits themore » MSA. Residues that are aligned differently by including or excluding electrostatic properties can be targeted by directed evolution experiments to transform the enzymatic properties of one protein into another. We have compared STEEP results to those obtained from a MSA program (ClustalW) and a structural alignment method (MUSTANG) for chymotrypsin serine proteases. Subsequently, we used PhyML to generate phylogenetic trees for the serine and metallo-β-lactamase superfamilies from the STEEP generated MSA, and corroborated the accepted relationships in these superfamilies. We have observed that STEEP acts as a functional classifier when electrostatic congruence is used as a discriminator, and thus identifies potential targets for directed evolution experiments. In summary, STEEP is unique among phylogenetic methods for its ability to use electrostatic congruence to specify mutations that might be the source of the functional divergence in a protein family. Based on our results, we also hypothesize that the active site and its close vicinity contains enough information to infer the correct phylogeny for related proteins.« less

  5. Studying the evolutionary significance of thermal adaptation in ectotherms: The diversification of amphibians' energetics.

    PubMed

    Nespolo, Roberto F; Figueroa, Julio; Solano-Iguaran, Jaiber J

    2017-08-01

    A fundamental problem in evolutionary biology is the understanding of the factors that promote or constrain adaptive evolution, and assessing the role of natural selection in this process. Here, comparative phylogenetics, that is, using phylogenetic information and traits to infer evolutionary processes has been a major paradigm . In this study, we discuss Ornstein-Uhlenbeck models (OU) in the context of thermal adaptation in ectotherms. We specifically applied this approach to study amphibians's evolution and energy metabolism. It has been hypothesized that amphibians exploit adaptive zones characterized by low energy expenditure, which generate specific predictions in terms of the patterns of diversification in standard metabolic rate (SMR). We complied whole-animal metabolic rates for 122 species of amphibians, and adjusted several models of diversification. According to the adaptive zone hypothesis, we expected: (1) to find "accelerated evolution" in SMR (i.e., diversification above Brownian Motion expectations, BM), (2) that a model assuming evolutionary optima (i.e., an OU model) fits better than a white-noise model and (3) that a model assuming multiple optima (according to the three amphibians's orders) fits better than a model assuming a single optimum. As predicted, we found that the diversification of SMR occurred most of the time, above BM expectations. Also, we found that a model assuming an optimum explained the data in a better way than a white-noise model. However, we did not find evidence that an OU model with multiple optima fits the data better, suggesting a single optimum in SMR for Anura, Caudata and Gymnophiona. These results show how comparative phylogenetics could be applied for testing adaptive hypotheses regarding history and physiological performance in ectotherms. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. CtGEM typing: Discrimination of Chlamydia trachomatis ocular and urogenital strains and major evolutionary lineages by high resolution melting analysis of two amplified DNA fragments.

    PubMed

    Giffard, Philip M; Andersson, Patiyan; Wilson, Judith; Buckley, Cameron; Lilliebridge, Rachael; Harris, Tegan M; Kleinecke, Mariana; O'Grady, Kerry-Ann F; Huston, Wilhelmina M; Lambert, Stephen B; Whiley, David M; Holt, Deborah C

    2018-01-01

    Chlamydia trachomatis infects the urogenital tract (UGT) and eyes. Anatomical tropism is correlated with variation in the major outer membrane protein encoded by ompA. Strains possessing the ocular ompA variants A, B, Ba and C are typically found within the phylogenetically coherent "classical ocular lineage". However, variants B, Ba and C have also been found within three distinct strains in Australia, all associated with ocular disease in children and outside the classical ocular lineage. CtGEM genotyping is a method for detecting and discriminating ocular strains and also the major phylogenetic lineages. The rationale was facilitation of surveillance to inform responses to C. trachomatis detection in UGT specimens from young children. CtGEM typing is based on high resolution melting analysis (HRMA) of two PCR amplified fragments with high combinatorial resolving power, as defined by computerised comparison of 65 whole genomes. One fragment is from the hypothetical gene defined by Jali-1891 in the C. trachomatis B_Jali20 genome, while the other is from ompA. Twenty combinatorial CtGEM types have been shown to exist, and these encompass unique genotypes for all known ocular strains, and also delineate the TI and T2 major phylogenetic lineages, identify LGV strains and provide additional resolution beyond this. CtGEM typing and Sanger sequencing were compared with 42 C. trachomatis positive clinical specimens, and there were no disjunctions. CtGEM typing is a highly efficient method designed and tested using large scale comparative genomics. It divides C. trachomatis into clinically and biologically meaningful groups, and may have broad application in surveillance.

  7. Identification of characteristic oligonucleotides in the bacterial 16S ribosomal RNA sequence dataset

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; Willson, Richard C.; Fox, George E.

    2002-01-01

    MOTIVATION: The phylogenetic structure of the bacterial world has been intensively studied by comparing sequences of 16S ribosomal RNA (16S rRNA). This database of sequences is now widely used to design probes for the detection of specific bacteria or groups of bacteria one at a time. The success of such methods reflects the fact that there are local sequence segments that are highly characteristic of particular organisms or groups of organisms. It is not clear, however, the extent to which such signature sequences exist in the 16S rRNA dataset. A better understanding of the numbers and distribution of highly informative oligonucleotide sequences may facilitate the design of hybridization arrays that can characterize the phylogenetic position of an unknown organism or serve as the basis for the development of novel approaches for use in bacterial identification. RESULTS: A computer-based algorithm that characterizes the extent to which any individual oligonucleotide sequence in 16S rRNA is characteristic of any particular bacterial grouping was developed. A measure of signature quality, Q(s), was formulated and subsequently calculated for every individual oligonucleotide sequence in the size range of 5-11 nucleotides and for 15mers with reference to each cluster and subcluster in a 929 organism representative phylogenetic tree. Subsequently, the perfect signature sequences were compared to the full set of 7322 sequences to see how common false positives were. The work completed here establishes beyond any doubt that highly characteristic oligonucleotides exist in the bacterial 16S rRNA sequence dataset in large numbers. Over 16,000 15mers were identified that might be useful as signatures. Signature oligonucleotides are available for over 80% of the nodes in the representative tree.

  8. Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.

    2008-02-01

    WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 tomore » 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.« less

  9. Partial gene sequences for the A subunit of methyl-coenzyme M reductase (mcrI) as a phylogenetic tool for the family Methanosarcinaceae

    NASA Technical Reports Server (NTRS)

    Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.

    1995-01-01

    Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.

  10. Physiological, behavioral and biochemical adaptations of intertidal fishes to hypoxia.

    PubMed

    Richards, Jeffrey G

    2011-01-15

    Hypoxia survival in fish requires a well-coordinated response to either secure more O(2) from the hypoxic environment or to limit the metabolic consequences of an O(2) restriction at the mitochondria. Although there is a considerable amount of information available on the physiological, behavioral, biochemical and molecular responses of fish to hypoxia, very little research has attempted to determine the adaptive value of these responses. This article will review current attempts to use the phylogenetically corrected comparative method to define physiological and behavioral adaptations to hypoxia in intertidal fish and further identify putatively adaptive biochemical traits that should be investigated in the future. In a group of marine fishes known as sculpins, from the family Cottidae, variation in hypoxia tolerance, measured as a critical O(2) tension (P(crit)), is primarily explained by variation in mass-specific gill surface area, red blood cell hemoglobin-O(2) binding affinity, and to a lesser extent variation in routine O(2) consumption rate (M(O(2))). The most hypoxia-tolerant sculpins consistently show aquatic surface respiration (ASR) and aerial emergence behavior during hypoxia exposure, but no phylogenetically independent relationship has been found between the thresholds for initiating these behaviors and P(crit). At O(2) levels below P(crit), hypoxia survival requires a rapid reorganization of cellular metabolism to suppress ATP consumption to match the limited capacity for O(2)-independent ATP production. Thus, it is reasonable to speculate that the degree of metabolic rate suppression and the quantity of stored fermentable fuel is strongly selected for in hypoxia-tolerant fishes; however, these assertions have not been tested in a phylogenetic comparative model.

  11. Phylogenetic and microsatellite markers for Tulasnella (Tulasnellaceae) mycorrhizal fungi associated with Australian orchids.

    PubMed

    Ruibal, Monica P; Peakall, Rod; Smith, Leon M; Linde, Celeste C

    2013-03-01

    Phylogenetic and microsatellite markers were developed for Tulasnella mycorrhizal fungi to investigate fungal species identity and diversity. These markers will be useful in future studies investigating the phylogenetic relationship of the fungal symbionts, specificity of orchid-mycorrhizal associations, and the role of mycorrhizae in orchid speciation within several orchid genera. • We generated partial genome sequences of two Tulasnella symbionts originating from Chiloglottis and Drakaea orchid species with 454 genome sequencing. Cross-genus transferability across mycorrhizal symbionts associated with multiple genera of Australian orchids (Arthrochilus, Chiloglottis, Drakaea, and Paracaleana) was found for seven phylogenetic loci. Five loci showed cross-transferability to Tulasnella from other orchid genera, and two to Sebacina. Furthermore, 11 polymorphic microsatellite loci were developed for Tulasnella from Chiloglottis. • Highly informative markers were obtained, allowing investigation of mycorrhizal diversity of Tulasnellaceae associated with a wide variety of terrestrial orchids in Australia and potentially worldwide.

  12. IcyTree: rapid browser-based visualization for phylogenetic trees and networks

    PubMed Central

    2017-01-01

    Abstract Summary: IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. Availability and Implementation: IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree. Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. Contact: tgvaughan@gmail.com PMID:28407035

  13. Repeated evolution of camouflage in speciose desert rodents.

    PubMed

    Boratyński, Zbyszek; Brito, José C; Campos, João C; Cunha, José L; Granjon, Laurent; Mappes, Tapio; Ndiaye, Arame; Rzebik-Kowalska, Barbara; Serén, Nina

    2017-06-14

    There are two main factors explaining variation among species and the evolution of characters along phylogeny: adaptive change, including phenotypic and genetic responses to selective pressures, and phylogenetic inertia, or the resemblance between species due to shared phylogenetic history. Phenotype-habitat colour match, a classic Darwinian example of the evolution of camouflage (crypsis), offers the opportunity to test the importance of historical versus ecological mechanisms in shaping phenotypes among phylogenetically closely related taxa. To assess it, we investigated fur (phenotypic data) and habitat (remote sensing data) colourations, along with phylogenetic information, in the species-rich Gerbillus genus. Overall, we found a strong phenotype-habitat match, once the phylogenetic signal is taken into account. We found that camouflage has been acquired and lost repeatedly in the course of the evolutionary history of Gerbillus. Our results suggest that fur colouration and its covariation with habitat is a relatively labile character in mammals, potentially responding quickly to selection. Relatively unconstrained and substantial genetic basis, as well as structural and functional independence from other fitness traits of mammalian colouration might be responsible for that observation.

  14. IcyTree: rapid browser-based visualization for phylogenetic trees and networks.

    PubMed

    Vaughan, Timothy G

    2017-08-01

    IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree . Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. tgvaughan@gmail.com. © The Author(s) 2017. Published by Oxford University Press.

  15. SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees.

    PubMed

    DeBlasio, Dan F; Wisecaver, Jennifer H

    2016-01-01

    We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.

  16. Quantifying MCMC exploration of phylogenetic tree space.

    PubMed

    Whidden, Chris; Matsen, Frederick A

    2015-05-01

    In order to gain an understanding of the effectiveness of phylogenetic Markov chain Monte Carlo (MCMC), it is important to understand how quickly the empirical distribution of the MCMC converges to the posterior distribution. In this article, we investigate this problem on phylogenetic tree topologies with a metric that is especially well suited to the task: the subtree prune-and-regraft (SPR) metric. This metric directly corresponds to the minimum number of MCMC rearrangements required to move between trees in common phylogenetic MCMC implementations. We develop a novel graph-based approach to analyze tree posteriors and find that the SPR metric is much more informative than simpler metrics that are unrelated to MCMC moves. In doing so, we show conclusively that topological peaks do occur in Bayesian phylogenetic posteriors from real data sets as sampled with standard MCMC approaches, investigate the efficiency of Metropolis-coupled MCMC (MCMCMC) in traversing the valleys between peaks, and show that conditional clade distribution (CCD) can have systematic problems when there are multiple peaks. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  17. Inferring Phylogenetic Networks Using PhyloNet.

    PubMed

    Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay

    2018-07-01

    PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.

  18. Likelihood of Tree Topologies with Fossils and Diversification Rate Estimation.

    PubMed

    Didier, Gilles; Fau, Marine; Laurin, Michel

    2017-11-01

    Since the diversification process cannot be directly observed at the human scale, it has to be studied from the information available, namely the extant taxa and the fossil record. In this sense, phylogenetic trees including both extant taxa and fossils are the most complete representations of the diversification process that one can get. Such phylogenetic trees can be reconstructed from molecular and morphological data, to some extent. Among the temporal information of such phylogenetic trees, fossil ages are by far the most precisely known (divergence times are inferences calibrated mostly with fossils). We propose here a method to compute the likelihood of a phylogenetic tree with fossils in which the only considered time information is the fossil ages, and apply it to the estimation of the diversification rates from such data. Since it is required in our computation, we provide a method for determining the probability of a tree topology under the standard diversification model. Testing our approach on simulated data shows that the maximum likelihood rate estimates from the phylogenetic tree topology and the fossil dates are almost as accurate as those obtained by taking into account all the data, including the divergence times. Moreover, they are substantially more accurate than the estimates obtained only from the exact divergence times (without taking into account the fossil record). We also provide an empirical example composed of 50 Permo-Carboniferous eupelycosaur (early synapsid) taxa ranging in age from about 315 Ma (Late Carboniferous) to 270 Ma (shortly after the end of the Early Permian). Our analyses suggest a speciation (cladogenesis, or birth) rate of about 0.1 per lineage and per myr, a marginally lower extinction rate, and a considerable hidden paleobiodiversity of early synapsids. [Extinction rate; fossil ages; maximum likelihood estimation; speciation rate.]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. Relationships among genera of the Saccharomycotina (Ascomycota) from multigene phylogenetic analysis of type species

    USDA-ARS?s Scientific Manuscript database

    Phylogenetic relatedness among ascomycetous yeast genera (subphylum Saccharomycotina, phylum Ascomycota) has been uncertain. In the present study, type species of 70 currently recognized genera are compared from divergence in the nearly entire nuclear gene sequences for large subunit rRNA, small sub...

  20. Constructing Student Problems in Phylogenetic Tree Construction.

    ERIC Educational Resources Information Center

    Brewer, Steven D.

    Evolution is often equated with natural selection and is taught from a primarily functional perspective while comparative and historical approaches, which are critical for developing an appreciation of the power of evolutionary theory, are often neglected. This report describes a study of expert problem-solving in phylogenetic tree construction.…

  1. Cyber-infrastructure for Fusarium (CiF): Three integrated platforms supporting strain identification, phylogenetics, comparative genomics, and knowledge sharing

    USDA-ARS?s Scientific Manuscript database

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on ...

  2. Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B

    PubMed Central

    2011-01-01

    Background Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. Result To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. Conclusion This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution. PMID:21214911

  3. Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B.

    PubMed

    Linsuwanon, Piyada; Payungporn, Sunchai; Suwannakarn, Kamol; Chieochansin, Thaweesak; Theamboonlers, Apiradee; Poovorawan, Yong

    2011-01-07

    Human Rhinoviruses (HRVs) are well recognized viral pathogens associated with acute respiratory tract illnesses (RTIs) abundant worldwide. Although recent studies have phylogenetically identified the new HRV species (HRV-C), data on molecular epidemiology, genetic diversity, and clinical manifestation have been limited. To gain new insight into HRV genetic diversity, we determined the complete coding sequences of putative new members of HRV species C (HRV-CU072 with 1% prevalence) and HRV-B (HRV-CU211) identified from clinical specimens collected from pediatric patients diagnosed with a symptom of acute lower RTI. Complete coding sequence and phylogenetic analysis revealed that the HRV-CU072 strain shared a recent common ancestor with most closely related Chinese strain (N4). Comparative analysis at the protein level showed that HRV-CU072 might accumulate substitutional mutations in structural proteins, as well as nonstructural proteins 3C and 3 D. Comparative analysis of all available HRVs and HEVs indicated that HRV-C contains a relatively high G+C content and is more closely related to HEV-D. This might be correlated to their replication and capability to adapt to the high temperature environment of the human lower respiratory tract. We herein report an infrequently occurring intra-species recombination event in HRV-B species (HRV-CU211) with a crossing over having taken place at the boundary of VP2 and VP3 genes. Moreover, we observed phylogenetic compatibility in all HRV species and suggest that dynamic mechanisms for HRV evolution seem to be related to recombination events. These findings indicated that the elementary units shaping the genetic diversity of HRV-C could be found in the nonstructural 2A and 3D genes. This study provides information for understanding HRV genetic diversity and insight into the role of selection pressure and recombination mechanisms influencing HRV evolution.

  4. Generalization of Entropy Based Divergence Measures for Symbolic Sequence Analysis

    PubMed Central

    Ré, Miguel A.; Azad, Rajeev K.

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms. PMID:24728338

  5. Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase)

    PubMed Central

    Odronitz, Florian; Kollmar, Martin

    2006-01-01

    Background Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Description Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. Conclusion We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein. PMID:17134497

  6. Generalization of entropy based divergence measures for symbolic sequence analysis.

    PubMed

    Ré, Miguel A; Azad, Rajeev K

    2014-01-01

    Entropy based measures have been frequently used in symbolic sequence analysis. A symmetrized and smoothed form of Kullback-Leibler divergence or relative entropy, the Jensen-Shannon divergence (JSD), is of particular interest because of its sharing properties with families of other divergence measures and its interpretability in different domains including statistical physics, information theory and mathematical statistics. The uniqueness and versatility of this measure arise because of a number of attributes including generalization to any number of probability distributions and association of weights to the distributions. Furthermore, its entropic formulation allows its generalization in different statistical frameworks, such as, non-extensive Tsallis statistics and higher order Markovian statistics. We revisit these generalizations and propose a new generalization of JSD in the integrated Tsallis and Markovian statistical framework. We show that this generalization can be interpreted in terms of mutual information. We also investigate the performance of different JSD generalizations in deconstructing chimeric DNA sequences assembled from bacterial genomes including that of E. coli, S. enterica typhi, Y. pestis and H. influenzae. Our results show that the JSD generalizations bring in more pronounced improvements when the sequences being compared are from phylogenetically proximal organisms, which are often difficult to distinguish because of their compositional similarity. While small but noticeable improvements were observed with the Tsallis statistical JSD generalization, relatively large improvements were observed with the Markovian generalization. In contrast, the proposed Tsallis-Markovian generalization yielded more pronounced improvements relative to the Tsallis and Markovian generalizations, specifically when the sequences being compared arose from phylogenetically proximal organisms.

  7. Multilocus variable-number tandem repeat analysis for molecular typing and phylogenetic analysis of Shigella flexneri

    PubMed Central

    2009-01-01

    Background Shigella flexneri is one of the causative agents of shigellosis, a major cause of childhood mortality in developing countries. Multilocus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related bacterial isolates for investigation of disease outbreaks and provide information for establishing phylogenetic patterns among isolates. The present study aimed to develop an MLVA method for S. flexneri and the VNTR loci identified were tested on 242 S. flexneri isolates to evaluate their variability in various serotypes. The isolates were also analyzed by pulsed-field gel electrophoresis (PFGE) to compare the discriminatory power and to evaluate the usefulness of MLVA as a tool for phylogenetic analysis of S. flexneri. Results Thirty-six VNTR loci were identified by exploring the repeat sequence loci in genomic sequences of Shigella species and by testing the loci on nine isolates of different subserotypes. The VNTR loci in different serotype groups differed greatly in their variability. The discriminatory power of an MLVA assay based on four most variable VNTR loci was higher, though not significantly, than PFGE for the total isolates, a panel of 2a isolates, which were relatively diverse, and a panel of 4a/Y isolates, which were closely-related. Phylogenetic groupings based on PFGE patterns and MLVA profiles were considerably concordant. The genetic relationships among the isolates were correlated with serotypes. The phylogenetic trees constructed using PFGE patterns and MLVA profiles presented two distinct clusters for the isolates of serotype 3 and one distinct cluster for each of the serotype groups, 1a/1b/NT, 2a/2b/X/NT, 4a/Y, and 6. Isolates that had different serotypes but had closer genetic relatedness than those with the same serotype were observed between serotype Y and subserotype 4a, serotype X and subserotype 2b, subserotype 1a and 1b, and subserotype 3a and 3b. Conclusions The 36 VNTR loci identified exhibited considerably different degrees of variability among S. flexneri serotype groups. VNTR locus could be highly variable in a serotype but invariable in others. MLVA assay based on four highly variable loci could display a comparable resolving power to PFGE in discriminating isolates. MLVA is also a prominent molecular tool for phylogenetic analysis of S. flexneri; the resulting data are beneficial to establish clear clonal patterns among different serotype groups and to discern clonal groups among isolates within the same serotype. As highly variable VNTR loci could be serotype-specific, a common MLVA protocol that consists of only a small set of loci, for example four to eight loci, and that provides high resolving power to all S. flexneri serotypes may not be obtainable. PMID:20042119

  8. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes

    PubMed Central

    Gallus, Susanne; Janke, Axel

    2017-01-01

    Abstract Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. PMID:28985298

  9. Disentangling the phylogenetic and ecological components of spider phenotypic variation.

    PubMed

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.

  10. Disentangling the Phylogenetic and Ecological Components of Spider Phenotypic Variation

    PubMed Central

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264

  11. Phylogenetic signal, feeding behaviour and brain volume in Neotropical bats.

    PubMed

    Rojas, D; Mancina, C A; Flores-Martínez, J J; Navarro, L

    2013-09-01

    Comparative correlational studies of brain size and ecological traits (e.g. feeding habits and habitat complexity) have increased our knowledge about the selective pressures on brain evolution. Studies conducted in bats as a model system assume that shared evolutionary history has a maximum effect on the traits. However, this effect has not been quantified. In addition, the effect of levels of diet specialization on brain size remains unclear. We examined the role of diet on the evolution of brain size in Mormoopidae and Phyllostomidae using two comparative methods. Body mass explained 89% of the variance in brain volume. The effect of feeding behaviour (either characterized as feeding habits, as levels of specialization on a type of item or as handling behaviour) on brain volume was also significant albeit not consistent after controlling for body mass and the strength of the phylogenetic signal (λ). Although the strength of the phylogenetic signal of brain volume and body mass was high when tested individually, λ values in phylogenetic generalized least squares models were significantly different from 1. This suggests that phylogenetic independent contrasts models are not always the best approach for the study of ecological correlates of brain size in New World bats. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.

  12. Breakdown of Phylogenetic Signal: A Survey of Microsatellite Densities in 454 Shotgun Sequences from 154 Non Model Eukaryote Species

    PubMed Central

    Meglécz, Emese; Nève, Gabriel; Biffin, Ed; Gardner, Michael G.

    2012-01-01

    Microsatellites are ubiquitous in Eukaryotic genomes. A more complete understanding of their origin and spread can be gained from a comparison of their distribution within a phylogenetic context. Although information for model species is accumulating rapidly, it is insufficient due to a lack of species depth, thus intragroup variation is necessarily ignored. As such, apparent differences between groups may be overinflated and generalizations cannot be inferred until an analysis of the variation that exists within groups has been conducted. In this study, we examined microsatellite coverage and motif patterns from 454 shotgun sequences of 154 Eukaryote species from eight distantly related phyla (Cnidaria, Arthropoda, Onychophora, Bryozoa, Mollusca, Echinodermata, Chordata and Streptophyta) to test if a consistent phylogenetic pattern emerges from the microsatellite composition of these species. It is clear from our results that data from model species provide incomplete information regarding the existing microsatellite variability within the Eukaryotes. A very strong heterogeneity of microsatellite composition was found within most phyla, classes and even orders. Autocorrelation analyses indicated that while microsatellite contents of species within clades more recent than 200 Mya tend to be similar, the autocorrelation breaks down and becomes negative or non-significant with increasing divergence time. Therefore, the age of the taxon seems to be a primary factor in degrading the phylogenetic pattern present among related groups. The most recent classes or orders of Chordates still retain the pattern of their common ancestor. However, within older groups, such as classes of Arthropods, the phylogenetic pattern has been scrambled by the long independent evolution of the lineages. PMID:22815847

  13. Phylogenetically informed logic relationships improve detection of biological network organization

    PubMed Central

    2011-01-01

    Background A "phylogenetic profile" refers to the presence or absence of a gene across a set of organisms, and it has been proven valuable for understanding gene functional relationships and network organization. Despite this success, few studies have attempted to search beyond just pairwise relationships among genes. Here we search for logic relationships involving three genes, and explore its potential application in gene network analyses. Results Taking advantage of a phylogenetic matrix constructed from the large orthologs database Roundup, we invented a method to create balanced profiles for individual triplets of genes that guarantee equal weight on the different phylogenetic scenarios of coevolution between genes. When we applied this idea to LAPP, the method to search for logic triplets of genes, the balanced profiles resulted in significant performance improvement and the discovery of hundreds of thousands more putative triplets than unadjusted profiles. We found that logic triplets detected biological network organization and identified key proteins and their functions, ranging from neighbouring proteins in local pathways, to well separated proteins in the whole pathway, and to the interactions among different pathways at the system level. Finally, our case study suggested that the directionality in a logic relationship and the profile of a triplet could disclose the connectivity between the triplet and surrounding networks. Conclusion Balanced profiles are superior to the raw profiles employed by traditional methods of phylogenetic profiling in searching for high order gene sets. Gene triplets can provide valuable information in detection of biological network organization and identification of key genes at different levels of cellular interaction. PMID:22172058

  14. Biological pattern and transcriptomic exploration and phylogenetic analysis in the odd floral architecture tree: Helwingia willd.

    PubMed

    Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui

    2014-06-27

    Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.

  15. An improved model for whole genome phylogenetic analysis by Fourier transform.

    PubMed

    Yin, Changchuan; Yau, Stephen S-T

    2015-10-07

    DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Phylogeny of Marsileaceous Ferns and Relationships of the Fossil Hydropteris pinnata Reconsidered.

    PubMed

    Pryer

    1999-09-01

    Recent phylogenetic studies have provided compelling evidence that confirms the once disputed hypothesis of monophyly for heterosporous leptosporangiate ferns (Marsileaceae and Salviniaceae). Hypotheses for relationships among the three genera of Marsileaceae (Marsilea, Regnellidium, and Pilularia), however, have continued to be in conflict. The phylogeny of Marsileaceae is investigated here using information from morphology and rbcL sequence data. In addition, relationships among all heterosporous ferns, including the whole-plant fossil Hydropteris pinnata are reconsidered. Data sets of 71 morphological and 1239 rbcL characters for 23 leptosporangiate ferns, including eight heterosporous ingroup taxa and 15 homosporous outgroup taxa, were subjected to maximum parsimony analysis. Morphological analyses were carried out both with and without the fossil Hydropteris, and it was excluded from all analyses with rbcL data. An annotated list of the 71 morphological characters is provided in the appendix. For comparative purposes, the Rothwell and Stockey (1994) data set was also reanalyzed here. The best estimate of phylogenetic relationships for Marsileaceae in all analyses is that Pilularia and Regnellidium are sister taxa and Marsilea is sister to that clade. Morphological synapomorphies for various nodes are discussed. Analyses that included Hydropteris resulted in two most-parsimonious trees that differ only in the placement of the fossil. One topology is identical to the relationship found by Rothwell and Stockey (1994), placing the fossil sister to the Azolla plus Salvinia clade. The alternative topology places Hydropteris as the most basal member of the heterosporous fern clade. Equivocal interpretations for character evolution in heterosporous ferns are discussed in the context of these two most-parsimonious trees. Because of the observed degree of character ambiguity, the phylogenetic placement of Hydropteris is best viewed as unresolved, and recognition of the suborder Hydropteridineae, as circumscribed by Rothwell and Stockey (1994), is regarded as premature. The two competing hypotheses of relationships for heterosporous ferns are also compared with the known temporal distribution of relevant taxa. Stratigraphic fit of the phylogenetic estimates is measured by using the Stratigraphic Consistency Index and by comparison with minimum divergence times.

  17. cDNA identification, comparison and phylogenetic aspects of lombricine kinase from two oligochaete species.

    PubMed

    Doumen, Chris

    2010-06-01

    Creatine kinase and arginine kinase are the typical representatives of an eight-member phosphagen kinase family, which play important roles in the cellular energy metabolism of animals. The phylum Annelida underwent a series of evolutionary processes that resulted in rapid divergence and radiation of these enzymes, producing the greatest diversity of the phosphagen kinases within this phylum. Lombricine kinase (EC 2.7.3.5) is one of such enzymes and sequence information is rather limited compared to other phosphagen kinases. This study presents data on the cDNA sequences of lombricine kinase from two oligochaete species, the California blackworm (Lumbriculus variegatus) and the sludge worm (Tubifex tubifex). The deduced amino acid sequences are analyzed and compared with other selected phosphagen kinases, including two additional lombricine kinase sequences extracted from DNA databases and provide further insights in the evolution and position of these enzymes within the phosphagen kinase family. The data confirms the presence of a deleted region within the flexible loop (the GS region) of all six examined lombricine kinases. A phylogenetic analysis of these six lombricine kinases clearly positions the enzymes together in a small subcluster within the larger creatine kinase (EC 2.7.3.2) clade. 2010. Published by Elsevier Inc.

  18. Species diversity driven by morphological and ecological disparity: a case study of comparative seed morphology and anatomy across a large monocot order

    PubMed Central

    Benedict, John C.; Smith, Selena Y.; Specht, Chelsea D.; Collinson, Margaret E.; Leong-Škorničková, Jana; Parkinson, Dilworth Y.; Marone, Federica

    2016-01-01

    Phenotypic variation can be attributed to genetic heritability as well as biotic and abiotic factors. Across Zingiberales, there is a high variation in the number of species per clade and in phenotypic diversity. Factors contributing to this phenotypic variation have never been studied in a phylogenetic or ecological context. Seeds of 166 species from all eight families in Zingiberales were analyzed for 51 characters using synchrotron based 3D X-ray tomographic microscopy to determine phylogenetically informative characters and to understand the distribution of morphological disparity within the order. All families are distinguishable based on seed characters. Non-metric multidimensional scaling analyses show Zingiberaceae occupy the largest seed morphospace relative to the other families, and environmental analyses demonstrate that Zingiberaceae inhabit both temperate and tropical regions, while other Zingiberales are almost exclusively tropical. Temperate species do not cluster in morphospace nor do they share a common suite of character states. This suggests that the diversity seen is not driven by adaptation to temperate niches; rather, the morphological disparity seen likely reflects an underlying genetic plasticity that allowed Zingiberaceae to repeatedly colonize temperate environments. The notable morphoanatomical variety in Zingiberaceae seeds may account for their extraordinary ecological success and high species diversity as compared to other Zingiberales. PMID:27594701

  19. EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms.

    PubMed

    Janies, Daniel A; Witter, Zach; Linchangco, Gregorio V; Foltz, David W; Miller, Allison K; Kerr, Alexander M; Jay, Jeremy; Reid, Robert W; Wray, Gregory A

    2016-01-22

    One of our goals for the echinoderm tree of life project (http://echinotol.org) is to identify orthologs suitable for phylogenetic analysis from next-generation transcriptome data. The current dataset is the largest assembled for echinoderm phylogeny and transcriptomics. We used RNA-Seq to profile adult tissues from 42 echinoderm specimens from 24 orders and 37 families. In order to achieve sampling members of clades that span key evolutionary divergence, many of our exemplars were collected from deep and polar seas. A small fraction of the transcriptome data we produced is being used for phylogenetic reconstruction. Thus to make a larger dataset available to researchers with a wide variety of interests, we made a web-based application, EchinoDB (http://echinodb.uncc.edu). EchinoDB is a repository of orthologous transcripts from echinoderms that is searchable via keywords and sequence similarity. From transcripts we identified 749,397 clusters of orthologous loci. We have developed the information technology to manage and search the loci their annotations with respect to the Sea Urchin (Strongylocentrotus purpuratus) genome. Several users have already taken advantage of these data for spin-off projects in developmental biology, gene family studies, and neuroscience. We hope others will search EchinoDB to discover datasets relevant to a variety of additional questions in comparative biology.

  20. Integrating brain, behavior, and phylogeny to understand the evolution of sensory systems in birds

    PubMed Central

    Wylie, Douglas R.; Gutiérrez-Ibáñez, Cristian; Iwaniuk, Andrew N.

    2015-01-01

    The comparative anatomy of sensory systems has played a major role in developing theories and principles central to evolutionary neuroscience. This includes the central tenet of many comparative studies, the principle of proper mass, which states that the size of a neural structure reflects its processing capacity. The size of structures within the sensory system is not, however, the only salient variable in sensory evolution. Further, the evolution of the brain and behavior are intimately tied to phylogenetic history, requiring studies to integrate neuroanatomy with behavior and phylogeny to gain a more holistic view of brain evolution. Birds have proven to be a useful group for these studies because of widespread interest in their phylogenetic relationships and a wealth of information on the functional organization of most of their sensory pathways. In this review, we examine the principle of proper mass in relation differences in the sensory capabilities among birds. We discuss how neuroanatomy, behavior, and phylogeny can be integrated to understand the evolution of sensory systems in birds providing evidence from visual, auditory, and somatosensory systems. We also consider the concept of a “trade-off,” whereby one sensory system (or subpathway within a sensory system), may be expanded in size, at the expense of others, which are reduced in size. PMID:26321905

  1. Comparative phylogenetic analyses of Halomonas variabilis and related organisms based on 16S rRNA, gyrB and ectBC gene sequences.

    PubMed

    Okamoto, Takuji; Maruyama, Akihiko; Imura, Satoshi; Takeyama, Haruko; Naganuma, Takeshi

    2004-05-01

    Halomonas variabilis and phylogenetically related organisms were isolated from various habitats such as Antarctic terrain and saline ponds, deep-sea sediment, deep-sea waters affected by hydrothermal plumes, and hydrothermal vent fluids. Ten strains were selected for physiological and phylogenetic characterization in detail. All of those strains were found to be piezotolerant and psychrotolerant, as well as euryhaline halophilic or halotolerant. Their stress tolerance may facilitate their wide occurrence, even in so-called extreme environments. The 16S rDNA-based phylogenetic relationship was complemented by analyses of the DNA gyrase subunit B gene (gyrB) and genes involved in the synthesis of the major compatible solute, ectoine: diaminobutyric acid aminotransferase gene (ectB) and ectoine synthase gene (ectC). The phylogenetic relationships of H. variabilis and related organisms were very similar in terms of 16S rDNA, gyrB, and ectB. The ectC-based tree was inconsistent with the other phylogenetic trees. For that reason, ectC was inferred to derive from horizontal transfer.

  2. Enumerating all maximal frequent subtrees in collections of phylogenetic trees

    PubMed Central

    2014-01-01

    Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474

  3. A methodological investigation of hominoid craniodental morphology and phylogenetics.

    PubMed

    Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A

    2011-01-01

    The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.

  4. Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

    PubMed

    Deepak, Akshay; Fernández-Baca, David

    2014-01-01

    A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.

  5. On the phylogenetic placement of human T cell leukemia virus type 1 sequences associated with an Andean mummy.

    PubMed

    Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A

    2006-03-01

    Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.

  6. Microbes on mountainsides: Contrasting elevational patterns of bacterial and plant diversity

    PubMed Central

    Bryant, Jessica A.; Lamanna, Christine; Morlon, Hélène; Kerkhoff, Andrew J.; Enquist, Brian J.; Green, Jessica L.

    2008-01-01

    The study of elevational diversity gradients dates back to the foundation of biogeography. Although elevational patterns of plant and animal diversity have been studied for centuries, such patterns have not been reported for microorganisms and remain poorly understood. Here, in an effort to assess the generality of elevational diversity patterns, we examined soil bacterial and plant diversity along an elevation gradient. To gain insight into the forces that structure these patterns, we adopted a multifaceted approach to incorporate information about the structure, diversity, and spatial turnover of montane communities in a phylogenetic context. We found that observed patterns of plant and bacterial diversity were fundamentally different. While bacterial taxon richness and phylogenetic diversity decreased monotonically from the lowest to highest elevations, plants followed a unimodal pattern, with a peak in richness and phylogenetic diversity at mid-elevations. At all elevations bacterial communities had a tendency to be phylogenetically clustered, containing closely related taxa. In contrast, plant communities did not exhibit a uniform phylogenetic structure across the gradient: they became more overdispersed with increasing elevation, containing distantly related taxa. Finally, a metric of phylogenetic beta-diversity showed that bacterial lineages were not randomly distributed, but rather exhibited significant spatial structure across the gradient, whereas plant lineages did not exhibit a significant phylogenetic signal. Quantifying the influence of sample scale in intertaxonomic comparisons remains a challenge. Nevertheless, our findings suggest that the forces structuring microorganism and macroorganism communities along elevational gradients differ. PMID:18695215

  7. Advances in the floral structural characterization of the major subclades of Malpighiales, one of the largest orders of flowering plants

    PubMed Central

    Endress, Peter K.; Davis, Charles C.; Matthews, Merran L.

    2013-01-01

    Background and Aims Malpighiales are one of the largest angiosperm orders and have undergone radical systematic restructuring based on molecular phylogenetic studies. The clade has been recalcitrant to molecular phylogenetic reconstruction, but has become much more resolved at the suprafamilial level. It now contains so many newly identified clades that there is an urgent need for comparative studies to understand their structure, biology and evolution. This is especially true because the order contains a disproportionally large diversity of rain forest species and includes numerous agriculturally important plants. This study is a first broad systematic step in this endeavour. It focuses on a comparative structural overview of the flowers across all recently identified suprafamilial clades of Malpighiales, and points towards areas that desperately need attention. Methods The phylogenetic comparative analysis of floral structure for the order is based on our previously published studies on four suprafamilial clades of Malpighiales, including also four related rosid orders (Celastrales, Crossosomatales, Cucurbitales, Oxalidales). In addition, the results are compiled from a survey of over 3000 publications on macrosystematics, floral structure and embryology across all orders of the core eudicots. Key Results Most new suprafamilial clades within Malpighiales are well supported by floral structural features. Inner morphological structures of the gynoecium (i.e. stigmatic lobes, inner shape of the locules, placentation, presence of obturators) and ovules (i.e. structure of the nucellus, thickness of the integuments, presence of vascular bundles in the integuments, presence of an endothelium in the inner integument) appear to be especially suitable for characterizing suprafamilial clades within Malpighiales. Conclusions Although the current phylogenetic reconstruction of Malpighiales is much improved compared with earlier versions, it is incomplete, and further focused phylogenetic and morphological studies are needed. Once all major subclades of Malpighiales are elucidated, more in-depth studies on promising structural features can be conducted. In addition, once the phylogenetic tree of Malpighiales, including closely related orders, is more fully resolved, character optimization studies will be possible to reconstruct evolution of structural and biological features within the order. PMID:23486341

  8. Chromhome: A rich internet application for accessing comparative chromosome homology maps

    PubMed Central

    Nagarajan, Sridevi; Rens, Willem; Stalker, James; Cox, Tony; Ferguson-Smith, Malcolm A

    2008-01-01

    Background Comparative genomics has become a significant research area in recent years, following the availability of a number of sequenced genomes. The comparison of genomes is of great importance in the analysis of functionally important genome regions. It can also be used to understand the phylogenetic relationships of species and the mechanisms leading to rearrangement of karyotypes during evolution. Many species have been studied at the cytogenetic level by cross species chromosome painting. With the large amount of such information, it has become vital to computerize the data and make them accessible worldwide. Chromhome is a comprehensive web application that is designed to provide cytogenetic comparisons among species and to fulfil this need. Results The Chromhome application architecture is multi-tiered with an interactive client layer, business logic and database layers. Enterprise java platform with open source framework OpenLaszlo is used to implement the Rich Internet Chromhome Application. Cross species comparative mapping raw data are collected and the processed information is stored into MySQL Chromhome database. Chromhome Release 1.0 contains 109 homology maps from 51 species. The data cover species from 14 orders and 30 families. The homology map displays all the chromosomes of the compared species as one image, making comparisons among species easier. Inferred data also provides maps of homologous regions that could serve as a guideline for researchers involved in phylogenetic or evolution based studies. Conclusion Chromhome provides a useful resource for comparative genomics, holding graphical homology maps of a wide range of species. It brings together cytogenetic data of many genomes under one roof. Inferred painting can often determine the chromosomal homologous regions between two species, if each has been compared with a common third species. Inferred painting greatly reduces the need to map entire genomes and helps focus only on relevant regions of the chromosomes of the species under study. Future releases of Chromhome will accommodate more species and their respective gene and BAC maps, in addition to chromosome painting data. Chromhome application provides a single-page interface (SPI) with desktop style layout, delivering a better and richer user experience. PMID:18366796

  9. Chromhome: a rich internet application for accessing comparative chromosome homology maps.

    PubMed

    Nagarajan, Sridevi; Rens, Willem; Stalker, James; Cox, Tony; Ferguson-Smith, Malcolm A

    2008-03-26

    Comparative genomics has become a significant research area in recent years, following the availability of a number of sequenced genomes. The comparison of genomes is of great importance in the analysis of functionally important genome regions. It can also be used to understand the phylogenetic relationships of species and the mechanisms leading to rearrangement of karyotypes during evolution. Many species have been studied at the cytogenetic level by cross species chromosome painting. With the large amount of such information, it has become vital to computerize the data and make them accessible worldwide. Chromhome http://www.chromhome.org is a comprehensive web application that is designed to provide cytogenetic comparisons among species and to fulfil this need. The Chromhome application architecture is multi-tiered with an interactive client layer, business logic and database layers. Enterprise java platform with open source framework OpenLaszlo is used to implement the Rich Internet Chromhome Application. Cross species comparative mapping raw data are collected and the processed information is stored into MySQL Chromhome database. Chromhome Release 1.0 contains 109 homology maps from 51 species. The data cover species from 14 orders and 30 families. The homology map displays all the chromosomes of the compared species as one image, making comparisons among species easier. Inferred data also provides maps of homologous regions that could serve as a guideline for researchers involved in phylogenetic or evolution based studies. Chromhome provides a useful resource for comparative genomics, holding graphical homology maps of a wide range of species. It brings together cytogenetic data of many genomes under one roof. Inferred painting can often determine the chromosomal homologous regions between two species, if each has been compared with a common third species. Inferred painting greatly reduces the need to map entire genomes and helps focus only on relevant regions of the chromosomes of the species under study. Future releases of Chromhome will accommodate more species and their respective gene and BAC maps, in addition to chromosome painting data. Chromhome application provides a single-page interface (SPI) with desktop style layout, delivering a better and richer user experience.

  10. Biodiversity assessment among two Nebraska prairies: a comparison between traditional and phylogenetic diversity indices

    PubMed Central

    Aust, Shelly K.; Ahrendsen, Dakota L.

    2015-01-01

    Abstract Background Conservation of the evolutionary diversity among organisms should be included in the selection of priority regions for preservation of Earth’s biodiversity. Traditionally, biodiversity has been determined from an assessment of species richness (S), abundance, evenness, rarity, etc. of organisms but not from variation in species’ evolutionary histories. Phylogenetic diversity (PD) measures evolutionary differences between taxa in a community and is gaining acceptance as a biodiversity assessment tool. However, with the increase in the number of ways to calculate PD, end-users and decision-makers are left wondering how metrics compare and what data are needed to calculate various metrics. New information In this study, we used massively parallel sequencing to generate over 65,000 DNA characters from three cellular compartments for over 60 species in the asterid clade of flowering plants. We estimated asterid phylogenies from character datasets of varying nucleotide quantities, and then assessed the effect of varying character datasets on resulting PD metric values. We also compared multiple PD metrics with traditional diversity indices (including S) among two endangered grassland prairies in Nebraska (U.S.A.). Our results revealed that PD metrics varied based on the quantity of genes used to infer the phylogenies; therefore, when comparing PD metrics between sites, it is vital to use comparable datasets. Additionally, various PD metrics and traditional diversity indices characterize biodiversity differently and should be chosen depending on the research question. Our study provides empirical results that reveal the value of measuring PD when considering sites for conservation, and it highlights the usefulness of using PD metrics in combination with other diversity indices when studying community assembly and ecosystem functioning. Ours is just one example of the types of investigations that need to be conducted across the tree of life and across varying ecosystems in order to build a database of phylogenetic diversity assessments that lead to a pool of results upon which a guide through the plethora of PD metrics may be prepared for use by ecologists and conservation planners. PMID:26312052

  11. Opposing assembly mechanisms in a neotropical dry forest: implications for phylogenetic and functional community ecology.

    PubMed

    Swenson, Nathan G; Enquist, Brian J

    2009-08-01

    Species diversity is promoted and maintained by ecological and evolutionary processes operating on species attributes through space and time. The degree to which variability in species function regulates distribution and promotes coexistence of species has been debated. Previous work has attempted to quantify the relative importance of species function by using phylogenetic relatedness as a proxy for functional similarity. The key assumption of this approach is that function is phylogenetically conserved. If this assumption is supported, then the phylogenetic dispersion in a community should mirror the functional dispersion. Here we quantify functional trait dispersion along several key axes of tree life-history variation and on multiple spatial scales in a Neotropical dry-forest community. We next compare these results to previously reported patterns of phylogenetic dispersion in this same forest. We find that, at small spatial scales, coexisting species are typically more functionally clustered than expected, but traits related to adult and regeneration niches are overdispersed. This outcome was repeated when the analyses were stratified by size class. Some of the trait dispersion results stand in contrast to the previously reported phylogenetic dispersion results. In order to address this inconsistency we examined the strength of phylogenetic signal in traits at different depths in the phylogeny. We argue that: (1) while phylogenetic relatedness may be a good general multivariate proxy for ecological similarity, it may have a reduced capacity to depict the functional mechanisms behind species coexistence when coexisting species simultaneously converge and diverge in function; and (2) the previously used metric of phylogenetic signal provided erroneous inferences about trait dispersion when married with patterns of phylogenetic dispersion.

  12. Phylogenetic patterns and the adaptive evolution of osmoregulation in fiddler crabs (Brachyura, Uca)

    PubMed Central

    Faria, Samuel Coelho; Provete, Diogo Borges; Thurman, Carl Leo

    2017-01-01

    Salinity is the primary driver of osmoregulatory evolution in decapods, and may have influenced their diversification into different osmotic niches. In semi-terrestrial crabs, hyper-osmoregulatory ability favors sojourns into burrows and dilute media, and provides a safeguard against hemolymph dilution; hypo-osmoregulatory ability underlies emersion capability and a life more removed from water sources. However, most comparative studies have neglected the roles of the phylogenetic and environmental components of inter-specific physiological variation, hindering evaluation of phylogenetic patterns and the adaptive nature of osmoregulatory evolution. Semi-terrestrial fiddler crabs (Uca) inhabit fresh to hyper-saline waters, with species from the Americas occupying higher intertidal habitats than Indo-west Pacific species mainly found in the low intertidal zone. Here, we characterize numerous osmoregulatory traits in all ten fiddler crabs found along the Atlantic coast of Brazil, and we employ phylogenetic comparative methods using 24 species to test for: (i) similarities of osmoregulatory ability among closely related species; (ii) salinity as a driver of osmoregulatory evolution; (iii) correlation between salt uptake and secretion; and (iv) adaptive peaks in osmoregulatory ability in the high intertidal American lineages. Our findings reveal that osmoregulation in Uca exhibits strong phylogenetic patterns in salt uptake traits. Salinity does not correlate with hyper/hypo-regulatory abilities, but drives hemolymph osmolality at ambient salinities. Osmoregulatory traits have evolved towards three adaptive peaks, revealing a significant contribution of hyper/hypo-regulatory ability in the American clades. Thus, during the evolutionary history of fiddler crabs, salinity has driven some of the osmoregulatory transformations that underpin habitat diversification, although others are apparently constrained phylogenetically. PMID:28182764

  13. A review of bioinformatics platforms for comparative genomics. Recent developments of the EDGAR 2.0 platform and its utility for taxonomic and phylogenetic studies.

    PubMed

    Yu, J; Blom, J; Glaeser, S P; Jaenicke, S; Juhre, T; Rupp, O; Schwengers, O; Spänig, S; Goesmann, A

    2017-11-10

    The rapid development of next generation sequencing technology has greatly increased the amount of available microbial genomes. As a result of this development, there is a rising demand for fast and automated approaches in analyzing these genomes in a comparative way. Whole genome sequencing also bears a huge potential for obtaining a higher resolution in phylogenetic and taxonomic classification. During the last decade, several software tools and platforms have been developed in the field of comparative genomics. In this manuscript, we review the most commonly used platforms and approaches for ortholog group analyses with a focus on their potential for phylogenetic and taxonomic research. Furthermore, we describe the latest improvements of the EDGAR platform for comparative genome analyses and present recent examples of its application for the phylogenomic analysis of different taxa. Finally, we illustrate the role of the EDGAR platform as part of the BiGi Center for Microbial Bioinformatics within the German network on Bioinformatics Infrastructure (de.NBI). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  14. Evolutionary lineages of marine snails identified using molecular phylogenetics and geometric morphometric analysis of shells.

    PubMed

    Vaux, Felix; Trewick, Steven A; Crampton, James S; Marshall, Bruce A; Beu, Alan G; Hills, Simon F K; Morgan-Richards, Mary

    2018-06-15

    The relationship between morphology and inheritance is of perennial interest in evolutionary biology and palaeontology. Using three marine snail genera Penion, Antarctoneptunea and Kelletia, we investigate whether systematics based on shell morphology accurately reflect evolutionary lineages indicated by molecular phylogenetics. Members of these gastropod genera have been a taxonomic challenge due to substantial variation in shell morphology, conservative radular and soft tissue morphology, few known ecological differences, and geographical overlap between numerous species. Sampling all sixteen putative taxa identified across the three genera, we infer mitochondrial and nuclear ribosomal DNA phylogenetic relationships within the group, and compare this to variation in adult shell shape and size. Results of phylogenetic analysis indicate that each genus is monophyletic, although the status of some phylogenetically derived and likely more recently evolved taxa within Penion is uncertain. The recently described species P. lineatus is supported by genetic evidence. Morphology, captured using geometric morphometric analysis, distinguishes the genera and matches the molecular phylogeny, although using the same dataset, species and phylogenetic subclades are not identified with high accuracy. Overall, despite abundant variation, we find that shell morphology accurately reflects genus-level classification and the corresponding deep phylogenetic splits identified in this group of marine snails. Copyright © 2018 Elsevier Inc. All rights reserved.

  15. Molecular Phylogenetics and Systematics of the Bivalve Family Ostreidae Based on rRNA Sequence-Structure Models and Multilocus Species Tree

    PubMed Central

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663

  16. Molecular phylogenetics and systematics of the bivalve family Ostreidae based on rRNA sequence-structure models and multilocus species tree.

    PubMed

    Salvi, Daniele; Macali, Armando; Mariottini, Paolo

    2014-01-01

    The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.

  17. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes.

    PubMed

    Lammers, Fritjof; Gallus, Susanne; Janke, Axel; Nilsson, Maria A

    2017-10-01

    Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  18. Macroevolutionary developmental biology: Embryos, fossils, and phylogenies.

    PubMed

    Organ, Chris L; Cooper, Lisa Noelle; Hieronymus, Tobin L

    2015-10-01

    The field of evolutionary developmental biology is broadly focused on identifying the genetic and developmental mechanisms underlying morphological diversity. Connecting the genotype with the phenotype means that evo-devo research often considers a wide range of evidence, from genetics and morphology to fossils. In this commentary, we provide an overview and framework for integrating fossil ontogenetic data with developmental data using phylogenetic comparative methods to test macroevolutionary hypotheses. We survey the vertebrate fossil record of preserved embryos and discuss how phylogenetic comparative methods can integrate data from developmental genetics and paleontology. Fossil embryos provide limited, yet critical, developmental data from deep time. They help constrain when developmental innovations first appeared during the history of life and also reveal the order in which related morphologies evolved. Phylogenetic comparative methods provide a powerful statistical approach that allows evo-devo researchers to infer the presence of nonpreserved developmental traits in fossil species and to detect discordant evolutionary patterns and processes across levels of biological organization. © 2015 Wiley Periodicals, Inc.

  19. Complete mitochondrial genomes of eleven extinct or possibly extinct bird species.

    PubMed

    Anmarkrud, Jarl A; Lifjeld, Jan T

    2017-03-01

    Natural history museum collections represent a vast source of ancient and historical DNA samples from extinct taxa that can be utilized by high-throughput sequencing tools to reveal novel genetic and phylogenetic information about them. Here, we report on the successful sequencing of complete mitochondrial genome sequences (mitogenomes) from eleven extinct bird species, using de novo assembly of short sequences derived from toepad samples of degraded DNA from museum specimens. For two species (the Passenger Pigeon Ectopistes migratorius and the South Island Piopio Turnagra capensis), whole mitogenomes were already available from recent studies, whereas for five others (the Great Auk Pinguinis impennis, the Imperial Woodpecker Campehilus imperialis, the Huia Heteralocha acutirostris, the Kauai Oo Moho braccathus and the South Island Kokako Callaeas cinereus), there were partial mitochondrial sequences available for comparison. For all seven species, we found sequence similarities of >98%. For the remaining four species (the Kamao Myadestes myadestinus, the Paradise Parrot Psephotellus pulcherrimus, the Ou Psittirostra psittacea and the Lesser Akialoa Akialoa obscura), there was no sequence information available for comparison, so we conducted blast searches and phylogenetic analyses to determine their phylogenetic positions and identify their closest extant relatives. These mitogenomes will be valuable for future analyses of avian phylogenetics and illustrate the importance of museum collections as repositories for genomics resources. © 2016 John Wiley & Sons Ltd.

  20. Functional & phylogenetic diversity of copepod communities

    NASA Astrophysics Data System (ADS)

    Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.

    2016-02-01

    The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.

  1. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    PubMed

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.

  2. Novel Substrates as Sources of Ancient DNA: Prospects and Hurdles

    PubMed Central

    Green, Eleanor Joan

    2017-01-01

    Following the discovery in the late 1980s that hard tissues such as bones and teeth preserve genetic information, the field of ancient DNA analysis has typically concentrated upon these substrates. The onset of high-throughput sequencing, combined with optimized DNA recovery methods, has enabled the analysis of a myriad of ancient species and specimens worldwide, dating back to the Middle Pleistocene. Despite the growing sophistication of analytical techniques, the genetic analysis of substrates other than bone and dentine remain comparatively “novel”. Here, we review analyses of other biological substrates which offer great potential for elucidating phylogenetic relationships, paleoenvironments, and microbial ecosystems including (1) archaeological artifacts and ecofacts; (2) calcified and/or mineralized biological deposits; and (3) biological and cultural archives. We conclude that there is a pressing need for more refined models of DNA preservation and bespoke tools for DNA extraction and analysis to authenticate and maximize the utility of the data obtained. With such tools in place the potential for neglected or underexploited substrates to provide a unique insight into phylogenetics, microbial evolution and evolutionary processes will be realized. PMID:28703741

  3. Pectoral girdle and fin anatomy of Gogonasus andrewsae Long, 1985: implications for tetrapodomorph limb evolution.

    PubMed

    Holland, Timothy

    2013-02-01

    Recently discovered material has yielded new information on the pectoral girdle and fin endoskeleton of Gogonasusandrewsae (Frasnian Gogo Formation, Kimberley Region, Western Australia). These elements permit the first comprehensive description of the anocleithrum, cleithrum, scapulocoracoid, and lepidotrichia. New autapomorphies of Gogonasus include a square exposed region on the supracleithrum, an unusual knob-like process on the scapulocoracoid, a relatively small entepicondyle, and lepidotrichia with I-beam-shaped cross sections. Several poorly ossified regions on the scapulocoracoid and humerus indicate an early ontogenetic state, as with other immature tetrapodomorph fish specimens. A phylogenetic analysis indicates a more stemward position for Gogonasus in a weakly supported clade with other "osteolepidid" taxa, compared to other recent studies placing Gogonasus crownward of osteolepidid fishes and the Tristichopteridae, as the sister taxon to the "Elpistosteglia" + Tetrapoda. A phylogenetic position among megalichthyid fishes is suggested for Sterropterygion, while radiographs of the megalichthyid Cladarosymblema show a scythe-like radius terminating distally with that of the intermedium. New data on the scapulocoracoid of the rhizodontid Barameda reveals a coroacoid crest and small supraglenoid foramen. Copyright © 2012 Wiley Periodicals, Inc.

  4. Complete chloroplast genome of Tetragonia tetragonioides: Molecular phylogenetic relationships and evolution in Caryophyllales.

    PubMed

    Choi, Kyoung Su; Kwak, Myounghai; Lee, Byoungyoon; Park, SeonJoo

    2018-01-01

    The chloroplast genome of Tetragonia tetragonioides (Aizoaceae; Caryophyllales) was sequenced to provide information for studies on phylogeny and evolution within Caryophyllales. The chloroplast genome of Tetragonia tetragonioides is 149,506 bp in length and includes a pair of inverted repeats (IRs) of 24,769 bp that separate a large single copy (LSC) region of 82,780 bp and a small single copy (SSC) region of 17,188 bp. Comparative analysis of the chloroplast genome showed that Caryphyllales species have lost many genes. In particular, the rpl2 intron and infA gene were not found in T. tetragonioides, and core Caryophyllales lack the rpl2 intron. Phylogenetic analyses were conducted using 55 genes in 16 complete chloroplast genomes. Caryophyllales was found to divide into two clades; core Caryophyllales and noncore Caryophyllales. The genus Tetragonia is closely related to Mesembryanthemum. Comparisons of the synonymous (Ks), nonsynonymous (Ka), and Ka/Ks substitution rates revealed that nonsynonymous substitution rates were lower than synonymous substitution rates and that Ka/Ks rates were less than 1. The findings of the present study suggest that most genes are a purified selection.

  5. Ecological Genomics of Marine Picocyanobacteria†

    PubMed Central

    Scanlan, D. J.; Ostrowski, M.; Mazard, S.; Dufresne, A.; Garczarek, L.; Hess, W. R.; Post, A. F.; Hagemann, M.; Paulsen, I.; Partensky, F.

    2009-01-01

    Summary: Marine picocyanobacteria of the genera Prochlorococcus and Synechococcus numerically dominate the picophytoplankton of the world ocean, making a key contribution to global primary production. Prochlorococcus was isolated around 20 years ago and is probably the most abundant photosynthetic organism on Earth. The genus comprises specific ecotypes which are phylogenetically distinct and differ markedly in their photophysiology, allowing growth over a broad range of light and nutrient conditions within the 45°N to 40°S latitudinal belt that they occupy. Synechococcus and Prochlorococcus are closely related, together forming a discrete picophytoplankton clade, but are distinguishable by their possession of dissimilar light-harvesting apparatuses and differences in cell size and elemental composition. Synechococcus strains have a ubiquitous oceanic distribution compared to that of Prochlorococcus strains and are characterized by phylogenetically discrete lineages with a wide range of pigmentation. In this review, we put our current knowledge of marine picocyanobacterial genomics into an environmental context and present previously unpublished genomic information arising from extensive genomic comparisons in order to provide insights into the adaptations of these marine microbes to their environment and how they are reflected at the genomic level. PMID:19487728

  6. A fully resolved consensus between fully resolved phylogenetic trees.

    PubMed

    Quitzau, José Augusto Amgarten; Meidanis, João

    2006-03-31

    Nowadays, there are many phylogeny reconstruction methods, each with advantages and disadvantages. We explored the advantages of each method, putting together the common parts of trees constructed by several methods, by means of a consensus computation. A number of phylogenetic consensus methods are already known. Unfortunately, there is also a taboo concerning consensus methods, because most biologists see them mainly as comparators and not as phylogenetic tree constructors. We challenged this taboo by defining a consensus method that builds a fully resolved phylogenetic tree based on the most common parts of fully resolved trees in a given collection. We also generated results showing that this consensus is in a way a kind of "median" of the input trees; as such it can be closer to the correct tree in many situations.

  7. Molecular phylogenetics of finches and sparrows: consequences of character state removal in cytochrome b sequences.

    PubMed

    Groth, J G

    1998-12-01

    The complete mitochondrial cytochrome b genes of 53 genera of oscine passerine birds representing the major groups of finches and some allies were compared. Phylogenetic trees resulting from three levels of character partition removal (no data removed, transitions at third positions of codons removed, and all transitions removed [transversion parsimony]) were generally concordant, and all supported several basic statements regarding relationships of finches and finch-like birds, including: (1) larks (Alaudidae) show no close relationship to any finch group; (2) Peucedramus (olive warbler) is phylogenetically far removed from true wood warblers; (3) a clade consisting of fringillids, passerids, motacillids, and emberizids is supported, and this clade is characterized by evolution of a vestigial 10th wing primary; and (4) Hawaiian honeycreepers are derived from within the cardueline finches. Excluding transition substitutions at third positions of codons resulted in phylogenetic trees similar to, but with greater bootstrap nodal support than, trees derived using either all data (equally weighted) or transversion parsimony. Relative to the shortest trees obtained using all data, the topologies obtained after elimination of third-position transitions showed only slight increases in realized treelength and homoplasy. These increases were negligable compared to increases in overall nodal support; therefore, this partition removal scheme may enhance recovery of deep phylogenetic signal in protein-coding DNA datasets. Copyright 1998 Academic Press.

  8. High-resolution SAR11 ecotype dynamics at the Bermuda Atlantic Time-series Study site by phylogenetic placement of pyrosequences

    PubMed Central

    Vergin, Kevin L; Beszteri, Bánk; Monier, Adam; Cameron Thrash, J; Temperton, Ben; Treusch, Alexander H; Kilpert, Fabian; Worden, Alexandra Z; Giovannoni, Stephen J

    2013-01-01

    Advances in next-generation sequencing technologies are providing longer nucleotide sequence reads that contain more information about phylogenetic relationships. We sought to use this information to understand the evolution and ecology of bacterioplankton at our long-term study site in the Western Sargasso Sea. A bioinformatics pipeline called PhyloAssigner was developed to align pyrosequencing reads to a reference multiple sequence alignment of 16S ribosomal RNA (rRNA) genes and assign them phylogenetic positions in a reference tree using a maximum likelihood algorithm. Here, we used this pipeline to investigate the ecologically important SAR11 clade of Alphaproteobacteria. A combined set of 2.7 million pyrosequencing reads from the 16S rRNA V1–V2 regions, representing 9 years at the Bermuda Atlantic Time-series Study (BATS) site, was quality checked and parsed into a comprehensive bacterial tree, yielding 929 036 Alphaproteobacteria reads. Phylogenetic structure within the SAR11 clade was linked to seasonally recurring spatiotemporal patterns. This analysis resolved four new SAR11 ecotypes in addition to five others that had been described previously at BATS. The data support a conclusion reached previously that the SAR11 clade diversified by subdivision of niche space in the ocean water column, but the new data reveal a more complex pattern in which deep branches of the clade diversified repeatedly across depth strata and seasonal regimes. The new data also revealed the presence of an unrecognized clade of Alphaproteobacteria, here named SMA-1 (Sargasso Mesopelagic Alphaproteobacteria, group 1), in the upper mesopelagic zone. The high-resolution phylogenetic analyses performed herein highlight significant, previously unknown, patterns of evolutionary diversification, within perhaps the most widely distributed heterotrophic marine bacterial clade, and strongly links to ecosystem regimes. PMID:23466704

  9. High-resolution SAR11 ecotype dynamics at the Bermuda Atlantic Time-series Study site by phylogenetic placement of pyrosequences.

    PubMed

    Vergin, Kevin L; Beszteri, Bánk; Monier, Adam; Thrash, J Cameron; Temperton, Ben; Treusch, Alexander H; Kilpert, Fabian; Worden, Alexandra Z; Giovannoni, Stephen J

    2013-07-01

    Advances in next-generation sequencing technologies are providing longer nucleotide sequence reads that contain more information about phylogenetic relationships. We sought to use this information to understand the evolution and ecology of bacterioplankton at our long-term study site in the Western Sargasso Sea. A bioinformatics pipeline called PhyloAssigner was developed to align pyrosequencing reads to a reference multiple sequence alignment of 16S ribosomal RNA (rRNA) genes and assign them phylogenetic positions in a reference tree using a maximum likelihood algorithm. Here, we used this pipeline to investigate the ecologically important SAR11 clade of Alphaproteobacteria. A combined set of 2.7 million pyrosequencing reads from the 16S rRNA V1-V2 regions, representing 9 years at the Bermuda Atlantic Time-series Study (BATS) site, was quality checked and parsed into a comprehensive bacterial tree, yielding 929 036 Alphaproteobacteria reads. Phylogenetic structure within the SAR11 clade was linked to seasonally recurring spatiotemporal patterns. This analysis resolved four new SAR11 ecotypes in addition to five others that had been described previously at BATS. The data support a conclusion reached previously that the SAR11 clade diversified by subdivision of niche space in the ocean water column, but the new data reveal a more complex pattern in which deep branches of the clade diversified repeatedly across depth strata and seasonal regimes. The new data also revealed the presence of an unrecognized clade of Alphaproteobacteria, here named SMA-1 (Sargasso Mesopelagic Alphaproteobacteria, group 1), in the upper mesopelagic zone. The high-resolution phylogenetic analyses performed herein highlight significant, previously unknown, patterns of evolutionary diversification, within perhaps the most widely distributed heterotrophic marine bacterial clade, and strongly links to ecosystem regimes.

  10. Thioredoxin and evolution

    NASA Technical Reports Server (NTRS)

    Buchanan, B. B.

    1991-01-01

    Comparisons of primary structure have revealed significant homology between the m type thioredoxins of chloroplasts and the thioredoxins from a variety of bacteria. Chloroplast thioredoxin f, by comparison, remains an enigma: certain residues are invariant with those of the other thioredoxins, but a phylogenetic relationship to bacterial or m thioredoxins seems distant. Knowledge of the evolutionary history of thioredoxin f is, nevertheless, of interest because of its role in photosynthesis. Therefore, we have attempted to gain information on the evolutionary history of chloroplast thioredoxin f, as well as m. Our goal was first to establish the utility of thioredoxin as a phylogenetic marker, and, if found suitable, to deduce the evolutionary histories of the chloroplast thioredoxins. To this end, we have constructed phylogenetic (minimal replacement) trees using computer analysis. The results show that the thioredoxins of bacteria and animals fall into distinct phylogenetic groups - the bacterial group resembling that derived from earlier 16s RNA analysis and the animal group showing a cluster consistent with known relationships. The chloroplast thioredoxins show a novel type of phylogenetic arrangement: one m type aligns with its counterpart of eukaryotic algae, cyanobacteria and other bacteria, whereas the second type (f type) tracks with animal thioredoxin. The results give new insight into the evolution of photosynthesis.

  11. Single-cell analysis of uncultured magnetotactic bacteria via fluorescence-coupled electron microscopy approach

    NASA Astrophysics Data System (ADS)

    LI, J.; Zhang, H.; Liu, P.; Menguy, N.; Pan, Y.

    2017-12-01

    Magnetotactic bacteria (MTB) are phylogenetically diverse and can biomineralize magnetic nanocrystals of magnetite or greigite in intracellular structures termed magnetosomes. Their remains within sediments or sedimentary rocks, i.e. magnetofossils, have been used to retrieve paleomagnetic and paleoenvironmental information of deposition time, as well as to trace the origin and evolution of life on Earth and even perhaps Mars. A precise identification of magnetofossils heavily depends on our knowledge of phylogenetic diversity and magnetosomal biomineralization within natural MTB. In this paper, we will present a novel method which can rapidly characterize both the phylogenetic and biomineralogical properties of uncultured MTB at the single-cell level by coupling fluorescence and electron microscopy. Using this method, we have successfully identified several uncultured MTB strains from natural environments in China. These MTB are phylogenetically affiliated with the Alphaproteobacteria, Deltaproteobacteria, Gammaproteobacteria and Nitrospirae phylum, and form octahedral, cuboctahedral, prismatic, tooth-like and bullet-shaped magnetite magnetosomes. A corresponding analysis of magnetosome morphology and bacterial phylogenetics on each MTB strain has shown a species/strain-specific magnetosome biomineralization. The new method is not only promising for better understanding the correlation between magnetosome mineral habits and MTB phylogenies, but also crucial for unambiguously identifying magnetofossils.

  12. From learning taxonomies to phylogenetic learning: integration of 16S rRNA gene data into FAME-based bacterial classification.

    PubMed

    Slabbinck, Bram; Waegeman, Willem; Dawyndt, Peter; De Vos, Paul; De Baets, Bernard

    2010-01-30

    Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification. In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model. FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for the discrimination of bacterial species. Summarized, by phylogenetic learning we are able to situate and evaluate FAME-based bacterial species classification in a more informative context.

  13. From learning taxonomies to phylogenetic learning: Integration of 16S rRNA gene data into FAME-based bacterial classification

    PubMed Central

    2010-01-01

    Background Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification. Results In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model. Conclusions FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for the discrimination of bacterial species. Summarized, by phylogenetic learning we are able to situate and evaluate FAME-based bacterial species classification in a more informative context. PMID:20113515

  14. Phylogeny, host-parasite relationship and zoogeography

    PubMed Central

    1999-01-01

    Phylogeny is the evolutionary history of a group or the lineage of organisms and is reconstructed based on morphological, molecular and other characteristics. The genealogical relationship of a group of taxa is often expressed as a phylogenetic tree. The difficulty in categorizing the phylogeny is mainly due to the existence of frequent homoplasies that deceive observers. At the present time, cladistic analysis is believed to be one of the most effective methods of reconstructing a phylogenetic tree. Excellent computer program software for phylogenetic analysis is available. As an example, cladistic analysis was applied for nematode genera of the family Acuariidae, and the phylogenetic tree formed was compared with the system used currently. Nematodes in the genera Nippostrongylus and Heligmonoides were also analyzed, and the validity of the reconstructed phylogenetic trees was observed from a zoogeographical point of view. Some of the theories of parasite evolution were briefly reviewed as well. Coevolution of parasites and humans was discussed with special reference to the evolutionary relationship between Enterobius and primates. PMID:10634036

  15. Entire plastid phylogeny of the carrot genus (Daucus, Apiaceae):Concordance with nuclear data and mitochondrial and nuclear DNA insertions to the plastid

    USDA-ARS?s Scientific Manuscript database

    We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results to prior phylogenetic results using plastid, nuclear, and mitochondrial DNA sequences. We obtained, using Illumina sequencing, full plastid sequences of 37 accessions of 20 Daucus taxa and outgrou...

  16. P-type ATPase superfamily: evidence for critical roles for kingdom evolution.

    PubMed

    Okamura, Hideyuki; Denawa, Masatsugu; Ohniwa, Ryosuke; Takeyasu, Kunio

    2003-04-01

    The P-type ATPase has become a protein superfamily. On the basis of sequence similarities, the phylogenetic analyses, and substrate specificities, this superfamily can be classified into 5 families and 11 subfamilies. A comparative phylogenetic analysis demonstrates the relationship between the molecular evolution of these subfamilies and the establishment of the kingdoms of living things.

  17. Prioritizing Populations for Conservation Using Phylogenetic Networks

    PubMed Central

    Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.

    2014-01-01

    In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451

  18. Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees.

    PubMed

    Martínez-Aquino, Andrés

    2016-08-01

    Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host-parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a "compass" when "walking" through jungles of tangled phylogenetic trees.

  19. Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees

    PubMed Central

    2016-01-01

    Abstract Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host–parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a “compass” when “walking” through jungles of tangled phylogenetic trees. PMID:29491928

  20. Optimal network alignment with graphlet degree vectors.

    PubMed

    Milenković, Tijana; Ng, Weng Leong; Hayes, Wayne; Przulj, Natasa

    2010-06-30

    Important biological information is encoded in the topology of biological networks. Comparative analyses of biological networks are proving to be valuable, as they can lead to transfer of knowledge between species and give deeper insights into biological function, disease, and evolution. We introduce a new method that uses the Hungarian algorithm to produce optimal global alignment between two networks using any cost function. We design a cost function based solely on network topology and use it in our network alignment. Our method can be applied to any two networks, not just biological ones, since it is based only on network topology. We use our new method to align protein-protein interaction networks of two eukaryotic species and demonstrate that our alignment exposes large and topologically complex regions of network similarity. At the same time, our alignment is biologically valid, since many of the aligned protein pairs perform the same biological function. From the alignment, we predict function of yet unannotated proteins, many of which we validate in the literature. Also, we apply our method to find topological similarities between metabolic networks of different species and build phylogenetic trees based on our network alignment score. The phylogenetic trees obtained in this way bear a striking resemblance to the ones obtained by sequence alignments. Our method detects topologically similar regions in large networks that are statistically significant. It does this independent of protein sequence or any other information external to network topology.

  1. HAL: a hierarchical format for storing and analyzing multiple genome alignments.

    PubMed

    Hickey, Glenn; Paten, Benedict; Earl, Dent; Zerbino, Daniel; Haussler, David

    2013-05-15

    Large multiple genome alignments and inferred ancestral genomes are ideal resources for comparative studies of molecular evolution, and advances in sequencing and computing technology are making them increasingly obtainable. These structures can provide a rich understanding of the genetic relationships between all subsets of species they contain. Current formats for storing genomic alignments, such as XMFA and MAF, are all indexed or ordered using a single reference genome, however, which limits the information that can be queried with respect to other species and clades. This loss of information grows with the number of species under comparison, as well as their phylogenetic distance. We present HAL, a compressed, graph-based hierarchical alignment format for storing multiple genome alignments and ancestral reconstructions. HAL graphs are indexed on all genomes they contain. Furthermore, they are organized phylogenetically, which allows for modular and parallel access to arbitrary subclades without fragmentation because of rearrangements that have occurred in other lineages. HAL graphs can be created or read with a comprehensive C++ API. A set of tools is also provided to perform basic operations, such as importing and exporting data, identifying mutations and coordinate mapping (liftover). All documentation and source code for the HAL API and tools are freely available at http://github.com/glennhickey/hal. hickey@soe.ucsc.edu or haussler@soe.ucsc.edu Supplementary data are available at Bioinformatics online.

  2. PolyTB: A genomic variation map for Mycobacterium tuberculosis

    PubMed Central

    Coll, Francesc; Preston, Mark; Guerra-Assunção, José Afonso; Hill-Cawthorn, Grant; Harris, David; Perdigão, João; Viveiros, Miguel; Portugal, Isabel; Drobniewski, Francis; Gagneux, Sebastien; Glynn, Judith R.; Pain, Arnab; Parkhill, Julian; McNerney, Ruth; Martin, Nigel; Clark, Taane G.

    2014-01-01

    Summary Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. PMID:24637013

  3. 16S and 23S plastid rDNA phylogenies of Prototheca species and their auxanographic phenotypes.

    PubMed

    Ewing, Aren; Brubaker, Shane; Somanchi, Aravind; Yu, Esther; Rudenko, George; Reyes, Nina; Espina, Karen; Grossman, Arthur; Franklin, Scott

    2014-08-01

    Because algae have become more accepted as sources of human nutrition, phylogenetic analysis can help resolve the taxonomy of taxa that have not been well studied. This can help establish algal evolutionary relationships. Here, we compare Auxenochlorella protothecoides and 23 strains of Prototheca based on their complete 16S and partial 23S plastid rDNA sequences along with nutrient utilization (auxanographic) profiles. These data demonstrate that some of the species groupings are not in agreement with the molecular phylogenetic analyses and that auxanographic profiles are poor predictors of phylogenetic relationships.

  4. 16S and 23S plastid rDNA phylogenies of Prototheca species and their auxanographic phenotypes1

    PubMed Central

    Ewing, Aren; Brubaker, Shane; Somanchi, Aravind; Yu, Esther; Rudenko, George; Reyes, Nina; Espina, Karen; Grossman, Arthur; Franklin, Scott

    2014-01-01

    Because algae have become more accepted as sources of human nutrition, phylogenetic analysis can help resolve the taxonomy of taxa that have not been well studied. This can help establish algal evolutionary relationships. Here, we compare Auxenochlorella protothecoides and 23 strains of Prototheca based on their complete 16S and partial 23S plastid rDNA sequences along with nutrient utilization (auxanographic) profiles. These data demonstrate that some of the species groupings are not in agreement with the molecular phylogenetic analyses and that auxanographic profiles are poor predictors of phylogenetic relationships. PMID:25937672

  5. Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study

    PubMed Central

    Weißenborn, Sandra; Walther, Dirk

    2017-01-01

    Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the potential and challenges associated with phylogenetic profiling methods for the detection of functional relationships between genes as well as the need to enlarge the set of plant genes with proven secondary metabolism involvement as well as the limitations of distinct pathways as abstractions of relationships between genes. PMID:29163570

  6. An attempt to reconstruct phylogenetic relationships within Caribbean nummulitids: simulating relationships and tracing character evolution

    NASA Astrophysics Data System (ADS)

    Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann

    2017-04-01

    Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first chamber length and P/D ratio) and some growth-invariant characters of the chamber sequence (e.g., backbend angle, initial chamber base length and chamber length increase) are evident.

  7. Phylogenetic and microsatellite markers for Tulasnella (Tulasnellaceae) mycorrhizal fungi associated with Australian orchids1

    PubMed Central

    Ruibal, Monica P.; Peakall, Rod; Smith, Leon M.; Linde, Celeste C.

    2013-01-01

    • Premise of the study: Phylogenetic and microsatellite markers were developed for Tulasnella mycorrhizal fungi to investigate fungal species identity and diversity. These markers will be useful in future studies investigating the phylogenetic relationship of the fungal symbionts, specificity of orchid–mycorrhizal associations, and the role of mycorrhizae in orchid speciation within several orchid genera. • Methods and Results: We generated partial genome sequences of two Tulasnella symbionts originating from Chiloglottis and Drakaea orchid species with 454 genome sequencing. Cross-genus transferability across mycorrhizal symbionts associated with multiple genera of Australian orchids (Arthrochilus, Chiloglottis, Drakaea, and Paracaleana) was found for seven phylogenetic loci. Five loci showed cross-transferability to Tulasnella from other orchid genera, and two to Sebacina. Furthermore, 11 polymorphic microsatellite loci were developed for Tulasnella from Chiloglottis. • Conclusions: Highly informative markers were obtained, allowing investigation of mycorrhizal diversity of Tulasnellaceae associated with a wide variety of terrestrial orchids in Australia and potentially worldwide. PMID:25202528

  8. Phylogenetic studies of transmission dynamics in generalized HIV epidemics: An essential tool where the burden is greatest?

    PubMed Central

    Dennis, Ann M.; Herbeck, Joshua T.; Brown, Andrew Leigh; Kellam, Paul; de Oliveira, Tulio; Pillay, Deenan; Fraser, Christophe; Cohen, Myron S.

    2014-01-01

    Efficient and effective HIV prevention measures for generalized epidemics in sub-Saharan Africa have not yet been validated at the population-level. Design and impact evaluation of such measures requires fine-scale understanding of local HIV transmission dynamics. The novel tools of HIV phylogenetics and molecular epidemiology may elucidate these transmission dynamics. Such methods have been incorporated into studies of concentrated HIV epidemics to identify proximate and determinant traits associated with ongoing transmission. However, applying similar phylogenetic analyses to generalized epidemics, including the design and evaluation of prevention trials, presents additional challenges. Here we review the scope of these methods and present examples of their use in concentrated epidemics in the context of prevention. Next, we describe the current uses for phylogenetics in generalized epidemics, and discuss their promise for elucidating transmission patterns and informing prevention trials. Finally, we review logistic and technical challenges inherent to large-scale molecular epidemiological studies of generalized epidemics, and suggest potential solutions. PMID:24977473

  9. Complete, accurate, mammalian phylogenies aid conservation planning, but not much

    PubMed Central

    Rodrigues, Ana S. L.; Grenyer, Richard; Baillie, Jonathan E. M.; Bininda-Emonds, Olaf R. P.; Gittlemann, John L.; Hoffmann, Michael; Safi, Kamran; Schipper, Jan; Stuart, Simon N.; Brooks, Thomas

    2011-01-01

    In the face of unprecedented global biodiversity loss, conservation planning must balance between refining and deepening knowledge versus acting on current information to preserve species and communities. Phylogenetic diversity (PD), a biodiversity measure that takes into account the evolutionary relationships between species, is arguably a more meaningful measure of biodiversity than species diversity, but cannot yet be applied to conservation planning for the majority of taxa for which phylogenetic trees have not yet been developed. Here, we investigate how the quality of data on the taxonomy and/or phylogeny of species affects the results of spatial conservation planning in terms of the representation of overall mammalian PD. The results show that the better the quality of the biodiversity data the better they can serve as a basis for conservation planning. However, decisions based on incomplete data are remarkably robust across different levels of degrading quality concerning the description of new species and the availability of phylogenetic information. Thus, given the level of urgency and the need for action, conservation planning can safely make use of the best available systematic data, limited as these data may be. PMID:21844044

  10. The Cladophora complex (Chlorophyta): new views based on 18S rRNA gene sequences.

    PubMed

    Bakker, F T; Olsen, J L; Stam, W T; van den Hoek, C

    1994-12-01

    Evolutionary relationships among species traditionally ascribed to the Siphonocladales/Cladophorales have remained unclear due to a lack of phylogenetically informative characters and extensive morphological plasticity resulting in morphological convergence. This study explores some of the diversity within the generic complex Cladophora and its siphonocladalaen allies. Twelve species of Cladophora representing 6 of the 11 morphological sections recognized by van den Hoek were analyzed along with 8 siphonocladalaen species using 18S rRNA gene sequences. The final alignment consisted of 1460 positions containing 92 phylogenetically informative substitutions. Weighting schemes (EOR weighting, combinatorial weighting) were applied in maximum parsimony analysis to correct for substitution bias. Stem characters were weighted 0.66 relative to single-stranded characters to correct for secondary structural constraints. Both weighting approaches resulted in greater phylogenetic resolution. Results confirm that there is no basis for the independent recognition of the Cladophorales and Siphonocladales. The Siphonocladales is polyphyletic, and Cladophora is paraphyletic. All analyses support two principal lineages, of which one contains predominantly tropical members including almost all siphonocladalean taxa, while the other lineage consists of mostly warm- to cold-temperate species of Cladophora.

  11. Description of the first cryptic avian malaria parasite, Plasmodium homocircumflexum n. sp., with experimental data on its virulence and development in avian hosts and mosquitoes.

    PubMed

    Palinauskas, Vaidas; Žiegytė, Rita; Ilgūnas, Mikas; Iezhova, Tatjana A; Bernotienė, Rasa; Bolshakov, Casimir; Valkiūnas, Gediminas

    2015-01-01

    For over 100 years studies on avian haemosporidian parasite species have relied on similarities in their morphology to establish a species concept. Some exceptional cases have also included information about the life cycle and sporogonic development. More than 50 avian Plasmodium spp. have now been described. However, PCR-based studies show a much broader diversity of haemosporidian parasites, indicating the possible existence of a diverse group of cryptic species. In the present study, using both similarity and phylogenetic species definition concepts, we believe that we report the first characterised cryptic speciation case of an avian Plasmodium parasite. We used sequence information on the mitochondrial cytochrome b gene and constructed phylogenies of identified Plasmodium spp. to define their position in the phylogenetic tree. After analysis of blood stages, the morphology of the parasite was shown to be identical to Plasmodium circumflexum. However, the geographic distribution of the new parasite, the phylogenetic information, as well as patterns of development of infection, indicate that this parasite differs from P. circumflexum. Plasmodium homocircumflexum n. sp. was described based on information about genetic differences from described lineages, phylogenetic position and biological characters. This parasite develops parasitemia in experimentally infected birds - the domestic canary Serinus canaria domestica, siskin Carduelis spinus and crossbill Loxia curvirostra. Anaemia caused by high parasitemia, as well as cerebral paralysis caused by exoerythrocytic stages in the brain, are the main reasons for mortality. Exoerythrocytic stages also form in other organs (heart, kidneys, liver, lungs, spleen, intestines and pectoral muscles). DNA amplification was unsuccessful from faecal samples of heavily infected birds. The sporogonic development initiates, but is abortive, at the oocyst stage in two common European mosquito species, Culex pipiens pipiens (forms pipiens and molestus) and Aedes vexans. Vectors of this Plasmodium sp. remain unknown. Copyright © 2014 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.

  12. Detecting Network Communities: An Application to Phylogenetic Analysis

    PubMed Central

    Andrade, Roberto F. S.; Rocha-Neto, Ivan C.; Santos, Leonardo B. L.; de Santana, Charles N.; Diniz, Marcelo V. C.; Lobão, Thierry Petit; Goés-Neto, Aristóteles; Pinho, Suani T. R.; El-Hani, Charbel N.

    2011-01-01

    This paper proposes a new method to identify communities in generally weighted complex networks and apply it to phylogenetic analysis. In this case, weights correspond to the similarity indexes among protein sequences, which can be used for network construction so that the network structure can be analyzed to recover phylogenetically useful information from its properties. The analyses discussed here are mainly based on the modular character of protein similarity networks, explored through the Newman-Girvan algorithm, with the help of the neighborhood matrix . The most relevant networks are found when the network topology changes abruptly revealing distinct modules related to the sets of organisms to which the proteins belong. Sound biological information can be retrieved by the computational routines used in the network approach, without using biological assumptions other than those incorporated by BLAST. Usually, all the main bacterial phyla and, in some cases, also some bacterial classes corresponded totally (100%) or to a great extent (>70%) to the modules. We checked for internal consistency in the obtained results, and we scored close to 84% of matches for community pertinence when comparisons between the results were performed. To illustrate how to use the network-based method, we employed data for enzymes involved in the chitin metabolic pathway that are present in more than 100 organisms from an original data set containing 1,695 organisms, downloaded from GenBank on May 19, 2007. A preliminary comparison between the outcomes of the network-based method and the results of methods based on Bayesian, distance, likelihood, and parsimony criteria suggests that the former is as reliable as these commonly used methods. We conclude that the network-based method can be used as a powerful tool for retrieving modularity information from weighted networks, which is useful for phylogenetic analysis. PMID:21573202

  13. Homoplastic microinversions and the avian tree of life

    PubMed Central

    2011-01-01

    Background Microinversions are cytologically undetectable inversions of DNA sequences that accumulate slowly in genomes. Like many other rare genomic changes (RGCs), microinversions are thought to be virtually homoplasy-free evolutionary characters, suggesting that they may be very useful for difficult phylogenetic problems such as the avian tree of life. However, few detailed surveys of these genomic rearrangements have been conducted, making it difficult to assess this hypothesis or understand the impact of microinversions upon genome evolution. Results We surveyed non-coding sequence data from a recent avian phylogenetic study and found substantially more microinversions than expected based upon prior information about vertebrate inversion rates, although this is likely due to underestimation of these rates in previous studies. Most microinversions were lineage-specific or united well-accepted groups. However, some homoplastic microinversions were evident among the informative characters. Hemiplasy, which reflects differences between gene trees and the species tree, did not explain the observed homoplasy. Two specific loci were microinversion hotspots, with high numbers of inversions that included both the homoplastic as well as some overlapping microinversions. Neither stem-loop structures nor detectable sequence motifs were associated with microinversions in the hotspots. Conclusions Microinversions can provide valuable phylogenetic information, although power analysis indicates that large amounts of sequence data will be necessary to identify enough inversions (and similar RGCs) to resolve short branches in the tree of life. Moreover, microinversions are not perfect characters and should be interpreted with caution, just as with any other character type. Independent of their use for phylogenetic analyses, microinversions are important because they have the potential to complicate alignment of non-coding sequences. Despite their low rate of accumulation, they have clearly contributed to genome evolution, suggesting that active identification of microinversions will prove useful in future phylogenomic studies. PMID:21612607

  14. Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses.

    PubMed

    Brown, Jeremy M; Thomson, Robert C

    2017-07-01

    As the application of genomic data in phylogenetics has become routine, a number of cases have arisen where alternative data sets strongly support conflicting conclusions. This sensitivity to analytical decisions has prevented firm resolution of some of the most recalcitrant nodes in the tree of life. To better understand the causes and nature of this sensitivity, we analyzed several phylogenomic data sets using an alternative measure of topological support (the Bayes factor) that both demonstrates and averts several limitations of more frequently employed support measures (such as Markov chain Monte Carlo estimates of posterior probabilities). Bayes factors reveal important, previously hidden, differences across six "phylogenomic" data sets collected to resolve the phylogenetic placement of turtles within Amniota. These data sets vary substantially in their support for well-established amniote relationships, particularly in the proportion of genes that contain extreme amounts of information as well as the proportion that strongly reject these uncontroversial relationships. All six data sets contain little information to resolve the phylogenetic placement of turtles relative to other amniotes. Bayes factors also reveal that a very small number of extremely influential genes (less than 1% of genes in a data set) can fundamentally change significant phylogenetic conclusions. In one example, these genes are shown to contain previously unrecognized paralogs. This study demonstrates both that the resolution of difficult phylogenomic problems remains sensitive to seemingly minor analysis details and that Bayes factors are a valuable tool for identifying and solving these challenges. © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. The space of ultrametric phylogenetic trees.

    PubMed

    Gavryushkin, Alex; Drummond, Alexei J

    2016-08-21

    The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  16. The Evolution of Tyrosine-Recombinase Elements in Nematoda

    PubMed Central

    Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L.; Lunt, David H.

    2014-01-01

    Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum. PMID:25197791

  17. JAK and STAT members in channel catfish: Identification, phylogenetic analysis and expression profiling after Edwardsiella ictaluri infection.

    PubMed

    Jin, Yulin; Zhou, Tao; Li, Ning; Liu, Shikai; Xu, Xiaoyan; Pan, Ying; Tan, Suxu; Shi, Huitong; Yang, Yujia; Yuan, Zihao; Wang, Wenwen; Luo, Jian; Gao, Dongya; Dunham, Rex; Liu, Zhanjiang

    2018-04-01

    The Janus kinase/signal transducers and activators of transcription (JAK/STAT) signaling pathway is one of the main pleiotropic cascades used to transmit information from extracellular receptors to the nucleus, which results in DNA transcription and expression of genes involved in immunity, proliferation, differentiation, migration, apoptosis, and cell survival. Members of JAK family and STAT family have been extensively studied in different mammalian species because of their important roles in innate and adaptive immune responses. However, they have not been systematically studied among teleost fish species. In this study, five JAK family members and eight STAT family members were identified and characterized from channel catfish. Phylogenetic analysis was conducted to properly annotate these genes. Syntenic analysis was also conducted to establish orthology, and confirm the results from phylogenetic analysis. Compared to mammals, more members of the JAK and STAT family were identified in channel catfish genome. Expression of JAK and STAT family members was detected in healthy catfish tissues, but was induced in gill, liver, and intestine after bacterial challenge. Notably, the significant upregulation of STAT1b gene in catfish liver, gill and intestine after Edwardsiella ictaluri infection supported the notion that high STAT1 expression are involved in defense against pathogens. Collectively, the increased expression of JAK and STAT members in tested tissues suggested their crucial function in defending the host against pathogen invasion. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Phylogenetic analyses of cyclidiids (Protista, Ciliophora, Scuticociliatia) based on multiple genes suggest their close relationship with thigmotrichids.

    PubMed

    Gao, Feng; Gao, Shan; Wang, Pu; Katz, Laura A; Song, Weibo

    2014-06-01

    Cyclidiids and thigmotrichids are two diverse groups of scuticociliates, a diverse clade of ciliates that is often difficult to investigate due to the small size and conserved morphology among its members. Compared to other groups (e.g. hypotrichs and oligotrichs), the scuticociliates have received relatively little attention and their phylogenetic relationships are largely unresolved. To contribute to our understanding of their evolutionary history, we characterized 26 sequences for three linked genes (SSU-rDNA, 5.8S and LSU-rDNA) from 14 isolates of cyclidiids and thigmotrichids. Phylogenetic analyses reveal the following: (1) traditional cyclidiids are associated with thigmotrichs rather than pleuronematids as expected; (2) the validity of the newly-reported genus Falcicyclidium is confirmed by the molecular data and we suggest to transfer this genus to the family Ctedoctematidae; (3) both the genera Cyclidium and Protocyclidium are not monophyletic and the separation of Protocyclidium from Cyclidium is not supported; (4) the genus Cristigera is a well supported monophyletic group and may stand for a new family; (5) according to both morphological and molecular information, Cyclidium plouneouriDragesco, 1963 should be assigned in the genus Falcicyclidium and thus a new combination is suggested: Falcicyclidium plouneouri (Dragesco, 1963) n. comb.; and (6) based on the data available, a new genus is suggested: Acucyclidium gen. nov. with the type species, Acucyclidium atractodes (Fan et al., 2011a) n. comb. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. The evolution of tyrosine-recombinase elements in Nematoda.

    PubMed

    Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L; Lunt, David H

    2014-01-01

    Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum.

  20. What constitutes an Arabian Helicobacter pylori? Lessons from comparative genomics.

    PubMed

    Kumar, Narender; Albert, M John; Al Abkal, Hanan; Siddique, Iqbal; Ahmed, Niyaz

    2017-02-01

    Helicobacter pylori, the human gastric pathogen, causes a variety of gastric diseases ranging from mild gastritis to gastric cancer. While the studies on H. pylori are dominated by those based on either East Asian or Western strains, information regarding H. pylori strains prevalent in the Middle East remains scarce. Therefore, we carried out whole-genome sequencing and comparative analysis of three H. pylori strains isolated from three native Arab, Kuwaiti patients. H. pylori strains were sequenced using Illumina platform. The sequence reads were filtered and draft genomes were assembled and annotated. Various pathogenicity-associated regions and phages present within the genomes were identified. Phylogenetic analysis was carried out to determine the genetic relatedness of Kuwaiti strains to various lineages of H. pylori. The core genome content and virulence-related genes were analyzed to assess the pathogenic potential. The three genomes clustered along with HpEurope strains in the phylogenetic tree comprising various H. pylori lineages. A total of 1187 genes spread among various functional classes were identified in the core genome analysis. The three genomes possessed a complete cagPAI and also retained most of the known outer membrane proteins as well as virulence-related genes. The cagA gene in all three strains consisted of an AB-C type EPIYA motif. The comparative genomic analysis of Kuwaiti H. pylori strains revealed a European ancestry and a high pathogenic potential. © 2016 John Wiley & Sons Ltd.

  1. Characterization of the complete mitogenomes of two Neoscona spiders (Araneae: Araneidae) and its phylogenetic implications.

    PubMed

    Wang, Zheng-Liang; Li, Chao; Fang, Wen-Yuan; Yu, Xiao-Ping

    2016-09-30

    The complete mitogenomes of two orb-weaving spiders Neoscona doenitzi and Neoscona nautica were determined and a comparative mitogenomic analysis was performed to depict evolutionary trends of spider mitogenomes. The circular mitogenomes are 14,161bp with A+T content of 74.6% in N. doenitzi and 14,049bp with A+T content of 78.8% in N. nautica, respectively. Both mitogenomes contain a standard set of 37 genes typically presented in metazoans. Gene content and orientation are identical to all previously sequenced spider mitogenomes, while gene order is rearranged by tRNAs translocation when compared with the putative ancestral gene arrangement pattern presented by Limulus polyphemus. A comparative mitogenomic analysis reveals that the nucleotide composition bias is obviously divergent between spiders in suborder Opisthothelae and Mesothelae. The loss of D-arm in the trnS(UCN) among all of Opisthothelae spiders highly suggested that this common feature is a synapomorphy for entire suborder Opisthothelae. Moreover, the trnS(AGN) in araneoids preferred to use TCT as an anticodon rather than the typical anticodon GCT. Phylogenetic analysis based on the 13 protein-coding gene sequences consistently yields trees that nest the two Neoscona spiders within Araneidae and recover superfamily Araneoidea as a monophyletic group. The molecular information acquired from the results of this study should be very useful for future research on mitogenomic evolution and genetic diversities in spiders. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. [The gastrulation in Cnidaria: A key to understanding phylogeny or the chaos of secondary modifications?].

    PubMed

    Kraus, Yu A; Markov, A V

    2016-01-01

    The data revealed by comparative embryology of the basal (diploblastic) metazoans is traditionally considered a valuable potential source of information on the origin and early evolution of the animal kingdom and its major clades. Special attention is paid to the fundamental morphogenetic process of gastrulation during which the cells of the early embryo differentiate into the germ layers and the primary body plan is formed. Comparative analysis of gastrulation in different cnidarian taxa reveals high level of intergroup, intragroup, and individual variation. With few exceptions, there is no robust correlation between the type of gastrulation and the taxon. Current data do not support the idea that morphogenetic processes underlying cnidarian gastrulation can be divided into several distinct types. Rather, there is a continuum of equifinal ontogenetic trajectories. In cnidarians, the mode of gastrulation apparently depends less on the macroevolutionary history of the species than on various evolutionary plastic features, such as the oocyte size, the amount of yolk, the number of cells at the blastula (or morula) stage, the presence of phototrophic symbionts, or the ecology of the larva. Thus, in cnidarians, morphogenetic basis of gastrulation contains only a very weak phylogenetic signal and can have only limited application in phylogenetic reconstructions. On the other hand, comparative studies of the ontogeny of the basal metazoans shed light on the general rules of the evolution of morphogenetic processes that is crucial for understanding the early history of the animal kingdom.

  3. A phylogenetic perspective on species diversity, β-diversity and biogeography for the microbial world.

    PubMed

    Barberán, Albert; Casamayor, Emilio O

    2014-12-01

    There is an increasing interest to combine phylogenetic data with distributional and ecological records to assess how natural communities arrange under an evolutionary perspective. In the microbial world, there is also a need to go beyond the problematic species definition to deeply explore ecological patterns using genetic data. We explored links between evolution/phylogeny and community ecology using bacterial 16S rRNA gene information from a high-altitude lakes district data set. We described phylogenetic community composition, spatial distribution, and β-diversity and biogeographical patterns applying evolutionary relatedness without relying on any particular operational taxonomic unit definition. High-altitude lakes districts usually contain a large mosaic of highly diverse small water bodies and conform a fine biogeographical model of spatially close but environmentally heterogeneous ecosystems. We sampled 18 lakes in the Pyrenees with a selection criteria focused on capturing the maximum environmental variation within the smallest geographical area. The results showed highly diverse communities nonrandomly distributed with phylogenetic β-diversity patterns mainly shaped by the environment and not by the spatial distance. Community similarity based on both bacterial taxonomic composition and phylogenetic β-diversity shared similar patterns and was primarily structured by similar environmental drivers. We observed a positive relationship between lake area and phylogenetic diversity with a slope consistent with highly dispersive planktonic organisms. The phylogenetic approach incorporated patterns of common ancestry into bacterial community analysis and emerged as a very convenient analytical tool for direct inter- and intrabiome biodiversity comparisons and sorting out microbial habitats with potential application in conservation studies. © 2014 John Wiley & Sons Ltd.

  4. A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.

    PubMed

    Rajan, Vaibhav

    2013-03-01

    Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.

  5. Synthesis of phylogeny and taxonomy into a comprehensive tree of life

    PubMed Central

    Hinchliff, Cody E.; Smith, Stephen A.; Allman, James F.; Burleigh, J. Gordon; Chaudhary, Ruchi; Coghill, Lyndon M.; Crandall, Keith A.; Deng, Jiabin; Drew, Bryan T.; Gazis, Romina; Gude, Karl; Hibbett, David S.; Katz, Laura A.; Laughinghouse, H. Dail; McTavish, Emily Jane; Midford, Peter E.; Owen, Christopher L.; Ree, Richard H.; Rees, Jonathan A.; Soltis, Douglas E.; Williams, Tiffani; Cranston, Karen A.

    2015-01-01

    Reconstructing the phylogenetic relationships that unite all lineages (the tree of life) is a grand challenge. The paucity of homologous character data across disparately related lineages currently renders direct phylogenetic inference untenable. To reconstruct a comprehensive tree of life, we therefore synthesized published phylogenies, together with taxonomic classifications for taxa never incorporated into a phylogeny. We present a draft tree containing 2.3 million tips—the Open Tree of Life. Realization of this tree required the assembly of two additional community resources: (i) a comprehensive global reference taxonomy and (ii) a database of published phylogenetic trees mapped to this taxonomy. Our open source framework facilitates community comment and contribution, enabling the tree to be continuously updated when new phylogenetic and taxonomic data become digitally available. Although data coverage and phylogenetic conflict across the Open Tree of Life illuminate gaps in both the underlying data available for phylogenetic reconstruction and the publication of trees as digital objects, the tree provides a compelling starting point for community contribution. This comprehensive tree will fuel fundamental research on the nature of biological diversity, ultimately providing up-to-date phylogenies for downstream applications in comparative biology, ecology, conservation biology, climate change, agriculture, and genomics. PMID:26385966

  6. A new phylogenetic diversity measure generalizing the shannon index and its application to phyllostomid bats.

    PubMed

    Allen, Benjamin; Kon, Mark; Bar-Yam, Yaneer

    2009-08-01

    Protecting biodiversity involves preserving the maximum number and abundance of species while giving special attention to species with unique genetic or morphological characteristics. In balancing different priorities, conservation policymakers may consider quantitative measures that compare diversity across ecological communities. To serve this purpose, a measure should increase or decrease with changes in community composition in a way that reflects what is valued, including species richness, evenness, and distinctness. However, counterintuitively, studies have shown that established indices, including those that emphasize average interspecies phylogenetic distance, may increase with the elimination of species. We introduce a new diversity index, the phylogenetic entropy, which generalizes in a natural way the Shannon index to incorporate species relatedness. Phylogenetic entropy favors communities in which highly distinct species are more abundant, but it does not advocate decreasing any species proportion below a community structure-dependent threshold. We contrast the behavior of multiple indices on a community of phyllostomid bats in the Selva Lacandona. The optimal genus distribution for phylogenetic entropy populates all genera in a linear relationship to their total phylogenetic distance to other genera. Two other indices favor eliminating 12 out of the 23 genera.

  7. Synthesis of phylogeny and taxonomy into a comprehensive tree of life.

    PubMed

    Hinchliff, Cody E; Smith, Stephen A; Allman, James F; Burleigh, J Gordon; Chaudhary, Ruchi; Coghill, Lyndon M; Crandall, Keith A; Deng, Jiabin; Drew, Bryan T; Gazis, Romina; Gude, Karl; Hibbett, David S; Katz, Laura A; Laughinghouse, H Dail; McTavish, Emily Jane; Midford, Peter E; Owen, Christopher L; Ree, Richard H; Rees, Jonathan A; Soltis, Douglas E; Williams, Tiffani; Cranston, Karen A

    2015-10-13

    Reconstructing the phylogenetic relationships that unite all lineages (the tree of life) is a grand challenge. The paucity of homologous character data across disparately related lineages currently renders direct phylogenetic inference untenable. To reconstruct a comprehensive tree of life, we therefore synthesized published phylogenies, together with taxonomic classifications for taxa never incorporated into a phylogeny. We present a draft tree containing 2.3 million tips-the Open Tree of Life. Realization of this tree required the assembly of two additional community resources: (i) a comprehensive global reference taxonomy and (ii) a database of published phylogenetic trees mapped to this taxonomy. Our open source framework facilitates community comment and contribution, enabling the tree to be continuously updated when new phylogenetic and taxonomic data become digitally available. Although data coverage and phylogenetic conflict across the Open Tree of Life illuminate gaps in both the underlying data available for phylogenetic reconstruction and the publication of trees as digital objects, the tree provides a compelling starting point for community contribution. This comprehensive tree will fuel fundamental research on the nature of biological diversity, ultimately providing up-to-date phylogenies for downstream applications in comparative biology, ecology, conservation biology, climate change, agriculture, and genomics.

  8. The phylogenetic position of the Critically Endangered Saint Croix ground lizard Ameiva polops: revisiting molecular systematics of West Indian Ameiva.

    PubMed

    Hurtado, Luis A; Santamaria, Carlos A; Fitzgerald, Lee A

    2014-05-06

    The phylogenetic position of the critically endangered Saint Croix ground lizard Ameiva polops is presently unknown and several hypotheses have been proposed. We investigated the phylogenetic position of this species using molecular phylogenetic methods. We obtained sequences of DNA fragments of the mitochondrial ribosomal genes 12S rDNA and 16S rDNA for this species. We aligned these sequences with published sequences of other Ameiva species, which include most of the Ameiva species from the West Indies, three Ameiva species from Central America and South America, and one from the teiid lizard Tupinambis teguixin, which was used as outgroup. We conducted Maximum Likelihood and Bayesian phylogenetic analyses. The phylogenetic reconstructions among the different methods were very similar, supporting the monophyly of West Indian Ameiva and showing within this lineage, a basal polytomy of four clades that are separated geographically. Ameiva polops grouped in a cluster that included the other two Ameiva species found in the Puerto Rican Bank: A. wetmorei and A. exsul. A sister relationship between A. polops and A. wetmorei is suggested by our analyses. We compare our results with a previous study on molecular systematics of West Indian Ameiva. 

  9. Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies

    PubMed Central

    Leaché, Adam D.; Banbury, Barbara L.; Felsenstein, Joseph; de Oca, Adrián nieto-Montes; Stamatakis, Alexandros

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the presence of missing data. Phylogenetic analysis of RAD loci requires careful attention to model assumptions, especially if downstream analyses depend on branch lengths. PMID:26227865

  10. Complete chloroplast genome sequence of an orchid model plant candidate: Erycina pusilla apply in tropical Oncidium breeding.

    PubMed

    Pan, I-Chun; Liao, Der-Chih; Wu, Fu-Huei; Daniell, Henry; Singh, Nameirakpam Dolendro; Chang, Chen; Shih, Ming-Che; Chan, Ming-Tsair; Lin, Choun-Sea

    2012-01-01

    Oncidium is an important ornamental plant but the study of its functional genomics is difficult. Erycina pusilla is a fast-growing Oncidiinae species. Several characteristics including low chromosome number, small genome size, short growth period, and its ability to complete its life cycle in vitro make E. pusilla a good model candidate and parent for hybridization for orchids. Although genetic information remains limited, systematic molecular analysis of its chloroplast genome might provide useful genetic information. By combining bacterial artificial chromosome (BAC) clones and next-generation sequencing (NGS), the chloroplast (cp) genome of E. pusilla was sequenced accurately, efficiently and economically. The cp genome of E. pusilla shares 89 and 84% similarity with Oncidium Gower Ramsey and Phalanopsis aphrodite, respectively. Comparing these 3 cp genomes, 5 regions have been identified as showing diversity. Using PCR analysis of 19 species belonging to the Epidendroideae subfamily, a conserved deletion was found in the rps15-trnN region of the Cymbidieae tribe. Because commercial Oncidium varieties in Taiwan are limited, identification of potential parents using molecular breeding method has become very important. To demonstrate the relationship between taxonomic position and hybrid compatibility of E. pusilla, 4 DNA regions of 36 tropically adapted Oncidiinae varieties have been analyzed. The results indicated that trnF-ndhJ and trnH-psbA were suitable for phylogenetic analysis. E. pusilla proved to be phylogenetically closer to Rodriguezia and Tolumnia than Oncidium, despite its similar floral appearance to Oncidium. These results indicate the hybrid compatibility of E. pusilla, its cp genome providing important information for Oncidium breeding.

  11. Complete Chloroplast Genome Sequence of an Orchid Model Plant Candidate: Erycina pusilla Apply in Tropical Oncidium Breeding

    PubMed Central

    Pan, I-Chun; Liao, Der-Chih; Wu, Fu-Huei; Daniell, Henry; Singh, Nameirakpam Dolendro; Chang, Chen; Shih, Ming-Che; Chan, Ming-Tsair; Lin, Choun-Sea

    2012-01-01

    Oncidium is an important ornamental plant but the study of its functional genomics is difficult. Erycina pusilla is a fast-growing Oncidiinae species. Several characteristics including low chromosome number, small genome size, short growth period, and its ability to complete its life cycle in vitro make E. pusilla a good model candidate and parent for hybridization for orchids. Although genetic information remains limited, systematic molecular analysis of its chloroplast genome might provide useful genetic information. By combining bacterial artificial chromosome (BAC) clones and next-generation sequencing (NGS), the chloroplast (cp) genome of E. pusilla was sequenced accurately, efficiently and economically. The cp genome of E. pusilla shares 89 and 84% similarity with Oncidium Gower Ramsey and Phalanopsis aphrodite, respectively. Comparing these 3 cp genomes, 5 regions have been identified as showing diversity. Using PCR analysis of 19 species belonging to the Epidendroideae subfamily, a conserved deletion was found in the rps15-trnN region of the Cymbidieae tribe. Because commercial Oncidium varieties in Taiwan are limited, identification of potential parents using molecular breeding method has become very important. To demonstrate the relationship between taxonomic position and hybrid compatibility of E. pusilla, 4 DNA regions of 36 tropically adapted Oncidiinae varieties have been analyzed. The results indicated that trnF-ndhJ and trnH-psbA were suitable for phylogenetic analysis. E. pusilla proved to be phylogenetically closer to Rodriguezia and Tolumnia than Oncidium, despite its similar floral appearance to Oncidium. These results indicate the hybrid compatibility of E. pusilla, its cp genome providing important information for Oncidium breeding. PMID:22496851

  12. An integrative view of phylogenetic comparative methods: connections to population genetics, community ecology, and paleobiology.

    PubMed

    Pennell, Matthew W; Harmon, Luke J

    2013-06-01

    Recent innovations in phylogenetic comparative methods (PCMs) have spurred a renaissance of research into the causes and consequences of large-scale patterns of biodiversity. In this paper, we review these advances. We also highlight the potential of comparative methods to integrate across fields and focus on three examples where such integration might be particularly valuable: quantitative genetics, community ecology, and paleobiology. We argue that PCMs will continue to be a key set of tools in evolutionary biology, shedding new light on how evolutionary processes have shaped patterns of biodiversity through deep time. © 2013 New York Academy of Sciences.

  13. An exploration of differences in the scaling of life history traits with body mass within reptiles and between amniotes.

    PubMed

    Hallmann, Konstantin; Griebeler, Eva Maria

    2018-06-01

    Allometric relationships linking species characteristics to body size or mass (scaling) are important in biology. However, studies on the scaling of life history traits in the reptiles (the nonavian Reptilia) are rather scarce, especially for the clades Crocodilia, Testudines, and Rhynchocephalia (single extant species, the tuatara). Previous studies on the scaling of reptilian life history traits indicated that they differ from those seen in the other amniotes (mammals and birds), but so far most comparative studies used small species samples and also not phylogenetically informed analyses. Here, we analyzed the scaling of nine life history traits with adult body mass for crocodiles ( n  =   22), squamates ( n  =   294), turtles ( n  =   52), and reptiles ( n  =   369). We used for the first time a phylogenetically informed approach for crocodiles, turtles, and the whole group of reptiles. We explored differences in scaling relationships between the reptilian clades Crocodilia, Squamata, and Testudines as well as differences between reptiles, mammals, and birds. Finally, we applied our scaling relationships, in order to gain new insights into the degree of the exceptionality of the tuatara's life history within reptiles. We observed for none of the life history traits studied any difference in their scaling with body mass between squamates, crocodiles, and turtles, except for clutch size and egg weight showing small differences between these groups. Compared to birds and mammals, scaling relationships of reptiles were similar for time-related traits, but they differed for reproductive traits. The tuatara's life history is more similar to that of a similar-sized turtle or crocodile than to a squamate.

  14. High time for a roll call: gene duplication and phylogenetic relationships of TCP-like genes in monocots

    PubMed Central

    Mondragón-Palomino, Mariana; Trontin, Charlotte

    2011-01-01

    Background and Aims The TCP family is an ancient group of plant developmental transcription factors that regulate cell division in vegetative and reproductive structures and are essential in the establishment of flower zygomorphy. In-depth research on eudicot TCPs has documented their evolutionary and developmental role. This has not happened to the same extent in monocots, although zygomorphy has been critical for the diversification of Orchidaceae and Poaceae, the largest families of this group. Investigating the evolution and function of TCP-like genes in a wider group of monocots requires a detailed phylogenetic analysis of all available sequence information and a system that facilitates comparing genetic and functional information. Methods The phylogenetic relationships of TCP-like genes in monocots were investigated by analysing sequences from the genomes of Zea mays, Brachypodium distachyon, Oryza sativa and Sorghum bicolor, as well as EST data from several other monocot species. Key Results All available monocot TCP-like sequences are associated in 20 major groups with an average identity ≥64 % and most correspond to well-supported clades of the phylogeny. Their sequence motifs and relationships of orthology were documented and it was found that 67 % of the TCP-like genes of Sorghum, Oryza, Zea and Brachypodium are in microsyntenic regions. This analysis suggests that two rounds of whole genome duplication drove the expansion of TCP-like genes in these species. Conclusions A system of classification is proposed where putative or recognized monocot TCP-like genes are assigned to a specific clade of PCF-, CIN- or CYC/tb1-like genes. Specific biases in sequence data of this family that must be tackled when studying its molecular evolution and phylogeny are documented. Finally, the significant retention of duplicated TCP genes from Zea mays is considered in the context of balanced gene drive. PMID:21444336

  15. Phylogenetic Relationships of Citrus and Its Relatives Based on matK Gene Sequences

    PubMed Central

    Penjor, Tshering; Uehara, Miki; Ide, Manami; Matsumoto, Natsumi; Matsumoto, Ryoji

    2013-01-01

    The genus Citrus includes mandarin, orange, lemon, grapefruit and lime, which have high economic and nutritional value. The family Rutaceae can be divided into 7 subfamilies, including Aurantioideae. The genus Citrus belongs to the subfamily Aurantioideae. In this study, we sequenced the chloroplast matK genes of 135 accessions from 22 genera of Aurantioideae and analyzed them phylogenetically. Our study includes many accessions that have not been examined in other studies. The subfamily Aurantioideae has been classified into 2 tribes, Clauseneae and Citreae, and our current molecular analysis clearly discriminate Citreae from Clauseneae by using only 1 chloroplast DNA sequence. Our study confirms previous observations on the molecular phylogeny of Aurantioideae in many aspects. However, we have provided novel information on these genetic relationships. For example, inconsistent with the previous observation, and consistent with our preliminary study using the chloroplast rbcL genes, our analysis showed that Feroniella oblata is not nested in Citrus species and is closely related with Feronia limonia. Furthermore, we have shown that Murraya paniculata is similar to Merrillia caloxylon and is dissimilar to Murraya koenigii. We found that “true citrus fruit trees” could be divided into 2 subclusters. One subcluster included Citrus, Fortunella, and Poncirus, while the other cluster included Microcitrus and Eremocitrus. Compared to previous studies, our current study is the most extensive phylogenetic study of Citrus species since it includes 93 accessions. The results indicate that Citrus species can be classified into 3 clusters: a citron cluster, a pummelo cluster, and a mandarin cluster. Although most mandarin accessions belonged to the mandarin cluster, we found some exceptions. We also obtained the information on the genetic background of various species of acid citrus grown in Japan. Because the genus Citrus contains many important accessions, we have comprehensively discussed the classification of this genus. PMID:23638116

  16. The morphological state space revisited: what do phylogenetic patterns in homoplasy tell us about the number of possible character states?

    PubMed Central

    Hoyal Cuthill, Jennifer F.

    2015-01-01

    Biological variety and major evolutionary transitions suggest that the space of possible morphologies may have varied among lineages and through time. However, most models of phylogenetic character evolution assume that the potential state space is finite. Here, I explore what the morphological state space might be like, by analysing trends in homoplasy (repeated derivation of the same character state). Analyses of ten published character matrices are compared against computer simulations with different state space models: infinite states, finite states, ordered states and an ‘inertial' model, simulating phylogenetic constraints. Of these, only the infinite states model results in evolution without homoplasy, a prediction which is not generally met by real phylogenies. Many authors have interpreted the ubiquity of homoplasy as evidence that the number of evolutionary alternatives is finite. However, homoplasy is also predicted by phylogenetic constraints on the morphological distance that can be traversed between ancestor and descendent. Phylogenetic rarefaction (sub-sampling) shows that finite and inertial state spaces do produce contrasting trends in the distribution of homoplasy. Two clades show trends characteristic of phylogenetic inertia, with decreasing homoplasy (increasing consistency index) as we sub-sample more distantly related taxa. One clade shows increasing homoplasy, suggesting exhaustion of finite states. Different clades may, therefore, show different patterns of character evolution. However, when parsimony uninformative characters are excluded (which may occur without documentation in cladistic studies), it may no longer be possible to distinguish inertial and finite state spaces. Interestingly, inertial models predict that homoplasy should be clustered among comparatively close relatives (parallel evolution), whereas finite state models do not. If morphological evolution is often inertial in nature, then homoplasy (false homology) may primarily occur between close relatives, perhaps being replaced by functional analogy at higher taxonomic scales. PMID:26640650

  17. Advances in the use of DNA barcodes to build a community phylogeny for tropical trees in a Puerto Rican forest dynamics plot.

    PubMed

    Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K

    2010-11-09

    Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.

  18. Limited overlap between phylogenetic HIV and hepatitis C virus clusters illustrates the dynamic sexual network structure of Dutch HIV-infected MSM.

    PubMed

    Vanhommerig, Joost W; Bezemer, Daniela; Molenkamp, Richard; Van Sighem, Ard I; Smit, Colette; Arends, Joop E; Lauw, Fanny N; Brinkman, Kees; Rijnders, Bart J; Newsum, Astrid M; Bruisten, Sylvia M; Prins, Maria; Van Der Meer, Jan T; Van De Laar, Thijs J; Schinkel, Janke

    2017-09-24

    MSM are at increased risk for infection with HIV-1 and hepatitis C virus (HCV). Is HIV/HCV coinfection confined to specific HIV transmission networks? A HIV phylogenetic tree was constructed for 5038 HIV-1 subtype B polymerase (pol) sequences obtained from MSM in the AIDS therapy evaluation in the Netherlands cohort. We investigated the existence of HIV clusters with increased HCV prevalence, the HIV phylogenetic density (i.e. the number of potential HIV transmission partners) of HIV/HCV-coinfected MSM compared with HIV-infected MSM without HCV, and the overlap in HIV and HCV phylogenies using HCV nonstructural protein 5B sequences from 183 HIV-infected MSM with acute HCV infection. Five hundred and sixty-three of 5038 (11.2%) HIV-infected MSM tested HCV positive. Phylogenetic analysis revealed 93 large HIV clusters (≥10 MSM), 370 small HIV clusters (2-9 MSM), and 867 singletons with a median HCV prevalence of 11.5, 11.6, and 9.3%, respectively. We identified six large HIV clusters with elevated HCV prevalence (range 23.5-46.2%). Median HIV phylogenetic densities for MSM with HCV (3, interquartile range 1-7) and without HCV (3, interquartile range 1-8) were similar. HCV phylogeny showed 12 MSM-specific HCV clusters (clustersize: 2-39 HCV sequences); 12.7% of HCV infections were part of the same HIV and HCV cluster. We observed few HIV clusters with elevated HCV prevalence, no increase in the HIV phylogenetic density of HIV/HCV-coinfected MSM compared to HIV-infected MSM without HCV, and limited overlap between HIV and HCV phylogenies among HIV/HCV-coinfected MSM. Our data do not support the existence of MSM-specific sexual networks that fuel both the HIV and HCV epidemic.

  19. Plunging hands into the mushroom jar: a phylogenetic framework for Lyophyllaceae (Agaricales, Basidiomycota).

    PubMed

    Bellanger, J-M; Moreau, P-A; Corriol, G; Bidaud, A; Chalange, R; Dudova, Z; Richard, F

    2015-04-01

    During the last two decades, the unprecedented development of molecular phylogenetic tools has propelled an opportunity to revisit the fungal kingdom under an evolutionary perspective. Mycology has been profoundly changed but a sustained effort to elucidate large sections of the astonishing fungal diversity is still needed. Here we fill this gap in the case of Lyophyllaceae, a species-rich and ecologically diversified family of mushrooms. Assembly and genealogical concordance multigene phylogenetic analysis of a large dataset that includes original, vouchered material from expert field mycologists reveal the phylogenetic topology of the family, from higher (generic) to lower (species) levels. A comparative analysis of the most widely used phylogenetic markers in Fungi indicates that the nuc rDNA region encompassing the internal transcribed spacers 1 and 2, along with the 5.8S rDNA (ITS) and portions of the genes for RNA polymerase II second largest subunit (RPB2) is the most performing combination to resolve the broadest range of taxa within Lyophyllaceae. Eleven distinct evolutionary lineages are identified, that display partial overlap with traditional genera as well as with the phylogenetic framework previously proposed for the family. Eighty phylogenetic species are delineated, which shed light on a large number of morphological concepts, including rare and poorly documented ones. Probing these novel phylogenetic species to the barcoding method of species limit delineation, indicates that the latter method fully resolves Lyophyllaceae species, except in one clade. This case study provides the first comprehensive phylogenetic overview of Lyophyllaceae, a necessary step towards a taxonomical, ecological and nomenclatural revision of this family of mushrooms. It also proposes a set of methodological guidelines that may be of relevance for future taxonomic works in other groups of Fungi.

  20. Analysing taxonomic structures and local ecological processes in temperate forests in North Eastern China.

    PubMed

    Fan, Chunyu; Tan, Lingzhao; Zhang, Chunyu; Zhao, Xiuhai; von Gadow, Klaus

    2017-10-30

    One of the core issues of forest community ecology is the exploration of how ecological processes affect community structure. The relative importance of different processes is still under debate. This study addresses four questions: (1) how is the taxonomic structure of a forest community affected by spatial scale? (2) does the taxonomic structure reveal effects of local processes such as environmental filtering, dispersal limitation or interspecific competition at a local scale? (3) does the effect of local processes on the taxonomic structure vary with the spatial scale? (4) does the analysis based on taxonomic structures provide similar insights when compared with the use of phylogenetic information? Based on the data collected in two large forest observational field studies, the taxonomic structures of the plant communities were analyzed at different sampling scales using taxonomic ratios (number of genera/number of species, number of families/number of species), and the relationship between the number of higher taxa and the number of species. Two random null models were used and the "standardized effect size" (SES) of taxonomic ratios was calculated, to assess possible differences between the observed and simulated taxonomic structures, which may be caused by specific ecological processes. We further applied a phylogeny-based method to compare results with those of the taxonomic approach. As expected, the taxonomic ratios decline with increasing grain size. The quantitative relationship between genera/families and species, described by a linearized power function, showed a good fit. With the exception of the family-species relationship in the Jiaohe study area, the exponents of the genus/family-species relationships did not show any scale dependent effects. The taxonomic ratios of the observed communities had significantly lower values than those of the simulated random community under the test of two null models at almost all scales. Null Model 2 which considered the spatial dispersion of species generated a taxonomic structure which proved to be more consistent with that in the observed community. As sampling sizes increased from 20 m × 20 m to 50 m × 50 m, the magnitudes of SESs of taxonomic ratios increased. Based on the phylogenetic analysis, we found that the Jiaohe plot was phylogenetically clustered at almost all scales. We detected significant phylogenetically overdispersion at the 20 m × 20 m and 30 m × 30 m scales in the Liangshui plot. The results suggest that the effect of abiotic filtering is greater than the effects of interspecific competition in shaping the local community at almost all scales. Local processes influence the taxonomic structures, but their combined effects vary with the spatial scale. The taxonomic approach provides similar insights as the phylogenetic approach, especially when we applied a more conservative null model. Analysing taxonomic structure may be a useful tool for communities where well-resolved phylogenetic data are not available.

  1. Phylogenetic diversity anomaly in angiosperms between eastern Asia and eastern North America.

    PubMed

    Qian, Hong; Jin, Yi; Ricklefs, Robert E

    2017-10-24

    Although eastern Asia (EAS) and eastern North America (ENA) have similar climates, plant species richness in EAS greatly exceeds that in ENA. The degree to which this diversity difference reflects the ages of the floras or their rates of evolutionary diversification has not been quantified. Measures of species diversity that do not incorporate the ages of lineages disregard the evolutionary distinctiveness of species. In contrast, phylogenetic diversity integrates both the number of species and their history of evolutionary diversification. Here we compared species diversity and phylogenetic diversity in a large number of flowering plant (angiosperm) floras distributed across EAS and ENA, two regions with similar contemporary environments and broadly shared floristic history. After accounting for climate and sample area, we found both species diversity and phylogenetic diversity to be significantly higher in EAS than in ENA. When we controlled the number of species statistically, we found that phylogenetic diversity remained substantially higher in EAS than in ENA, although it tended to converge at high latitude. This pattern held independently for herbs, shrubs, and trees. The anomaly in species and phylogenetic diversity likely resulted from differences in regional processes, related in part to high climatic and topographic heterogeneity, and a strong monsoon climate, in EAS. The broad connection between tropical and temperate floras in southern Asia also might have played a role in creating the phylogenetic diversity anomaly.

  2. Onto-phylogenetic aspect of myotomal myogenesis in Chordata.

    PubMed

    Kiełbówna, Leokadia; Daczewska, Małgorzata

    2004-01-01

    This paper presents an onto- and phylogenetic aspect of myotoamal myogenesis in Chordata. A comparative analysis of early stages of myotomal myogenesis in Chordata indicates that the myogenic process in this phylum underwent evolutionary changes. The first stage of the process is myogenesis leading to development of mononucleate mature muscle cells, the most advanced stage is formation of multinucleate muscle fibres.

  3. Multilocus phylogenetic analysis of true morels (Morchella) reveals high levels of endemics in Turkey relative ot other regions of Europe

    USDA-ARS?s Scientific Manuscript database

    The present study was conducted to better understand how the phylogenetic diversity of true morels (Morchella) in Turkey compares with species found in other regions of the world. The current research builds on our recently published survey of 10 Turkish provinces and another of the world in which D...

  4. Phylogenetic Reconstruction as a Broadly Applicable Teaching Tool in the Biology Classroom: The Value of Data in Estimating Likely Answers

    ERIC Educational Resources Information Center

    Julius, Matthew L.; Schoenfuss, Heiko L.

    2006-01-01

    This laboratory exercise introduces students to a fundamental tool in evolutionary biology--phylogenetic inference. Students are required to create a data set via observation and through mining preexisting data sets. These student data sets are then used to develop and compare competing hypotheses of vertebrate phylogeny. The exercise uses readily…

  5. The Transporter Classification Database: recent advances.

    PubMed

    Saier, Milton H; Yen, Ming Ren; Noto, Keith; Tamang, Dorjee G; Elkan, Charles

    2009-01-01

    The Transporter Classification Database (TCDB), freely accessible at http://www.tcdb.org, is a relational database containing sequence, structural, functional and evolutionary information about transport systems from a variety of living organisms, based on the International Union of Biochemistry and Molecular Biology-approved transporter classification (TC) system. It is a curated repository for factual information compiled largely from published references. It uses a functional/phylogenetic system of classification, and currently encompasses about 5000 representative transporters and putative transporters in more than 500 families. We here describe novel software designed to support and extend the usefulness of TCDB. Our recent efforts render it more user friendly, incorporate machine learning to input novel data in a semiautomatic fashion, and allow analyses that are more accurate and less time consuming. The availability of these tools has resulted in recognition of distant phylogenetic relationships and tremendous expansion of the information available to TCDB users.

  6. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    PubMed

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new trees for LIT genes will be a valuable resource for researchers studying the evolution of eyes or other light-interacting structures. We also introduce PIA, a high throughput method for using phylogenetic relationships to identify LIT genes in transcriptomes from non-model organisms. With simple modifications, our methods may be used to search for different sets of genes or to annotate data sets from taxa outside of Metazoa.

  7. EvolView, an online tool for visualizing, annotating and managing phylogenetic trees.

    PubMed

    Zhang, Huangkai; Gao, Shenghan; Lercher, Martin J; Hu, Songnian; Chen, Wei-Hua

    2012-07-01

    EvolView is a web application for visualizing, annotating and managing phylogenetic trees. First, EvolView is a phylogenetic tree viewer and customization tool; it visualizes trees in various formats, customizes them through built-in functions that can link information from external datasets, and exports the customized results to publication-ready figures. Second, EvolView is a tree and dataset management tool: users can easily organize related trees into distinct projects, add new datasets to trees and edit and manage existing trees and datasets. To make EvolView easy to use, it is equipped with an intuitive user interface. With a free account, users can save data and manipulations on the EvolView server. EvolView is freely available at: http://www.evolgenius.info/evolview.html.

  8. Data set for phylogenetic tree and RAMPAGE Ramachandran plot analysis of SODs in Gossypium raimondii and G. arboreum.

    PubMed

    Wang, Wei; Xia, Minxuan; Chen, Jie; Deng, Fenni; Yuan, Rui; Zhang, Xiaopei; Shen, Fafu

    2016-12-01

    The data presented in this paper is supporting the research article "Genome-Wide Analysis of Superoxide Dismutase Gene Family in Gossypium raimondii and G. arboreum" [1]. In this data article, we present phylogenetic tree showing dichotomy with two different clusters of SODs inferred by the Bayesian method of MrBayes (version 3.2.4), "Bayesian phylogenetic inference under mixed models" [2], Ramachandran plots of G. raimondii and G. arboreum SODs, the protein sequence used to generate 3D sructure of proteins and the template accession via SWISS-MODEL server, "SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information." [3] and motif sequences of SODs identified by InterProScan (version 4.8) with the Pfam database, "Pfam: the protein families database" [4].

  9. EvolView, an online tool for visualizing, annotating and managing phylogenetic trees

    PubMed Central

    Zhang, Huangkai; Gao, Shenghan; Lercher, Martin J.; Hu, Songnian; Chen, Wei-Hua

    2012-01-01

    EvolView is a web application for visualizing, annotating and managing phylogenetic trees. First, EvolView is a phylogenetic tree viewer and customization tool; it visualizes trees in various formats, customizes them through built-in functions that can link information from external datasets, and exports the customized results to publication-ready figures. Second, EvolView is a tree and dataset management tool: users can easily organize related trees into distinct projects, add new datasets to trees and edit and manage existing trees and datasets. To make EvolView easy to use, it is equipped with an intuitive user interface. With a free account, users can save data and manipulations on the EvolView server. EvolView is freely available at: http://www.evolgenius.info/evolview.html. PMID:22695796

  10. RNA Sequencing Analysis of the Gametophyte Transcriptome from the Liverwort, Marchantia polymorpha

    PubMed Central

    Sharma, Niharika; Jung, Chol-Hee; Bhalla, Prem L.; Singh, Mohan B.

    2014-01-01

    The liverwort Marchantia polymorpha is a member of the most basal lineage of land plants (embryophytes) and likely retains many ancestral morphological, physiological and molecular characteristics. Despite its phylogenetic importance and the availability of previous EST studies, M. polymorpha’s lack of economic importance limits accessible genomic resources for this species. We employed Illumina RNA-Seq technology to sequence the gametophyte transcriptome of M. polymorpha. cDNA libraries from 6 different male and female developmental tissues were sequenced to delineate a global view of the M. polymorpha transcriptome. Approximately 80 million short reads were obtained and assembled into a non-redundant set of 46,533 transcripts (> = 200 bp) from 46,070 loci. The average length and the N50 length of the transcripts were 757 bp and 471 bp, respectively. Sequence comparison of assembled transcripts with non-redundant proteins from embryophytes resulted in the annotation of 43% of the transcripts. The transcripts were also compared with M. polymorpha expressed sequence tags (ESTs), and approximately 69.5% of the transcripts appeared to be novel. Twenty-one percent of the transcripts were assigned GO terms to improve annotation. In addition, 6,112 simple sequence repeats (SSRs) were identified as potential molecular markers, which may be useful in studies of genetic diversity. A comparative genomics approach revealed that a substantial proportion of the genes (35.5%) expressed in M. polymorpha were conserved across phylogenetically related species, such as Selaginella and Physcomitrella, and identified 580 genes that are potentially unique to liverworts. Our study presents an extensive amount of novel sequence information for M. polymorpha. This information will serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the isolation and characterization of functional genes that are involved in sex differentiation and sexual reproduction in this liverwort. PMID:24841988

  11. The State of Phylogenetic Analysis: Narrow Visions and Simple Answers-Examples from the Diptera (flies).

    PubMed

    Borkent, Art

    2018-01-17

    The order Diptera is remarkably diverse, not only in species but in morphological variation in every life stage, making them excellent candidates for phylogenetic analysis. Such analysis has been hampered by methods that have severely restricted character state interpretation. Morphological-based phylogenies should be based on a deep understanding of the morphology, development and function of character states, and have extensive outgroup comparisons made to determine their polarity. Character states clearly vary in their value for determining phylogenetic relationships and this needs to be studied and utilized. Characters themselves need more explicit discussion, including how some may be developmentally or functionally related to other characters (and potentially not independent indicators of genealogical relationship). The current practice by many, of filling a matrix with poorly understood character states and highly limited outgroup comparisons, is unacceptable if the results are to be a valid reflection of the actual history of the group.Parsimony analysis is not an objective interpretation of phylogenetic relationships when all characters are treated as equal in value. Exact mathematical values applied to characters are entirely arbitrary and are generally used to produce a phylogeny that the author considers as reasonable. Mathematical appraisal of a given node is similarly inconsequential because characters do not have an intrinsic mathematical value. Bremer support, for example, provides values that have no biological reality but provide the pretence of objectivity. Cladists need to focus their attention on testing the validity of each synapomorphy proposed, as the basis for all further phylogenetic interpretation, rather than the testing of differing phylogenies through various comparative programs.Current phylogenetic analyses have come to increasingly depend on DNA sequence-based characters, in spite of their tumultuous history of inconsistent results. Until such time as sequences can be shown to produce predictive phylogenies (i.e., using Hennigian logic), independent of morphological analysis, they should be viewed with caution and certainly not as a panacea as they are commonly portrayed.The purported comprehensive analyses of phylogenetic relationships between families of Diptera by Wiegmann et al. (2011) and Lambkin et al. (2013) have serious flaws and cannot be considered as the "Periodic Table" of such relationships as originally heralded.Systematists working on Diptera have a plethora of complex and informative morphological synapomorphies in every life stage, either described or awaiting study. Many lineages have the potential of providing a wealth of evolutionary stories to share with other biologists if we produce stable phylogenies based on weighted synapomorphies and interpreted to elucidate the zoogeographic and bionomic divergence of the group. Some lineages are devoid of convincing synapomorphies and, in spite of our desires, should be recognized as being largely uninterpretable.

  12. Your place or mine? A phylogenetic comparative analysis of marital residence in Indo-European and Austronesian societies

    PubMed Central

    Fortunato, Laura; Jordan, Fiona

    2010-01-01

    Accurate reconstruction of prehistoric social organization is important if we are to put together satisfactory multidisciplinary scenarios about, for example, the dispersal of human groups. Such considerations apply in the case of Indo-European and Austronesian, two large-scale language families that are thought to represent Neolithic expansions. Ancestral kinship patterns have mostly been inferred through reconstruction of kin terminologies in ancestral proto-languages using the linguistic comparative method, and through geographical or distributional arguments based on the comparative patterns of kin terms and ethnographic kinship ‘facts’. While these approaches are detailed and valuable, the processes through which conclusions have been drawn from the data fail to provide explicit criteria for systematic testing of alternative hypotheses. Here, we use language trees derived using phylogenetic tree-building techniques on Indo-European and Austronesian vocabulary data. With these trees, ethnographic data and Bayesian phylogenetic comparative methods, we statistically reconstruct past marital residence and infer rates of cultural change between different residence forms, showing Proto-Indo-European to be virilocal and Proto-Malayo-Polynesian uxorilocal. The instability of uxorilocality and the rare loss of virilocality once gained emerge as common features of both families. PMID:21041215

  13. Using tree diversity to compare phylogenetic heuristics.

    PubMed

    Sul, Seung-Jin; Matthews, Suzanne; Williams, Tiffani L

    2009-04-29

    Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees-especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.

  14. Phylogenetic analysis of anaerobic psychrophilic enrichment cultures obtained from a greenland glacier ice core

    NASA Technical Reports Server (NTRS)

    Sheridan, Peter P.; Miteva, Vanya I.; Brenchley, Jean E.

    2003-01-01

    The examination of microorganisms in glacial ice cores allows the phylogenetic relationships of organisms frozen for thousands of years to be compared with those of current isolates. We developed a method for aseptically sampling a sediment-containing portion of a Greenland ice core that had remained at -9 degrees C for over 100,000 years. Epifluorescence microscopy and flow cytometry results showed that the ice sample contained over 6 x 10(7) cells/ml. Anaerobic enrichment cultures inoculated with melted ice were grown and maintained at -2 degrees C. Genomic DNA extracted from these enrichments was used for the PCR amplification of 16S rRNA genes with bacterial and archaeal primers and the preparation of clone libraries. Approximately 60 bacterial inserts were screened by restriction endonuclease analysis and grouped into 27 unique restriction fragment length polymorphism types, and 24 representative sequences were compared phylogenetically. Diverse sequences representing major phylogenetic groups including alpha, beta, and gamma Proteobacteria as well as relatives of the Thermus, Bacteroides, Eubacterium, and Clostridium groups were found. Sixteen clone sequences were closely related to those from known organisms, with four possibly representing new species. Seven sequences may reflect new genera and were most closely related to sequences obtained only by PCR amplification. One sequence was over 12% distant from its closest relative and may represent a novel order or family. These results show that phylogenetically diverse microorganisms have remained viable within the Greenland ice core for at least 100,000 years.

  15. Phylogenetic Analysis of Anaerobic Psychrophilic Enrichment Cultures Obtained from a Greenland Glacier Ice Core

    PubMed Central

    Sheridan, Peter P.; Miteva, Vanya I.; Brenchley, Jean E.

    2003-01-01

    The examination of microorganisms in glacial ice cores allows the phylogenetic relationships of organisms frozen for thousands of years to be compared with those of current isolates. We developed a method for aseptically sampling a sediment-containing portion of a Greenland ice core that had remained at −9°C for over 100,000 years. Epifluorescence microscopy and flow cytometry results showed that the ice sample contained over 6 × 107 cells/ml. Anaerobic enrichment cultures inoculated with melted ice were grown and maintained at −2°C. Genomic DNA extracted from these enrichments was used for the PCR amplification of 16S rRNA genes with bacterial and archaeal primers and the preparation of clone libraries. Approximately 60 bacterial inserts were screened by restriction endonuclease analysis and grouped into 27 unique restriction fragment length polymorphism types, and 24 representative sequences were compared phylogenetically. Diverse sequences representing major phylogenetic groups including alpha, beta, and gamma Proteobacteria as well as relatives of the Thermus, Bacteroides, Eubacterium, and Clostridium groups were found. Sixteen clone sequences were closely related to those from known organisms, with four possibly representing new species. Seven sequences may reflect new genera and were most closely related to sequences obtained only by PCR amplification. One sequence was over 12% distant from its closest relative and may represent a novel order or family. These results show that phylogenetically diverse microorganisms have remained viable within the Greenland ice core for at least 100,000 years. PMID:12676695

  16. Cross-validation to select Bayesian hierarchical models in phylogenetics.

    PubMed

    Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C

    2016-05-26

    Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.

  17. Characterization of the complete mitochondrial genome of the cloacal tapeworm Cloacotaenia megalops (Cestoda: Hymenolepididae).

    PubMed

    Guo, Aijiang

    2016-09-05

    The cloacal tapeworm Cloacotaenia megalops (Hymenolepididae) is one of the most common cestode parasites of domestic and wild ducks worldwide. However, limited information is available regarding its epidemiology, biology, genetics and systematics. This study provides characterisation of the complete mitochondrial (mt) genome of C. megalops. The complete mt genome of C. megalops was obtained by long PCR, sequenced and annotated. The length of the entire mt genome of C. megalops is 13,887 bp; it contains 12 protein-coding, 2 ribosomal RNA and 22 transfer RNA genes, but lacks an atp8 gene. The mt gene arrangement of C. megalops is identical to that observed in Anoplocephala magna and A. perfoliata (Anoplocephalidae), Dipylidium caninum (Dipylidiidae) and Hymenolepis diminuta (Hymenolepididae), but differs from that reported in taeniids owing to the position shift between the tRNA (L1) and tRNA (S2) genes. The phylogenetic position of C. megalops was inferred using Maximum likelihood and Bayesian inference methods based on the concatenated amino acid data for 12 protein-coding genes. Phylogenetic trees showed that C. megalops is sister to Anoplocephala spp. (Anoplocephalidae) + Pseudanoplocephala crawfordi + Hymenolepis spp. (Hymenolepididae) indicating that the family Hymenolepididae is paraphyletic. The complete mt genome of C. megalops is sequenced. Phylogenetic analyses provided an insight into the phylogenetic relationships among the families Anoplocephalidae, Hymenolepididae, Dipylidiidae and Taeniidae. This novel genomic information also provides the opportunity to develop useful genetic markers for studying the molecular epidemiology, biology, genetics and systematics of C. megalops.

  18. Brief communication: Artificial cranial modification in Kow Swamp and Cohuna.

    PubMed

    Durband, Arthur C

    2014-09-01

    The crania from Kow Swamp and Cohuna have been important for a number of debates in Australian paleoanthropology. These crania typically have long, flat foreheads that many workers have cited as evidence of genetic continuity with archaic Indonesian populations, particularly the Ngandong sample. Other scientists have alleged that at least some of the crania from Kow Swamp and the Cohuna skull have been altered through artificial modification, and that the flat foreheads possessed by these individuals are not phylogenetically informative. In this study, several Kow Swamp crania and Cohuna are compared to known modified and unmodified comparative samples. Canonical variates analyses and Mahalanobis distances are generated, and random expectation statistics are used to calculate statistical significance for these tests. The results of this study agree with prior work indicating that a portion of this sample shows evidence for artificial modification of the cranial vault. Many Kow Swamp crania and Cohuna display shape similarities with a population of known modified individuals from New Britain. Kow Swamp 1, 5, and Cohuna show the strongest evidence for modification, but other individuals from this sample also show evidence of culturally manipulated changes in cranial shape. This project provides added support for the argument that at least some Pleistocene Australian groups were practicing artificial cranial modification, and suggests that caution should be used when including these individuals in phylogenetic studies. Copyright © 2014 Wiley Periodicals, Inc.

  19. The spectrum of genomic signatures: from dinucleotides to chaos game representation.

    PubMed

    Wang, Yingwei; Hill, Kathleen; Singh, Shiva; Kari, Lila

    2005-02-14

    In the post genomic era, access to complete genome sequence data for numerous diverse species has opened multiple avenues for examining and comparing primary DNA sequence organization of entire genomes. Previously, the concept of a genomic signature was introduced with the observation of species-type specific Dinucleotide Relative Abundance Profiles (DRAPs); dinucleotides were identified as the subsequences with the greatest bias in representation in a majority of genomes. Herein, we demonstrate that DRAP is one particular genomic signature contained within a broader spectrum of signatures. Within this spectrum, an alternative genomic signature, Chaos Game Representation (CGR), provides a unique visualization of patterns in sequence organization. A genomic signature is associated with a particular integer order or subsequence length that represents a measure of the resolution or granularity in the analysis of primary DNA sequence organization. We quantitatively explore the organizational information provided by genomic signatures of different orders through different distance measures, including a novel Image Distance. The Image Distance and other existing distance measures are evaluated by comparing the phylogenetic trees they generate for 26 complete mitochondrial genomes from a diversity of species. The phylogenetic tree generated by the Image Distance is compatible with the known relatedness of species. Quantitative evaluation of the spectrum of genomic signatures may be used to ultimately gain insight into the determinants and biological relevance of the genome signatures.

  20. Effects of ornamentation and phylogeny on the evolution of wing shape in stalk-eyed flies (Diopsidae).

    PubMed

    Husak, J F; Ribak, G; Baker, R H; Rivera, G; Wilkinson, G S; Swallow, J G

    2013-06-01

    Exaggerated male ornaments are predicted to be costly to their bearers, but these negative effects may be offset by the correlated evolution of compensatory traits. However, when locomotor systems, such as wings in flying species, evolve to decrease such costs, it remains unclear whether functional changes across related species are achieved via the same morphological route or via alternate changes that have similar function. We conducted a comparative analysis of wing shape in relation to eye-stalk elongation across 24 species of stalk-eyed flies, using geometric morphometrics to determine how species with increased eye span, a sexually selected trait, have modified wing morphology as a compensatory mechanism. Using traditional and phylogenetically informed multivariate analyses of shape in combination with phenotypic trajectory analysis, we found a strong phylogenetic signal in wing shape. However, dimorphic species possessed shifted wing veins with the result of lengthening and narrowing wings compared to monomorphic species. Dimorphic species also had changes that seem unrelated to wing size, but instead may govern wing flexion. Nevertheless, the lack of a uniform, compensatory pattern suggests that stalk-eyed flies used alternative modifications in wing structure to increase wing area and aspect ratio, thus taking divergent morphological routes to compensate for exaggerated eye stalks. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.

  1. Species diversity driven by morphological and ecological disparity: a case study of comparative seed morphology and anatomy across a large monocot order.

    PubMed

    Benedict, John C; Smith, Selena Y; Specht, Chelsea D; Collinson, Margaret E; Leong-Škorničková, Jana; Parkinson, Dilworth Y; Marone, Federica

    2016-01-01

    Phenotypic variation can be attributed to genetic heritability as well as biotic and abiotic factors. Across Zingiberales, there is a high variation in the number of species per clade and in phenotypic diversity. Factors contributing to this phenotypic variation have never been studied in a phylogenetic or ecological context. Seeds of 166 species from all eight families in Zingiberales were analyzed for 51 characters using synchrotron based 3D X-ray tomographic microscopy to determine phylogenetically informative characters and to understand the distribution of morphological disparity within the order. All families are distinguishable based on seed characters. Non-metric multidimensional scaling analyses show Zingiberaceae occupy the largest seed morphospace relative to the other families, and environmental analyses demonstrate that Zingiberaceae inhabit both temperate and tropical regions, while other Zingiberales are almost exclusively tropical. Temperate species do not cluster in morphospace nor do they share a common suite of character states. This suggests that the diversity seen is not driven by adaptation to temperate niches; rather, the morphological disparity seen likely reflects an underlying genetic plasticity that allowed Zingiberaceae to repeatedly colonize temperate environments. The notable morphoanatomical variety in Zingiberaceae seeds may account for their extraordinary ecological success and high species diversity as compared to other Zingiberales. © The Authors 2016. Published by Oxford University Press on behalf of the Annals of Botany Company.

  2. The chordate proteome history database.

    PubMed

    Levasseur, Anthony; Paganini, Julien; Dainat, Jacques; Thompson, Julie D; Poch, Olivier; Pontarotti, Pierre; Gouret, Philippe

    2012-01-01

    The chordate proteome history database (http://ioda.univ-provence.fr) comprises some 20,000 evolutionary analyses of proteins from chordate species. Our main objective was to characterize and study the evolutionary histories of the chordate proteome, and in particular to detect genomic events and automatic functional searches. Firstly, phylogenetic analyses based on high quality multiple sequence alignments and a robust phylogenetic pipeline were performed for the whole protein and for each individual domain. Novel approaches were developed to identify orthologs/paralogs, and predict gene duplication/gain/loss events and the occurrence of new protein architectures (domain gains, losses and shuffling). These important genetic events were localized on the phylogenetic trees and on the genomic sequence. Secondly, the phylogenetic trees were enhanced by the creation of phylogroups, whereby groups of orthologous sequences created using OrthoMCL were corrected based on the phylogenetic trees; gene family size and gene gain/loss in a given lineage could be deduced from the phylogroups. For each ortholog group obtained from the phylogenetic or the phylogroup analysis, functional information and expression data can be retrieved. Database searches can be performed easily using biological objects: protein identifier, keyword or domain, but can also be based on events, eg, domain exchange events can be retrieved. To our knowledge, this is the first database that links group clustering, phylogeny and automatic functional searches along with the detection of important events occurring during genome evolution, such as the appearance of a new domain architecture.

  3. Phylogeny and evolutionary histories of Pyrus L. revealed by phylogenetic trees and networks based on data from multiple DNA sequences.

    PubMed

    Zheng, Xiaoyan; Cai, Danying; Potter, Daniel; Postman, Joseph; Liu, Jing; Teng, Yuanwen

    2014-11-01

    Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence datasets. Phylogenetic trees based on both cpDNA and nuclear LFY2int2-N (LN) data resulted in poor resolution, especially, only five primary species were monophyletic in the LN tree. A phylogenetic network of LN suggested that reticulation caused by hybridization is one of the major evolutionary processes for Pyrus species. Polytomies of the gene trees and star-like structure of cpDNA networks suggested rapid radiation is another major evolutionary process, especially for the occidental species. Pyrus calleryana and P. regelii were the earliest diverged Pyrus species. Two North African species, P. cordata, P. spinosa and P. betulaefolia were descendent of primitive stock Pyrus species and still share some common molecular characters. Southwestern China, where a large number of P. pashia populations are found, is probably the most important diversification center of Pyrus. More accessions and nuclear genes are needed for further understanding the evolutionary histories of Pyrus. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. Diversity of Phylogenetic Information According to the Locus and the Taxonomic Level: An Example from a Parasitic Mesostigmatid Mite Genus

    PubMed Central

    Roy, Lise; Dowling, Ashley P.G.; Chauve, Claude Marie; Buronfosse, Thierry

    2010-01-01

    Molecular markers for cladistic analyses may perform differently according to the taxonomic group considered and the historical level under investigation. Here we evaluate the phylogenetic potential of five different markers for resolving evolutionary relationships within the ectoparasitic genus Dermanyssus at the species level, and their ability to address questions about the evolution of specialization. COI provided 9–18% divergence between species (up to 9% within species), 16S rRNA 10–16% (up to 4% within species), ITS1 and 2 2–9% (up to 1% within species) and Tropomyosin intron n 8–20% (up to 6% within species). EF-1α revealed different non-orthologous copies within individuals of Dermanyssus and Ornithonyssus. Tropomyosin intron n was shown containing consistent phylogenetic signal at the specific level within Dermanyssus and represents a promising marker for future prospects in phylogenetics of Acari. Phylogenetic analyses revealed that the generalist condition is apomorphic and D. gallinae might represent a complex of hybridized lineages. The split into hirsutus-group and gallinae-group in Dermanyssus does not seem to be appropriate based upon these results and D. longipes appears to be composed of two different entities. PMID:20480038

  5. Phylogenetic mixtures and linear invariants for equal input models.

    PubMed

    Casanellas, Marta; Steel, Mike

    2017-04-01

    The reconstruction of phylogenetic trees from molecular sequence data relies on modelling site substitutions by a Markov process, or a mixture of such processes. In general, allowing mixed processes can result in different tree topologies becoming indistinguishable from the data, even for infinitely long sequences. However, when the underlying Markov process supports linear phylogenetic invariants, then provided these are sufficiently informative, the identifiability of the tree topology can be restored. In this paper, we investigate a class of processes that support linear invariants once the stationary distribution is fixed, the 'equal input model'. This model generalizes the 'Felsenstein 1981' model (and thereby the Jukes-Cantor model) from four states to an arbitrary number of states (finite or infinite), and it can also be described by a 'random cluster' process. We describe the structure and dimension of the vector spaces of phylogenetic mixtures and of linear invariants for any fixed phylogenetic tree (and for all trees-the so called 'model invariants'), on any number n of leaves. We also provide a precise description of the space of mixtures and linear invariants for the special case of [Formula: see text] leaves. By combining techniques from discrete random processes and (multi-) linear algebra, our results build on a classic result that was first established by James Lake (Mol Biol Evol 4:167-191, 1987).

  6. Effects of species' similarity and dominance on the functional and phylogenetic structure of a plant meta-community.

    PubMed

    Chalmandrier, L; Münkemüller, T; Lavergne, S; Thuiller, W

    2015-01-01

    Different assembly processes drive the spatial structure of meta-communities (beta-diversity). Recently, functional and phylogenetic diversities have been suggested as indicators of these assembly processes. Assuming that diversity is a good proxy for niche overlap, high beta-diversity along environmental gradients should be the result of environmental filtering while low beta-diversity should stem from competitive interactions. So far, studies trying to disentangle the relative importance of these assembly processes have provided mixed results. One reason for this may be that these studies often rely on a single measure of diversity and thus implicitly make a choice on how they account for species relative abundances and how species similarities are captured by functional traits or phylogeny. Here, we tested the effect of gradually scaling the importance of dominance (the weight given to dominant vs. rare species) and species similarity (the weight given to small vs. large similarities) on resulting beta-diversity patterns of an alpine plant meta-community. To this end, we combined recent extensions of the Hill numbers framework with Pagel's phylogenetic tree transformation approach. We included functional (based on the leaf-height-seed spectrum) and phylogenetic facets of beta-diversity in our analysis and explicitly accounted for effects of environmental and spatial covariates. We found that functional beta-diversity, was high when the same weight was given to dominant vs. rare species and to large vs. small species' similarities. In contrast, phylogenetic beta-diversity was low when greater weight was given to dominant species and small species' similarities. Those results suggested that different environments along the gradients filtered different species according to their functional traits, while, the same competitive lineages dominated communities across the gradients. Our results highlight that functional vs. phylogenetic facets, presence-absence vs. abundance structure and different weights of species' dissimilarity provide complementary and important information on the drivers of meta-community structure. By utilizing the full extent of information provided by the flexible frameworks of Hill numbers and Pagel's tree transformation, we propose a new approach to disentangle the patterns resulting from different assembly processes.

  7. Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences.

    PubMed

    Klopfenstein, Ned B; Stewart, Jane E; Ota, Yuko; Hanna, John W; Richardson, Bryce A; Ross-Davis, Amy L; Elías-Román, Rubén D; Korhonen, Kari; Keča, Nenad; Iturritxa, Eugenia; Alvarado-Rosales, Dionicio; Solheim, Halvor; Brazee, Nicholas J; Łakomy, Piotr; Cleary, Michelle R; Hasegawa, Eri; Kikuchi, Taisei; Garza-Ocañas, Fortunato; Tsopelas, Panaghiotis; Rigling, Daniel; Prospero, Simone; Tsykun, Tetyana; Bérubé, Jean A; Stefani, Franck O P; Jafarpour, Saeideh; Antonín, Vladimír; Tomšovský, Michal; McDonald, Geral I; Woodward, Stephen; Kim, Mee-Sook

    2017-01-01

    Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence-based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation elongation factor 1-α (tef1) sequences are highly informative for phylogenetic analysis of Armillaria species within diverse global regions. This study used Neighbor-net and coalescence-based Bayesian analyses to examine phylogenetic relationships of newly determined and existing tef1 sequences derived from diverse Armillaria species from across the Northern Hemisphere, with Southern Hemisphere Armillaria species included for reference. Based on the Bayesian analysis of tef1 sequences, Armillaria species from the Northern Hemisphere are generally contained within the following four superclades, which are named according to the specific epithet of the most frequently cited species within the superclade: (i) Socialis/Tabescens (exannulate) superclade including Eurasian A. ectypa, North American A. socialis (A. tabescens), and Eurasian A. socialis (A. tabescens) clades; (ii) Mellea superclade including undescribed annulate North American Armillaria sp. (Mexico) and four separate clades of A. mellea (Europe and Iran, eastern Asia, and two groups from North America); (iii) Gallica superclade including Armillaria Nag E (Japan), multiple clades of A. gallica (Asia and Europe), A. calvescens (eastern North America), A. cepistipes (North America), A. altimontana (western USA), A. nabsnona (North America and Japan), and at least two A. gallica clades (North America); and (iv) Solidipes/Ostoyae superclade including two A. solidipes/ostoyae clades (North America), A. gemina (eastern USA), A. solidipes/ostoyae (Eurasia), A. cepistipes (Europe and Japan), A. sinapina (North America and Japan), and A. borealis (Eurasia) clade 2. Of note is that A. borealis (Eurasia) clade 1 appears basal to the Solidipes/Ostoyae and Gallica superclades. The Neighbor-net analysis showed similar phylogenetic relationships. This study further demonstrates the utility of tef1 for global phylogenetic studies of Armillaria species and provides critical insights into multiple taxonomic issues that warrant further study.

  8. Ribosomal DNA sequence heterogeneity reflects intraspecies phylogenies and predicts genome structure in two contrasting yeast species.

    PubMed

    West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N

    2014-07-01

    The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of closely related organisms, and discuss how it could be extended to future studies of multilocus rDNA systems. [concerted evolution; genome hydridisation; phylogenetic analysis; ribosomal DNA; whole genome sequencing; yeast]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  9. A guide to phylogenetic metrics for conservation, community ecology and macroecology.

    PubMed

    Tucker, Caroline M; Cadotte, Marc W; Carvalho, Silvia B; Davies, T Jonathan; Ferrier, Simon; Fritz, Susanne A; Grenyer, Rich; Helmus, Matthew R; Jin, Lanna S; Mooers, Arne O; Pavoine, Sandrine; Purschke, Oliver; Redding, David W; Rosauer, Dan F; Winter, Marten; Mazel, Florent

    2017-05-01

    The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub-disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub-disciplines hampers potential meta-analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo-diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo-diversity metrics based on their mathematical form within these three dimensions and identify 'anchor' representatives: for α-diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. © 2016 The Authors. Biological Reviews published by John Wiley © Sons Ltd on behalf of Cambridge Philosophical Society.

  10. Phylogenomic evidence for a recent and rapid radiation of lizards in the Patagonian Liolaemus fitzingerii species group.

    PubMed

    Grummer, Jared A; Morando, Mariana M; Avila, Luciano J; Sites, Jack W; Leaché, Adam D

    2018-08-01

    Rapid evolutionary radiations are difficult to resolve because divergence events are nearly synchronous and gene flow among nascent species can be high, resulting in a phylogenetic "bush". Large datasets composed of sequence loci from across the genome can potentially help resolve some of these difficult phylogenetic problems. A suitable test case is the Liolaemus fitzingerii species group of lizards, which includes twelve species that are broadly distributed in Argentinean Patagonia. The species in the group have had a complex evolutionary history that has led to high morphological variation and unstable taxonomy. We generated a sequence capture dataset for 28 ingroup individuals of 580 nuclear loci, alongside a mitogenomic dataset, to infer phylogenetic relationships among species in this group. Relationships among species were generally weakly supported with the nuclear data, and along with an inferred age of ∼2.6 million years old, indicate either rapid evolution, hybridization, incomplete lineage sorting, non-informative data, or a combination thereof. We inferred a signal of mito-nuclear discordance, indicating potential hybridization between L. melanops and L. martorii, and phylogenetic network analyses provided support for 5 reticulation events among species. Phasing the nuclear loci did not provide additional insight into relationships or suspected patterns of hybridization. Only one clade, composed of L. camarones, L. fitzingerii, and L. xanthoviridis was recovered across all analyses. Genomic datasets provide molecular systematists with new opportunities to resolve difficult phylogenetic problems, yet the lack of phylogenetic resolution in Patagonian Liolaemus is biologically meaningful and indicative of a recent and rapid evolutionary radiation. The phylogenetic relationships of the Liolaemus fitzingerii group may be best modeled as a reticulated network instead of a bifurcating phylogeny. Copyright © 2018 Elsevier Inc. All rights reserved.

  11. A guide to phylogenetic metrics for conservation, community ecology and macroecology

    PubMed Central

    Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S.; Mooers, Arne O.; Pavoine, Sandrine; Purschke, Oliver; Redding, David W.; Rosauer, Dan F.; Winter, Marten; Mazel, Florent

    2016-01-01

    ABSTRACT The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub‐disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub‐disciplines hampers potential meta‐analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo‐diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo‐diversity metrics based on their mathematical form within these three dimensions and identify ‘anchor’ representatives: for α‐diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. PMID:26785932

  12. YBYRÁ facilitates comparison of large phylogenetic trees.

    PubMed

    Machado, Denis Jacob

    2015-07-01

    The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .

  13. Phylogenetic study of Geitlerinema and Microcystis (Cyanobacteria) using PC-IGS and 16S-23S ITS as markers: investigation of horizontal gene transfer.

    PubMed

    Piccin-Santos, Viviane; Brandão, Marcelo Mendes; Bittencourt-Oliveira, Maria Do Carmo

    2014-08-01

    Selection of genes that have not been horizontally transferred for prokaryote phylogenetic inferences is regarded as a challenging task. The markers internal transcribed spacer of ribosomal genes (16S-23S ITS) and phycocyanin intergenic spacer (PC-IGS), based on the operons of ribosomal and phycocyanin genes respectively, are among the most used markers in cyanobacteria. The region of the ribosomal genes has been considered stable, whereas the phycocyanin operon may have undergone horizontal transfer. To investigate the occurrence of horizontal transfer of PC-IGS, phylogenetic trees of Geitlerinema and Microcystis strains were generated using PC-IGS and 16S-23S ITS and compared. Phylogenetic trees based on the two markers were mostly congruent for Geitlerinema and Microcystis, indicating a common evolutionary history among ribosomal and phycocyanin genes with no evidence for horizontal transfer of PC-IGS. Thus, PC-IGS is a suitable marker, along with 16S-23S ITS for phylogenetic studies of cyanobacteria. © 2014 Phycological Society of America.

  14. Variance Component Selection With Applications to Microbiome Taxonomic Data.

    PubMed

    Zhai, Jing; Kim, Juhyun; Knox, Kenneth S; Twigg, Homer L; Zhou, Hua; Zhou, Jin J

    2018-01-01

    High-throughput sequencing technology has enabled population-based studies of the role of the human microbiome in disease etiology and exposure response. Microbiome data are summarized as counts or composition of the bacterial taxa at different taxonomic levels. An important problem is to identify the bacterial taxa that are associated with a response. One method is to test the association of specific taxon with phenotypes in a linear mixed effect model, which incorporates phylogenetic information among bacterial communities. Another type of approaches consider all taxa in a joint model and achieves selection via penalization method, which ignores phylogenetic information. In this paper, we consider regression analysis by treating bacterial taxa at different level as multiple random effects. For each taxon, a kernel matrix is calculated based on distance measures in the phylogenetic tree and acts as one variance component in the joint model. Then taxonomic selection is achieved by the lasso (least absolute shrinkage and selection operator) penalty on variance components. Our method integrates biological information into the variable selection problem and greatly improves selection accuracies. Simulation studies demonstrate the superiority of our methods versus existing methods, for example, group-lasso. Finally, we apply our method to a longitudinal microbiome study of Human Immunodeficiency Virus (HIV) infected patients. We implement our method using the high performance computing language Julia. Software and detailed documentation are freely available at https://github.com/JingZhai63/VCselection.

  15. Sampling strategies for improving tree accuracy and phylogenetic analyses: a case study in ciliate protists, with notes on the genus Paramecium.

    PubMed

    Yi, Zhenzhen; Strüder-Kypke, Michaela; Hu, Xiaozhong; Lin, Xiaofeng; Song, Weibo

    2014-02-01

    In order to assess how dataset-selection for multi-gene analyses affects the accuracy of inferred phylogenetic trees in ciliates, we chose five genes and the genus Paramecium, one of the most widely used model protist genera, and compared tree topologies of the single- and multi-gene analyses. Our empirical study shows that: (1) Using multiple genes improves phylogenetic accuracy, even when their one-gene topologies are in conflict with each other. (2) The impact of missing data on phylogenetic accuracy is ambiguous: resolution power and topological similarity, but not number of represented taxa, are the most important criteria of a dataset for inclusion in concatenated analyses. (3) As an example, we tested the three classification models of the genus Paramecium with a multi-gene based approach, and only the monophyly of the subgenus Paramecium is supported. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. The ovary structure and oogenesis in the basal crustaceans and hexapods. Possible phylogenetic significance.

    PubMed

    Jaglarz, Mariusz K; Kubrakiewicz, Janusz; Bilinski, Szczepan M

    2014-07-01

    Recent large-scale phylogenetic analyses of exclusively molecular or combined molecular and morphological characters support a close relationship between Crustacea and Hexapoda. The growing consensus on this phylogenetic link is reflected in uniting both taxa under the name Pancrustacea or Tetraconata. Several recent molecular phylogenies have also indicated that the monophyletic hexapods should be nested within paraphyletic crustaceans. However, it is still contentious exactly which crustacean taxon is the sister group to Hexapoda. Among the favored candidates are Branchiopoda, Malacostraca, Remipedia and Xenocarida (Remipedia + Cephalocarida). In this context, we review morphological and ultrastructural features of the ovary architecture and oogenesis in these crustacean groups in search of traits potentially suitable for phylogenetic considerations. We have identified a suite of morphological characters which may prove useful in further comparative studies. Copyright © 2014 Elsevier Ltd. All rights reserved.

  17. Urbanisation and the loss of phylogenetic diversity in birds.

    PubMed

    Sol, Daniel; Bartomeus, Ignasi; González-Lagos, César; Pavoine, Sandrine

    2017-06-01

    Despite the recognised conservation value of phylogenetic diversity, little is known about how it is affected by the urbanisation process. Combining a complete avian phylogeny with surveys along urbanisation gradients from five continents, we show that highly urbanised environments supported on average 450 million fewer years of evolutionary history than the surrounding natural environments. This loss was primarily caused by species loss and could have been higher had not been partially compensated by the addition of urban exploiters and some exotic species. Highly urbanised environments also supported fewer evolutionary distinctive species, implying a disproportionate loss of evolutionary history. Compared with highly urbanised environments, changes in phylogenetic richness and evolutionary distinctiveness were less substantial in moderately urbanised environments. Protecting pristine environments is therefore essential for maintaining phylogenetic diversity, but moderate levels of urbanisation still preserve much of the original diversity. © 2017 John Wiley & Sons Ltd/CNRS.

  18. [Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

    PubMed

    Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

    2013-07-04

    Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.

  19. Phylogenetic congruence between subtropical trees and their associated fungi.

    PubMed

    Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao

    2016-12-01

    Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.

  20. Evolution of specialization: a phylogenetic study of host range in the red milkweed beetle (Tetraopes tetraophthalmus).

    PubMed

    Rasmann, Sergio; Agrawal, Anurag A

    2011-06-01

    Specialization is common in most lineages of insect herbivores, one of the most diverse groups of organisms on earth. To address how and why specialization is maintained over evolutionary time, we hypothesized that plant defense and other ecological attributes of potential host plants would predict the performance of a specialist root-feeding herbivore (the red milkweed beetle, Tetraopes tetraophthalmus). Using a comparative phylogenetic and functional trait approach, we assessed the determinants of insect host range across 18 species of Asclepias. Larval survivorship decreased with increasing phylogenetic distance from the true host, Asclepias syriaca, suggesting that adaptation to plant traits drives specialization. Among several root traits measured, only cardenolides (toxic defense chemicals) correlated with larval survival, and cardenolides also explained the phylogenetic distance effect in phylogenetically controlled multiple regression analyses. Additionally, milkweed species having a known association with other Tetraopes beetles were better hosts than species lacking Tetraopes herbivores, and milkweeds with specific leaf area values (a trait related to leaf function and habitat affiliation) similar to those of A. syriaca were better hosts than species having divergent values. We thus conclude that phylogenetic distance is an integrated measure of phenotypic and ecological attributes of Asclepias species, especially defensive cardenolides, which can be used to explain specialization and constraints on host shifts over evolutionary time.

  1. Genes with minimal phylogenetic information are problematic for coalescent analyses when gene tree estimation is biased.

    PubMed

    Xi, Zhenxiang; Liu, Liang; Davis, Charles C

    2015-11-01

    The development and application of coalescent methods are undergoing rapid changes. One little explored area that bears on the application of gene-tree-based coalescent methods to species tree estimation is gene informativeness. Here, we investigate the accuracy of these coalescent methods when genes have minimal phylogenetic information, including the implementation of the multilocus bootstrap approach. Using simulated DNA sequences, we demonstrate that genes with minimal phylogenetic information can produce unreliable gene trees (i.e., high error in gene tree estimation), which may in turn reduce the accuracy of species tree estimation using gene-tree-based coalescent methods. We demonstrate that this problem can be alleviated by sampling more genes, as is commonly done in large-scale phylogenomic analyses. This applies even when these genes are minimally informative. If gene tree estimation is biased, however, gene-tree-based coalescent analyses will produce inconsistent results, which cannot be remedied by increasing the number of genes. In this case, it is not the gene-tree-based coalescent methods that are flawed, but rather the input data (i.e., estimated gene trees). Along these lines, the commonly used program PhyML has a tendency to infer one particular bifurcating topology even though it is best represented as a polytomy. We additionally corroborate these findings by analyzing the 183-locus mammal data set assembled by McCormack et al. (2012) using ultra-conserved elements (UCEs) and flanking DNA. Lastly, we demonstrate that when employing the multilocus bootstrap approach on this 183-locus data set, there is no strong conflict between species trees estimated from concatenation and gene-tree-based coalescent analyses, as has been previously suggested by Gatesy and Springer (2014). Copyright © 2015 Elsevier Inc. All rights reserved.

  2. Phylogenetics and Differentiation of Salmonella Newport Lineages by Whole Genome Sequencing

    PubMed Central

    Cao, Guojie; Meng, Jianghong; Strain, Errol; Stones, Robert; Pettengill, James; Zhao, Shaohua; McDermott, Patrick; Brown, Eric; Allard, Marc

    2013-01-01

    Salmonella Newport has ranked in the top three Salmonella serotypes associated with foodborne outbreaks from 1995 to 2011 in the United States. In the current study, we selected 26 S. Newport strains isolated from diverse sources and geographic locations and then conducted 454 shotgun pyrosequencing procedures to obtain 16–24 × coverage of high quality draft genomes for each strain. Comparative genomic analysis of 28 S. Newport strains (including 2 reference genomes) and 15 outgroup genomes identified more than 140,000 informative SNPs. A resulting phylogenetic tree consisted of four sublineages and indicated that S. Newport had a clear geographic structure. Strains from Asia were divergent from those from the Americas. Our findings demonstrated that analysis using whole genome sequencing data resulted in a more accurate picture of phylogeny compared to that using single genes or small sets of genes. We selected loci around the mutS gene of S. Newport to differentiate distinct lineages, including those between invH and mutS genes at the 3′ end of Salmonella Pathogenicity Island 1 (SPI-1), ste fimbrial operon, and Clustered, Regularly Interspaced, Short Palindromic Repeats (CRISPR) associated-proteins (cas). These genes in the outgroup genomes held high similarity with either S. Newport Lineage II or III at the same loci. S. Newport Lineages II and III have different evolutionary histories in this region and our data demonstrated genetic flow and homologous recombination events around mutS. The findings suggested that S. Newport Lineages II and III diverged early in the serotype evolution and have evolved largely independently. Moreover, we identified genes that could delineate sublineages within the phylogenetic tree and that could be used as potential biomarkers for trace-back investigations during outbreaks. Thus, whole genome sequencing data enabled us to better understand the genetic background of pathogenicity and evolutionary history of S. Newport and also provided additional markers for epidemiological response. PMID:23409020

  3. Comparative chloroplast genomics and phylogenetics of Fagopyrum esculentum ssp. ancestrale – A wild ancestor of cultivated buckwheat

    PubMed Central

    Logacheva, Maria D; Samigullin, Tahir H; Dhingra, Amit; Penin, Aleksey A

    2008-01-01

    Background Chloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent. Results We report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales. Conclusion Comparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids. PMID:18492277

  4. A comparison of chloroplast genome sequences in Aconitum (Ranunculaceae): a traditional herbal medicinal genus

    PubMed Central

    Yao, Gang

    2017-01-01

    The herbal medicinal genus Aconitum L., belonging to the Ranunculaceae family, represents the earliest diverging lineage within the eudicots. It currently comprises of two subgenera, A. subgenus Lycoctonum and A. subg. Aconitum. The complete chloroplast (cp) genome sequences were characterized in three species: A. angustius, A. finetianum, and A. sinomontanum in subg. Lycoctonum and compared to other Aconitum species to clarify their phylogenetic relationship and provide molecular information for utilization of Aconitum species particularly in Eastern Asia. The length of the chloroplast genome sequences were 156,109 bp in A. angustius, 155,625 bp in A. finetianum and 157,215 bp in A. sinomontanum, with each species possessing 126 genes with 84 protein coding genes (PCGs). While genomic rearrangements were absent, structural variation was detected in the LSC/IR/SSC boundaries. Five pseudogenes were identified, among which Ψrps19 and Ψycf1 were in the LSC/IR/SSC boundaries, Ψrps16 and ΨinfA in the LSC region, and Ψycf15 in the IRb region. The nucleotide variability (Pi) of Aconitum was estimated to be 0.00549, with comparably higher variations in the LSC and SSC than the IR regions. Eight intergenic regions were revealed to be highly variable and a total of 58–62 simple sequence repeats (SSRs) were detected in all three species. More than 80% of SSRs were present in the LSC region. Altogether, 64.41% and 46.81% of SSRs are mononucleotides in subg. Lycoctonum and subg. Aconitum, respectively, while a higher percentage of di-, tri-, tetra-, and penta- SSRs were present in subg. Aconitum. Most species of subg. Aconitum in Eastern Asia were first used for phylogenetic analyses. The availability of the complete cp genome sequences of these species in subg. Lycoctonum will benefit future phylogenetic analyses and aid in germplasm utilization in Aconitum species. PMID:29134154

  5. Phylogeny, ecology, and heart position in snakes.

    PubMed

    Gartner, Gabriel E A; Hicks, James W; Manzani, Paulo R; Andrade, Denis V; Abe, Augusto S; Wang, Tobias; Secor, Stephen M; Garland, Theodore

    2010-01-01

    The cardiovascular system of all animals is affected by gravitational pressure gradients, the intensity of which varies according to organismic features, behavior, and habitat occupied. A previous nonphylogenetic analysis of heart position in snakes-which often assume vertical postures-found the heart located 15%-25% of total body length from the head in terrestrial and arboreal species but 25%-45% in aquatic species. It was hypothesized that a more anterior heart in arboreal species served to reduce the hydrostatic blood pressure when these animals adopt vertical postures during climbing, whereas an anterior heart position would not be needed in aquatic habitats, where the effects of gravity are less pronounced. We analyzed a new data set of 155 species from five major families of Alethinophidia (one of the two major branches of snakes, the other being blind snakes, Scolecophidia) using both conventional and phylogenetically based statistical methods. General linear models regressing log(10) snout-heart position on log(10) snout-vent length (SVL), as well as dummy variables coding for habitat and/or clade, were compared using likelihood ratio tests and the Akaike Information Criterion. Heart distance to the tip of the snout scaled isometrically with SVL. In all instances, phylogenetic models that incorporated transformation of the branch lengths under an Ornstein-Uhlenbeck model of evolution (to mimic stabilizing selection) better fit the data as compared with their nonphylogenetic counterparts. The best-fit model predicting snake heart position included aspects of both habitat and clade and indicated that arboreal snakes in our study tend to have hearts placed more posteriorly, opposite the trend identified in previous studies. Phylogenetic signal in relative heart position was apparent both within and among clades. Our results suggest that overcoming gravitational pressure gradients in snakes most likely involves the combined action of several cardiovascular and behavioral adaptations in addition to alterations in relative heart location.

  6. A comparison of chloroplast genome sequences in Aconitum (Ranunculaceae): a traditional herbal medicinal genus.

    PubMed

    Kong, Hanghui; Liu, Wanzhen; Yao, Gang; Gong, Wei

    2017-01-01

    The herbal medicinal genus Aconitum L., belonging to the Ranunculaceae family, represents the earliest diverging lineage within the eudicots. It currently comprises of two subgenera, A . subgenus Lycoctonum and A . subg. Aconitum . The complete chloroplast (cp) genome sequences were characterized in three species: A. angustius , A. finetianum , and A. sinomontanum in subg. Lycoctonum and compared to other Aconitum species to clarify their phylogenetic relationship and provide molecular information for utilization of Aconitum species particularly in Eastern Asia. The length of the chloroplast genome sequences were 156,109 bp in A. angustius , 155,625 bp in A. finetianum and 157,215 bp in A. sinomontanum , with each species possessing 126 genes with 84 protein coding genes (PCGs). While genomic rearrangements were absent, structural variation was detected in the LSC/IR/SSC boundaries. Five pseudogenes were identified, among which Ψ rps 19 and Ψ ycf 1 were in the LSC/IR/SSC boundaries, Ψ rps 16 and Ψ inf A in the LSC region, and Ψ ycf 15 in the IRb region. The nucleotide variability ( Pi ) of Aconitum was estimated to be 0.00549, with comparably higher variations in the LSC and SSC than the IR regions. Eight intergenic regions were revealed to be highly variable and a total of 58-62 simple sequence repeats (SSRs) were detected in all three species. More than 80% of SSRs were present in the LSC region. Altogether, 64.41% and 46.81% of SSRs are mononucleotides in subg. Lycoctonum and subg. Aconitum , respectively, while a higher percentage of di-, tri-, tetra-, and penta- SSRs were present in subg. Aconitum . Most species of subg. Aconitum in Eastern Asia were first used for phylogenetic analyses. The availability of the complete cp genome sequences of these species in subg. Lycoctonum will benefit future phylogenetic analyses and aid in germplasm utilization in Aconitum species.

  7. A comparative study of the inner ear structures of artiodactyls and early cetaceans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Klingshirn, M.A.; Luo, Z.

    1994-12-31

    It has been suggested that the order Cetacea (whales and porpoises) are closely related to artiodactyls, even-hoofed ungulate mammals such as the pig and cow. Paleontological and molecular data strongly supports this concept of phylogenetic relationships. In a study of DNA sequences of two mitochondrial ribosomal gene segments of cetaceans, the artiodactyls were found to be closest related to Cetaceans. These well accepted studies on the phylogenetic affinities of artiodactyls and cetaceans cause us to conduct a comparative study of the bony structure of the inner ear of these two taxa.

  8. Host susceptibility to snake fungal disease is highly dispersed across phylogenetic and functional trait space

    PubMed Central

    Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.

    2017-01-01

    Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen. PMID:29291245

  9. Host susceptibility to snake fungal disease is highly dispersed across phylogenetic and functional trait space.

    PubMed

    Burbrink, Frank T; Lorch, Jeffrey M; Lips, Karen R

    2017-12-01

    Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.

  10. Host susceptibility to snake fungal disease is highly dispersed across phylogenetic and functional trait space

    USGS Publications Warehouse

    Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.

    2017-01-01

    Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.

  11. SILVA tree viewer: interactive web browsing of the SILVA phylogenetic guide trees.

    PubMed

    Beccati, Alan; Gerken, Jan; Quast, Christian; Yilmaz, Pelin; Glöckner, Frank Oliver

    2017-09-30

    Phylogenetic trees are an important tool to study the evolutionary relationships among organisms. The huge amount of available taxa poses difficulties in their interactive visualization. This hampers the interaction with the users to provide feedback for the further improvement of the taxonomic framework. The SILVA Tree Viewer is a web application designed for visualizing large phylogenetic trees without requiring the download of any software tool or data files. The SILVA Tree Viewer is based on Web Geographic Information Systems (Web-GIS) technology with a PostgreSQL backend. It enables zoom and pan functionalities similar to Google Maps. The SILVA Tree Viewer enables access to two phylogenetic (guide) trees provided by the SILVA database: the SSU Ref NR99 inferred from high-quality, full-length small subunit sequences, clustered at 99% sequence identity and the LSU Ref inferred from high-quality, full-length large subunit sequences. The Tree Viewer provides tree navigation, search and browse tools as well as an interactive feedback system to collect any kinds of requests ranging from taxonomy to data curation and improving the tool itself.

  12. Phylogenetic screening of a bacterial, metagenomic library using homing endonuclease restriction and marker insertion

    PubMed Central

    Yung, Pui Yi; Burke, Catherine; Lewis, Matt; Egan, Suhelen; Kjelleberg, Staffan; Thomas, Torsten

    2009-01-01

    Metagenomics provides access to the uncultured majority of the microbial world. The approaches employed in this field have, however, had limited success in linking functional genes to the taxonomic or phylogenetic origin of the organism they belong to. Here we present an efficient strategy to recover environmental DNA fragments that contain phylogenetic marker genes from metagenomic libraries. Our method involves the cleavage of 23S ribsosmal RNA (rRNA) genes within pooled library clones by the homing endonuclease I-CeuI followed by the insertion and selection of an antibiotic resistance cassette. This approach was applied to screen a library of 6500 fosmid clones derived from the microbial community associated with the sponge Cymbastela concentrica. Several fosmid clones were recovered after the screen and detailed phylogenetic and taxonomic assignment based on the rRNA gene showed that they belong to previously unknown organisms. In addition, compositional features of these fosmid clones were used to classify and taxonomically assign a dataset of environmental shotgun sequences. Our approach represents a valuable tool for the analysis of rapidly increasing, environmental DNA sequencing information. PMID:19767618

  13. Bayesian nonparametric clustering in phylogenetics: modeling antigenic evolution in influenza.

    PubMed

    Cybis, Gabriela B; Sinsheimer, Janet S; Bedford, Trevor; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A

    2018-01-30

    Influenza is responsible for up to 500,000 deaths every year, and antigenic variability represents much of its epidemiological burden. To visualize antigenic differences across many viral strains, antigenic cartography methods use multidimensional scaling on binding assay data to map influenza antigenicity onto a low-dimensional space. Analysis of such assay data ideally leads to natural clustering of influenza strains of similar antigenicity that correlate with sequence evolution. To understand the dynamics of these antigenic groups, we present a framework that jointly models genetic and antigenic evolution by combining multidimensional scaling of binding assay data, Bayesian phylogenetic machinery and nonparametric clustering methods. We propose a phylogenetic Chinese restaurant process that extends the current process to incorporate the phylogenetic dependency structure between strains in the modeling of antigenic clusters. With this method, we are able to use the genetic information to better understand the evolution of antigenicity throughout epidemics, as shown in applications of this model to H1N1 influenza. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  14. Resolving kangaroo phylogeny and overcoming retrotransposon ascertainment bias.

    PubMed

    Dodt, William G; Gallus, Susanne; Phillips, Matthew J; Nilsson, Maria A

    2017-12-01

    Reconstructing phylogeny from retrotransposon insertions is often limited by access to only a single reference genome, whereby support for clades that do not include the reference taxon cannot be directly observed. Here we have developed a new statistical framework that accounts for this ascertainment bias, allowing us to employ phylogenetically powerful retrotransposon markers to explore the radiation of the largest living marsupials, the kangaroos and wallabies of the genera Macropus and Wallabia. An exhaustive in silico screening of the tammar wallaby (Macropus eugenii) reference genome followed by experimental screening revealed 29 phylogenetically informative retrotransposon markers belonging to a family of endogenous retroviruses. We identified robust support for the enigmatic swamp wallaby (Wallabia bicolor) falling within a paraphyletic genus, Macropus. Our statistical approach provides a means to test for incomplete lineage sorting and introgression/hybridization in the presence of the ascertainment bias. Using retrotransposons as "molecular fossils", we reveal one of the most complex patterns of hemiplasy yet identified, during the rapid diversification of kangaroos and wallabies. Ancestral state reconstruction incorporating the new retrotransposon phylogenetic information reveals multiple independent ecological shifts among kangaroos into more open habitats, coinciding with the Pliocene onset of increased aridification in Australia from ~3.6 million years ago.

  15. Maximizing the phylogenetic diversity of seed banks.

    PubMed

    Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

    2015-04-01

    Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. © 2014 Society for Conservation Biology.

  16. The nuclear 18S ribosomal RNA gene as a source of phylogenetic information in the genus Taenia.

    PubMed

    Yan, Hongbin; Lou, Zhongzi; Li, Li; Ni, Xingwei; Guo, Aijiang; Li, Hongmin; Zheng, Yadong; Dyachenko, Viktor; Jia, Wanzhong

    2013-03-01

    Most species of the genus Taenia are of considerable medical and veterinary significance. In this study, complete nuclear 18S rRNA gene sequences were obtained from seven members of genus Taenia [Taenia multiceps, Taenia saginata, Taenia asiatica, Taenia solium, Taenia pisiformis, Taenia hydatigena, and Taenia taeniaeformis] and a phylogeny inferred using these sequences. Most of the variable sites fall within the variable regions, V1-V5. We show that sequences from the nuclear 18S ribosomal RNA gene have considerable promise as sources of phylogenetic information within the genus Taenia. Furthermore, given that almost all the variable sites lie within defined variable portions of that gene, it will be appropriate and economical to sequence only those regions for additional species of Taenia.

  17. Stable isotope probing in the metagenomics era: a bridge towards improved bioremediation

    PubMed Central

    Uhlik, Ondrej; Leewis, Mary-Cathrine; Strejcek, Michal; Musilova, Lucie; Mackova, Martina; Leigh, Mary Beth; Macek, Tomas

    2012-01-01

    Microbial biodegradation and biotransformation reactions are essential to most bioremediation processes, yet the specific organisms, genes, and mechanisms involved are often not well understood. Stable isotope probing (SIP) enables researchers to directly link microbial metabolic capability to phylogenetic and metagenomic information within a community context by tracking isotopically labeled substances into phylogenetically and functionally informative biomarkers. SIP is thus applicable as a tool for the identification of active members of the microbial community and associated genes integral to the community functional potential, such as biodegradative processes. The rapid evolution of SIP over the last decade and integration with metagenomics provides researchers with a much deeper insight into potential biodegradative genes, processes, and applications, thereby enabling an improved mechanistic understanding that can facilitate advances in the field of bioremediation. PMID:23022353

  18. A taxonomic wish-list for community ecology.

    PubMed Central

    Gotelli, Nicholas J

    2004-01-01

    Community ecology seeks to explain the number and relative abundance of coexisting species. Four research frontiers in community ecology are closely tied to research in systematics and taxonomy: the statistics of species richness estimators, global patterns of biodiversity, the influence of global climate change on community structure, and phylogenetic influences on community structure. The most pressing needs for taxonomic information in community ecology research are usable taxonomic keys, current nomenclature, species occurrence records and resolved phylogenies. These products can best be obtained from Internet-based phylogenetic and taxonomic resources, but the lack of trained professional systematists and taxonomists threatens this effort. Community ecologists will benefit most directly from research in systematics and taxonomy by making better use of resources in museums and herbaria, and by actively seeking training, information and collaborations with taxonomic specialists. PMID:15253346

  19. Head capsule characters in the Hymenoptera and their phylogenetic implications

    PubMed Central

    Vilhelmsen, Lars

    2011-01-01

    Abstract The head capsule of a taxon sample of three outgroup and 86 ingroup taxa is examined for characters of possible phylogenetic significance within Hymenoptera. 21 morphological characters are illustrated and scored, and their character evolution explored by mapping them onto a phylogeny recently produced from a large morphological data set. Many of the characters are informative and display unambiguous changes. Most of the character support demonstrated is supportive at the superfamily or family level. In contrast, only few characters corroborate deeper nodes in the phylogeny of Hymenoptera. PMID:22259288

  20. The relevance of phylogeny to studies of global change.

    PubMed

    Edwards, Erika J; Still, Christopher J; Donoghue, Michael J

    2007-05-01

    Phylogenetic thinking has infiltrated many areas of biological research, but has had little impact on studies of global ecology or climate change. Here, we illustrate how phylogenetic information can be relevant to understanding vegetation-atmosphere dynamics at ecosystem or global scales by re-analyzing a data set of carbonic anhydrase (CA) activity in leaves that was used to estimate terrestrial gross primary productivity. The original calculations relied on what appeared to be low CA activity exclusively in C4 grasses, but our analyses indicate that such activity might instead characterize the PACCAD grass lineage, which includes many widespread C3 species. We outline how phylogenetics can guide better taxon sampling of key physiological traits, and discuss how the emerging field of phyloinformatics presents a promising new framework for scaling from organism physiology to global processes.

  1. Preliminary Classification of Novel Hemorrhagic Fever-Causing Viruses Using Sequence-Based PAirwise Sequence Comparison (PASC) Analysis.

    PubMed

    Bào, Yīmíng; Kuhn, Jens H

    2018-01-01

    During the last decade, genome sequence-based classification of viruses has become increasingly prominent. Viruses can be even classified based on coding-complete genome sequence data alone. Nevertheless, classification remains arduous as experts are required to establish phylogenetic trees to depict the evolutionary relationships of such sequences for preliminary taxonomic placement. Pairwise sequence comparison (PASC) of genomes is one of several novel methods for establishing relationships among viruses. This method, provided by the US National Center for Biotechnology Information as an open-access tool, circumvents phylogenetics, and yet PASC results are often in agreement with those of phylogenetic analyses. Computationally inexpensive, PASC can be easily performed by non-taxonomists. Here we describe how to use the PASC tool for the preliminary classification of novel viral hemorrhagic fever-causing viruses.

  2. Visualizing speciation in artificial cichlid fish.

    PubMed

    Clement, Ross

    2006-01-01

    The Cichlid Speciation Project (CSP) is an ALife simulation system for investigating open problems in the speciation of African cichlid fish. The CSP can be used to perform a wide range of experiments that show that speciation is a natural consequence of certain biological systems. A visualization system capable of extracting the history of speciation from low-level trace data and creating a phylogenetic tree has been implemented. Unlike previous approaches, this visualization system presents a concrete trace of speciation, rather than a summary of low-level information from which the viewer can make subjective decisions on how speciation progressed. The phylogenetic trees are a more objective visualization of speciation, and enable automated collection and summarization of the results of experiments. The visualization system is used to create a phylogenetic tree from an experiment that models sympatric speciation.

  3. Selecting informative subsets of sparse supermatrices increases the chance to find correct trees.

    PubMed

    Misof, Bernhard; Meyer, Benjamin; von Reumont, Björn Marcus; Kück, Patrick; Misof, Katharina; Meusemann, Karen

    2013-12-03

    Character matrices with extensive missing data are frequently used in phylogenomics with potentially detrimental effects on the accuracy and robustness of tree inference. Therefore, many investigators select taxa and genes with high data coverage. Drawbacks of these selections are their exclusive reliance on data coverage without consideration of actual signal in the data which might, thus, not deliver optimal data matrices in terms of potential phylogenetic signal. In order to circumvent this problem, we have developed a heuristics implemented in a software called mare which (1) assesses information content of genes in supermatrices using a measure of potential signal combined with data coverage and (2) reduces supermatrices with a simple hill climbing procedure to submatrices with high total information content. We conducted simulation studies using matrices of 50 taxa × 50 genes with heterogeneous phylogenetic signal among genes and data coverage between 10-30%. With matrices of 50 taxa × 50 genes with heterogeneous phylogenetic signal among genes and data coverage between 10-30% Maximum Likelihood (ML) tree reconstructions failed to recover correct trees. A selection of a data subset with the herein proposed approach increased the chance to recover correct partial trees more than 10-fold. The selection of data subsets with the herein proposed simple hill climbing procedure performed well either considering the information content or just a simple presence/absence information of genes. We also applied our approach on an empirical data set, addressing questions of vertebrate systematics. With this empirical dataset selecting a data subset with high information content and supporting a tree with high average boostrap support was most successful if information content of genes was considered. Our analyses of simulated and empirical data demonstrate that sparse supermatrices can be reduced on a formal basis outperforming the usually used simple selections of taxa and genes with high data coverage.

  4. Comparative cytogenetic analysis of some species of the Dendropsophus microcephalus group (Anura, Hylidae) in the light of phylogenetic inferences

    PubMed Central

    2013-01-01

    Background Dendropsophus is a monophyletic anuran genus with a diploid number of 30 chromosomes as an important synapomorphy. However, the internal phylogenetic relationships of this genus are poorly understood. Interestingly, an intriguing interspecific variation in the telocentric chromosome number has been useful in species identification. To address certain uncertainties related to one of the species groups of Dendropsophus, the D. microcephalus group, we carried out a cytogenetic analysis combined with phylogenetic inferences based on mitochondrial sequences, which aimed to aid in the analysis of chromosomal characters. Populations of Dendropsophus nanus, Dendropsophus walfordi, Dendropsophus sanborni, Dendropsophus jimi and Dendropsophus elianeae, ranging from the extreme south to the north of Brazil, were cytogenetically compared. A mitochondrial region of the ribosomal 12S gene from these populations, as well as from 30 other species of Dendropsophus, was used for the phylogenetic inferences. Phylogenetic relationships were inferred using maximum parsimony and Bayesian analyses. Results The species D. nanus and D. walfordi exhibited identical karyotypes (2n = 30; FN = 52), with four pairs of telocentric chromosomes and a NOR located on metacentric chromosome pair 13. In all of the phylogenetic hypotheses, the paraphyly of D. nanus and D. walfordi was inferred. D. sanborni from Botucatu-SP and Torres-RS showed the same karyotype as D. jimi, with 5 pairs of telocentric chromosomes (2n = 30; FN = 50) and a terminal NOR in the long arm of the telocentric chromosome pair 12. Despite their karyotypic similarity, these species were not found to compose a monophyletic group. Finally, the phylogenetic and cytogenetic analyses did not cluster the specimens of D. elianeae according to their geographical occurrence or recognized morphotypes. Conclusions We suggest that a taxonomic revision of the taxa D. nanus and D. walfordi is quite necessary. We also observe that the number of telocentric chromosomes is useful to distinguish among valid species in some cases, although it is unchanged in species that are not necessarily closely related phylogenetically. Therefore, inferences based on this chromosomal character must be made with caution; a proper evolutionary analysis of the karyotypic variation in Dendropsophus depends on further characterization of the telocentric chromosomes found in this group. PMID:23822759

  5. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    PubMed

    Guo, Yong; Qiu, Li-Juan

    2013-01-01

    The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max). In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs) were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  6. Phylogenetic Analysis Supports the Aerobic-Capacity Model for the Evolution of Endothermy.

    PubMed

    Nespolo, Roberto F; Solano-Iguaran, Jaiber J; Bozinovic, Francisco

    2017-01-01

    The evolution of endothermy is a controversial topic in evolutionary biology, although several hypotheses have been proposed to explain it. To a great extent, the debate has centered on the aerobic-capacity model (AC model), an adaptive hypothesis involving maximum and resting rates of metabolism (MMR and RMR, respectively; hereafter "metabolic traits"). The AC model posits that MMR, a proxy of aerobic capacity and sustained activity, is the target of directional selection and that RMR is also influenced as a correlated response. Associated with this reasoning are the assumptions that (1) factorial aerobic scope (FAS; MMR/RMR) and net aerobic scope (NAS; MMR - RMR), two commonly used indexes of aerobic capacity, show different evolutionary optima and (2) the functional link between MMR and RMR is a basic design feature of vertebrates. To test these assumptions, we performed a comparative phylogenetic analysis in 176 vertebrate species, ranging from fish and amphibians to birds and mammals. Using disparity-through-time analysis, we also explored trait diversification and fitted different evolutionary models to study the evolution of metabolic traits. As predicted, we found (1) a positive phylogenetic correlation between RMR and MMR, (2) diversification of metabolic traits exceeding that of random-walk expectations, (3) that a model assuming selection fits the data better than alternative models, and (4) that a single evolutionary optimum best fits FAS data, whereas a model involving two optima (one for ectotherms and another for endotherms) is the best explanatory model for NAS. These results support the AC model and give novel information concerning the mode and tempo of physiological evolution of vertebrates.

  7. Molecular, phylogenetic and comparative genomic analysis of the cytokinin oxidase/dehydrogenase gene family in the Poaceae.

    PubMed

    Mameaux, Sabine; Cockram, James; Thiel, Thomas; Steuernagel, Burkhard; Stein, Nils; Taudien, Stefan; Jack, Peter; Werner, Peter; Gray, John C; Greenland, Andy J; Powell, Wayne

    2012-01-01

    The genomes of cereals such as wheat (Triticum aestivum) and barley (Hordeum vulgare) are large and therefore problematic for the map-based cloning of agronomicaly important traits. However, comparative approaches within the Poaceae permit transfer of molecular knowledge between species, despite their divergence from a common ancestor sixty million years ago. The finding that null variants of the rice gene cytokinin oxidase/dehydrogenase 2 (OsCKX2) result in large yield increases provides an opportunity to explore whether similar gains could be achieved in other Poaceae members. Here, phylogenetic, molecular and comparative analyses of CKX families in the sequenced grass species rice, brachypodium, sorghum, maize and foxtail millet, as well as members identified from the transcriptomes/genomes of wheat and barley, are presented. Phylogenetic analyses define four Poaceae CKX clades. Comparative analyses showed that CKX phylogenetic groupings can largely be explained by a combination of local gene duplication, and the whole-genome duplication event that predates their speciation. Full-length OsCKX2 homologues in barley (HvCKX2.1, HvCKX2.2) and wheat (TaCKX2.3, TaCKX2.4, TaCKX2.5) are characterized, with comparative analysis at the DNA, protein and genetic/physical map levels suggesting that true CKX2 orthologs have been identified. Furthermore, our analysis shows CKX2 genes in barley and wheat have undergone a Triticeae-specific gene-duplication event. Finally, by identifying ten of the eleven CKX genes predicted to be present in barley by comparative analyses, we show that next-generation sequencing approaches can efficiently determine the gene space of large-genome crops. Together, this work provides the foundation for future functional investigation of CKX family members within the Poaceae. © 2011 National Institute of Agricultural Botany (NIAB). Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.

  8. A phylogenetic analysis of normal modes evolution in enzymes and its relationship to enzyme function

    PubMed Central

    Lai, Jason; Jin, Jing; Kubelka, Jan; Liberles, David A.

    2012-01-01

    Since the dynamic nature of protein structures is essential for enzymatic function, it is expected that the functional evolution can be inferred from the changes in the protein dynamics. However, dynamics can also diverge neutrally with sequence substitution between enzymes without changes of function. In this study, a phylogenetic approach is implemented to explore the relationship between enzyme dynamics and function through evolutionary history. Protein dynamics are described by normal mode analysis based on a simplified harmonic potential force field applied to the reduced Cα representation of the protein structure while enzymatic function is described by Enzyme Commission (EC) numbers. Similarity of the binding pocket dynamics at each branch of the protein family’s phylogeny was analyzed in two ways: 1) explicitly by quantifying the normal mode overlap calculated for the reconstructed ancestral proteins at each end and 2) implicitly using a diffusion model to obtain the reconstructed lineage-specific changes in the normal modes. Both explicit and implicit ancestral reconstruction identified generally faster rates of change in dynamics compared with the expected change from neutral evolution at the branches of potential functional divergences for the alpha-amylase, D-isomer specific 2-hydroxyacid dehydrogenase, and copper-containing amine oxidase protein families. Normal modes analysis added additional information over just comparing the RMSD of static structures. However, the branch-specific changes were not statistically significant compared to background function-independent neutral rates of change of dynamic properties and blind application of the analysis would not enable prediction of changes in enzyme specificity. PMID:22651983

  9. A phylogenetic analysis of normal modes evolution in enzymes and its relationship to enzyme function.

    PubMed

    Lai, Jason; Jin, Jing; Kubelka, Jan; Liberles, David A

    2012-09-21

    Since the dynamic nature of protein structures is essential for enzymatic function, it is expected that functional evolution can be inferred from the changes in protein dynamics. However, dynamics can also diverge neutrally with sequence substitution between enzymes without changes of function. In this study, a phylogenetic approach is implemented to explore the relationship between enzyme dynamics and function through evolutionary history. Protein dynamics are described by normal mode analysis based on a simplified harmonic potential force field applied to the reduced C(α) representation of the protein structure while enzymatic function is described by Enzyme Commission numbers. Similarity of the binding pocket dynamics at each branch of the protein family's phylogeny was analyzed in two ways: (1) explicitly by quantifying the normal mode overlap calculated for the reconstructed ancestral proteins at each end and (2) implicitly using a diffusion model to obtain the reconstructed lineage-specific changes in the normal modes. Both explicit and implicit ancestral reconstruction identified generally faster rates of change in dynamics compared with the expected change from neutral evolution at the branches of potential functional divergences for the α-amylase, D-isomer-specific 2-hydroxyacid dehydrogenase, and copper-containing amine oxidase protein families. Normal mode analysis added additional information over just comparing the RMSD of static structures. However, the branch-specific changes were not statistically significant compared to background function-independent neutral rates of change of dynamic properties and blind application of the analysis would not enable prediction of changes in enzyme specificity. Copyright © 2012 Elsevier Ltd. All rights reserved.

  10. Interspecific Proteomic Comparisons Reveal Ash Phloem Genes Potentially Involved in Constitutive Resistance to the Emerald Ash Borer

    PubMed Central

    Whitehill, Justin G. A.; Popova-Butler, Alexandra; Green-Church, Kari B.; Koch, Jennifer L.; Herms, Daniel A.; Bonello, Pierluigi

    2011-01-01

    The emerald ash borer (Agrilus planipennis) is an invasive wood-boring beetle that has killed millions of ash trees since its accidental introduction to North America. All North American ash species (Fraxinus spp.) that emerald ash borer has encountered so far are susceptible, while an Asian species, Manchurian ash (F. mandshurica), which shares an evolutionary history with emerald ash borer, is resistant. Phylogenetic evidence places North American black ash (F. nigra) and Manchurian ash in the same clade and section, yet black ash is highly susceptible to the emerald ash borer. This contrast provides an opportunity to compare the genetic traits of the two species and identify those with a potential role in defense/resistance. We used Difference Gel Electrophoresis (DIGE) to compare the phloem proteomes of resistant Manchurian to susceptible black, green, and white ash. Differentially expressed proteins associated with the resistant Manchurian ash when compared to the susceptible ash species were identified using nano-LC-MS/MS and putative identities assigned. Proteomic differences were strongly associated with the phylogenetic relationships among the four species. Proteins identified in Manchurian ash potentially associated with its resistance to emerald ash borer include a PR-10 protein, an aspartic protease, a phenylcoumaran benzylic ether reductase (PCBER), and a thylakoid-bound ascorbate peroxidase. Discovery of resistance-related proteins in Asian species will inform approaches in which resistance genes can be introgressed into North American ash species. The generation of resistant North American ash genotypes can be used in forest ecosystem restoration and urban plantings following the wake of the emerald ash borer invasion. PMID:21949771

  11. Interspecific proteomic comparisons reveal ash phloem genes potentially involved in constitutive resistance to the emerald ash borer.

    PubMed

    Whitehill, Justin G A; Popova-Butler, Alexandra; Green-Church, Kari B; Koch, Jennifer L; Herms, Daniel A; Bonello, Pierluigi

    2011-01-01

    The emerald ash borer (Agrilus planipennis) is an invasive wood-boring beetle that has killed millions of ash trees since its accidental introduction to North America. All North American ash species (Fraxinus spp.) that emerald ash borer has encountered so far are susceptible, while an Asian species, Manchurian ash (F. mandshurica), which shares an evolutionary history with emerald ash borer, is resistant. Phylogenetic evidence places North American black ash (F. nigra) and Manchurian ash in the same clade and section, yet black ash is highly susceptible to the emerald ash borer. This contrast provides an opportunity to compare the genetic traits of the two species and identify those with a potential role in defense/resistance. We used Difference Gel Electrophoresis (DIGE) to compare the phloem proteomes of resistant Manchurian to susceptible black, green, and white ash. Differentially expressed proteins associated with the resistant Manchurian ash when compared to the susceptible ash species were identified using nano-LC-MS/MS and putative identities assigned. Proteomic differences were strongly associated with the phylogenetic relationships among the four species. Proteins identified in Manchurian ash potentially associated with its resistance to emerald ash borer include a PR-10 protein, an aspartic protease, a phenylcoumaran benzylic ether reductase (PCBER), and a thylakoid-bound ascorbate peroxidase. Discovery of resistance-related proteins in Asian species will inform approaches in which resistance genes can be introgressed into North American ash species. The generation of resistant North American ash genotypes can be used in forest ecosystem restoration and urban plantings following the wake of the emerald ash borer invasion.

  12. PolyTB: a genomic variation map for Mycobacterium tuberculosis.

    PubMed

    Coll, Francesc; Preston, Mark; Guerra-Assunção, José Afonso; Hill-Cawthorn, Grant; Harris, David; Perdigão, João; Viveiros, Miguel; Portugal, Isabel; Drobniewski, Francis; Gagneux, Sebastien; Glynn, Judith R; Pain, Arnab; Parkhill, Julian; McNerney, Ruth; Martin, Nigel; Clark, Taane G

    2014-05-01

    Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  13. Comparative Study of Lectin Domains in Model Species: New Insights into Evolutionary Dynamics

    PubMed Central

    Van Holle, Sofie; De Schutter, Kristof; Eggermont, Lore; Tsaneva, Mariya; Dang, Liuyi; Van Damme, Els J. M.

    2017-01-01

    Lectins are present throughout the plant kingdom and are reported to be involved in diverse biological processes. In this study, we provide a comparative analysis of the lectin families from model species in a phylogenetic framework. The analysis focuses on the different plant lectin domains identified in five representative core angiosperm genomes (Arabidopsis thaliana, Glycine max, Cucumis sativus, Oryza sativa ssp. japonica and Oryza sativa ssp. indica). The genomes were screened for genes encoding lectin domains using a combination of Basic Local Alignment Search Tool (BLAST), hidden Markov models, and InterProScan analysis. Additionally, phylogenetic relationships were investigated by constructing maximum likelihood phylogenetic trees. The results demonstrate that the majority of the lectin families are present in each of the species under study. Domain organization analysis showed that most identified proteins are multi-domain proteins, owing to the modular rearrangement of protein domains during evolution. Most of these multi-domain proteins are widespread, while others display a lineage-specific distribution. Furthermore, the phylogenetic analyses reveal that some lectin families evolved to be similar to the phylogeny of the plant species, while others share a closer evolutionary history based on the corresponding protein domain architecture. Our results yield insights into the evolutionary relationships and functional divergence of plant lectins. PMID:28587095

  14. Is Aquatic Life Correlated with an Increased Hematocrit in Snakes?

    PubMed Central

    Brischoux, François; Gartner, Gabriel E. A.; Garland, Theodore; Bonnet, Xavier

    2011-01-01

    Background Physiological adaptations that allow air-breathing vertebrates to remain underwater for long periods mainly involve modifications of the respiratory system, essentially through increased oxygen reserves. Physiological constraints on dive duration tend to be less critical for ectotherms than for endotherms because the former have lower mass-specific metabolic rates. Moreover, comparative studies between marine and terrestrial ectotherms have yet to show overall distinct physiological differences specifically associated with oxygen reserves. Methodology/Principal Findings We used phylogenetically informed statistical models to test if habitat affects hematocrit (an indicator of blood oxygen stores) in snakes, a lineage that varies widely in habitat use. Our results indicate that both phylogenetic position (clade) and especially habitat are significant predictors of hematocrit. Our analysis also confirms the peculiar respiratory physiology of the marine Acrochordus granulatus. Conclusion/Significance Contrary to previous findings, marine snakes have significantly–albeit slightly–elevated hematocrit, which should facilitate increased aerobic dive times. Longer dives could have consequences for foraging, mate searching, and predation risks. Alternatively, but not exclusively, increased Hct in marine species might also help to fuel other oxygen-demanding physiological adaptations, such as those involved in osmoregulation. PMID:21359216

  15. Genome-Wide Analysis of Mycoplasma bovirhinis GS01 Reveals Potential Virulence Factors and Phylogenetic Relationships.

    PubMed

    Chen, Shengli; Hao, Huafang; Zhao, Ping; Liu, Yongsheng; Chu, Yuefeng

    2018-05-04

    Mycoplasma bovirhinis is a significant etiology in bovine pneumonia and mastitis, but our knowledge about the genetic and pathogenic mechanisms of M. bovirhinis is very limited. In this study, we sequenced the complete genome of M. bovirhinis strain GS01 isolated from the nasal swab of pneumonic calves in Gansu, China, and we found that its genome forms a 847,985 bp single circular chromosome with a GC content of 27.57% and with 707 protein-coding genes. The putative virulence determinants of M. bovirhinis were then analyzed. Results showed that three genomic islands and 16 putative virulence genes, including one adhesion gene enolase, seven surface lipoproteins, proteins involved in glycerol metabolism, and cation transporters, might be potential virulence factors. Glycerol and pyruvate metabolic pathways were defective. Comparative analysis revealed remarkable genome variations between GS01 and a recently reported HAZ141_2 strain, and extremely low homology with others mycoplasma species. Phylogenetic analysis demonstrated that M. bovirhinis was most genetically close to M. canis , distant from other bovine Mycoplasma species. Genomic dissection may provide useful information on the pathogenic mechanisms and genetics of M. bovirhinis . Copyright © 2018 Chen et al.

  16. Toward a comprehensive understanding of phylogenetic relationships among lineages of Acanthaceae s.l. (Lamiales).

    PubMed

    McDade, Lucinda A; Daniel, Thomas F; Kiel, Carrie A

    2008-09-01

    Acanthaceae (Asteridae; Lamiales) include ∼4000 species and encompass a range of morphological diversity, habitats, and biogeographic patterns. Although they are important components of tropical and subtropical habitats worldwide, inadequate knowledge of the family's phylogenetic framework has impeded comparative research. In this study, we sampled all known lineages of Acanthaceae including Andrographideae. Also included were eight of 13 genera whose relationships remain enigmatic. We used sequence data from nrITS and four chloroplast noncoding regions, and parsimony and Bayesian methods of analysis. Results strongly support most aspects of relationships including inclusion of Avicennia in Acanthaceae. Excepting Neuracanthus, newly sampled taxa are placed with strong support; Kudoacanthus is in Justicieae, Tetramerium lineage, and the remaining enigmatic genera are in Whitfieldieae or Barlerieae, and Andrographideae are sister to Barlerieae. This last result is unanticipated, but placement of Andrographideae based on structural characters has been elusive. Neuracanthus is monophyletic but placement relative to (Whitfieldieae (Andrographideae + Barlerieae)) is weakly supported. Many clades have clear morphological synapomorphies, but nonmolecular evidence for some remains elusive. Results suggest an Old World origin with multiple dispersal events to the New World. This study informs future work by clarifying sampling strategy and identifying aspects of relationships that require further study.

  17. Does aquatic foraging impact head shape evolution in snakes?

    PubMed Central

    Cornette, Raphaël; Fabre, Anne-Claire; Godoy-Diana, Ramiro; Herrel, Anthony

    2016-01-01

    Evolutionary trajectories are often biased by developmental and historical factors. However, environmental factors can also impose constraints on the evolutionary trajectories of organisms leading to convergence of morphology in similar ecological contexts. The physical properties of water impose strong constraints on aquatic feeding animals by generating pressure waves that can alert prey and potentially push them away from the mouth. These hydrodynamic constraints have resulted in the independent evolution of suction feeding in most groups of secondarily aquatic tetrapods. Despite the fact that snakes cannot use suction, they have invaded the aquatic milieu many times independently. Here, we test whether the aquatic environment has constrained head shape evolution in snakes and whether shape converges on that predicted by biomechanical models. To do so, we used three-dimensional geometric morphometrics and comparative, phylogenetically informed analyses on a large sample of aquatic snake species. Our results show that aquatic snakes partially conform to our predictions and have a narrower anterior part of the head and dorsally positioned eyes and nostrils. This morphology is observed, irrespective of the phylogenetic relationships among species, suggesting that the aquatic environment does indeed drive the evolution of head shape in snakes, thus biasing the evolutionary trajectory of this group of animals. PMID:27581887

  18. Comparing Mycobacterium tuberculosis genomes using genome topology networks.

    PubMed

    Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan

    2015-02-14

    Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes were found to be affected by SVs in M. tuberculosis genomes. We believe that the GTN method will be suitable for the exploration of genomic SVs in connection with biological features of bacterial strains, and that GTN-based phylogenetic analysis will provide additional insights into whole genome-based phylogenetic analysis.

  19. Are pollen fossils useful for calibrating relaxed molecular clock dating of phylogenies? A comparative study using Myrtaceae.

    PubMed

    Thornhill, Andrew H; Popple, Lindsay W; Carter, Richard J; Ho, Simon Y W; Crisp, Michael D

    2012-04-01

    The identification and application of reliable fossil calibrations represents a key component of many molecular studies of evolutionary timescales. In studies of plants, most paleontological calibrations are associated with macrofossils. However, the pollen record can also inform age calibrations if fossils matching extant pollen groups are found. Recent work has shown that pollen of the myrtle family, Myrtaceae, can be classified into a number of morphological groups that are synapomorphic with molecular groups. By assembling a data matrix of pollen morphological characters from extant and fossil Myrtaceae, we were able to measure the fit of 26 pollen fossils to a molecular phylogenetic tree using parsimony optimisation of characters. We identified eight Myrtaceidites fossils as appropriate for calibration based on the most parsimonious placements of these fossils on the tree. These fossils were used to inform age constraints in a Bayesian phylogenetic analysis of a sequence alignment comprising two sequences from the chloroplast genome (matK and ndhF) and one nuclear locus (ITS), sampled from 106 taxa representing 80 genera. Three additional analyses were calibrated by placing pollen fossils using geographic and morphological information (eight calibrations), macrofossils (five calibrations), and macrofossils and pollen fossils in combination (12 calibrations). The addition of new fossil pollen calibrations led to older crown ages than have previously been found for tribes such as Eucalypteae and Myrteae. Estimates of rate variation among lineages were affected by the choice of calibrations, suggesting that the use of multiple calibrations can improve estimates of rate heterogeneity among lineages. This study illustrates the potential of including pollen-based calibrations in molecular studies of divergence times. Copyright © 2011 Elsevier Inc. All rights reserved.

  20. Recovery of microbial communities and carbon cycling processes following drought manipulation in southern California

    NASA Astrophysics Data System (ADS)

    Allison, S. D.; Martiny, J. B. H.; Martiny, A.; Berlemont, R.; Treseder, K. K.; Goulden, M.; Brodie, E.

    2016-12-01

    Predicting the functioning of microbial communities under changing environmental conditions remains a key challenge in Earth system science. Metagenomics and other high-throughput molecular approaches can help address this challenge by revealing the functional potential of microbial communities. We coupled metagenomics with models and experimental manipulations to address microbial responses to drought in a California grassland ecosystem along with the consequences for carbon cycling. We developed an approach for extracting trait information from metagenomic data and asked: 1) What is the phylogenetic structure of drought response traits? 2) What is the relationship between these traits and those involved in carbohydrate degradation? 3) How do both classes of traits vary seasonally and with precipitation manipulation? 4) How resilient are these traits in the face of perturbation? We found that drought response traits are phylogenetically conserved at an equivalent of 5-8% ribosomal RNA gene sequence dissimilarity. Experimental drought treatment selected for the genetic potential to degrade starch, xylan, and mixed polysaccharides, suggesting a link between drought response and carbon cycling traits. In addition, microbial communities exposed to experimental drought showed a reduced potential to degrade plant biomass. Particularly among bacteria, seasonal drought had a larger impact on microbial composition, abundance, and carbohydrate-degrading genes compared to experimental drought. Bacterial communities were also more resilient to drought perturbation than fungal communities, which showed legacies of drought perturbation for up to three years. Altogether, these findings imply that microbial communities exhibit trait diversity that facilitates resilience but with substantial time lags and consequences for carbon turnover. This information is being used to inform new trait-based models that address the challenge of predicting microbial functioning under precipitation change.

  1. Arthropod phylogenetics in light of three novel millipede (myriapoda: diplopoda) mitochondrial genomes with comments on the appropriateness of mitochondrial genome sequence data for inferring deep level relationships.

    PubMed

    Brewer, Michael S; Swafford, Lynn; Spruill, Chad L; Bond, Jason E

    2013-01-01

    Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the resulting tree topologies as suspect. As such, these data are likely inappropriate for investigating such ancient relationships.

  2. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan

    PubMed Central

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-01-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5′ trnK intron, matK, partial 3′ trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species. PMID:23610621

  3. Geographical origin of Leucobryum boninense Sull. & Lesq. (Leucobryaceae, Musci) endemic to the Bonin Islands, Japan.

    PubMed

    Oguri, Emiko; Yamaguchi, Tomio; Tsubota, Hiromi; Deguchi, Hironori; Murakami, Noriaki

    2013-04-01

    Leucobryum boninense is endemic to the Bonin Islands, Japan, and its related species are widely distributed in Asia and the Pacific. We aimed to clarify the phylogenetic relationships among Leucobryum species and infer the origin of L. boninense. We also describe the utility of the chloroplast trnK intron including matK for resolving the phylogenetic relationships among Leucobryum species, as phylogenetic analyses using trnK intron and/or matK have not been performed well in bryophytes to date. Fifty samples containing 15 species of Leucobryum from Asia and the Pacific were examined for six chloroplast DNA regions including rbcL, rps4, partial 5' trnK intron, matK, partial 3' trnK intron, and trnL-F intergenic spacer plus one nuclear DNA region including ITS. A molecular phylogenetic tree showed that L. boninense made a clade with L. scabrum from Japan, Taiwan and, Hong Kong; L. javense which is widely distributed in East and Southeast Asia, and L. pachyphyllum and L. seemannii restricted to the Hawaii Islands, as well as with L. scaberulum from the Ryukyus, Japan, Taiwan, and southeastern China. Leucobryum boninense from various islands of the Bonin Islands made a monophylic group that was closely related to L. scabrum and L. javense from Japan. Therefore, L. boninense may have evolved from L. scabrum from Japan, Taiwan, or Hong Kong, or L. javense from Japan. We also described the utility of trnK intron including matK. A percentage of the parsimony-informative characters in trnK intron sequence data (5.8%) was significantly higher than that from other chloroplast regions, rbcL (2.4%) and rps4 (3.2%) sequence data. Nucleotide sequence data of the trnK intron including matK are more informative than other chloroplast DNA regions for identifying the phylogenetic relationships among Leucobryum species.

  4. Evaluation of properties over phylogenetic trees using stochastic logics.

    PubMed

    Requeno, José Ignacio; Colom, José Manuel

    2016-06-14

    Model checking has been recently introduced as an integrated framework for extracting information of the phylogenetic trees using temporal logics as a querying language, an extension of modal logics that imposes restrictions of a boolean formula along a path of events. The phylogenetic tree is considered a transition system modeling the evolution as a sequence of genomic mutations (we understand mutation as different ways that DNA can be changed), while this kind of logics are suitable for traversing it in a strict and exhaustive way. Given a biological property that we desire to inspect over the phylogeny, the verifier returns true if the specification is satisfied or a counterexample that falsifies it. However, this approach has been only considered over qualitative aspects of the phylogeny. In this paper, we repair the limitations of the previous framework for including and handling quantitative information such as explicit time or probability. To this end, we apply current probabilistic continuous-time extensions of model checking to phylogenetics. We reinterpret a catalog of qualitative properties in a numerical way, and we also present new properties that couldn't be analyzed before. For instance, we obtain the likelihood of a tree topology according to a mutation model. As case of study, we analyze several phylogenies in order to obtain the maximum likelihood with the model checking tool PRISM. In addition, we have adapted the software for optimizing the computation of maximum likelihoods. We have shown that probabilistic model checking is a competitive framework for describing and analyzing quantitative properties over phylogenetic trees. This formalism adds soundness and readability to the definition of models and specifications. Besides, the existence of model checking tools hides the underlying technology, omitting the extension, upgrade, debugging and maintenance of a software tool to the biologists. A set of benchmarks justify the feasibility of our approach.

  5. An Expanded Combined Evidence Approach to the Gavialis Problem Using Geometric Morphometric Data from Crocodylian Braincases and Eustachian Systems

    PubMed Central

    Gold, Maria Eugenia Leone; Brochu, Christopher A.; Norell, Mark A.

    2014-01-01

    The phylogenetic position of the Indian gharial (Gavialis gangeticus) is disputed - morphological characters place Gavialis as the sister to all other extant crocodylians, whereas molecular and combined analyses find Gavialis and the false gharial (Tomistoma schlegelii) to be sister taxa. Geometric morphometric techniques have only begun to be applied to this issue, but most of these studies have focused on the exterior of the skull. The braincase has provided useful phylogenetic information for basal crurotarsans, but has not been explored for the crown group. The Eustachian system is thought to vary phylogenetically in Crocodylia, but has not been analytically tested. To determine if gross morphology of the crocodylian braincase proves informative to the relationships of Gavialis and Tomistoma, we used two- and three-dimensional geometric morphometric approaches. Internal braincase images were obtained using high-resolution computerized tomography scans. A principal components analysis identified that the first component axis was primarily associated with size and did not show groupings that divide the specimens by phylogenetic affinity. Sliding semi-landmarks and a relative warp analysis indicate that a unique Eustachian morphology separates Gavialis from other extant members of Crocodylia. Ontogenetic expansion of the braincase results in a more dorsoventrally elongate median Eustachian canal. Changes in the shape of the Eustachian system do provide phylogenetic distinctions between major crocodylian clades. Each morphometric dataset, consisting of continuous morphological characters, was added independently to a combined cladistic analysis of discrete morphological and molecular characters. The braincase data alone produced a clade that included crocodylids and Gavialis, whereas the Eustachian data resulted in Gavialis being considered a basally divergent lineage. When each morphometric dataset was used in a combined analysis with discrete morphological and molecular characters, it generated a tree that matched the topology of the molecular phylogeny of Crocodylia. PMID:25198124

  6. Life history and biogeographic diversification of an endemic western North American freshwater fish clade using a comparative species tree approach.

    PubMed

    Baumsteiger, Jason; Kinziger, Andrew P; Aguilar, Andres

    2012-12-01

    The west coast of North America contains a number of biogeographic freshwater provinces which reflect an ever-changing aquatic landscape. Clues to understanding this complex structure are often encapsulated genetically in the ichthyofauna, though frequently as unresolved evolutionary relationships and putative cryptic species. Advances in molecular phylogenetics through species tree analyses now allow for improved exploration of these relationships. Using a comprehensive approach, we analyzed two mitochondrial and nine nuclear loci for a group of endemic freshwater fish (sculpin-Cottus) known for a wide ranging distribution and complex species structure in this region. Species delimitation techniques identified three novel cryptic lineages, all well supported by phylogenetic analyses. Comparative phylogenetic analyses consistently found five distinct clades reflecting a number of unique biogeographic provinces. Some internal node relationships varied by species tree reconstruction method, and were associated with either Bayesian or maximum likelihood statistical approaches or between mitochondrial, nuclear, and combined datasets. Limited cases of mitochondrial capture were also evident, suggestive of putative ancestral hybridization between species. Biogeographic diversification was associated with four major regions and revealed historical faunal exchanges across regions. Mapping of an important life-history character (amphidromy) revealed two separate instances of trait evolution, a transition that has occurred repeatedly in Cottus. This study demonstrates the power of current phylogenetic methods, the need for a comprehensive phylogenetic approach, and the potential for sculpin to serve as an indicator of biogeographic history for native ichthyofauna in the region. Copyright © 2012 Elsevier Inc. All rights reserved.

  7. Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.

    PubMed

    Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F

    2017-08-01

    Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Patterns of Phylogenetic Diversity of Subtropical Rainforest of the Great Sandy Region, Australia Indicate Long Term Climatic Refugia.

    PubMed

    Howard, Marion G; McDonald, William J F; Forster, Paul I; Kress, W John; Erickson, David; Faith, Daniel P; Shapcott, Alison

    2016-01-01

    Australia's Great Sandy Region is of international significance containing two World Heritage areas and patches of rainforest growing on white sand. Previous broad-scale analysis found the Great Sandy biogeographic subregion contained a significantly more phylogenetically even subset of species than expected by chance contrasting with rainforest on white sand in Peru. This study aimed to test the patterns of rainforest diversity and relatedness at a finer scale and to investigate why we may find different patterns of phylogenetic evenness compared with rainforests on white sands in other parts of the world. This study focussed on rainforest sites within the Great Sandy and surrounding areas in South East Queensland (SEQ), Australia. We undertook field collections, expanded our three-marker DNA barcode library of SEQ rainforest plants and updated the phylogeny to 95% of the SEQ rainforest flora. We sampled species composition of rainforest in fixed area plots from 100 sites. We calculated phylogenetic diversity (PD) measures as well as species richness (SR) for each rainforest community. These combined with site variables such as geology, were used to evaluate patterns and relatedness. We found that many rainforest communities in the Great Sandy area were significantly phylogenetically even at the individual site level consistent with a broader subregion analysis. Sites from adjacent areas were either not significant or were significantly phylogenetically clustered. Some results in the neighbouring areas were consistent with historic range expansions. In contrast with expectations, sites located on the oldest substrates had significantly lower phylogenetic diversity (PD). Fraser Island was once connected to mainland Australia, our results are consistent with a region geologically old enough to have continuously supported rainforest in refugia. The interface of tropical and temperate floras in part also explains the significant phylogenetic evenness and higher than expected phylogenetic diversity.

  9. Patterns of Phylogenetic Diversity of Subtropical Rainforest of the Great Sandy Region, Australia Indicate Long Term Climatic Refugia

    PubMed Central

    Howard, Marion G.; McDonald, William J. F.; Forster, Paul I.; Kress, W. John; Erickson, David; Faith, Daniel P.; Shapcott, Alison

    2016-01-01

    Australia’s Great Sandy Region is of international significance containing two World Heritage areas and patches of rainforest growing on white sand. Previous broad-scale analysis found the Great Sandy biogeographic subregion contained a significantly more phylogenetically even subset of species than expected by chance contrasting with rainforest on white sand in Peru. This study aimed to test the patterns of rainforest diversity and relatedness at a finer scale and to investigate why we may find different patterns of phylogenetic evenness compared with rainforests on white sands in other parts of the world. This study focussed on rainforest sites within the Great Sandy and surrounding areas in South East Queensland (SEQ), Australia. We undertook field collections, expanded our three-marker DNA barcode library of SEQ rainforest plants and updated the phylogeny to 95% of the SEQ rainforest flora. We sampled species composition of rainforest in fixed area plots from 100 sites. We calculated phylogenetic diversity (PD) measures as well as species richness (SR) for each rainforest community. These combined with site variables such as geology, were used to evaluate patterns and relatedness. We found that many rainforest communities in the Great Sandy area were significantly phylogenetically even at the individual site level consistent with a broader subregion analysis. Sites from adjacent areas were either not significant or were significantly phylogenetically clustered. Some results in the neighbouring areas were consistent with historic range expansions. In contrast with expectations, sites located on the oldest substrates had significantly lower phylogenetic diversity (PD). Fraser Island was once connected to mainland Australia, our results are consistent with a region geologically old enough to have continuously supported rainforest in refugia. The interface of tropical and temperate floras in part also explains the significant phylogenetic evenness and higher than expected phylogenetic diversity. PMID:27119149

  10. The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae).

    PubMed

    Prychitko, T M; Moore, W S

    1997-10-01

    Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press

  11. Molecular characterization of chikungunya virus from Andhra Pradesh, India & phylogenetic relationship with Central African isolates.

    PubMed

    M Naresh Kumar, C V; Anthony Johnson, A M; R Sai Gopal, D V

    2007-12-01

    Chikungunya virus has caused numerous large outbreaks in India. Suspected blood samples from the epidemic were collected and characterized for the identification of the responsible causative from Rayalaseema region of Andhra Pradesh. RT-PCR was used for screening of suspected blood samples. Primers were designed to amplify partial E1 gene and the amplified fragment was cloned and sequenced. The sequence was analyzed and compared with other geographical isolates to find the phylogenetic relationship. The sequence was submitted to the Gen bank DNA database (accession DQ888620). Comparative nucleotide homology analysis of the AP Ra-CTR isolate with the other isolates revealed 94.7+/-3.6 per cent of homology of CHIKAPRa-CTR with other isolates of Chikungunya virus at nucleotide level and 96.8+/-3.2 per cent of homology at amino acid level. The current epidemic was caused by the Central African genotype of CHIKV, grouped in Central Africa cluster in phylogenetic trees generated based on nucleotide and amino acid sequences.

  12. Homologization of the flight musculature of zygoptera (insecta: odonata) and neoptera (insecta).

    PubMed

    Büsse, Sebastian; Genet, Cécile; Hörnschemeyer, Thomas

    2013-01-01

    Among the winged insects (Pterygota) the Dragonflies and Damselflies (Odonata) are unique for several reasons. Behaviourally they are aerial predators that hunt and catch their prey in flight, only. Morphologically the flight apparatus of Odonata is significantly different from what is found in the remaining Pterygota. However, to understand the phylogenetic relationships of winged insects and the origin and evolution of insect flight in general, it is essential to know how the elements of the odonatan flight apparatus relate to those of the other Pterygota. Here we present a comprehensive, comparative morphological investigation of the thoracic flight musculature of damselflies (Zygoptera). Based on our new data we propose a homologization scheme for the thoracic musculature throughout Pterygota. The new homology hypotheses will allow for future comparative work and especially for phylogenetic analyses using characters of the thoracic musculature throughout all winged insects. This will contribute to understand the early evolution of pterygote insects and their basal phylogenetic relationship.

  13. Homologization of the Flight Musculature of Zygoptera (Insecta: Odonata) and Neoptera (Insecta)

    PubMed Central

    Büsse, Sebastian; Genet, Cécile; Hörnschemeyer, Thomas

    2013-01-01

    Among the winged insects (Pterygota) the Dragonflies and Damselflies (Odonata) are unique for several reasons. Behaviourally they are aerial predators that hunt and catch their prey in flight, only. Morphologically the flight apparatus of Odonata is significantly different from what is found in the remaining Pterygota. However, to understand the phylogenetic relationships of winged insects and the origin and evolution of insect flight in general, it is essential to know how the elements of the odonatan flight apparatus relate to those of the other Pterygota. Here we present a comprehensive, comparative morphological investigation of the thoracic flight musculature of damselflies (Zygoptera). Based on our new data we propose a homologization scheme for the thoracic musculature throughout Pterygota. The new homology hypotheses will allow for future comparative work and especially for phylogenetic analyses using characters of the thoracic musculature throughout all winged insects. This will contribute to understand the early evolution of pterygote insects and their basal phylogenetic relationship. PMID:23457479

  14. General quantitative genetic methods for comparative biology: phylogenies, taxonomies and multi-trait models for continuous and categorical characters.

    PubMed

    Hadfield, J D; Nakagawa, S

    2010-03-01

    Although many of the statistical techniques used in comparative biology were originally developed in quantitative genetics, subsequent development of comparative techniques has progressed in relative isolation. Consequently, many of the new and planned developments in comparative analysis already have well-tested solutions in quantitative genetics. In this paper, we take three recent publications that develop phylogenetic meta-analysis, either implicitly or explicitly, and show how they can be considered as quantitative genetic models. We highlight some of the difficulties with the proposed solutions, and demonstrate that standard quantitative genetic theory and software offer solutions. We also show how results from Bayesian quantitative genetics can be used to create efficient Markov chain Monte Carlo algorithms for phylogenetic mixed models, thereby extending their generality to non-Gaussian data. Of particular utility is the development of multinomial models for analysing the evolution of discrete traits, and the development of multi-trait models in which traits can follow different distributions. Meta-analyses often include a nonrandom collection of species for which the full phylogenetic tree has only been partly resolved. Using missing data theory, we show how the presented models can be used to correct for nonrandom sampling and show how taxonomies and phylogenies can be combined to give a flexible framework with which to model dependence.

  15. Is specialization an evolutionary dead end? Testing for differences in speciation, extinction and trait transition rates across diverse phylogenies of specialists and generalists.

    PubMed

    Day, E H; Hua, X; Bromham, L

    2016-06-01

    Specialization has often been claimed to be an evolutionary dead end, with specialist lineages having a reduced capacity to persist or diversify. In a phylogenetic comparative framework, an evolutionary dead end may be detectable from the phylogenetic distribution of specialists, if specialists rarely give rise to large, diverse clades. Previous phylogenetic studies of the influence of specialization on macroevolutionary processes have demonstrated a range of patterns, including examples where specialists have both higher and lower diversification rates than generalists, as well as examples where the rates of evolutionary transitions from generalists to specialists are higher, lower or equal to transitions from specialists to generalists. Here, we wish to ask whether these varied answers are due to the differences in macroevolutionary processes in different clades, or partly due to differences in methodology. We analysed ten phylogenies containing multiple independent origins of specialization and quantified the phylogenetic distribution of specialists by applying a common set of metrics to all datasets. We compared the tip branch lengths of specialists to generalists, the size of specialist clades arising from each evolutionary origin of a specialized trait and whether specialists tend to be clustered or scattered on phylogenies. For each of these measures, we compared the observed values to expectations under null models of trait evolution and expected outcomes under alternative macroevolutionary scenarios. We found that specialization is sometimes an evolutionary dead end: in two of the ten case studies (pollinator-specific plants and host-specific flies), specialization is associated with a reduced rate of diversification or trait persistence. However, in the majority of studies, we could not distinguish the observed phylogenetic distribution of specialists from null models in which specialization has no effect on diversification or trait persistence. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.

  16. The Skull of Phyllomedusa sauvagii (Anura, Hylidae).

    PubMed

    Ruiz-Monachesi, Mario R; Lavilla, Esteban O; Montero, Ricardo

    2016-05-01

    The hylid genus Phyllomedusa comprises charismatic frogs commonly known as monkey, leaf or green frogs, and is the most diverse genus of the subfamily Phyllomedusinae, including about 31 species. Although there is some information about the anatomy of these frogs, little is known about the osteology. Here the adult skull of Phyllomedusa sauvagii, both articulated and disarticulated, is described and the intraspecific variation is reported. Additionally, cartilage associated with the adult skull, such as the nasal capsules, auditory apparatus, and hyobranchial apparatus, are included in the analysis. Further examination of disarticulated bones reveals their remarkable complexity, specifically in the sphenethmoid and of the oocipital region. The description of disarticulated bones is useful for the identification of fossil remains as well as providing morphological characteristics that are phylogenetically informative. When comparing the skull morphology with the available information of other species of the genus, Phyllomesusa sauvagii skull resembles more that of P. vaillantii and P. venusta than P. atelopoides. © 2016 Wiley Periodicals, Inc.

  17. Allopatric tuberculosis host–pathogen relationships are associated with greater pulmonary impairment

    PubMed Central

    Pasipanodya, Jotam G.; Moonan, Patrick K.; Vecino, Edgar; Miller, Thaddeus L.; Fernandez, Michel; Slocum, Philip; Drewyer, Gerry; Weis, Stephen E.

    2015-01-01

    Background Host pathogen relationships can be classified as allopatric, when the pathogens originated from separate, non-overlapping geographic areas from the host; or sympatric, when host and pathogen shared a common ancestral geographic location. It remains unclear if host–pathogen relationships, as defined by phylogenetic lineage, influence clinical outcome. We sought to examine the association between allopatric and sympatric phylogenetic Mycobacterium tuberculosis lineages and pulmonary impairment after tuberculosis (PIAT). Methods Pulmonary function tests were performed on patients 16 years of age and older who had received ≥20 weeks of treatment for culture-confirmed M. tuberculosis complex. Forced Expiratory Volume in 1 min (FEV1) ≥80%, Forced Vital Capacity (FVC) ≥80% and FEV1/FVC >70% of predicted were considered normal. Other results defined pulmonary impairment. Spoligotype and 12-locus mycobacterial interspersed repetitive units-variable number of tandem repeats (MIRU-VNTR) were used to assign phylogenetic lineage. PIAT severity was compared between host–pathogen relationships which were defined by geography and ethnic population. We used multivariate logistic regression modeling to calculate adjusted odds ratios (aOR) between phylogenetic lineage and PIAT. Results Self-reported continental ancestry was correlated with Mycobacterium. tuberculosis lineage (p < 0.001). In multivariate analyses adjusting for phylogenetic lineage, age and smoking, the overall aOR for subjects with allopatric host–pathogen relationships and PIAT was 1.8 (95% confidence interval [CI]: 1.1, 2.9) compared to sympatric relationships. Smoking >30 pack-years was also associated with PIAT (aOR: 3.2; 95% CI: 1.5, 7.2) relative to smoking <1 pack-years. Conclusions PIAT frequency and severity varies by host–pathogen relationship and heavy cigarette consumption, but not phylogenetic lineage alone. Patients who had disease resulting from allopatric–host–pathogen relationship were more likely to have PIAT than patients with disease from sympatric–host–pathogen relationship infection. Further study of this association may identify ways that treatment and preventive efforts can be tailored to specific lineages and racial/ethnic populations. PMID:23501297

  18. Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information

    PubMed Central

    McDonald, Daniel; Gonzalez, Antonio; Navas-Molina, Jose A.; Jiang, Lingjing; Xu, Zhenjiang Zech; Winker, Kevin; Kado, Deborah M.; Orwoll, Eric; Manary, Mark; Mirarab, Siavash

    2018-01-01

    ABSTRACT Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. PMID:29719869

  19. RNA-Seq based phylogeny recapitulates previous phylogeny of the genus Flaveria (Asteraceae) with some modifications.

    PubMed

    Lyu, Ming-Ju Amy; Gowik, Udo; Kelly, Steve; Covshoff, Sarah; Mallmann, Julia; Westhoff, Peter; Hibberd, Julian M; Stata, Matt; Sage, Rowan F; Lu, Haorong; Wei, Xiaofeng; Wong, Gane Ka-Shu; Zhu, Xin-Guang

    2015-06-18

    The genus Flaveria has been extensively used as a model to study the evolution of C4 photosynthesis as it contains C3 and C4 species as well as a number of species that exhibit intermediate types of photosynthesis. The current phylogenetic tree of the genus Flaveria contains 21 of the 23 known Flaveria species and has been previously constructed using a combination of morphological data and three non-coding DNA sequences (nuclear encoded ETS, ITS and chloroplast encoded trnL-F). Here we developed a new strategy to update the phylogenetic tree of 16 Flaveria species based on RNA-Seq data. The updated phylogeny is largely congruent with the previously published tree but with some modifications. We propose that the data collection method provided in this study can be used as a generic method for phylogenetic tree reconstruction if the target species has no genomic information. We also showed that a "F. pringlei" genotype recently used in a number of labs may be a hybrid between F. pringlei (C3) and F. angustifolia (C3-C4). We propose that the new strategy of obtaining phylogenetic sequences outlined in this study can be used to construct robust trees in a larger number of taxa. The updated Flaveria phylogenetic tree also supports a hypothesis of stepwise and parallel evolution of C4 photosynthesis in the Flavaria clade.

  20. How Many Is Enough?—Statistical Principles for Lexicostatistics

    PubMed Central

    Zhang, Menghan; Gong, Tao

    2016-01-01

    Lexicostatistics has been applied in linguistics to inform phylogenetic relations among languages. There are two important yet not well-studied parameters in this approach: the conventional size of vocabulary list to collect potentially true cognates and the minimum matching instances required to confirm a recurrent sound correspondence. Here, we derive two statistical principles from stochastic theorems to quantify these parameters. These principles validate the practice of using the Swadesh 100- and 200-word lists to indicate degree of relatedness between languages, and enable a frequency-based, dynamic threshold to detect recurrent sound correspondences. Using statistical tests, we further evaluate the generality of the Swadesh 100-word list compared to the Swadesh 200-word list and other 100-word lists sampled randomly from the Swadesh 200-word list. All these provide mathematical support for applying lexicostatistics in historical and comparative linguistics. PMID:28018261

Top