Wen, Dingqiao; Yu, Yun; Hahn, Matthew W.; Nakhleh, Luay
2016-01-01
The role of hybridization and subsequent introgression has been demonstrated in an increasing number of species. Recently, Fontaine et al. (Science, 347, 2015, 1258524) conducted a phylogenomic analysis of six members of the Anopheles gambiae species complex. Their analysis revealed a reticulate evolutionary history and pointed to extensive introgression on all four autosomal arms. The study further highlighted the complex evolutionary signals that the co-occurrence of incomplete lineage sorting (ILS) and introgression can give rise to in phylogenomic analyses. While tree-based methodologies were used in the study, phylogenetic networks provide a more natural model to capture reticulate evolutionary histories. In this work, we reanalyse the Anopheles data using a recently devised framework that combines the multispecies coalescent with phylogenetic networks. This framework allows us to capture ILS and introgression simultaneously, and forms the basis for statistical methods for inferring reticulate evolutionary histories. The new analysis reveals a phylogenetic network with multiple hybridization events, some of which differ from those reported in the original study. To elucidate the extent and patterns of introgression across the genome, we devise a new method that quantifies the use of reticulation branches in the phylogenetic network by each genomic region. Applying the method to the mosquito data set reveals the evolutionary history of all the chromosomes. This study highlights the utility of ‘network thinking’ and the new insights it can uncover, in particular in phylogenomic analyses of large data sets with extensive gene tree incongruence. PMID:26808290
Lee, Dong-Hun
2017-01-01
To determine the genetic and epidemiological relationship of infectious bronchitis virus (IBV) isolates from commercial poultry to attenuated live IBV vaccines we conducted a phylogenetic network analysis on the full-length S1 sequence for Arkansas (Ark), Massachusetts (Mass) and Delmarva/1639 (DMV/1639) type viruses isolated in 2015 from clinical cases by 3 different diagnostic laboratories. Phylogenetic network analysis of Ark isolates showed two predominant groups linked by 2 mutations, consistent with subpopulations found in commercial vaccines for this IBV type. In addition, a number of satellite groups surrounding the two predominant populations were observed for the Ark type virus, which is likely due to mutations associated with the nature of this vaccine to persist in flocks. The phylogenetic network analysis of Mass-type viruses shows two groupings corresponding to different manufacturers vaccine sequences. No satellite groups were observed for Mass-type viruses, which is consistent with no persistence of this vaccine type in the field. At the time of collection, no vaccine was being used for the DMV/1639 type viruses and phylogenetic network analysis showed a dispersed network suggesting no clear change in genetic distribution. Selection pressure analysis showed that the DMV/1639 and Mass-type strains were evolving under negative selection, whereas the Ark type viruses had evolved under positive selection. This data supports the hypothesis that live attenuated vaccine usage does play a role in the genetic profile of similar IB viruses in the field and phylogenetic network analysis can be used to identify vaccine and vaccine origin isolates, which is important for our understanding of the role live vaccines play in the evolutionary trajectory of those viruses. PMID:28472110
Construction of phylogenetic trees by kernel-based comparative analysis of metabolic networks.
Oh, S June; Joung, Je-Gun; Chang, Jeong-Ho; Zhang, Byoung-Tak
2006-06-06
To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees. To compare the structures of metabolic networks in organisms, we adopted the exponential graph kernel, which is a kernel-based approach with a labeled graph that includes a label matrix and an adjacency matrix. To construct the phylogenetic trees, we used an unweighted pair-group method with arithmetic mean, i.e., a hierarchical clustering algorithm. We applied the kernel-based network profiling method in a comparative analysis of nine carbohydrate metabolic networks from 81 biological species encompassing Archaea, Eukaryota, and Eubacteria. The resulting phylogenetic hierarchies generally support the tripartite scheme of three domains rather than the two domains of prokaryotes and eukaryotes. By combining the kernel machines with metabolic information, the method infers the context of biosphere development that covers physiological events required for adaptation by genetic reconstruction. The results show that one may obtain a global view of the tree of life by comparing the metabolic pathway structures using meta-level information rather than sequence information. This method may yield further information about biological evolution, such as the history of horizontal transfer of each gene, by studying the detailed structure of the phylogenetic tree constructed by the kernel-based method.
A network perspective on the topological importance of enzymes and their phylogenetic conservation
Liu, Wei-chung; Lin, Wen-hsien; Davis, Andrew J; Jordán, Ferenc; Yang, Hsih-te; Hwang, Ming-jing
2007-01-01
Background A metabolic network is the sum of all chemical transformations or reactions in the cell, with the metabolites being interconnected by enzyme-catalyzed reactions. Many enzymes exist in numerous species while others occur only in a few. We ask if there are relationships between the phylogenetic profile of an enzyme, or the number of different bacterial species that contain it, and its topological importance in the metabolic network. Our null hypothesis is that phylogenetic profile is independent of topological importance. To test our null hypothesis we constructed an enzyme network from the KEGG (Kyoto Encyclopedia of Genes and Genomes) database. We calculated three network indices of topological importance: the degree or the number of connections of a network node; closeness centrality, which measures how close a node is to others; and betweenness centrality measuring how frequently a node appears on all shortest paths between two other nodes. Results Enzyme phylogenetic profile correlates best with betweenness centrality and also quite closely with degree, but poorly with closeness centrality. Both betweenness and closeness centralities are non-local measures of topological importance and it is intriguing that they have contrasting power of predicting phylogenetic profile in bacterial species. We speculate that redundancy in an enzyme network may be reflected by betweenness centrality but not by closeness centrality. We also discuss factors influencing the correlation between phylogenetic profile and topological importance. Conclusion Our analysis falsifies the hypothesis that phylogenetic profile of enzymes is independent of enzyme network importance. Our results show that phylogenetic profile correlates better with degree and betweenness centrality, but less so with closeness centrality. Enzymes that occur in many bacterial species tend to be those that have high network importance. We speculate that this phenomenon originates in mechanisms driving network evolution. Closeness centrality reflects phylogenetic profile poorly. This is because metabolic networks often consist of distinct functional modules and some are not in the centre of the network. Enzymes in these peripheral parts of a network might be important for cell survival and should therefore occur in many bacterial species. They are, however, distant from other enzymes in the same network. PMID:17425808
Detecting Network Communities: An Application to Phylogenetic Analysis
Andrade, Roberto F. S.; Rocha-Neto, Ivan C.; Santos, Leonardo B. L.; de Santana, Charles N.; Diniz, Marcelo V. C.; Lobão, Thierry Petit; Goés-Neto, Aristóteles; Pinho, Suani T. R.; El-Hani, Charbel N.
2011-01-01
This paper proposes a new method to identify communities in generally weighted complex networks and apply it to phylogenetic analysis. In this case, weights correspond to the similarity indexes among protein sequences, which can be used for network construction so that the network structure can be analyzed to recover phylogenetically useful information from its properties. The analyses discussed here are mainly based on the modular character of protein similarity networks, explored through the Newman-Girvan algorithm, with the help of the neighborhood matrix . The most relevant networks are found when the network topology changes abruptly revealing distinct modules related to the sets of organisms to which the proteins belong. Sound biological information can be retrieved by the computational routines used in the network approach, without using biological assumptions other than those incorporated by BLAST. Usually, all the main bacterial phyla and, in some cases, also some bacterial classes corresponded totally (100%) or to a great extent (>70%) to the modules. We checked for internal consistency in the obtained results, and we scored close to 84% of matches for community pertinence when comparisons between the results were performed. To illustrate how to use the network-based method, we employed data for enzymes involved in the chitin metabolic pathway that are present in more than 100 organisms from an original data set containing 1,695 organisms, downloaded from GenBank on May 19, 2007. A preliminary comparison between the outcomes of the network-based method and the results of methods based on Bayesian, distance, likelihood, and parsimony criteria suggests that the former is as reliable as these commonly used methods. We conclude that the network-based method can be used as a powerful tool for retrieving modularity information from weighted networks, which is useful for phylogenetic analysis. PMID:21573202
Transforming phylogenetic networks: Moving beyond tree space.
Huber, Katharina T; Moulton, Vincent; Wu, Taoyang
2016-09-07
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.
Chase, Mark W.; Kim, Joo-Hwan
2013-01-01
Phylogenetic analysis aims to produce a bifurcating tree, which disregards conflicting signals and displays only those that are present in a large proportion of the data. However, any character (or tree) conflict in a dataset allows the exploration of support for various evolutionary hypotheses. Although data-display network approaches exist, biologists cannot easily and routinely use them to compute rooted phylogenetic networks on real datasets containing hundreds of taxa. Here, we constructed an original neighbour-net for a large dataset of Asparagales to highlight the aspects of the resulting network that will be important for interpreting phylogeny. The analyses were largely conducted with new data collected for the same loci as in previous studies, but from different species accessions and greater sampling in many cases than in published analyses. The network tree summarised the majority data pattern in the characters of plastid sequences before tree building, which largely confirmed the currently recognised phylogenetic relationships. Most conflicting signals are at the base of each group along the Asparagales backbone, which helps us to establish the expectancy and advance our understanding of some difficult taxa relationships and their phylogeny. The network method should play a greater role in phylogenetic analyses than it has in the past. To advance the understanding of evolutionary history of the largest order of monocots Asparagales, absolute diversification times were estimated for family-level clades using relaxed molecular clock analyses. PMID:23544071
A program to compute the soft Robinson-Foulds distance between phylogenetic networks.
Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai
2017-03-14
Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.
Phylogenetic comparative methods on phylogenetic networks with reticulations.
Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile
2018-04-25
The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.
Tanglegrams for rooted phylogenetic trees and networks
Scornavacca, Celine; Zickmann, Franziska; Huson, Daniel H.
2011-01-01
Motivation: In systematic biology, one is often faced with the task of comparing different phylogenetic trees, in particular in multi-gene analysis or cospeciation studies. One approach is to use a tanglegram in which two rooted phylogenetic trees are drawn opposite each other, using auxiliary lines to connect matching taxa. There is an increasing interest in using rooted phylogenetic networks to represent evolutionary history, so as to explicitly represent reticulate events, such as horizontal gene transfer, hybridization or reassortment. Thus, the question arises how to define and compute a tanglegram for such networks. Results: In this article, we present the first formal definition of a tanglegram for rooted phylogenetic networks and present a heuristic approach for computing one, called the NN-tanglegram method. We compare the performance of our method with existing tree tanglegram algorithms and also show a typical application to real biological datasets. For maximum usability, the algorithm does not require that the trees or networks are bifurcating or bicombining, or that they are on identical taxon sets. Availability: The algorithm is implemented in our program Dendroscope 3, which is freely available from www.dendroscope.org. Contact: scornava@informatik.uni-tuebingen.de; huson@informatik.uni-tuebingen.de PMID:21685078
Nonbinary Tree-Based Phylogenetic Networks.
Jetten, Laura; van Iersel, Leo
2018-01-01
Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.
Shin, Junha; Lee, Insuk
2015-01-01
Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co-inheritance analysis within the domains of life will greatly potentiate the use of the expected onslaught of sequenced genomes in the study of molecular pathways in higher eukaryotes. PMID:26394049
BIMLR: a method for constructing rooted phylogenetic networks from rooted phylogenetic trees.
Wang, Juan; Guo, Maozu; Xing, Linlin; Che, Kai; Liu, Xiaoyan; Wang, Chunyu
2013-09-15
Rooted phylogenetic trees constructed from different datasets (e.g. from different genes) are often conflicting with one another, i.e. they cannot be integrated into a single phylogenetic tree. Phylogenetic networks have become an important tool in molecular evolution, and rooted phylogenetic networks are able to represent conflicting rooted phylogenetic trees. Hence, the development of appropriate methods to compute rooted phylogenetic networks from rooted phylogenetic trees has attracted considerable research interest of late. The CASS algorithm proposed by van Iersel et al. is able to construct much simpler networks than other available methods, but it is extremely slow, and the networks it constructs are dependent on the order of the input data. Here, we introduce an improved CASS algorithm, BIMLR. We show that BIMLR is faster than CASS and less dependent on the input data order. Moreover, BIMLR is able to construct much simpler networks than almost all other methods. BIMLR is available at http://nclab.hit.edu.cn/wangjuan/BIMLR/. © 2013 Elsevier B.V. All rights reserved.
Phylogenetic diversity and biodiversity indices on phylogenetic networks.
Wicke, Kristina; Fischer, Mareike
2018-04-01
In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.
Improved Maximum Parsimony Models for Phylogenetic Networks.
Van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2018-05-01
Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.
On Tree-Based Phylogenetic Networks.
Zhang, Louxin
2016-07-01
A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.
Brownian model of transcriptome evolution and phylogenetic network visualization between tissues.
Gu, Xun; Ruan, Hang; Su, Zhixi; Zou, Yangyun
2017-09-01
While phylogenetic analysis of transcriptomes of the same tissue is usually congruent with the species tree, the controversy emerges when multiple tissues are included, that is, whether species from the same tissue are clustered together, or different tissues from the same species are clustered together. Recent studies have suggested that phylogenetic network approach may shed some lights on our understanding of multi-tissue transcriptome evolution; yet the underlying evolutionary mechanism remains unclear. In this paper we develop a Brownian-based model of transcriptome evolution under the phylogenetic network that can statistically distinguish between the patterns of species-clustering and tissue-clustering. Our model can be used as a null hypothesis (neutral transcriptome evolution) for testing any correlation in tissue evolution, can be applied to cancer transcriptome evolution to study whether two tumors of an individual appeared independently or via metastasis, and can be useful to detect convergent evolution at the transcriptional level. Copyright © 2017. Published by Elsevier Inc.
Inferring Phylogenetic Networks Using PhyloNet.
Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay
2018-07-01
PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.
Francis, Andrew; Moulton, Vincent
2018-06-07
Phylogenetic networks are an extension of phylogenetic trees which are used to represent evolutionary histories in which reticulation events (such as recombination and hybridization) have occurred. A central question for such networks is that of identifiability, which essentially asks under what circumstances can we reliably identify the phylogenetic network that gave rise to the observed data? Recently, identifiability results have appeared for networks relative to a model of sequence evolution that generalizes the standard Markov models used for phylogenetic trees. However, these results are quite limited in terms of the complexity of the networks that are considered. In this paper, by introducing an alternative probabilistic model for evolution along a network that is based on some ground-breaking work by Thatte for pedigrees, we are able to obtain an identifiability result for a much larger class of phylogenetic networks (essentially the class of so-called tree-child networks). To prove our main theorem, we derive some new results for identifying tree-child networks combinatorially, and then adapt some techniques developed by Thatte for pedigrees to show that our combinatorial results imply identifiability in the probabilistic setting. We hope that the introduction of our new model for networks could lead to new approaches to reliably construct phylogenetic networks. Copyright © 2018 Elsevier Ltd. All rights reserved.
Folding and unfolding phylogenetic trees and networks.
Huber, Katharina T; Moulton, Vincent; Steel, Mike; Wu, Taoyang
2016-12-01
Phylogenetic networks are rooted, labelled directed acyclic graphswhich are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network N can be "unfolded" to obtain a MUL-tree U(N) and, conversely, a MUL-tree T can in certain circumstances be "folded" to obtain aphylogenetic network F(T) that exhibits T. In this paper, we study properties of the operations U and F in more detail. In particular, we introduce the class of stable networks, phylogenetic networks N for which F(U(N)) is isomorphic to N, characterise such networks, and show that they are related to the well-known class of tree-sibling networks. We also explore how the concept of displaying a tree in a network N can be related to displaying the tree in the MUL-tree U(N). To do this, we develop aphylogenetic analogue of graph fibrations. This allows us to view U(N) as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in U(N) and reconciling phylogenetic trees with networks.
On the quirks of maximum parsimony and likelihood on phylogenetic networks.
Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles
2017-03-21
Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogenetic networks, which can display such events, are becoming of more and more interest in phylogenetic research. It is therefore necessary to extend concepts like maximum parsimony from phylogenetic trees to networks. Several suggestions for possible extensions can be found in recent literature, for instance the softwired and the hardwired parsimony concepts. In this paper, we analyze the so-called big parsimony problem under these two concepts, i.e. we investigate maximum parsimonious networks and analyze their properties. In particular, we show that finding a softwired maximum parsimony network is possible in polynomial time. We also show that the set of maximum parsimony networks for the hardwired definition always contains at least one phylogenetic tree. Lastly, we investigate some parallels of parsimony to different likelihood concepts on phylogenetic networks. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sato, Mitsuharu; Miyazaki, Kentaro
2017-01-01
Horizontal gene transfer (HGT) is a ubiquitous genetic event in bacterial evolution, but it seldom occurs for genes involved in highly complex supramolecules (or biosystems), which consist of many gene products. The ribosome is one such supramolecule, but several bacteria harbor dissimilar and/or chimeric 16S rRNAs in their genomes, suggesting the occurrence of HGT of this gene. However, we know little about whether the genes actually experience HGT and, if so, the frequency of such a transfer. This is primarily because the methods currently employed for phylogenetic analysis (e.g., neighbor-joining, maximum likelihood, and maximum parsimony) of 16S rRNA genes assume point mutation-driven tree-shape evolution as an evolutionary model, which is intrinsically inappropriate to decipher the evolutionary history for genes driven by recombination. To address this issue, we applied a phylogenetic network analysis, which has been used previously for detection of genetic recombination in homologous alleles, to the 16S rRNA gene. We focused on the genus Enterobacter, whose phylogenetic relationships inferred by multi-locus sequence alignment analysis and 16S rRNA sequences are incompatible. All 10 complete genomic sequences were retrieved from the NCBI database, in which 71 16S rRNA genes were included. Neighbor-joining analysis demonstrated that the genes residing in the same genomes clustered, indicating the occurrence of intragenomic recombination. However, as suggested by the low bootstrap values, evolutionary relationships between the clusters were uncertain. We then applied phylogenetic network analysis to representative sequences from each cluster. We found three ancestral 16S rRNA groups; the others were likely created through recursive recombination between the ancestors and chimeric descendants. Despite the large sequence changes caused by the recombination events, the RNA secondary structures were conserved. Successive intergenomic and intragenomic recombination thus shaped the evolution of 16S rRNA genes in the genus Enterobacter. PMID:29180992
Arnaud-Haond, Sophie; Moalic, Yann; Barnabé, Christian; Ayala, Francisco José; Tibayrenc, Michel
2014-01-01
Micropathogens (viruses, bacteria, fungi, parasitic protozoa) share a common trait, which is partial clonality, with wide variance in the respective influence of clonality and sexual recombination on the dynamics and evolution of taxa. The discrimination of distinct lineages and the reconstruction of their phylogenetic history are key information to infer their biomedical properties. However, the phylogenetic picture is often clouded by occasional events of recombination across divergent lineages, limiting the relevance of classical phylogenetic analysis and dichotomic trees. We have applied a network analysis based on graph theory to illustrate the relationships among genotypes of Trypanosoma cruzi, the parasitic protozoan responsible for Chagas disease, to identify major lineages and to unravel their past history of divergence and possible recombination events. At the scale of T. cruzi subspecific diversity, graph theory-based networks applied to 22 isoenzyme loci (262 distinct Multi-Locus-Enzyme-Electrophoresis -MLEE) and 19 microsatellite loci (66 Multi-Locus-Genotypes -MLG) fully confirms the high clustering of genotypes into major lineages or "near-clades". The release of the dichotomic constraint associated with phylogenetic reconstruction usually applied to Multilocus data allows identifying putative hybrids and their parental lineages. Reticulate topology suggests a slightly different history for some of the main "near-clades", and a possibly more complex origin for the putative hybrids than hitherto proposed. Finally the sub-network of the near-clade T. cruzi I (28 MLG) shows a clustering subdivision into three differentiated lesser near-clades ("Russian doll pattern"), which confirms the hypothesis recently proposed by other investigators. The present study broadens and clarifies the hypotheses previously obtained from classical markers on the same sets of data, which demonstrates the added value of this approach. This underlines the potential of graph theory-based network analysis for describing the nature and relationships of major pathogens, thereby opening stimulating prospects to unravel the organization, dynamics and history of major micropathogen lineages.
Wagner, Andreas
2014-07-07
Networks of evolving genotypes can be constructed from the worldwide time-resolved genotyping of pathogens like influenza viruses. Such genotype networks are graphs where neighbouring vertices (viral strains) differ in a single nucleotide or amino acid. A rich trove of network analysis methods can help understand the evolutionary dynamics reflected in the structure of these networks. Here, I analyse a genotype network comprising hundreds of influenza A (H3N2) haemagglutinin genes. The network is rife with cycles that reflect non-random parallel or convergent (homoplastic) evolution. These cycles also show patterns of sequence change characteristic for strong and local evolutionary constraints, positive selection and mutation-limited evolution. Such cycles would not be visible on a phylogenetic tree, illustrating that genotype network analysis can complement phylogenetic analyses. The network also shows a distinct modular or community structure that reflects temporal more than spatial proximity of viral strains, where lowly connected bridge strains connect different modules. These and other organizational patterns illustrate that genotype networks can help us study evolution in action at an unprecedented level of resolution. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
The Use of Weighted Graphs for Large-Scale Genome Analysis
Zhou, Fang; Toivonen, Hannu; King, Ross D.
2014-01-01
There is an acute need for better tools to extract knowledge from the growing flood of sequence data. For example, thousands of complete genomes have been sequenced, and their metabolic networks inferred. Such data should enable a better understanding of evolution. However, most existing network analysis methods are based on pair-wise comparisons, and these do not scale to thousands of genomes. Here we propose the use of weighted graphs as a data structure to enable large-scale phylogenetic analysis of networks. We have developed three types of weighted graph for enzymes: taxonomic (these summarize phylogenetic importance), isoenzymatic (these summarize enzymatic variety/redundancy), and sequence-similarity (these summarize sequence conservation); and we applied these types of weighted graph to survey prokaryotic metabolism. To demonstrate the utility of this approach we have compared and contrasted the large-scale evolution of metabolism in Archaea and Eubacteria. Our results provide evidence for limits to the contingency of evolution. PMID:24619061
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; Ng, Patrick; Khraiwesh, Basel; Jaiswal, Ashish; Jijakli, Kenan; Koussa, Joseph; Nelson, David R; Cai, Hong; Yang, Xinping; Chang, Roger L; Papin, Jason; Yu, Haiyuan; Balaji, Santhanam; Salehi-Ashtiani, Kourosh
2016-07-19
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolic network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. The defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.
Chaiboonchoe, Amphun; Ghamsari, Lila; Dohai, Bushra; ...
2016-06-14
Metabolic networks, which are mathematical representations of organismal metabolism, are reconstructed to provide computational platforms to guide metabolic engineering experiments and explore fundamental questions on metabolism. Systems level analyses, such as interrogation of phylogenetic relationships within the network, can provide further guidance on the modification of metabolic circuitries. Chlamydomonas reinhardtii, a biofuel relevant green alga that has retained key genes with plant, animal, and protist affinities, serves as an ideal model organism to investigate the interplay between gene function and phylogenetic affinities at multiple organizational levels. Here, using detailed topological and functional analyses, coupled with transcriptomics studies on a metabolicmore » network that we have reconstructed for C. reinhardtii, we show that network connectivity has a significant concordance with the co-conservation of genes; however, a distinction between topological and functional relationships is observable within the network. Dynamic and static modes of co-conservation were defined and observed in a subset of gene-pairs across the network topologically. In contrast, genes with predicted synthetic interactions, or genes involved in coupled reactions, show significant enrichment for both shorter and longer phylogenetic distances. Based on our results, we propose that the metabolic network of C. reinhardtii is assembled with an architecture to minimize phylogenetic profile distances topologically, while it includes an expansion of such distances for functionally interacting genes. This arrangement may increase the robustness of C. reinhardtii's network in dealing with varied environmental challenges that the species may face. As a result, the defined evolutionary constraints within the network, which identify important pairings of genes in metabolism, may offer guidance on synthetic biology approaches to optimize the production of desirable metabolites.« less
Phylomemetics—Evolutionary Analysis beyond the Gene
Howe, Christopher J.; Windram, Heather F.
2011-01-01
Genes are propagated by error-prone copying, and the resulting variation provides the basis for phylogenetic reconstruction of evolutionary relationships. Horizontal gene transfer may be superimposed on a tree-like evolutionary pattern, with some relationships better depicted as networks. The copying of manuscripts by scribes is very similar to the replication of genes, and phylogenetic inference programs can be used directly for reconstructing the copying history of different versions of a manuscript text. Phylogenetic methods have also been used for some time to analyse the evolution of languages and the development of physical cultural artefacts. These studies can help to answer a range of anthropological questions. We propose the adoption of the term “phylomemetics” for phylogenetic analysis of reproducing non-genetic elements. PMID:21655311
Mathur, Rinku; Adlakha, Neeru
2014-06-01
Phylogenetic trees give the information about the vertical relationships of ancestors and descendants but phylogenetic networks are used to visualize the horizontal relationships among the different organisms. In order to predict reticulate events there is a need to construct phylogenetic networks. Here, a Linear Programming (LP) model has been developed for the construction of phylogenetic network. The model is validated by using data sets of chloroplast of 16S rRNA sequences of photosynthetic organisms and Influenza A/H5N1 viruses. Results obtained are in agreement with those obtained by earlier researchers.
Tree-Based Unrooted Phylogenetic Networks.
Francis, A; Huber, K T; Moulton, V
2018-02-01
Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.
A new algorithm to construct phylogenetic networks from trees.
Wang, J
2014-03-06
Developing appropriate methods for constructing phylogenetic networks from tree sets is an important problem, and much research is currently being undertaken in this area. BIMLR is an algorithm that constructs phylogenetic networks from tree sets. The algorithm can construct a much simpler network than other available methods. Here, we introduce an improved version of the BIMLR algorithm, QuickCass. QuickCass changes the selection strategy of the labels of leaves below the reticulate nodes, i.e., the nodes with an indegree of at least 2 in BIMLR. We show that QuickCass can construct simpler phylogenetic networks than BIMLR. Furthermore, we show that QuickCass is a polynomial-time algorithm when the output network that is constructed by QuickCass is binary.
Comparing Mycobacterium tuberculosis genomes using genome topology networks.
Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan
2015-02-14
Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes were found to be affected by SVs in M. tuberculosis genomes. We believe that the GTN method will be suitable for the exploration of genomic SVs in connection with biological features of bacterial strains, and that GTN-based phylogenetic analysis will provide additional insights into whole genome-based phylogenetic analysis.
Rearrangement moves on rooted phylogenetic networks
Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2017-01-01
Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for practical phylogenetic network reconstruction. PMID:28763439
Chen, Zhaojin; Zheng, Yuan; Ding, Chuanyu; Ren, Xuemin; Yuan, Jian; Sun, Feng; Li, Yuying
2017-11-01
Two energy crops (maize and soybean) were used in the remediation of cadmium-contaminated soils. These crops were used because they are fast growing, have a large biomass and are good sources for bioenergy production. The total accumulation of cadmium in maize and soybean plants was 393.01 and 263.24μg pot -1 , respectively. The rhizosphere bacterial community composition was studied by MiSeq sequencing. Phylogenetic analysis was performed using 16S rRNA gene sequences. The rhizosphere bacteria were divided into 33 major phylogenetic groups according to phyla. The dominant phylogenetic groups included Proteobacteria, Acidobacteria, Actinobacteria, Gemmatimonadetes, and Bacteroidetes. Based on principal component analysis (PCA) and unweighted pair group with arithmetic mean (UPGMA) analysis, we found that the bacterial community was influenced by cadmium addition and bioenergy cropping. Three molecular ecological networks were constructed for the unplanted, soybean- and maize-planted bacterial communities grown in 50mgkg -1 cadmium-contaminated soils. The results indicated that bioenergy cropping increased the complexity of the bacterial community network as evidenced by a higher total number of nodes, the average geodesic distance (GD), the modularity and a shorter geodesic distance. Proteobacteria and Acidobacteria were the keystone bacteria connecting different co-expressed operational taxonomic units (OTUs). The results showed that bioenergy cropping altered the topological roles of individual OTUs and keystone populations. This is the first study to reveal the effects of bioenergy cropping on microbial interactions in the phytoremediation of cadmium-contaminated soils by network reconstruction. This method can greatly enhance our understanding of the mechanisms of plant-microbe-metal interactions in metal-polluted ecosystems. Copyright © 2017 Elsevier Inc. All rights reserved.
Vanhommerig, Joost W; Bezemer, Daniela; Molenkamp, Richard; Van Sighem, Ard I; Smit, Colette; Arends, Joop E; Lauw, Fanny N; Brinkman, Kees; Rijnders, Bart J; Newsum, Astrid M; Bruisten, Sylvia M; Prins, Maria; Van Der Meer, Jan T; Van De Laar, Thijs J; Schinkel, Janke
2017-09-24
MSM are at increased risk for infection with HIV-1 and hepatitis C virus (HCV). Is HIV/HCV coinfection confined to specific HIV transmission networks? A HIV phylogenetic tree was constructed for 5038 HIV-1 subtype B polymerase (pol) sequences obtained from MSM in the AIDS therapy evaluation in the Netherlands cohort. We investigated the existence of HIV clusters with increased HCV prevalence, the HIV phylogenetic density (i.e. the number of potential HIV transmission partners) of HIV/HCV-coinfected MSM compared with HIV-infected MSM without HCV, and the overlap in HIV and HCV phylogenies using HCV nonstructural protein 5B sequences from 183 HIV-infected MSM with acute HCV infection. Five hundred and sixty-three of 5038 (11.2%) HIV-infected MSM tested HCV positive. Phylogenetic analysis revealed 93 large HIV clusters (≥10 MSM), 370 small HIV clusters (2-9 MSM), and 867 singletons with a median HCV prevalence of 11.5, 11.6, and 9.3%, respectively. We identified six large HIV clusters with elevated HCV prevalence (range 23.5-46.2%). Median HIV phylogenetic densities for MSM with HCV (3, interquartile range 1-7) and without HCV (3, interquartile range 1-8) were similar. HCV phylogeny showed 12 MSM-specific HCV clusters (clustersize: 2-39 HCV sequences); 12.7% of HCV infections were part of the same HIV and HCV cluster. We observed few HIV clusters with elevated HCV prevalence, no increase in the HIV phylogenetic density of HIV/HCV-coinfected MSM compared to HIV-infected MSM without HCV, and limited overlap between HIV and HCV phylogenies among HIV/HCV-coinfected MSM. Our data do not support the existence of MSM-specific sexual networks that fuel both the HIV and HCV epidemic.
Network Analysis of Protein Adaptation: Modeling the Functional Impact of Multiple Mutations
Beleva Guthrie, Violeta; Masica, David L; Fraser, Andrew; Federico, Joseph; Fan, Yunfan; Camps, Manel; Karchin, Rachel
2018-01-01
Abstract The evolution of new biochemical activities frequently involves complex dependencies between mutations and rapid evolutionary radiation. Mutation co-occurrence and covariation have previously been used to identify compensating mutations that are the result of physical contacts and preserve protein function and fold. Here, we model pairwise functional dependencies and higher order interactions that enable evolution of new protein functions. We use a network model to find complex dependencies between mutations resulting from evolutionary trade-offs and pleiotropic effects. We present a method to construct these networks and to identify functionally interacting mutations in both extant and reconstructed ancestral sequences (Network Analysis of Protein Adaptation). The time ordering of mutations can be incorporated into the networks through phylogenetic reconstruction. We apply NAPA to three distantly homologous β-lactamase protein clusters (TEM, CTX-M-3, and OXA-51), each of which has experienced recent evolutionary radiation under substantially different selective pressures. By analyzing the network properties of each protein cluster, we identify key adaptive mutations, positive pairwise interactions, different adaptive solutions to the same selective pressure, and complex evolutionary trajectories likely to increase protein fitness. We also present evidence that incorporating information from phylogenetic reconstruction and ancestral sequence inference can reduce the number of spurious links in the network, whereas preserving overall network community structure. The analysis does not require structural or biochemical data. In contrast to function-preserving mutation dependencies, which are frequently from structural contacts, gain-of-function mutation dependencies are most commonly between residues distal in protein structure. PMID:29522102
USDA-ARS?s Scientific Manuscript database
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence d...
A measure of the denseness of a phylogenetic network. [by sequenced proteins from extant species
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
An objective measure of phylogenetic denseness is developed to examine various phylogenetic criteria: alpha- and beta-hemoglobin, myoglobin, cytochrome c, and the parvalbumin family. Attention is given to the number of nucleotide replacements separating homologous sequences, and to the topology of the network (in other words, to the qualitative nature of the network as defined by how closely the studied species are related). Applications include quantitative comparisons of species origin, relation, and rates of evolution.
Marcussen, Thomas; Heier, Lise; Brysting, Anne K.; Oxelman, Bengt; Jakobsen, Kjetill S.
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the “correct” network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies (“ghost subgenome lineages”) significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage—between 67% and 88%—of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. PMID:25281848
Autumn Algorithm-Computation of Hybridization Networks for Realistic Phylogenetic Trees.
Huson, Daniel H; Linz, Simone
2018-01-01
A minimum hybridization network is a rooted phylogenetic network that displays two given rooted phylogenetic trees using a minimum number of reticulations. Previous mathematical work on their calculation has usually assumed the input trees to be bifurcating, correctly rooted, or that they both contain the same taxa. These assumptions do not hold in biological studies and "realistic" trees have multifurcations, are difficult to root, and rarely contain the same taxa. We present a new algorithm for computing minimum hybridization networks for a given pair of "realistic" rooted phylogenetic trees. We also describe how the algorithm might be used to improve the rooting of the input trees. We introduce the concept of "autumn trees", a nice framework for the formulation of algorithms based on the mathematics of "maximum acyclic agreement forests". While the main computational problem is hard, the run-time depends mainly on how different the given input trees are. In biological studies, where the trees are reasonably similar, our parallel implementation performs well in practice. The algorithm is available in our open source program Dendroscope 3, providing a platform for biologists to explore rooted phylogenetic networks. We demonstrate the utility of the algorithm using several previously studied data sets.
Prioritizing Populations for Conservation Using Phylogenetic Networks
Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.
2014-01-01
In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451
Bernard, E J; Azad, Y; Vandamme, A M; Weait, M; Geretti, A M
2007-09-01
Phylogenetic analysis - the study of the genetic relatedness between HIV strains - has recently been used in criminal prosecutions as evidence of responsibility for HIV transmission. In these trials, the expert opinion of virologists has been of critical importance. Phylogenetic analysis of HIV gene sequences is complex and its findings do not achieve the levels of certainty obtained with the forensic analysis of human DNA. Although two individuals may carry HIV strains that are closely related, these will not necessarily be unique to the two parties and could extend to other persons within the same transmission network. For forensic purposes, phylogenetic analysis should be conducted under strictly controlled conditions by laboratories with relevant expertise applying rigorous methods. It is vitally important to include the right controls, which should be epidemiologically and temporally relevant to the parties under investigation. Use of inappropriate controls can exaggerate any relatedness between the virus strains of the complainant and defendant as being strikingly unique. It will be often difficult to obtain the relevant controls. If convenient but less appropriate controls are used, interpretation of the findings should be tempered accordingly. Phylogenetic analysis cannot prove that HIV transmission occurred directly between two individuals. However, it can exonerate individuals by demonstrating that the defendant carries a virus strain unrelated to that of the complainant. Expert witnesses should acknowledge the limitations of the inferences that might be made and choose the correct language in both written and verbal testimony.
IcyTree: rapid browser-based visualization for phylogenetic trees and networks
2017-01-01
Abstract Summary: IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. Availability and Implementation: IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree. Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. Contact: tgvaughan@gmail.com PMID:28407035
IcyTree: rapid browser-based visualization for phylogenetic trees and networks.
Vaughan, Timothy G
2017-08-01
IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree . Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. tgvaughan@gmail.com. © The Author(s) 2017. Published by Oxford University Press.
Do Branch Lengths Help to Locate a Tree in a Phylogenetic Network?
Gambette, Philippe; van Iersel, Leo; Kelk, Steven; Pardi, Fabio; Scornavacca, Celine
2016-09-01
Phylogenetic networks are increasingly used in evolutionary biology to represent the history of species that have undergone reticulate events such as horizontal gene transfer, hybrid speciation and recombination. One of the most fundamental questions that arise in this context is whether the evolution of a gene with one copy in all species can be explained by a given network. In mathematical terms, this is often translated in the following way: is a given phylogenetic tree contained in a given phylogenetic network? Recently this tree containment problem has been widely investigated from a computational perspective, but most studies have only focused on the topology of the phylogenies, ignoring a piece of information that, in the case of phylogenetic trees, is routinely inferred by evolutionary analyses: branch lengths. These measure the amount of change (e.g., nucleotide substitutions) that has occurred along each branch of the phylogeny. Here, we study a number of versions of the tree containment problem that explicitly account for branch lengths. We show that, although length information has the potential to locate more precisely a tree within a network, the problem is computationally hard in its most general form. On a positive note, for a number of special cases of biological relevance, we provide algorithms that solve this problem efficiently. This includes the case of networks of limited complexity, for which it is possible to recover, among the trees contained by the network with the same topology as the input tree, the closest one in terms of branch lengths.
Chang, Xiao; Wang, Zhuo; Hao, Pei; Li, Yuan-Yuan; Li, Yi-Xue
2010-06-01
The endosymbiotic theory proposed that mitochondrial genomes are derived from an alpha-proteobacterium-like endosymbiont, which was concluded from sequence analysis. We rebuilt the metabolic networks of mitochondria and 22 relative species, and studied the evolution of mitochondrial metabolism at the level of enzyme content and network topology. Our phylogenetic results based on network alignment and motif identification supported the endosymbiotic theory from the point of view of systems biology for the first time. It was found that the mitochondrial metabolic network were much more compact than the relative species, probably related to the higher efficiency of oxidative phosphorylation of the specialized organelle, and the network is highly clustered around the TCA cycle. Moreover, the mitochondrial metabolic network exhibited high functional specificity to the modules. This work provided insight to the understanding of mitochondria evolution, and the organization principle of mitochondrial metabolic network at the network level. Copyright 2010 Elsevier Inc. All rights reserved.
Davies, T Jonathan; Urban, Mark C; Rayfield, Bronwyn; Cadotte, Marc W; Peres-Neto, Pedro R
2016-09-01
Recent studies have supported a link between phylogenetic diversity and various ecological properties including ecosystem function. However, such studies typically assume that phylogenetic branches of equivalent length are more or less interchangeable. Here we suggest that there is a need to consider not only branch lengths but also their placement on the phylogeny. We demonstrate how two common indices of network centrality can be used to describe the evolutionary distinctiveness of network elements (nodes and branches) on a phylogeny. If phylogenetic diversity enhances ecosystem function via complementarity and the representation of functional diversity, we would predict a correlation between evolutionary distinctiveness of network elements and their contribution to ecosystem process. In contrast, if one or a few evolutionary innovations play key roles in ecosystem function, the relationship between evolutionary distinctiveness and functional contribution may be weak or absent. We illustrate how network elements associated with high functional contribution can be identified from regressions between phylogenetic diversity and productivity using a well-known empirical data set on plant productivity from the Cedar Creek Long-Term Ecological Research. We find no association between evolutionary distinctiveness and ecosystem functioning, but we are able to identify phylogenetic elements associated with species of known high functional contribution within the Fabaceae. Our perspective provides a useful guide in the search for ecological traits linking diversity and ecosystem function, and suggests a more nuanced consideration of phylogenetic diversity is required in the conservation and biodiversity-ecosystem-function literature. © 2016 by the Ecological Society of America.
Marcussen, Thomas; Heier, Lise; Brysting, Anne K; Oxelman, Bengt; Jakobsen, Kjetill S
2015-01-01
Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the "correct" network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies ("ghost subgenome lineages") significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage-between 67% and 88%-of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy
2015-04-01
Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.
On Determining if Tree-based Networks Contain Fixed Trees.
Anaya, Maria; Anipchenko-Ulaj, Olga; Ashfaq, Aisha; Chiu, Joyce; Kaiser, Mahedi; Ohsawa, Max Shoji; Owen, Megan; Pavlechko, Ella; St John, Katherine; Suleria, Shivam; Thompson, Keith; Yap, Corrine
2016-05-01
We address an open question of Francis and Steel about phylogenetic networks and trees. They give a polynomial time algorithm to decide if a phylogenetic network, N, is tree-based and pose the problem: given a fixed tree T and network N, is N based on T? We show that it is [Formula: see text]-hard to decide, by reduction from 3-Dimensional Matching (3DM) and further that the problem is fixed-parameter tractable.
Shiino, Teiichiro; Hattori, Junko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru
2014-01-01
Background One major circulating HIV-1 subtype in Southeast Asian countries is CRF01_AE, but little is known about its epidemiology in Japan. We conducted a molecular phylodynamic study of patients newly diagnosed with CRF01_AE from 2003 to 2010. Methods Plasma samples from patients registered in Japanese Drug Resistance HIV-1 Surveillance Network were analyzed for protease-reverse transcriptase sequences; all sequences undergo subtyping and phylogenetic analysis using distance-matrix-based, maximum likelihood and Bayesian coalescent Markov Chain Monte Carlo (MCMC) phylogenetic inferences. Transmission clusters were identified using interior branch test and depth-first searches for sub-tree partitions. Times of most recent common ancestor (tMRCAs) of significant clusters were estimated using Bayesian MCMC analysis. Results Among 3618 patient registered in our network, 243 were infected with CRF01_AE. The majority of individuals with CRF01_AE were Japanese, predominantly male, and reported heterosexual contact as their risk factor. We found 5 large clusters with ≥5 members and 25 small clusters consisting of pairs of individuals with highly related CRF01_AE strains. The earliest cluster showed a tMRCA of 1996, and consisted of individuals with their known risk as heterosexual contacts. The other four large clusters showed later tMRCAs between 2000 and 2002 with members including intravenous drug users (IVDU) and non-Japanese, but not men who have sex with men (MSM). In contrast, small clusters included a high frequency of individuals reporting MSM risk factors. Phylogenetic analysis also showed that some individuals infected with HIV strains spread in East and South-eastern Asian countries. Conclusions Introduction of CRF01_AE viruses into Japan is estimated to have occurred in the 1990s. CFR01_AE spread via heterosexual behavior, then among persons connected with non-Japanese, IVDU, and MSM. Phylogenetic analysis demonstrated that some viral variants are largely restricted to Japan, while others have a broad geographic distribution. PMID:25025900
NASA Astrophysics Data System (ADS)
Amiroch, S.; Pradana, M. S.; Irawan, M. I.; Mukhlash, I.
2017-09-01
Multiple Alignment (MA) is a particularly important tool for studying the viral genome and determine the evolutionary process of the specific virus. Application of MA in the case of the spread of the Severe acute respiratory syndrome (SARS) epidemic is an interesting thing because this virus epidemic a few years ago spread so quickly that medical attention in many countries. Although there has been a lot of software to process multiple sequences, but the use of pairwise alignment to process MA is very important to consider. In previous research, the alignment between the sequences to process MA algorithm, Super Pairwise Alignment, but in this study used a dynamic programming algorithm Needleman wunchs simulated in Matlab. From the analysis of MA obtained and stable region and unstable which indicates the position where the mutation occurs, the system network topology that produced the phylogenetic tree of the SARS epidemic distance method, and system area networks mutation.
Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A
2006-03-01
Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.
Statistical parsimony networks and species assemblages in Cephalotrichid nemerteans (nemertea).
Chen, Haixia; Strand, Malin; Norenburg, Jon L; Sun, Shichun; Kajihara, Hiroshi; Chernyshev, Alexey V; Maslakova, Svetlana A; Sundberg, Per
2010-09-21
It has been suggested that statistical parsimony network analysis could be used to get an indication of species represented in a set of nucleotide data, and the approach has been used to discuss species boundaries in some taxa. Based on 635 base pairs of the mitochondrial protein-coding gene cytochrome c oxidase I (COI), we analyzed 152 nemertean specimens using statistical parsimony network analysis with the connection probability set to 95%. The analysis revealed 15 distinct networks together with seven singletons. Statistical parsimony yielded three networks supporting the species status of Cephalothrix rufifrons, C. major and C. spiralis as they currently have been delineated by morphological characters and geographical location. Many other networks contained haplotypes from nearby geographical locations. Cladistic structure by maximum likelihood analysis overall supported the network analysis, but indicated a false positive result where subnetworks should have been connected into one network/species. This probably is caused by undersampling of the intraspecific haplotype diversity. Statistical parsimony network analysis provides a rapid and useful tool for detecting possible undescribed/cryptic species among cephalotrichid nemerteans based on COI gene. It should be combined with phylogenetic analysis to get indications of false positive results, i.e., subnetworks that would have been connected with more extensive haplotype sampling.
Miller, Joseph T; Hui, Cang; Thornhill, Andrew; Gallien, Laure; Le Roux, Johannes J; Richardson, David M
2016-12-30
For a plant species to become invasive it has to progress along the introduction-naturalization-invasion (INI) continuum which reflects the joint direction of niche breadth. Identification of traits that correlate with and drive species invasiveness along the continuum is a major focus of invasion biology. If invasiveness is underlain by heritable traits, and if such traits are phylogenetically conserved, then we would expect non-native species with different introduction status (i.e. position along the INI continuum) to show phylogenetic signal. This study uses two clades that contain a large number of invasive tree species from the genera Acacia and Eucalyptus to test whether geographic distribution and a novel phylogenetic conservation method can predict which species have been introduced, became naturalized, and invasive. Our results suggest that no underlying phylogenetic signal underlie the introduction status for both groups of trees, except for introduced acacias. The more invasive acacia clade contains invasive species that have smoother geographic distributions and are more marginal in the phylogenetic network. The less invasive eucalyptus group contains invasive species that are more clustered geographically, more centrally located in the phylogenetic network and have phylogenetic distances between invasive and non-invasive species that are trending toward the mean pairwise distance. This suggests that highly invasive groups may be identified because they have invasive species with smoother and faster expanding native distributions and are located more to the edges of phylogenetic networks than less invasive groups. Published by Oxford University Press on behalf of the Annals of Botany Company.
Thuillard, Marc; Fraix-Burnet, Didier
2015-01-01
This article presents an innovative approach to phylogenies based on the reduction of multistate characters to binary-state characters. We show that the reduction to binary characters' approach can be applied to both character- and distance-based phylogenies and provides a unifying framework to explain simply and intuitively the similarities and differences between distance- and character-based phylogenies. Building on these results, this article gives a possible explanation on why phylogenetic trees obtained from a distance matrix or a set of characters are often quite reasonable despite lateral transfers of genetic material between taxa. In the presence of lateral transfers, outer planar networks furnish a better description of evolution than phylogenetic trees. We present a polynomial-time reconstruction algorithm for perfect outer planar networks with a fixed number of states, characters, and lateral transfers.
Phylogenetically informed logic relationships improve detection of biological network organization
2011-01-01
Background A "phylogenetic profile" refers to the presence or absence of a gene across a set of organisms, and it has been proven valuable for understanding gene functional relationships and network organization. Despite this success, few studies have attempted to search beyond just pairwise relationships among genes. Here we search for logic relationships involving three genes, and explore its potential application in gene network analyses. Results Taking advantage of a phylogenetic matrix constructed from the large orthologs database Roundup, we invented a method to create balanced profiles for individual triplets of genes that guarantee equal weight on the different phylogenetic scenarios of coevolution between genes. When we applied this idea to LAPP, the method to search for logic triplets of genes, the balanced profiles resulted in significant performance improvement and the discovery of hundreds of thousands more putative triplets than unadjusted profiles. We found that logic triplets detected biological network organization and identified key proteins and their functions, ranging from neighbouring proteins in local pathways, to well separated proteins in the whole pathway, and to the interactions among different pathways at the system level. Finally, our case study suggested that the directionality in a logic relationship and the profile of a triplet could disclose the connectivity between the triplet and surrounding networks. Conclusion Balanced profiles are superior to the raw profiles employed by traditional methods of phylogenetic profiling in searching for high order gene sets. Gene triplets can provide valuable information in detection of biological network organization and identification of key genes at different levels of cellular interaction. PMID:22172058
Zheng, Xiaoyan; Cai, Danying; Potter, Daniel; Postman, Joseph; Liu, Jing; Teng, Yuanwen
2014-11-01
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence datasets. Phylogenetic trees based on both cpDNA and nuclear LFY2int2-N (LN) data resulted in poor resolution, especially, only five primary species were monophyletic in the LN tree. A phylogenetic network of LN suggested that reticulation caused by hybridization is one of the major evolutionary processes for Pyrus species. Polytomies of the gene trees and star-like structure of cpDNA networks suggested rapid radiation is another major evolutionary process, especially for the occidental species. Pyrus calleryana and P. regelii were the earliest diverged Pyrus species. Two North African species, P. cordata, P. spinosa and P. betulaefolia were descendent of primitive stock Pyrus species and still share some common molecular characters. Southwestern China, where a large number of P. pashia populations are found, is probably the most important diversification center of Pyrus. More accessions and nuclear genes are needed for further understanding the evolutionary histories of Pyrus. Copyright © 2014 Elsevier Inc. All rights reserved.
Guo, Guo-Ye; Chen, Fang; Shi, Xiao-Dong; Tian, Yin-Shuai; Yu, Mao-Qun; Han, Xue-Qin; Yuan, Li-Chun; Zhang, Ying
2016-01-01
Genetic variation and phylogenetic relationships among 102 Jatropha curcas accessions from Asia, Africa, and the Americas were assessed using the internal transcribed spacer region of nuclear ribosomal DNA (nrDNA ITS). The average G+C content (65.04%) was considerably higher than the A+T (34.96%) content. The estimated genetic diversity revealed moderate genetic variation. The pairwise genetic divergences (GD) between haplotypes were evaluated and ranged from 0.000 to 0.017, suggesting a higher level of genetic differentiation in Mexican accessions than those of other regions. Phylogenetic relationships and intraspecific divergence were inferred by Bayesian inference (BI), maximum parsimony (MP), and median joining (MJ) network analysis and were generally resolved. The J. curcas accessions were consistently divided into three lineages, groups A, B, and C, which demonstrated distant geographical isolation and genetic divergence between American accessions and those from other regions. The MJ network analysis confirmed that Central America was the possible center of origin. The putative migration route suggested that J. curcas was distributed from Mexico or Brazil, via Cape Verde and then split into two routes. One route was dispersed to Spain, then migrated to China, eventually spreading to southeastern Asia, while the other route was dispersed to Africa, via Madagascar and migrated to China, later spreading to southeastern Asia. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Computing all hybridization networks for multiple binary phylogenetic input trees.
Albrecht, Benjamin
2015-07-30
The computation of phylogenetic trees on the same set of species that are based on different orthologous genes can lead to incongruent trees. One possible explanation for this behavior are interspecific hybridization events recombining genes of different species. An important approach to analyze such events is the computation of hybridization networks. This work presents the first algorithm computing the hybridization number as well as a set of representative hybridization networks for multiple binary phylogenetic input trees on the same set of taxa. To improve its practical runtime, we show how this algorithm can be parallelized. Moreover, we demonstrate the efficiency of the software Hybroscale, containing an implementation of our algorithm, by comparing it to PIRNv2.0, which is so far the best available software computing the exact hybridization number for multiple binary phylogenetic trees on the same set of taxa. The algorithm is part of the software Hybroscale, which was developed specifically for the investigation of hybridization networks including their computation and visualization. Hybroscale is freely available(1) and runs on all three major operating systems. Our simulation study indicates that our approach is on average 100 times faster than PIRNv2.0. Moreover, we show how Hybroscale improves the interpretation of the reported hybridization networks by adding certain features to its graphical representation.
Fire modifies the phylogenetic structure of soil bacterial co-occurrence networks.
Pérez-Valera, Eduardo; Goberna, Marta; Faust, Karoline; Raes, Jeroen; García, Carlos; Verdú, Miguel
2017-01-01
Fire alters ecosystems by changing the composition and community structure of soil microbes. The phylogenetic structure of a community provides clues about its main assembling mechanisms. While environmental filtering tends to reduce the community phylogenetic diversity by selecting for functionally (and hence phylogenetically) similar species, processes like competitive exclusion by limiting similarity tend to increase it by preventing the coexistence of functionally (and phylogenetically) similar species. We used co-occurrence networks to detect co-presence (bacteria that co-occur) or exclusion (bacteria that do not co-occur) links indicative of the ecological interactions structuring the community. We propose that inspecting the phylogenetic structure of co-presence or exclusion links allows to detect the main processes simultaneously assembling the community. We monitored a soil bacterial community after an experimental fire and found that fire altered its composition, richness and phylogenetic diversity. Both co-presence and exclusion links were more phylogenetically related than expected by chance. We interpret such a phylogenetic clustering in co-presence links as a result of environmental filtering, while that in exclusion links reflects competitive exclusion by limiting similarity. This suggests that environmental filtering and limiting similarity operate simultaneously to assemble soil bacterial communities, widening the traditional view that only environmental filtering structures bacterial communities. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
The origins and evolutionary history of human non-coding RNA regulatory networks.
Sherafatian, Masih; Mowla, Seyed Javad
2017-04-01
The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.
Holden, Brian J; Pinney, John W; Lovell, Simon C; Amoutzias, Grigoris D; Robertson, David L
2007-01-01
Background Alternative representations of biochemical networks emphasise different aspects of the data and contribute to the understanding of complex biological systems. In this study we present a variety of automated methods for visualisation of a protein-protein interaction network, using the basic helix-loop-helix (bHLH) family of transcription factors as an example. Results Network representations that arrange nodes (proteins) according to either continuous or discrete information are investigated, revealing the existence of protein sub-families and the retention of interactions following gene duplication events. Methods of network visualisation in conjunction with a phylogenetic tree are presented, highlighting the evolutionary relationships between proteins, and clarifying the context of network hubs and interaction clusters. Finally, an optimisation technique is used to create a three-dimensional layout of the phylogenetic tree upon which the protein-protein interactions may be projected. Conclusion We show that by incorporating secondary genomic, functional or phylogenetic information into network visualisation, it is possible to move beyond simple layout algorithms based on network topology towards more biologically meaningful representations. These new visualisations can give structure to complex networks and will greatly help in interpreting their evolutionary origins and functional implications. Three open source software packages (InterView, TVi and OptiMage) implementing our methods are available. PMID:17683601
Shazib, Shahed Uddin Ahmed; Vd'ačný, Peter; Kim, Ji Hye; Jang, Seok Won; Shin, Mann Kyoon
2014-09-01
The ciliate class Heterotrichea is defined by somatic dikinetids bearing postciliodesmata, by an oral apparatus consisting of a paroral membrane and an adoral zone of membranelles, as well as by features of nuclear division involving extramacronuclear microtubules. Although phylogenetic interrelationships among heterotrichs have been analyzed several times, deeper nodes of the heterotrichean tree of life remain poorly resolved. To cast more light on the evolutionary history of heterotricheans, we performed phylogenetic analyses of multiple loci (18S rRNA gene, ITS1-5.8S rRNA-ITS2 region, and 28S rRNA gene) using traditional tree-building phylogenetic methods and statistical tree topology tests as well as phylogenetic networks, split spectrum analysis and quartet likelihood mapping. This multifaceted approach has shown that (1) Peritromus is very likely an adelphotaxon of all other heterotrichs; (2) Spirostomum and Anigsteinia are sister taxa and their common monophyletic origin is strongly supported by a uniquely posteriorly-thickened paroral membrane; (3) the monotypic family Chattonidiidae should be suppressed because its type genus clusters within the family Condylostomatidae; and (4) new families are needed for Gruberia and Fabrea because their affiliation with Spirostomidae and Climacostomidae, respectively, is not supported by molecular phylogenies nor the fine structure of the paroral membrane. Copyright © 2014 Elsevier Inc. All rights reserved.
Toward a phylogenetic chronology of ancient Gaulish, Celtic, and Indo-European.
Forster, Peter; Toth, Alfred
2003-07-22
Indo-European is the largest and best-documented language family in the world, yet the reconstruction of the Indo-European tree, first proposed in 1863, has remained controversial. Complications may include ascertainment bias when choosing the linguistic data, and disregard for the wave model of 1872 when attempting to reconstruct the tree. Essentially analogous problems were solved in evolutionary genetics by DNA sequencing and phylogenetic network methods, respectively. We now adapt these tools to linguistics, and analyze Indo-European language data, focusing on Celtic and in particular on the ancient Celtic language of Gaul (modern France), by using bilingual Gaulish-Latin inscriptions. Our phylogenetic network reveals an early split of Celtic within Indo-European. Interestingly, the next branching event separates Gaulish (Continental Celtic) from the British (Insular Celtic) languages, with Insular Celtic subsequently splitting into Brythonic (Welsh, Breton) and Goidelic (Irish and Scottish Gaelic). Taken together, the network thus suggests that the Celtic language arrived in the British Isles as a single wave (and then differentiated locally), rather than in the traditional two-wave scenario ("P-Celtic" to Britain and "Q-Celtic" to Ireland). The phylogenetic network furthermore permits the estimation of time in analogy to genetics, and we obtain tentative dates for Indo-European at 8100 BC +/- 1,900 years, and for the arrival of Celtic in Britain at 3200 BC +/- 1,500 years. The phylogenetic method is easily executed by hand and promises to be an informative approach for many problems in historical linguistics.
A decomposition theory for phylogenetic networks and incompatible characters.
Gusfield, Dan; Bansal, Vikas; Bafna, Vineet; Song, Yun S
2007-12-01
Phylogenetic networks are models of evolution that go beyond trees, incorporating non-tree-like biological events such as recombination (or more generally reticulation), which occur either in a single species (meiotic recombination) or between species (reticulation due to lateral gene transfer and hybrid speciation). The central algorithmic problems are to reconstruct a plausible history of mutations and non-tree-like events, or to determine the minimum number of such events needed to derive a given set of binary sequences, allowing one mutation per site. Meiotic recombination, reticulation and recurrent mutation can cause conflict or incompatibility between pairs of sites (or characters) of the input. Previously, we used "conflict graphs" and "incompatibility graphs" to compute lower bounds on the minimum number of recombination nodes needed, and to efficiently solve constrained cases of the minimization problem. Those results exposed the structural and algorithmic importance of the non-trivial connected components of those two graphs. In this paper, we more fully develop the structural importance of non-trivial connected components of the incompatibility and conflict graphs, proving a general decomposition theorem (Gusfield and Bansal, 2005) for phylogenetic networks. The decomposition theorem depends only on the incompatibilities in the input sequences, and hence applies to many types of phylogenetic networks, and to any biological phenomena that causes pairwise incompatibilities. More generally, the proof of the decomposition theorem exposes a maximal embedded tree structure that exists in the network when the sequences cannot be derived on a perfect phylogenetic tree. This extends the theory of perfect phylogeny in a natural and important way. The proof is constructive and leads to a polynomial-time algorithm to find the unique underlying maximal tree structure. We next examine and fully solve the major open question from Gusfield and Bansal (2005): Is it true that for every input there must be a fully decomposed phylogenetic network that minimizes the number of recombination nodes used, over all phylogenetic networks for the input. We previously conjectured that the answer is yes. In this paper, we show that the answer in is no, both for the case that only single-crossover recombination is allowed, and also for the case that unbounded multiple-crossover recombination is allowed. The latter case also resolves a conjecture recently stated in (Huson and Klopper, 2007) in the context of reticulation networks. Although the conjecture from Gusfield and Bansal (2005) is disproved in general, we show that the answer to the conjecture is yes in several natural special cases, and establish necessary combinatorial structure that counterexamples to the conjecture must possess. We also show that counterexamples to the conjecture are rare (for the case of single-crossover recombination) in simulated data.
Network dynamics of eukaryotic LTR retroelements beyond phylogenetic trees
Llorens, Carlos; Muñoz-Pomer, Alfonso; Bernad, Lucia; Botella, Hector; Moya, Andrés
2009-01-01
Background Sequencing projects have allowed diverse retroviruses and LTR retrotransposons from different eukaryotic organisms to be characterized. It is known that retroviruses and other retro-transcribing viruses evolve from LTR retrotransposons and that this whole system clusters into five families: Ty3/Gypsy, Retroviridae, Ty1/Copia, Bel/Pao and Caulimoviridae. Phylogenetic analyses usually show that these split into multiple distinct lineages but what is yet to be understood is how deep evolution occurred in this system. Results We combined phylogenetic and graph analyses to investigate the history of LTR retroelements both as a tree and as a network. We used 268 non-redundant LTR retroelements, many of them introduced for the first time in this work, to elucidate all possible LTR retroelement phylogenetic patterns. These were superimposed over the tree of eukaryotes to investigate the dynamics of the system, at distinct evolutionary times. Next, we investigated phenotypic features such as duplication and variability of amino acid motifs, and several differences in genomic ORF organization. Using this information we characterized eight reticulate evolution markers to construct phenotypic network models. Conclusion The evolutionary history of LTR retroelements can be traced as a time-evolving network that depends on phylogenetic patterns, epigenetic host-factors and phenotypic plasticity. The Ty1/Copia and the Ty3/Gypsy families represent the oldest patterns in this network that we found mimics eukaryotic macroevolution. The emergence of the Bel/Pao, Retroviridae and Caulimoviridae families in this network can be related with distinct inflations of the Ty3/Gypsy family, at distinct evolutionary times. This suggests that Ty3/Gypsy ancestors diversified much more than their Ty1/Copia counterparts, at distinct geological eras. Consistent with the principle of preferential attachment, the connectivities among phenotypic markers, taken as network-represented combinations, are power-law distributed. This evidences an inflationary mode of evolution where the system diversity; 1) expands continuously alternating vertical and gradual processes of phylogenetic divergence with episodes of modular, saltatory and reticulate evolution; 2) is governed by the intrinsic capability of distinct LTR retroelement host-communities to self-organize their phenotypes according to emergent laws characteristic of complex systems. Reviewers This article was reviewed by Eugene V. Koonin, Eric Bapteste, and Enmanuelle Lerat (nominated by King Jordan) PMID:19883502
Phylogenomic Reconstruction of the Oomycete Phylogeny Derived from 37 Genomes
McCarthy, Charley G. P.
2017-01-01
ABSTRACT The oomycetes are a class of microscopic, filamentous eukaryotes within the Stramenopiles-Alveolata-Rhizaria (SAR) supergroup which includes ecologically significant animal and plant pathogens, most infamously the causative agent of potato blight Phytophthora infestans. Single-gene and concatenated phylogenetic studies both of individual oomycete genera and of members of the larger class have resulted in conflicting conclusions concerning species phylogenies within the oomycetes, particularly for the large Phytophthora genus. Genome-scale phylogenetic studies have successfully resolved many eukaryotic relationships by using supertree methods, which combine large numbers of potentially disparate trees to determine evolutionary relationships that cannot be inferred from individual phylogenies alone. With a sufficient amount of genomic data now available, we have undertaken the first whole-genome phylogenetic analysis of the oomycetes using data from 37 oomycete species and 6 SAR species. In our analysis, we used established supertree methods to generate phylogenies from 8,355 homologous oomycete and SAR gene families and have complemented those analyses with both phylogenomic network and concatenated supermatrix analyses. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and individual clades within the problematic Phytophthora genus. Support for the resolution of the inferred relationships between individual Phytophthora clades varies depending on the methodology used. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. IMPORTANCE The oomycetes are a class of eukaryotes and include ecologically significant animal and plant pathogens. Single-gene and multigene phylogenetic studies of individual oomycete genera and of members of the larger classes have resulted in conflicting conclusions concerning interspecies relationships among these species, particularly for the Phytophthora genus. The onset of next-generation sequencing techniques now means that a wealth of oomycete genomic data is available. For the first time, we have used genome-scale phylogenetic methods to resolve oomycete phylogenetic relationships. We used supertree methods to generate single-gene and multigene species phylogenies. Overall, our supertree analyses utilized phylogenetic data from 8,355 oomycete gene families. We have also complemented our analyses with superalignment phylogenies derived from 131 single-copy ubiquitous gene families. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and clades. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. PMID:28435885
Enhanced use of phylogenetic data to inform public health approaches to HIV among MSM
German, Danielle; Grabowski, Mary Kate; Beyrer, Chris
2017-01-01
The multi-dimensional nature and continued evolution of HIV epidemics among men who have sex with men (MSM) requires innovative intervention approaches. Strategies are needed that recognize the individual, social, and structural factors driving HIV transmission; that can pinpoint networks with heightened transmission risk; and that can help target intervention in real-time. HIV phylogenetics is a rapidly evolving field with strong promise for informing innovative responses to the HIV epidemic among MSM. Currently, HIV phylogenetic insights are providing new understandings of characteristics of HIV epidemics involving MSM, social networks influencing transmission, characteristics of HIV transmission clusters involving MSM, targets for antiretroviral and other prevention strategies, and dynamics of emergent epidemics. Maximizing the potential of HIV phylogenetics for HIV responses among MSM will require attention to key methodological challenges and ethical considerations, as well as resolving key implementation and scientific questions. Enhanced and integrated use of HIV surveillance, socio-behavioral, and phylogenetic data resources are becoming increasingly critical for informing public health approaches to HIV among MSM. PMID:27584826
Tree-average distances on certain phylogenetic networks have their weights uniquely determined.
Willson, Stephen J
2012-01-01
A phylogenetic network N has vertices corresponding to species and arcs corresponding to direct genetic inheritance from the species at the tail to the species at the head. Measurements of DNA are often made on species in the leaf set, and one seeks to infer properties of the network, possibly including the graph itself. In the case of phylogenetic trees, distances between extant species are frequently used to infer the phylogenetic trees by methods such as neighbor-joining. This paper proposes a tree-average distance for networks more general than trees. The notion requires a weight on each arc measuring the genetic change along the arc. For each displayed tree the distance between two leaves is the sum of the weights along the path joining them. At a hybrid vertex, each character is inherited from one of its parents. We will assume that for each hybrid there is a probability that the inheritance of a character is from a specified parent. Assume that the inheritance events at different hybrids are independent. Then for each displayed tree there will be a probability that the inheritance of a given character follows the tree; this probability may be interpreted as the probability of the tree. The tree-average distance between the leaves is defined to be the expected value of their distance in the displayed trees. For a class of rooted networks that includes rooted trees, it is shown that the weights and the probabilities at each hybrid vertex can be calculated given the network and the tree-average distances between the leaves. Hence these weights and probabilities are uniquely determined. The hypotheses on the networks include that hybrid vertices have indegree exactly 2 and that vertices that are not leaves have a tree-child.
Demographic but not geographic insularity in HIV transmission among young black MSM.
Oster, Alexandra M; Pieniazek, Danuta; Zhang, Xinjian; Switzer, William M; Ziebell, Rebecca A; Mena, Leandro A; Wei, Xierong; Johnson, Kendra L; Singh, Sonita K; Thomas, Peter E; Elmore, Kimberlee A; Heffelfinger, James D
2011-11-13
To understand patterns of HIV transmission among young black MSM and others in Mississippi. Phylogenetic analysis of HIV-1 polymerase (pol) sequences from 799 antiretroviral-naive persons newly diagnosed with HIV infection in Mississippi during 2005-2008, 130 (16%) of whom were black MSM aged 16-25 years. We identified phylogenetic clusters and used surveillance data to evaluate demographic attributes and risk factors of all persons in clusters that included black MSM aged 16-25 years. We identified 82 phylogenetic clusters, 21 (26%) of which included HIV strains from at least one young black MSM. Of the 69 persons in these clusters, 59 were black MSM and seven were black men with unknown transmission category; the remaining three were MSM of white or Hispanic race/ethnicity. Of these 21 clusters, 10 included residents of one geographic region of Mississippi, whereas 11 included residents of multiple regions or outside of the state. Phylogenetic clusters involving HIV-infected young black MSM were homogeneous with respect to demographic and risk characteristics, suggesting insularity of this population with respect to HIV transmission, but were geographically heterogeneous. Reducing HIV transmission among young black MSM in Mississippi may require prevention strategies that are tailored to young black MSM and those in their sexual networks, and prevention interventions should be delivered in a manner to reach young black MSM throughout the state. Phylogenetic analysis can be a tool for local jurisdictions to understand the transmission dynamics in their areas.
Network portal: a database for storage, analysis and visualization of biological networks
Turkarslan, Serdar; Wurtmann, Elisabeth J.; Wu, Wei-Ju; Jiang, Ning; Bare, J. Christopher; Foley, Karen; Reiss, David J.; Novichkov, Pavel; Baliga, Nitin S.
2014-01-01
The ease of generating high-throughput data has enabled investigations into organismal complexity at the systems level through the inference of networks of interactions among the various cellular components (genes, RNAs, proteins and metabolites). The wider scientific community, however, currently has limited access to tools for network inference, visualization and analysis because these tasks often require advanced computational knowledge and expensive computing resources. We have designed the network portal (http://networks.systemsbiology.net) to serve as a modular database for the integration of user uploaded and public data, with inference algorithms and tools for the storage, visualization and analysis of biological networks. The portal is fully integrated into the Gaggle framework to seamlessly exchange data with desktop and web applications and to allow the user to create, save and modify workspaces, and it includes social networking capabilities for collaborative projects. While the current release of the database contains networks for 13 prokaryotic organisms from diverse phylogenetic clades (4678 co-regulated gene modules, 3466 regulators and 9291 cis-regulatory motifs), it will be rapidly populated with prokaryotic and eukaryotic organisms as relevant data become available in public repositories and through user input. The modular architecture, simple data formats and open API support community development of the portal. PMID:24271392
Villandre, Luc; Günthard, Huldrych F.; Kouyos, Roger; Stadler, Tanja
2016-01-01
Background Transmission patterns of sexually-transmitted infections (STIs) could relate to the structure of the underlying sexual contact network, whose features are therefore of interest to clinicians. Conventionally, we represent sexual contacts in a population with a graph, that can reveal the existence of communities. Phylogenetic methods help infer the history of an epidemic and incidentally, may help detecting communities. In particular, phylogenetic analyses of HIV-1 epidemics among men who have sex with men (MSM) have revealed the existence of large transmission clusters, possibly resulting from within-community transmissions. Past studies have explored the association between contact networks and phylogenies, including transmission clusters, producing conflicting conclusions about whether network features significantly affect observed transmission history. As far as we know however, none of them thoroughly investigated the role of communities, defined with respect to the network graph, in the observation of clusters. Methods The present study investigates, through simulations, community detection from phylogenies. We simulate a large number of epidemics over both unweighted and weighted, undirected random interconnected-islands networks, with islands corresponding to communities. We use weighting to modulate distance between islands. We translate each epidemic into a phylogeny, that lets us partition our samples of infected subjects into transmission clusters, based on several common definitions from the literature. We measure similarity between subjects’ island membership indices and transmission cluster membership indices with the adjusted Rand index. Results and Conclusion Analyses reveal modest mean correspondence between communities in graphs and phylogenetic transmission clusters. We conclude that common methods often have limited success in detecting contact network communities from phylogenies. The rarely-fulfilled requirement that network communities correspond to clades in the phylogeny is their main drawback. Understanding the link between transmission clusters and communities in sexual contact networks could help inform policymaking to curb HIV incidence in MSMs. PMID:26863322
Evolution of the Max and Mlx networks in animals.
McFerrin, Lisa G; Atchley, William R
2011-01-01
Transcription factors (TFs) are essential for the regulation of gene expression and often form emergent complexes to perform vital roles in cellular processes. In this paper, we focus on the parallel Max and Mlx networks of TFs because of their critical involvement in cell cycle regulation, proliferation, growth, metabolism, and apoptosis. A basic-helix-loop-helix-zipper (bHLHZ) domain mediates the competitive protein dimerization and DNA binding among Max and Mlx network members to form a complex system of cell regulation. To understand the importance of these network interactions, we identified the bHLHZ domain of Max and Mlx network proteins across the animal kingdom and carried out several multivariate statistical analyses. The presence and conservation of Max and Mlx network proteins in animal lineages stemming from the divergence of Metazoa indicate that these networks have ancient and essential functions. Phylogenetic analysis of the bHLHZ domain identified clear relationships among protein families with distinct points of radiation and divergence. Multivariate discriminant analysis further isolated specific amino acid changes within the bHLHZ domain that classify proteins, families, and network configurations. These analyses on Max and Mlx network members provide a model for characterizing the evolution of TFs involved in essential networks.
Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod
2016-02-01
Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili.
Phylogenetic Network for European mtDNA
Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari
2001-01-01
The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229
Parallel implementation of D-Phylo algorithm for maximum likelihood clusters.
Malik, Shamita; Sharma, Dolly; Khatri, Sunil Kumar
2017-03-01
This study explains a newly developed parallel algorithm for phylogenetic analysis of DNA sequences. The newly designed D-Phylo is a more advanced algorithm for phylogenetic analysis using maximum likelihood approach. The D-Phylo while misusing the seeking capacity of k -means keeps away from its real constraint of getting stuck at privately conserved motifs. The authors have tested the behaviour of D-Phylo on Amazon Linux Amazon Machine Image(Hardware Virtual Machine)i2.4xlarge, six central processing unit, 122 GiB memory, 8 × 800 Solid-state drive Elastic Block Store volume, high network performance up to 15 processors for several real-life datasets. Distributing the clusters evenly on all the processors provides us the capacity to accomplish a near direct speed if there should arise an occurrence of huge number of processors.
Untangling above- and belowground mycorrhizal fungal networks in tropical orchids.
Leake, J R; Cameron, D D
2012-10-01
Orchids typically depend on fungi for establishment from seeds, forming mycorrhizal associations with basidiomycete fungal partners in the polyphyletic group rhizoctonia from early stages of germination, sometimes with very high specificity. This has raised important questions about the roles of plant and fungal phylogenetics, and their habitat preferences, in controlling which fungi associate with which plants. In this issue of Molecular Ecology, Martos et al. (2012) report the largest network analysis to date for orchids and their mycorrhizal fungi, sampling a total of over 450 plants from nearly half the 150 tropical orchid species on Reunion Island, encompassing its main terrestrial and epiphytic orchid genera. The authors found a total of 95 operational taxonomic units of mycorrhizal fungi and investigated the architecture and nestedness of their bipartite networks with 73 orchid species. The most striking finding was a major ecological barrier between above- and belowground mycorrhizal fungal networks, despite both epiphytic and terrestrial orchids often associating with closely related taxa across all three major lineages of rhizoctonia fungi. The fungal partnerships of the epiphytes and terrestrial species involved a diversity of fungal taxa in a modular network architecture, with only about one in ten mycorrhizal fungi partnering orchids in both groups. In contrast, plant and fungal phylogenetics had weak or no effects on the network. This highlights the power of recently developed ecological network analyses to give new insights into controls on plant-fungal symbioses and raises exciting new hypotheses about the differences in properties and functioning of mycorrhiza in epiphytic and terrestrial orchids. © 2012 Blackwell Publishing Ltd.
Fast Construction of Near Parsimonious Hybridization Networks for Multiple Phylogenetic Trees.
Mirzaei, Sajad; Wu, Yufeng
2016-01-01
Hybridization networks represent plausible evolutionary histories of species that are affected by reticulate evolutionary processes. An established computational problem on hybridization networks is constructing the most parsimonious hybridization network such that each of the given phylogenetic trees (called gene trees) is "displayed" in the network. There have been several previous approaches, including an exact method and several heuristics, for this NP-hard problem. However, the exact method is only applicable to a limited range of data, and heuristic methods can be less accurate and also slow sometimes. In this paper, we develop a new algorithm for constructing near parsimonious networks for multiple binary gene trees. This method is more efficient for large numbers of gene trees than previous heuristics. This new method also produces more parsimonious results on many simulated datasets as well as a real biological dataset than a previous method. We also show that our method produces topologically more accurate networks for many datasets.
Fu, L-L; Liu, J; Chen, Y; Wang, F-T; Wen, X; Liu, H-Q; Wang, M-Y; Ouyang, L; Huang, J; Bao, J-K; Wei, Y-Q
2014-08-01
The aim of this study was to explore sodium taurocholate co-transporting polypeptide (NTCP) exerting its function with hepatitis B virus (HBV) and its targeted candidate compounds, in HBV therapy. Identification of NTCP as a novel HBV target for screening candidate small molecules, was used by phylogenetic analysis, network construction, molecular modelling, molecular docking and molecular dynamics (MD) simulation. In vitro virological examination, q-PCR, western blotting and cytotoxicity studies were used for validating efficacy of the candidate compound. We used the phylogenetic analysis of NTCP and constructed its protein-protein network. Also, we screened compounds from Drugbank and ZINC, among which five were validated for their authentication in HepG 2.2.15 cells. Then, we selected compound N4 (azelastine hydrochloride) as the most potent of them. This showed good inhibitory activity against HBsAg (IC50 = 7.5 μm) and HBeAg (IC50 = 3.7 μm), as well as high SI value (SI = 4.68). Further MD simulation results supported good interaction between compound N4 and NTCP. In silico analysis and experimental validation together demonstrated that compound N4 can target NTCP in HepG2.2.15 cells, which may shed light on exploring it as a potential anti-HBV drug. © 2014 John Wiley & Sons Ltd.
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Background and Aims Despite differences in morphology, the genera representing ‘true citrus fruit trees’ are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial ‘species’ of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between ‘true citrus fruit trees’ were clarified. Methods Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. Key Results A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Conclusions Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis. PMID:23104641
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis.
Aguilar-Rodea, Pamela; Zúñiga, Gerardo; Rodríguez-Espino, Benjamín Antonio; Olivares Cervantes, Alma Lidia; Gamiño Arroyo, Ana Estela; Moreno-Espinosa, Sarbelio; de la Rosa Zamboni, Daniela; López Martínez, Briceida; Castellanos-Cruz, María del Carmen; Parra-Ortega, Israel; Jiménez Rojas, Verónica Leticia; Vigueras Galindo, Juan Carlos; Velázquez-Guadarrama, Norma
2017-01-01
Several microorganisms produce nosocomial infections (NIs), among which Pseudomonas aeruginosa stands out as an opportunist pathogen with the capacity to develop multiresistance to first-choice antibiotics. From 2007 to 2013, forty-six NIs produced by P. aeruginosa were detected at a pediatric tertiary care hospital in Mexico with a significant mortality rate (17.39%). All isolates (n = 58/46 patients) were characterized by evaluating their response to several antibiotics as panresistant (PDR), extensively resistant (XDR), multiresistant (MDR) or sensitive (S). In addition, all isolates were typified through multilocus sequencing of seven genes: acsA, aroE, guaA, mutL, nuoD, ppsA and trpE. Furthermore, to establish the genetic relationships among these isolates, we carried out a phylogenetic inference analysis using maximum likelihood to construct a phylogenetic network. To assess evolutionary parameters, recombination was evaluated using the PHI test, and the ratio of nonsynonymous to synonymous substitutions was determined. Two of the strains were PDR (ST1725); 42 were XDR; four were MDR; and ten were S. Twenty-one new sequence types were detected. Thirty-three strains exhibited novel sequence type ST1725. The ratio of nonsynonym to synonym substitutions was 1:1 considering all genes. Phylogenetic analysis showed that the genetic relationship of the PDR, XDR and MDR strains was mainly clonal; however, the PHI test and the phylogenetic network suggest that recombination events occurred to produce a non-clonal population. This study aimed not only to determine the genetic diversity of clinical P. aeruginosa but also to provide a warning regarding the identification and spreading of clone ST1725, its ability to cause outbreaks with high mortality rates, and to remain in the hospital environment for over seven years. These characteristics highlight the need to identify clonal outbreaks, especially where high resistance to most antibiotics is observed, and control measures are needed. This study also represents the first report of the PDR ST1725. PMID:28253282
The phylogenetic structure of plant-pollinator networks increases with habitat size and isolation.
Aizen, Marcelo A; Gleiser, Gabriela; Sabatino, Malena; Gilarranz, Luis J; Bascompte, Jordi; Verdú, Miguel
2016-01-01
Similarity among species in traits related to ecological interactions is frequently associated with common ancestry. Thus, closely related species usually interact with ecologically similar partners, which can be reinforced by diverse co-evolutionary processes. The effect of habitat fragmentation on the phylogenetic signal in interspecific interactions and correspondence between plant and animal phylogenies is, however, unknown. Here, we address to what extent phylogenetic signal and co-phylogenetic congruence of plant-animal interactions depend on habitat size and isolation by analysing the phylogenetic structure of 12 pollination webs from isolated Pampean hills. Phylogenetic signal in interspecific interactions differed among webs, being stronger for flower-visiting insects than plants. Phylogenetic signal and overall co-phylogenetic congruence increased independently with hill size and isolation. We propose that habitat fragmentation would erode the phylogenetic structure of interaction webs. A decrease in phylogenetic signal and co-phylogenetic correspondence in plant-pollinator interactions could be associated with less reliable mutualism and erratic co-evolutionary change. © 2015 John Wiley & Sons Ltd/CNRS.
German, Danielle; Grabowski, Mary Kate; Beyrer, Chris
2017-02-01
The multidimensional nature and continued evolution of HIV epidemics among men who have sex with men (MSM) requires innovative intervention approaches. Strategies are needed that recognise the individual, social and structural factors driving HIV transmission; that can pinpoint networks with heightened transmission risk; and that can help target intervention in real time. HIV phylogenetics is a rapidly evolving field with strong promise for informing innovative responses to the HIV epidemic among MSM. Currently, HIV phylogenetic insights are providing new understandings of characteristics of HIV epidemics involving MSM, social networks influencing transmission, characteristics of HIV transmission clusters involving MSM, targets for antiretroviral and other prevention strategies and dynamics of emergent epidemics. Maximising the potential of HIV phylogenetics for HIV responses among MSM will require attention to key methodological challenges and ethical considerations, as well as resolving key implementation and scientific questions. Enhanced and integrated use of HIV surveillance, sociobehavioural and phylogenetic data resources are becoming increasingly critical for informing public health approaches to HIV among MSM.
Zhao, Min; Qu, Hong
2011-11-30
The phylogenetic profile is widely used to characterize functional linkage and conservation between proteins without amino acid sequence similarity. To survey the conservative regulatory properties of rate-limiting enzymes (RLEs) in metabolic inhibitory network across different species, we define the enzyme inhibiting pair as: where the first enzyme in a pair is the inhibitor provider and the second is the target of the inhibitor. Phylogenetic profiles of enzymes in the inhibiting pairs are further generated to measure the functional linkage of these enzymes during evolutionary history. We find that the RLEs generate, on average, over half of all in vivo inhibitors in each surveyed model organism. And these inhibitors inhibit on average over 85% targets in metabolic inhibitory network and cover the majority of targets of cross-pathway inhibiting relations. Furthermore, we demonstrate that the phylogenetic profiles of the enzymes in inhibiting pairs in which at least one enzyme is rate-limiting often show higher similarities than those in common inhibiting enzyme pairs. In addition, RLEs, compared to common metabolic enzymes, often tend to produce ADP instead of AMP in conservative inhibitory networks. Combined with the conservative roles of RLEs in their efficiency in sensing metabolic signals and transmitting regulatory signals to the rest of the metabolic system, the RLEs may be important molecules in balancing energy homeostasis via maintaining the ratio of ATP to ADP in living cells. Furthermore, our results indicate that similarities of phylogenetic profiles of enzymes in the inhibiting enzyme pairs are not only correlated with enzyme topological importance, but also related with roles of the enzymes in metabolic inhibitory network.
2011-01-01
Background The phylogenetic profile is widely used to characterize functional linkage and conservation between proteins without amino acid sequence similarity. To survey the conservative regulatory properties of rate-limiting enzymes (RLEs) in metabolic inhibitory network across different species, we define the enzyme inhibiting pair as: where the first enzyme in a pair is the inhibitor provider and the second is the target of the inhibitor. Phylogenetic profiles of enzymes in the inhibiting pairs are further generated to measure the functional linkage of these enzymes during evolutionary history. Results We find that the RLEs generate, on average, over half of all in vivo inhibitors in each surveyed model organism. And these inhibitors inhibit on average over 85% targets in metabolic inhibitory network and cover the majority of targets of cross-pathway inhibiting relations. Furthermore, we demonstrate that the phylogenetic profiles of the enzymes in inhibiting pairs in which at least one enzyme is rate-limiting often show higher similarities than those in common inhibiting enzyme pairs. In addition, RLEs, compared to common metabolic enzymes, often tend to produce ADP instead of AMP in conservative inhibitory networks. Conclusions Combined with the conservative roles of RLEs in their efficiency in sensing metabolic signals and transmitting regulatory signals to the rest of the metabolic system, the RLEs may be important molecules in balancing energy homeostasis via maintaining the ratio of ATP to ADP in living cells. Furthermore, our results indicate that similarities of phylogenetic profiles of enzymes in the inhibiting enzyme pairs are not only correlated with enzyme topological importance, but also related with roles of the enzymes in metabolic inhibitory network. PMID:22369203
Phylogenetic congruence between subtropical trees and their associated fungi.
Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao
2016-12-01
Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.
Chen, Liang; Zheng, Yong; Gao, Cheng; Mi, Xiang-Cheng; Ma, Ke-Ping; Wubet, Tesfaye; Guo, Liang-Dong
2017-05-01
Elucidating symbiotic relationships between arbuscular mycorrhizal fungi (AMF) and plants contributes to a better understanding of their reciprocally dependent coexistence and community assembly. However, the main drivers of plant and AMF community assembly remain unclear. In this study, we examined AMF communities from 166 root samples of 17 woody plant species from 10 quadrats in a Chinese subtropical forest using 454 pyrosequencing of 18S rRNA gene to describe symbiotic AMF-plant association. Our results show the woody plant-AMF networks to be highly interconnected and nested, but in antimodular and antispecialized manners. The nonrandom pattern in the woody plant-AMF network was explained by plant and AMF phylogenies, with a tendency for a stronger phylogenetic signal by plant than AMF phylogeny. This study suggests that the phylogenetic niche conservatism in woody plants and their AMF symbionts could contribute to interdependent AMF and plant community assembly in this subtropical forest ecosystem. © 2017 John Wiley & Sons Ltd.
Inferring explicit weighted consensus networks to represent alternative evolutionary histories
2013-01-01
Background The advent of molecular biology techniques and constant increase in availability of genetic material have triggered the development of many phylogenetic tree inference methods. However, several reticulate evolution processes, such as horizontal gene transfer and hybridization, have been shown to blur the species evolutionary history by causing discordance among phylogenies inferred from different genes. Methods To tackle this problem, we hereby describe a new method for inferring and representing alternative (reticulate) evolutionary histories of species as an explicit weighted consensus network which can be constructed from a collection of gene trees with or without prior knowledge of the species phylogeny. Results We provide a way of building a weighted phylogenetic network for each of the following reticulation mechanisms: diploid hybridization, intragenic recombination and complete or partial horizontal gene transfer. We successfully tested our method on some synthetic and real datasets to infer the above-mentioned evolutionary events which may have influenced the evolution of many species. Conclusions Our weighted consensus network inference method allows one to infer, visualize and validate statistically major conflicting signals induced by the mechanisms of reticulate evolution. The results provided by the new method can be used to represent the inferred conflicting signals by means of explicit and easy-to-interpret phylogenetic networks. PMID:24359207
Genetic diversity of Grapevine virus A in Washington and California vineyards.
Alabi, Olufemi J; Al Rwahnih, Maher; Mekuria, Tefera A; Naidu, Rayapati A
2014-05-01
Grapevine virus A (GVA; genus Vitivirus, family Betaflexiviridae) has been implicated with the Kober stem grooving disorder of the rugose wood disease complex. In this study, 26 isolates of GVA recovered from wine grape (Vitis vinifera) cultivars from California and Washington were analyzed for their genetic diversity. An analysis of a portion of the RNA-dependent RNA polymerase (RdRp) and complete coat protein (CP) sequences revealed intra- and inter-isolate sequence diversity. Our results indicated that both RdRp and CP are under strong negative selection based on the normalized values for the ratio of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site. A global phylogenetic analysis of CP sequences revealed segregation of virus isolates into four major clades with no geographic clustering. In contrast, the RdRp-based phylogenetic tree indicated segregation of GVA isolates from California and Washington into six clades, independent of geographic origin or cultivar. Phylogenetic network coupled with recombination analyses showed putative recombination events in both RdRp and CP sequence data sets, with more of these events located in the CP sequence. The preponderance of divergent variants of GVA co-replicating within individual grapevines could increase viral genotypic complexity with implications for phylogenetic analysis and evolutionary history of the virus. The knowledge of genetic diversity of GVA generated in this study will provide a foundation for elucidating the epidemiological characteristics of virus populations at different scales and implementing appropriate management strategies for minimizing the spread of genetic variants of the virus by vectors and via planting materials supplied to nurseries and grape growers.
HIV Transmission Networks in the San Diego–Tijuana Border Region
Mehta, Sanjay R.; Wertheim, Joel O.; Brouwer, Kimberly C.; Wagner, Karla D.; Chaillon, Antoine; Strathdee, Steffanie; Patterson, Thomas L.; Rangel, Maria G.; Vargas, Mlenka; Murrell, Ben; Garfein, Richard; Little, Susan J.; Smith, Davey M.
2015-01-01
Background HIV sequence data can be used to reconstruct local transmission networks. Along international borders, like the San Diego–Tijuana region, understanding the dynamics of HIV transmission across reported risks, racial/ethnic groups, and geography can help direct effective prevention efforts on both sides of the border. Methods We gathered sociodemographic, geographic, clinical, and viral sequence data from HIV infected individuals participating in ten studies in the San Diego–Tijuana border region. Phylogenetic and network analysis was performed to infer putative relationships between HIV sequences. Correlates of identified clusters were evaluated and spatiotemporal relationships were explored using Bayesian phylogeographic analysis. Findings After quality filtering, 843 HIV sequences with associated demographic data and 263 background sequences from the region were analyzed, and 138 clusters were inferred (2–23 individuals). Overall, the rate of clustering did not differ by ethnicity, residence, or sex, but bisexuals were less likely to cluster than heterosexuals or men who have sex with men (p = 0.043), and individuals identifying as white (p ≤ 0.01) were more likely to cluster than other races. Clustering individuals were also 3.5 years younger than non-clustering individuals (p < 0.001). Although the sampled San Diego and Tijuana epidemics were phylogenetically compartmentalized, five clusters contained individuals residing on both sides of the border. Interpretation This study sampled ~ 7% of HIV infected individuals in the border region, and although the sampled networks on each side of the border were largely separate, there was evidence of persistent bidirectional cross-border transmissions that linked risk groups, thus highlighting the importance of the border region as a “melting pot” of risk groups. Funding NIH, VA, and Pendleton Foundation. PMID:26629540
HIV Transmission Networks in the San Diego-Tijuana Border Region.
Mehta, Sanjay R; Wertheim, Joel O; Brouwer, Kimberly C; Wagner, Karla D; Chaillon, Antoine; Strathdee, Steffanie; Patterson, Thomas L; Rangel, Maria G; Vargas, Mlenka; Murrell, Ben; Garfein, Richard; Little, Susan J; Smith, Davey M
2015-10-01
HIV sequence data can be used to reconstruct local transmission networks. Along international borders, like the San Diego-Tijuana region, understanding the dynamics of HIV transmission across reported risks, racial/ethnic groups, and geography can help direct effective prevention efforts on both sides of the border. We gathered sociodemographic, geographic, clinical, and viral sequence data from HIV infected individuals participating in ten studies in the San Diego-Tijuana border region. Phylogenetic and network analysis was performed to infer putative relationships between HIV sequences. Correlates of identified clusters were evaluated and spatiotemporal relationships were explored using Bayesian phylogeographic analysis. After quality filtering, 843 HIV sequences with associated demographic data and 263 background sequences from the region were analyzed, and 138 clusters were inferred (2-23 individuals). Overall, the rate of clustering did not differ by ethnicity, residence, or sex, but bisexuals were less likely to cluster than heterosexuals or men who have sex with men (p = 0.043), and individuals identifying as white (p ≤ 0.01) were more likely to cluster than other races. Clustering individuals were also 3.5 years younger than non-clustering individuals (p < 0.001). Although the sampled San Diego and Tijuana epidemics were phylogenetically compartmentalized, five clusters contained individuals residing on both sides of the border. This study sampled ~ 7% of HIV infected individuals in the border region, and although the sampled networks on each side of the border were largely separate, there was evidence of persistent bidirectional cross-border transmissions that linked risk groups, thus highlighting the importance of the border region as a "melting pot" of risk groups. NIH, VA, and Pendleton Foundation.
Things fall apart: biological species form unconnected parsimony networks.
Hart, Michael W; Sunday, Jennifer
2007-10-22
The generality of operational species definitions is limited by problematic definitions of between-species divergence. A recent phylogenetic species concept based on a simple objective measure of statistically significant genetic differentiation uses between-species application of statistical parsimony networks that are typically used for population genetic analysis within species. Here we review recent phylogeographic studies and reanalyse several mtDNA barcoding studies using this method. We found that (i) alignments of DNA sequences typically fall apart into a separate subnetwork for each Linnean species (but with a higher rate of true positives for mtDNA data) and (ii) DNA sequences from single species typically stick together in a single haplotype network. Departures from these patterns are usually consistent with hybridization or cryptic species diversity.
Are humans the initial source of canine mange?
Andriantsoanirina, Valérie; Fang, Fang; Ariey, Frédéric; Izri, Arezki; Foulet, Françoise; Botterel, Françoise; Bernigaud, Charlotte; Chosidow, Olivier; Huang, Weiyi; Guillot, Jacques; Durand, Rémy
2016-03-25
Scabies, or mange as it is called in animals, is an ectoparasitic contagious infestation caused by the mite Sarcoptes scabiei. Sarcoptic mange is an important veterinary disease leading to significant morbidity and mortality in wild and domestic animals. A widely accepted hypothesis, though never substantiated by factual data, suggests that humans were the initial source of the animal contamination. In this study we performed phylogenetic analyses of populations of S. scabiei from humans and from canids to validate or not the hypothesis of a human origin of the mites infecting domestic dogs. Mites from dogs and foxes were obtained from three French sites and from other countries. A part of cytochrome c oxidase subunit 1 (cox1) gene was amplified and directly sequenced. Other sequences corresponding to mites from humans, raccoon dogs, foxes, jackal and dogs from various geographical areas were retrieved from GenBank. Phylogenetic analyses were performed using the Otodectes cynotis cox1 sequence as outgroup. Maximum Likelihood and Bayesian Inference analysis approaches were used. To visualize the relationship between the haplotypes, a median joining haplotype network was constructed using Network v4.6 according to host. Twenty-one haplotypes were observed among mites collected from five different host species, including humans and canids from nine geographical areas. The phylogenetic trees based on Maximum Likelihood and Bayesian Inference analyses showed similar topologies with few differences in node support values. The results were not consistent with a human origin of S. scabiei mites in dogs and, on the contrary, did not exclude the opposite hypothesis of a host switch from dogs to humans. Phylogenetic relatedness may have an impact in terms of epidemiological control strategy. Our results and other recent studies suggest to re-evaluate the level of transmission between domestic dogs and humans.
Heggarty, Paul; Maguire, Warren; McMahon, April
2010-12-12
Linguists have traditionally represented patterns of divergence within a language family in terms of either a 'splits' model, corresponding to a branching family tree structure, or the wave model, resulting in a (dialect) continuum. Recent phylogenetic analyses, however, have tended to assume the former as a viable idealization also for the latter. But the contrast matters, for it typically reflects different processes in the real world: speaker populations either separated by migrations, or expanding over continuous territory. Since history often leaves a complex of both patterns within the same language family, ideally we need a single model to capture both, and tease apart the respective contributions of each. The 'network' type of phylogenetic method offers this, so we review recent applications to language data. Most have used lexical data, encoded as binary or multi-state characters. We look instead at continuous distance measures of divergence in phonetics. Our output networks combine branch- and continuum-like signals in ways that correspond well to known histories (illustrated for Germanic, and particularly English). We thus challenge the traditional insistence on shared innovations, setting out a new, principled explanation for why complex language histories can emerge correctly from distance measures, despite shared retentions and parallel innovations.
Controlled recovery of phylogenetic communities from an evolutionary model using a network approach
NASA Astrophysics Data System (ADS)
Sousa, Arthur M. Y. R.; Vieira, André P.; Prado, Carmen P. C.; Andrade, Roberto F. S.
2016-04-01
This works reports the use of a complex network approach to produce a phylogenetic classification tree of a simple evolutionary model. This approach has already been used to treat proteomic data of actual extant organisms, but an investigation of its reliability to retrieve a traceable evolutionary history is missing. The used evolutionary model includes key ingredients for the emergence of groups of related organisms by differentiation through random mutations and population growth, but purposefully omits other realistic ingredients that are not strictly necessary to originate an evolutionary history. This choice causes the model to depend only on a small set of parameters, controlling the mutation probability and the population of different species. Our results indicate that for a set of parameter values, the phylogenetic classification produced by the used framework reproduces the actual evolutionary history with a very high average degree of accuracy. This includes parameter values where the species originated by the evolutionary dynamics have modular structures. In the more general context of community identification in complex networks, our model offers a simple setting for evaluating the effects, on the efficiency of community formation and identification, of the underlying dynamics generating the network itself.
Evaluating multiple determinants of the structure of plant-animal mutualistic networks.
Vázquez, Diego P; Chacoff, Natacha P; Cagnolo, Luciano
2009-08-01
The structure of mutualistic networks is likely to result from the simultaneous influence of neutrality and the constraints imposed by complementarity in species phenotypes, phenologies, spatial distributions, phylogenetic relationships, and sampling artifacts. We develop a conceptual and methodological framework to evaluate the relative contributions of these potential determinants. Applying this approach to the analysis of a plant-pollinator network, we show that information on relative abundance and phenology suffices to predict several aggregate network properties (connectance, nestedness, interaction evenness, and interaction asymmetry). However, such information falls short of predicting the detailed network structure (the frequency of pairwise interactions), leaving a large amount of variation unexplained. Taken together, our results suggest that both relative species abundance and complementarity in spatiotemporal distribution contribute substantially to generate observed network patters, but that this information is by no means sufficient to predict the occurrence and frequency of pairwise interactions. Future studies could use our methodological framework to evaluate the generality of our findings in a representative sample of study systems with contrasting ecological conditions.
Mooers, Arne Ø.; Caccone, Adalgisa; Russello, Michael A.
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization. PMID:27635324
Jensen, Evelyn L; Mooers, Arne Ø; Caccone, Adalgisa; Russello, Michael A
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization.
Phylogenetic analysis of Tibetan mastiffs based on mitochondrial hypervariable region I.
Ren, Zhanjun; Chen, Huiling; Yang, Xuejiao; Zhang, Chengdong
2017-03-01
Recently, the number of Tibetan mastiffs, which is a precious germplasm resource and cultural heritage, is decreasing sharply. Therefore, the genetic diversity of Tibetan mastiffs needs to be studied to clarify its phylogenetics relationships and lay the foundation for resource protection, rational development and utilization of Tibetan mastiffs. We sequenced hypervariable region I of mitochondrial DNA (mtDNA) of 110 individuals from Tibet region and Gansu province. A total of 12 polymorphic sites were identified which defined eight haplotypes of which H4 and H8 were unique to Tibetan population with H8 being identified first. The haplotype diversity (Hd: 0.808), nucleotide diversity (Pi: 0.603%), the average number of nucleotide difference (K: 3.917) of Tibetan mastiffs from Gansu were higher than those from Tibet region (Hd: 0.794; Pi: 0.589%; K: 3.831), which revealed higher genetic diversity in Gansu. In terms of total population, the genetic variation was low. The median-joining network and phylogenetic tree based on the mtDNA hypervariable region I showed that Tibetan mastiffs originated from grey wolves, as the other domestic dogs and had different history of maternal origin. The mismatch distribution analysis and neutrality tests indicated that Tibetan mastiffs were in genetic equilibrium or in a population decline.
Inferring epidemiological parameters from phylogenetic information for the HIV-1 epidemic among MSM
NASA Astrophysics Data System (ADS)
Quax, Rick; van de Vijver, David A. M. C.; Frentz, Dineke; Sloot, Peter M. A.
2013-09-01
The HIV-1 epidemic in Europe is primarily sustained by a dynamic topology of sexual interactions among MSM who have individual immune systems and behavior. This epidemiological process shapes the phylogeny of the virus population. Both fields of epidemic modeling and phylogenetics have a long history, however it remains difficult to use phylogenetic data to infer epidemiological parameters such as the structure of the sexual network and the per-act infectiousness. This is because phylogenetic data is necessarily incomplete and ambiguous. Here we show that the cluster-size distribution indeed contains information about epidemiological parameters using detailed numberical experiments. We simulate the HIV epidemic among MSM many times using the Monte Carlo method with all parameter values and their ranges taken from literature. For each simulation and the corresponding set of parameter values we calculate the likelihood of reproducing an observed cluster-size distribution. The result is an estimated likelihood distribution of all parameters from the phylogenetic data, in particular the structure of the sexual network, the per-act infectiousness, and the risk behavior reduction upon diagnosis. These likelihood distributions encode the knowledge provided by the observed cluster-size distrbution, which we quantify using information theory. Our work suggests that the growing body of genetic data of patients can be exploited to understand the underlying epidemiological process.
Mondav, Rhiannon; McCalley, Carmody K; Hodgkins, Suzanne B; Frolking, Steve; Saleska, Scott R; Rich, Virginia I; Chanton, Jeff P; Crill, Patrick M
2017-08-01
Biogenic production and release of methane (CH 4 ) from thawing permafrost has the potential to be a strong source of radiative forcing. We investigated changes in the active layer microbial community of three sites representative of distinct permafrost thaw stages at a palsa mire in northern Sweden. The palsa site (intact permafrost and low radiative forcing signature) had a phylogenetically clustered community dominated by Acidobacteria and Proteobacteria. The bog (thawing permafrost and low radiative forcing signature) had lower alpha diversity and midrange phylogenetic clustering, characteristic of ecosystem disturbance affecting habitat filtering. Hydrogenotrophic methanogens and Acidobacteria dominated the bog shifting from palsa-like to fen-like at the waterline. The fen (no underlying permafrost, high radiative forcing signature) had the highest alpha, beta and phylogenetic diversity, was dominated by Proteobacteria and Euryarchaeota and was significantly enriched in methanogens. The Mire microbial network was modular with module cores consisting of clusters of Acidobacteria, Euryarchaeota or Xanthomonodales. Loss of underlying permafrost with associated hydrological shifts correlated to changes in microbial composition, alpha, beta and phylogenetic diversity associated with a higher radiative forcing signature. These results support the complex role of microbial interactions in mediating carbon budget changes and climate feedback in response to climate forcing. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Phylogenetic convolutional neural networks in metagenomics.
Fioravanti, Diego; Giarratano, Ylenia; Maggio, Valerio; Agostinelli, Claudio; Chierici, Marco; Jurman, Giuseppe; Furlanello, Cesare
2018-03-08
Convolutional Neural Networks can be effectively used only when data are endowed with an intrinsic concept of neighbourhood in the input space, as is the case of pixels in images. We introduce here Ph-CNN, a novel deep learning architecture for the classification of metagenomics data based on the Convolutional Neural Networks, with the patristic distance defined on the phylogenetic tree being used as the proximity measure. The patristic distance between variables is used together with a sparsified version of MultiDimensional Scaling to embed the phylogenetic tree in a Euclidean space. Ph-CNN is tested with a domain adaptation approach on synthetic data and on a metagenomics collection of gut microbiota of 38 healthy subjects and 222 Inflammatory Bowel Disease patients, divided in 6 subclasses. Classification performance is promising when compared to classical algorithms like Support Vector Machines and Random Forest and a baseline fully connected neural network, e.g. the Multi-Layer Perceptron. Ph-CNN represents a novel deep learning approach for the classification of metagenomics data. Operatively, the algorithm has been implemented as a custom Keras layer taking care of passing to the following convolutional layer not only the data but also the ranked list of neighbourhood of each sample, thus mimicking the case of image data, transparently to the user.
Yu, J; Blom, J; Glaeser, S P; Jaenicke, S; Juhre, T; Rupp, O; Schwengers, O; Spänig, S; Goesmann, A
2017-11-10
The rapid development of next generation sequencing technology has greatly increased the amount of available microbial genomes. As a result of this development, there is a rising demand for fast and automated approaches in analyzing these genomes in a comparative way. Whole genome sequencing also bears a huge potential for obtaining a higher resolution in phylogenetic and taxonomic classification. During the last decade, several software tools and platforms have been developed in the field of comparative genomics. In this manuscript, we review the most commonly used platforms and approaches for ortholog group analyses with a focus on their potential for phylogenetic and taxonomic research. Furthermore, we describe the latest improvements of the EDGAR platform for comparative genome analyses and present recent examples of its application for the phylogenomic analysis of different taxa. Finally, we illustrate the role of the EDGAR platform as part of the BiGi Center for Microbial Bioinformatics within the German network on Bioinformatics Infrastructure (de.NBI). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Optimal network alignment with graphlet degree vectors.
Milenković, Tijana; Ng, Weng Leong; Hayes, Wayne; Przulj, Natasa
2010-06-30
Important biological information is encoded in the topology of biological networks. Comparative analyses of biological networks are proving to be valuable, as they can lead to transfer of knowledge between species and give deeper insights into biological function, disease, and evolution. We introduce a new method that uses the Hungarian algorithm to produce optimal global alignment between two networks using any cost function. We design a cost function based solely on network topology and use it in our network alignment. Our method can be applied to any two networks, not just biological ones, since it is based only on network topology. We use our new method to align protein-protein interaction networks of two eukaryotic species and demonstrate that our alignment exposes large and topologically complex regions of network similarity. At the same time, our alignment is biologically valid, since many of the aligned protein pairs perform the same biological function. From the alignment, we predict function of yet unannotated proteins, many of which we validate in the literature. Also, we apply our method to find topological similarities between metabolic networks of different species and build phylogenetic trees based on our network alignment score. The phylogenetic trees obtained in this way bear a striking resemblance to the ones obtained by sequence alignments. Our method detects topologically similar regions in large networks that are statistically significant. It does this independent of protein sequence or any other information external to network topology.
Kiefer, Christiane; Koch, Marcus A.
2012-01-01
74 of the currently accepted 111 taxa of the North American genus Boechera (Brassicaceae) were subject to pyhlogenetic reconstruction and network analysis. The dataset comprised 911 accessions for which ITS sequences were analyzed. Phylogenetic analyses yielded largely unresolved trees. Together with the network analysis confirming this result this can be interpreted as an indication for multiple, independent, and rapid diversification events. Network analyses were superimposed with datasets describing i) geographical distribution, ii) taxonomy, iii) reproductive mode, and iv) distribution history based on phylogeographic evidence. Our results provide first direct evidence for enormous reticulate evolution in the entire genus and give further insights into the evolutionary history of this complex genus on a continental scale. In addition two novel single-copy gene markers, orthologues of the Arabidopsis thaliana genes At2g25920 and At3g18900, were analyzed for subsets of taxa and confirmed the findings obtained through the ITS data. PMID:22606266
Using hybridization networks to retrace the evolution of Indo-European languages.
Willems, Matthieu; Lord, Etienne; Laforest, Louise; Labelle, Gilbert; Lapointe, François-Joseph; Di Sciullo, Anna Maria; Makarenkov, Vladimir
2016-09-06
Curious parallels between the processes of species and language evolution have been observed by many researchers. Retracing the evolution of Indo-European (IE) languages remains one of the most intriguing intellectual challenges in historical linguistics. Most of the IE language studies use the traditional phylogenetic tree model to represent the evolution of natural languages, thus not taking into account reticulate evolutionary events, such as language hybridization and word borrowing which can be associated with species hybridization and horizontal gene transfer, respectively. More recently, implicit evolutionary networks, such as split graphs and minimal lateral networks, have been used to account for reticulate evolution in linguistics. Striking parallels existing between the evolution of species and natural languages allowed us to apply three computational biology methods for reconstruction of phylogenetic networks to model the evolution of IE languages. We show how the transfer of methods between the two disciplines can be achieved, making necessary methodological adaptations. Considering basic vocabulary data from the well-known Dyen's lexical database, which contains word forms in 84 IE languages for the meanings of a 200-meaning Swadesh list, we adapt a recently developed computational biology algorithm for building explicit hybridization networks to study the evolution of IE languages and compare our findings to the results provided by the split graph and galled network methods. We conclude that explicit phylogenetic networks can be successfully used to identify donors and recipients of lexical material as well as the degree of influence of each donor language on the corresponding recipient languages. We show that our algorithm is well suited to detect reticulate relationships among languages, and present some historical and linguistic justification for the results obtained. Our findings could be further refined if relevant syntactic, phonological and morphological data could be analyzed along with the available lexical data.
Phylogeny of metabolic networks: a spectral graph theoretical approach.
Deyasi, Krishanu; Banerjee, Anirban; Deb, Bony
2015-10-01
Many methods have been developed for finding the commonalities between different organisms in order to study their phylogeny. The structure of metabolic networks also reveals valuable insights into metabolic capacity of species as well as into the habitats where they have evolved. We constructed metabolic networks of 79 fully sequenced organisms and compared their architectures. We used spectral density of normalized Laplacian matrix for comparing the structure of networks. The eigenvalues of this matrix reflect not only the global architecture of a network but also the local topologies that are produced by different graph evolutionary processes like motif duplication or joining. A divergence measure on spectral densities is used to quantify the distances between various metabolic networks, and a split network is constructed to analyse the phylogeny from these distances. In our analysis, we focused on the species that belong to different classes, but appear more related to each other in the phylogeny. We tried to explore whether they have evolved under similar environmental conditions or have similar life histories. With this focus, we have obtained interesting insights into the phylogenetic commonality between different organisms.
Yu, Yun; Degnan, James H.; Nakhleh, Luay
2012-01-01
Gene tree topologies have proven a powerful data source for various tasks, including species tree inference and species delimitation. Consequently, methods for computing probabilities of gene trees within species trees have been developed and widely used in probabilistic inference frameworks. All these methods assume an underlying multispecies coalescent model. However, when reticulate evolutionary events such as hybridization occur, these methods are inadequate, as they do not account for such events. Methods that account for both hybridization and deep coalescence in computing the probability of a gene tree topology currently exist for very limited cases. However, no such methods exist for general cases, owing primarily to the fact that it is currently unknown how to compute the probability of a gene tree topology within the branches of a phylogenetic network. Here we present a novel method for computing the probability of gene tree topologies on phylogenetic networks and demonstrate its application to the inference of hybridization in the presence of incomplete lineage sorting. We reanalyze a Saccharomyces species data set for which multiple analyses had converged on a species tree candidate. Using our method, though, we show that an evolutionary hypothesis involving hybridization in this group has better support than one of strict divergence. A similar reanalysis on a group of three Drosophila species shows that the data is consistent with hybridization. Further, using extensive simulation studies, we demonstrate the power of gene tree topologies at obtaining accurate estimates of branch lengths and hybridization probabilities of a given phylogenetic network. Finally, we discuss identifiability issues with detecting hybridization, particularly in cases that involve extinction or incomplete sampling of taxa. PMID:22536161
Grid-based International Network for Flu observation (g-INFO).
Doan, Trung-Tung; Bernard, Aurélien; Da-Costa, Ana Lucia; Bloch, Vincent; Le, Thanh-Hoa; Legre, Yannick; Maigne, Lydia; Salzemann, Jean; Sarramia, David; Nguyen, Hong-Quang; Breton, Vincent
2010-01-01
The 2009 H1N1 outbreak has demonstrated that continuing vigilance, planning, and strong public health research capability are essential defenses against emerging health threats. Molecular epidemiology of influenza virus strains provides scientists with clues about the temporal and geographic evolution of the virus. In the present paper, researchers from France and Vietnam are proposing a global surveillance network based on grid technology: the goal is to federate influenza data servers and deploy automatically molecular epidemiology studies. A first prototype based on AMGA and the WISDOM Production Environment extracts daily from NCBI influenza H1N1 sequence data which are processed through a phylogenetic analysis pipeline deployed on EGEE and AuverGrid e-infrastructures. The analysis results are displayed on a web portal (http://g-info.healthgrid.org) for epidemiologists to monitor H1N1 pandemics.
Grummer, Jared A; Morando, Mariana M; Avila, Luciano J; Sites, Jack W; Leaché, Adam D
2018-08-01
Rapid evolutionary radiations are difficult to resolve because divergence events are nearly synchronous and gene flow among nascent species can be high, resulting in a phylogenetic "bush". Large datasets composed of sequence loci from across the genome can potentially help resolve some of these difficult phylogenetic problems. A suitable test case is the Liolaemus fitzingerii species group of lizards, which includes twelve species that are broadly distributed in Argentinean Patagonia. The species in the group have had a complex evolutionary history that has led to high morphological variation and unstable taxonomy. We generated a sequence capture dataset for 28 ingroup individuals of 580 nuclear loci, alongside a mitogenomic dataset, to infer phylogenetic relationships among species in this group. Relationships among species were generally weakly supported with the nuclear data, and along with an inferred age of ∼2.6 million years old, indicate either rapid evolution, hybridization, incomplete lineage sorting, non-informative data, or a combination thereof. We inferred a signal of mito-nuclear discordance, indicating potential hybridization between L. melanops and L. martorii, and phylogenetic network analyses provided support for 5 reticulation events among species. Phasing the nuclear loci did not provide additional insight into relationships or suspected patterns of hybridization. Only one clade, composed of L. camarones, L. fitzingerii, and L. xanthoviridis was recovered across all analyses. Genomic datasets provide molecular systematists with new opportunities to resolve difficult phylogenetic problems, yet the lack of phylogenetic resolution in Patagonian Liolaemus is biologically meaningful and indicative of a recent and rapid evolutionary radiation. The phylogenetic relationships of the Liolaemus fitzingerii group may be best modeled as a reticulated network instead of a bifurcating phylogeny. Copyright © 2018 Elsevier Inc. All rights reserved.
Fujimoto, Kayo; Coghill, Lyndon M; Weier, Christopher A; Hwang, Lu-Yu; Kim, Ju Yeong; Schneider, John A; Metzker, Michael L; Brown, Jeremy M
2017-09-01
We explore the phylogenetic relationships among HIV sequences sampled from young adult black men who have sex with men (YAB-MSM), who are connected through peer referral/social ties and who attend common venues. Using 196 viral sequences sampled from the peripheral blood mononuclear cells of 10 individuals, our preliminary phylogenetic results indicate that these socially connected YAB-MSM are infected with distantly related viruses and provide no evidence for viral transmission between network members. Our results suggest that HIV-prevention strategies that target young adult MSM should extend beyond their network members and local community.
Morgan, Ethan; Nyaku, Amesika N; DʼAquila, Richard T; Schneider, John A
2017-07-01
Phylogenetic analysis determines similarities among HIV genetic sequences from persons infected with HIV, identifying clusters of transmission. We determined characteristics associated with both membership in an HIV transmission cluster and the number of clustered sequences among a cohort of young black men who have sex with men (YBMSM) in Chicago. Pairwise genetic distances of HIV-1 pol sequences were collected during 2013-2016. Potential transmission ties were identified among HIV-infected persons whose sequences were ≤1.5% genetically distant. Putative transmission pairs were defined as ≥1 tie to another sequence. We then determined demographic and risk attributes associated with both membership in an HIV transmission cluster and the number of ties to the sequences from other persons in the cluster. Of 86 available sequences, 31 (36.0%) were tied to ≥1 other sequence. Through multivariable analyses, we determined that those who reported symptoms of depression and those who had a higher number of confidants in their network had significantly decreased odds of membership in transmission clusters. We found that those who had unstable housing and who reported heavy marijuana use had significantly more ties to other individuals within transmission clusters, whereas those identifying as bisexual, those participating in group sex, and those with higher numbers of sexual partners had significantly fewer ties. This study demonstrates the potential for combining phylogenetic and individual and network attributes to target HIV control efforts to persons with potentially higher transmission risk, as well as suggesting some unappreciated specific predictors of transmission risk among YBMSM in Chicago for future study.
Aligning Biomolecular Networks Using Modular Graph Kernels
NASA Astrophysics Data System (ADS)
Towfic, Fadi; Greenlee, M. Heather West; Honavar, Vasant
Comparative analysis of biomolecular networks constructed using measurements from different conditions, tissues, and organisms offer a powerful approach to understanding the structure, function, dynamics, and evolution of complex biological systems. We explore a class of algorithms for aligning large biomolecular networks by breaking down such networks into subgraphs and computing the alignment of the networks based on the alignment of their subgraphs. The resulting subnetworks are compared using graph kernels as scoring functions. We provide implementations of the resulting algorithms as part of BiNA, an open source biomolecular network alignment toolkit. Our experiments using Drosophila melanogaster, Saccharomyces cerevisiae, Mus musculus and Homo sapiens protein-protein interaction networks extracted from the DIP repository of protein-protein interaction data demonstrate that the performance of the proposed algorithms (as measured by % GO term enrichment of subnetworks identified by the alignment) is competitive with some of the state-of-the-art algorithms for pair-wise alignment of large protein-protein interaction networks. Our results also show that the inter-species similarity scores computed based on graph kernels can be used to cluster the species into a species tree that is consistent with the known phylogenetic relationships among the species.
A Single Early Introduction of HIV-1 Subtype B into Central America Accounts for Most Current Cases
Murillo, Wendy; Veras, Nazle; Prosperi, Mattia; de Rivera, Ivette Lorenzana; Paz-Bailey, Gabriela; Morales-Miranda, Sonia; Juarez, Sandra I.; Yang, Chunfu; DeVos, Joshua; Marín, José Pablo; Mild, Mattias; Albert, Jan
2013-01-01
Human immunodeficiency virus type 1 (HIV-1) variants show considerable geographical separation across the world, but there is limited information from Central America. We provide the first detailed investigation of the genetic diversity and molecular epidemiology of HIV-1 in six Central American countries. Phylogenetic analysis was performed on 625 HIV-1 pol gene sequences collected between 2002 and 2010 in Honduras, El Salvador, Nicaragua, Costa Rica, Panama, and Belize. Published sequences from neighboring countries (n = 57) and the rest of the world (n = 740) were included as controls. Maximum likelihood methods were used to explore phylogenetic relationships. Bayesian coalescence-based methods were used to time HIV-1 introductions. Nearly all (98.9%) Central American sequences were of subtype B. Phylogenetic analysis revealed that 437 (70%) sequences clustered within five significantly supported monophyletic clades formed essentially by Central American sequences. One clade contained 386 (62%) sequences from all six countries; the other four clades were smaller and more country specific, suggesting discrete subepidemics. The existence of one large well-supported Central American clade provides evidence that a single introduction of HIV-1 subtype B in Central America accounts for most current cases. An introduction during the early phase of the HIV-1 pandemic may explain its epidemiological success. Moreover, the smaller clades suggest a subsequent regional spread related to specific transmission networks within each country. PMID:23616665
2017-01-01
North America’s Great Basin has long been of interest to biologists due to its high level of organismal endemicity throughout its endorheic watersheds. One example of such a group is the subfamily Empetricthyinae. In this paper, we analyzed the relationships of the Empetrichtyinae and assessed the validity of the subspecies designations given by Williams and Wilde within the group using concatenated phylogenetic tree estimation and species tree estimation. Samples from 19 populations were included covering the entire distribution of the three extant species of Empetricthyinae–Crenichthys nevadae, Crenichthys baileyi and Empetricthys latos. Three nuclear introns (S8 intron 4, S7 intron 1, and P0 intron 1) and one mitochondrial gene (Cytb) were sequenced for phylogenetic analysis. Using these sequences, we generated two separate hypotheses of the evolutionary relationships of Empetrichtyinae- one based on the mitochondrial data and one based on the nuclear data using Bayesian phylogenetics. Haplotype networks were also generated to look at the relationships of the populations within Empetrichthyinae. After comparing the two phylogenetic hypotheses, species trees were generated using *BEAST with the nuclear data to further test the validity of the subspecies within Empetrichthyinae. The mitochondrial analyses supported four lineages within C. baileyi and 2 within C. nevadae. The concatenated nuclear tree was more conserved, supporting one clade and an unresolved polytomy in both species. The species tree analysis supported the presence of two species within both C. baileyi and C. nevadae. Based on the results of these analyses, the subspecies designations of Williams and Wilde are not valid, rather a conservative approach suggests there are two species within C. nevadae and two species within C. baileyi. No structure was found for E. latos or the populations of Empetricthyinae. This study represents one of many demonstrating the invalidity of subspecies and their detriment to species identification, conservation, and understanding. PMID:29077708
Predicting rates of interspecific interaction from phylogenetic trees.
Nuismer, Scott L; Harmon, Luke J
2015-01-01
Integrating phylogenetic information can potentially improve our ability to explain species' traits, patterns of community assembly, the network structure of communities, and ecosystem function. In this study, we use mathematical models to explore the ecological and evolutionary factors that modulate the explanatory power of phylogenetic information for communities of species that interact within a single trophic level. We find that phylogenetic relationships among species can influence trait evolution and rates of interaction among species, but only under particular models of species interaction. For example, when interactions within communities are mediated by a mechanism of phenotype matching, phylogenetic trees make specific predictions about trait evolution and rates of interaction. In contrast, if interactions within a community depend on a mechanism of phenotype differences, phylogenetic information has little, if any, predictive power for trait evolution and interaction rate. Together, these results make clear and testable predictions for when and how evolutionary history is expected to influence contemporary rates of species interaction. © 2014 John Wiley & Sons Ltd/CNRS.
HIV-1 diversity, transmission dynamics and primary drug resistance in Angola.
Bártolo, Inês; Zakovic, Suzana; Martin, Francisco; Palladino, Claudia; Carvalho, Patrícia; Camacho, Ricardo; Thamm, Sven; Clemente, Sofia; Taveira, Nuno
2014-01-01
To assess HIV-1 diversity, transmission dynamics and prevalence of transmitted drug resistance (TDR) in Angola, five years after ART scale-up. Population sequencing of the pol gene was performed on 139 plasma samples collected in 2009 from drug-naive HIV-1 infected individuals living in Luanda. HIV-1 subtypes were determined using phylogenetic analysis. Drug resistance mutations were identified using the Calibrated Population Resistance Tool (CPR). Transmission networks were determined using phylogenetic analysis of all Angolan sequences present in the databases. Evolutionary trends were determined by comparison with a similar survey performed in 2001. 47.1% of the viruses were pure subtypes (all except B), 47.1% were recombinants and 5.8% were untypable. The prevalence of subtype A decreased significantly from 2001 to 2009 (40.0% to 10.8%, P = 0.0019) while the prevalence of unique recombinant forms (URFs) increased > 2-fold (40.0% to 83.1%, P < 0.0001). The most frequent URFs comprised untypable sequences with subtypes H (U/H, n = 7, 10.8%), A (U/A, n = 6, 9.2%) and G (G/U, n = 4, 6.2%). Newly identified U/H recombinants formed a highly supported monophyletic cluster suggesting a local and common origin. TDR mutation K103N was found in one (0.7%) patient (1.6% in 2001). Out of the 364 sequences sampled for transmission network analysis, 130 (35.7%) were part of a transmission network. Forty eight transmission clusters were identified; the majority (56.3%) comprised sequences sampled in 2008-2010 in Luanda which is consistent with a locally fuelled epidemic. Very low genetic distance was found in 27 transmission pairs sampled in the same year, suggesting recent transmission events. Transmission of drug resistant strains was still negligible in Luanda in 2009, five years after the scale-up of ART. The dominance of small and recent transmission clusters and the emergence of new URFs are consistent with a rising HIV-1 epidemics mainly driven by heterosexual transmission.
Takai, Ken; Horikoshi, Koki
1999-01-01
Molecular phylogenetic analysis of a naturally occurring microbial community in a deep-subsurface geothermal environment indicated that the phylogenetic diversity of the microbial population in the environment was extremely limited and that only hyperthermophilic archaeal members closely related to Pyrobaculum were present. All archaeal ribosomal DNA sequences contained intron-like sequences, some of which had open reading frames with repeated homing-endonuclease motifs. The sequence similarity analysis and the phylogenetic analysis of these homing endonucleases suggested the possible phylogenetic relationship among archaeal rRNA-encoded homing endonucleases. PMID:10584021
Wei, Fangping; Chen, Bowen
2012-03-01
To find out the evolutionary relationships among different tRNA sequences of 21 amino acids, 22 networks are constructed. One is constructed from whole tRNAs, and the other 21 networks are constructed from the tRNAs which carry the same amino acids. A new method is proposed such that the alignment scores of any two amino acids groups are determined by the average degree and the average clustering coefficient of their networks. The anticodon feature of isolated tRNA and the phylogenetic trees of 21 group networks are discussed. We find that some isolated tRNA sequences in 21 networks still connect with other tRNAs outside their group, which reflects the fact that those tRNAs might evolve by intercrossing among these 21 groups. We also find that most anticodons among the same cluster are only one base different in the same sites when S ≥ 70, and they stay in the same rank in the ladder of evolutionary relationships. Those observations seem to agree on that some tRNAs might mutate from the same ancestor sequences based on point mutation mechanisms.
Phylogenetic classification of yeasts and related taxa within Pucciniomycotina
Wang, Q.-M.; Yurkov, A.M.; Göker, M.; Lumbsch, H.T.; Leavitt, S.D.; Groenewald, M.; Theelen, B.; Liu, X.-Z.; Boekhout, T.; Bai, F.-Y.
2016-01-01
Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the “One Fungus = One Name” nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study. PMID:26951631
Phylogenetic classification of yeasts and related taxa within Pucciniomycotina.
Wang, Q-M; Yurkov, A M; Göker, M; Lumbsch, H T; Leavitt, S D; Groenewald, M; Theelen, B; Liu, X-Z; Boekhout, T; Bai, F-Y
2015-06-01
Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the "One Fungus = One Name" nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study.
Histology and affinity of anaspids, and the early evolution of the vertebrate dermal skeleton
Keating, Joseph N.; Donoghue, Philip C. J.
2016-01-01
The assembly of the gnathostome bodyplan constitutes a formative episode in vertebrate evolutionary history, an interval in which the mineralized skeleton and its canonical suite of cell and tissue types originated. Fossil jawless fishes, assigned to the gnathostome stem-lineage, provide an unparalleled insight into the origin and evolution of the skeleton, hindered only by uncertainty over the phylogenetic position and evolutionary significance of key clades. Chief among these are the jawless anaspids, whose skeletal composition, a rich source of phylogenetic information, is poorly characterized. Here we survey the histology of representatives spanning anaspid diversity and infer their generalized skeletal architecture. The anaspid dermal skeleton is composed of odontodes comprising spheritic dentine and enameloid, overlying a basal layer of acellular parallel fibre bone containing an extensive shallow canal network. A recoded and revised phylogenetic analysis using equal and implied weights parsimony resolves anaspids as monophyletic, nested among stem-gnathostomes. Our results suggest the anaspid dermal skeleton is a degenerate derivative of a histologically more complex ancestral vertebrate skeleton, rather than reflecting primitive simplicity. Hypotheses that anaspids are ancestral skeletonizing lampreys, or a derived lineage of jawless vertebrates with paired fins, are rejected. PMID:26962140
Arruda, A G; Friendship, R; Carpenter, J; Hand, K; Ojkic, D; Poljak, Z
2017-02-01
The main goal of this study was to investigate the occurrence of porcine reproductive and respiratory syndrome virus (PRRSV)-specific genotypes in swine sites in Ontario (Canada) using molecular, spatial and network data from a porcine reproductive and respiratory syndrome (PRRS) regional control project. For each site, location, animal movement service provider (truck companies), PRRSV status and sequencing data of the open reading frame 5 (ORF5) were obtained. Three-kilometre buffers were created to evaluate neighbourhood characteristics for each site. Social network analysis was conducted on swine sites and trucking companies to assemble the network and define network components. Three different PRRSV genotypes were used as outcomes for statistical analysis based on the region's phylogenetic tree of the ORF5. Multivariable exact logistic regression was conducted to investigate the association between being positive for a specific genotype and two main exposures of interest: (i) having at least one neighbour within three km also positive for the same genotype outside the production system and (ii) having at least one positive site for the same genotype in the same truck network component outside the production system. Results showed that the importance of area spread and truck network on PRRSV occurrence differed according to genotype. Additionally, the Ontario PRRS database appears suitable for conducting regional disease investigations. Finally, the use of relatively new tools available for network, spatial and molecular analysis could be useful in investigation, control and prevention of endemic infectious diseases in animal populations. © 2015 Blackwell Verlag GmbH.
Prokaryote genome fluidity: toward a system approach of the mobilome.
Toussaint, Ariane; Chandler, Mick
2012-01-01
The importance of horizontal/lateral gene transfer (LGT) in shaping the genomes of prokaryotic organisms has been recognized in recent years as a result of analysis of the increasing number of available genome sequences. LGT is largely due to the transfer and recombination activities of mobile genetic elements (MGEs). Bacterial and archaeal genomes are mosaics of vertically and horizontally transmitted DNA segments. This generates reticulate relationships between members of the prokaryotic world that are better represented by networks than by "classical" phylogenetic trees. In this review we summarize the nature and activities of MGEs, and the problems that presently limit their analysis on a large scale. We propose routes to improve their annotation in the flow of genomic and metagenomic sequences that currently exist and those that become available. We describe network analysis of evolutionary relationships among some MGE categories and sketch out possible developments of this type of approach to get more insight into the role of the mobilome in bacterial adaptation and evolution.
A stochastic simulator of birth-death master equations with application to phylodynamics.
Vaughan, Timothy G; Drummond, Alexei J
2013-06-01
In this article, we present a versatile new software tool for the simulation and analysis of stochastic models of population phylodynamics and chemical kinetics. Models are specified via an expressive and human-readable XML format and can be used as the basis for generating either single population histories or large ensembles of such histories. Importantly, phylogenetic trees or networks can be generated alongside the histories they correspond to, enabling investigations into the interplay between genealogies and population dynamics. Summary statistics such as means and variances can be recorded in place of the full ensemble, allowing for a reduction in the amount of memory used--an important consideration for models including large numbers of individual subpopulations or demes. In the case of population size histories, the resulting simulation output is written to disk in the flexible JSON format, which is easily read into numerical analysis environments such as R for visualization or further processing. Simulated phylogenetic trees can be recorded using the standard Newick or NEXUS formats, with extensions to these formats used for non-tree-like inheritance relationships.
A Stochastic Simulator of Birth–Death Master Equations with Application to Phylodynamics
Vaughan, Timothy G.; Drummond, Alexei J.
2013-01-01
In this article, we present a versatile new software tool for the simulation and analysis of stochastic models of population phylodynamics and chemical kinetics. Models are specified via an expressive and human-readable XML format and can be used as the basis for generating either single population histories or large ensembles of such histories. Importantly, phylogenetic trees or networks can be generated alongside the histories they correspond to, enabling investigations into the interplay between genealogies and population dynamics. Summary statistics such as means and variances can be recorded in place of the full ensemble, allowing for a reduction in the amount of memory used—an important consideration for models including large numbers of individual subpopulations or demes. In the case of population size histories, the resulting simulation output is written to disk in the flexible JSON format, which is easily read into numerical analysis environments such as R for visualization or further processing. Simulated phylogenetic trees can be recorded using the standard Newick or NEXUS formats, with extensions to these formats used for non-tree-like inheritance relationships. PMID:23505043
A family of interaction-adjusted indices of community similarity.
Schmidt, Thomas Sebastian Benedikt; Matias Rodrigues, João Frederico; von Mering, Christian
2017-03-01
Interactions between taxa are essential drivers of ecological community structure and dynamics, but they are not taken into account by traditional indices of β diversity. In this study, we propose a novel family of indices that quantify community similarity in the context of taxa interaction networks. Using publicly available datasets, we assessed the performance of two specific indices that are Taxa INteraction-Adjusted (TINA, based on taxa co-occurrence networks), and Phylogenetic INteraction-Adjusted (PINA, based on phylogenetic similarities). TINA and PINA outperformed traditional indices when partitioning human-associated microbial communities according to habitat, even for extremely downsampled datasets, and when organising ocean micro-eukaryotic plankton diversity according to geographical and physicochemical gradients. We argue that interaction-adjusted indices capture novel aspects of diversity outside the scope of traditional approaches, highlighting the biological significance of ecological association networks in the interpretation of community similarity.
A family of interaction-adjusted indices of community similarity
Schmidt, Thomas Sebastian Benedikt; Matias Rodrigues, João Frederico; von Mering, Christian
2017-01-01
Interactions between taxa are essential drivers of ecological community structure and dynamics, but they are not taken into account by traditional indices of β diversity. In this study, we propose a novel family of indices that quantify community similarity in the context of taxa interaction networks. Using publicly available datasets, we assessed the performance of two specific indices that are Taxa INteraction-Adjusted (TINA, based on taxa co-occurrence networks), and Phylogenetic INteraction-Adjusted (PINA, based on phylogenetic similarities). TINA and PINA outperformed traditional indices when partitioning human-associated microbial communities according to habitat, even for extremely downsampled datasets, and when organising ocean micro-eukaryotic plankton diversity according to geographical and physicochemical gradients. We argue that interaction-adjusted indices capture novel aspects of diversity outside the scope of traditional approaches, highlighting the biological significance of ecological association networks in the interpretation of community similarity. PMID:27935587
Open Reading Frame Phylogenetic Analysis on the Cloud
2013-01-01
Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843
Balasubramaniam, Krishna N; Beisner, Brianne A; Berman, Carol M; De Marco, Arianna; Duboscq, Julie; Koirala, Sabina; Majolo, Bonaventura; MacIntosh, Andrew J; McFarland, Richard; Molesti, Sandra; Ogawa, Hideshi; Petit, Odile; Schino, Gabriele; Sosa, Sebastian; Sueur, Cédric; Thierry, Bernard; de Waal, Frans B M; McCowan, Brenda
2018-01-01
Among nonhuman primates, the evolutionary underpinnings of variation in social structure remain debated, with both ancestral relationships and adaptation to current conditions hypothesized to play determining roles. Here we assess whether interspecific variation in higher-order aspects of female macaque (genus: Macaca) dominance and grooming social structure show phylogenetic signals, that is, greater similarity among more closely-related species. We use a social network approach to describe higher-order characteristics of social structure, based on both direct interactions and secondary pathways that connect group members. We also ask whether network traits covary with each other, with species-typical social style grades, and/or with sociodemographic characteristics, specifically group size, sex-ratio, and current living condition (captive vs. free-living). We assembled 34-38 datasets of female-female dyadic aggression and allogrooming among captive and free-living macaques representing 10 species. We calculated dominance (transitivity, certainty), and grooming (centrality coefficient, Newman's modularity, clustering coefficient) network traits as aspects of social structure. Computations of K statistics and randomization tests on multiple phylogenies revealed moderate-strong phylogenetic signals in dominance traits, but moderate-weak signals in grooming traits. GLMMs showed that grooming traits did not covary with dominance traits and/or social style grade. Rather, modularity and clustering coefficient, but not centrality coefficient, were strongly predicted by group size and current living condition. Specifically, larger groups showed more modular networks with sparsely-connected clusters than smaller groups. Further, this effect was independent of variation in living condition, and/or sampling effort. In summary, our results reveal that female dominance networks were more phylogenetically conserved across macaque species than grooming networks, which were more labile to sociodemographic factors. Such findings narrow down the processes that influence interspecific variation in two core aspects of macaque social structure. Future directions should include using phylogeographic approaches, and addressing challenges in examining the effects of socioecological factors on primate social structure. © 2017 Wiley Periodicals, Inc.
Fountain-Jones, Nicholas M; Packer, Craig; Troyer, Jennifer L; VanderWaal, Kimberly; Robinson, Stacie; Jacquot, Maude; Craft, Meggan E
2017-10-01
Heterogeneity within pathogen species can have important consequences for how pathogens transmit across landscapes; however, discerning different transmission routes is challenging. Here, we apply both phylodynamic and phylogenetic community ecology techniques to examine the consequences of pathogen heterogeneity on transmission by assessing subtype-specific transmission pathways in a social carnivore. We use comprehensive social and spatial network data to examine transmission pathways for three subtypes of feline immunodeficiency virus (FIV Ple ) in African lions (Panthera leo) at multiple scales in the Serengeti National Park, Tanzania. We used FIV Ple molecular data to examine the role of social organization and lion density in shaping transmission pathways and tested to what extent vertical (i.e., father- and/or mother-offspring relationships) or horizontal (between unrelated individuals) transmission underpinned these patterns for each subtype. Using the same data, we constructed subtype-specific FIV Ple co-occurrence networks and assessed what combination of social networks, spatial networks or co-infection best structured the FIV Ple network. While social organization (i.e., pride) was an important component of FIV Ple transmission pathways at all scales, we find that FIV Ple subtypes exhibited different transmission pathways at within- and between-pride scales. A combination of social and spatial networks, coupled with consideration of subtype co-infection, was likely to be important for FIV Ple transmission for the two major subtypes, but the relative contribution of each factor was strongly subtype-specific. Our study provides evidence that pathogen heterogeneity is important in understanding pathogen transmission, which could have consequences for how endemic pathogens are managed. Furthermore, we demonstrate that community phylogenetic ecology coupled with phylodynamic techniques can reveal insights into the differential evolutionary pressures acting on virus subtypes, which can manifest into landscape-level effects. © 2017 The Authors. Journal of Animal Ecology © 2017 British Ecological Society.
Ismail, Nurul-Ain; Adilah-Amrannudin, Nurul; Hamsidi, Mayamin; Ismail, Rodziah; Dom, Nazri Che; Ahmad, Abu Hassan; Mastuki, Mohd Fahmi; Camalxaman, Siti Nazrina
2017-11-07
The global expansion of Ae. albopictus from its native range in Southeast Asia has been implicated in the recent emergence of dengue endemicity in Malaysia. Genetic variability studies of Ae. albopictus are currently lacking in the Malaysian setting, yet are crucial to enhancing the existing vector control strategies. The study was conducted to establish the genetic variability of maternally inherited mitochondrial DNA encoding for cytochrome oxidase subunit 1 (CO1) gene in Ae. albopictus. Twelve localities were selected in the Subang Jaya district based on temporal indices utilizing 120 mosquito samples. Genetic polymorphism and phylogenetic analysis were conducted to unveil the genetic variability and geographic origins of Ae. albopictus. The haplotype network was mapped to determine the genealogical relationship of sequences among groups of population in the Asian region. Comparison of Malaysian CO1 sequences with sequences derived from five Asian countries revealed genetically distinct Ae. albopictus populations. Phylogenetic analysis revealed that all sequences from other Asian countries descended from the same genetic lineage as the Malaysian sequences. Noteworthy, our study highlights the discovery of 20 novel haplotypes within the Malaysian population which to date had not been reported. These findings could help determine the genetic variation of this invasive species, which in turn could possibly improve the current dengue vector surveillance strategies, locally and regionally. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Xing, Rui; Gao, Qing-Bo; Zhang, Fa-Qi; Fu, Peng-Cheng; Wang, Jiu-Li; Yan, Hui-Ying; Chen, Shi-Long
2017-08-01
Floccularia luteovirens, as an ectomycorrhizal fungus, is widely distributed in the Qinghai-Tibet Plateau. As an edible fungus, it is famous for its unique flavor. Former studies mainly focus on the chemical composition and genetic structure of this species. However, the phylogenetic relationship between genotypes remains unknown. In this study, the genetic variation and phylogenetic relationship between the genotypes of F. luteovirens in Qinghai-Tibet Plateau was estimated through the analysis on two protein-coding genes (rpb1 and ef-1α) from 398 individuals collected from 24 wild populations. The sample covered the entire range of this species during all the growth seasons from 2011 to 2015. 13 genotypes were detected and moderate genetic diversity was revealed. Based on the results of network analysis, the maximum likelihood (ML), maximum parsimony (MP), and Bayesian inference (BI) analyses, the genotypes H-1, H-4, H-6, H-8, H-10, and H-11 were grouped into one clade. Additionally, a relatively higher genotype diversity (average h value is 0.722) and unique genotypes in the northeast edge of Qinghai- Tibet plateau have been found, combined with the results of mismatch analysis and neutrality tests indicated that Southeast Qinghai-Tibet plateau was a refuge for F. luteovirens during the historical geological or climatic events (uplifting of the Qinghai-Tibet Plateau or Last Glacial Maximum). Furthermore, the present distribution of the species on the Qinghai-Tibet plateau has resulted from the recent population expansion. Our findings provide a foundation for the future study of the evolutionary history and the speciation of this species.
NASA Astrophysics Data System (ADS)
Xiang, Xing; Wang, Ruicheng; Wang, Hongmei; Gong, Linfeng; Man, Baiying; Xu, Ying
2017-03-01
High abundance and widespread distribution of the archaeal phylum Bathyarchaeota in marine environment have been recognized recently, but knowledge about Bathyarchaeota in terrestrial settings and their correlation with environmental parameters is fairly limited. Here we reported the abundance of Bathyarchaeota members across different ecosystems and their correlation with environmental factors by constructing 16S rRNA clone libraries of peat from the Dajiuhu Peatland, coupling with bioinformatics analysis of 16S rRNA data available to date in NCBI database. In total, 1456 Bathyarchaeota sequences from 28 sites were subjected to UniFrac analysis based on phylogenetic distance and multivariate regression tree analysis of taxonomy. Both phylogenetic and taxon-based approaches showed that salinity, total organic carbon and temperature significantly influenced the distribution of Bathyarchaeota across different terrestrial habitats. By applying the ecological concept of ‘indicator species’, we identify 9 indicator groups among the 6 habitats with the most in the estuary sediments. Network analysis showed that members of Bathyarchaeota formed the “backbone” of archaeal community and often co-occurred with Methanomicrobia. These results suggest that Bathyarchaeota may play an important ecological role within archaeal communities via a potential symbiotic association with Methanomicrobia. Our results shed light on understanding of the biogeography, potential functions of Bathyarchaeota and environment conditions that influence Bathyarchaea distribution in terrestrial settings.
Heggarty, Paul; Maguire, Warren; McMahon, April
2010-01-01
Linguists have traditionally represented patterns of divergence within a language family in terms of either a ‘splits’ model, corresponding to a branching family tree structure, or the wave model, resulting in a (dialect) continuum. Recent phylogenetic analyses, however, have tended to assume the former as a viable idealization also for the latter. But the contrast matters, for it typically reflects different processes in the real world: speaker populations either separated by migrations, or expanding over continuous territory. Since history often leaves a complex of both patterns within the same language family, ideally we need a single model to capture both, and tease apart the respective contributions of each. The ‘network’ type of phylogenetic method offers this, so we review recent applications to language data. Most have used lexical data, encoded as binary or multi-state characters. We look instead at continuous distance measures of divergence in phonetics. Our output networks combine branch- and continuum-like signals in ways that correspond well to known histories (illustrated for Germanic, and particularly English). We thus challenge the traditional insistence on shared innovations, setting out a new, principled explanation for why complex language histories can emerge correctly from distance measures, despite shared retentions and parallel innovations. PMID:21041208
Wei, Wei; Chai, Zhuangzhuang; Xie, Yinge; Gao, Kuan; Cui, Mengyuan; Jiang, Ying
2017-01-01
Mitogen-activated protein kinases (MAPKs) play essential roles in mediating biotic and abiotic stress responses in plants. However, the MAPK gene family in strawberry has not been systematically characterized. Here, we performed a genome-wide survey and identified 12 MAPK genes in the Fragaria vesca genome. Protein domain analysis indicated that all FvMAPKs have typical protein kinase domains. Sequence alignments and phylogenetic analysis classified the FvMAPK genes into four different groups. Conserved motif and exon-intron organization supported the evolutionary relationships inferred from the phylogenetic analysis. Analysis of the stress-related cis-regulatory element in the promoters and subcellular localization predictions of FvMAPKs were also performed. Gene transcript profile analysis showed that the majority of the FvMAPK genes were ubiquitously transcribed in strawberry leaves after Podosphaera aphanis inoculation and after treatment with cold, heat, drought, salt and the exogenous hormones abscisic acid, ethephon, methyl jasmonate, and salicylic acid. RT-qPCR showed that six selected FvMAPK genes comprehensively responded to various stimuli. Additionally, interaction networks revealed that the crucial signaling transduction controlled by FvMAPKs may be involved in the biotic and abiotic stress responses. Our results may provide useful information for future research on the function of the MAPK gene family and the genetic improvement of strawberry resistance to environmental stresses. PMID:28562633
Structure versus time in the evolutionary diversification of avian carotenoid metabolic networks.
Morrison, Erin S; Badyaev, Alexander V
2018-05-01
Historical associations of genes and proteins are thought to delineate pathways available to subsequent evolution; however, the effects of past functional involvements on contemporary evolution are rarely quantified. Here, we examined the extent to which the structure of a carotenoid enzymatic network persists in avian evolution. Specifically, we tested whether the evolution of carotenoid networks was most concordant with phylogenetically structured expansion from core reactions of common ancestors or with subsampling of biochemical pathway modules from an ancestral network. We compared structural and historical associations in 467 carotenoid networks of extant and ancestral species and uncovered the overwhelming effect of pre-existing metabolic network structure on carotenoid diversification over the last 50 million years of avian evolution. Over evolutionary time, birds repeatedly subsampled and recombined conserved biochemical modules, which likely maintained the overall structure of the carotenoid metabolic network during avian evolution. These findings explain the recurrent convergence of evolutionary distant species in carotenoid metabolism and weak phylogenetic signal in avian carotenoid evolution. Remarkable retention of an ancient metabolic structure throughout extensive and prolonged ecological diversification in avian carotenoid metabolism illustrates a fundamental requirement of organismal evolution - historical continuity of a deterministic network that links past and present functional associations of its components. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.
Matsudaira, Kazunari; Hamada, Yuzuru; Bunlungsup, Srichan; Ishida, Takafumi; San, Aye Mi; Malaivijitnond, Suchinda
2018-05-11
Macaca fascicularis aurea (Burmese long-tailed macaque) is 1 of the 10 subspecies of Macaca fascicularis. Despite having few morphological differences from other subspecies, a recent phylogeographic study showed that M. f. aurea is clearly distinct genetically from Macaca fascicularis fascicularis (common long-tailed macaque) and suggests that M. f. aurea experienced a disparate evolutionary pathway versus other subspecies. To construct a detailed evolutionary history of M. f. aurea and its relationships with other macaque species, we performed phylogenetic analyses and divergence time estimation of whole mitochondrial genomes (2 M. f. aurea, 8 M. f. fascicularis, and 16 animals of 12 macaque species) and 2871 bp of the Y chromosome (1 M. f. aurea, 2 M. f. fascicularis, and 5 animals of 5 macaque species) and haplotype network analysis of 758 bp of the Y chromosome (1 M. f. aurea, 2 M. f. fascicularis, and 21 animals of 19 macaque species). Whereas the Y chromosome of M. f. aurea clustered with those of the fascicularis species group in the phylogenetic and haplotype network analyses, its mtDNA clustered within the clade of the sinica species group. Based on this phylogenetic incongruence and the estimated divergence times, we propose that proto-M. f. aurea underwent hybridization with a population of the sinica species group between 2.5 and 0.95 MYA after divergence from the common ancestor of M. fascicularis. Hybridization and introgression might have been central in the evolution of M. f. aurea, similar to what occurred in the evolution of other macaque species and subspecies.
Lopez, Philippe; Halary, Sébastien; Bapteste, Eric
2015-10-26
Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered.
Molecular Phylogenetics: Concepts for a Newcomer.
Ajawatanawong, Pravech
Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.
Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul
2016-01-01
Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods. PMID:28348865
Dallman, Tim; Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul
2016-08-01
Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods.
Lai, Alessia; Simonetti, Francesco R; Zehender, Gianguglielmo; De Luca, Andrea; Micheli, Valeria; Meraviglia, Paola; Corsi, Paola; Bagnarelli, Patrizia; Almi, Paolo; Zoncada, Alessia; Paolucci, Stefania; Gonnelli, Angela; Colao, Grazia; Tacconi, Danilo; Franzetti, Marco; Ciccozzi, Massimo; Zazzi, Maurizio; Balotta, Claudia
2012-01-01
About 40% of the Italian HIV-1 epidemic due to non-B variants is sustained by F1 clade, which circulates at high prevalence in South America and Eastern Europe. Aim of this study was to define clade F1 origin, population dynamics and epidemiological networks through phylogenetic approaches. We analyzed pol sequences of 343 patients carrying F1 subtype stored in the ARCA database from 1998 to 2009. Citizenship of patients was as follows: 72.6% Italians, 9.3% South Americans and 7.3% Rumanians. Heterosexuals, Homo-bisexuals, Intravenous Drug Users accounted for 58.1%, 24.0% and 8.8% of patients, respectively. Phylogenetic analysis indicated that 70% of sequences clustered in 27 transmission networks. Two distinct groups were identified; the first clade, encompassing 56 sequences, included all Rumanian patients. The second group involved the remaining clusters and included 10 South American Homo-bisexuals in 9 distinct clusters. Heterosexual modality of infection was significantly associated with the probability to be detected in transmission networks. Heterosexuals were prevalent either among Italians (67.2%) or Rumanians (50%); by contrast, Homo-bisexuals accounted for 71.4% of South Americans. Among patients with resistant strains the proportion of clustering sequences was 57.1%, involving 14 clusters (51.8%). Resistance in clusters tended to be higher in South Americans (28.6%) compared to Italian (17.7%) and Rumanian patients (14.3%). A striking proportion of epidemiological networks could be identified in heterosexuals carrying F1 subtype residing in Italy. Italian Heterosexual males predominated within epidemiological clusters while foreign patients were mainly Heterosexual Rumanians, both males and females, and South American Homo-bisexuals. Tree topology suggested that F1 variant from South America gave rise to the Italian F1 epidemic through multiple introduction events. The contact tracing also revealed an unexpected burden of resistance in epidemiological clusters underlying the need of public interventions to limit the spread of non-B subtypes and transmitted drug resistance.
LEEBENS-MACK, JIM; VISION, TODD; BRENNER, ERIC; BOWERS, JOHN E.; CANNON, STEVEN; CLEMENT, MARK J.; CUNNINGHAM, CLIFFORD W.; dePAMPHILIS, CLAUDE; deSALLE, ROB; DOYLE, JEFF J.; EISEN, JONATHAN A.; GU, XUN; HARSHMAN, JOHN; JANSEN, ROBERT K.; KELLOGG, ELIZABETH A.; KOONIN, EUGENE V.; MISHLER, BRENT D.; PHILIPPE, HERVÉ; PIRES, J. CHRIS; QIU, YIN-LONG; RHEE, SEUNG Y.; SJÖLANDER, KIMMEN; SOLTIS, DOUGLAS E.; SOLTIS, PAMELA S.; STEVENSON, DENNIS W.; WALL, KERR; WARNOW, TANDY; ZMASEK, CHRISTIAN
2011-01-01
In the eight years since phylogenomics was introduced as the intersection of genomics and phylogenetics, the field has provided fundamental insights into gene function, genome history and organismal relationships. The utility of phylogenomics is growing with the increase in the number and diversity of taxa for which whole genome and large transcriptome sequence sets are being generated. We assert that the synergy between genomic and phylogenetic perspectives in comparative biology would be enhanced by the development and refinement of minimal reporting standards for phylogenetic analyses. Encouraged by the development of the Minimum Information About a Microarray Experiment (MIAME) standard, we propose a similar roadmap for the development of a Minimal Information About a Phylogenetic Analysis (MIAPA) standard. Key in the successful development and implementation of such a standard will be broad participation by developers of phylogenetic analysis software, phylogenetic database developers, practitioners of phylogenomics, and journal editors. PMID:16901231
Estrada-Peña, Agustín; Sprong, Hein; Cabezas-Cruz, Alejandro; de la Fuente, José; Ramo, Ana; Coipan, Elena Claudia
2016-09-23
The bacteria of the Borrelia burgdorferi (s.l.) (BBG) complex constitute a group of tick-transmitted pathogens that are linked to many vertebrate and tick species. The ecological relationships between the pathogens, the ticks and the vertebrate carriers have not been analysed. The aim of this study was to quantitatively analyse these interactions by creating a network based on a large dataset of associations. Specifically, we examined the relative positions of partners in the network, the phylogenetic diversity of the tick's hosts and its impact on BBG circulation. The secondary aim was to evaluate the segregation of BBG strains in different vectors and reservoirs. BBG circulates through a nested recursive network of ticks and vertebrates that delineate closed clusters. Each cluster contains generalist ticks with high values of centrality as well as specialist ticks that originate nested sub-networks and that link secondary vertebrates to the cluster. These results highlighted the importance of host phylogenetic diversity for ticks in the circulation of BBG, as this diversity was correlated with high centrality values for the ticks. The ticks and BBG species in each cluster were not significantly associated with specific branches of the phylogeny of host genera (R 2 = 0.156, P = 0.784 for BBG; R 2 = 0.299, P = 0.699 for ticks). A few host genera had higher centrality values and thus higher importance for BBG circulation. However, the combined contribution of hosts with low centrality values could maintain active BBG foci. The results suggested that ticks do not share strains of BBG, which were highly segregated among sympatric species of ticks. We conclude that BBG circulation is supported by a highly redundant network. This network includes ticks with high centrality values and high host phylogenetic diversity as well as ticks with low centrality values. This promotes ecological sub-networks and reflects the high resilience of BBG circulation. The functional redundancy in BBG circulation reduces disturbances due to the removal of vertebrates as it allows ticks to fill other biotic niches.
Plant-Pollinator Coextinctions and the Loss of Plant Functional and Phylogenetic Diversity
Vieira, Marcos Costa; Cianciaruso, Marcus Vinicius; Almeida-Neto, Mário
2013-01-01
Plant-pollinator coextinctions are likely to become more frequent as habitat alteration and climate change continue to threaten pollinators. The consequences of the resulting collapse of plant communities will depend partly on how quickly plant functional and phylogenetic diversity decline following pollinator extinctions. We investigated the functional and phylogenetic consequences of pollinator extinctions by simulating coextinctions in seven plant-pollinator networks coupled with independent data on plant phylogeny and functional traits. Declines in plant functional diversity were slower than expected under a scenario of random extinctions, while phylogenetic diversity often decreased faster than expected by chance. Our results show that plant functional diversity was relatively robust to plant-pollinator coextinctions, despite the underlying rapid loss of evolutionary history. Thus, our study suggests the possibility of uncoupled responses of functional and phylogenetic diversity to species coextinctions, highlighting the importance of considering both dimensions of biodiversity explicitly in ecological studies and when planning for the conservation of species and interactions. PMID:24312281
Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.
Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser
2010-05-01
Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.
Nishtala, Sneha; Neelamraju, Yaseswini; Janga, Sarath Chandra
2016-05-10
RNA-binding proteins (RBPs) are pivotal in orchestrating several steps in the metabolism of RNA in eukaryotes thereby controlling an extensive network of RBP-RNA interactions. Here, we employed CLIP (cross-linking immunoprecipitation)-seq datasets for 60 human RBPs and RIP-ChIP (RNP immunoprecipitation-microarray) data for 69 yeast RBPs to construct a network of genome-wide RBP- target RNA interactions for each RBP. We show in humans that majority (~78%) of the RBPs are strongly associated with their target transcripts at transcript level while ~95% of the studied RBPs were also found to be strongly associated with expression levels of target transcripts when protein expression levels of RBPs were employed. At transcript level, RBP - RNA interaction data for the yeast genome, exhibited a strong association for 63% of the RBPs, confirming the association to be conserved across large phylogenetic distances. Analysis to uncover the features contributing to these associations revealed the number of target transcripts and length of the selected protein-coding transcript of an RBP at the transcript level while intensity of the CLIP signal, number of RNA-Binding domains, location of the binding site on the transcript, to be significant at the protein level. Our analysis will contribute to improved modelling and prediction of post-transcriptional networks.
NASA Astrophysics Data System (ADS)
Nishtala, Sneha; Neelamraju, Yaseswini; Janga, Sarath Chandra
2016-05-01
RNA-binding proteins (RBPs) are pivotal in orchestrating several steps in the metabolism of RNA in eukaryotes thereby controlling an extensive network of RBP-RNA interactions. Here, we employed CLIP (cross-linking immunoprecipitation)-seq datasets for 60 human RBPs and RIP-ChIP (RNP immunoprecipitation-microarray) data for 69 yeast RBPs to construct a network of genome-wide RBP- target RNA interactions for each RBP. We show in humans that majority (~78%) of the RBPs are strongly associated with their target transcripts at transcript level while ~95% of the studied RBPs were also found to be strongly associated with expression levels of target transcripts when protein expression levels of RBPs were employed. At transcript level, RBP - RNA interaction data for the yeast genome, exhibited a strong association for 63% of the RBPs, confirming the association to be conserved across large phylogenetic distances. Analysis to uncover the features contributing to these associations revealed the number of target transcripts and length of the selected protein-coding transcript of an RBP at the transcript level while intensity of the CLIP signal, number of RNA-Binding domains, location of the binding site on the transcript, to be significant at the protein level. Our analysis will contribute to improved modelling and prediction of post-transcriptional networks.
Alignment-free protein interaction network comparison
Ali, Waqar; Rito, Tiago; Reinert, Gesine; Sun, Fengzhu; Deane, Charlotte M.
2014-01-01
Motivation: Biological network comparison software largely relies on the concept of alignment where close matches between the nodes of two or more networks are sought. These node matches are based on sequence similarity and/or interaction patterns. However, because of the incomplete and error-prone datasets currently available, such methods have had limited success. Moreover, the results of network alignment are in general not amenable for distance-based evolutionary analysis of sets of networks. In this article, we describe Netdis, a topology-based distance measure between networks, which offers the possibility of network phylogeny reconstruction. Results: We first demonstrate that Netdis is able to correctly separate different random graph model types independent of network size and density. The biological applicability of the method is then shown by its ability to build the correct phylogenetic tree of species based solely on the topology of current protein interaction networks. Our results provide new evidence that the topology of protein interaction networks contains information about evolutionary processes, despite the lack of conservation of individual interactions. As Netdis is applicable to all networks because of its speed and simplicity, we apply it to a large collection of biological and non-biological networks where it clusters diverse networks by type. Availability and implementation: The source code of the program is freely available at http://www.stats.ox.ac.uk/research/proteins/resources. Contact: w.ali@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25161230
Marr, Melissa M; Brace, Selina; Schreve, Danielle C; Barnes, Ian
2018-02-09
Establishing true phylogenetic relationships between populations is a critical consideration when sourcing individuals for translocation. This presents huge difficulties with threatened and endangered species that have become extirpated from large areas of their former range. We utilise ancient DNA (aDNA) to reconstruct the phylogenetic relationships of a keystone species which has become extinct in Britain, the Eurasian beaver Castor fiber. We sequenced seventeen 492 bp partial tRNAPro and control region sequences from Late Pleistocene and Holocene age beavers and included these in network, demographic and genealogy analyses. The mode of postglacial population expansion from refugia was investigated by employing tests of neutrality and a pairwise mismatch distribution analysis. We found evidence of a pre-Late Glacial Maximum ancestor for the Western C. fiber clade which experienced a rapid demographic expansion during the terminal Pleistocene to early Holocene period. Ancient British beavers were found to originate from the Western phylogroup but showed no phylogenetic affinity to any one modern relict population over another. Instead, we find that they formed part of a large, continuous, pan-Western European clade that harbored little internal substructure. Our study highlights the utility of aDNA in reconstructing population histories of extirpated species which has real-world implications for conservation planning.
NASA Astrophysics Data System (ADS)
Mittal, Shikha; Banduni, Pooja; Mallikarjuna, Mallana G.; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean
2018-05-01
Drought is one of the major threats to maize production. In order to improve the production and to breed tolerant hybrids, understanding the genes and regulatory mechanisms during drought stress is important. Transcription factors (TFs) play a major role in gene regulation and many TFs have been identified in response to drought stress. In our experiment, a set of 15 major TF families comprising 1436 genes was structurally and functionally characterized using in-silico tools and a gene expression assay. All 1436 genes were mapped on 10 chromosome of maize. The functional annotation indicated the involvement of these genes in ABA signaling, ROS scavenging, photosynthesis, stomatal regulation, and sucrose metabolism. Duplication was identified as the primary force in divergence and expansion of TF families. Phylogenetic relationship was developed individually for each TF family as well as combined TF families. Phylogenetic analysis grouped the TF family of genes into TF-specific and mixed groups. Phylogenetic analysis of genes belonging to various TF families suggested that the origin of TFs occurred in the lineage of maize evolution. Gene structure analysis revealed that more number of genes were intron-rich as compared to intronless genes. Drought-responsive CRE’s such as ABREA, ABREB, DRE1 and DRECRTCOREAT have been identified. Expression and interaction analyses identified leaf-specific bZIP TF, GRMZM2G140355, as a potential contributor toward drought tolerance in maize. We also analyzed protein-protein interaction network of 269 drought-responsive genes belonging to different drought-related TFs. The information generated on structural and functional characteristics, expression and interaction of the drought-related TF families will be useful to decipher the drought tolerance mechanisms and to derive drought-tolerant genotypes in maize.
Rajakumaran, P; Vaseeharan, B; Jayakumar, R; Chidambara, R
2014-01-01
Understanding of accurate phylogenetic relationship among Penaeidae shrimp is important for academic and fisheries industry. The Morphometric and Randomly amplified polymorphic DNA (RAPD) analysis was used to make the phylogenetic relationsip among 13 Penaeidae shrimp. For morphometric analysis forty variables and total lengths of shrimp were measured for each species, and removed the effect of size variation. The size normalized values obtained was subjected to UPGMA (Unweighted Pair-Group Method with Arithmetic Mean) cluster analysis. For RAPD analysis, the four primers showed reliable differentiation between species, and used correlation coefficient between the DNA banding patterns of 13 Penaeidae species to construct UPGMA dendrogram. Phylogenetic relationship from morphometric and molecular analysis for Penaeidae species found to be congruent. We concluded that as the results from morphometry investigations concur with molecular one, phylogenetic relationship obtained for the studied Penaeidae are considered to be reliable.
Scholz, Alexander; Rabaey, David; Stein, Anke; Cochard, Hervé; Smets, Erik; Jansen, Steven
2013-07-01
Various structure-function relationships regarding drought-induced cavitation resistance of secondary xylem have been postulated. These hypotheses were tested on wood of 10 Prunus species showing a range in P50 (i.e., the pressure corresponding to 50% loss of hydraulic conductivity) from -3.54 to -6.27 MPa. Hydraulically relevant wood characters were quantified using light and electron microscopy. A phylogenetic tree was constructed to investigate evolutionary correlations using a phylogenetically independent contrast (PIC) analysis. Vessel-grouping characters were found to be most informative in explaining interspecific variation in P50, with cavitation-resistant species showing more solitary vessels than less resistant species. Co-evolution between vessel-grouping indices and P50 was reported. P50 was weakly correlated with the shape of the intervessel pit aperture, but not with the total intervessel pit membrane area per vessel. A negative correlation was found between P50 and intervessel pit membrane thickness, but this relationship was not supported by the PIC analysis. Cavitation resistance has co-evolved with vessel grouping within Prunus and was mainly influenced by the spatial distribution of the vessel network.
Lo Giudice, Angelina; Brilli, Matteo; Bruni, Vivia; De Domenico, Maria; Fani, Renato; Michaud, Luigi
2007-06-01
One hundred and forty bacteria isolated from Antarctic seawater samples were examined for their ability to inhibit the growth of indigenous isolates and their sensitivity to antibacterial activity expressed by one another. On the basis of 16S rRNA gene sequencing and analysis, bacterial isolates were assigned to five phylogenetically different taxa, Actinobacteria, alpha and gamma subclasses of Proteobacteria, Bacillaceae, and Bacteroidetes. Twenty-one isolates (15%), predominantly Actinobacteria, exhibited antagonistic properties against marine bacteria of Antarctic origin. Members of Bacteroidetes and Firmicutes did not show any inhibitory activity. Differences were observed among inhibition patterns of single isolates, suggesting that their activity was more likely strain-specific rather than dependent on phylogenetic affiliation. A novel analysis based on network theory confirmed these results, showing that the structure of this population is probably robust to perturbations, but also that it depends strongly on the most active strains. The determination of plasmid incidence in the bacterial strains investigated revealed that there was no correlation between their presence and the antagonistic activity. The data presented here provide evidence for the antagonistic interactions within bacterial strains inhabiting Antarctic seawater and suggest the potential exploitation of Antarctic bacteria as a novel source of antibiotics.
Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro
2012-01-01
Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.
Oliveira, Alberto; Bleicher, Lucas; Schrago, Carlos G; Silva Junior, Floriano Paes
2018-05-01
Phospholipases A2 (PLA 2 s) comprise a superfamily of glycerophospholipids hydrolyzing enzymes present in many organisms in nature, whose catalytic activity was majorly unveiled by analysis of snake venoms. The latter have pharmaceutical and biotechnological interests and can be divided into different functional sub-classes. Our goal was to identify important residues and their relation to the functional and class-specific characteristics in the PLA 2 s family with special emphasis on snake venom PLA 2 s (svPLA 2 s). We identified such residues by conservation analysis and decomposition of residue coevolution networks (DRCN), annotated the results based on the available literature on PLA 2 s, structural analysis and molecular dynamics simulations, and related the results to the phylogenetic distribution of these proteins. A filtered alignment of PLA 2 s revealed 14 highly conserved positions and 3 sets of coevolved residues, which were annotated according to their structural or functional role. These residues are mostly involved in ligand binding and catalysis, calcium-binding, the formation of disulfide bridges and a hydrophobic cluster close to the binding site. An independent validation of the inference of structure-function relationships from our co-evolution analysis on the svPLA2s family was obtained by the analysis of the pattern of selection acting on the Viperidae and Elapidae lineages. Additionally, a molecular dynamics simulation on the Lys49 PLA 2 from Agkistrodon contortrix laticinctus was carried out to further investigate the correlation of the Lys49-Glu69 pair. Our results suggest this configuration can result in a novel conformation where the binding cavity collapses due to the approximation of two loops caused by a strong salt bridge between Glu69 and Arg34. Finally, phylogenetic analysis indicated a correlation between the presence of residues in the coevolved sets found in this analysis and the clade localization. The results provide a guide for important positions in the family of PLA 2 s, and potential new objects of investigation. Copyright © 2018 Elsevier Ltd. All rights reserved.
Neuron-Like Networks Between Ribosomal Proteins Within the Ribosome
NASA Astrophysics Data System (ADS)
Poirot, Olivier; Timsit, Youri
2016-05-01
From brain to the World Wide Web, information-processing networks share common scale invariant properties. Here, we reveal the existence of neural-like networks at a molecular scale within the ribosome. We show that with their extensions, ribosomal proteins form complex assortative interaction networks through which they communicate through tiny interfaces. The analysis of the crystal structures of 50S eubacterial particles reveals that most of these interfaces involve key phylogenetically conserved residues. The systematic observation of interactions between basic and aromatic amino acids at the interfaces and along the extension provides new structural insights that may contribute to decipher the molecular mechanisms of signal transmission within or between the ribosomal proteins. Similar to neurons interacting through “molecular synapses”, ribosomal proteins form a network that suggest an analogy with a simple molecular brain in which the “sensory-proteins” innervate the functional ribosomal sites, while the “inter-proteins” interconnect them into circuits suitable to process the information flow that circulates during protein synthesis. It is likely that these circuits have evolved to coordinate both the complex macromolecular motions and the binding of the multiple factors during translation. This opens new perspectives on nanoscale information transfer and processing.
The Double-Stranded DNA Virosphere as a Modular Hierarchical Network of Gene Sharing
Iranzo, Jaime
2016-01-01
ABSTRACT Virus genomes are prone to extensive gene loss, gain, and exchange and share no universal genes. Therefore, in a broad-scale study of virus evolution, gene and genome network analyses can complement traditional phylogenetics. We performed an exhaustive comparative analysis of the genomes of double-stranded DNA (dsDNA) viruses by using the bipartite network approach and found a robust hierarchical modularity in the dsDNA virosphere. Bipartite networks consist of two classes of nodes, with nodes in one class, in this case genomes, being connected via nodes of the second class, in this case genes. Such a network can be partitioned into modules that combine nodes from both classes. The bipartite network of dsDNA viruses includes 19 modules that form 5 major and 3 minor supermodules. Of these modules, 11 include tailed bacteriophages, reflecting the diversity of this largest group of viruses. The module analysis quantitatively validates and refines previously proposed nontrivial evolutionary relationships. An expansive supermodule combines the large and giant viruses of the putative order “Megavirales” with diverse moderate-sized viruses and related mobile elements. All viruses in this supermodule share a distinct morphogenetic tool kit with a double jelly roll major capsid protein. Herpesviruses and tailed bacteriophages comprise another supermodule, held together by a distinct set of morphogenetic proteins centered on the HK97-like major capsid protein. Together, these two supermodules cover the great majority of currently known dsDNA viruses. We formally identify a set of 14 viral hallmark genes that comprise the hubs of the network and account for most of the intermodule connections. PMID:27486193
Fournier, Bertrand; Mouly, Arnaud; Gillet, François
2016-01-01
Understanding the factors underlying the co-occurrence of multiple species remains a challenge in ecology. Biotic interactions, environmental filtering and neutral processes are among the main mechanisms evoked to explain species co-occurrence. However, they are most often studied separately or even considered as mutually exclusive. This likely hampers a more global understanding of species assembly. Here, we investigate the general hypothesis that the structure of co-occurrence networks results from multiple assembly rules and its potential implications for grassland ecosystems. We surveyed orthopteran and plant communities in 48 permanent grasslands of the French Jura Mountains and gathered functional and phylogenetic data for all species. We constructed a network of plant and orthopteran species co-occurrences and verified whether its structure was modular or nested. We investigated the role of all species in the structure of the network (modularity and nestedness). We also investigated the assembly rules driving the structure of the plant-orthopteran co-occurrence network by using null models on species functional traits, phylogenetic relatedness and environmental conditions. We finally compared our results to abundance-based approaches. We found that the plant-orthopteran co-occurrence network had a modular organization. Community assembly rules differed among modules for plants while interactions with plants best explained the distribution of orthopterans into modules. Few species had a disproportionately high positive contribution to this modular organization and are likely to have a key importance to modulate future changes. The impact of agricultural practices was restricted to some modules (3 out of 5) suggesting that shifts in agricultural practices might not impact the entire plant-orthopteran co-occurrence network. These findings support our hypothesis that multiple assembly rules drive the modular structure of the plant-orthopteran network. This modular structure is likely to play a key role in the response of grassland ecosystems to future changes by limiting the impact of changes in agricultural practices such as intensification to some modules leaving species from other modules poorly impacted. The next step is to understand the importance of this modular structure for the long-term maintenance of grassland ecosystem structure and functions as well as to develop tools to integrate network structure into models to improve their capacity to predict future changes. PMID:27582754
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.
Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich
2004-03-01
By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
Multiple introductions and onward transmission of HIV-1 subtype B strains in Shanghai, China.
Li, Xiaoshan; Zhu, Kexin; Xue, Yile; Wei, Feiran; Gao, Rong; Duerr, Ralf; Fang, Kun; Li, Wei; Song, Yue; Du, Guoping; Yan, Wenjuan; Musa, Taha Hussein; Ge, You; Ji, Yu; Zhong, Ping; Wei, Pingmin
2017-08-01
To investigate the viral genetic evolution, spatial origins and patterns of transmission of HIV-1 subtype B in Shanghai, China. A total of 242 Shanghai subtype B and 1519 reference pol sequences were subjected to phylogenetic inference and genetic transmission network analyses. Phylogenetic analysis revealed that subtype B strains circulating in Shanghai were genetically diverse and closely associated with viral sequence lineages in Beijing (76 of 242 [31.4%]), Central China (Henan/Hebei/Hunan/Hubei) (43 of 242 [17.8%]), Chinese Taiwan (20 of 242 [8.3%]), Japan (6 of 242 [2.5%]), and Korea (7 of 242 [2.9%]), suggesting multiple introductions into Shanghai from mainland China and Taiwan, Japan, and Korea. Interestingly, a monophyletic Shanghai lineage (SH-L) (36 of 242 [14.9%]) of HIV-1 subtype B most likely originated from an Argentine strain, transferred through Liaoning infected individuals. In-depth analyses of 195 Shanghai subtype B sequences revealed that a total of 37.9% (n = 74) sequences contributed to 35 transmission networks, whereof 33.8% (n = 25) of the sequences associated with infected individuals from other provinces. Our new findings reflect the evolution complexity and transmission dynamics of HIV-1 subtype B in Shanghai, which would provide critical information for the design of effective prevention measures against HIV transmission. Copyright © 2017 The British Infection Association. Published by Elsevier Ltd. All rights reserved.
treespace: Statistical exploration of landscapes of phylogenetic trees.
Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline
2017-11-01
The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
De Barro, Paul; Ahmed, Muhammad Z
2011-01-01
A challenge within the context of cryptic species is the delimitation of individual species within the complex. Statistical parsimony network analytics offers the opportunity to explore limits in situations where there are insufficient species-specific morphological characters to separate taxa. The results also enable us to explore the spread in taxa that have invaded globally. Using a 657 bp portion of mitochondrial cytochrome oxidase 1 from 352 unique haplotypes belonging to the Bemisia tabaci cryptic species complex, the analysis revealed 28 networks plus 7 unconnected individual haplotypes. Of the networks, 24 corresponded to the putative species identified using the rule set devised by Dinsdale et al. (2010). Only two species proposed in Dinsdale et al. (2010) departed substantially from the structure suggested by the analysis. The analysis of the two invasive members of the complex, Mediterranean (MED) and Middle East - Asia Minor 1 (MEAM1), showed that in both cases only a small number of haplotypes represent the majority that have spread beyond the home range; one MEAM1 and three MED haplotypes account for >80% of the GenBank records. Israel is a possible source of the globally invasive MEAM1 whereas MED has two possible sources. The first is the eastern Mediterranean which has invaded only the USA, primarily Florida and to a lesser extent California. The second are western Mediterranean haplotypes that have spread to the USA, Asia and South America. The structure for MED supports two home range distributions, a Sub-Saharan range and a Mediterranean range. The MEAM1 network supports the Middle East - Asia Minor region. The network analyses show a high level of congruence with the species identified in a previous phylogenetic analysis. The analysis of the two globally invasive members of the complex support the view that global invasion often involve very small portions of the available genetic diversity.
Intercontinental spread of asian-origin H5N8 to North America through Beringia by migratory birds
Lee, Dong-Hun; Kim Torchetti, Mia; Winker, Kevin; Ip, Hon S.; Swayne, David E.; Song, Chang-Seon
2015-01-01
Phylogenetic network analysis and understanding of waterfowl migration patterns suggest the Eurasian H5N8 clade 2.3.4.4 avian influenza virus emerged in late 2013 in China, spread in early 2014 to South Korea and Japan, and reached Siberia and Beringia by summer 2014 via migratory birds. Three genetically distinct subgroups emerged and subsequently spread along different flyways during fall 2014 into Europe, North America, and East Asia, respectively. All three subgroups reappeared in Japan, a wintering site for waterfowl from Eurasia and parts of North America.
Gu, Hao; Goodale, Eben; Chen, Jin
2015-03-18
The study of mutualistic plant and animal networks is an emerging field of ecological research. We reviewed progress in this field over the past 30 years. While earlier studies mostly focused on network structure, stability, and biodiversity maintenance, recent studies have investigated the conservation implications of mutualistic networks, specifically the influence of invasive species and how networks respond to habitat loss. Current research has also focused on evolutionary questions including phylogenetic signal in networks, impact of networks on the coevolution of interacting partners, and network influences on the evolution of interacting species. We outline some directions for future research, particularly the evolution of specialization in mutualistic networks, and provide concrete recommendations for environmental managers.
USDA-ARS?s Scientific Manuscript database
An extensive phylogenetic analysis and genus-level taxonomic revision of Paranoplocephala Lühe, 1910 -like cestodes (Cyclophyllidea, Anoplocephalidae) are presented. The phylogenetic analysis is based on DNA sequences of two partial mitochondrial genes, i.e. cytochrome c oxidase subunit 1 (cox1) and...
Lemoh, Chris; Ryan, Claire E.; Sekawi, Zamberi; Hearps, Anna C.; Aleksic, Eman; Chibo, Doris; Grierson, Jeffrey; Baho, Samia; Street, Alan; Hellard, Margaret; Biggs, Beverley-Ann; Crowe, Suzanne M.
2013-01-01
African-born Australians are a recognised “priority population” in Australia's Sixth National HIV/AIDS Strategy. We compared exposure location and route for African-born people living with HIV (PLHIV) in Victoria, Australia, with HIV-1 pol subtype from drug resistance assays and geographical origin suggested by phylogenetic analysis of env gene. Twenty adult HIV positive African-born Victorian residents were recruited via treating doctors. HIV exposure details were obtained from interviews and case notes. Viral RNA was extracted from participant stored plasma or whole blood. The env V3 region was sequenced and compared to globally representative reference HIV-1 sequences in the Los Alamos National Library HIV Database. Twelve participants reported exposure via heterosexual sex and two via iatrogenic blood exposures; four were men having sex with men (MSM); two were exposed via unknown routes. Eight participants reported exposure in their countries of birth, seven in Australia, three in other countries and two in unknown locations. Genotype results (pol) were available for ten participants. HIV env amplification was successful in eighteen cases. HIV-1 subtype was identified in all participants: eight both pol and env; ten env alone and two pol alone. Twelve were subtype C, four subtype B, three subtype A and one subtype CRF02_AG. Reported exposure location was consistent with the phylogenetic clustering of env sequences. African Australians are members of multiple transnational social and sexual networks influencing their exposure to HIV. Phylogenetic analysis may complement traditional surveillance to discern patterns of HIV exposure, providing focus for HIV prevention programs in mobile populations. PMID:24391866
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.
Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H
2014-02-26
Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
May, Shoshanna; Ngui, Siew Lin; Collins, Sarah; Lattimore, Sam; Ramsay, Mary; Tedder, Richard S; Ijaz, Samreen
2015-03-01
Analysis of laboratory testing data collected through the Sentinel Surveillance programme has provided a method for identifying individuals who have recently acquired their hepatitis C virus (HCV) infection. Access to samples from these individuals provided a rare opportunity to undertake molecular characterization studies. To describe the epidemiology and genetic diversity of hepatitis C in recent seroconverter infections and to predict how this will impact on HCV treatment and control. One hundred and forty seven samples were available from individuals, identified to have recently acquired their HCV infection. Genotype determination with additional phylogenetic analysis was carried out on NS5B sequences. Analysis across the NS3 region investigated the presence of antiviral resistance mutations. Where possible, molecular data was linked to demographic and risk/behavioural factor information. The majority of new infections occurred in males with a mean age of 37 years. The most commonly observed genotypes were 1a (49%) and 3a (42%) and injecting drug use (58%) was the most common risk factor. Genotype distribution differed between persons who inject drugs and those with other risk factors suggesting two possible epidemics. Phylogenetic analysis indicated possible transmission networks within specific risk groups. Amino acid changes associated with antiviral resistance were noted in the NS3 region in some samples. Continued surveillance of linked molecular, virological, demographic and epidemiological information on recently acquired infections will contribute to understanding the on-going HCV epidemic in England. Copyright © 2015 Elsevier B.V. All rights reserved.
The algebra of the general Markov model on phylogenetic trees and networks.
Sumner, J G; Holland, B R; Jarvis, P D
2012-04-01
It is known that the Kimura 3ST model of sequence evolution on phylogenetic trees can be extended quite naturally to arbitrary split systems. However, this extension relies heavily on mathematical peculiarities of the associated Hadamard transformation, and providing an analogous augmentation of the general Markov model has thus far been elusive. In this paper, we rectify this shortcoming by showing how to extend the general Markov model on trees to include incompatible edges; and even further to more general network models. This is achieved by exploring the algebra of the generators of the continuous-time Markov chain together with the “splitting” operator that generates the branching process on phylogenetic trees. For simplicity, we proceed by discussing the two state case and then show that our results are easily extended to more states with little complication. Intriguingly, upon restriction of the two state general Markov model to the parameter space of the binary symmetric model, our extension is indistinguishable from the Hadamard approach only on trees; as soon as any incompatible splits are introduced the two approaches give rise to differing probability distributions with disparate structure. Through exploration of a simple example, we give an argument that our extension to more general networks has desirable properties that the previous approaches do not share. In particular, our construction allows for convergent evolution of previously divergent lineages; a property that is of significant interest for biological applications.
The WRKY transcription factor family and senescence in switchgrass.
Rinerson, Charles I; Scully, Erin D; Palmer, Nathan A; Donze-Reiner, Teresa; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Sattler, Scott E; Rohila, Jai S; Sarath, Gautam; Rushton, Paul J
2015-11-09
Early aerial senescence in switchgrass (Panicum virgatum) can significantly limit biomass yields. WRKY transcription factors that can regulate senescence could be used to reprogram senescence and enhance biomass yields. All potential WRKY genes present in the version 1.0 of the switchgrass genome were identified and curated using manual and bioinformatic methods. Expression profiles of WRKY genes in switchgrass flag leaf RNA-Seq datasets were analyzed using clustering and network analyses tools to identify both WRKY and WRKY-associated gene co-expression networks during leaf development and senescence onset. We identified 240 switchgrass WRKY genes including members of the RW5 and RW6 families of resistance proteins. Weighted gene co-expression network analysis of the flag leaf transcriptomes across development readily separated clusters of co-expressed genes into thirteen modules. A visualization highlighted separation of modules associated with the early and senescence-onset phases of flag leaf growth. The senescence-associated module contained 3000 genes including 23 WRKYs. Putative promoter regions of senescence-associated WRKY genes contained several cis-element-like sequences suggestive of responsiveness to both senescence and stress signaling pathways. A phylogenetic comparison of senescence-associated WRKY genes from switchgrass flag leaf with senescence-associated WRKY genes from other plants revealed notable hotspots in Group I, IIb, and IIe of the phylogenetic tree. We have identified and named 240 WRKY genes in the switchgrass genome. Twenty three of these genes show elevated mRNA levels during the onset of flag leaf senescence. Eleven of the WRKY genes were found in hotspots of related senescence-associated genes from multiple species and thus represent promising targets for future switchgrass genetic improvement. Overall, individual WRKY gene expression profiles could be readily linked to developmental stages of flag leaves.
Jones, Christopher M; Stres, Blaz; Rosenquist, Magnus; Hallin, Sara
2008-09-01
Denitrification is a facultative respiratory pathway in which nitrite (NO2(-)), nitric oxide (NO), and nitrous oxide (N2O) are successively reduced to nitrogen gas (N(2)), effectively closing the nitrogen cycle. The ability to denitrify is widely dispersed among prokaryotes, and this polyphyletic distribution has raised the possibility of horizontal gene transfer (HGT) having a substantial role in the evolution of denitrification. Comparisons of 16S rRNA and denitrification gene phylogenies in recent studies support this possibility; however, these results remain speculative as they are based on visual comparisons of phylogenies from partial sequences. We reanalyzed publicly available nirS, nirK, norB, and nosZ partial sequences using Bayesian and maximum likelihood phylogenetic inference. Concomitant analysis of denitrification genes with 16S rRNA sequences from the same organisms showed substantial differences between the trees, which were supported by examining the posterior probability of monophyletic constraints at different taxonomic levels. Although these differences suggest HGT of denitrification genes, the presence of structural variants for nirK, norB, and nosZ makes it difficult to determine HGT from other evolutionary events. Additional analysis using phylogenetic networks and likelihood ratio tests of phylogenies based on full-length sequences retrieved from genomes also revealed significant differences in tree topologies among denitrification and 16S rRNA gene phylogenies, with the exception of the nosZ gene phylogeny within the data set of the nirK-harboring genomes. However, inspection of codon usage and G + C content plots from complete genomes gave no evidence for recent HGT. Instead, the close proximity of denitrification gene copies in the genomes of several denitrifying bacteria suggests duplication. Although HGT cannot be ruled out as a factor in the evolution of denitrification genes, our analysis suggests that other phenomena, such gene duplication/divergence and lineage sorting, may have differently influenced the evolution of each denitrification gene.
Shavit Grievink, Liat; Penny, David; Holland, Barbara R.
2013-01-01
Phylogenetic studies based on molecular sequence alignments are expected to become more accurate as the number of sites in the alignments increases. With the advent of genomic-scale data, where alignments have very large numbers of sites, bootstrap values close to 100% and posterior probabilities close to 1 are the norm, suggesting that the number of sites is now seldom a limiting factor on phylogenetic accuracy. This provokes the question, should we be fussy about the sites we choose to include in a genomic-scale phylogenetic analysis? If some sites contain missing data, ambiguous character states, or gaps, then why not just throw them away before conducting the phylogenetic analysis? Indeed, this is exactly the approach taken in many phylogenetic studies. Here, we present an example where the decision on how to treat sites with missing data is of equal importance to decisions on taxon sampling and model choice, and we introduce a graphical method for illustrating this. PMID:23471508
Analysis of the SOS response of Vibrio and other bacteria with multiple chromosomes.
Sanchez-Alberola, Neus; Campoy, Susana; Barbé, Jordi; Erill, Ivan
2012-02-03
The SOS response is a well-known regulatory network present in most bacteria and aimed at addressing DNA damage. It has also been linked extensively to stress-induced mutagenesis, virulence and the emergence and dissemination of antibiotic resistance determinants. Recently, the SOS response has been shown to regulate the activity of integrases in the chromosomal superintegrons of the Vibrionaceae, which encompasses a wide range of pathogenic species harboring multiple chromosomes. Here we combine in silico and in vitro techniques to perform a comparative genomics analysis of the SOS regulon in the Vibrionaceae, and we extend the methodology to map this transcriptional network in other bacterial species harboring multiple chromosomes. Our analysis provides the first comprehensive description of the SOS response in a family (Vibrionaceae) that includes major human pathogens. It also identifies several previously unreported members of the SOS transcriptional network, including two proteins of unknown function. The analysis of the SOS response in other bacterial species with multiple chromosomes uncovers additional regulon members and reveals that there is a conserved core of SOS genes, and that specialized additions to this basic network take place in different phylogenetic groups. Our results also indicate that across all groups the main elements of the SOS response are always found in the large chromosome, whereas specialized additions are found in the smaller chromosomes and plasmids. Our findings confirm that the SOS response of the Vibrionaceae is strongly linked with pathogenicity and dissemination of antibiotic resistance, and suggest that the characterization of the newly identified members of this regulon could provide key insights into the pathogenesis of Vibrio. The persistent location of key SOS genes in the large chromosome across several bacterial groups confirms that the SOS response plays an essential role in these organisms and sheds light into the mechanisms of evolution of global transcriptional networks involved in adaptability and rapid response to environmental changes, suggesting that small chromosomes may act as evolutionary test beds for the rewiring of transcriptional networks.
Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice.
Smita, Shuchi; Katiyar, Amit; Chinnusamy, Viswanathan; Pandey, Dev M; Bansal, Kailash C
2015-01-01
MYB transcription factor (TF) is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by "top-down" and "guide-gene" approaches. More than 50% of OsMYBs were strongly correlated under 50 experimental conditions with 51 hub genes via "top-down" approach. Further, clusters were identified using Markov Clustering (MCL). To maximize the clustering performance, parameter evaluation of the MCL inflation score (I) was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by "guide-gene" approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought response in rice. Thus, the co-regulatory network analysis facilitated the identification of complex OsMYB regulatory networks, and candidate target regulon genes of selected guide MYB genes. The results contribute to the candidate gene screening, and experimentally testable hypotheses for potential regulatory MYB TFs, and their targets under stress conditions.
Taxonomic review of Argentine mackerel Scomber japonicus (Houttuyn, 1782) by phylogenetic analysis
Trucco, María Inés; Buratti, Claudio César
2017-01-01
Taxonomically, Argentine mackerels were first considered as Scomber japonicus marplatensis and later as Scomber japonicus Houttuyn 1782, although, in the last years, different studies have suggested that South Atlantic mackerel species belongs to Scomber colias Gmelin 1789. These latter results, incorporated in the main fish databases (FishBase and Catalog of Fishes), promoted a phylogenetic study using cytochrome c oxidase I (COI) gene sequences taken from the Barcode of Life (FISH-BOL) database. Thus, 76 sequences of S. japonicus, S. colias, S. australasicus and S. scombrus from different regions were used; including 3 from Sarda sarda as outgroup. Among S. japonicus selected sequences are those corresponding to the Argentine mackerels collected in 2007. Phylogenetic trees were obtained by neighbor joining and maximum likelihood methods and a network of haplotypes was reconstructed to analyze the relationship between species. The results showed the clear differentiation of S. australasicus, S. scombrus and S. japonicus from the Pacific while S. japonicus from Argentina was included in the S. colias group, with genetic differences corresponding to conspecific populations (0.1%). Four of the five Argentine specimens shared the same haplotype with S. colias, and none were shared with S. japonicus from the Pacific. These results suggest that the current specific name of Argentine mackerel S. japonicus should be changed to S. colias, in agreement with several genetic studies carried out with species of the genus Scomber. PMID:29071283
Prangishvili, David
2016-01-01
ABSTRACT Archaea and particularly hyperthermophilic crenarchaea are hosts to many unusual viruses with diverse virion shapes and distinct gene compositions. As is typical of viruses in general, there are no universal genes in the archaeal virosphere. Therefore, to obtain a comprehensive picture of the evolutionary relationships between viruses, network analysis methods are more productive than traditional phylogenetic approaches. Here we present a comprehensive comparative analysis of genomes and proteomes from all currently known taxonomically classified and unclassified, cultivated and uncultivated archaeal viruses. We constructed a bipartite network of archaeal viruses that includes two classes of nodes, the genomes and gene families that connect them. Dissection of this network using formal community detection methods reveals strong modularity, with 10 distinct modules and 3 putative supermodules. However, compared to similar previously analyzed networks of eukaryotic and bacterial viruses, the archaeal virus network is sparsely connected. With the exception of the tailed viruses related to bacteriophages of the order Caudovirales and the families Turriviridae and Sphaerolipoviridae that are linked to a distinct supermodule of eukaryotic and bacterial viruses, there are few connector genes shared by different archaeal virus modules. In contrast, most of these modules include, in addition to viruses, capsidless mobile elements, emphasizing tight evolutionary connections between the two types of entities in archaea. The relative contributions of distinct evolutionary origins, in particular from nonviral elements, and insufficient sampling to the sparsity of the archaeal virus network remain to be determined by further exploration of the archaeal virosphere. IMPORTANCE Viruses infecting archaea are among the most mysterious denizens of the virosphere. Many of these viruses display no genetic or even morphological relationship to viruses of bacteria and eukaryotes, raising questions regarding their origins and position in the global virosphere. Analysis of 5,740 protein sequences from 116 genomes allowed dissection of the archaeal virus network and showed that most groups of archaeal viruses are evolutionarily connected to capsidless mobile genetic elements, including various plasmids and transposons. This finding could reflect actual independent origins of the distinct groups of archaeal viruses from different nonviral elements, providing important insights into the emergence and evolution of the archaeal virome. PMID:27681128
Beckett, Stephen J.; Williams, Hywel T. P.
2013-01-01
Phage and their bacterial hosts are the most diverse and abundant biological entities in the oceans, where their interactions have a major impact on marine ecology and ecosystem function. The structure of interaction networks for natural phage–bacteria communities offers insight into their coevolutionary origin. At small phylogenetic scales, observed communities typically show a nested structure, in which both hosts and phages can be ranked by their range of resistance and infectivity, respectively. A qualitatively different multi-scale structure is seen at larger phylogenetic scales; a natural assemblage sampled from the Atlantic Ocean displays large-scale modularity and local nestedness within each module. Here, we show that such ‘nested-modular’ interaction networks can be produced by a simple model of host–phage coevolution in which infection depends on genetic matching. Negative frequency-dependent selection causes diversification of hosts (to escape phages) and phages (to track their evolving hosts). This creates a diverse community of bacteria and phage, maintained by kill-the-winner ecological dynamics. When the resulting communities are visualized as bipartite networks of who infects whom, they show the nested-modular structure characteristic of the Atlantic sample. The statistical significance and strength of this observation varies depending on whether the interaction networks take into account the density of the interacting strains, with implications for interpretation of interaction networks constructed by different methods. Our results suggest that the apparently complex community structures associated with marine bacteria and phage may arise from relatively simple coevolutionary origins. PMID:24516719
Brenner, Bluma G.; Ibanescu, Ruxandra-Ilinca; Hardy, Isabelle; Roger, Michel
2017-01-01
HIV continues to spread among vulnerable heterosexual (HET), Men-having-Sex with Men (MSM) and intravenous drug user (IDU) populations, influenced by a complex array of biological, behavioral and societal factors. Phylogenetics analyses of large sequence datasets from national drug resistance testing programs reveal the evolutionary interrelationships of viral strains implicated in the dynamic spread of HIV in different regional settings. Viral phylogenetics can be combined with demographic and behavioral information to gain insights on epidemiological processes shaping transmission networks at the population-level. Drug resistance testing programs also reveal emergent mutational pathways leading to resistance to the 23 antiretroviral drugs used in HIV-1 management in low-, middle- and high-income settings. This article describes how genotypic and phylogenetic information from Quebec and elsewhere provide critical information on HIV transmission and resistance, Cumulative findings can be used to optimize public health strategies to tackle the challenges of HIV in “real-world” settings. PMID:29283390
REFGEN and TREENAMER: Automated Sequence Data Handling for Phylogenetic Analysis in the Genomic Era
Leonard, Guy; Stevens, Jamie R.; Richards, Thomas A.
2009-01-01
The phylogenetic analysis of nucleotide sequences and increasingly that of amino acid sequences is used to address a number of biological questions. Access to extensive datasets, including numerous genome projects, means that standard phylogenetic analyses can include many hundreds of sequences. Unfortunately, most phylogenetic analysis programs do not tolerate the sequence naming conventions of genome databases. Managing large numbers of sequences and standardizing sequence labels for use in phylogenetic analysis programs can be a time consuming and laborious task. Here we report the availability of an online resource for the management of gene sequences recovered from public access genome databases such as GenBank. These web utilities include the facility for renaming every sequence in a FASTA alignment file, with each sequence label derived from a user-defined combination of the species name and/or database accession number. This facility enables the user to keep track of the branching order of the sequences/taxa during multiple tree calculations and re-optimisations. Post phylogenetic analysis, these webpages can then be used to rename every label in the subsequent tree files (with a user-defined combination of species name and/or database accession number). Together these programs drastically reduce the time required for managing sequence alignments and labelling phylogenetic figures. Additional features of our platform include the automatic removal of identical accession numbers (recorded in the report file) and generation of species and accession number lists for use in supplementary materials or figure legends. PMID:19812722
Yan, Yan; Wang, Lianzhe; Ding, Zehong; Tie, Weiwei; Ding, Xupo; Zeng, Changying; Wei, Yunxie; Zhao, Hongliang; Peng, Ming; Hu, Wei
2016-01-01
Mitogen-activated protein kinases (MAPKs) play central roles in plant developmental processes, hormone signaling transduction, and responses to abiotic stress. However, no data are currently available about the MAPK family in cassava, an important tropical crop. Herein, 21 MeMAPK genes were identified from cassava. Phylogenetic analysis indicated that MeMAPKs could be classified into four subfamilies. Gene structure analysis demonstrated that the number of introns in MeMAPK genes ranged from 1 to 10, suggesting large variation among cassava MAPK genes. Conserved motif analysis indicated that all MeMAPKs had typical protein kinase domains. Transcriptomic analysis suggested that MeMAPK genes showed differential expression patterns in distinct tissues and in response to drought stress between wild subspecies and cultivated varieties. Interaction networks and co-expression analyses revealed that crucial pathways controlled by MeMAPK networks may be involved in the differential response to drought stress in different accessions of cassava. Expression of nine selected MAPK genes showed that these genes could comprehensively respond to osmotic, salt, cold, oxidative stressors, and abscisic acid (ABA) signaling. These findings yield new insights into the transcriptional control of MAPK gene expression, provide an improved understanding of abiotic stress responses and signaling transduction in cassava, and lead to potential applications in the genetic improvement of cassava cultivars. PMID:27625666
Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison
Ratmann, Oliver; Hodcroft, Emma B.; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A.S. Md Mukarram; Joy, Jeffrey B.; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E.; Liang, Richard; Plazzotta, Giacomo; Poon, Art F.Y.; Rasmussen, David A.; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J.; Fraser, Christophe
2017-01-01
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods’ development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. PMID:28053012
Requeno, José Ignacio; Colom, José Manuel
2014-12-01
Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.
Requeno, José Ignacio; Colom, José Manuel
2014-10-23
Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.
Grammatical Analysis as a Distributed Neurobiological Function
Bozic, Mirjana; Fonteneau, Elisabeth; Su, Li; Marslen-Wilson, William D
2015-01-01
Language processing engages large-scale functional networks in both hemispheres. Although it is widely accepted that left perisylvian regions have a key role in supporting complex grammatical computations, patient data suggest that some aspects of grammatical processing could be supported bilaterally. We investigated the distribution and the nature of grammatical computations across language processing networks by comparing two types of combinatorial grammatical sequences—inflectionally complex words and minimal phrases—and contrasting them with grammatically simple words. Novel multivariate analyses revealed that they engage a coalition of separable subsystems: inflected forms triggered left-lateralized activation, dissociable into dorsal processes supporting morphophonological parsing and ventral, lexically driven morphosyntactic processes. In contrast, simple phrases activated a consistently bilateral pattern of temporal regions, overlapping with inflectional activations in L middle temporal gyrus. These data confirm the role of the left-lateralized frontotemporal network in supporting complex grammatical computations. Critically, they also point to the capacity of bilateral temporal regions to support simple, linear grammatical computations. This is consistent with a dual neurobiological framework where phylogenetically older bihemispheric systems form part of the network that supports language function in the modern human, and where significant capacities for language comprehension remain intact even following severe left hemisphere damage. PMID:25421880
Motani, Ryosuke; Schmitz, Lars
2011-08-01
Phylogeny is deeply pertinent to evolutionary studies. Traits that perform a body function are expected to be strongly influenced by physical "requirements" of the function. We investigated if such traits exhibit phylogenetic signals, and, if so, how phylogenetic noises bias quantification of form-function relationships. A form-function system that is strongly influenced by physics, namely the relationship between eye morphology and visual optics in amniotes, was used. We quantified the correlation between form (i.e., eye morphology) and function (i.e., ocular optics) while varying the level of phylogenetic bias removal through adjusting Pagel's λ. Ocular soft-tissue dimensions exhibited the highest correlation with ocular optics when 1% of phylogenetic bias expected from Brownian motion was removed (i.e., λ= 0.01); the value for hard-tissue data were 8%. A small degree of phylogenetic bias therefore exists in morphology despite of the stringent functional constraints. We also devised a phylogenetically informed discriminant analysis and recorded the effects of phylogenetic bias on this method using the same data. Use of proper λ values during phylogenetic bias removal improved misidentification rates in resulting classifications when prior probabilities were assumed to be equal. Even a small degree of phylogenetic bias affected the classification resulting from phylogenetically informed discriminant analysis. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.
Verma, Amit K; Diwan, Danish; Raut, Sandeep; Dobriyal, Neha; Brown, Rebecca E; Gowda, Vinita; Hines, Justin K; Sahi, Chandan
2017-06-07
Heat shock proteins of 70 kDa (Hsp70s) partner with structurally diverse Hsp40s (J proteins), generating distinct chaperone networks in various cellular compartments that perform myriad housekeeping and stress-associated functions in all organisms. Plants, being sessile, need to constantly maintain their cellular proteostasis in response to external environmental cues. In these situations, the Hsp70:J protein machines may play an important role in fine-tuning cellular protein quality control. Although ubiquitous, the functional specificity and complexity of the plant Hsp70:J protein network has not been studied. Here, we analyzed the J protein network in the cytosol of Arabidopsis thaliana and, using yeast genetics, show that the functional specificities of most plant J proteins in fundamental chaperone functions are conserved across long evolutionary timescales. Detailed phylogenetic and functional analysis revealed that increased number, regulatory differences, and neofunctionalization in J proteins together contribute to the emerging functional diversity and complexity in the Hsp70:J protein network in higher plants. Based on the data presented, we propose that higher plants have orchestrated their "chaperome," especially their J protein complement, according to their specialized cellular and physiological stipulations. Copyright © 2017 Verma et al.
Visualizing Phylogenetic Treespace Using Cartographic Projections
NASA Astrophysics Data System (ADS)
Sundberg, Kenneth; Clement, Mark; Snell, Quinn
Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger datasets.
Li, Tao; Zhang, Min; Qu, Yanhua; Ren, Zhumei; Zhang, Jianzhen; Guo, Yaping; Heong, K L; Villareal, Bong; Zhong, Yang; Ma, Enbo
2011-04-01
The rice grasshopper, Oxya hyla intricata, is a rice pest in Southeast Asia. In this study, population genetic diversity and structure of this Oxya species was examined using both DNA sequences and AFLP technology. The samples of 12 populations were collected from four Southeast Asian countries, among which 175 individuals were analysed using mitochondrial DNA cytochrome c oxidase subunit I (COI) sequences, and 232 individuals were examined using amplified fragment length polymorphisms (AFLP) to test whether the phylogeographical pattern and population genetics of this species are related to past geological events and/or climatic oscillations. No obvious trend of genetic diversity was found along a latitude/longitude gradient among different geographical groups. Phylogenetic analysis indicated three deep monophyletic clades that approximately correspond to three geographical regions separated by high mountains and a deep strait, and TCS analysis also revealed three disconnected networks, suggesting that spatial and temporal separations by vicariance, which were also supported by AMOVA as a source of the molecular variance presented among groups. Gene flow analysis showed that there had been frequent historical gene flow among local populations in different regions, but the networks exhibited no shared haplotype among populations. In conclusion, the past geological events and climatic fluctuations are the most important factor on the phylogeographical structure and genetic patterns of O. hyla intricata in Southeast Asia. Habitat, vegetation, and anthropogenic effect may also contribute to gene flow and introgression of this species. Moreover, temperature, abundant rainfall and a diversity of graminaceous species are beneficial for the migration of O. hyla intricata. High haplotype diversity, deep phylogenetic division, negative Fu's F (s) values and unimodal and multimodal distribution shapes all suggest a complicated demographic expansion pattern of these O. hyla intricata populations, which might have been caused by climatic oscillations during glacial periods in the Quaternary.
Intercontinental Spread of Asian-Origin H5N8 to North America through Beringia by Migratory Birds.
Lee, Dong-Hun; Torchetti, Mia Kim; Winker, Kevin; Ip, Hon S; Song, Chang-Seon; Swayne, David E
2015-06-01
Phylogenetic network analysis and understanding of waterfowl migration patterns suggest that the Eurasian H5N8 clade 2.3.4.4 avian influenza virus emerged in late 2013 in China, spread in early 2014 to South Korea and Japan, and reached Siberia and Beringia by summer 2014 via migratory birds. Three genetically distinct subgroups emerged and subsequently spread along different flyways during fall 2014 into Europe, North America, and East Asia, respectively. All three subgroups reappeared in Japan, a wintering site for waterfowl from Eurasia and parts of North America. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Seed size, fecundity and postfire regeneration strategy are interdependent in Hakea.
El-ahmir, Sh-hoob Mohamed; Lim, Sim Lin; Lamont, Byron B; He, Tianhua
2015-01-01
Seed size is a key functional trait that affects plant fitness at the seedling stage and may vary greatly with species fruit size, growth form and fecundity. Using structural equation modelling (SEM) and correlated trait evolution analysis, we investigated the interaction network between seed size and fecundity, postfire regeneration strategy, fruit size, plant height and serotiny (on-plant seed storage) among 82 species of the woody shrub genus, Hakea, with a wide spectrum of seed sizes (2-500 mg). Seed size is negatively correlated with fecundity, while fire-killed species (nonsprouters) produce more seeds than resprouters though they are of similar size. Seed size is unrelated to plant height and level of serotiny while it scales allometrically with fruit size. A strong phylogenetic signal in seed size revealed phylogenetic constraints on seed size variation in Hakea. Our analyses suggest a causal relationship between seed size, fecundity and postfire regeneration strategy in Hakea. These results demonstrate that fruit size, fecundity and evolutionary history have had most control over seed size variation among Hakea species.
Nakano, Takafumi; Ihara, Yoh; Kumasaki, Yusuke; Baba, Yuki G; Tomikawa, Ko
2017-08-01
The systematic status of geographical variants of Arcuphantes hibanus Saito, 1992 belonging to the A. longiscapus species group, indigenous to western Honshu and Shikoku, Japan, was evaluated using morphological and molecular data. Two species, A. enmusubi Ihara, Nakano and Tomikawa, sp. nov. and A. occidentalis Ihara, Nakano and Tomikawa, sp. nov., are described, and A. hibanus is redescribed with redefinition of its taxonomic status. These three species are diagnosed by the characteristics of paracymbium, pseudolamella, and epigynal basal part. Phylogenetic trees obtained with mitochondrial cytochrome c oxidase subunit I and 16S rRNA markers showed that the variants are mutually genetically highly diverged. However, the mtDNA phylogenies failed to recover the monophyly of A. hibanus redefined herein. Contrary to the mtDNA phylogenetic analyses, a neighbor-network analysis of nuclear internal transcribed spacer 1 sequences of A. hibanus, A. enmusubi and A. occidentalis spiders showed that each of them forms a cluster. The results of mitochondrial and nuclear DNA analyses in each of the three species are briefly discussed, along with their taxonomic identities.
Seed Size, Fecundity and Postfire Regeneration Strategy Are Interdependent in Hakea
El-ahmir, Sh-hoob Mohamed; Lim, Sim Lin; Lamont, Byron B.; He, Tianhua
2015-01-01
Seed size is a key functional trait that affects plant fitness at the seedling stage and may vary greatly with species fruit size, growth form and fecundity. Using structural equation modelling (SEM) and correlated trait evolution analysis, we investigated the interaction network between seed size and fecundity, postfire regeneration strategy, fruit size, plant height and serotiny (on-plant seed storage) among 82 species of the woody shrub genus, Hakea, with a wide spectrum of seed sizes (2–500 mg). Seed size is negatively correlated with fecundity, while fire-killed species (nonsprouters) produce more seeds than resprouters though they are of similar size. Seed size is unrelated to plant height and level of serotiny while it scales allometrically with fruit size. A strong phylogenetic signal in seed size revealed phylogenetic constraints on seed size variation in Hakea. Our analyses suggest a causal relationship between seed size, fecundity and postfire regeneration strategy in Hakea. These results demonstrate that fruit size, fecundity and evolutionary history have had most control over seed size variation among Hakea species. PMID:26035821
Harlin-Cognato, April D; Honeycutt, Rodney L
2006-01-01
Background Dolphins of the genus Lagenorhynchus are anti-tropically distributed in temperate to cool waters. Phylogenetic analyses of cytochrome b sequences have suggested that the genus is polyphyletic; however, many relationships were poorly resolved. In this study, we present a combined-analysis phylogenetic hypothesis for Lagenorhynchus and members of the subfamily Lissodelphininae, which is derived from two nuclear and two mitochondrial data sets and the addition of 34 individuals representing 9 species. In addition, we characterize with parsimony and Bayesian analyses the phylogenetic utility and interaction of characters with statistical measures, including the utility of highly consistent (non-homoplasious) characters as a conservative measure of phylogenetic robustness. We also explore the effects of removing sources of character conflict on phylogenetic resolution. Results Overall, our study provides strong support for the monophyly of the subfamily Lissodelphininae and the polyphyly of the genus Lagenorhynchus. In addition, the simultaneous parsimony analysis resolved and/or improved resolution for 12 nodes including: (1) L. albirostris, L. acutus; (2) L. obscurus and L. obliquidens; and (3) L. cruciger and L. australis. In addition, the Bayesian analysis supported the monophyly of the Cephalorhynchus, and resolved ambiguities regarding the relationship of L. australis/L. cruciger to other members of the genus Lagenorhynchus. The frequency of highly consistent characters varied among data partitions, but the rate of evolution was consistent within data partitions. Although the control region was the greatest source of character conflict, removal of this data partition impeded phylogenetic resolution. Conclusion The simultaneous analysis approach produced a more robust phylogenetic hypothesis for Lagenorhynchus than previous studies, thus supporting a phylogenetic approach employing multiple data partitions that vary in overall rate of evolution. Even in cases where there was apparent conflict among characters, our data suggest a synergistic interaction in the simultaneous analysis, and speak against a priori exclusion of data because of potential conflicts, primarily because phylogenetic results can be less robust. For example, the removal of the control region, the putative source of character conflict, produced spurious results with inconsistencies among and within topologies from parsimony and Bayesian analyses. PMID:17078887
Maximizing the phylogenetic diversity of seed banks.
Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard
2015-04-01
Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. © 2014 Society for Conservation Biology.
A methodological investigation of hominoid craniodental morphology and phylogenetics.
Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A
2011-01-01
The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.
Li, Ying Hong; Wang, Pan Pan; Li, Xiao Xu; Yu, Chun Yan; Yang, Hong; Zhou, Jin; Xue, Wei Wei; Tan, Jun; Zhu, Feng
2016-01-01
The human kinome is one of the most productive classes of drug target, and there is emerging necessity for treating complex diseases by means of polypharmacology (multi-target drugs and combination products). However, the advantages of the multi-target drugs and the combination products are still under debate. A comparative analysis between FDA approved multi-target drugs and combination products, targeting the human kinome, was conducted by mapping targets onto the phylogenetic tree of the human kinome. The approach of network medicine illustrating the drug-target interactions was applied to identify popular targets of multi-target drugs and combination products. As identified, the multi-target drugs tended to inhibit target pairs in the human kinome, especially the receptor tyrosine kinase family, while the combination products were able to against targets of distant homology relationship. This finding asked for choosing the combination products as a better solution for designing drugs aiming at targets of distant homology relationship. Moreover, sub-networks of drug-target interactions in specific disease were generated, and mechanisms shared by multi-target drugs and combination products were identified. In conclusion, this study performed an analysis between approved multi-target drugs and combination products against the human kinome, which could assist the discovery of next generation polypharmacology.
The pangenome of the genus Clostridium.
Udaondo, Zulema; Duque, Estrella; Ramos, Juan-Luis
2017-07-01
The pangenome for the genus Clostridium sensu stricto, which was obtained using highly curated and annotated genomes from 16 species is presented; some of these cause disease, while others are used for the production of added-value chemicals. Multilocus sequencing analysis revealed that species of this genus group into at least two clades that include non-pathogenic and pathogenic strains, suggesting that pathogenicity is dispersed across the phylogenetic tree. The core genome of the genus includes 546 protein families, which mainly comprise those involved in protein translation and DNA repair. The GS-GOGAT may represent the central pathway for generating organic nitrogen from inorganic nitrogen sources. Glycerol and glucose metabolism genes are well represented in the core genome together with a set of energy conservation systems. A metabolic network comprising proteins/enzymes, RNAs and metabolites, whose topological structure is a non-random and scale-free network with hierarchically structured modules was built. These modules shed light on the interactions between RNAs, proteins and metabolites, revealing biological features of transcription and translation, cell wall biosynthesis, C1 metabolism and N metabolism. Network analysis identified four nodes that function as hubs and bottlenecks, namely, coenzyme A, HPr kinases, S-adenosylmethionine and the ribonuclease P-protein, suggesting pivotal roles for them in Clostridium. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
The Evolutionary Ecology of Plant Disease: A Phylogenetic Perspective.
Gilbert, Gregory S; Parker, Ingrid M
2016-08-04
An explicit phylogenetic perspective provides useful tools for phytopathology and plant disease ecology because the traits of both plants and microbes are shaped by their evolutionary histories. We present brief primers on phylogenetic signal and the analytical tools of phylogenetic ecology. We review the literature and find abundant evidence of phylogenetic signal in pathogens and plants for most traits involved in disease interactions. Plant nonhost resistance mechanisms and pathogen housekeeping functions are conserved at deeper phylogenetic levels, whereas molecular traits associated with rapid coevolutionary dynamics are more labile at branch tips. Horizontal gene transfer disrupts the phylogenetic signal for some microbial traits. Emergent traits, such as host range and disease severity, show clear phylogenetic signals. Therefore pathogen spread and disease impact are influenced by the phylogenetic structure of host assemblages. Phylogenetically rare species escape disease pressure. Phylogenetic tools could be used to develop predictive tools for phytosanitary risk analysis and reduce disease pressure in multispecies cropping systems.
Kumar, Nitin; Miyajima, Fabio; He, Miao; Roberts, Paul; Swale, Andrew; Ellison, Louise; Pickard, Derek; Smith, Godfrey; Molyneux, Rebecca; Dougan, Gordon; Parkhill, Julian; Wren, Brendan W; Parry, Christopher M; Pirmohamed, Munir; Lawley, Trevor D
2016-03-15
Accurate tracking of Clostridium difficile transmission within healthcare settings is key to its containment but is hindered by the lack of discriminatory power of standard genotyping methods. We describe a whole-genome phylogenetic-based method to track the transmission of individual clones in infected hospital patients from the epidemic C. difficile 027/ST1 lineage, and to distinguish between the 2 causes of recurrent disease, relapse (same strain), or reinfection (different strain). We monitored patients with C. difficile infection in a UK hospital over a 2-year period. We performed whole-genome sequencing and phylogenetic analysis of 108 strains isolated from symptomatic patients. High-resolution phylogeny was integrated with in-hospital transfers and contact data to create an infection network linking individual patients and specific hospital wards. Epidemic C. difficile 027/ST1 caused the majority of infections during our sampling period. Integration of whole-genome single nucleotide polymorphism (SNP) phylogenetic analysis, which accurately discriminated between 27 distinct SNP genotypes, with patient movement and contact data identified 32 plausible transmission events, including ward-based contamination (66%) or direct donor-recipient contact (34%). Highly contagious donors were identified who contributed to the persistence of clones within distinct hospital wards and the spread of clones between wards, especially in areas of intense turnover. Recurrent cases were identified between 4 and 26 weeks, highlighting the limitation of the standard <8-week cutoff used for patient diagnosis and management. Genome-based infection tracking to monitor the persistence and spread of C. difficile within healthcare facilities could inform infection control and patient management. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America.
Spatial Structure of Evolutionary Models of Dialects in Contact
Murawaki, Yugo
2015-01-01
Phylogenetic models, originally developed to demonstrate evolutionary biology, have been applied to a wide range of cultural data including natural language lexicons, manuscripts, folktales, material cultures, and religions. A fundamental question regarding the application of phylogenetic inference is whether trees are an appropriate approximation of cultural evolutionary history. Their validity in cultural applications has been scrutinized, particularly with respect to the lexicons of dialects in contact. Phylogenetic models organize evolutionary data into a series of branching events through time. However, branching events are typically not included in dialectological studies to interpret the distributions of lexical terms. Instead, dialectologists have offered spatial interpretations to represent lexical data. For example, new lexical items that emerge in a politico-cultural center are likely to spread to peripheries, but not vice versa. To explore the question of the tree model’s validity, we present a simple simulation model in which dialects form a spatial network and share lexical items through contact rather than through common ancestors. We input several network topologies to the model to generate synthetic data. We then analyze the synthesized data using conventional phylogenetic techniques. We found that a group of dialects can be considered tree-like even if it has not evolved in a temporally tree-like manner but has a temporally invariant, spatially tree-like structure. In addition, the simulation experiments appear to reproduce unnatural results observed in reconstructed trees for real data. These results motivate further investigation into the spatial structure of the evolutionary history of dialect lexicons as well as other cultural characteristics. PMID:26221958
Relationships among genera of the Saccharomycotina from multigene sequence analysis
USDA-ARS?s Scientific Manuscript database
Most known species of the subphylum Saccharomycotina (budding ascomycetous yeasts) have now been placed in phylogenetically defined clades following multigene sequence analysis. Terminal clades, which are usually well supported from bootstrap analysis, are viewed as phylogenetically circumscribed ge...
Phylogenetic inference under varying proportions of indel-induced alignment gaps
Dwivedi, Bhakti; Gadagkar, Sudhindra R
2009-01-01
Background The effect of alignment gaps on phylogenetic accuracy has been the subject of numerous studies. In this study, we investigated the relationship between the total number of gapped sites and phylogenetic accuracy, when the gaps were introduced (by means of computer simulation) to reflect indel (insertion/deletion) events during the evolution of DNA sequences. The resulting (true) alignments were subjected to commonly used gap treatment and phylogenetic inference methods. Results (1) In general, there was a strong – almost deterministic – relationship between the amount of gap in the data and the level of phylogenetic accuracy when the alignments were very "gappy", (2) gaps resulting from deletions (as opposed to insertions) contributed more to the inaccuracy of phylogenetic inference, (3) the probabilistic methods (Bayesian, PhyML & "MLε, " a method implemented in DNAML in PHYLIP) performed better at most levels of gap percentage when compared to parsimony (MP) and distance (NJ) methods, with Bayesian analysis being clearly the best, (4) methods that treat gapped sites as missing data yielded less accurate trees when compared to those that attribute phylogenetic signal to the gapped sites (by coding them as binary character data – presence/absence, or as in the MLε method), and (5) in general, the accuracy of phylogenetic inference depended upon the amount of available data when the gaps resulted from mainly deletion events, and the amount of missing data when insertion events were equally likely to have caused the alignment gaps. Conclusion When gaps in an alignment are a consequence of indel events in the evolution of the sequences, the accuracy of phylogenetic analysis is likely to improve if: (1) alignment gaps are categorized as arising from insertion events or deletion events and then treated separately in the analysis, (2) the evolutionary signal provided by indels is harnessed in the phylogenetic analysis, and (3) methods that utilize the phylogenetic signal in indels are developed for distance methods too. When the true homology is known and the amount of gaps is 20 percent of the alignment length or less, the methods used in this study are likely to yield trees with 90–100 percent accuracy. PMID:19698168
Inferring influenza global transmission networks without complete phylogenetic information
Aris-Brosou, Stéphane
2014-01-01
Influenza is one of the most severe respiratory infections affecting humans throughout the world, yet the dynamics of its global transmission network are still contentious. Here, I describe a novel combination of phylogenetics, time series, and graph theory to analyze 14.25 years of data stratified in space and in time, focusing on the main target of the human immune response, the hemagglutinin gene. While bypassing the complete phylogenetic inference of huge data sets, the method still extracts information suggesting that waves of genetic or of nucleotide diversity circulate continuously around the globe for subtypes that undergo sustained transmission over several seasons, such as H3N2 and pandemic H1N1/09, while diversity of prepandemic H1N1 viruses had until 2009 a noncontinuous transmission pattern consistent with a source/sink model. Irrespective of the shift in the structure of H1N1 diversity circulation with the emergence of the pandemic H1N1/09 strain, US prevalence peaks during the winter months when genetic diversity is at its lowest. This suggests that a dominant strain is generally responsible for epidemics and that monitoring genetic and/or nucleotide diversity in real time could provide public health agencies with an indirect estimate of prevalence. PMID:24665342
Khan, Faheem Ahmed; Liu, Hui; Zhou, Hao; Wang, Kai; Qamar, Muhammad Tahir Ul; Pandupuspitasari, Nuruliarizki Shinta; Shujun, Zhang
2017-01-01
The biology of sperm, its capability of fertilizing an egg and its role in sex ratio are the major biological questions in reproductive biology. To answer these question we integrated X and Y chromosome transcriptome across different species: Bos taurus and Sus scrofa and identified reproductive driver genes based on Weighted Gene Co-Expression Network Analysis (WGCNA) algorithm. Our strategy resulted in 11007 and 10445 unique genes consisting of 9 and 11 reproductive modules in Bos taurus and Sus scrofa, respectively. The consensus module calculation yields an overall 167 overlapped genes which were mapped to 846 DEGs in Bos taurus to finally get a list of 67 dual feature genes. We develop gene co-expression network of selected 67 genes that consists of 58 nodes (27 down-regulated and 31 up-regulated genes) enriched to 66 GO biological process (BP) including 6 GO annotations related to reproduction and two KEGG pathways. Moreover, we searched significantly related TF (ISRE, AP1FJ, RP58, CREL) and miRNAs (bta-miR-181a, bta-miR-17-5p, bta-miR-146b, bta-miR-146a) which targeted the genes in co-expression network. In addition we performed genetic analysis including phylogenetic, functional domain identification, epigenetic modifications, mutation analysis of the most important reproductive driver genes PRM1, PPP2R2B and PAFAH1B1 and finally performed a protein docking analysis to visualize their therapeutic and gene expression regulation ability. PMID:28903352
Investigation on maternal lineage of a Neolithic group from northern Shaanxi based on ancient DNA.
Zhao, Jing; Liu, Fang-E; Lin, Song; Liu, Zhi-Zhen; Sun, Zhou-Yong; Wu, Xiao-Ming; Zhang, Hu-Qin
2017-09-01
A magnetic bead purification method was successfully used to extract ancient DNA from the skeletal remains of 10 specimens excavated from Wuzhuangguoliang (Wzhgl) site, which was located in northern Shaanxi. The multidimensional scaling (MDS) and analysis of molecular variance approach (AMOVA) revealed that ancient Wzhgl people bored a very high similarity to southern Han Chinese. By constructing the MJ-network of various modern people including Han Chinese and Japanese, the phylogenetic analysis indicated that the Wzhgl population had close maternal distance with ancient Shandong and Xinjiang people. These findings indicated that Wzhgl contributed to the gene pool of Han Chinese and modern Japanese. In addition, population migration and interflow between Wzhgl people and ancient Shandong or Xinjiang probably occurred in Neolithic period.
Vaux, Felix; Trewick, Steven A; Crampton, James S; Marshall, Bruce A; Beu, Alan G; Hills, Simon F K; Morgan-Richards, Mary
2018-06-15
The relationship between morphology and inheritance is of perennial interest in evolutionary biology and palaeontology. Using three marine snail genera Penion, Antarctoneptunea and Kelletia, we investigate whether systematics based on shell morphology accurately reflect evolutionary lineages indicated by molecular phylogenetics. Members of these gastropod genera have been a taxonomic challenge due to substantial variation in shell morphology, conservative radular and soft tissue morphology, few known ecological differences, and geographical overlap between numerous species. Sampling all sixteen putative taxa identified across the three genera, we infer mitochondrial and nuclear ribosomal DNA phylogenetic relationships within the group, and compare this to variation in adult shell shape and size. Results of phylogenetic analysis indicate that each genus is monophyletic, although the status of some phylogenetically derived and likely more recently evolved taxa within Penion is uncertain. The recently described species P. lineatus is supported by genetic evidence. Morphology, captured using geometric morphometric analysis, distinguishes the genera and matches the molecular phylogeny, although using the same dataset, species and phylogenetic subclades are not identified with high accuracy. Overall, despite abundant variation, we find that shell morphology accurately reflects genus-level classification and the corresponding deep phylogenetic splits identified in this group of marine snails. Copyright © 2018 Elsevier Inc. All rights reserved.
On the use of cartographic projections in visualizing phylo-genetic tree space
2010-01-01
Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger data sets. PMID:20529355
Phylogenetic Tools for Generalized HIV-1 Epidemics: Findings from the PANGEA-HIV Methods Comparison.
Ratmann, Oliver; Hodcroft, Emma B; Pickles, Michael; Cori, Anne; Hall, Matthew; Lycett, Samantha; Colijn, Caroline; Dearlove, Bethany; Didelot, Xavier; Frost, Simon; Hossain, A S Md Mukarram; Joy, Jeffrey B; Kendall, Michelle; Kühnert, Denise; Leventhal, Gabriel E; Liang, Richard; Plazzotta, Giacomo; Poon, Art F Y; Rasmussen, David A; Stadler, Tanja; Volz, Erik; Weis, Caroline; Leigh Brown, Andrew J; Fraser, Christophe
2017-01-01
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods' development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Comparative Analysis of Begonia Plastid Genomes and Their Utility for Species-Level Phylogenetics
Harrison, Nicola; Harrison, Richard J.
2016-01-01
Recent, rapid radiations make species-level phylogenetics difficult to resolve. We used a multiplexed, high-throughput sequencing approach to identify informative genomic regions to resolve phylogenetic relationships at low taxonomic levels in Begonia from a survey of sixteen species. A long-range PCR method was used to generate draft plastid genomes to provide a strong phylogenetic backbone, identify fast evolving regions and provide informative molecular markers for species-level phylogenetic studies in Begonia. PMID:27058864
Inference of Transmission Network Structure from HIV Phylogenetic Trees
Giardina, Federica; Romero-Severson, Ethan Obie; Albert, Jan; ...
2017-01-13
Phylogenetic inference is an attractive means to reconstruct transmission histories and epidemics. However, there is not a perfect correspondence between transmission history and virus phylogeny. Both node height and topological differences may occur, depending on the interaction between within-host evolutionary dynamics and between-host transmission patterns. To investigate these interactions, we added a within-host evolutionary model in epidemiological simulations and examined if the resulting phylogeny could recover different types of contact networks. To further improve realism, we also introduced patient-specific differences in infectivity across disease stages, and on the epidemic level we considered incomplete sampling and the age of the epidemic.more » Second, we implemented an inference method based on approximate Bayesian computation (ABC) to discriminate among three well-studied network models and jointly estimate both network parameters and key epidemiological quantities such as the infection rate. Our ABC framework used both topological and distance-based tree statistics for comparison between simulated and observed trees. Overall, our simulations showed that a virus time-scaled phylogeny (genealogy) may be substantially different from the between-host transmission tree. This has important implications for the interpretation of what a phylogeny reveals about the underlying epidemic contact network. In particular, we found that while the within-host evolutionary process obscures the transmission tree, the diversification process and infectivity dynamics also add discriminatory power to differentiate between different types of contact networks. We also found that the possibility to differentiate contact networks depends on how far an epidemic has progressed, where distance-based tree statistics have more power early in an epidemic. Finally, we applied our ABC inference on two different outbreaks from the Swedish HIV-1 epidemic.« less
Inference of Transmission Network Structure from HIV Phylogenetic Trees
DOE Office of Scientific and Technical Information (OSTI.GOV)
Giardina, Federica; Romero-Severson, Ethan Obie; Albert, Jan
Phylogenetic inference is an attractive means to reconstruct transmission histories and epidemics. However, there is not a perfect correspondence between transmission history and virus phylogeny. Both node height and topological differences may occur, depending on the interaction between within-host evolutionary dynamics and between-host transmission patterns. To investigate these interactions, we added a within-host evolutionary model in epidemiological simulations and examined if the resulting phylogeny could recover different types of contact networks. To further improve realism, we also introduced patient-specific differences in infectivity across disease stages, and on the epidemic level we considered incomplete sampling and the age of the epidemic.more » Second, we implemented an inference method based on approximate Bayesian computation (ABC) to discriminate among three well-studied network models and jointly estimate both network parameters and key epidemiological quantities such as the infection rate. Our ABC framework used both topological and distance-based tree statistics for comparison between simulated and observed trees. Overall, our simulations showed that a virus time-scaled phylogeny (genealogy) may be substantially different from the between-host transmission tree. This has important implications for the interpretation of what a phylogeny reveals about the underlying epidemic contact network. In particular, we found that while the within-host evolutionary process obscures the transmission tree, the diversification process and infectivity dynamics also add discriminatory power to differentiate between different types of contact networks. We also found that the possibility to differentiate contact networks depends on how far an epidemic has progressed, where distance-based tree statistics have more power early in an epidemic. Finally, we applied our ABC inference on two different outbreaks from the Swedish HIV-1 epidemic.« less
NASA Astrophysics Data System (ADS)
Eder, Wolfgang; Ives Torres-Silva, Ana; Hohenegger, Johann
2017-04-01
Phylogenetic analysis and trees based on molecular data are broadly applied and used to infer genetical and biogeographic relationship in recent larger foraminifera. Molecular phylogenetic is intensively used within recent nummulitids, however for fossil representatives these trees are only of minor informational value. Hence, within paleontological studies a phylogenetic approach through morphometric analysis is of much higher value. To tackle phylogenetic relationships within the nummulitid family, a much higher number of morphological character must be measured than are commonly used in biometric studies, where mostly parameters describing embryonic size (e.g., proloculus diameter, deuteroloculus diameter) and/or the marginal spiral (e.g., spiral diagrams, spiral indices) are studied. For this purpose 11 growth-independent and/or growth-invariant characters have been used to describe the morphological variability of equatorial thin sections of seven Carribbean nummulitid taxa (Nummulites striatoreticulatus, N. macgillavry, Palaeonummulites willcoxi, P.floridensis, P. soldadensis, P.trinitatensis and P.ocalanus) and one outgroup taxon (Ranikothalia bermudezi). Using these characters, phylogenetic trees were calculated using a restricted maximum likelihood algorithm (REML), and results are cross-checked by ordination and cluster analysis. Square-change parsimony method has been run to reconstruct ancestral states, as well as to simulate the evolution of the chosen characters along the calculated phylogenetic tree and, independent - contrast analysis was used to estimate confidence intervals. Based on these simulations, phylogenetic tendencies of certain characters proposed for nummulitids (e.g., Cope's rule or nepionic acceleration) can be tested, whether these tendencies are valid for the whole family or only for certain clades. At least, within the Carribean nummulitids, phylogenetic trends along some growth-independent characters of the embryo (e.g., first chamber length and P/D ratio) and some growth-invariant characters of the chamber sequence (e.g., backbend angle, initial chamber base length and chamber length increase) are evident.
A phylogenetic analysis of the megadiverse Chalcidoidea (Hymenoptera)
USDA-ARS?s Scientific Manuscript database
Chalcidoidea (Hymenoptera) are extremely diverse with an estimated 500,000 species. We present the first phylogenetic analysis of the superfamily based on a cladistic analysis of both morphological and molecular data. A total of 233 morphological characters were scored for 300 taxa and 265 genera, a...
Phylogenetic Analysis of Ruminant Theileria spp. from China Based on 28S Ribosomal RNA Gene
Gou, Huitian; Guan, Guiquan; Ma, Miling; Liu, Aihong; Liu, Zhijie; Xu, Zongke; Ren, Qiaoyun; Li, Youquan; Yang, Jifei; Chen, Ze
2013-01-01
Species identification using DNA sequences is the basis for DNA taxonomy. In this study, we sequenced the ribosomal large-subunit RNA gene sequences (3,037-3,061 bp) in length of 13 Chinese Theileria stocks that were infective to cattle and sheep. The complete 28S rRNA gene is relatively difficult to amplify and its conserved region is not important for phylogenetic study. Therefore, we selected the D2-D3 region from the complete 28S rRNA sequences for phylogenetic analysis. Our analyses of 28S rRNA gene sequences showed that the 28S rRNA was useful as a phylogenetic marker for analyzing the relationships among Theileria spp. in ruminants. In addition, the D2-D3 region was a short segment that could be used instead of the whole 28S rRNA sequence during the phylogenetic analysis of Theileria, and it may be an ideal DNA barcode. PMID:24327775
Phylogenetic analysis of ruminant Theileria spp. from China based on 28S ribosomal RNA gene.
Gou, Huitian; Guan, Guiquan; Ma, Miling; Liu, Aihong; Liu, Zhijie; Xu, Zongke; Ren, Qiaoyun; Li, Youquan; Yang, Jifei; Chen, Ze; Yin, Hong; Luo, Jianxun
2013-10-01
Species identification using DNA sequences is the basis for DNA taxonomy. In this study, we sequenced the ribosomal large-subunit RNA gene sequences (3,037-3,061 bp) in length of 13 Chinese Theileria stocks that were infective to cattle and sheep. The complete 28S rRNA gene is relatively difficult to amplify and its conserved region is not important for phylogenetic study. Therefore, we selected the D2-D3 region from the complete 28S rRNA sequences for phylogenetic analysis. Our analyses of 28S rRNA gene sequences showed that the 28S rRNA was useful as a phylogenetic marker for analyzing the relationships among Theileria spp. in ruminants. In addition, the D2-D3 region was a short segment that could be used instead of the whole 28S rRNA sequence during the phylogenetic analysis of Theileria, and it may be an ideal DNA barcode.
Phylogeny, host-parasite relationship and zoogeography
1999-01-01
Phylogeny is the evolutionary history of a group or the lineage of organisms and is reconstructed based on morphological, molecular and other characteristics. The genealogical relationship of a group of taxa is often expressed as a phylogenetic tree. The difficulty in categorizing the phylogeny is mainly due to the existence of frequent homoplasies that deceive observers. At the present time, cladistic analysis is believed to be one of the most effective methods of reconstructing a phylogenetic tree. Excellent computer program software for phylogenetic analysis is available. As an example, cladistic analysis was applied for nematode genera of the family Acuariidae, and the phylogenetic tree formed was compared with the system used currently. Nematodes in the genera Nippostrongylus and Heligmonoides were also analyzed, and the validity of the reconstructed phylogenetic trees was observed from a zoogeographical point of view. Some of the theories of parasite evolution were briefly reviewed as well. Coevolution of parasites and humans was discussed with special reference to the evolutionary relationship between Enterobius and primates. PMID:10634036
Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.
2017-01-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen. PMID:29291245
Burbrink, Frank T; Lorch, Jeffrey M; Lips, Karen R
2017-12-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.
Burbrink, Frank T.; Lorch, Jeffrey M.; Lips, Karen R.
2017-01-01
Emerging infectious diseases (EIDs) reduce host population sizes, cause extinction, disassemble communities, and have indirect negative effects on human well-being. Fungal EIDs have reduced population abundances in amphibians and bats across many species over large areas. The recent emergence of snake fungal disease (SFD) may have caused declines in some snake populations in the Eastern United States (EUS), which is home to a phylogenetically and ecologically diverse assembly of 98 taxa. SFD has been documented in only 23 naturally occuring species, although this is likely an underestimate of the number of susceptible taxa. Using several novel methods, including artificial neural networks, we combine phylogenetic and trait-based community estimates from all taxa in this region to show that SFD hosts are both phylogenetically and ecologically randomly dispersed. This might indicate that other species of snakes in the EUS could be currently infected or susceptible to SFD. Our models also indicate that information about key traits that enhance susceptiblity is lacking. Surveillance should consider that all snake species and habitats likely harbor this pathogen.
SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
2013-01-01
Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408
SUNPLIN: simulation with uncertainty for phylogenetic investigations.
Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F
2013-11-15
Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.
Chang, Yan-Li; Li, Wen-Yan; Miao, Hai; Yang, Shuai-Qi; Li, Ri; Wang, Xiang; Li, Wen-Qiang; Chen, Kun-Ming
2016-02-23
Plasma membrane NADPH oxidases (NOXs) are key producers of reactive oxygen species under both normal and stress conditions in plants and they form functional subfamilies. Studies of these subfamilies indicated that they show considerable evolutionary selection. We performed a comparative genomic analysis that identified 50 ferric reduction oxidases (FRO) and 77 NOX gene homologs from 20 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots, and eudicots. Phylogenetic and structural analysis classified these FRO and NOX genes into four well-conserved groups represented as NOX, FRO I, FRO II, and FRO III. Further analysis of NOXs of phylogenetic and exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures during the evolution of NOXs family genes and which were classified into four conserved subfamilies which are represented as Sub.I, Sub.II, Sub.III, and Sub.IV. Additionally, both available global microarray data analysis and quantitative real-time PCR experiments revealed that the NOX genes in Arabidopsis and rice (Oryza sativa) have different expression patterns in different developmental stages, various abiotic stresses and hormone treatments. Finally, coexpression network analysis of NOX genes in Arabidopsis and rice revealed that NOXs have significantly correlated expression profiles with genes which are involved in plants metabolic and resistance progresses. All these results suggest that NOX family underscores the functional diversity and divergence in plants. This finding will facilitate further studies of the NOX family and provide valuable information for functional validation of this family in plants. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
ERIC Educational Resources Information Center
Franklin, Wilfred A.
2010-01-01
In a flexible multisession laboratory, students investigate concepts of phylogenetic analysis at both the molecular and the morphological level. Students finish by conducting their own analysis on a collection of skeletons representing the major phyla of vertebrates, a collection of primate skulls, or a collection of hominid skulls.
Cheng, Kun; Rong, Xiaoying; Pinto-Tomás, Adrián A.; Fernández-Villalobos, Marcela; Murillo-Cruz, Catalina
2014-01-01
Examining the population structure and the influence of recombination and ecology on microbial populations makes great sense for understanding microbial evolution and speciation. Streptomycetes are a diverse group of bacteria that are widely distributed in nature and a rich source of useful bioactive compounds; however, they are rarely subjected to population genetic investigations. In this study, we applied a five-gene-based multilocus sequence analysis (MLSA) scheme to 41 strains of Streptomyces albidoflavus derived from diverse sources, mainly insects, sea, and soil. Frequent recombination was detected in S. albidoflavus, supported by multiple lines of evidence from the pairwise homoplasy index (Φw) test, phylogenetic discordance, the Shimodaira-Hasegawa (SH) test, and network analysis, underpinning the predominance of homologous recombination within Streptomyces species. A strong habitat signal was also observed in both phylogenetic and Structure 2.3.3 analyses, indicating the importance of ecological difference in shaping the population structure. Moreover, all three habitat-associated groups, particularly the entomic group, demonstrated significantly reduced levels of gene flow with one another, generally revealing habitat barriers to recombination. Therefore, a combined effect of homologous recombination and ecology is inferred for S. albidoflavus, where dynamic evolution is at least partly balanced by the extent that differential distributions of strains among habitats limit genetic exchange. Our study stresses the significance of ecology in microbial speciation and reveals the coexistence of homologous recombination and ecological divergence in the evolution of streptomycetes. PMID:25416769
Kawaida, Hitomi; Ohba, Kohki; Koutake, Yuhki; Shimizu, Hiroshi; Tachida, Hidenori; Kobayakawa, Yoshitaka
2013-03-01
Although many physiological studies have been reported on the symbiosis between hydra and green algae, very little information from a molecular phylogenetic aspect of symbiosis is available. In order to understand the origin and evolution of symbiosis between the two organisms, we compared the phylogenetic relationships among symbiotic green algae with the phylogenetic relationships among host hydra strains. To do so, we reconstructed molecular phylogenetic trees of several strains of symbiotic chlorella harbored in the endodermal epithelial cells of viridissima group hydra strains and investigated their congruence with the molecular phylogenetic trees of the host hydra strains. To examine the species specificity between the host and the symbiont with respect to the genetic distance, we also tried to introduce chlorella strains into two aposymbiotic strains of viridissima group hydra in which symbiotic chlorella had been eliminated in advance. We discussed the origin and history of symbiosis between hydra and green algae based on the analysis. Copyright © 2012 Elsevier Inc. All rights reserved.
Chan, Philip A.; Hogan, Joseph W.; Huang, Austin; DeLong, Allison; Salemi, Marco; Mayer, Kenneth H.; Kantor, Rami
2015-01-01
Background Molecular epidemiologic evaluation of HIV-1 transmission networks can elucidate behavioral components of transmission that can be targets for intervention. Methods We combined phylogenetic and statistical approaches using pol sequences from patients diagnosed 2004-2011 at a large HIV center in Rhode Island, following 75% of the state’s HIV population. Phylogenetic trees were constructed using maximum likelihood and putative transmission clusters were evaluated using latent class analyses (LCA) to determine association of cluster size with underlying demographic/behavioral characteristics. A logistic growth model was used to assess intra-cluster dynamics over time and predict “active” clusters that were more likely to harbor undiagnosed infections. Results Of 1,166 HIV-1 subtype B sequences, 31% were distributed among 114 statistically-supported, monophyletic clusters (range: 2-15 sequences/cluster). Sequences from men who have sex with men (MSM) formed 52% of clusters. LCA demonstrated that sequences from recently diagnosed (2008-2011) MSM with primary HIV infection (PHI) and other sexually transmitted infections (STIs) were more likely to form larger clusters (Odds Ratio 1.62-11.25, p<0.01). MSM in clusters were more likely to have anonymous partners and meet partners at sex clubs and pornographic stores. Four large clusters with 38 sequences (100% male, 89% MSM) had a high-probability of harboring undiagnosed infections and included younger MSM with PHI and STIs. Conclusions In this first large-scale molecular epidemiologic investigation of HIV-1 transmission in New England, sexual networks among recently diagnosed MSM with PHI and concomitant STIs contributed to ongoing transmission. Characterization of transmission dynamics revealed actively growing clusters which may be targets for intervention. PMID:26258569
Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis.
Gibbs, S; Collard, M; Wood, B
2002-01-01
This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence.
Molecular phylogenetic trees - On the validity of the Goodman-Moore augmentation algorithm
NASA Technical Reports Server (NTRS)
Holmquist, R.
1979-01-01
A response is made to the reply of Nei and Tateno (1979) to the letter of Holmquist (1978) supporting the validity of the augmentation algorithm of Moore (1977) in reconstructions of nucleotide substitutions by means of the maximum parsimony principle. It is argued that the overestimation of the augmented numbers of nucleotide substitutions (augmented distances) found by Tateno and Nei (1978) is due to an unrepresentative data sample and that it is only necessary that evolution be stochastically uniform in different regions of the phylogenetic network for the augmentation method to be useful. The importance of the average value of the true distance over all links is explained, and the relative variances of the true and augmented distances are calculated to be almost identical. The effects of topological changes in the phylogenetic tree on the augmented distance and the question of the correctness of ancestral sequences inferred by the method of parsimony are also clarified.
A taxonomic and phylogenetic re-appraisal of the genus Curvularia
USDA-ARS?s Scientific Manuscript database
Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...
Grammatical analysis as a distributed neurobiological function.
Bozic, Mirjana; Fonteneau, Elisabeth; Su, Li; Marslen-Wilson, William D
2015-03-01
Language processing engages large-scale functional networks in both hemispheres. Although it is widely accepted that left perisylvian regions have a key role in supporting complex grammatical computations, patient data suggest that some aspects of grammatical processing could be supported bilaterally. We investigated the distribution and the nature of grammatical computations across language processing networks by comparing two types of combinatorial grammatical sequences--inflectionally complex words and minimal phrases--and contrasting them with grammatically simple words. Novel multivariate analyses revealed that they engage a coalition of separable subsystems: inflected forms triggered left-lateralized activation, dissociable into dorsal processes supporting morphophonological parsing and ventral, lexically driven morphosyntactic processes. In contrast, simple phrases activated a consistently bilateral pattern of temporal regions, overlapping with inflectional activations in L middle temporal gyrus. These data confirm the role of the left-lateralized frontotemporal network in supporting complex grammatical computations. Critically, they also point to the capacity of bilateral temporal regions to support simple, linear grammatical computations. This is consistent with a dual neurobiological framework where phylogenetically older bihemispheric systems form part of the network that supports language function in the modern human, and where significant capacities for language comprehension remain intact even following severe left hemisphere damage. Copyright © 2014 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc.
Yap, Fook Choy; Yan, Yap Jin; Loon, Kiung Teh; Zhen, Justina Lee Ning; Kamau, Nelly Warau; Kumaran, Jayaraj Vijaya
2010-10-01
The present investigation was carried out in an attempt to study the phylogenetic analysis of different breeds of domestic chickens in Peninsular Malaysia inferred from partial cytochrome b gene information and random amplified polymorphic DNA (RAPD) markers. Phylogenetic analysis using both neighbor-joining (NJ) and maximum parsimony (MP) methods produced three clusters that encompassed Type-I village chickens, the red jungle fowl subspecies and the Japanese Chunky broilers. The phylogenetic analysis also revealed that majority of the Malaysian commercial chickens were randomly assembled with the Type-II village chickens. In RAPD assay, phylogenetic analysis using neighbor-joining produced six clusters that were completely distinguished based on the locality of chickens. High levels of genetic variations were observed among the village chickens, the commercial broilers, and between the commercial broilers and layer chickens. In this study, it was found that Type-I village chickens could be distinguished from the commercial chickens and Type-II village chickens at the position of the 27th nucleotide of the 351 bp cytochrome b gene. This study also revealed that RAPD markers were unable to differentiate the type of chickens, but it showed the effectiveness of RAPD in evaluating the genetic variation and the genetic relationships between chicken lines and populations.
Iranzo, Jaime; Koonin, Eugene V; Prangishvili, David; Krupovic, Mart
2016-12-15
Archaea and particularly hyperthermophilic crenarchaea are hosts to many unusual viruses with diverse virion shapes and distinct gene compositions. As is typical of viruses in general, there are no universal genes in the archaeal virosphere. Therefore, to obtain a comprehensive picture of the evolutionary relationships between viruses, network analysis methods are more productive than traditional phylogenetic approaches. Here we present a comprehensive comparative analysis of genomes and proteomes from all currently known taxonomically classified and unclassified, cultivated and uncultivated archaeal viruses. We constructed a bipartite network of archaeal viruses that includes two classes of nodes, the genomes and gene families that connect them. Dissection of this network using formal community detection methods reveals strong modularity, with 10 distinct modules and 3 putative supermodules. However, compared to similar previously analyzed networks of eukaryotic and bacterial viruses, the archaeal virus network is sparsely connected. With the exception of the tailed viruses related to bacteriophages of the order Caudovirales and the families Turriviridae and Sphaerolipoviridae that are linked to a distinct supermodule of eukaryotic and bacterial viruses, there are few connector genes shared by different archaeal virus modules. In contrast, most of these modules include, in addition to viruses, capsidless mobile elements, emphasizing tight evolutionary connections between the two types of entities in archaea. The relative contributions of distinct evolutionary origins, in particular from nonviral elements, and insufficient sampling to the sparsity of the archaeal virus network remain to be determined by further exploration of the archaeal virosphere. Viruses infecting archaea are among the most mysterious denizens of the virosphere. Many of these viruses display no genetic or even morphological relationship to viruses of bacteria and eukaryotes, raising questions regarding their origins and position in the global virosphere. Analysis of 5,740 protein sequences from 116 genomes allowed dissection of the archaeal virus network and showed that most groups of archaeal viruses are evolutionarily connected to capsidless mobile genetic elements, including various plasmids and transposons. This finding could reflect actual independent origins of the distinct groups of archaeal viruses from different nonviral elements, providing important insights into the emergence and evolution of the archaeal virome. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Wang, Xianfeng; Liu, Xiaosong; Li, Feng; Zhou, Hong; Li, Jiefang; Wang, Yingying; Liu, Lihua; Liu, Shujun; Feng, Yi; Wang, Ning
2018-01-01
The widespread use of antiretroviral therapy (ART) has led to considerable concerns about the prevalence of transmitted drug resistance (TDR). Sexual contact, particularly men who have sex with men (MSM) was the most prevalent form of HIV transmission in Shijiazhuang. Hence, we conducted an epidemiological surveillance study on TDR among newly diagnosed individuals who infected-HIV through sexual contact in from 2014-2015. Genotypic resistance mutations were defined using the WHO-2009 surveillance list. Potential impact on antiretroviral drug was predicted according to the Stanford HIV db program version 7.0. The role of transmission clusters in drug resistant strains was evaluated by phylogenetic and network analyses. In this study, 589 individuals were recruited and 542 samples were amplified and sequenced successfully. The over prevalence of TDR was 6.1%: 1.8% to nucleoside reverse transcriptase inhibitors (NRTIs), 2.0% to non- NRTIs (NNRTIs) and 2.4% to protease inhibitors (PIs), respectively. We did not find significant differences in the TDR prevalence by demographic and clinical characteristics (p > 0.05). Using network and phylogenetic analysis, almost 60.0% sequences were clustered together. Of these clusters, 2 included at least two individuals carrying the same resistance mutation, accounting for 21.2% (7/33) individuals with TDR. No significant difference was observed in the clustering rate between the individuals with and without TDR. We obtained a moderate level TDR rate in studied region. These findings enhance our understanding of HIV-1 drug resistance prevalence in Shijiazhuang, and may be helpful for the comprehensive prevention and control of HIV-1.
Phylogenetic analysis of human immunodeficiency virus type 2 isolated from Cuban individuals.
Machado, Liuber Y; Díaz, Héctor M; Noa, Enrique; Martín, Dayamí; Blanco, Madeline; Díaz, Dervel F; Sánchez, Yordank R; Nibot, Carmen; Sánchez, Lourdes; Dubed, Marta
2014-08-01
The presence of infection by human immunodeficiency virus type 2 (HIV-2) in Cuba has been previously documented. However, genetic information on the strains that circulate in the Cuban people is still unknown. The present work constitutes the first study concerning the phylogenetic relationship of HIV-2 Cuban isolates conducted on 13 Cuban patients who were diagnosed with HIV-2. The env sequences were analyzed for the construction of a phylogenetic tree with reference sequences of HIV-2. Phylogenetic analysis of the env gene showed that all the Cuban sequences clustered in group A of HIV-2. The analysis indicated several independent introductions of HIV-2 into Cuba. The results of the study will reinforce the program on the epidemiological surveillance of the infection in Cuba and make possible further molecular evolutionary studies.
Analysis of the SOS response of Vibrio and other bacteria with multiple chromosomes
2012-01-01
Background The SOS response is a well-known regulatory network present in most bacteria and aimed at addressing DNA damage. It has also been linked extensively to stress-induced mutagenesis, virulence and the emergence and dissemination of antibiotic resistance determinants. Recently, the SOS response has been shown to regulate the activity of integrases in the chromosomal superintegrons of the Vibrionaceae, which encompasses a wide range of pathogenic species harboring multiple chromosomes. Here we combine in silico and in vitro techniques to perform a comparative genomics analysis of the SOS regulon in the Vibrionaceae, and we extend the methodology to map this transcriptional network in other bacterial species harboring multiple chromosomes. Results Our analysis provides the first comprehensive description of the SOS response in a family (Vibrionaceae) that includes major human pathogens. It also identifies several previously unreported members of the SOS transcriptional network, including two proteins of unknown function. The analysis of the SOS response in other bacterial species with multiple chromosomes uncovers additional regulon members and reveals that there is a conserved core of SOS genes, and that specialized additions to this basic network take place in different phylogenetic groups. Our results also indicate that across all groups the main elements of the SOS response are always found in the large chromosome, whereas specialized additions are found in the smaller chromosomes and plasmids. Conclusions Our findings confirm that the SOS response of the Vibrionaceae is strongly linked with pathogenicity and dissemination of antibiotic resistance, and suggest that the characterization of the newly identified members of this regulon could provide key insights into the pathogenesis of Vibrio. The persistent location of key SOS genes in the large chromosome across several bacterial groups confirms that the SOS response plays an essential role in these organisms and sheds light into the mechanisms of evolution of global transcriptional networks involved in adaptability and rapid response to environmental changes, suggesting that small chromosomes may act as evolutionary test beds for the rewiring of transcriptional networks. PMID:22305460
Gamboa-Tuz, Samuel D; Pereira-Santana, Alejandro; Zhao, Tao; Schranz, M Eric; Castano, Enrique; Rodriguez-Zapata, Luis C
2018-04-25
The Transmembrane BAX Inhibitor Motif containing (TMBIM) superfamily, divided into BAX Inhibitor (BI) and Lifeguard (LFG) families, comprises a group of cytoprotective cell death regulators conserved in prokaryotes and eukaryotes. However, no research has focused on the evolution of this superfamily in plants. We identified 685 TMBIM proteins in 171 organisms from Archaea, Bacteria, and Eukarya, and provided a phylogenetic overview of the whole TMBIM superfamily. Then, we used orthology and synteny network analyses to further investigate the evolution and expansion of the BI and LFG families in 48 plants from diverse taxa. Plant BI family forms a single monophyletic group; however, monocot BI sequences transposed to another genomic context during evolution. Plant LFG family, which expanded trough whole genome and tandem duplications, is subdivided in LFG I, LFG IIA, and LFG IIB major phylogenetic groups, and retains synteny in angiosperms. Moreover, two orthologous groups (OGs) are shared between bryophytes and seed plants. Other several lineage-specific OGs are present in plants. This work clarifies the phylogenetic classification of the TMBIM superfamily across the three domains of life. Furthermore, it sheds new light on the evolution of the BI and LFG families in plants providing a benchmark for future research. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Sleator, Roy D
2011-04-01
The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects.
Sharma, Ashish Ranjan; Chakraborty, Chiranjib; Lee, Sang-Soo; Sharma, Garima; Yoon, Jeong Kyo; George Priya Doss, C; Song, Dong-Keun; Nam, Ju-Suk
2014-01-01
In human, Wnt/β-catenin signaling pathway plays a significant role in cell growth, cell development, and disease pathogenesis. Four human (Rspo)s are known to activate canonical Wnt/β-catenin signaling pathway. Presently, (Rspo)s serve as therapeutic target for several human diseases. Henceforth, basic understanding about the molecular properties of (Rspo)s is essential. We approached this issue by interpreting the biochemical and biophysical properties along with molecular evolution of (Rspo)s thorough computational algorithm methods. Our analysis shows that signal peptide length is roughly similar in (Rspo)s family along with similarity in aa distribution pattern. In Rspo3, four N-glycosylation sites were noted. All members are hydrophilic in nature and showed alike GRAVY values, approximately. Conversely, Rspo3 contains the maximum positively charged residues while Rspo4 includes the lowest. Four highly aligned blocks were recorded through Gblocks. Phylogenetic analysis shows Rspo4 is being rooted with Rspo2 and similarly Rspo3 and Rspo1 have the common point of origin. Through phylogenomics study, we developed a phylogenetic tree of sixty proteins (n = 60) with the orthologs and paralogs seed sequences. Protein-protein network was also illustrated. Results demonstrated in our study may help the future researchers to unfold significant physiological and therapeutic properties of (Rspo)s in various disease models.
Sharma, Ashish Ranjan; Lee, Sang-Soo; Yoon, Jeong Kyo; George Priya Doss, C.; Song, Dong-Keun
2014-01-01
In human, Wnt/β-catenin signaling pathway plays a significant role in cell growth, cell development, and disease pathogenesis. Four human (Rspo)s are known to activate canonical Wnt/β-catenin signaling pathway. Presently, (Rspo)s serve as therapeutic target for several human diseases. Henceforth, basic understanding about the molecular properties of (Rspo)s is essential. We approached this issue by interpreting the biochemical and biophysical properties along with molecular evolution of (Rspo)s thorough computational algorithm methods. Our analysis shows that signal peptide length is roughly similar in (Rspo)s family along with similarity in aa distribution pattern. In Rspo3, four N-glycosylation sites were noted. All members are hydrophilic in nature and showed alike GRAVY values, approximately. Conversely, Rspo3 contains the maximum positively charged residues while Rspo4 includes the lowest. Four highly aligned blocks were recorded through Gblocks. Phylogenetic analysis shows Rspo4 is being rooted with Rspo2 and similarly Rspo3 and Rspo1 have the common point of origin. Through phylogenomics study, we developed a phylogenetic tree of sixty proteins (n = 60) with the orthologs and paralogs seed sequences. Protein-protein network was also illustrated. Results demonstrated in our study may help the future researchers to unfold significant physiological and therapeutic properties of (Rspo)s in various disease models. PMID:25276837
Biocomputional construction of a gene network under acid stress in Synechocystis sp. PCC 6803.
Li, Yi; Rao, Nini; Yang, Feng; Zhang, Ying; Yang, Yang; Liu, Han-ming; Guo, Fengbiao; Huang, Jian
2014-01-01
Acid stress is one of the most serious threats that cyanobacteria have to face, and it has an impact at all levels from genome to phenotype. However, very little is known about the detailed response mechanism to acid stress in this species. We present here a general analysis of the gene regulatory network of Synechocystis sp. PCC 6803 in response to acid stress using comparative genome analysis and biocomputational prediction. In this study, we collected 85 genes and used them as an initial template to predict new genes through co-regulation, protein-protein interactions and the phylogenetic profile, and 179 new genes were obtained to form a complete template. In addition, we found that 11 enriched pathways such as glycolysis are closely related to the acid stress response. Finally, we constructed a regulatory network for the intricate relationship of these genes and summarize the key steps in response to acid stress. This is the first time a bioinformatic approach has been taken systematically to gene interactions in cyanobacteria and the elaboration of their cell metabolism and regulatory pathways under acid stress, which is more efficient than a traditional experimental study. The results also provide theoretical support for similar research into environmental stresses in cyanobacteria and possible industrial applications. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
A Gateway for Phylogenetic Analysis Powered by Grid Computing Featuring GARLI 2.0
Bazinet, Adam L.; Zwickl, Derrick J.; Cummings, Michael P.
2014-01-01
We introduce molecularevolution.org, a publicly available gateway for high-throughput, maximum-likelihood phylogenetic analysis powered by grid computing. The gateway features a garli 2.0 web service that enables a user to quickly and easily submit thousands of maximum likelihood tree searches or bootstrap searches that are executed in parallel on distributed computing resources. The garli web service allows one to easily specify partitioned substitution models using a graphical interface, and it performs sophisticated post-processing of phylogenetic results. Although the garli web service has been used by the research community for over three years, here we formally announce the availability of the service, describe its capabilities, highlight new features and recent improvements, and provide details about how the grid system efficiently delivers high-quality phylogenetic results. [garli, gateway, grid computing, maximum likelihood, molecular evolution portal, phylogenetics, web service.] PMID:24789072
Phylogenetic inertia and Darwin's higher law.
Shanahan, Timothy
2011-03-01
The concept of 'phylogenetic inertia' is routinely deployed in evolutionary biology as an alternative to natural selection for explaining the persistence of characteristics that appear sub-optimal from an adaptationist perspective. However, in many of these contexts the precise meaning of 'phylogenetic inertia' and its relationship to selection are far from clear. After tracing the history of the concept of 'inertia' in evolutionary biology, I argue that treating phylogenetic inertia and natural selection as alternative explanations is mistaken because phylogenetic inertia is, from a Darwinian point of view, simply an expected effect of selection. Although Darwin did not discuss 'phylogenetic inertia,' he did assert the explanatory priority of selection over descent. An analysis of 'phylogenetic inertia' provides a perspective from which to assess Darwin's view. Copyright © 2010 Elsevier Ltd. All rights reserved.
Tse, Herman; Chen, Jonathan H.K.; Tang, Ying; Lau, Susanna K.P.; Woo, Patrick C.Y.
2014-01-01
Streptococcus sinensis is a recently discovered human pathogen isolated from blood cultures of patients with infective endocarditis. Its phylogenetic position, as well as those of its closely related species, remains inconclusive when single genes were used for phylogenetic analysis. For example, S. sinensis branched out from members of the anginosus, mitis, and sanguinis groups in the 16S ribosomal RNA gene phylogenetic tree, but it was clustered with members of the anginosus and sanguinis groups when groEL gene sequences used for analysis. In this study, we sequenced the draft genome of S. sinensis and used a polyphasic approach, including concatenated genes, whole genomes, and matrix-assisted laser desorption ionization-time of flight mass spectrometry to analyze the phylogeny of S. sinensis. The size of the S. sinensis draft genome is 2.06 Mb, with GC content of 42.2%. Phylogenetic analysis using 50 concatenated genes or whole genomes revealed that S. sinensis formed a distinct cluster with Streptococcus oligofermentans and Streptococcus cristatus, and these three streptococci were clustered with the “sanguinis group.” As for phylogenetic analysis using hierarchical cluster analysis of the mass spectra of streptococci, S. sinensis also formed a distinct cluster with S. oligofermentans and S. cristatus, but these three streptococci were clustered with the “mitis group.” On the basis of the findings, we propose a novel group, named “sinensis group,” to include S. sinensis, S. oligofermentans, and S. cristatus, in the Streptococcus genus. Our study also illustrates the power of phylogenomic analyses for resolving ambiguities in bacterial taxonomy. PMID:25331233
Teng, Jade L L; Huang, Yi; Tse, Herman; Chen, Jonathan H K; Tang, Ying; Lau, Susanna K P; Woo, Patrick C Y
2014-10-20
Streptococcus sinensis is a recently discovered human pathogen isolated from blood cultures of patients with infective endocarditis. Its phylogenetic position, as well as those of its closely related species, remains inconclusive when single genes were used for phylogenetic analysis. For example, S. sinensis branched out from members of the anginosus, mitis, and sanguinis groups in the 16S ribosomal RNA gene phylogenetic tree, but it was clustered with members of the anginosus and sanguinis groups when groEL gene sequences used for analysis. In this study, we sequenced the draft genome of S. sinensis and used a polyphasic approach, including concatenated genes, whole genomes, and matrix-assisted laser desorption ionization-time of flight mass spectrometry to analyze the phylogeny of S. sinensis. The size of the S. sinensis draft genome is 2.06 Mb, with GC content of 42.2%. Phylogenetic analysis using 50 concatenated genes or whole genomes revealed that S. sinensis formed a distinct cluster with Streptococcus oligofermentans and Streptococcus cristatus, and these three streptococci were clustered with the "sanguinis group." As for phylogenetic analysis using hierarchical cluster analysis of the mass spectra of streptococci, S. sinensis also formed a distinct cluster with S. oligofermentans and S. cristatus, but these three streptococci were clustered with the "mitis group." On the basis of the findings, we propose a novel group, named "sinensis group," to include S. sinensis, S. oligofermentans, and S. cristatus, in the Streptococcus genus. Our study also illustrates the power of phylogenomic analyses for resolving ambiguities in bacterial taxonomy. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Insights into the fold organization of TIM barrel from interaction energy based structure networks.
Vijayabaskar, M S; Vishveshwara, Saraswathi
2012-01-01
There are many well-known examples of proteins with low sequence similarity, adopting the same structural fold. This aspect of sequence-structure relationship has been extensively studied both experimentally and theoretically, however with limited success. Most of the studies consider remote homology or "sequence conservation" as the basis for their understanding. Recently "interaction energy" based network formalism (Protein Energy Networks (PENs)) was developed to understand the determinants of protein structures. In this paper we have used these PENs to investigate the common non-covalent interactions and their collective features which stabilize the TIM barrel fold. We have also developed a method of aligning PENs in order to understand the spatial conservation of interactions in the fold. We have identified key common interactions responsible for the conservation of the TIM fold, despite high sequence dissimilarity. For instance, the central beta barrel of the TIM fold is stabilized by long-range high energy electrostatic interactions and low-energy contiguous vdW interactions in certain families. The other interfaces like the helix-sheet or the helix-helix seem to be devoid of any high energy conserved interactions. Conserved interactions in the loop regions around the catalytic site of the TIM fold have also been identified, pointing out their significance in both structural and functional evolution. Based on these investigations, we have developed a novel network based phylogenetic analysis for remote homologues, which can perform better than sequence based phylogeny. Such an analysis is more meaningful from both structural and functional evolutionary perspective. We believe that the information obtained through the "interaction conservation" viewpoint and the subsequently developed method of structure network alignment, can shed new light in the fields of fold organization and de novo computational protein design.
NASA Astrophysics Data System (ADS)
Moore, E. K.; Jelen, B. I.; Giovannelli, D.; Prabhu, A.; Raanan, H.; Falkowski, P. G.
2017-12-01
Deep time changes in Earth surface redox conditions, particularly due to global oxygenation, has impacted the availability of different metals and substrates that are central in biology. Oxidoreductase proteins are molecular nanomachines responsible for all biological electron transfer processes across the tree of life. These enzymes largely contain transition metals in their active sites. Microbial metabolic pathways form a global network of electron transfer, which expanded throughout the Archean eon. Older metabolisms (sulfur reduction, methanogenesis, anoxygenic photosynthesis) accessed negative redox potentials, while later evolving metabolisms (oxygenic photosynthesis, nitrification/denitrification, aerobic respiration) accessed positive redox potentials. The incorporation of different transition metals facilitated biological innovation and the expansion of the network of microbial metabolism. Network analysis was used to examine the connections between microbial taxa, metabolic pathways, crucial metallocofactors, and substrates in deep time by incorporating biosignatures preserved in the geologic record. Nitrogen fixation and aerobic respiration have the highest level of betweenness among metabolisms in the network, indicating that the oldest metabolisms are not the most central. Fe has by far the highest betweenness among metals. Clustering analysis largely separates High Metal Bacteria (HMB), Low Metal Bacteria (LMB), and Archaea showing that simple un-weighted links between taxa, metabolism, and metals have phylogenetic relevance. On average HMB have the highest betweenness among taxa, followed by Archaea and LMB. There is a correlation between the number of metallocofactors and metabolic pathways in representative bacterial taxa, but Archaea do not follow this trend. In many cases older and more recently evolved metabolisms were clustered together supporting previous findings that proliferation of metabolic pathways is not necessarily chronological.
Sultana, H.; Seo, D. W.; Bhuiyan, M. S. A.; Choi, N. R.; Hoque, M. R.; Heo, K. N.; Lee, J. H.
2016-01-01
The maternally inherited mitochondrial DNA (mtDNA) D–loop region is widely used for exploring genetic relationships and for investigating the origin of various animal species. Currently, domestic ducks play an important role in animal protein supply. In this study, partial mtDNA D–loop sequences were obtained from 145 samples belonging to six South-East Asian duck populations and commercial duck population. All these populations were closely related to the mallard duck (Anas platyrhynchos), as indicated by their mean overall genetic distance. Sixteen nucleotide substitutions were identified in sequence analyses allowing the distinction of 28 haplotypes. Around 42.76% of the duck sequences were classified as Hap_02, which completely matched with Anas platyrhynchos duck species. The neighbor-joining phylogenetic tree also revealed that South-East Asian duck populations were closely related to Anas platyrhynchos. Network profiles were also traced using the 28 haplotypes. Overall, results showed that those duck populations D-loop haplotypes were shared between several duck breeds from Korea and Bangladesh sub continental regions. Therefore, these results confirmed that South-East Asian domestic duck populations have been domesticated from Anas platyrhynchos duck as the maternal origins. PMID:27004808
Bandelt, Hans-Jürgen; Yao, Yong-Gang; Bravi, Claudio M; Salas, Antonio; Kivisild, Toomas
2009-03-01
Sequence analysis of the mitochondrial genome has become a routine method in the study of mitochondrial diseases. Quite often, the sequencing efforts in the search of pathogenic or disease-associated mutations are affected by technical and interpretive problems, caused by sample mix-up, contamination, biochemical problems, incomplete sequencing, misdocumentation and insufficient reference to previously published data. To assess data quality in case studies of mitochondrial diseases, it is recommended to compare any mtDNA sequence under consideration to their phylogenetically closest lineages available in the Web. The median network method has proven useful for visualizing potential problems with the data. We contrast some early reports of complete mtDNA sequences to more recent total mtDNA sequencing efforts in studies of various mitochondrial diseases. We conclude that the quality of complete mtDNA sequences generated in the medical field in the past few years is somewhat unsatisfactory and may even fall behind that of pioneer manual sequencing in the early nineties. Our study provides a paradigm for an a posteriori evaluation of sequence quality and for detection of potential problems with inferring a pathogenic status of a particular mutation.
Hu, Wei; Wang, Lianzhe; Tie, Weiwei; Yan, Yan; Ding, Zehong; Liu, Juhua; Li, Meiying; Peng, Ming; Xu, Biyu; Jin, Zhiqiang
2016-01-01
The leucine zipper (bZIP) transcription factors play important roles in multiple biological processes. However, less information is available regarding the bZIP family in the important fruit crop banana. In this study, 121 bZIP transcription factor genes were identified in the banana genome. Phylogenetic analysis showed that MabZIPs were classified into 11 subfamilies. The majority of MabZIP genes in the same subfamily shared similar gene structures and conserved motifs. The comprehensive transcriptome analysis of two banana genotypes revealed the differential expression patterns of MabZIP genes in different organs, in various stages of fruit development and ripening, and in responses to abiotic stresses, including drought, cold, and salt. Interaction networks and co-expression assays showed that group A MabZIP-mediated networks participated in various stress signaling, which was strongly activated in Musa ABB Pisang Awak. This study provided new insights into the complicated transcriptional control of MabZIP genes and provided robust tissue-specific, development-dependent, and abiotic stress-responsive candidate MabZIP genes for potential applications in the genetic improvement of banana cultivars. PMID:27445085
Paraschiv, Simona; Otelea, Dan; Batan, Ionelia; Baicus, Cristian; Magiorkinis, Gkikas; Paraskevis, Dimitrios
2012-07-01
HIV-1 subtype B is predominant in Europe except in some countries from Eastern Europe which are characterized by a high prevalence of non-B subtypes and circulating recombinant forms (CRFs). Romania is a particular case: the HIV-1 epidemic started with subtype F1 which is still the most prevalent. Previous studies have shown an increasing prevalence of subtype B which is the second most frequent one among the newly diagnosed individuals, followed by subtype C and several CRFs as well as unique recombinant forms (URFs). Our objective was to analyze in detail the characteristics (way of dispersal, association with transmission risk groups) of the subtype B infections in Romania by means of phylogenetic analysis. Among all the individuals sampled during 2003-2010, 71 out of 1127 patients (6.3%) have been identified to be infected with subtype B strains. The most frequent route of infection identified in HIV-1 subtype B patients in Romania was MSM transmission (39.6%), followed by the heterosexual route (35.2%). Many of the patients acquired the infection abroad, mainly in Western European countries. Phylogenetic analysis indicated the existence of a local transmission network (monophyletic clade) including 14 patients, mainly MSM living in the Bucharest area. We estimate the origin of the local transmission network that dates at the beginning of the 90s; the introduction of the F1 and C subtypes occurred earlier. The rest of the sequences were intermixed with reference strains sampled across Europe suggesting that single infection were not followed by subsequent dispersal within the local population. Although HIV-1 subtype B epidemic in Romania is recent, there is evidence for local spread among the MSMs, in addition to multiple introductions. Copyright © 2012 Elsevier B.V. All rights reserved.
Punctuated equilibrium in the large-scale evolution of programming languages†
Valverde, Sergi; Solé, Ricard V.
2015-01-01
The analogies and differences between biological and cultural evolution have been explored by evolutionary biologists, historians, engineers and linguists alike. Two well-known domains of cultural change are language and technology. Both share some traits relating the evolution of species, but technological change is very difficult to study. A major challenge in our way towards a scientific theory of technological evolution is how to properly define evolutionary trees or clades and how to weight the role played by horizontal transfer of information. Here, we study the large-scale historical development of programming languages, which have deeply marked social and technological advances in the last half century. We analyse their historical connections using network theory and reconstructed phylogenetic networks. Using both data analysis and network modelling, it is shown that their evolution is highly uneven, marked by innovation events where new languages are created out of improved combinations of different structural components belonging to previous languages. These radiation events occur in a bursty pattern and are tied to novel technological and social niches. The method can be extrapolated to other systems and consistently captures the major classes of languages and the widespread horizontal design exchanges, revealing a punctuated evolutionary path. PMID:25994298
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelilhood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the...
The phylogenetic relationship of Alexandrium monilatum to other Alexandrium spp. was explored using 18S rDNA sequences. Maximum likelihood phylogenetic analysis of the combined rDNA sequences established that A. monilatum paired with Alexandrium taylori and that the pair was the ...
Pan-genome and phylogeny of Bacillus cereus sensu lato.
Bazinet, Adam L
2017-08-02
Bacillus cereus sensu lato (s. l.) is an ecologically diverse bacterial group of medical and agricultural significance. In this study, I use publicly available genomes and novel bioinformatic workflows to characterize the B. cereus s. l. pan-genome and perform the largest phylogenetic and population genetic analyses of this group to date in terms of the number of genes and taxa included. With these fundamental data in hand, I identify genes associated with particular phenotypic traits (i.e., "pan-GWAS" analysis), and quantify the degree to which taxa sharing common attributes are phylogenetically clustered. A rapid k-mer based approach (Mash) was used to create reduced representations of selected Bacillus genomes, and a fast distance-based phylogenetic analysis of this data (FastME) was performed to determine which species should be included in B. cereus s. l. The complete genomes of eight B. cereus s. l. species were annotated de novo with Prokka, and these annotations were used by Roary to produce the B. cereus s. l. pan-genome. Scoary was used to associate gene presence and absence patterns with various phenotypes. The orthologous protein sequence clusters produced by Roary were filtered and used to build HaMStR databases of gene models that were used in turn to construct phylogenetic data matrices. Phylogenetic analyses used RAxML, DendroPy, ClonalFrameML, PAUP*, and SplitsTree. Bayesian model-based population genetic analysis assigned taxa to clusters using hierBAPS. The genealogical sorting index was used to quantify the phylogenetic clustering of taxa sharing common attributes. The B. cereus s. l. pan-genome currently consists of ≈60,000 genes, ≈600 of which are "core" (common to at least 99% of taxa sampled). Pan-GWAS analysis revealed genes associated with phenotypes such as isolation source, oxygen requirement, and ability to cause diseases such as anthrax or food poisoning. Extensive phylogenetic analyses using an unprecedented amount of data produced phylogenies that were largely concordant with each other and with previous studies. Phylogenetic support as measured by bootstrap probabilities increased markedly when all suitable pan-genome data was included in phylogenetic analyses, as opposed to when only core genes were used. Bayesian population genetic analysis recommended subdividing the three major clades of B. cereus s. l. into nine clusters. Taxa sharing common traits and species designations exhibited varying degrees of phylogenetic clustering. All phylogenetic analyses recapitulated two previously used classification systems, and taxa were consistently assigned to the same major clade and group. By including accessory genes from the pan-genome in the phylogenetic analyses, I produced an exceptionally well-supported phylogeny of 114 complete B. cereus s. l. genomes. The best-performing methods were used to produce a phylogeny of all 498 publicly available B. cereus s. l. genomes, which was in turn used to compare three different classification systems and to test the monophyly status of various B. cereus s. l. species. The majority of the methodology used in this study is generic and could be leveraged to produce pan-genome estimates and similarly robust phylogenetic hypotheses for other bacterial groups.
PAMLX: a graphical user interface for PAML.
Xu, Bo; Yang, Ziheng
2013-12-01
This note announces pamlX, a graphical user interface/front end for the paml (for Phylogenetic Analysis by Maximum Likelihood) program package (Yang Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 13:555-556; Yang Z. 2007. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 24:1586-1591). pamlX is written in C++ using the Qt library and communicates with paml programs through files. It can be used to create, edit, and print control files for paml programs and to launch paml runs. The interface is available for free download at http://abacus.gene.ucl.ac.uk/software/paml.html.
Leaché, Adam D.; Banbury, Barbara L.; Felsenstein, Joseph; de Oca, Adrián nieto-Montes; Stamatakis, Alexandros
2015-01-01
Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the presence of missing data. Phylogenetic analysis of RAD loci requires careful attention to model assumptions, especially if downstream analyses depend on branch lengths. PMID:26227865
A Deliberate Practice Approach to Teaching Phylogenetic Analysis
ERIC Educational Resources Information Center
Hobbs, F. Collin; Johnson, Daniel J.; Kearns, Katherine D.
2013-01-01
One goal of postsecondary education is to assist students in developing expert-level understanding. Previous attempts to encourage expert-level understanding of phylogenetic analysis in college science classrooms have largely focused on isolated, or "one-shot," in-class activities. Using a deliberate practice instructional approach, we…
Ragonnet-Cronin, Manon; Hué, Stéphane; Hodcroft, Emma B; Tostevin, Anna; Dunn, David; Fawcett, Tracy; Pozniak, Anton; Brown, Alison E; Delpech, Valerie; Brown, Andrew J Leigh
2018-06-01
Patients who do not disclose their sexuality, including men who do not disclose same-sex behaviour, are difficult to characterise through traditional epidemiological approaches such as interviews. Using a recently developed method to detect large networks of viral sequences from time-resolved trees, we localised non-disclosed men who have sex with men (MSM) in UK transmission networks, gaining crucial insight into the behaviour of this group. For this phylogenetic analysis, we obtained HIV pol sequences from the UK HIV Drug Resistance Database (UKRDB), a central repository for resistance tests done as part of routine clinical care throughout the UK. Sequence data are linked to demographic and clinical data held by the UK Collaborative HIV Cohort study and the national HIV/AIDS reporting system database. Initially, we reconstructed maximum likelihood phylogenies from these sequences, then sequences were selected for time-resolved analysis in BEAST if they were clustered with at least one other sequence at a genetic distance of 4·5% or less with support of at least 90%. We used time-resolved phylogenies to create networks by linking together nodes if sequences shared a common ancestor within the previous 5 years. We identified potential non-disclosed MSM (pnMSM), defined as self-reported heterosexual men who clustered only with men. We measured the network position of pnMSM, including betweenness (a measure of connectedness and importance) and assortativity (the propensity for nodes sharing attributes to link). 14 405 individuals were in the network, including 8452 MSM, 1743 heterosexual women and 1341 heterosexual men. 249 pnMSM were identified (18·6% of all clustered heterosexual men) in the network. pnMSM were more likely to be black African (p<0·0001), less likely to be infected with subtype B (p=0·006), and were slightly older (p=0·002) than the MSM they clustered with. Mean betweenness centrality was lower for pnMSM than for MSM (1·31, 95% CI 0·48-2·15 in pnMSM vs 2·24, 0·98-3·51 in MSM; p=0·002), indicating that pnMSM were in peripheral positions in MSM clusters. Assortativity by risk group was higher than expected (0·037 vs -0·037, p=0·01) signifying that pnMSM were linked to each other. We found that self-reported heterosexual men were more likely to link MSM and heterosexual women than heterosexual women were to link MSM and heterosexual men (Fisher's exact test p=0·0004; OR 2·24) but the number of such transmission chains was small (only 54 in total vs 32 in women). pnMSM are a subgroup distinct from both MSM and from heterosexual men. They are more likely to choose sexual partners who are also pnMSM and might exhibit lower-risk sexual behaviour than MSM (eg, choosing low-risk partners or consistently using condoms). Heterosexual men are the group most likely to be diagnosed with late-stage disease (ie, low CD4 counts) and non-disclosed MSM might put female partners at higher risk than heterosexual men because non-disclosed MSM have male partners. Hence, pnMSM require specific consideration to ensure they are included in public health interventions. National Institutes of Health. Copyright © 2018 Elsevier Ltd. All rights reserved.
Berney, Cédric; Geisen, Stefan; Van Wichelen, Jeroen; Nitsche, Frank; Vanormelingen, Pieter; Bonkowski, Michael; Bass, David
2015-05-01
Amoebae able to form cytoplasmic networks or displaying a multiply branching morphology remain very poorly studied. We sequenced the small-subunit ribosomal RNA gene of 15 new amoeboid isolates, 14 of which are branching or network-forming amoebae (BNFA). Phylogenetic analyses showed that these isolates all group within the poorly-known and weakly-defined class Variosea (Amoebozoa). They are resolved into six lineages corresponding to distinct new morphotypes; we describe them as new genera Angulamoeba (type species Angulamoeba microcystivorans n. gen., n. sp.; and A. fungorum n. sp.), Arboramoeba (type species Arboramoeba reticulata n. gen., n. sp.), Darbyshirella (type species Darbyshirella terrestris n. gen., n. sp.), Dictyamoeba (type species Dictyamoeba vorax n. gen., n. sp.), Heliamoeba (type species Heliamoeba mirabilis n. gen., n. sp.), and Ischnamoeba (type species Ischnamoeba montana n. gen., n. sp.). We also isolated and sequenced four additional variosean strains, one belonging to Flamella, one related to Telaepolella tubasferens, and two members of the cavosteliid protosteloid lineage. We identified a further 104 putative variosean environmental clone sequences in GenBank, comprising up to 14 lineages that may prove to represent additional novel morphotypes. We show that BNFA are phylogenetically widespread in Variosea and morphologically very variable, both within and between lineages. Copyright © 2015 Elsevier GmbH. All rights reserved.
Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong; Lee, Hyang Burm
2015-09-01
A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae.
Phylogenetic relationship of Ornithobacterium rhinotracheale strains.
DE Oca-Jimenez, Roberto Montes; Vega-Sanchez, Vicente; Morales-Erasto, Vladimir; Salgado-Miranda, Celene; Blackall, Patrick J; Soriano-Vargas, Edgardo
2018-04-10
The bacterium Ornithobacterium rhinotracheale is associated with respiratory disease in wild birds and poultry. In this study, the phylogenetic analysis of nine reference strains of O. rhinotracheale belonging to serovars A to I, and eight Mexican isolates belonging to serovar A, was performed. The analysis was extended to include available sequences from another 23 strains available in the public domain. The analysis showed that the 40 sequences formed six clusters, I to VI. All eight Mexican field isolates were placed in cluster I. One of the reference strains appears to present genetic diversity not previously recognized and was placed in a new genetic cluster. In conclusion, the phylogenetic analysis of O. rhinotracheale strains, based on the 16S rRNA gene, is a suitable tool for epidemiologic studies.
The impact of transmission clusters on primary drug resistance in newly diagnosed HIV-1 infection.
Yerly, Sabine; Junier, Thomas; Gayet-Ageron, Angèle; Amari, Emmanuelle Boffi El; von Wyl, Viktor; Günthard, Huldrych F; Hirschel, Bernard; Zdobnov, Evgeny; Kaiser, Laurent
2009-07-17
To monitor HIV-1 transmitted drug resistance (TDR) in a well defined urban area with large access to antiretroviral therapy and to assess the potential source of infection of newly diagnosed HIV individuals. All individuals resident in Geneva, Switzerland, with a newly diagnosed HIV infection between 2000 and 2008 were screened for HIV resistance. An infection was considered as recent when the positive test followed a negative screening test within less than 1 year. Phylogenetic analyses were performed by using the maximum likelihood method on pol sequences including 1058 individuals with chronic infection living in Geneva. Of 637 individuals with newly diagnosed HIV infection, 20% had a recent infection. Mutations associated with resistance to at least one drug class were detected in 8.5% [nucleoside reverse transcriptase inhibitors (NRTIs), 6.3%; non-nucleoside reverse transcriptase inhibitors (NNRTIs), 3.5%; protease inhibitors, 1.9%]. TDR (P-trend = 0.015) and, in particular, NNRTI resistance (P = 0.002) increased from 2000 to 2008. Phylogenetic analyses revealed that 34.9% of newly diagnosed individuals, and 52.7% of those with recent infection were linked to transmission clusters. Clusters were more frequent in individuals with TDR than in those with sensitive strains (59.3 vs. 32.6%, respectively; P < 0.0001). Moreover, 84% of newly diagnosed individuals with TDR were part of clusters composed of only newly diagnosed individuals. Reconstruction of the HIV transmission networks using phylogenetic analysis shows that newly diagnosed HIV infections are a significant source of onward transmission, particularly of resistant strains, thus suggesting an important self-fueling mechanism for TDR.
Dashper, Stuart G; Mitchell, Helen L; Seers, Christine A; Gladman, Simon L; Seemann, Torsten; Bulach, Dieter M; Chandry, P Scott; Cross, Keith J; Cleal, Steven M; Reynolds, Eric C
2017-01-01
Porphyromonas gingivalis is a keystone pathogen of chronic periodontitis. The virulence of P. gingivalis is reported to be strain related and there are currently a number of strain typing schemes based on variation in capsular polysaccharide, the major and minor fimbriae and adhesin domains of Lys-gingipain (Kgp), amongst other surface proteins. P. gingivalis can exchange chromosomal DNA between strains by natural competence and conjugation. The aim of this study was to determine the genetic variability of P. gingivalis strains sourced from international locations over a 25-year period and to determine if variability in surface virulence factors has a phylogenetic basis. Whole genome sequencing was performed on 13 strains and comparison made to 10 previously sequenced strains. A single nucleotide polymorphism-based phylogenetic analysis demonstrated a shallow tri-lobed phylogeny. There was a high level of reticulation in the phylogenetic network, demonstrating extensive horizontal gene transfer between the strains. Two highly conserved variants of the catalytic domain of the major virulence factor the Kgp proteinase (Kgp cat I and Kgp cat II) were found. There were three variants of the fourth Kgp C-terminal cleaved adhesin domain. Specific variants of the cell surface proteins FimA, FimCDE, MfaI, RagAB, Tpr, and PrtT were also identified. The occurrence of all these variants in the P. gingivalis strains formed a mosaic that was not related to the SNP-based phylogeny. In conclusion P. gingivalis uses domain rearrangements and genetic exchange to generate diversity in specific surface virulence factors.
Panichsillapakit, Theppharit; Smith, Davey M; Wertheim, Joel O; Richman, Douglas D; Little, Susan J; Mehta, Sanjay R
2016-02-01
Transmitted drug resistance (TDR) remains an important concern when initiating antiretroviral therapy (ART). Here, we describe the prevalence and phylogenetic relationships of TDR among ART-naive, HIV-infected individuals in San Diego from 1996 to 2013. Data were analyzed from 496 participants of the San Diego Primary Infection Cohort who underwent genotypic resistance testing before initiating therapy. Mutations associated with drug resistance were identified according to the WHO-2009 surveillance list. Network and phylogenetic analyses of the HIV-1 pol sequences were used to evaluate the relationships of TDR within the context of the entire cohort. The overall prevalence of TDR was 13.5% (67/496), with an increasing trend over the study period (P = 0.005). TDR was predominantly toward nonnucleoside reverse transcriptase inhibitors (NNRTIs) [8.5% (42/496)], also increasing over the study period (P = 0.005). By contrast, TDR to protease inhibitors and nucleos(t)ide reverse transcriptase inhibitors were 4.4% (22/496) and 3.8% (19/496), respectively, and did not vary with time. TDR prevalence did not differ by age, gender, race/ethnicity, or risk factors. Using phylogenetic analysis, we identified 52 transmission clusters, including 8 with at least 2 individuals sharing the same mutation, accounting for 23.8% (16/67) of the individuals with TDR. Between 1996 and 2013, the prevalence of TDR significantly increased among recently infected ART-naive individuals in San Diego. Around one-fourth of TDR occurred within clusters of recently infected individuals. These findings highlight the importance of baseline resistance testing to guide selection of ART and for public health monitoring.
Detection and Phylogenetic Analysis of Group 1 Coronaviruses in South American Bats
Foster, Jerome E.; Zhu, Hua Chen; Zhang, Jin Xia; Smith, Gavin J.D.; Thompson, Nadin; Auguste, Albert J.; Ramkissoon, Vernie; Adesiyun, Abiodun A.; Guan, Yi
2008-01-01
Bat coronaviruses (Bt-CoVs) are thought to be the precursors of severe acute respiratory syndrome coronavirus. We detected Bt-CoVs in 2 bat species from Trinidad. Phylogenetic analysis of the RNA-dependent RNA polymerase gene and helicase confirmed them as group 1 coronaviruses. PMID:19046513
USDA-ARS?s Scientific Manuscript database
Sarcocystis nesbitti was first described by Mandour in 1969 from rhesus monkey muscle. Its definitive host remains unknown. 18SrRNA gene of Sarcocystis nesbitti was amplified, sequenced, and subjected to phylogenetic analysis. Among those congeners available for comparison, it shares closest affinit...
ERIC Educational Resources Information Center
Cline, Erica; Gogarten, Jennifer
2012-01-01
We describe a laboratory exercise developed for the cell and molecular biology quarter of a year-long majors' undergraduate introductory biology sequence. In an analysis of salmon samples collected by students in their local stores and restaurants, DNA sequencing and phylogenetic analysis were used to detect market substitution of Atlantic salmon…
Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.
Barr, W Andrew; Scott, Robert S
2014-04-01
In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.
Kevin M. Potter; Frank H. Koch
2014-01-01
The analysis of phylogenetic relationships among co-occurring tree species offers insights into the ecological organization of forest communities from an evolutionary perspective and, when employed regionally across thousands of plots, can assist in forest health assessment. Phylogenetic clustering of species, when species are more closely related than expected by...
Cau, Andrea
2017-01-01
Bayesian phylogenetic methods integrating simultaneously morphological and stratigraphic information have been applied increasingly among paleontologists. Most of these studies have used Bayesian methods as an alternative to the widely-used parsimony analysis, to infer macroevolutionary patterns and relationships among species-level or higher taxa. Among recently introduced Bayesian methodologies, the Fossilized Birth-Death (FBD) model allows incorporation of hypotheses on ancestor-descendant relationships in phylogenetic analyses including fossil taxa. Here, the FBD model is used to infer the relationships among an ingroup formed exclusively by fossil individuals, i.e., dipnoan tooth plates from four localities in the Ain el Guettar Formation of Tunisia. Previous analyses of this sample compared the results of phylogenetic analysis using parsimony with stratigraphic methods, inferred a high diversity (five or more genera) in the Ain el Guettar Formation, and interpreted it as an artifact inflated by depositional factors. In the analysis performed here, the uncertainty on the chronostratigraphic relationships among the specimens was included among the prior settings. The results of the analysis confirm the referral of most of the specimens to the taxa Asiatoceratodus , Equinoxiodus, Lavocatodus and Neoceratodus , but reject those to Ceratodus and Ferganoceratodus . The resulting phylogeny constrained the evolution of the Tunisian sample exclusively in the Early Cretaceous, contrasting with the previous scenario inferred by the stratigraphically-calibrated topology resulting from parsimony analysis. The phylogenetic framework also suggests that (1) the sampled localities are laterally equivalent, (2) but three localities are restricted to the youngest part of the section; both results are in agreement with previous stratigraphic analyses of these localities. The FBD model of specimen-level units provides a novel tool for phylogenetic inference among fossils but also for independent tests of stratigraphic scenarios.
Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis
Gibbs, S; Collard, M; Wood, B
2002-01-01
This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence. PMID:11833653
A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.
Takahashi, Tetsumi; Moreno, Edmundo
2015-12-01
The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
Yutin, Natalya; Raoult, Didier; Koonin, Eugene V
2013-05-23
Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknown virus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements.
2013-01-01
Background Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor. Results We performed a comprehensive phylogenomic analysis of the available genomes of virophages and traced the evolutionary connections between the virophages and other selfish genetic elements. The comparison of the gene composition and genome organization of the virophages reveals 6 conserved, core genes that are organized in partially conserved arrays. Phylogenetic analysis of those core virophage genes, for which a sufficient diversity of homologs outside the virophages was detected, including the maturation protease and the packaging ATPase, supports the monophyly of the virophages. The results of this analysis appear incompatible with the origin of polintons from a Mavirus-like agent but rather suggest that Mavirus evolved through recombination between a polinton and an unknownvirus. Altogether, virophages, polintons, a distinct Tetrahymena transposable element Tlr1, transpovirons, adenoviruses, and some bacteriophages form a network of evolutionary relationships that is held together by overlapping sets of shared genes and appears to represent a distinct module in the vast total network of viruses and mobile elements. Conclusions The results of the phylogenomic analysis of the virophages and related genetic elements are compatible with the concept of network-like evolution of the virus world and emphasize multiple evolutionary connections between bona fide viruses and other classes of capsid-less mobile elements. PMID:23701946
Hu, Wei; Xia, Zhiqiang; Yan, Yan; Ding, Zehong; Tie, Weiwei; Wang, Lianzhe; Zou, Meiling; Wei, Yunxie; Lu, Cheng; Hou, Xiaowan; Wang, Wenquan; Peng, Ming
2015-01-01
Cassava is an important food and potential biofuel crop that is tolerant to multiple abiotic stressors. The mechanisms underlying these tolerances are currently less known. CBL-interacting protein kinases (CIPKs) have been shown to play crucial roles in plant developmental processes, hormone signaling transduction, and in the response to abiotic stress. However, no data is currently available about the CPK family in cassava. In this study, a total of 25 CIPK genes were identified from cassava genome based on our previous genome sequencing data. Phylogenetic analysis suggested that 25 MeCIPKs could be classified into four subfamilies, which was supported by exon-intron organizations and the architectures of conserved protein motifs. Transcriptomic analysis of a wild subspecies and two cultivated varieties showed that most MeCIPKs had different expression patterns between wild subspecies and cultivatars in different tissues or in response to drought stress. Some orthologous genes involved in CIPK interaction networks were identified between Arabidopsis and cassava. The interaction networks and co-expression patterns of these orthologous genes revealed that the crucial pathways controlled by CIPK networks may be involved in the differential response to drought stress in different accessions of cassava. Nine MeCIPK genes were selected to investigate their transcriptional response to various stimuli and the results showed the comprehensive response of the tested MeCIPK genes to osmotic, salt, cold, oxidative stressors, and ABA signaling. The identification and expression analysis of CIPK family suggested that CIPK genes are important components of development and multiple signal transduction pathways in cassava. The findings of this study will help lay a foundation for the functional characterization of the CIPK gene family and provide an improved understanding of abiotic stress responses and signaling transduction in cassava. PMID:26579161
Guzman-Valencia, S; Santillán-Galicia, M T; Guzmán-Franco, A W; González-Hernández, H; Carrillo-Benítez, M G; Suárez-Espinoza, J
2014-10-01
Oligonychus punicae and Oligonychus perseae (Acari: Tetranychidae) are the most important mite species affecting avocado orchards in Mexico. Here we used nucleotide sequence data from segments of the nuclear ribosomal internal transcribed spacers (ITS1 and ITS2) and mitochondrial cytochrome oxidase subunit I (COI) genes to assess the phylogenetic relationships between both sympatric mite species and, using only ITS sequence data, examine genetic variation and population structure in both species, to test the hypothesis that, although both species co-occur, their genetic population structures are different in both Michoacan state (main producer) and Mexico state. Phylogenetic analysis showed a clear separation between both species using ITS and COI sequence information. Haplotype network analysis done on 24 samples of O. punicae revealed low genetic diversity with only three haplotypes found but a significant geographical population structure confirmed by analysis of molecular variance (AMOVA) and Kimura-2-parameter (K2P) analyses. In addition, a Mantel test revealed that geographical isolation was a factor responsible for the genetic differentiation. In contrast, analyses of 22 samples of O. perseae revealed high genetic diversity with 15 haplotypes found but no geographical structure confirmed by the AMOVA, K2P and Mantel test analyses. We have suggested that geographical separation is one of the most important factors driving genetic variation, but that it affected each species differently. The role of the ecology of these species on our results, and the importance of our findings in the development of monitoring and control strategies are discussed.
Full-genome sequence and analysis of a novel human rhinovirus strain within a divergent HRV-A clade.
Rathe, Jennifer A; Liu, Xinyue; Tallon, Luke J; Gern, James E; Liggett, Stephen B
2010-01-01
Genome sequences of human rhinoviruses (HRV) have primarily been from stocks collected in the 1960s, with genomes and phylogeny of modern HRVs remaining undefined. Here, two modern isolates (hrv-A101 and hrv-A101-v1) collected approximately 8 years apart were sequenced in their entirety. Incorporation into our full-genome HRV alignment with subsequent phylogenetic network inference indicated that these represent a unique HRV-A, localized within a distinct divergent clade. They appear to have resulted from recombination of the hrv-65 and hrv-78 lineages. These results support our contention that there are unrecognized distinct HRV-A strains, and that recombination is evident in currently circulating strains.
Bazsalovicsová, Eva; Králová-Hromadová, Ivica; Xi, Bing-Wen; Štefka, Jan
2018-02-24
The monozoic tapeworm Atractolytocestus huronensis Anthony, 1958 (Cestoda: Caryophyllidea), an intestinal parasite of the common carp, is characterized by its invasive character and potential to colonize new territories. It was initially described from North America and has also been found in several European countries. The most recent findings of A. huronensis originated from China and South Africa; however, no data on genetic relationships of these populations were available. The current study provides the first molecular characterisation of A. huronensis from South Africa and China using a partial sequence of mitochondrial cytochrome c oxidase subunit 1 (cox1) and a complete ribosomal ITS2 spacer. Ribosomal and mitochondrial data were applied for phylogenetic analyses in order to assess the genetic interrelationships among global A. huronensis populations. Divergent intragenomic copies of ribosomal ITS2 were detected in all analysed specimens; the structure and frequency of the ITS2 variants of tapeworms from China and South Africa corresponded with the data on ITS2 paralogues observed previously in A. huronensis from Slovakia, the United States and the United Kingdom. The phylogenetic analysis of cox1 indicated that A. huronensis exist in two slightly differentiated clusters; one cluster was supported by all phylogenetic approaches (NJ, ML, BI) and was represented by samples from China, the USA and the UK. A second cluster was represented by tapeworms from continental Europe (Slovakia, Hungary, Romania, Croatia) and South Africa. Haplotype network analysis revealed that the highest population diversity occurs in China. The results provide useful pilot information about the interrelationships of A. huronensis on four continents and indicate that China, or the eastern Palaearctic, served as the original source population for the global expansion of this invasive tapeworm. Data on the origin and distribution of the common carp, the only specific host of A. huronensis, are also discussed. Copyright © 2018 Elsevier B.V. All rights reserved.
Estimating phylogenetic trees from genome-scale data.
Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V
2015-12-01
The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.
The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture.
Caetano-Anollés, Gustavo; Kim, Hee Shin; Mittenthal, Jay E
2007-05-29
Metabolism represents a complex collection of enzymatic reactions and transport processes that convert metabolites into molecules capable of supporting cellular life. Here we explore the origins and evolution of modern metabolism. Using phylogenomic information linked to the structure of metabolic enzymes, we sort out recruitment processes and discover that most enzymatic activities were associated with the nine most ancient and widely distributed protein fold architectures. An analysis of newly discovered functions showed enzymatic diversification occurred early, during the onset of the modern protein world. Most importantly, phylogenetic reconstruction exercises and other evidence suggest strongly that metabolism originated in enzymes with the P-loop hydrolase fold in nucleotide metabolism, probably in pathways linked to the purine metabolic subnetwork. Consequently, the first enzymatic takeover of an ancient biochemistry or prebiotic chemistry was related to the synthesis of nucleotides for the RNA world.
Molecular analysis and genetic diversity of Aedes albopictus (Diptera, Culicidae) from China.
Ruiling, Zhang; Peien, Leng; Xuejun, Wang; Zhong, Zhang
2018-05-01
Aedes albopictus is one of the most invasive species, which can carry Dengue virus, Yellow fever virus and more than twenty arboviruses. Based on mitochondrial gene cytochrome c oxidase I (COI) and samples collected from 17 populations, we investigated the molecular character and genetic diversity of Ae. albopictus from China. Altogether, 25 haplotypes were detected, including 10 shared haplotypes and 15 private haplotypes. H1 was the dominant haplotype, which is widely distributed in 13 populations. Tajima'D value of most populations was significantly negative, demonstrating that populations experienced rapid range expansion recently. Most haplotypes clustered together both in phylogenetic and median-joining network analysis without clear phylogeographic patterns. However, neutrality tests revealed shallow divergences among Hainan and Guangxi with other populations (0.15599 ≤ F ST ≤ 0.75858), which probably due to interrupted gene flow, caused by geographical isolations. In conclusion, Ae. albopictus populations showed low genetic diversity in China.
CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis
2011-01-01
Background The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Results Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. Conclusions CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore. PMID:21496247
CDAO-store: ontology-driven data integration for phylogenetic analysis.
Chisham, Brandon; Wright, Ben; Le, Trung; Son, Tran Cao; Pontelli, Enrico
2011-04-15
The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore.
Characterization of a novel orthoreovirus isolated from fruit bat, China.
Hu, Tingsong; Qiu, Wei; He, Biao; Zhang, Yan; Yu, Jing; Liang, Xiu; Zhang, Wendong; Chen, Gang; Zhang, Yingguo; Wang, Yiyin; Zheng, Ying; Feng, Ziliang; Hu, Yonghe; Zhou, Weiguo; Tu, Changchun; Fan, Quanshui; Zhang, Fuqiang
2014-11-30
In recent years novel human respiratory disease agents have been described for Southeast Asia and Australia. The causative pathogens were classified as pteropine orthoreoviruses with a strong phylogenetic relationship to orthoreoviruses of bat origin. In this report, we isolated a novel Melaka-like reovirus (named "Cangyuan virus") from intestinal content samples of one fruit bat residing in China's Yunnan province. Phylogenetic analysis of the whole Cangyuan virus genome sequences of segments L, M and S demonstrated the genetic diversity of the Cangyuan virus. In contrast to the L and M segments, the phylogenetic trees for the S segments of Cangyuan virus demonstrated a greater degree of heterogeneity. Phylogenetic analysis indicated that the Cangyuan virus was a novel orthoreovirus and substantially different from currently known members of Pteropine orthoreovirus (PRV) species group.
The DIMA web resource--exploring the protein domain network.
Pagel, Philipp; Oesterheld, Matthias; Stümpflen, Volker; Frishman, Dmitrij
2006-04-15
Conserved domains represent essential building blocks of most known proteins. Owing to their role as modular components carrying out specific functions they form a network based both on functional relations and direct physical interactions. We have previously shown that domain interaction networks provide substantially novel information with respect to networks built on full-length protein chains. In this work we present a comprehensive web resource for exploring the Domain Interaction MAp (DIMA), interactively. The tool aims at integration of multiple data sources and prediction techniques, two of which have been implemented so far: domain phylogenetic profiling and experimentally demonstrated domain contacts from known three-dimensional structures. A powerful yet simple user interface enables the user to compute, visualize, navigate and download domain networks based on specific search criteria. http://mips.gsf.de/genre/proj/dima
Speciation network in Laurasiatheria: retrophylogenomic signals.
Doronina, Liliya; Churakov, Gennady; Kuritzin, Andrej; Shi, Jingjing; Baertsch, Robert; Clawson, Hiram; Schmitz, Jürgen
2017-06-01
Rapid species radiation due to adaptive changes or occupation of new ecospaces challenges our understanding of ancestral speciation and the relationships of modern species. At the molecular level, rapid radiation with successive speciations over short time periods-too short to fix polymorphic alleles-is described as incomplete lineage sorting. Incomplete lineage sorting leads to random fixation of genetic markers and hence, random signals of relationships in phylogenetic reconstructions. The situation is further complicated when you consider that the genome is a mosaic of ancestral and modern incompletely sorted sequence blocks that leads to reconstructed affiliations to one or the other relative, depending on the fixation of their shared ancestral polymorphic alleles. The laurasiatherian relationships among Chiroptera, Perissodactyla, Cetartiodactyla, and Carnivora present a prime example for such enigmatic affiliations. We performed whole-genome screenings for phylogenetically diagnostic retrotransposon insertions involving the representatives bat (Chiroptera), horse (Perissodactyla), cow (Cetartiodactyla), and dog (Carnivora), and extracted among 162,000 preselected cases 102 virtually homoplasy-free, phylogenetically informative retroelements to draw a complete picture of the highly complex evolutionary relations within Laurasiatheria. All possible evolutionary scenarios received considerable retrotransposon support, leaving us with a network of affiliations. However, the Cetartiodactyla-Carnivora relationship as well as the basal position of Chiroptera and an ancestral laurasiatherian hybridization process did exhibit some very clear, distinct signals. The significant accordance of retrotransposon presence/absence patterns and flanking nucleotide changes suggest an important influence of mosaic genome structures in the reconstruction of species histories. © 2017 Doronina et al.; Published by Cold Spring Harbor Laboratory Press.
Speciation network in Laurasiatheria: retrophylogenomic signals
Doronina, Liliya; Churakov, Gennady; Kuritzin, Andrej; Shi, Jingjing; Baertsch, Robert; Clawson, Hiram; Schmitz, Jürgen
2017-01-01
Rapid species radiation due to adaptive changes or occupation of new ecospaces challenges our understanding of ancestral speciation and the relationships of modern species. At the molecular level, rapid radiation with successive speciations over short time periods—too short to fix polymorphic alleles—is described as incomplete lineage sorting. Incomplete lineage sorting leads to random fixation of genetic markers and hence, random signals of relationships in phylogenetic reconstructions. The situation is further complicated when you consider that the genome is a mosaic of ancestral and modern incompletely sorted sequence blocks that leads to reconstructed affiliations to one or the other relative, depending on the fixation of their shared ancestral polymorphic alleles. The laurasiatherian relationships among Chiroptera, Perissodactyla, Cetartiodactyla, and Carnivora present a prime example for such enigmatic affiliations. We performed whole-genome screenings for phylogenetically diagnostic retrotransposon insertions involving the representatives bat (Chiroptera), horse (Perissodactyla), cow (Cetartiodactyla), and dog (Carnivora), and extracted among 162,000 preselected cases 102 virtually homoplasy-free, phylogenetically informative retroelements to draw a complete picture of the highly complex evolutionary relations within Laurasiatheria. All possible evolutionary scenarios received considerable retrotransposon support, leaving us with a network of affiliations. However, the Cetartiodactyla–Carnivora relationship as well as the basal position of Chiroptera and an ancestral laurasiatherian hybridization process did exhibit some very clear, distinct signals. The significant accordance of retrotransposon presence/absence patterns and flanking nucleotide changes suggest an important influence of mosaic genome structures in the reconstruction of species histories. PMID:28298429
Liyanage, Kapila K; Khan, Sehroon; Brooks, Siraprapa; Mortimer, Peter E; Karunarathna, Samantha C; Xu, Jianchu; Hyde, Kevin D
2018-01-01
Powdery mildew disease of rubber affects immature green leaves, buds, inflorescences, and other immature tissues of rubber trees, resulting in up to 45% losses in rubber latex yield worldwide. The disease is often controlled by dusting the diseased plants with powdered sulfur, which can have long-term negative effects on the environment. Therefore, it is necessary to search for alternative and environmentally friendly control methods for this disease. This study aimed to identify mycoparasites associated with rubber powdery mildew species, and characterize them on the basis of morpho-molecular characteristics and phylogenetic analyses of ITS rDNA regions. We observed that the Ampelomyces fungus parasitizes rubber powdery mildew, and eventually destroys it. Furthermore, on the basis of phylogenetic analyses and morphological characteristics we confirmed that the Ampelomyces mycoparasite isolated from rubber powdery mildew is closely related to other mycohost taxa in the Erysiphe genus. A total of 73 (71 retrieved from GenBank and two obtained from fresh collections of rubber powdery mildew fungi) Ampelomyces spp. were analyzed using ITS rDNA sequences and 153 polymorphic sites were identified through haplotypic analyses. A total of 28 haplotypes (H1-H28) were identified to have a complex network of mutation events. The results from phylogenetic tree constructed on the basis of maximum likelihood analyses, and the haplotype network tree revealed similar relationships of clustering pattern. This work presents the first report on morpho-molecular characterization of Ampelomyces species that are mycoparasites of powdery mildew of Hevea brasiliensis .
A Format for Phylogenetic Placements
Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988
A format for phylogenetic placements.
Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros
2012-01-01
We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.
A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.
Rajan, Vaibhav
2013-03-01
Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.
Liang, Li-Jung; Weiss, Robert E; Redelings, Benjamin; Suchard, Marc A
2009-10-01
Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.
Mark T. Banik; Daniel L. Lindner; Yuko Ota; Tsutomu Hattori
2010-01-01
Relationships were investigated among North American and Japanese isolates of Laetiporus using phylogenetic analysis of ITS sequences and single-spore isolate incompatibility. Single-spore isolate pairings revealed no significant compatibility between North American and Japanese isolates. ITS analysis revealed 12 clades within the core ...
USDA-ARS?s Scientific Manuscript database
The southwestern United States has been incidentally affected by vesicular stomatitis virus (VSV) epidemics during the last 100 years. By the time this manuscript was written, the last episodes were reported in 2004-2006. Results of space clustering and phylogenetic analysis techniques used here sug...
Phylogenetic analysis of West Nile virus, Nuevo Leon State, Mexico.
Blitvich, Bradley J; Fernández-Salas, Ildefonso; Contreras-Cordero, Juan F; Loroño-Pino, María A; Marlenee, Nicole L; Díaz, Francisco J; González-Rojas, José I; Obregón-Martínez, Nelson; Chiu-García, Jorge A; Black, William C; Beaty, Barry J
2004-07-01
West Nile virus RNA was detected in brain tissue from a horse that died in June 2003 in Nuevo Leon State, Mexico. Nucleotide sequencing and phylogenetic analysis of the premembrane and envelope genes showed that the virus was most closely related to West Nile virus isolates collected in Texas in 2002.
Phylogenetic Analysis of West Nile Virus, Nuevo Leon State, Mexico
Blitvich, Bradley J.; Fernández-Salas, Ildefonso; Contreras-Cordero, Juan F.; Loroño-Pino, María A.; Marlenee, Nicole L.; Díaz, Francisco J.; González-Rojas, José I.; Obregón-Martínez, Nelson; Chiu-García, Jorge A.; Black, William C.
2004-01-01
West Nile virus RNA was detected in brain tissue from a horse that died in June 2003 in Nuevo Leon State, Mexico. Nucleotide sequencing and phylogenetic analysis of the premembrane and envelope genes showed that the virus was most closely related to West Nile virus isolates collected in Texas in 2002. PMID:15324558
Previous studies have shown that culture-based methods tend to underestimate the densities and diversity of bacterial populations inhabiting water distribution systems (WDS). In this study, the phylogenetic diversity of drinking water bacteria was assessed using sequence analysis...
Aisen, Santiago; Ramírez, Martín J
2015-08-06
We review the spider genus Oxysoma Nicolet, with most of its species endemic from the southern temperate forests in Chile and Argentina, and present a phylogenetic analysis including seven species, of which three are newly described in this study (O. macrocuspis new species, O. kuni new species, and O. losruiles new species, all from Chile), together with other 107 representatives of Anyphaenidae. New geographical records and distribution maps are provided for all species, with illustrations and reviewed diagnoses for the genus and the four previously known species (O. punctatum Nicolet, O. saccatum (Tullgren), O. longiventre (Nicolet) and O. itambezinho Ramírez). The phylogenetic analysis using cladistic methods is based on 264 previously defined characters plus one character that arises from this study. The three new species are closely related with Oxysoma longiventre, and this four species compose what we define as the Oxysoma longiventre species group. The phylogenetic analysis did not retrieve the monophyly of Oxysoma, which should be reevaluated in the future, together with the genus Tasata.
Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris
2016-01-01
INTRODUCTION Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. METHODS Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. RESULTS Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. CONCLUSION The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. PMID:26805667
Verdú, Miguel; Traveset, Anna
2004-02-01
Most studies using meta-analysis try to establish relationships between traits across taxa from interspecific databases and, thus, the phylogenetic relatedness among these taxa should be taken into account to avoid pseudoreplication derived from common ancestry. This paper illustrates, with a representative example of the relationship between seed size and the effect of frugivore's gut on seed germination, that meta-analytic procedures can also be phylogenetically corrected by means of the comparative method. The conclusions obtained in the meta-analytical and phylogenetical approaches are very different. The meta-analysis revealed that the positive effects that gut passage had on seed germination increased with seed size in the case of gut passage through birds whereas decreased in the case of gut passage through non-flying mammals. However, once the phylogenetic relatedness among plant species was taken into account, the effects of gut passage on seed germination did not depend on seed size and were similar between birds and non-flying mammals. Some methodological considerations are given to improve the bridge between the meta-analysis and the comparative method.
Loh, Jin Phang; Gao, Qiu Han Christine; Lee, Vernon J; Tetteh, Kevin; Drakeley, Chris
2016-12-01
Although there have been several phylogenetic studies on Plasmodium knowlesi (P. knowlesi), only cytochrome c oxidase subunit 1 (COX1) gene analysis has shown some geographical differentiation between the isolates of different countries. Phylogenetic analysis of locally acquired P. knowlesi infections, based on circumsporozoite, small subunit ribosomal ribonucleic acid (SSU rRNA), merozoite surface protein 1 and COX1 gene targets, was performed. The results were compared with the published sequences of regional isolates from Malaysia and Thailand. Phylogenetic analysis of the circumsporozoite, SSU rRNA and merozoite surface protein 1 gene sequences for regional P. knowlesi isolates showed no obvious differentiation that could be attributed to their geographical origin. However, COX1 gene analysis showed that it was possible to differentiate between Singapore-acquired P. knowlesi infections and P. knowlesi infections from Peninsular Malaysia and Sarawak, Borneo, Malaysia. The ability to differentiate between locally acquired P. knowlesi infections and imported P. knowlesi infections has important utility for the monitoring of P. knowlesi malaria control programmes in Singapore. Copyright: © Singapore Medical Association
Ali, Khalil H Al; El-Badry, Ayman A; Ali, Mouhanad Al; El-Sayed, Wael S M; El-Beshbishy, Hesham A
2016-06-01
Aedes aegypti is the main vector of the yellow fever and dengue virus. This mosquito has become the major indirect cause of morbidity and mortality of the human worldwide. Dengue virus activity has been reported recently in the western areas of Saudi Arabia. There is no vaccine for dengue virus until now, and the control of the disease depends on the control of the vector. The present study has aimed to perform phylogenetic analysis of Aedes aegypti based on mitochondrial NADH dehydrogenase subunit 4 ( ND4 ) gene at Almadinah, Saudi Arabia in order to get further insight into the epidemiology and transmission of this vector. Mitochondrial ND4 gene was sequenced in the eight isolated Aedes aegypti mosquitoes from Almadinah, Saudi Arabia, sequences were aligned, and phylogenetic analysis were performed and compared with 54 sequences of Aedes reported in the previous studies from Mexico, Thailand, Brazil, and Africa. Our results suggest that increased gene flow among Aedes aegypti populations occurs between Africa and Saudi Arabia. Phylogenetic relationship analysis showed two genetically distinct Aedes aegypti in Saudi Arabia derived from dual African ancestor.
Eukaryotic Protein Kinases (ePKs) of the Helminth Parasite Schistosoma mansoni
2011-01-01
Background Schistosomiasis remains an important parasitic disease and a major economic problem in many countries. The Schistosoma mansoni genome and predicted proteome sequences were recently published providing the opportunity to identify new drug candidates. Eukaryotic protein kinases (ePKs) play a central role in mediating signal transduction through complex networks and are considered druggable targets from the medical and chemical viewpoints. Our work aimed at analyzing the S. mansoni predicted proteome in order to identify and classify all ePKs of this parasite through combined computational approaches. Functional annotation was performed mainly to yield insights into the parasite signaling processes relevant to its complex lifestyle and to select some ePKs as potential drug targets. Results We have identified 252 ePKs, which corresponds to 1.9% of the S. mansoni predicted proteome, through sequence similarity searches using HMMs (Hidden Markov Models). Amino acid sequences corresponding to the conserved catalytic domain of ePKs were aligned by MAFFT and further used in distance-based phylogenetic analysis as implemented in PHYLIP. Our analysis also included the ePK homologs from six other eukaryotes. The results show that S. mansoni has proteins in all ePK groups. Most of them are clearly clustered with known ePKs in other eukaryotes according to the phylogenetic analysis. None of the ePKs are exclusively found in S. mansoni or belong to an expanded family in this parasite. Only 16 S. mansoni ePKs were experimentally studied, 12 proteins are predicted to be catalytically inactive and approximately 2% of the parasite ePKs remain unclassified. Some proteins were mentioned as good target for drug development since they have a predicted essential function for the parasite. Conclusions Our approach has improved the functional annotation of 40% of S. mansoni ePKs through combined similarity and phylogenetic-based approaches. As we continue this work, we will highlight the biochemical and physiological adaptations of S. mansoni in response to diverse environments during the parasite development, vector interaction, and host infection. PMID:21548963
Ecophenotypic plasticity leads to extraordinary gastropod shells found on the “Roof of the World”
Clewing, Catharina; Riedel, Frank; Wilke, Thomas; Albrecht, Christian
2015-01-01
The often extraordinary shell forms and shapes of gastropods found in palaeolakes, such as the highly diverse Gyraulus fauna of the famous Steinheim Basin, have been puzzling evolutionary biologists for centuries, and there is an ongoing debate whether these aberrant shell forms are indicative of true species (or subspecies) or ecophenotypic morphs. Interestingly, one of the Steinheim Gyraulus morphs – a corkscrew-like open-coiled shell – has a recent analogue in the Lake Bangong drainage system on the western Tibetan Plateau. Therefore, a combination of morphological, molecular, palaeolimnological, and ecological analyses was used in this study to assess whether the extraordinary shell shape in Gyraulus sp. from this drainage system represents a (young) ecophenotypic phenomenon or if it has been genetically fixed over an extended period of time. Our morphological, ecological, and palaeolimnological data suggest that the corkscrew-like specimens remain restricted to a small pond near Lake Bangong with an elevated pH value and that the colonization may have occurred recently. The phylogenetic reconstruction based on two gene fragments shows that these nonplanispiral specimens cluster within the previous described Tibetan Plateau Gyraulus clade N2. A network analysis indicates that some haplotypes are even shared by planispiral and nonplanispiral specimens. Given the ephemerality of the phenomenon, the compact network patterns inferred, the likely young phylogenetic age of the aberrant Gyraulus shells studied, and the ecological peculiarities of the study site, we suggest that the evolution of the aberrant shell forms on the Tibetan Plateau could likely be considered as a rapid ecophenotypic response, possibly induced by ecological stress. This finding may thus have implications for the ongoing debate about the processes that have caused the extraordinary shell diversity in palaeolakes such as the Steinheim Basin. PMID:26306180
Babes in the wood – a unique window into sea scorpion ontogeny
2013-01-01
Background Few studies on eurypterids have taken into account morphological changes that occur throughout postembryonic development. Here two species of eurypterid are described from the Pragian Beartooth Butte Formation of Cottonwood Canyon in Wyoming and included in a phylogenetic analysis. Both species comprise individuals from a number of instars, and this allows for changes that occur throughout their ontogeny to be documented, and how ontogenetically variable characters can influence phylogenetic analysis to be tested. Results The two species of eurypterid are described as Jaekelopterus howelli (Kjellesvig-Waering and Størmer, 1952) and Strobilopterus proteus sp. nov. Phylogenetic analysis places them within the Pterygotidae and Strobilopteridae respectively, both families within the Eurypterina. Jaekelopterus howelli shows positive allometry of the cheliceral denticles throughout ontogeny, while a number of characteristics including prosomal appendage length, carapace shape, lateral eye position, and relative breadth all vary during the growth of Strobilopterus proteus. Conclusions The ontogeny of Strobilopterus proteus shares much in common with that of modern xiphosurans, however certain characteristics including apparent true direct development suggest a closer affinity to arachnids. The ontogenetic development of the genital appendage also supports the hypothesis that the structure is homologous to the endopods of the trunk limbs of other arthropods. Including earlier instars in the phylogenetic analysis is shown to destabilise the retrieved topology. Therefore, coding juveniles as individual taxa in an analysis is shown to be actively detrimental and alternative ways of coding ontogenetic data into phylogenetic analyses should be explored. PMID:23663507
Hué, Stéphane; Buckton, Andrew J.; Myers, Richard E.; Duiculescu, Dan; Ene, Luminita; Oprea, Cristiana; Tardei, Gratiela; Rugina, Sorin; Mardarescu, Mariana; Floch, Corinne; Notheis, Gundula; Zöhrer, Bettina; Cane, Patricia A.; Pillay, Deenan
2012-01-01
Abstract In the late 1980s an HIV-1 epidemic emerged in Romania that was dominated by subtype F1. The main route of infection is believed to be parenteral transmission in children. We sequenced partial pol coding regions of 70 subtype F1 samples from children and adolescents from the PENTA-EPPICC network of which 67 were from Romania. Phylogenetic reconstruction using the sequences and other publically available global subtype F sequences showed that 79% of Romanian F1 sequences formed a statistically robust monophyletic cluster. The monophyletic cluster was epidemiologically linked to parenteral transmission in children. Coalescent-based analysis dated the origins of the parenteral epidemic to 1983 [1981–1987; 95% HPD]. The analysis also shows that the epidemic's effective population size has remained fairly constant since the early 1990s suggesting limited onward spread of the virus within the population. Furthermore, phylogeographic analysis suggests that the root location of the parenteral epidemic was Bucharest. PMID:22251065
Molecular Epidemiology of Autochthonous Dengue Virus Strains Circulating in Mexico ▿
Rivera-Osorio, Pilar; Vaughan, Gilberto; Ramírez-González, Jose Ernesto; Fonseca-Coronado, Salvador; Ruíz-Tovar, Karina; Cruz-Rivera, Mayra Yolanda; Ruíz-Pacheco, Juan Alberto; Vázquez-Pichardo, Mauricio; Carpio-Pedroza, Juan Carlos; Cázares, Fernando; Escobar-Gutiérrez, Alejandro
2011-01-01
Dengue virus (DENV) is the most important arthropod-borne viral infection in humans. Here, the genetic relatedness among autochthonous DENV Mexican isolates was assessed. Phylogenetic and median-joining network analyses showed that viral strains recovered from different geographic locations are genetically related and relatively homogeneous, exhibiting limited nucleotide diversity. PMID:21775538
Smith, Natalie; Power, Ultan F; McKillen, John
2018-05-29
To investigate the genetic diversity of porcine reproductive and respiratory syndrome virus (PRRSV) in Northern Ireland, the ORF5 gene from nine field isolates was sequenced and phylogenetically analysed. The results revealed relatively high diversity amongst isolates, with 87.6-92.2% identity between farms at the nucleotide level and 84.1-93.5% identity at the protein level. Phylogenetic analysis confirmed that all nine isolates belonged to the European (type 1) genotype and formed a cluster within the subtype 1 subgroup. This study provides the first report on PRRSV isolate diversity in Northern Ireland.
NASA Astrophysics Data System (ADS)
Gao, Fengtao; Wei, Min; Zhu, Ying; Guo, Hua; Chen, Songlin; Yang, Guanpin
2017-06-01
This study presents the complete mitochondrial genome of the hybrid Epinephelus moara♀× Epinephelus lanceolatus♂. The genome is 16886 bp in length, and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, a light-strand replication origin and a control region. Additionally, phylogenetic analysis based on the nucleotide sequences of 13 conserved protein-coding genes using the maximum likelihood method indicated that the mitochondrial genome is maternally inherited. This study presents genomic data for studying phylogenetic relationships and breeding of hybrid Epinephelinae.
NASA Astrophysics Data System (ADS)
Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.
2007-12-01
Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
Basic Helix-Loop-Helix Transcription Factor Gene Family Phylogenetics and Nomenclature
Skinner, Michael K.; Rawls, Alan; Wilson-Rawls, Jeanne; Roalson, Eric H.
2010-01-01
A phylogenetic analysis of the basic helix-loop-helix (bHLH) gene superfamily was performed using seven different species (human, mouse, rat, worm, fly, yeast, and plant Arabidopsis) and involving over 600 bHLH genes [1]. All bHLH genes were identified in the genomes of the various species, including expressed sequence tags, and the entire coding sequence was used in the analysis. Nearly 15% of the gene family has been updated or added since the original publication. A super-tree involving six clades and all structural relationships was established and is now presented for four of the species. The wealth of functional data available for members of the bHLH gene superfamily provides us with the opportunity to use this exhaustive phylogenetic tree to predict potential functions of uncharacterized members of the family. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique elements of the evolution and functional relationships of the different genes in the bHLH gene family. PMID:20219281
galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.
Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M
2004-06-12
The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se
Bellanger, J-M; Moreau, P-A; Corriol, G; Bidaud, A; Chalange, R; Dudova, Z; Richard, F
2015-04-01
During the last two decades, the unprecedented development of molecular phylogenetic tools has propelled an opportunity to revisit the fungal kingdom under an evolutionary perspective. Mycology has been profoundly changed but a sustained effort to elucidate large sections of the astonishing fungal diversity is still needed. Here we fill this gap in the case of Lyophyllaceae, a species-rich and ecologically diversified family of mushrooms. Assembly and genealogical concordance multigene phylogenetic analysis of a large dataset that includes original, vouchered material from expert field mycologists reveal the phylogenetic topology of the family, from higher (generic) to lower (species) levels. A comparative analysis of the most widely used phylogenetic markers in Fungi indicates that the nuc rDNA region encompassing the internal transcribed spacers 1 and 2, along with the 5.8S rDNA (ITS) and portions of the genes for RNA polymerase II second largest subunit (RPB2) is the most performing combination to resolve the broadest range of taxa within Lyophyllaceae. Eleven distinct evolutionary lineages are identified, that display partial overlap with traditional genera as well as with the phylogenetic framework previously proposed for the family. Eighty phylogenetic species are delineated, which shed light on a large number of morphological concepts, including rare and poorly documented ones. Probing these novel phylogenetic species to the barcoding method of species limit delineation, indicates that the latter method fully resolves Lyophyllaceae species, except in one clade. This case study provides the first comprehensive phylogenetic overview of Lyophyllaceae, a necessary step towards a taxonomical, ecological and nomenclatural revision of this family of mushrooms. It also proposes a set of methodological guidelines that may be of relevance for future taxonomic works in other groups of Fungi.
Phylogenetic analysis of the envelope protein (domain lll) of dengue 4 viruses
Mota, Javier; Ramos-Castañeda, José; Rico-Hesse, Rebeca; Ramos, Celso
2011-01-01
Objective To evaluate the genetic variability of domain III of envelope (E) protein and to estimate phylogenetic relationships of dengue 4 (Den-4) viruses isolated in Mexico and from other endemic areas of the world. Material and Methods A phylogenetic study of domain III of envelope (E) protein of Den-4 viruses was conducted in 1998 using virus strains from Mexico and other parts of the world, isolated in different years. Specific primers were used to amplify by RT-PCR the domain III and to obtain nucleotide sequence. Based on nucleotide and deduced aminoacid sequence, genetic variability was estimated and a phylogenetic tree was generated. To make an easy genetic analysis of domain III region, a Restriction Fragment Length Polymorphism (RFLP) assay was performed, using six restriction enzymes. Results Study results demonstrate that nucleotide and aminoacid sequence analysis of domain III are similar to those reported from the complete E protein gene. Based on the RFLP analysis of domain III using the restriction enzymes Nla III, Dde I and Cfo I, Den-4 viruses included in this study were clustered into genotypes 1 and 2 previously reported. Conclusions Study results suggest that domain III may be used as a genetic marker for phylogenetic and molecular epidemiology studies of dengue viruses. The English version of this paper is available too at: http://www.insp.mx/salud/index.html PMID:12132320
GENOME-WIDE COMPARATIVE ANALYSIS OF PHYLOGENETIC TREES: THE PROKARYOTIC FOREST OF LIFE
Puigbò, Pere; Wolf, Yuri I.; Koonin, Eugene V.
2013-01-01
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance (SD) method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the applications methods used to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a ‘species tree’. PMID:22399455
Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.
Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V
2012-01-01
Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."
Goggin, C L; Barker, S C
1993-07-01
Parasites of the genus Perkinsus destroy marine molluscs worldwide. Their phylogenetic position within the kingdom Protista is controversial. Nucleotide sequence data (1792 bp) from the small subunit rRNA gene of Perkinsus sp. from Anadara trapezia (Mollusca: Bivalvia) from Moreton Bay, Queensland, was used to examine the phylogenetic affinities of this enigmatic genus. These data were aligned with nucleotide sequences from 6 apicomplexans, 3 ciliates, 3 flagellates, a dinoflagellate, 3 fungi, maize and human. Phylogenetic trees were constructed after analysis with maximum parsimony and distance matrix methods. Our analyses indicate that Perkinsus is phylogenetically closer to dinoflagellates and to coccidean and piroplasm apicomplexans than to fungi or flagellates.
Benedek, Tibor; Táncsics, András; Szabó, István; Farkas, Milán; Szoboszlay, Sándor; Fábián, Krisztina; Maróti, Gergely; Kriszt, Balázs
2016-05-01
Pump and treat systems are widely used for hydrocarbon-contaminated groundwater remediation. Although biofouling (formation of clogging biofilms on pump surfaces) is a common problem in these systems, scarce information is available regarding the phylogenetic and functional complexity of such biofilms. Extensive information about the taxa and species as well as metabolic potential of a bacterial biofilm developed on the stainless steel surface of a pump submerged in a gasoline-contaminated hypoxic groundwater is presented. Results shed light on a complex network of interconnected hydrocarbon-degrading chemoorganotrophic and chemolitotrophic bacteria. It was found that besides the well-known hydrocarbon-degrading aerobic/facultative anaerobic biofilm-forming organisms (e.g., Azoarcus, Leptothrix, Acidovorax, Thauera, Pseudomonas, etc.), representatives of Fe(2+)-and Mn(2+)-oxidizing (Thiobacillus, Sideroxydans, Gallionella, Rhodopseudomonas, etc.) as well as of Fe(3+)- and Mn(4+)-respiring (Rhodoferax, Geobacter, Magnetospirillum, Sulfurimonas, etc.) bacteria were present in the biofilm. The predominance of β-Proteobacteria within the biofilm bacterial community in phylogenetic and functional point of view was revealed. Investigation of meta-cleavage dioxygenase and benzylsuccinate synthase (bssA) genes indicated that within the biofilm, Azoarcus, Leptothrix, Zoogloea, and Thauera species are most probably involved in intrinsic biodegradation of aromatic hydrocarbons. Polyphasic analysis of the biofilm shed light on the fact that subsurface microbial accretions might be reservoirs of novel putatively hydrocarbon-degrading bacterial species. Moreover, clogging biofilms besides their detrimental effects might supplement the efficiency of pump and treat systems.
Effect of Wnt3a on Keratinocytes Utilizing in Vitro and Bioinformatics Analysis
Nam, Ju-Suk; Chakraborty, Chiranjib; Sharma, Ashish Ranjan; Her, Young; Bae, Kee-Jeong; Sharma, Garima; Doss, George Priya; Lee, Sang-Soo; Hong, Myung-Sun; Song, Dong-Keun
2014-01-01
Wingless-type (Wnt) signaling proteins participate in various cell developmental processes. A suppressive role of Wnt5a on keratinocyte growth has already been observed. However, the role of other Wnt proteins in proliferation and differentiation of keratinocytes remains unknown. Here, we investigated the effects of the Wnt ligand, Wnt3a, on proliferation and differentiation of keratinocytes. Keratinocytes from normal human skin were cultured and treated with recombinant Wnt3a alone or in combination with the inflammatory cytokine, tumor necrosis factor α (TNFα). Furthermore, using bioinformatics, we analyzed the biochemical parameters, molecular evolution, and protein–protein interaction network for the Wnt family. Application of recombinant Wnt3a showed an anti-proliferative effect on keratinocytes in a dose-dependent manner. After treatment with TNFα, Wnt3a still demonstrated an anti-proliferative effect on human keratinocytes. Exogenous treatment of Wnt3a was unable to alter mRNA expression of differentiation markers of keratinocytes, whereas an altered expression was observed in TNFα-stimulated keratinocytes. In silico phylogenetic, biochemical, and protein–protein interaction analysis showed several close relationships among the family members of the Wnt family. Moreover, a close phylogenetic and biochemical similarity was observed between Wnt3a and Wnt5a. Finally, we proposed a hypothetical mechanism to illustrate how the Wnt3a protein may inhibit the process of proliferation in keratinocytes, which would be useful for future researchers. PMID:24686518
Hesamizadeh, Khashayar; Alavian, Seyed Moayed; Najafi Tireh Shabankareh, Azar; Sharafi, Heidar
2016-12-01
Hepatitis C virus (HCV) is characterized by a high degree of genetic heterogeneity and classified into 7 genotypes and different subtypes. It heterogeneously distributed through various risk groups and geographical regions. A well-established phylogenetic relationship can simplify the tracing of HCV hierarchical strata into geographical regions. The current study aimed to find genetic phylogeny of subtypes 1a and 1b of HCV isolates based on NS5B nucleotide sequences in Iran and other members of Eastern Mediterranean regional office of world health organization, as well as other Middle Eastern countries, with a systematic review of available published and unpublished studies. The phylogenetic analyses were performed based on the nucleotide sequences of NS5B gene of HCV genotype 1 (HCV-1), which were registered in the GenBank database. The literature review was performed in two steps: 1) searching studies evaluating the NS5B sequences of HCV-1, on PubMed, Scopus, and Web of Science, and 2) Searching sequences of unpublished studies registered in the GenBank database. In this study, 442 sequences from HCV-1a and 232 from HCV-1b underwent phylogenetic analysis. Phylogenetic analysis of all sequences revealed different clusters in the phylogenetic trees. The results showed that the proportion of HCV-1a and -1b isolates from Iranian patients probably originated from domestic sources. Moreover, the HCV-1b isolates from Iranian patients may have similarities with the European ones. In this study, phylogenetic reconstruction of HCV-1 sequences clearly indicated for molecular tracing and ancestral relationships of the HCV genotypes in Iran, and showed the likelihood of domestic origin for HCV-1a and various origin for HCV-1b.
Ast, Jennifer C; Dunlap, Paul V
2005-10-01
Substantial ambiguity exists regarding the phylogenetic status of facultatively psychrophilic luminous bacteria identified as Photobacterium phosphoreum, a species thought to be widely distributed in the world's oceans and believed to be the specific bioluminescent light-organ symbiont of several deep-sea fishes. Members of the P. phosphoreum species group include luminous and non-luminous strains identified phenotypically from a variety of different habitats as well as phylogenetically defined lineages that appear to be evolutionarily distinct. To resolve this ambiguity and to begin developing a meaningful knowledge of the geographic distributions, habitats and symbiotic relationships of bacteria in the P. phosphoreum species group, we carried out a multilocus, fine-scale phylogenetic analysis based on sequences of the 16S rRNA, gyrB and luxABFE genes of many newly isolated luminous strains from symbiotic and saprophytic habitats, together with previously isolated luminous and non-luminous strains identified as P. phosphoreum from these and other habitats. Parsimony analysis unambiguously resolved three evolutionarily distinct clades, phosphoreum, iliopiscarium and kishitanii. The tight phylogenetic clustering within these clades and the distinct separation between them indicates they are different species, P. phosphoreum, Photobacterium iliopiscarium and the newly recognized 'Photobacterium kishitanii'. Previously reported non-luminous strains, which had been identified phenotypically as P. phosphoreum, resolved unambiguously as P. iliopiscarium, and all examined deep-sea fishes (specimens of families Chlorophthalmidae, Macrouridae, Moridae, Trachichthyidae and Acropomatidae) were found to harbour 'P. kishitanii', not P. phosphoreum, in their light organs. This resolution revealed also that 'P. kishitanii' is cosmopolitan in its geographic distribution. Furthermore, the lack of phylogenetic variation within 'P. kishitanii' indicates that this facultatively symbiotic bacterium is not cospeciating with its phylogenetically divergent host fishes. The results of this fine-scale phylogenetic analysis support the emerging view that bacterial species names should designate singular historical entities, i.e. discrete lineages diagnosed by a significant divergence of shared derived nucleotide characters.
Wolf, Y I; Aravind, L; Grishin, N V; Koonin, E V
1999-08-01
Phylogenetic analysis of aminoacyl-tRNA synthetases (aaRSs) of all 20 specificities from completely sequenced bacterial, archaeal, and eukaryotic genomes reveals a complex evolutionary picture. Detailed examination of the domain architecture of aaRSs using sequence profile searches delineated a network of partially conserved domains that is even more elaborate than previously suspected. Several unexpected evolutionary connections were identified, including the apparent origin of the beta-subunit of bacterial GlyRS from the HD superfamily of hydrolases, a domain shared by bacterial AspRS and the B subunit of archaeal glutamyl-tRNA amidotransferases, and another previously undetected domain that is conserved in a subset of ThrRS, guanosine polyphosphate hydrolases and synthetases, and a family of GTPases. Comparison of domain architectures and multiple alignments resulted in the delineation of synapomorphies-shared derived characters, such as extra domains or inserts-for most of the aaRSs specificities. These synapomorphies partition sets of aaRSs with the same specificity into two or more distinct and apparently monophyletic groups. In conjunction with cluster analysis and a modification of the midpoint-rooting procedure, this partitioning was used to infer the likely root position in phylogenetic trees. The topologies of the resulting rooted trees for most of the aaRSs specificities are compatible with the evolutionary "standard model" whereby the earliest radiation event separated bacteria from the common ancestor of archaea and eukaryotes as opposed to the two other possible evolutionary scenarios for the three major divisions of life. For almost all aaRSs specificities, however, this simple scheme is confounded by displacement of some of the bacterial aaRSs by their eukaryotic or, less frequently, archaeal counterparts. Displacement of ancestral eukaryotic aaRS genes by bacterial ones, presumably of mitochondrial origin, was observed for three aaRSs. In contrast, there was no convincing evidence of displacement of archaeal aaRSs by bacterial ones. Displacement of aaRS genes by eukaryotic counterparts is most common among parasitic and symbiotic bacteria, particularly the spirochaetes, in which 10 of the 19 aaRSs seem to have been displaced by the respective eukaryotic genes and two by the archaeal counterpart. Unlike the primary radiation events between the three main divisions of life, that were readily traceable through the phylogenetic analysis of aaRSs, no consistent large-scale bacterial phylogeny could be established. In part, this may be due to additional gene displacement events among bacterial lineages. Argument is presented that, although lineage-specific gene loss might have contributed to the evolution of some of the aaRSs, this is not a viable alternative to horizontal gene transfer as the principal evolutionary phenomenon in this gene class.
Cai, J; Collins, M D
1994-04-01
The 16S rRNA gene sequence of Melissococcus pluton, the causative agent of European foulbrood disease, was determined in order to investigate the phylogenetic relationships between this organism and other low-G + C-content gram-positive bacteria. A comparative sequence analysis revealed that M. pluton is a close phylogenetic relative of the genus Enterococcus.
Cunningham, Evan; Jacka, Brendan; DeBeck, Kora; Applegate, Tanya A; Harrigan, P. Richard; Krajden, Mel; Marshall, Brandon DL; Montaner, Julio; Lima, Viviane Dias; Olmstead, Andrea; Milloy, M-J; Wood, Evan; Grebely, Jason
2015-01-01
Background Among prospective cohorts of people who inject drugs (PWID), phylogenetic clustering of HCV infection has been observed. However, the majority of studies have included older PWID, representing distant transmission events. The aim of this study was to investigate phylogenetic clustering of HCV infection among a cohort of street-involved youth. Methods Data were derived from a prospective cohort of street-involved youth aged 14–26 recruited between 2005 and 2012 in Vancouver, Canada (At Risk Youth Study, ARYS). HCV RNA testing and sequencing (Core-E2) were performed on HCV positive participants. Phylogenetic trees were inferred using maximum likelihood methods and clusters were identified using ClusterPicker (Core-E2 without HVR1, 90% bootstrap threshold, 0.05 genetic distance threshold). Results Among 945 individuals enrolled in ARYS, 16% (n=149, 100% recent injectors) were HCV antibody positive at baseline interview (n=86) or seroconverted during follow-up (n=63). Among HCV antibody positive participants with available samples (n=131), 75% (n=98) had detectable HCV RNA and 66% (n=65, mean age 23, 58% with recent methamphetamine injection, 31% female, 3% HIV+) had available Core-E2 sequences. Of those with Core-E2 sequence, 14% (n=9) were in a cluster (one cluster of three) or pair (two pairs), with all reporting recent methamphetamine injection. Recent methamphetamine injection was associated with membership in a cluster or pair (P=0.009). Conclusion In this study of street-involved youth with HCV infection and recent injecting, 14% demonstrated phylogenetic clustering. Phylogenetic clustering was associated with recent methamphetamine injection, suggesting that methamphetamine drug injection may play an important role in networks of HCV transmission. PMID:25977204
Caruso, Claudio; Dondo, Alessandro; Cerutti, Francesco; Masoero, Loretta; Rosamilia, Alfonso; Zoppi, Simona; D'Errico, Valeria; Grattarola, Carla; Acutis, Pier Luigi; Peletto, Simone
2014-07-01
We describe Aujeszky's disease in a female of red fox (Vulpes vulpes). Although wild boar (Sus scrofa) would be the expected source of infection, phylogenetic analysis suggested a domestic rather than a wild source of virus, underscoring the importance of biosecurity measures in pig farms to prevent contact with wild animals.
Isolation and Phylogenetic Analysis of Sindbis Viruses from Mosquitoes in Germany ▿
Jöst, Hanna; Bialonski, Alexandra; Storch, Volker; Günther, Stephan; Becker, Norbert; Schmidt-Chanasit, Jonas
2010-01-01
A molecular survey of 16,057 mosquitoes captured in Southwest Germany during the summer of 2009 demonstrated the presence of Sindbis virus (SINV) in Culex spp. and Anopheles maculipennis sensu lato. Phylogenetic analysis of the German SINV strains linked them with Swedish SINV strains, the causative agent of Ockelbo disease in humans. PMID:20335414
Easy-to-use phylogenetic analysis system for hepatitis B virus infection.
Sugiyama, Masaya; Inui, Ayano; Shin-I, Tadasu; Komatsu, Haruki; Mukaide, Motokazu; Masaki, Naohiko; Murata, Kazumoto; Ito, Kiyoaki; Nakanishi, Makoto; Fujisawa, Tomoo; Mizokami, Masashi
2011-10-01
The molecular phylogenetic analysis has been broadly applied to clinical and virological study. However, the appropriate settings and application of calculation parameters are difficult for non-specialists of molecular genetics. In the present study, the phylogenetic analysis tool was developed for the easy determination of genotypes and transmission route. A total of 23 patients of 10 families infected with hepatitis B virus (HBV) were enrolled and expected to undergo intrafamilial transmission. The extracted HBV DNA were amplified and sequenced in a region of the S gene. The software to automatically classify query sequence was constructed and installed on the Hepatitis Virus Database (HVDB). Reference sequences were retrieved from HVDB, which contained major genotypes from A to H. Multiple-alignments using CLUSTAL W were performed before the genetic distance matrix was calculated with the six-parameter method. The phylogenetic tree was output by the neighbor-joining method. User interface using WWW-browser was also developed for intuitive control. This system was named as the easy-to-use phylogenetic analysis system (E-PAS). Twenty-three sera of 10 families were analyzed to evaluate E-PAS. The queries obtained from nine families were genotype C and were located in one cluster per family. However, one patient of a family was classified into the cluster different from her family, suggesting that E-PAS detected the sample distinct from that of her family on the transmission route. The E-PAS to output phylogenetic tree was developed since requisite material was sequence data only. E-PAS could expand to determine HBV genotypes as well as transmission routes. © 2011 The Japan Society of Hepatology.
NASA Astrophysics Data System (ADS)
Kuo, Chun Wei; Hao Huang, Kuan; Hsu, Bing Mu; Tsai, Hsien Lung; Tseng, Shao Feng; Kao, Po Min; Shen, Shu Min; Chou Chiu, Yi; Chen, Jung Sheng
2013-04-01
Salmonella is one of the most important pathogens of waterborne diseases with outbreaks from contaminated water reported worldwide. In addition, Salmonella spp. can survive for long periods in aquatic environments. To realize genotypes and serovars of Salmonella in aquatic environments, we isolated the Salmonella strains by selective culture plates to identify the serovars of Salmonella by serological assay, and identify the genotypes by Multilocus sequence typing (MLST) based on the sequence data from University College Cork (UCC), respectively. The results show that 36 stream water samples (30.1%) and 18 drinking water samples (23.3%) were confirmed the existence of Salmonella using culture method combined PCR specific invA gene amplification. In this study, 24 cultured isolates of Salmonella from water samples were classified to fifteen Salmonella enterica serovars. In addition, we construct phylogenetic analysis using phylogenetic tree and Minimum spanning tree (MST) method to analyze the relationship of clinical, environmental, and geographical data. Phylogenetic tree showed that four main clusters and our strains can be distributed in all. The genotypes of isolates from stream water are more biodiversity while comparing the Salmonella strains genotypes from drinking water sources. According to MST data, we can found the positive correlation between serovars and genotypes of Salmonella. Previous studies revealed that the result of Pulsed field gel electrophoresis (PFGE) method can predict the serovars of Salmonella strain. Hence, we used the MLST data combined phylogenetic analysis to identify the serovars of Salmonella strain and achieved effectiveness. While using the geographical data combined phylogenetic analysis, the result showed that the dominant strains were existed in whole stream area in rainy season. Keywords: Salmonella spp., MLST, phylogenetic analysis, PFGE
Nalbantoglu, Sinem; Abu-Asab, Mones; Tan, Ming; Zhang, Xuemin; Cai, Ling; Amri, Hakima
2016-07-01
Pancreatic ductal adenocarcinoma (PDAC) is one of the rapidly growing forms of pancreatic cancer with a poor prognosis and less than 5% 5-year survival rate. In this study, we characterized the genetic signatures and signaling pathways related to survival from PDAC, using a parsimony phylogenetic algorithm. We applied the parsimony phylogenetic algorithm to analyze the publicly available whole-genome in silico array analysis of a gene expression data set in 25 early-stage human PDAC specimens. We explain here that the parsimony phylogenetics is an evolutionary analytical method that offers important promise to uncover clonal (driver) and nonclonal (passenger) aberrations in complex diseases. In our analysis, parsimony and statistical analyses did not identify significant correlations between survival times and gene expression values. Thus, the survival rankings did not appear to be significantly different between patients for any specific gene (p > 0.05). Also, we did not find correlation between gene expression data and tumor stage in the present data set. While the present analysis was unable to identify in this relatively small sample of patients a molecular signature associated with pancreatic cancer prognosis, we suggest that future research and analyses with the parsimony phylogenetic algorithm in larger patient samples are worthwhile, given the devastating nature of pancreatic cancer and its early diagnosis, and the need for novel data analytic approaches. The future research practices might want to place greater emphasis on phylogenetics as one of the analytical paradigms, as our findings presented here are on the cusp of this shift, especially in the current era of Big Data and innovation policies advocating for greater data sharing and reanalysis.
A Deliberate Practice Approach to Teaching Phylogenetic Analysis
Hobbs, F. Collin; Johnson, Daniel J.; Kearns, Katherine D.
2013-01-01
One goal of postsecondary education is to assist students in developing expert-level understanding. Previous attempts to encourage expert-level understanding of phylogenetic analysis in college science classrooms have largely focused on isolated, or “one-shot,” in-class activities. Using a deliberate practice instructional approach, we designed a set of five assignments for a 300-level plant systematics course that incrementally introduces the concepts and skills used in phylogenetic analysis. In our assignments, students learned the process of constructing phylogenetic trees through a series of increasingly difficult tasks; thus, skill development served as a framework for building content knowledge. We present results from 5 yr of final exam scores, pre- and postconcept assessments, and student surveys to assess the impact of our new pedagogical materials on student performance related to constructing and interpreting phylogenetic trees. Students improved in their ability to interpret relationships within trees and improved in several aspects related to between-tree comparisons and tree construction skills. Student feedback indicated that most students believed our approach prepared them to engage in tree construction and gave them confidence in their abilities. Overall, our data confirm that instructional approaches implementing deliberate practice address student misconceptions, improve student experiences, and foster deeper understanding of difficult scientific concepts. PMID:24297294
DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.
Kelly, Steven; Maini, Philip K
2013-01-01
The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.
Phylogenetic Information Content of Copepoda Ribosomal DNA Repeat Units: ITS1 and ITS2 Impact
Zagoskin, Maxim V.; Lazareva, Valentina I.; Grishanin, Andrey K.; Mukha, Dmitry V.
2014-01-01
The utility of various regions of the ribosomal repeat unit for phylogenetic analysis was examined in 16 species representing four families, nine genera, and two orders of the subclass Copepoda (Crustacea). Fragments approximately 2000 bp in length containing the ribosomal DNA (rDNA) 18S and 28S gene fragments, the 5.8S gene, and the internal transcribed spacer regions I and II (ITS1 and ITS2) were amplified and analyzed. The DAMBE (Data Analysis in Molecular Biology and Evolution) software was used to analyze the saturation of nucleotide substitutions; this test revealed the suitability of both the 28S gene fragment and the ITS1/ITS2 rDNA regions for the reconstruction of phylogenetic trees. Distance (minimum evolution) and probabilistic (maximum likelihood, Bayesian) analyses of the data revealed that the 28S rDNA and the ITS1 and ITS2 regions are informative markers for inferring phylogenetic relationships among families of copepods and within the Cyclopidae family and associated genera. Split-graph analysis of concatenated ITS1/ITS2 rDNA regions of cyclopoid copepods suggested that the Mesocyclops, Thermocyclops, and Macrocyclops genera share complex evolutionary relationships. This study revealed that the ITS1 and ITS2 regions potentially represent different phylogenetic signals. PMID:25215300
Incorporating evolutionary history into conservation planning in biodiversity hotspots.
Buerki, Sven; Callmander, Martin W; Bachman, Steven; Moat, Justin; Labat, Jean-Noël; Forest, Félix
2015-02-19
There is increased evidence that incorporating evolutionary history directly in conservation actions is beneficial, particularly given the likelihood that extinction is not random and that phylogenetic diversity (PD) is lost at higher rates than species diversity. This evidence is even more compelling in biodiversity hotspots, such as Madagascar, where less than 10% of the original vegetation remains. Here, we use the Leguminosae, an ecologically and economically important plant family, and a combination of phylogenetics and species distribution modelling, to assess biodiversity patterns and identify regions, coevolutionary processes and ecological factors that are important in shaping this diversity, especially during the Quaternary. We show evidence that species distribution and community PD are predicted by watershed boundaries, which enable the identification of a network of refugia and dispersal corridors that were perhaps important for maintaining community integrity during past climate change. Phylogenetically clustered communities are found in the southwest of the island at low elevation and share a suite of morphological characters (especially fruit morphology) indicative of coevolution with their main dispersers, the extinct and extant lemurs. Phylogenetically over-dispersed communities are found along the eastern coast at sea level and may have resulted from many independent dispersal events from the drier and more seasonal regions of Madagascar. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Metatranscriptome analysis of the microbial fermentation of dietary milk proteins in the murine gut.
Hugenholtz, Floor; Davids, Mark; Schwarz, Jessica; Müller, Michael; Tomé, Daniel; Schaap, Peter; Hooiveld, Guido J E J; Smidt, Hauke; Kleerebezem, Michiel
2018-01-01
Undigestible food ingredients are converted by the microbiota into a large range of metabolites, predominated by short chain fatty acids (SCFA). These microbial metabolites are subsequently available for absorption by the host mucosa and can serve as an energy source. Amino acids fermentation by the microbiota expands the spectrum of fermentation end-products beyond acetate, propionate and butyrate, to include in particular branched-SCFA. Here the long-term effects of high protein-diets on microbial community composition and functionality in mice were analyzed. Determinations of the microbiota composition using phylogenetic microarray (MITChip) technology were complemented with metatranscriptome and SCFA analyses to obtain insight in in situ expression of protein fermentation pathways and the phylogenetic groups involved. High protein diets led to increased luminal concentrations of branched-SCFA, in accordance with protein fermentation in the gut. Bacteria dominantly participating in protein catabolism belonged to the Lachnospiraceae, Erysipelotrichaceae and Clostridiaceae families in both normal- and high- protein diet regimes. This study identifies the microbial groups involved in protein catabolism in the intestine and underpins the value of in situ metatranscriptome analyses as an approach to decipher locally active metabolic networks and pathways as a function of the dietary regime, as well as the phylogeny of the microorganisms executing them.
Novosel, D; Tuboly, T; Csagola, A; Lorincz, M; Cubric-Curik, V; Jungic, A; Curik, I; Segalés, J; Cortey, M; Lipej, Z
2014-04-26
Porcine circovirus type 2 (PCV2) causes some of the most significant economic losses in pig production. Several multisystemic syndromes have been attributed to PCV2 infection, which are known as PCV2-associated diseases (PCVDs). This study investigated the origin and evolution of PCV2 sequences in domestic pigs and wild boars affected by PCVDs in Croatia. Viral sequences were recovered from three wild boars diagnosed with PCV2-systemic disease (PCV2-SD), 63 fetuses positive for PCV2 DNA as determined by PCR, 14 domestic pigs affected with PCV2-SD (displaying severe interstitial nephritis) and five domestic pigs with proliferative and necrotising pneumonia. Seventeen complete PCV2 genomes were recovered. Phylogenetic and evolutionary analyses based on median-joining phylogenetic networks, amino acid alignments and principal coordinate analysis were performed using complete genomes, as well as complete and partial ORF sequences for ORF1 and ORF2. Two of the 17 PCV2 sequences belonged to PCV2a, 14 to PCV2b and one was unclustered. PCV2b was the predominant genotype in Croatia and has been linked to international trade as a route of introduction. Correlation between particular viral strains with PCVDs is lacking.
Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.
Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong
2018-04-05
Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.
Phenotypic integration emerges from aposematism and scale in poison frogs
Santos, Juan C.; Cannatella, David C.
2011-01-01
Complex phenotypes can be modeled as networks of component traits connected by genetic, developmental, or functional interactions. Aposematism, which has evolved multiple times in poison frogs (Dendrobatidae), links a warning signal to a chemical defense against predators. Other traits are involved in this complex phenotype. Most aposematic poison frogs are ant specialists, from which they sequester defensive alkaloids. We found that aposematic species have greater aerobic capacity, also related to diet specialization. To characterize the aposematic trait network more fully, we analyzed phylogenetic correlations among its hypothesized components: conspicuousness, chemical defense, diet specialization, body mass, active and resting metabolic rates, and aerobic scope. Conspicuous coloration was correlated with all components except resting metabolism. Structural equation modeling on the basis of trait correlations recovered “aposematism” as one of two latent variables in an integrated phenotypic network, the other being scaling with body mass and physiology (“scale”). Chemical defense and diet specialization were uniquely tied to aposematism whereas conspicuousness was related to scale. The phylogenetic distribution of the aposematic syndrome suggests two scenarios for its evolution: (i) chemical defense and conspicuousness preceded greater aerobic capacity, which supports the increased resource-gathering abilities required of ant–mite diet specialization; and (ii) assuming that prey are patchy, diet specialization and greater aerobic capacity evolved in tandem, and both traits subsequently facilitated the evolution of aposematism. PMID:21444790
Chloroplast heterogeneity and historical admixture within the genus Malus.
Volk, Gayle M; Henk, Adam D; Baldo, Angela; Fazio, Gennaro; Chao, C Thomas; Richards, Christopher M
2015-07-01
• The genus Malus represents a unique and complex evolutionary context in which to study domestication. Several Malus species have provided novel alleles and traits to the cultivars. The extent of admixture among wild Malus species has not been well described, due in part to limited sampling of individuals within a taxon.• Four chloroplast regions (1681 bp total) were sequenced and aligned for 412 Malus individuals from 30 species. Phylogenetic relationships were reconstructed using maximum parsimony. The distribution of chloroplast haplotypes among species was examined using statistical parsimony, phylogenetic trees, and a median-joining network.• Chloroplast haplotypes are shared among species within Malus. Three major haplotype-sharing networks were identified. One includes species native to China, Western North America, as well as Malus domestica Borkh, and its four primary progenitor species: M. sieversii (Ledeb.) M. Roem., M. orientalis Uglitzk., M. sylvestris (L.) Mill., and M. prunifolia (Willd.) Borkh; another includes five Chinese Malus species, and a third includes the three Malus species native to Eastern North America.• Chloroplast haplotypes found in M. domestica belong to a single, highly admixed network. Haplotypes shared between the domesticated apple and its progenitors may reflect historical introgression or the retention of ancestral polymorphisms. Multiple individuals should be sampled within Malus species to reveal haplotype heterogeneity, if complex maternal contributions to named species are to be recognized. © 2015 Botanical Society of America, Inc.
Phylogenetic trends in respiratory rhythmogenesis: insights from ectothermic vertebrates.
Kinkead, Richard
2009-08-31
Understanding the neural substrate driving breathing has puzzled physiologists for more than a century. The discovery of the pre-Bötzinger complex (preBötC) in newborn rodents as a structure with a unique physiological function in respiratory rhythm generation was an important progress in respiratory neurobiology that stimulated much research. Owing to the extensive literature describing the location, organisation, and function of the preBötC mainly in newborn rodents, this structure has become the point of reference in studies addressing respiratory rhythm generation in other mammals and various classes of vertebrates. This paper reviews recent progress made in non-mammalian vertebrates in our understanding of the location and function of the neural networks driving respiratory activity. As in newborn rodents, data from lampreys, air breathing fish, and amphibians show that the production of eupnea is the result of interactions between multiple (at least two) rhythmogenic networks. These networks are located in anatomically distinct areas and show different functional properties in terms of their ability to produce (or not) bursting activity in the absence of synaptic inputs (e.g. pacemaker neurons) and their sensitivity to specific neuromodulators such as substance P, somatostatin, and opioids. Current data indicate that respiratory rhythmogenesis is a phylogenetically ancient function that was highly conserved throughout evolution and that a comparative approach remains important to derive broader biological principles and a more comprehensive view.
YBYRÁ facilitates comparison of large phylogenetic trees.
Machado, Denis Jacob
2015-07-01
The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .
SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees.
DeBlasio, Dan F; Wisecaver, Jennifer H
2016-01-01
We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.
A review of criticisms of phylogenetic nomenclature: is taxonomic freedom the fundamental issue?
Bryant, Harold N; Cantino, Philip D
2002-02-01
The proposal to implement a phylogenetic nomenclatural system governed by the PhyloCode), in which taxon names are defined by explicit reference to common descent, has met with strong criticism from some proponents of phylogenetic taxonomy (taxonomy based on the principle of common descent in which only clades and species are recognized). We examine these criticisms and find that some of the perceived problems with phylogenetic nomenclature are based on misconceptions, some are equally true of the current rank-based nomenclatural system, and some will be eliminated by implementation of the PhyloCode. Most of the criticisms are related to an overriding concern that, because the meanings of names are associated with phylogenetic pattern which is subject to change, the adoption of phylogenetic nomenclature will lead to increased instability in the content of taxa. This concern is associated with the fact that, despite the widespread adoption of the view that taxa are historical entities that are conceptualized based on ancestry, many taxonomists also conceptualize taxa based on their content. As a result, critics of phylogenetic nomenclature have argued that taxonomists should be free to emend the content of taxa without constraints imposed by nomenclatural decisions. However, in phylogenetic nomenclature the contents of taxa are determined, not by the taxonomist, but by the combination of the phylogenetic definition of the name and a phylogenetic hypothesis. Because the contents of taxa, once their names are defined, can no longer be freely modified by taxonomists, phylogenetic nomenclature is perceived as limiting taxonomic freedom. We argue that the form of taxonomic freedom inherent to phylogenetic nomenclature is appropriate to phylogenetic taxonomy in which taxa are considered historical entities that are discovered through phylogenetic analysis and are not human constructs.
Punctuated equilibrium in the large-scale evolution of programming languages.
Valverde, Sergi; Solé, Ricard V
2015-06-06
The analogies and differences between biological and cultural evolution have been explored by evolutionary biologists, historians, engineers and linguists alike. Two well-known domains of cultural change are language and technology. Both share some traits relating the evolution of species, but technological change is very difficult to study. A major challenge in our way towards a scientific theory of technological evolution is how to properly define evolutionary trees or clades and how to weight the role played by horizontal transfer of information. Here, we study the large-scale historical development of programming languages, which have deeply marked social and technological advances in the last half century. We analyse their historical connections using network theory and reconstructed phylogenetic networks. Using both data analysis and network modelling, it is shown that their evolution is highly uneven, marked by innovation events where new languages are created out of improved combinations of different structural components belonging to previous languages. These radiation events occur in a bursty pattern and are tied to novel technological and social niches. The method can be extrapolated to other systems and consistently captures the major classes of languages and the widespread horizontal design exchanges, revealing a punctuated evolutionary path. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Ji, Boyang; Zhang, Sheng-Da; Zhang, Wei-Jia; Rouy, Zoe; Alberto, François; Santini, Claire-Lise; Mangenot, Sophie; Gagnot, Séverine; Philippe, Nadège; Pradel, Nathalie; Zhang, Lichen; Tempel, Sébastien; Li, Ying; Médigue, Claudine; Henrissat, Bernard; Coutinho, Pedro M; Barbe, Valérie; Talla, Emmanuel; Wu, Long-Fei
2017-03-01
Magnetotactic bacteria (MTB) are a group of phylogenetically and physiologically diverse Gram-negative bacteria that synthesize intracellular magnetic crystals named magnetosomes. MTB are affiliated with three classes of Proteobacteria phylum, Nitrospirae phylum, Omnitrophica phylum and probably with the candidate phylum Latescibacteria. The evolutionary origin and physiological diversity of MTB compared with other bacterial taxonomic groups remain to be illustrated. Here, we analysed the genome of the marine magneto-ovoid strain MO-1 and found that it is closely related to Magnetococcus marinus MC-1. Detailed analyses of the ribosomal proteins and whole proteomes of 390 genomes reveal that, among the Proteobacteria analysed, only MO-1 and MC-1 have coding sequences (CDSs) with a similarly high proportion of origins from Alphaproteobacteria, Betaproteobacteria, Deltaproteobacteria and Gammaproteobacteria. Interestingly, a comparative metabolic network analysis with anoxic network enzymes from sequenced MTB and non-MTB successfully allows the eventual prediction of an organism with a metabolic profile compatible for magnetosome production. Altogether, our genomic analysis reveals multiple origins of MO-1 and M. marinus MC-1 genomes and suggests a metabolism-restriction model for explaining whether a bacterium could become an MTB upon acquisition of magnetosome encoding genes. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics.
Thomson, Robert C
2009-07-30
PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.
PhyLIS: A Simple GNU/Linux Distribution for Phylogenetics and Phyloinformatics
Thomson, Robert C.
2009-01-01
PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/. PMID:19812729
Sun, Cheng; Yu, Guoliang; Bao, Manzhu; Zheng, Bo; Ning, Guogui
2014-06-27
Odd traits in few of plant species usually implicate potential biology significances in plant evolutions. The genus Helwingia Willd, a dioecious medical shrub in Aquifoliales order, has an odd floral architecture-epiphyllous inflorescence. The potential significances and possible evolutionary origin of this specie are not well understood due to poorly available data of biological and genetic studies. In addition, the advent of genomics-based technologies has widely revolutionized plant species with unknown genomic information. Morphological and biological pattern were detailed via anatomical and pollination analyses. An RNA sequencing based transcriptomic analysis were undertaken and a high-resolution phylogenetic analysis was conducted based on single-copy genes in more than 80 species of seed plants, including H. japonica. It is verified that a potential fusion of rachis to the leaf midvein facilitates insect pollination. RNA sequencing yielded a total of 111450 unigenes; half of them had significant similarity with proteins in the public database, and 20281 unigenes were mapped to 119 pathways. Deduced from the phylogenetic analysis based on single-copy genes, the group of Helwingia is closer with Euasterids II and rather than Euasterids, congruent with previous reports using plastid sequences. The odd flower architecture make H. Willd adapt to insect pollination by hosting those insects larger than the flower in size via leave, which has little common character that other insect pollination plants hold. Further the present transcriptome greatly riches genomics information of Helwingia species and nucleus genes based phylogenetic analysis also greatly improve the resolution and robustness of phylogenetic reconstruction in H. japonica.
A study on the characterization of Propionibacterium acnes isolated from ocular clinical specimens.
Sowmiya, Murali; Malathi, Jambulingam; Swarnali, Sen; Priya, Jeyavel Padma; Therese, Kulandai Lily; Madhavan, Hajib N
2015-10-01
There are only a few reports available on characterization of Propionibacterium acnes isolated from various ocular clinical specimens. We undertook this study to evaluate the role of P. acnes in ocular infections and biofilm production, and also do the phylogenetic analysis of the bacilli. One hundred isolates of P. acnes collected prospectively from ocular clinical specimens at a tertiary care eye hospital between January 2010 and December 2011, were studied for their association with various ocular disease conditions. The isolates were also subjected to genotyping and phylogenetic analysis, and were also tested for their ability to produce biofilms. Among preoperative conjunctival swabs, P. acnes was a probably significant pathogen in one case; a possibly significant pathogen in two cases. In other clinical conditions, 13 per cent isolates were probably significant pathogens and 38 per cent as possibly significant pathogens. The analysis of 16S rRNA gene revealed four different phylogenies whereas analysis of recA gene showed two phylogenies confirming that recA gene was more reliable than 16S rRNA with less sequence variation. Results of polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) had 100 per cent concordance with phylogenetic results. No association was seen between P. acnes subtypes and biofilm production. RecA gene phylogenetic studies revealed two different phylogenies. RFLP technique was found to be cost-effective with high sensitivity and specificity in phylogenetic analysis. No association between P. acnes subtypes and pathogenetic ability was observed. Biofilm producing isolates showed increased antibiotic resistance compared with non-biofilm producing isolates.
Microbial ecological associations in the surface sediments of Bohai strait
NASA Astrophysics Data System (ADS)
Wang, Bin; Liu, Hongmei; Tang, Haitian; Hu, Xiaoke
2017-09-01
Microbial communities play key roles in the marine ecosystem. Despite a few studies on marine microbial communities in deep straits, ecological associations among microbial communities in the sediments of shallow straits have not been fully investigated. The Bohai Strait in northern China (average depth less than 20 m) separates the Bohai Sea from the Yellow Sea and has organic-rich sediments. In this study, in the summer of 2014, six stations across the strait were selected to explore the taxonomic composition of microbial communities and their ecological associations. The four most abundant classes were Gammaproteobacteria, Deltaproteobacteria, Bacilli and Flavobacteriia. Temperature, total carbon, depth, nitrate, fishery breeding and cold water masses influenced the microbial communities, as suggested by representational difference and composition analyses. Network analysis of microbial associations revealed that key families included Flavobacteriaceae, Pirellulaceae and Piscirickettsiaceae. Our findings suggest that the families with high phylogenetic diversity are key populations in the microbial association network that ensure the stability of microbial ecosystems. Our study contributes to a better understanding of microbial ecology in complex hydrological environments.
Phylogenetic species delimitation for crayfishes of the genus Pacifastacus.
Larson, Eric R; Castelin, Magalie; Williams, Bronwyn W; Olden, Julian D; Abbott, Cathryn L
2016-01-01
Molecular genetic approaches are playing an increasing role in conservation science by identifying biodiversity that may not be evident by morphology-based taxonomy and systematics. So-called cryptic species are particularly prevalent in freshwater environments, where isolation of dispersal-limited species, such as crayfishes, within dendritic river networks often gives rise to high intra- and inter-specific genetic divergence. We apply here a multi-gene molecular approach to investigate relationships among extant species of the crayfish genus Pacifastacus, representing the first comprehensive phylogenetic study of this taxonomic group. Importantly, Pacifastacus includes both the widely invasive signal crayfish Pacifastacus leniusculus, as well as several species of conservation concern like the Shasta crayfish Pacifastacus fortis. Our analysis used 83 individuals sampled across the four extant Pacifastacus species (omitting the extinct Pacifastacus nigrescens), representing the known taxonomic diversity and geographic distributions within this genus as comprehensively as possible. We reconstructed phylogenetic trees from mitochondrial (16S, COI) and nuclear genes (GAPDH), both separately and using a combined or concatenated dataset, and performed several species delimitation analyses (PTP, ABGD, GMYC) on the COI phylogeny to propose Primary Species Hypotheses (PSHs) within the genus. All phylogenies recovered the genus Pacifastacus as monophyletic, within which we identified a range of six to 21 PSHs; more abundant PSHs delimitations from GMYC and ABGD were always nested within PSHs delimited by the more conservative PTP method. Pacifastacus leniusculus included the majority of PSHs and was not monophyletic relative to the other Pacifastacus species considered. Several of these highly distinct P. leniusculus PSHs likely require urgent conservation attention. Our results identify research needs and conservation priorities for Pacifastacus crayfishes in western North America, and may inform better understanding and management of P. leniusculus in regions where it is invasive, such as Europe and Japan.
Phylogeography of the Macaronesian Lettuce Species Lactuca watsoniana and L. palmensis (Asteraceae).
Dias, Elisabete F; Kilian, Norbert; Silva, Luís; Schaefer, Hanno; Carine, Mark; Rudall, Paula J; Santos-Guerra, Arnoldo; Moura, Mónica
2018-02-24
The phylogenetic relationships and phylogeography of two relatively rare Macaronesian Lactuca species, Lactuca watsoniana (Azores) and L. palmensis (Canary Islands), were, until this date, unclear. Karyological information of the Azorean species was also unknown. For this study, a chromosome count was performed and L. watsoniana showed 2n = 34. A phylogenetic approach was used to clarify the relationships of the Azorean endemic L. watsoniana and the La Palma endemic L. palmensis within the subtribe Lactucinae. Maximum parsimony, Maximum likelihood and Bayesian analysis of a combined molecular dataset (ITS and four chloroplast DNA regions) and molecular clock analyses were performed with the Macaronesian Lactuca species, as well as a TCS haplotype network. The analyses revealed that L. watsoniana and L. palmensis belong to different subclades of the Lactuca clade. Lactuca watsoniana showed a strongly supported phylogenetic relationship with North American species, while L. palmensis was closely related to L. tenerrima and L. inermis, from Europe and Africa. Lactuca watsoniana showed four single-island haplotypes. A divergence time estimation of the Macaronesian lineages was used to examine island colonization pathways. Results obtained with BEAST suggest a divergence of L. palmensis and L. watsoniana clades c. 11 million years ago, L. watsoniana diverged from its North American sister species c. 3.8 million years ago and L. palmensis diverged from its sister L. tenerrima, c. 1.3 million years ago, probably originating from an African ancestral lineage which colonized the Canary Islands. Divergence analyses with *BEAST indicate a more recent divergence of the L. watsoniana crown, c. 0.9 million years ago. In the Azores colonization, in a stepping stone, east-to-west dispersal pattern, associated with geological events might explain the current distribution range of L. watsoniana.
Cancilleri, Francesco; Ciccozzi, Massimo; Fogolari, Marta; Cella, Eleonora; De Florio, Lucia; Berton, Alessandra; Salvatore, Giuseppe; Dicuonzo, Giordano; Spoto, Silvia; Denaro, Vincenzo; Angeletti, Silvia
2018-05-01
Methicillin-resistant Staphylococcus aureus (MRSA) infection is rapidly increasing in both hospital and community settings. A 71-year-old man admitted at the Department of Orthopaedics and Trauma Surgery, University Campus Bio-Medico of Rome, with MRSA wound infection consequent to orthopedic surgery was studied and the MRSA transmission evaluated by phylogenetic analysis.
Li, Wei; Zhang, Xin-Cheng; Zhao, Jian; Shi, Yan; Zhu, Xin-Ping
2015-01-25
Cuora trifasciata has become one of the most critically endangered species in the world. The complete mitochondrial genome of C. trifasciata (Chinese three-striped box turtle) was determined in this study. Its mitochondrial genome is a 16,575-bp-long circular molecule that consists of 37 genes that are typically found in other vertebrates. And the basic characteristics of the C. trifasciata mitochondrial genome were also determined. Moreover, a comparison of C. trifasciata with Cuora cyclornata, Cuora pani and Cuora aurocapitata indicated that the four mitogenomics differed in length, codons, overlaps, 13 protein-coding genes (PCGs), ND3, rRNA genes, control region, and other aspects. Phylogenetic analysis with Bayesian inference and maximum likelihood based on 12 protein-coding genes of the genus Cuora indicated the phylogenetic position of C. trifasciata within Cuora. The phylogenetic analysis also showed that C. trifasciata from Vietnam and China formed separate monophyletic clades with different Cuora species. The results of nucleotide base compositions, protein-coding genes and phylogenetic analysis showed that C. trifasciata from these two countries may represent different Cuora species. Copyright © 2014 Elsevier B.V. All rights reserved.
Makendi, Carine; Page, Andrew J.; Wren, Brendan W.; Le Thi Phuong, Tu; Clare, Simon; Hale, Christine; Goulding, David; Klemm, Elizabeth J.; Pickard, Derek; Okoro, Chinyere; Hunt, Martin; Thompson, Corinne N.; Phu Huong Lan, Nguyen; Tran Do Hoang, Nhu; Thwaites, Guy E.; Le Hello, Simon; Brisabois, Anne; Weill, François-Xavier; Baker, Stephen; Dougan, Gordon
2016-01-01
Salmonella enterica serovar Weltevreden (S. Weltevreden) is an emerging cause of diarrheal and invasive disease in humans residing in tropical regions. Despite the regional and international emergence of this Salmonella serovar, relatively little is known about its genetic diversity, genomics or virulence potential in model systems. Here we used whole genome sequencing and bioinformatics analyses to define the phylogenetic structure of a diverse global selection of S. Weltevreden. Phylogenetic analysis of more than 100 isolates demonstrated that the population of S. Weltevreden can be segregated into two main phylogenetic clusters, one associated predominantly with continental Southeast Asia and the other more internationally dispersed. Subcluster analysis suggested the local evolution of S. Weltevreden within specific geographical regions. Four of the isolates were sequenced using long read sequencing to produce high quality reference genomes. Phenotypic analysis in Hep-2 cells and in a murine infection model indicated that S. Weltevreden were significantly attenuated in these models compared to the classical S. Typhimurium reference strain SL1344. Our work outlines novel insights into this important emerging pathogen and provides a baseline understanding for future research studies. PMID:26867150
A gateway for phylogenetic analysis powered by grid computing featuring GARLI 2.0.
Bazinet, Adam L; Zwickl, Derrick J; Cummings, Michael P
2014-09-01
We introduce molecularevolution.org, a publicly available gateway for high-throughput, maximum-likelihood phylogenetic analysis powered by grid computing. The gateway features a garli 2.0 web service that enables a user to quickly and easily submit thousands of maximum likelihood tree searches or bootstrap searches that are executed in parallel on distributed computing resources. The garli web service allows one to easily specify partitioned substitution models using a graphical interface, and it performs sophisticated post-processing of phylogenetic results. Although the garli web service has been used by the research community for over three years, here we formally announce the availability of the service, describe its capabilities, highlight new features and recent improvements, and provide details about how the grid system efficiently delivers high-quality phylogenetic results. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Kumar, Anurag; Saha, Bhaskar; Singh, Shailza
2017-12-01
Leishmaniasis is the second largest parasitic killer disease caused by the protozoan parasite Leishmania , transmitted by the bite of sand flies. It's endemic in the eastern India with 165.4 million populations at risk with the current drug regimen. Three forms of leishmaniasis exist in which cutaneous is the most common form caused by Leishmania major . Trypanothione Reductase (TryR), a flavoprotein oxidoreductase, unique to thiol redox system, is considered as a potential target for chemotherapy for trypanosomatids infection. It is involved in the NADPH dependent reduction of Trypanothione disulphide to Trypanothione. Similarly, is Tryparedoxin Peroxidase (Txnpx), for detoxification of peroxides, an event pivotal for survival of Leishmania in two disparate biological environment. Fe-S plays a major role in regulating redox balance. To check for the closeness between human homologs of these proteins, we have carried the molecular clock analysis followed by molecular modeling of 3D structure of this protein, enabling us to design and test the novel drug like molecules. Molecular clock analysis suggests that human homologs of TryR i.e. Glutathione Reductase and Txnpx respectively are highly diverged in phylogenetic tree, thus, they serve as good candidates for chemotherapy of leishmaniasis. Furthermore, we have done the homology modeling of TryR using template of same protein from Leishmania infantum (PDB ID: 2JK6). This was done using Modeller 9.18 and the resultant models were validated. To inhibit this target, molecular docking was done with various screened inhibitors in which we found Taxifolin acts as common inhibitors for both TryR and Txnpx. We constructed the protein-protein interaction network for the proteins that are involved in the redox metabolism from various Interaction databases and the network was statistically analysed.
Liyanage, Kapila K.; Khan, Sehroon; Brooks, Siraprapa; Mortimer, Peter E.; Karunarathna, Samantha C.; Xu, Jianchu; Hyde, Kevin D.
2018-01-01
Powdery mildew disease of rubber affects immature green leaves, buds, inflorescences, and other immature tissues of rubber trees, resulting in up to 45% losses in rubber latex yield worldwide. The disease is often controlled by dusting the diseased plants with powdered sulfur, which can have long-term negative effects on the environment. Therefore, it is necessary to search for alternative and environmentally friendly control methods for this disease. This study aimed to identify mycoparasites associated with rubber powdery mildew species, and characterize them on the basis of morpho-molecular characteristics and phylogenetic analyses of ITS rDNA regions. We observed that the Ampelomyces fungus parasitizes rubber powdery mildew, and eventually destroys it. Furthermore, on the basis of phylogenetic analyses and morphological characteristics we confirmed that the Ampelomyces mycoparasite isolated from rubber powdery mildew is closely related to other mycohost taxa in the Erysiphe genus. A total of 73 (71 retrieved from GenBank and two obtained from fresh collections of rubber powdery mildew fungi) Ampelomyces spp. were analyzed using ITS rDNA sequences and 153 polymorphic sites were identified through haplotypic analyses. A total of 28 haplotypes (H1–H28) were identified to have a complex network of mutation events. The results from phylogenetic tree constructed on the basis of maximum likelihood analyses, and the haplotype network tree revealed similar relationships of clustering pattern. This work presents the first report on morpho-molecular characterization of Ampelomyces species that are mycoparasites of powdery mildew of Hevea brasiliensis. PMID:29403464
Stevens, John R; Jones, Todd R; Lefevre, Michael; Ganesan, Balasubramanian; Weimer, Bart C
2017-01-01
Microbial community analysis experiments to assess the effect of a treatment intervention (or environmental change) on the relative abundance levels of multiple related microbial species (or operational taxonomic units) simultaneously using high throughput genomics are becoming increasingly common. Within the framework of the evolutionary phylogeny of all species considered in the experiment, this translates to a statistical need to identify the phylogenetic branches that exhibit a significant consensus response (in terms of operational taxonomic unit abundance) to the intervention. We present the R software package SigTree , a collection of flexible tools that make use of meta-analysis methods and regular expressions to identify and visualize significantly responsive branches in a phylogenetic tree, while appropriately adjusting for multiple comparisons.
Prychitko, T M; Moore, W S
1997-10-01
Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press
Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723
Neutral Community Dynamics and the Evolution of Species Interactions.
Coelho, Marco Túlio P; Rangel, Thiago F
2018-04-01
A contemporary goal in ecology is to determine the ecological and evolutionary processes that generate recurring structural patterns in mutualistic networks. One of the great challenges is testing the capacity of neutral processes to replicate observed patterns in ecological networks, since the original formulation of the neutral theory lacks trophic interactions. Here, we develop a stochastic-simulation neutral model adding trophic interactions to the neutral theory of biodiversity. Without invoking ecological differences among individuals of different species, and assuming that ecological interactions emerge randomly, we demonstrate that a spatially explicit multitrophic neutral model is able to capture the recurrent structural patterns of mutualistic networks (i.e., degree distribution, connectance, nestedness, and phylogenetic signal of species interactions). Nonrandom species distribution, caused by probabilistic events of migration and speciation, create nonrandom network patterns. These findings have broad implications for the interpretation of niche-based processes as drivers of ecological networks, as well as for the integration of network structures with demographic stochasticity.
Dual Neural Network Model for the Evolution of Speech and Language.
Hage, Steffen R; Nieder, Andreas
2016-12-01
Explaining the evolution of speech and language poses one of the biggest challenges in biology. We propose a dual network model that posits a volitional articulatory motor network (VAMN) originating in the prefrontal cortex (PFC; including Broca's area) that cognitively controls vocal output of a phylogenetically conserved primary vocal motor network (PVMN) situated in subcortical structures. By comparing the connections between these two systems in human and nonhuman primate brains, we identify crucial biological preadaptations in monkeys for the emergence of a language system in humans. This model of language evolution explains the exclusiveness of non-verbal communication sounds (e.g., cries) in infants with an immature PFC, as well as the observed emergence of non-linguistic vocalizations in adults after frontal lobe pathologies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Oh, Seungdae; Hammes, Frederik; Liu, Wen-Tso
2018-01-01
Microorganisms inhabiting filtration media of a drinking water treatment plant can be beneficial, because they metabolize biodegradable organic matter from source waters and those formed during disinfection processes, leading to the production of biologically stable drinking water. However, which microbial consortia colonize filters and what metabolic capacity they possess remain to be investigated. To gain insights into these issues, we performed metagenome sequencing and analysis of microbial communities in three different filters of a full-scale drinking water treatment plant (DWTP). Filter communities were sampled from a rapid sand filter (RSF), granular activated carbon filter (GAC), and slow sand filter (SSF), and from the Schmutzdecke (SCM, a biologically active scum layer accumulated on top of SSF), respectively. Analysis of community phylogenetic structure revealed that the filter bacterial communities significantly differed from those in the source water and final effluent communities, respectively. Network analysis identified a filter-specific colonization pattern of bacterial groups. Bradyrhizobiaceae were abundant in GAC, whereas Nitrospira were enriched in the sand-associated filters (RSF, SCM, and SSF). The GAC community was enriched with functions associated with aromatics degradation, many of which were encoded by Rhizobiales (∼30% of the total GAC community). Predicting minimum generation time (MGT) of prokaryotic communities suggested that the GAC community potentially select fast-growers (<15 h of MGT) among the four filter communities, consistent with the highest dissolved organic matter removal rate by GAC. Our findings provide new insights into the community phylogenetic structure, colonization pattern, and metabolic capacity that potentially contributes to organic matter removal achieved in the biofiltration stages of the full-scale DWTP. Copyright © 2017 Elsevier Ltd. All rights reserved.
Marangi, M; Cantacessi, C; Sparagano, O A E; Camarda, A; Giangaspero, A
2014-12-01
In order to investigate the genetic relationships between Dermanyssus gallinae (Metastigmata: Dermanyssidae) (de Geer) isolates from poultry farms in Italy and other European countries, phylogenetic analysis was performed using a portion of the cytochrome c oxidase subunit 1 (cox1) gene of the mitochondrial DNA and the internal transcribed spacers (ITS1+5.8S+ITS2) of the ribosomal DNA. A total of 360 cox1 sequences and 360 ITS+ sequences were obtained from mites collected on 24 different poultry farms in 10 different regions of Northern and Southern Italy. Phylogenetic analysis of the cox1 sequences resulted in the clustering of two groups (A and B), whereas phylogenetic analysis of the ITS+ resulted in largely unresolved clusters. Knowledge of the genetic make-up of mite populations within countries, together with comparative analyses of D. gallinae isolates from different countries, will provide better understanding of the population dynamics of D. gallinae. This will also allow the identification of genetic markers of emerging acaricide resistance and the development of alternative strategies for the prevention and treatment of infestations. © 2014 The Royal Entomological Society.
Jiao, Yu-Liang; Wang, Shu-Jun; Lv, Ming-Sheng; Fang, Yao-Wei; Liu, Shu
2013-03-01
Thermostable amylopullulanase (TAPU) is valuable in starch saccharification industry for its capability to catalyze both α-1,4 and α-1,6 glucosidic bonds under the industrial starch liquefication condition. The majority of TAPUs belong to glycoside hydrolase family 57 (GH57). In this study, we performed a phylogenetic analysis of GH57 amylopullulanase (APU) based on the highly conserved DOMON_glucodextranase_like (DDL) domain and classified APUs according to their multidomain architectures, phylogenetic analysis and enzymatic characters. This study revealed that amylopullulanase, pullulanase, andα-amylase had passed through a long joint evolution process, in which DDL played an important role. The phylogenetic analysis of DDL domain showed that the GH57 APU is directly sharing a common ancestor with pullulanase, and the DDL domains in some species undergo evolution scenarios such as domain duplication and recombination. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Satheesh, Viswanathan; Jagannadham, P Tej Kumar; Chidambaranathan, Parameswaran; Jain, P K; Srinivasan, R
2014-12-01
The NAC (NAM, ATAF and CUC) proteins are plant-specific transcription factors implicated in development and stress responses. In the present study 88 pigeonpea NAC genes were identified from the recently published draft genome of pigeonpea by using homology based and de novo prediction programmes. These sequences were further subjected to phylogenetic, motif and promoter analyses. In motif analysis, highly conserved motifs were identified in the NAC domain and also in the C-terminal region of the NAC proteins. A phylogenetic reconstruction using pigeonpea, Arabidopsis and soybean NAC genes revealed 33 putative stress-responsive pigeonpea NAC genes. Several stress-responsive cis-elements were identified through in silico analysis of the promoters of these putative stress-responsive genes. This analysis is the first report of NAC gene family in pigeonpea and will be useful for the identification and selection of candidate genes associated with stress tolerance.
Trichoderma stromaticum and its overseas relatives
USDA-ARS?s Scientific Manuscript database
Trichoderma stromaticum, T. rossicum and newly discovered species form a new lineage in Trichoderma. Phylogenetic and phenotypic diversity in Trichoderma stromaticum are examined in light of reported differences in ecological parameters and AFLP patterns. Multilocus phylogenetic analysis using 4 gen...
2013-01-01
Background Dendropsophus is a monophyletic anuran genus with a diploid number of 30 chromosomes as an important synapomorphy. However, the internal phylogenetic relationships of this genus are poorly understood. Interestingly, an intriguing interspecific variation in the telocentric chromosome number has been useful in species identification. To address certain uncertainties related to one of the species groups of Dendropsophus, the D. microcephalus group, we carried out a cytogenetic analysis combined with phylogenetic inferences based on mitochondrial sequences, which aimed to aid in the analysis of chromosomal characters. Populations of Dendropsophus nanus, Dendropsophus walfordi, Dendropsophus sanborni, Dendropsophus jimi and Dendropsophus elianeae, ranging from the extreme south to the north of Brazil, were cytogenetically compared. A mitochondrial region of the ribosomal 12S gene from these populations, as well as from 30 other species of Dendropsophus, was used for the phylogenetic inferences. Phylogenetic relationships were inferred using maximum parsimony and Bayesian analyses. Results The species D. nanus and D. walfordi exhibited identical karyotypes (2n = 30; FN = 52), with four pairs of telocentric chromosomes and a NOR located on metacentric chromosome pair 13. In all of the phylogenetic hypotheses, the paraphyly of D. nanus and D. walfordi was inferred. D. sanborni from Botucatu-SP and Torres-RS showed the same karyotype as D. jimi, with 5 pairs of telocentric chromosomes (2n = 30; FN = 50) and a terminal NOR in the long arm of the telocentric chromosome pair 12. Despite their karyotypic similarity, these species were not found to compose a monophyletic group. Finally, the phylogenetic and cytogenetic analyses did not cluster the specimens of D. elianeae according to their geographical occurrence or recognized morphotypes. Conclusions We suggest that a taxonomic revision of the taxa D. nanus and D. walfordi is quite necessary. We also observe that the number of telocentric chromosomes is useful to distinguish among valid species in some cases, although it is unchanged in species that are not necessarily closely related phylogenetically. Therefore, inferences based on this chromosomal character must be made with caution; a proper evolutionary analysis of the karyotypic variation in Dendropsophus depends on further characterization of the telocentric chromosomes found in this group. PMID:23822759
Modularity and evolutionary constraints in a baculovirus gene regulatory network
2013-01-01
Background The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. Results We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. Conclusions Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks. PMID:24006890
Ocean plankton. Determinants of community structure in the global plankton interactome.
Lima-Mendez, Gipsi; Faust, Karoline; Henry, Nicolas; Decelle, Johan; Colin, Sébastien; Carcillo, Fabrizio; Chaffron, Samuel; Ignacio-Espinosa, J Cesar; Roux, Simon; Vincent, Flora; Bittner, Lucie; Darzi, Youssef; Wang, Jun; Audic, Stéphane; Berline, Léo; Bontempi, Gianluca; Cabello, Ana M; Coppola, Laurent; Cornejo-Castillo, Francisco M; d'Ovidio, Francesco; De Meester, Luc; Ferrera, Isabel; Garet-Delmas, Marie-José; Guidi, Lionel; Lara, Elena; Pesant, Stéphane; Royo-Llonch, Marta; Salazar, Guillem; Sánchez, Pablo; Sebastian, Marta; Souffreau, Caroline; Dimier, Céline; Picheral, Marc; Searson, Sarah; Kandels-Lewis, Stefanie; Gorsky, Gabriel; Not, Fabrice; Ogata, Hiroyuki; Speich, Sabrina; Stemmann, Lars; Weissenbach, Jean; Wincker, Patrick; Acinas, Silvia G; Sunagawa, Shinichi; Bork, Peer; Sullivan, Matthew B; Karsenti, Eric; Bowler, Chris; de Vargas, Colomban; Raes, Jeroen
2015-05-22
Species interaction networks are shaped by abiotic and biotic factors. Here, as part of the Tara Oceans project, we studied the photic zone interactome using environmental factors and organismal abundance profiles and found that environmental factors are incomplete predictors of community structure. We found associations across plankton functional types and phylogenetic groups to be nonrandomly distributed on the network and driven by both local and global patterns. We identified interactions among grazers, primary producers, viruses, and (mainly parasitic) symbionts and validated network-generated hypotheses using microscopy to confirm symbiotic relationships. We have thus provided a resource to support further research on ocean food webs and integrating biological components into ocean models. Copyright © 2015, American Association for the Advancement of Science.
The problem and promise of scale dependency in community phylogenetics.
Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K
2006-10-01
The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.
Howard, Marion G; McDonald, William J F; Forster, Paul I; Kress, W John; Erickson, David; Faith, Daniel P; Shapcott, Alison
2016-01-01
Australia's Great Sandy Region is of international significance containing two World Heritage areas and patches of rainforest growing on white sand. Previous broad-scale analysis found the Great Sandy biogeographic subregion contained a significantly more phylogenetically even subset of species than expected by chance contrasting with rainforest on white sand in Peru. This study aimed to test the patterns of rainforest diversity and relatedness at a finer scale and to investigate why we may find different patterns of phylogenetic evenness compared with rainforests on white sands in other parts of the world. This study focussed on rainforest sites within the Great Sandy and surrounding areas in South East Queensland (SEQ), Australia. We undertook field collections, expanded our three-marker DNA barcode library of SEQ rainforest plants and updated the phylogeny to 95% of the SEQ rainforest flora. We sampled species composition of rainforest in fixed area plots from 100 sites. We calculated phylogenetic diversity (PD) measures as well as species richness (SR) for each rainforest community. These combined with site variables such as geology, were used to evaluate patterns and relatedness. We found that many rainforest communities in the Great Sandy area were significantly phylogenetically even at the individual site level consistent with a broader subregion analysis. Sites from adjacent areas were either not significant or were significantly phylogenetically clustered. Some results in the neighbouring areas were consistent with historic range expansions. In contrast with expectations, sites located on the oldest substrates had significantly lower phylogenetic diversity (PD). Fraser Island was once connected to mainland Australia, our results are consistent with a region geologically old enough to have continuously supported rainforest in refugia. The interface of tropical and temperate floras in part also explains the significant phylogenetic evenness and higher than expected phylogenetic diversity.
Howard, Marion G.; McDonald, William J. F.; Forster, Paul I.; Kress, W. John; Erickson, David; Faith, Daniel P.; Shapcott, Alison
2016-01-01
Australia’s Great Sandy Region is of international significance containing two World Heritage areas and patches of rainforest growing on white sand. Previous broad-scale analysis found the Great Sandy biogeographic subregion contained a significantly more phylogenetically even subset of species than expected by chance contrasting with rainforest on white sand in Peru. This study aimed to test the patterns of rainforest diversity and relatedness at a finer scale and to investigate why we may find different patterns of phylogenetic evenness compared with rainforests on white sands in other parts of the world. This study focussed on rainforest sites within the Great Sandy and surrounding areas in South East Queensland (SEQ), Australia. We undertook field collections, expanded our three-marker DNA barcode library of SEQ rainforest plants and updated the phylogeny to 95% of the SEQ rainforest flora. We sampled species composition of rainforest in fixed area plots from 100 sites. We calculated phylogenetic diversity (PD) measures as well as species richness (SR) for each rainforest community. These combined with site variables such as geology, were used to evaluate patterns and relatedness. We found that many rainforest communities in the Great Sandy area were significantly phylogenetically even at the individual site level consistent with a broader subregion analysis. Sites from adjacent areas were either not significant or were significantly phylogenetically clustered. Some results in the neighbouring areas were consistent with historic range expansions. In contrast with expectations, sites located on the oldest substrates had significantly lower phylogenetic diversity (PD). Fraser Island was once connected to mainland Australia, our results are consistent with a region geologically old enough to have continuously supported rainforest in refugia. The interface of tropical and temperate floras in part also explains the significant phylogenetic evenness and higher than expected phylogenetic diversity. PMID:27119149
Valtueña, Francisco J; Rodríguez-Riaño, Tomás; López, Josefa; Mayo, Carlos; Ortega-Olivencia, Ana
2017-01-01
The Macaronesian Scrophularia lowei is hypothesized to have arisen from the widespread S. arguta on the basis of several phylogenetic studies of the genus, but sampling has been limited. Although these two annual species are morphologically distinct, the origin of S. lowei is unclear because genetic studies focused on this Macaronesian species are lacking. We studied 5 S. lowei and 25 S. arguta populations to determine the relationship of both species and to infer the geographical origin of S. lowei. The timing of S. lowei divergence and differentiation was inferred by dating analysis of the ITS region. A phylogenetic analysis of two nuclear (ITS and ETS) and two chloroplast (psbJ-petA and psbA-trnH) DNA regions was performed to study the relationship between the two species, and genetic differentiation was analysed by AMOVA. Haplotype network construction and Bayesian phylogeographic analysis were conducted using chloroplast DNA regions and a spatial clustering analysis was carried out on a combined dataset of all studied regions. Our results indicate that both species constitute a well-supported clade that diverged in the Miocene and differentiated in the Late Miocene-Pleistocene. Although S. lowei constitutes a well-supported clade according to nDNA, cpDNA revealed a close relationship between S. lowei and western Canarian S. arguta, a finding supported by the spatial clustering analysis. Both species have strong population structure, with most genetic variability explained by inter-population differences. Our study therefore supports a recent peripatric speciation of S. lowei-a taxon that differs morphologically and genetically at the nDNA level from its closest relative, S. arguta, but not according to cpDNA, from the closest Macaronesian populations of that species. In addition, a recent dispersal of S. arguta to Madeira from Canary Islands or Selvagens Islands and a rapid morphological differentiation after the colonization to generate S. lowei is the most likely hypothesis to explain the origin of the last taxon.
Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis
Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu
2013-01-01
Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis. PMID:23825605
Rodríguez-Riaño, Tomás; López, Josefa; Mayo, Carlos; Ortega-Olivencia, Ana
2017-01-01
The Macaronesian Scrophularia lowei is hypothesized to have arisen from the widespread S. arguta on the basis of several phylogenetic studies of the genus, but sampling has been limited. Although these two annual species are morphologically distinct, the origin of S. lowei is unclear because genetic studies focused on this Macaronesian species are lacking. We studied 5 S. lowei and 25 S. arguta populations to determine the relationship of both species and to infer the geographical origin of S. lowei. The timing of S. lowei divergence and differentiation was inferred by dating analysis of the ITS region. A phylogenetic analysis of two nuclear (ITS and ETS) and two chloroplast (psbJ–petA and psbA–trnH) DNA regions was performed to study the relationship between the two species, and genetic differentiation was analysed by AMOVA. Haplotype network construction and Bayesian phylogeographic analysis were conducted using chloroplast DNA regions and a spatial clustering analysis was carried out on a combined dataset of all studied regions. Our results indicate that both species constitute a well-supported clade that diverged in the Miocene and differentiated in the Late Miocene-Pleistocene. Although S. lowei constitutes a well-supported clade according to nDNA, cpDNA revealed a close relationship between S. lowei and western Canarian S. arguta, a finding supported by the spatial clustering analysis. Both species have strong population structure, with most genetic variability explained by inter-population differences. Our study therefore supports a recent peripatric speciation of S. lowei—a taxon that differs morphologically and genetically at the nDNA level from its closest relative, S. arguta, but not according to cpDNA, from the closest Macaronesian populations of that species. In addition, a recent dispersal of S. arguta to Madeira from Canary Islands or Selvagens Islands and a rapid morphological differentiation after the colonization to generate S. lowei is the most likely hypothesis to explain the origin of the last taxon. PMID:28575081
Morphometrics of Daucus (Apiaceae): A counterpart to a phylogenomic study
USDA-ARS?s Scientific Manuscript database
Molecular phylogenetics of genome-scale data sets (phylogenomics) often produces phylogenetic trees with unprecedented resolution. A companion phylogenomics analysis of Daucus (carrots) using 94 conserved nuclear orthologs supported many of the traditional species but showed unexpected results that ...
Abdel-Shafi, Iman R; Shoieb, Eman Y; Attia, Samar S; Rubio, José M; Ta-Tang, Thuy-Huong; El-Badry, Ayman A
2017-03-01
Lymphatic filariasis (LF) is a serious vector-borne health problem, and Wuchereria bancrofti (W.b) is the major cause of LF worldwide and is focally endemic in Egypt. Identification of filarial infection using traditional morphologic and immunological criteria can be difficult and lead to misdiagnosis. The aim of the present study was molecular detection of W.b in residents in endemic areas in Egypt, sequence variance analysis, and phylogenetic analysis of W.b DNA. Collected blood samples from residents in filariasis endemic areas in five governorates were subjected to semi-nested PCR targeting repeated DNA sequence, for detection of W.b DNA. PCR products were sequenced; subsequently, a phylogenetic analysis of the obtained sequences was performed. Out of 300 blood samples, W.b DNA was identified in 48 (16%). Sequencing analysis confirmed PCR results identifying only W.b species. Sequence alignment and phylogenetic analysis indicated genetically distinct clusters of W.b among the study population. Study results demonstrated that the semi-nested PCR proved to be an effective diagnostic tool for accurate and rapid detection of W.b infections in nano-epidemics and is applicable for samples collected in the daytime as well as the night time. PCR products sequencing and phylogenitic analysis revealed three different nucleotide sequences variants. Further genetic studies of W.b in Egypt and other endemic areas are needed to distinguish related strains and the various ecological as well as drug effects exerted on them to support W.b elimination.
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
2011-02-16
were checked for the presence of heterotrophic bacteria by streak- ing a sample on ASW-R2A agar plates. DNA extraction and analysis of phylogenetic ...Bellerophon v. 3 (greengenes.lbl.gov) and Pintail (www.bioinformatics -toolkit.org/Web-Pintail/). Phylogenetic trees were constructed for SSU rRNA gene...CLUSTALW (44), and phylogenetic analyses were conducted in MEGA4 (42). The evolutionary history was inferred using the neighbor-joining method (39), and
Chauveau, Olivier; Eggers, Lilian; Raquin, Christian; Silvério, Adriano; Brown, Spencer; Couloux, Arnaud; Cruaud, Corine; Kaltchuk-Santos, Eliane; Yockteng, Roxana; Souza-Chies, Tatiana T.; Nadot, Sophie
2011-01-01
Background and Aims Sisyrinchium (Iridaceae: Iridoideae: Sisyrinchieae) is one of the largest, most widespread and most taxonomically complex genera in Iridaceae, with all species except one native to the American continent. Phylogenetic relationships within the genus were investigated and the evolution of oil-producing structures related to specialized oil-bee pollination examined. Methods Phylogenetic analyses based on eight molecular markers obtained from 101 Sisyrinchium accessions representing 85 species were conducted in the first extensive phylogenetic analysis of the genus. Total evidence analyses confirmed the monophyly of the genus and retrieved nine major clades weakly connected to the subdivisions previously recognized. The resulting phylogenetic hypothesis was used to reconstruct biogeographical patterns, and to trace the evolutionary origin of glandular trichomes present in the flowers of several species. Key Results and Conclusions Glandular trichomes evolved three times independently in the genus. In two cases, these glandular trichomes are oil-secreting, suggesting that the corresponding flowers might be pollinated by oil-bees. Biogeographical patterns indicate expansions from Central America and the northern Andes to the subandean ranges between Chile and Argentina and to the extended area of the Paraná river basin. The distribution of oil-flower species across the phylogenetic trees suggests that oil-producing trichomes may have played a key role in the diversification of the genus, a hypothesis that requires future testing. PMID:21527419
Phylogenetic Analysis of Nuclear-Encoded RNA Maturases
Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul
2017-01-01
Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538
Stratification of co-evolving genomic groups using ranked phylogenetic profiles
Freilich, Shiri; Goldovsky, Leon; Gottlieb, Assaf; Blanc, Eric; Tsoka, Sophia; Ouzounis, Christos A
2009-01-01
Background Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database. Results The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples. Conclusion Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples. PMID:19860884
Parks, Donovan H; Beiko, Robert G
2013-01-01
High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.
NASA Technical Reports Server (NTRS)
Buchanan, B. B.
1991-01-01
Comparisons of primary structure have revealed significant homology between the m type thioredoxins of chloroplasts and the thioredoxins from a variety of bacteria. Chloroplast thioredoxin f, by comparison, remains an enigma: certain residues are invariant with those of the other thioredoxins, but a phylogenetic relationship to bacterial or m thioredoxins seems distant. Knowledge of the evolutionary history of thioredoxin f is, nevertheless, of interest because of its role in photosynthesis. Therefore, we have attempted to gain information on the evolutionary history of chloroplast thioredoxin f, as well as m. Our goal was first to establish the utility of thioredoxin as a phylogenetic marker, and, if found suitable, to deduce the evolutionary histories of the chloroplast thioredoxins. To this end, we have constructed phylogenetic (minimal replacement) trees using computer analysis. The results show that the thioredoxins of bacteria and animals fall into distinct phylogenetic groups - the bacterial group resembling that derived from earlier 16s RNA analysis and the animal group showing a cluster consistent with known relationships. The chloroplast thioredoxins show a novel type of phylogenetic arrangement: one m type aligns with its counterpart of eukaryotic algae, cyanobacteria and other bacteria, whereas the second type (f type) tracks with animal thioredoxin. The results give new insight into the evolution of photosynthesis.
NASA Astrophysics Data System (ADS)
LI, J.; Zhang, H.; Liu, P.; Menguy, N.; Pan, Y.
2017-12-01
Magnetotactic bacteria (MTB) are phylogenetically diverse and can biomineralize magnetic nanocrystals of magnetite or greigite in intracellular structures termed magnetosomes. Their remains within sediments or sedimentary rocks, i.e. magnetofossils, have been used to retrieve paleomagnetic and paleoenvironmental information of deposition time, as well as to trace the origin and evolution of life on Earth and even perhaps Mars. A precise identification of magnetofossils heavily depends on our knowledge of phylogenetic diversity and magnetosomal biomineralization within natural MTB. In this paper, we will present a novel method which can rapidly characterize both the phylogenetic and biomineralogical properties of uncultured MTB at the single-cell level by coupling fluorescence and electron microscopy. Using this method, we have successfully identified several uncultured MTB strains from natural environments in China. These MTB are phylogenetically affiliated with the Alphaproteobacteria, Deltaproteobacteria, Gammaproteobacteria and Nitrospirae phylum, and form octahedral, cuboctahedral, prismatic, tooth-like and bullet-shaped magnetite magnetosomes. A corresponding analysis of magnetosome morphology and bacterial phylogenetics on each MTB strain has shown a species/strain-specific magnetosome biomineralization. The new method is not only promising for better understanding the correlation between magnetosome mineral habits and MTB phylogenies, but also crucial for unambiguously identifying magnetofossils.
NASA Astrophysics Data System (ADS)
Nashrulloh, Maulana Malik; Kurniawan, Nia; Rahardi, Brian
2017-11-01
The increasing availability of genetic sequence data associated with explicit geographic and environment (including biotic and abiotic components) information offers new opportunities to study the processes that shape biodiversity and its patterns. Developing phylogeography reconstruction, by integrating phylogenetic and biogeographic knowledge, provides richer and deeper visualization and information on diversification events than ever before. Geographical information systems such as QGIS provide an environment for spatial modeling, analysis, and dissemination by which phylogenetic models can be explicitly linked with their associated spatial data, and subsequently, they will be integrated with other related georeferenced datasets describing the biotic and abiotic environment. We are introducing PHYLOGEOrec, a QGIS plugin for building spatial phylogeographic reconstructions constructed from phylogenetic tree and geographical information data based on QGIS2threejs. By using PHYLOGEOrec, researchers can integrate existing phylogeny and geographical information data, resulting in three-dimensional geographic visualizations of phylogenetic trees in the Keyhole Markup Language (KML) format. Such formats can be overlaid on a map using QGIS and finally, spatially viewed in QGIS by means of a QGIS2threejs engine for further analysis. KML can also be viewed in reputable geobrowsers with KML-support (i.e., Google Earth).
USDA-ARS?s Scientific Manuscript database
Arylamine N-acetyltransferases (NATs) are xenobiotic metabolizing enzymes characterized in several bacteria and eukaryotic organisms. We report a comprehensive phylogenetic analysis employing an exhaustive dataset of NAT-homologous sequences recovered through inspection of 2445 genomes. We describe ...
Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.
Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F
2017-08-01
Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.
Hembry, David H; Raimundo, Rafael L G; Newman, Erica A; Atkinson, Lesje; Guo, Chang; Guimarães, Paulo R; Gillespie, Rosemary G
2018-04-25
Biological intimacy-the degree of physical proximity or integration of partner taxa during their life cycles-is thought to promote the evolution of reciprocal specialization and modularity in the networks formed by co-occurring mutualistic species, but this hypothesis has rarely been tested. Here, we test this "biological intimacy hypothesis" by comparing the network architecture of brood pollination mutualisms, in which specialized insects are simultaneously parasites (as larvae) and pollinators (as adults) of their host plants to that of other mutualisms which vary in their biological intimacy (including ant-myrmecophyte, ant-extrafloral nectary, plant-pollinator and plant-seed disperser assemblages). We use a novel dataset sampled from leafflower trees (Phyllanthaceae: Phyllanthus s. l. [Glochidion]) and their pollinating leafflower moths (Lepidoptera: Epicephala) on three oceanic islands (French Polynesia) and compare it to equivalent published data from congeners on continental islands (Japan). We infer taxonomic diversity of leafflower moths using multilocus molecular phylogenetic analysis and examine several network structural properties: modularity (compartmentalization), reciprocality (symmetry) of specialization and algebraic connectivity. We find that most leafflower-moth networks are reciprocally specialized and modular, as hypothesized. However, we also find that two oceanic island networks differ in their modularity and reciprocal specialization from the others, as a result of a supergeneralist moth taxon which interacts with nine of 10 available hosts. Our results generally support the biological intimacy hypothesis, finding that leafflower-moth networks (usually) share a reciprocally specialized and modular structure with other intimate mutualisms such as ant-myrmecophyte symbioses, but unlike nonintimate mutualisms such as seed dispersal and nonintimate pollination. Additionally, we show that generalists-common in nonintimate mutualisms-can also evolve in intimate mutualisms, and that their effect is similar in both types of assemblages: once generalists emerge they reshape the network organization by connecting otherwise isolated modules. © 2018 The Authors. Journal of Animal Ecology © 2018 British Ecological Society.
Phylogenetic turnover along local environmental gradients in tropical forest communities.
Baldeck, C A; Kembel, S W; Harms, K E; Yavitt, J B; John, R; Turner, B L; Madawala, S; Gunatilleke, N; Gunatilleke, S; Bunyavejchewin, S; Kiratiprayoon, S; Yaacob, A; Supardi, M N N; Valencia, R; Navarrete, H; Davies, S J; Chuyong, G B; Kenfack, D; Thomas, D W; Dalling, J W
2016-10-01
While the importance of local-scale habitat niches in shaping tree species turnover along environmental gradients in tropical forests is well appreciated, relatively little is known about the influence of phylogenetic signal in species' habitat niches in shaping local community structure. We used detailed maps of the soil resource and topographic variation within eight 24-50 ha tropical forest plots combined with species phylogenies created from the APG III phylogeny to examine how phylogenetic beta diversity (indicating the degree of phylogenetic similarity of two communities) was related to environmental gradients within tropical tree communities. Using distance-based redundancy analysis we found that phylogenetic beta diversity, expressed as either nearest neighbor distance or mean pairwise distance, was significantly related to both soil and topographic variation in all study sites. In general, more phylogenetic beta diversity within a forest plot was explained by environmental variables this was expressed as nearest neighbor distance versus mean pairwise distance (3.0-10.3 % and 0.4-8.8 % of variation explained among plots, respectively), and more variation was explained by soil resource variables than topographic variables using either phylogenetic beta diversity metric. We also found that patterns of phylogenetic beta diversity expressed as nearest neighbor distance were consistent with previously observed patterns of niche similarity among congeneric species pairs in these plots. These results indicate the importance of phylogenetic signal in local habitat niches in shaping the phylogenetic structure of tropical tree communities, especially at the level of close phylogenetic neighbors, where similarity in habitat niches is most strongly preserved.
Badiane, Arnaud; Garcia-Porta, Joan; Červenka, Jan; Kratochvíl, Lukáš; Sindaco, Roberto; Robinson, Michael D; Morales, Hernan; Mazuch, Tomáš; Price, Thomas; Amat, Fèlix; Shobrak, Mohammed Y; Wilms, Thomas; Simó-Riudalbas, Marc; Ahmadzadeh, Faraham; Papenfuss, Theodore J; Cluchier, Alexandre; Viglione, Julien; Carranza, Salvador
2014-07-09
A molecular phylogeny of the sphaerodactylid geckos of the genus Pristurus is inferred based on an alignment of 1845 base pairs (bp) of concatenated mitochondrial (12S) and nuclear (acm4, cmos, rag1 and rag2) genes for 80 individuals, representing 18 of the 23-26 species, and the three subspecies of P. rupestris. The results indicate that P. rupestris is polyphyletic and includes two highly divergent clades: the eastern clade, found in coastal Iran and throughout the Hajar Mountain range in northern Oman and eastern UAE; and the western clade, distributed from central coastal Oman, through Yemen, Saudi Arabia and north to southern Jordan. Inferred haplotype networks for the four nuclear genes show that the eastern and western clades of "P. rupestris" are highly differentiated and do not share any alleles. Moreover, although the two clades are differentiated by a morphological multivariate analysis, no one character or set of characters was found to be diagnostic. Based on the molecular analysis of specimens from the type locality of P. rupestris rupestris, the name P. rupestris is applied to the eastern clade. The name that should apply to the western clade cannot be clarified until morphological and genetic data for "P. rupestris" is available from the vicinity of Bosaso, Somalia, and therefore we refer to it as Pristurus sp. 1. The phylogenetic tree of Pristurus supports the hypothesis that P. celerrimus is sister to all the other species in the analyses and that the Socotra Archipelago was independently colonized a minimum of two times.
Cheng, Kun; Rong, Xiaoying; Huang, Ying
2016-09-01
Homologous recombination is increasingly being recognized as a driving force in microbial evolution. However, recombination in streptomycetes, a rich source of diverse secondary metabolites, particularly among different species, remains minimally investigated. In this study, the largest sample of Streptomyces species to date, consisting of 142 type strains spanning the genus, with available sequences of 16S rRNA, atpD, gyrB, recA, rpoB and trpB genes, were collected and subjected to a comprehensive population genetic analysis to generate an overall estimate of the level of Streptomyces interspecies genetic exchange and its effect on the evolution of this genus. The results indicate frequent homologous recombination among Streptomyces species, which occurred three times more frequently and was nearly 14 times more important than point mutation in nucleotide sequence divergence (ρ/θw=3.10, r/m=13.74). As a result, a facilitating effect on the evolutionary process and confusion in phylogenetic relationships were observed, as well as a number of specific transfer events of the six gene fragments. A resultant phylogenetic network depicted extensive horizontal genetic exchange which decays clonality in streptomycetes. Moreover, seven evolutionary lineage groups were identified in the present sample in the Structure analysis, generally consistent with morphological and physiological data, and the contribution of recombination was detected to be varied among them. Our analyses demonstrated a reticulate evolution within Streptomyces due to the high level of interspecies gene exchange, which greatly challenges the traditional tree-shaped phylogeny in this genus and may advance our evolutionary understanding of a genuine Streptomyces species. Copyright © 2016 Elsevier Inc. All rights reserved.
Comparative Study of Lectin Domains in Model Species: New Insights into Evolutionary Dynamics
Van Holle, Sofie; De Schutter, Kristof; Eggermont, Lore; Tsaneva, Mariya; Dang, Liuyi; Van Damme, Els J. M.
2017-01-01
Lectins are present throughout the plant kingdom and are reported to be involved in diverse biological processes. In this study, we provide a comparative analysis of the lectin families from model species in a phylogenetic framework. The analysis focuses on the different plant lectin domains identified in five representative core angiosperm genomes (Arabidopsis thaliana, Glycine max, Cucumis sativus, Oryza sativa ssp. japonica and Oryza sativa ssp. indica). The genomes were screened for genes encoding lectin domains using a combination of Basic Local Alignment Search Tool (BLAST), hidden Markov models, and InterProScan analysis. Additionally, phylogenetic relationships were investigated by constructing maximum likelihood phylogenetic trees. The results demonstrate that the majority of the lectin families are present in each of the species under study. Domain organization analysis showed that most identified proteins are multi-domain proteins, owing to the modular rearrangement of protein domains during evolution. Most of these multi-domain proteins are widespread, while others display a lineage-specific distribution. Furthermore, the phylogenetic analyses reveal that some lectin families evolved to be similar to the phylogeny of the plant species, while others share a closer evolutionary history based on the corresponding protein domain architecture. Our results yield insights into the evolutionary relationships and functional divergence of plant lectins. PMID:28587095
Phylogenetic Analysis of Dengue Virus in Bangkalan, Madura Island, East Java Province, Indonesia.
Sucipto, Teguh Hari; Kotaki, Tomohiro; Mulyatno, Kris Cahyo; Churrotin, Siti; Labiqah, Amaliah; Soegijanto, Soegeng; Kameoka, Masanori
2018-01-01
Dengue virus (DENV) infection is a major health issue in tropical and subtropical areas. Indonesia is one of the biggest dengue endemic countries in the world. In the present study, the phylogenetic analysis of DENV in Bangkalan, Madura Island, Indonesia, was performed in order to obtain a clearer understanding of its dynamics in this country. A total of 359 blood samples from dengue-suspected patients were collected between 2012 and 2014. Serotyping was conducted using a multiplex Reverse Transcriptase-Polymerase Chain Reaction and a phylogenetic analysis of E gene sequences was performed using the Bayesian Markov chain Monte Carlo (MCMC) method. 17 out of 359 blood samples (4.7%) were positive for the isolation of DENV. Serotyping and the phylogenetic analysis revealed the predominance of DENV-1 genotype I (9/17, 52.9%), followed by DENV-2 Cosmopolitan type (7/17, 41.2%) and DENV-3 genotype I (1/17, 5.9%) . DENV-4 was not isolated. The Madura Island isolates showed high nucleotide similarity to other Indonesian isolates, indicating frequent virus circulation in Indonesia. The results of the present study highlight the importance of continuous viral surveillance in dengue endemic areas in order to obtain a clearer understanding of the dynamics of DENV in Indonesia.
Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D
2015-02-01
The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.
Computational biomechanics changes our view on insect head evolution.
Blanke, Alexander; Watson, Peter J; Holbrey, Richard; Fagan, Michael J
2017-02-08
Despite large-scale molecular attempts, the relationships of the basal winged insect lineages dragonflies, mayflies and neopterans, are still unresolved. Other data sources, such as morphology, suffer from unclear functional dependencies of the structures considered, which might mislead phylogenetic inference. Here, we assess this problem by combining for the first time biomechanics with phylogenetics using two advanced engineering techniques, multibody dynamics analysis and finite-element analysis, to objectively identify functional linkages in insect head structures which have been used traditionally to argue basal winged insect relationships. With a biomechanical model of unprecedented detail, we are able to investigate the mechanics of morphological characters under biologically realistic load, i.e. biting. We show that a range of head characters, mainly ridges, endoskeletal elements and joints, are indeed mechanically linked to each other. An analysis of character state correlation in a morphological data matrix focused on head characters shows highly significant correlation of these mechanically linked structures. Phylogenetic tree reconstruction under different data exclusion schemes based on the correlation analysis unambiguously supports a sistergroup relationship of dragonflies and mayflies. The combination of biomechanics and phylogenetics as it is proposed here could be a promising approach to assess functional dependencies in many organisms to increase our understanding of phenotypic evolution. © 2017 The Author(s).
Stanevičiūtė, Gražina; Stunžėnas, Virmantas; Petkevičiūtė, Romualda
2015-01-01
The family Echinostomatidae Looss, 1899 exhibits a substantial taxonomic diversity, morphological criteria adopted by different authors have resulted in its subdivision into an impressive number of subfamilies. The status of the subfamily Echinochasminae Odhner, 1910 was changed in various classifications. Genetic characteristics and phylogenetic analysis of four Echinostomatidae species - Echinochasmus sp., Echinochasmuscoaxatus Dietz, 1909, Stephanoprorapseudoechinata (Olsson, 1876) and Echinoparyphiummordwilkoi Skrjabin, 1915 were obtained to understand well enough the homogeneity of the Echinochasminae and phylogenetic relationships within the Echinostomatidae. Chromosome set and nuclear rDNA (ITS2 and 28S) sequences of parthenites of Echinochasmus sp. were studied. The karyotype of this species (2n=20, one pair of large bi-armed chromosomes and others are smaller-sized, mainly one-armed, chromosomes) differed from that previously described for two other representatives of the Echinochasminae, Echinochasmusbeleocephalus (von Linstow, 1893), 2n=14, and Episthmiumbursicola (Creplin, 1937), 2n=18. In phylogenetic trees based on ITS2 and 28S datasets, a well-supported subclade with Echinochasmus sp. and Stephanoprorapseudoechinata clustered with one well-supported clade together with Echinochasmusjaponicus Tanabe, 1926 (data only for 28S) and Echinochasmuscoaxatus. These results supported close phylogenetic relationships between Echinochasmus Dietz, 1909 and Stephanoprora Odhner, 1902. Phylogenetic analysis revealed a clear separation of related species of Echinostomatoidea restricted to prosobranch snails as first intermediate hosts, from other species of Echinostomatidae and Psilostomidae, developing in Lymnaeoidea snails as first intermediate hosts. According to the data based on rDNA phylogeny, it was supposed that evolution of parasitic flukes linked with first intermediate hosts. Digeneans parasitizing prosobranch snails showed higher dynamic of karyotype evolution provided by different chromosomal rearrangements including Robertsonian translocations and pericentric inversions than more stable karyotype of digenean worms parasitizing lymnaeoid pulmonate snails.
Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand
2009-12-29
Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.
Tjahjono, Elissa; Kirienko, Natalia V
2017-06-01
All living organisms exist in a precarious state of homeostasis that requires constant maintenance. A wide variety of stresses, including hypoxia, heat, and infection by pathogens perpetually threaten to imbalance this state. Organisms use a battery of defenses to mitigate damage and restore normal function. Previously, we described a Caenorhabditis elegans-Pseudomonas aeruginosa assay (Liquid Killing) in which toxicity to the host is dependent upon the secreted bacterial siderophore pyoverdine. Although pyoverdine is also indispensable for virulence in mammals, its cytological effects are unclear. We used genetics, transcriptomics, and a variety of pathogen and chemical exposure assays to study the interactions between P. aeruginosa and C. elegans. Although P. aeruginosa can kill C. elegans through at least 5 different mechanisms, the defense responses activated by Liquid Killing are specific and selective and have little in common with innate defense mechanisms against intestinal colonization. Intriguingly, the defense response utilizes the phylogenetically-conserved ESRE (Ethanol and Stress Response Element) network, which we and others have previously shown to mitigate damage from a variety of abiotic stresses. This is the first report of this networks involvement in innate immunity, and indicates that host innate immune responses overlap with responses to abiotic stresses. The upregulation of the ESRE network in C. elegans is mediated in part by a family of bZIP proteins (including ZIP-2, ZIP-4, CEBP-1, and CEBP-2) that have overlapping and unique functions. Our data convincingly show that, following exposure to P. aeruginosa, the ESRE defense network is activated by mitochondrial damage, and that mitochondrial damage also leads to ESRE activation in mammals. This establishes a role for ESRE in a phylogenetically-conserved mitochondrial surveillance system important for stress response and innate immunity.
Developmental Stage of Parasites Influences the Structure of Fish-Parasite Networks
Bellay, Sybelle; de Oliveira, Edson Fontes; Almeida-Neto, Mário; Lima Junior, Dilermando Pereira; Takemoto, Ricardo Massato; Luque, José Luis
2013-01-01
Specialized interactions tend to be more common in systems that require strong reciprocal adaptation between species, such as those observed between parasites and hosts. Parasites exhibit a high diversity of species and life history strategies, presenting host specificity which increases the complexity of these antagonistic systems. However, most studies are limited to the description of interactions between a few parasite and host species, which restricts our understanding of these systems as a whole. We investigated the effect of the developmental stage of the parasite on the structure of 30 metazoan fish-parasite networks, with an emphasis on the specificity of the interactions, connectance and modularity. We assessed the functional role of each species in modular networks and its interactions within and among the modules according to the developmental stage (larval and adult) and taxonomic group of the parasites. We observed that most parasite and host species perform a few interactions but that parasites at the larval stage tended to be generalists, increasing the network connectivity within and among modules. The parasite groups did not differ among each other in the number of interactions within and among the modules when considering only species at the larval stage. However, the same groups of adult individuals differed from each other in their interaction patterns, which were related to variations in the degree of host specificity at this stage. Our results show that the interaction pattern of fishes with parasites, such as acanthocephalans, cestodes, digeneans and nematodes, is more closely associated with their developmental stage than their phylogenetic history. This finding corroborates the hypothesis that the life history of parasites results in adaptations that cross phylogenetic boundaries. PMID:24124506
Developmental stage of parasites influences the structure of fish-parasite networks.
Bellay, Sybelle; de Oliveira, Edson Fontes; Almeida-Neto, Mário; Lima Junior, Dilermando Pereira; Takemoto, Ricardo Massato; Luque, José Luis
2013-01-01
Specialized interactions tend to be more common in systems that require strong reciprocal adaptation between species, such as those observed between parasites and hosts. Parasites exhibit a high diversity of species and life history strategies, presenting host specificity which increases the complexity of these antagonistic systems. However, most studies are limited to the description of interactions between a few parasite and host species, which restricts our understanding of these systems as a whole. We investigated the effect of the developmental stage of the parasite on the structure of 30 metazoan fish-parasite networks, with an emphasis on the specificity of the interactions, connectance and modularity. We assessed the functional role of each species in modular networks and its interactions within and among the modules according to the developmental stage (larval and adult) and taxonomic group of the parasites. We observed that most parasite and host species perform a few interactions but that parasites at the larval stage tended to be generalists, increasing the network connectivity within and among modules. The parasite groups did not differ among each other in the number of interactions within and among the modules when considering only species at the larval stage. However, the same groups of adult individuals differed from each other in their interaction patterns, which were related to variations in the degree of host specificity at this stage. Our results show that the interaction pattern of fishes with parasites, such as acanthocephalans, cestodes, digeneans and nematodes, is more closely associated with their developmental stage than their phylogenetic history. This finding corroborates the hypothesis that the life history of parasites results in adaptations that cross phylogenetic boundaries.
The power and pitfalls of HIV phylogenetics in public health.
Brooks, James I; Sandstrom, Paul A
2013-07-25
Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.
Inferring Epidemic Contact Structure from Phylogenetic Trees
Leventhal, Gabriel E.; Kouyos, Roger; Stadler, Tanja; von Wyl, Viktor; Yerly, Sabine; Böni, Jürg; Cellerai, Cristina; Klimkait, Thomas; Günthard, Huldrych F.; Bonhoeffer, Sebastian
2012-01-01
Contact structure is believed to have a large impact on epidemic spreading and consequently using networks to model such contact structure continues to gain interest in epidemiology. However, detailed knowledge of the exact contact structure underlying real epidemics is limited. Here we address the question whether the structure of the contact network leaves a detectable genetic fingerprint in the pathogen population. To this end we compare phylogenies generated by disease outbreaks in simulated populations with different types of contact networks. We find that the shape of these phylogenies strongly depends on contact structure. In particular, measures of tree imbalance allow us to quantify to what extent the contact structure underlying an epidemic deviates from a null model contact network and illustrate this in the case of random mixing. Using a phylogeny from the Swiss HIV epidemic, we show that this epidemic has a significantly more unbalanced tree than would be expected from random mixing. PMID:22412361
Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat
2017-07-01
The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.
NASA Technical Reports Server (NTRS)
Woese, C. R.; Achenbach, L.; Rouviere, P.; Mandelco, L.
1991-01-01
A major and too little recognized source of artifact in phylogenetic analysis of molecular sequence data is compositional difference among sequences. The problem becomes particularly acute when alignments contain ribosomal RNAs from both mesophilic and thermophilic species. Among prokaryotes the latter are considerably higher in G + C content than the former, which often results in artificial clustering of thermophilic lineages and their being placed artificially deep in phylogenetic trees. In this communication we review archaeal phylogeny in the light of this consideration, focusing in particular on the phylogenetic position of the sulfate reducing species Archaeoglobus fulgidus, using both 16S rRNA and 23S rRNA sequences. The analysis shows clearly that the previously reported deep branching of the A. fulgidus lineage (very near the base of the euryarchaeal side of the archaeal tree) is incorrect, and that the lineage actually groups with a previously recognized unit that comprises the Methanomicrobiales and extreme halophiles.
Phylogenetic Diversity in the Macromolecular Composition of Microalgae
Finkel, Zoe V.; Follows, Mick J.; Liefer, Justin D.; Brown, Chris M.; Benner, Ina; Irwin, Andrew J.
2016-01-01
The elemental stoichiometry of microalgae reflects their underlying macromolecular composition and influences competitive interactions among species and their role in the food web and biogeochemistry. Here we provide a new estimate of the macromolecular composition of microalgae using a hierarchical Bayesian analysis of data compiled from the literature. The median macromolecular composition of nutrient-sufficient exponentially growing microalgae is 32.2% protein, 17.3% lipid, 15.0% carbohydrate, 17.3% ash, 5.7% RNA, 1.1% chlorophyll-a and 1.0% DNA as percent dry weight. Our analysis identifies significant phylogenetic differences in macromolecular composition undetected by previous studies due to small sample sizes and the large inherent variability in macromolecular pools. The phylogenetic differences in macromolecular composition lead to variations in carbon-to-nitrogen ratios that are consistent with independent observations. These phylogenetic differences in macromolecular and elemental composition reflect adaptations in cellular architecture and biochemistry; specifically in the cell wall, the light harvesting apparatus, and storage pools. PMID:27228080
Development of a Prognostic Marker for Lung Cancer Using Analysis of Tumor Evolution
2016-08-01
construct evolutionary trees , the characteristics of which will be used to predict whether a tumor will metastasize or not. We established a procedure for...of populations, the evolution of tumor cells within a tumor can be diagrammed on a phylogenetic tree . The more diverse a tumor’s phylogenetic tree ...individual tumor cells from the tumors of a training set of patients (half early stage, half late stage). We will reconstruct each tumor’s phylogenetic tree
Wonglersak, Rungtip; Cronk, Quentin; Percy, Diana
2017-01-01
Abstract Background The common nettle (Urtica dioica L.) is co-associated with willows (Salix spp.) in riparian habitats across Europe. We sampled the widespread nettle psyllid, Trioza urticae (Linné, 1758), from Urtica in willow habitats on a megatransect of Europe from the Aegean to the Arctic Ocean. The aim of this study was to use an unusually widespread insect to assess the influence of geographic distances and natural geographic barriers on patterns of genetic variation and haplotype distribution. New information Phylogeographic analysis using DNA sequences of two mtDNA regions, COI and cytB, shows that T. urticae specimens are organized into four regional groups (southern, central, northern and arctic). These groups are supported by both phylogenetic analysis (four geographically-based clades) and network analysis (four major haplotype groups). The boundary between southern and central groups corresponds to the Carpathian Mountains and the boundary between the central and northern groups corresponds to the Gulf of Finland. Overall these groups form a latitudinal cline in genetic diversity, which decreases with increasing latitude. PMID:28325977
Extracellular chloride signals collagen IV network assembly during basement membrane formation
Cummings, Christopher F.; Pedchenko, Vadim; Brown, Kyle L.; Colon, Selene; Rafi, Mohamed; Jones-Paris, Celestial; Pokydeshava, Elena; Liu, Min; Pastor-Pareja, Jose C.; Stothers, Cody; Ero-Tolliver, Isi A.; McCall, A. Scott; Vanacore, Roberto; Bhave, Gautam; Santoro, Samuel; Blackwell, Timothy S.; Zent, Roy; Pozzi, Ambra
2016-01-01
Basement membranes are defining features of the cellular microenvironment; however, little is known regarding their assembly outside cells. We report that extracellular Cl− ions signal the assembly of collagen IV networks outside cells by triggering a conformational switch within collagen IV noncollagenous 1 (NC1) domains. Depletion of Cl− in cell culture perturbed collagen IV networks, disrupted matrix architecture, and repositioned basement membrane proteins. Phylogenetic evidence indicates this conformational switch is a fundamental mechanism of collagen IV network assembly throughout Metazoa. Using recombinant triple helical protomers, we prove that NC1 domains direct both protomer and network assembly and show in Drosophila that NC1 architecture is critical for incorporation into basement membranes. These discoveries provide an atomic-level understanding of the dynamic interactions between extracellular Cl− and collagen IV assembly outside cells, a critical step in the assembly and organization of basement membranes that enable tissue architecture and function. Moreover, this provides a mechanistic framework for understanding the molecular pathobiology of NC1 domains. PMID:27216258
Ai, Dongmei; Huang, Ruocheng; Wen, Jin; Li, Chao; Zhu, Jiangping; Xia, Li Charlie
2017-01-25
Periodontitis is an inflammatory disease affecting the tissues supporting teeth (periodontium). Integrative analysis of metagenomic samples from multiple periodontitis studies is a powerful way to examine microbiota diversity and interactions within host oral cavity. A total of 43 subjects were recruited to participate in two previous studies profiling the microbial community of human subgingival plaque samples using shotgun metagenomic sequencing. We integrated metagenomic sequence data from those two studies, including six healthy controls, 14 sites representative of stable periodontitis, 16 sites representative of progressing periodontitis, and seven periodontal sites of unknown status. We applied phylogenetic diversity, differential abundance, and network analyses, as well as clustering, to the integrated dataset to compare microbiological community profiles among the different disease states. We found alpha-diversity, i.e., mean species diversity in sites or habitats at a local scale, to be the single strongest predictor of subjects' periodontitis status (P < 0.011). More specifically, healthy subjects had the highest alpha-diversity, while subjects with stable sites had the lowest alpha-diversity. From these results, we developed an alpha-diversity logistic model-based naive classifier able to perfectly predict the disease status of the seven subjects with unknown periodontal status (not used in training). Phylogenetic profiling resulted in the discovery of nine marker microbes, and these species are able to differentiate between stable and progressing periodontitis, achieving an accuracy of 94.4%. Finally, we found that the reduction of negatively correlated species is a notable signature of disease progression. Our results consistently show a strong association between the loss of oral microbiota diversity and the progression of periodontitis, suggesting that metagenomics sequencing and phylogenetic profiling are predictive of early periodontitis, leading to potential therapeutic intervention. Our results also support a keystone pathogen-mediated polymicrobial synergy and dysbiosis (PSD) model to explain the etiology of periodontitis. Apart from P. gingivalis, we identified three additional keystone species potentially mediating the progression of periodontitis progression based on pathogenic characteristics similar to those of known keystone pathogens.
Phylogenetic analysis of several Thermus strains from Rehai of Tengchong, Yunnan, China.
Lin, Lianbing; Zhang, Jie; Wei, Yunlin; Chen, Chaoyin; Peng, Qian
2005-10-01
Several Thermus strains were isolated from 10 hot springs of the Rehai geothermal area in Tengchong, Yunnan province. The diversity of Thermus strains was examined by sequencing the 16S rRNA genes and comparing their sequences. Phylogenetic analysis showed that the 16S rDNA sequences from the Rehai geothermal isolates form four branches in the phylogenetic tree and had greater than 95.9% similarity in the phylogroup. Secondary structure comparison also indicated that the 16S rRNA from the Rehai geothermal isolates have unique secondary structure characteristics in helix 6, helix 9, and helix 10 (reference to Escherichia coli). This research is the first attempt to reveal the diversity of Thermus strains that are distributed in the Rehai geothermal area.
PAL: an object-oriented programming library for molecular evolution and phylogenetics.
Drummond, A; Strimmer, K
2001-07-01
Phylogenetic Analysis Library (PAL) is a collection of Java classes for use in molecular evolution and phylogenetics. PAL provides a modular environment for the rapid construction of both special-purpose and general analysis programs. PAL version 1.1 consists of 145 public classes or interfaces in 13 packages, including classes for models of character evolution, maximum-likelihood estimation, and the coalescent, with a total of more than 27000 lines of code. The PAL project is set up as a collaborative project to facilitate contributions from other researchers. AVAILIABILTY: The program is free and is available at http://www.pal-project.org. It requires Java 1.1 or later. PAL is licensed under the GNU General Public License.
Stevenson, Pablo R.; Link, Andrés; González-Caro, Sebastian; Torres-Jiménez, María Fernanda
2015-01-01
Frugivory is a widespread mutualistic interaction in which frugivores obtain nutritional resources while favoring plant recruitment through their seed dispersal services. Nonetheless, how these complex interactions are organized in diverse communities, such as tropical forests, is not fully understood. In this study we evaluated the existence of plant-frugivore sub-assemblages and their phylogenetic organization in an undisturbed western Amazonian forest in Colombia. We also explored for potential keystone plants, based on network analyses and an estimate of the amount of fruit going from plants to frugivores. We carried out diurnal observations on 73 canopy plant species during a period of two years. During focal tree sampling, we recorded frugivore identity, the duration of each individual visit, and feeding rates. We did not find support for the existence of sub assemblages, such as specialized vs. generalized dispersal systems. Visitation rates on the vast majority of canopy species were associated with the relative abundance of frugivores, in which ateline monkeys (i.e. Lagothrix and Ateles) played the most important roles. All fruiting plants were visited by a variety of frugivores and the phylogenetic assemblage was random in more than 67% of the cases. In cases of aggregation, the plant species were consumed by only primates or only birds, and filters were associated with fruit protection and likely chemical content. Plants suggested as keystone species based on the amount of pulp going from plants to frugivores differ from those suggested based on network approaches. Our results suggest that in tropical forests most tree-frugivore interactions are generalized, and abundance should be taken into account when assessing the most important plants for frugivores. PMID:26492037
GeneBee-net: Internet-based server for analyzing biopolymers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brodsky, L.I.; Ivanov, V.V.; Nikolaev, V.K.
This work describes a network server for searching databanks of biopolymer structures and performing other biocomputing procedures; it is available via direct Internet connection. Basic server procedures are dedicated to homology (similarity) search of sequence and 3D structure of proteins. The homologies found could be used to build multiple alignments, predict protein and RNA secondary structure, and construct phylogenetic trees. In addition to traditional methods of sequence similarity search, the authors propose {open_quotes}non-matrix{close_quotes} (correlational) search. An analogous approach is used to identify regions of similar tertiary structure of proteins. Algorithm concepts and usage examples are presented for new methods. Servicemore » logic is based upon interaction of a client program and server procedures. The client program allows the compilation of queries and the processing of results of an analysis.« less
Winkler, Isaac S; Blaschke, Jeremy D; Davis, Daniel J; Stireman, John O; O'Hara, James E; Cerretti, Pierfilippo; Moulton, John K
2015-07-01
Molecular phylogenetic studies at all taxonomic levels often infer rapid radiation events based on short, poorly resolved internodes. While such rapid episodes of diversification are an important and widespread evolutionary phenomenon, much of this poor phylogenetic resolution may be attributed to the continuing widespread use of "traditional" markers (mitochondrial, ribosomal, and some nuclear protein-coding genes) that are often poorly suited to resolve difficult, higher-level phylogenetic problems. Here we reconstruct phylogenetic relationships among a representative set of taxa of the parasitoid fly family Tachinidae and related outgroups of the superfamily Oestroidea. The Tachinidae are one of the most species rich, yet evolutionarily recent families of Diptera, providing an ideal case study for examining the differential performance of loci in resolving phylogenetic relationships and the benefits of adding more loci to phylogenetic analyses. We assess the phylogenetic utility of nine genes including both traditional genes (e.g., CO1 mtDNA, 28S rDNA) and nuclear protein-coding genes newly developed for phylogenetic analysis. Our phylogenetic findings, based on a limited set of taxa, include: a close relationship between Tachinidae and the calliphorid subfamily Polleninae, monophyly of Tachinidae and the subfamilies Exoristinae and Dexiinae, subfamily groupings of Dexiinae+Phasiinae and Tachininae+Exoristinae, and robust phylogenetic placement of the somewhat enigmatic genera Strongygaster, Euthera, and Ceracia. In contrast to poor resolution and phylogenetic incongruence of "traditional genes," we find that a more selective set of highly informative genes is able to more precisely identify regions of the phylogeny that experienced rapid radiation of lineages, while more accurately depicting their phylogenetic context. Although much expanded taxon sampling is necessary to effectively assess the monophyly of and relationships among major tachinid lineages and their relatives, we show that a small number of well-chosen nuclear protein-coding genes can successfully resolve even difficult phylogenetic problems. Copyright © 2015 Elsevier Inc. All rights reserved.
[Short interspersed repetitive sequences (SINEs) and their use as a phylogenetic tool].
Kramerov, D A; Vasetskiĭ, N S
2009-01-01
The data on one of the most common repetitive elements of eukaryotic genomes, short interspersed elements (SINEs), are reviewed. Their structure, origin, and functioning in the genome are discussed. The variation and abundance of these neutral genomic markers makes them a convenient and reliable tool for phylogenetic analysis. The main methods of such analysis are presented, and the potential and limitations of this approach are discussed using specific examples.
Acremonium phylogenetic overview and revision of Gliomastix, Sarocladium, and Trichothecium.
Summerbell, R C; Gueidan, C; Schroers, H-J; de Hoog, G S; Starink, M; Rosete, Y Arocha; Guarro, J; Scott, J A
2011-01-01
Over 200 new sequences are generated for members of the genus Acremonium and related taxa including ribosomal small subunit sequences (SSU) for phylogenetic analysis and large subunit (LSU) sequences for phylogeny and DNA-based identification. Phylogenetic analysis reveals that within the Hypocreales, there are two major clusters containing multiple Acremonium species. One clade contains Acremonium sclerotigenum, the genus Emericellopsis, and the genus Geosmithia as prominent elements. The second clade contains the genera Gliomastixsensu stricto and Bionectria. In addition, there are numerous smaller clades plus two multi-species clades, one containing Acremonium strictum and the type species of the genus Sarocladium, and, as seen in the combined SSU/LSU analysis, one associated subclade containing Acremonium breve and related species plus Acremonium curvulum and related species. This sequence information allows the revision of three genera. Gliomastix is revived for five species, G. murorum, G. polychroma, G. tumulicola, G. roseogrisea, and G. masseei. Sarocladium is extended to include all members of the phylogenetically distinct A. strictum clade including the medically important A. kiliense and the protective maize endophyte A. zeae. Also included in Sarocladium are members of the phylogenetically delimited Acremonium bacillisporum clade, closely linked to the A. strictum clade. The genus Trichothecium is revised following the principles of unitary nomenclature based on the oldest valid anamorph or teleomorph name, and new combinations are made in Trichothecium for the tightly interrelated Acremonium crotocinigenum, Spicellum roseum, and teleomorph Leucosphaerinaindica. Outside the Hypocreales, numerous Acremonium-like species fall into the Plectosphaerellaceae, and A. atrogriseum falls into the Cephalothecaceae.
Kiwuwa-Muyingo, Sylvia; Nazziwa, Jamirah; Ssemwanga, Deogratius; Ilmonen, Pauliina; Ndembi, Nicaise; Parry, Chris; Kitandwe, Paul Kato; Gershim, Asiki; Mpendo, Juliet; Neilsen, Leslie; Seeley, Janet; Seppälä, Heikki; Lyagoba, Fred; Kamali, Anatoli; Kaleebu, Pontiano
2017-01-01
Background Fishing communities around Lake Victoria in sub-Saharan Africa have been characterised as a population at high risk of HIV-infection. Methods Using data from a cohort of HIV-positive individuals aged 13–49 years, enrolled from 5 fishing communities on Lake Victoria between 2009–2011, we sought to identify factors contributing to the epidemic and to understand the underlying structure of HIV transmission networks. Clinical and socio-demographic data were combined with HIV-1 phylogenetic analyses. HIV-1 gag-p24 and env-gp-41 sub-genomic fragments were amplified and sequenced from 283 HIV-1-infected participants. Phylogenetic clusters with ≥2 highly related sequences were defined as transmission clusters. Logistic regression models were used to determine factors associated with clustering. Results Altogether, 24% (n = 67/283) of HIV positive individuals with sequences fell within 34 phylogenetically distinct clusters in at least one gene region (either gag or env). Of these, 83% occurred either within households or within community; 8/34 (24%) occurred within household partnerships, and 20/34 (59%) within community. 7/12 couples (58%) within households clustered together. Individuals in clusters with potential recent transmission (11/34) were more likely to be younger 71% (15/21) versus 46% (21/46) in un-clustered individuals and had recently become resident in the community 67% (14/21) vs 48% (22/46). Four of 11 (36%) potential transmission clusters included incident-incident transmissions. Independently, clustering was less likely in HIV subtype D (adjusted Odds Ratio, aOR = 0.51 [95% CI 0.26–1.00]) than A and more likely in those living with an HIV-infected individual in the household (aOR = 6.30 [95% CI 3.40–11.68]). Conclusions A large proportion of HIV sexual transmissions occur within house-holds and within communities even in this key mobile population. The findings suggest localized HIV transmissions and hence a potential benefit for the test and treat approach even at a community level, coupled with intensified HIV counselling to identify early infections. PMID:29023474
Kiwuwa-Muyingo, Sylvia; Nazziwa, Jamirah; Ssemwanga, Deogratius; Ilmonen, Pauliina; Njai, Harr; Ndembi, Nicaise; Parry, Chris; Kitandwe, Paul Kato; Gershim, Asiki; Mpendo, Juliet; Neilsen, Leslie; Seeley, Janet; Seppälä, Heikki; Lyagoba, Fred; Kamali, Anatoli; Kaleebu, Pontiano
2017-01-01
Fishing communities around Lake Victoria in sub-Saharan Africa have been characterised as a population at high risk of HIV-infection. Using data from a cohort of HIV-positive individuals aged 13-49 years, enrolled from 5 fishing communities on Lake Victoria between 2009-2011, we sought to identify factors contributing to the epidemic and to understand the underlying structure of HIV transmission networks. Clinical and socio-demographic data were combined with HIV-1 phylogenetic analyses. HIV-1 gag-p24 and env-gp-41 sub-genomic fragments were amplified and sequenced from 283 HIV-1-infected participants. Phylogenetic clusters with ≥2 highly related sequences were defined as transmission clusters. Logistic regression models were used to determine factors associated with clustering. Altogether, 24% (n = 67/283) of HIV positive individuals with sequences fell within 34 phylogenetically distinct clusters in at least one gene region (either gag or env). Of these, 83% occurred either within households or within community; 8/34 (24%) occurred within household partnerships, and 20/34 (59%) within community. 7/12 couples (58%) within households clustered together. Individuals in clusters with potential recent transmission (11/34) were more likely to be younger 71% (15/21) versus 46% (21/46) in un-clustered individuals and had recently become resident in the community 67% (14/21) vs 48% (22/46). Four of 11 (36%) potential transmission clusters included incident-incident transmissions. Independently, clustering was less likely in HIV subtype D (adjusted Odds Ratio, aOR = 0.51 [95% CI 0.26-1.00]) than A and more likely in those living with an HIV-infected individual in the household (aOR = 6.30 [95% CI 3.40-11.68]). A large proportion of HIV sexual transmissions occur within house-holds and within communities even in this key mobile population. The findings suggest localized HIV transmissions and hence a potential benefit for the test and treat approach even at a community level, coupled with intensified HIV counselling to identify early infections.
Phylogeny of flowering plants by the chloroplast genome sequences: in search of a "lucky gene".
Logacheva, M D; Penin, A A; Samigullin, T H; Vallejo-Roman, C M; Antonov, A S
2007-12-01
One of the most complicated remaining problems of molecular-phylogenetic analysis is choosing an appropriate genome region. In an ideal case, such a region should have two specific properties: (i) results of analysis using this region should be similar to the results of multigene analysis using the maximal number of regions; (ii) this region should be arranged compactly and be significantly shorter than the multigene set. The second condition is necessary to facilitate sequencing and extension of taxons under analysis, the number of which is also crucial for molecular phylogenetic analysis. Such regions have been revealed for some groups of animals and have been designated as "lucky genes". We have carried out a computational experiment on analysis of 41 complete chloroplast genomes of flowering plants aimed at searching for a "lucky gene" for reconstruction of their phylogeny. It is shown that the phylogenetic tree inferred from a combination of translated nucleotide sequences of genes encoding subunits of plastid RNA polymerase is closest to the tree constructed using all protein coding sites of the chloroplast genome. The only node for which a contradiction is observed is unstable according to the different type analyses. For all the other genes or their combinations, the coincidence is significantly worse. The RNA polymerase genes are compactly arranged in the genome and are fourfold shorter than the total length of protein coding genes used for phylogenetic analysis. The combination of all necessary features makes this group of genes main candidates for the role of "lucky gene" in studying phylogeny of flowering plants.
Low, V L; Lim, P E; Chen, C D; Lim, Y A L; Tan, T K; Norma-Rashid, Y; Lee, H L; Sofian-Azirun, M
2014-06-01
The present study explored the intraspecific genetic diversity, dispersal patterns and phylogeographic relationships of Culex quinquefasciatus Say (Diptera: Culicidae) in Malaysia using reference data available in GenBank in order to reveal this species' phylogenetic relationships. A statistical parsimony network of 70 taxa aligned as 624 characters of the cytochrome c oxidase subunit I (COI) gene and 685 characters of the cytochrome c oxidase subunit II (COII) gene revealed three haplotypes (A1-A3) and four haplotypes (B1-B4), respectively. The concatenated sequences of both COI and COII genes with a total of 1309 characters revealed seven haplotypes (AB1-AB7). Analysis using tcs indicated that haplotype AB1 was the common ancestor and the most widespread haplotype in Malaysia. The genetic distance based on concatenated sequences of both COI and COII genes ranged from 0.00076 to 0.00229. Sequence alignment of Cx. quinquefasciatus from Malaysia and other countries revealed four haplotypes (AA1-AA4) by the COI gene and nine haplotypes (BB1-BB9) by the COII gene. Phylogenetic analyses demonstrated that Malaysian Cx. quinquefasciatus share the same genetic lineage as East African and Asian Cx. quinquefasciatus. This study has inferred the genetic lineages, dispersal patterns and hypothetical ancestral genotypes of Cx. quinquefasciatus. © 2013 The Royal Entomological Society.
Shiota, Seiji; Suzuki, Rumiko; Matsuo, Yuichi; Miftahussurur, Muhammad; Tran, Trang Thu Huyen; Binh, Tran Thanh; Yamaoka, Yoshio
2014-01-01
A recent report has shown that the phylogenetic origin of Helicobacter pylori based on multi-locus sequence typing (MLST) was significantly associated with the severity of gastritis in Colombia. However, the potential relationship between phylogenetic origin and clinical outcomes was not examined in that study. If the phylogenetic origin rather than virulence factors were truly associated with clinical outcomes, identifying a population at high risk for gastric cancer in Colombia would be relatively straightforward. In this study, we examined the phylogenetic origins of strains from gastric cancer and duodenal ulcer patients living in Bogota, Colombia. We included 35 gastric cancer patients and 31 duodenal ulcer patients, which are considered the variant outcomes. The genotypes of cagA and vacA were determined by polymerase chain reaction. The genealogy of these Colombian strains was analyzed by MLST. Bacterial population structure was analyzed using STRUCTURE software. H. pylori strains from gastric cancer and duodenal ulcer patients were scattered in the phylogenetic tree; thus, we did not detect any difference in phylogenetic distribution between gastric cancer and duodenal ulcer strains in the hpEurope group in Colombia. Sixty-six strains, with one exception, were classified as hpEurope irrespective of the cagA and vacA genotypes, and type of disease. STRUCTURE analysis revealed that Colombian hpEurope strains have a phylogenetic connection to Spanish strains. Our study showed that a phylogeographic origin determined by MLST was insufficient for distinguishing between gastric cancer and duodenal ulcer risk among hpEurope strains in the Andean region in Colombia. Our analysis also suggests that hpEurope strains in Colombia were primarily introduced by Spanish immigrants.
2012-01-01
Background Through next-generation sequencing, the amount of sequence data potentially available for phylogenetic analyses has increased exponentially in recent years. Simultaneously, the risk of incorporating ‘noisy’ data with misleading phylogenetic signal has also increased, and may disproportionately influence the topology of weakly supported nodes and lineages featuring rapid radiations and/or elevated rates of evolution. Results We investigated the influence of phylogenetic noise in large data sets by applying two fundamental strategies, variable site removal and long-branch exclusion, to the phylogenetic analysis of a full plastome alignment of 107 species of Pinus and six Pinaceae outgroups. While high overall phylogenetic resolution resulted from inclusion of all data, three historically recalcitrant nodes remained conflicted with previous analyses. Close investigation of these nodes revealed dramatically different responses to data removal. Whereas topological resolution and bootstrap support for two clades peaked with removal of highly variable sites, the third clade resolved most strongly when all sites were included. Similar trends were observed using long-branch exclusion, but patterns were neither as strong nor as clear. When compared to previous phylogenetic analyses of nuclear loci and morphological data, the most highly supported topologies seen in Pinus plastome analysis are congruent for the two clades gaining support from variable site removal and long-branch exclusion, but in conflict for the clade with highest support from the full data set. Conclusions These results suggest that removal of misleading signal in phylogenomic datasets can result not only in increased resolution for poorly supported nodes, but may serve as a tool for identifying erroneous yet highly supported topologies. For Pinus chloroplast genomes, removal of variable sites appears to be more effective than long-branch exclusion for clarifying phylogenetic hypotheses. PMID:22731878
Dornburg, Alex; Friedman, Matt; Near, Thomas J
2015-08-01
Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods. Copyright © 2015 Elsevier Inc. All rights reserved.
Nosov, Nikita Yu; Krasnov, Yaroslav M.; Oglodin, Yevgeny G.; Kukleva, Lyubov M.; Guseva, Natalia P.; Kuznetsov, Alexander A.; Abdikarimov, Sabyrzhan T.; Dzhaparova, Aigul K.; Kutyrev, Vladimir V.
2017-01-01
Fifty six Yersinia pestis strains, isolated over the period of more than 50 years in three high-mountain foci of Kyrgyzstan (Tien Shan, Alai, and Talas), have been characterized by means of PCR and single nucleotide polymorphism (SNP) typing methods. Seven of these strains were also characterized by means of whole genome sequencing and genome-wide SNP phylogenetic analysis. It was found that forty two strains belong to 0.ANT2, 0.ANT3 and 0.ANT5 phylogenetic branches. From these, strains of 0.ANT2 and 0.ANT3 branches were earlier detected in China only, whereas 0.ANT5 phylogenetic branch was identified for Y. pestis phylogeny for the first time. According to the results of genome-wide SNP analysis, 0.ANT5 strains are ones of the most closely related to Y. pestis strain responsible for the Justinianic Plague. We have also found out that four of the studied strains belong to the phylogenetic branch 2.MED1, and ten strains from Talas high-mountain focus belong to the phylogenetic branch 0.PE4 (sub-branch 0.PE4t). Established diversity of Y. pestis strains and extensive dissemination of the strains pertaining to the 0.ANT branch confirm the antiquity of the mentioned above plague foci and suggest that strains of the 0.ANT branch, which serve as precursors for all highly virulent Y. pestis strains, had their origin in the Tien Shan mountains. PMID:29073248
Eroshenko, Galina A; Nosov, Nikita Yu; Krasnov, Yaroslav M; Oglodin, Yevgeny G; Kukleva, Lyubov M; Guseva, Natalia P; Kuznetsov, Alexander A; Abdikarimov, Sabyrzhan T; Dzhaparova, Aigul K; Kutyrev, Vladimir V
2017-01-01
Fifty six Yersinia pestis strains, isolated over the period of more than 50 years in three high-mountain foci of Kyrgyzstan (Tien Shan, Alai, and Talas), have been characterized by means of PCR and single nucleotide polymorphism (SNP) typing methods. Seven of these strains were also characterized by means of whole genome sequencing and genome-wide SNP phylogenetic analysis. It was found that forty two strains belong to 0.ANT2, 0.ANT3 and 0.ANT5 phylogenetic branches. From these, strains of 0.ANT2 and 0.ANT3 branches were earlier detected in China only, whereas 0.ANT5 phylogenetic branch was identified for Y. pestis phylogeny for the first time. According to the results of genome-wide SNP analysis, 0.ANT5 strains are ones of the most closely related to Y. pestis strain responsible for the Justinianic Plague. We have also found out that four of the studied strains belong to the phylogenetic branch 2.MED1, and ten strains from Talas high-mountain focus belong to the phylogenetic branch 0.PE4 (sub-branch 0.PE4t). Established diversity of Y. pestis strains and extensive dissemination of the strains pertaining to the 0.ANT branch confirm the antiquity of the mentioned above plague foci and suggest that strains of the 0.ANT branch, which serve as precursors for all highly virulent Y. pestis strains, had their origin in the Tien Shan mountains.
Worldwide phylogenetic relationship of avian poxviruses
Gyuranecz, Miklós; Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly
2013-01-01
Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we have expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g. starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy.
Worldwide Phylogenetic Relationship of Avian Poxviruses
Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly
2013-01-01
Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of the Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups, and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g., starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy. PMID:23408635
Lachance, Denis; Giguère, Isabelle; Séguin, Armand
2014-01-01
This research aimed to investigate the role of diverse transcription factors (TFs) and to delineate gene regulatory networks directly in conifers at a relatively high-throughput level. The approach integrated sequence analyses, transcript profiling, and development of a conifer-specific activation assay. Transcript accumulation profiles of 102 TFs and potential target genes were clustered to identify groups of coordinately expressed genes. Several different patterns of transcript accumulation were observed by profiling in nine different organs and tissues: 27 genes were preferential to secondary xylem both in stems and roots, and other genes were preferential to phelloderm and periderm or were more ubiquitous. A robust system has been established as a screening approach to define which TFs have the ability to regulate a given promoter in planta. Trans-activation or repression effects were observed in 30% of TF–candidate gene promoter combinations. As a proof of concept, phylogenetic analysis and expression and trans-activation data were used to demonstrate that two spruce NAC-domain proteins most likely play key roles in secondary vascular growth as observed in other plant species. This study tested many TFs from diverse families in a conifer tree species, which broadens the knowledge of promoter–TF interactions in wood development and enables comparisons of gene regulatory networks found in angiosperms and gymnosperms. PMID:24713992
Lin, Haijiang; He, Na; Zhou, Sujuan; Ding, Yingying; Qiu, Danhong; Zhang, Tiejun; Wong, Frank Y.
2013-01-01
Contact tracing, coupled with molecular epidemiologic investigation, is especially useful for identifying an infection with few cases in the population, such as human immunodeficiency virus (HIV) infection in China. No such research is available on Chinese men who have sex with men (MSM). From 2008 to 2010 in Taizhou Prefecture in China, every newly diagnosed HIV-infected MSM was invited to participate as an “index case” in a contact tracing survey by providing contact information for up to 8 sexual contacts, who themselves were approached to receive voluntary HIV counseling and testing. Those who tested HIV-positive were then subjected to another contact tracing survey. This process was repeated until no more sexual contacts were reported or tested positive. A total of 100 HIV-infected MSM served as “index cases,” including the initial 49 cases identified through routine surveillance programs and 51 cases from the present survey. Traced MSM exhibited little willingness to receive voluntary counseling and testing. CRF01_AE (HIV type 1) was the dominant subtype. Seven of 49 independent sexual networks were deemed HIV transmission clusters. Fear of stigma or discrimination may deter Chinese MSM from receiving voluntary counseling and testing. Nonetheless, the integration of behavioral network analysis and HIV phylogenetic analysis provides enhanced evidence for developing tailored prevention strategies for HIV-infected MSM. PMID:23348006
Herrnstadt, Corinna; Elson, Joanna L; Fahy, Eoin; Preston, Gwen; Turnbull, Douglass M; Anderson, Christen; Ghosh, Soumitra S; Olefsky, Jerrold M; Beal, M Flint; Davis, Robert E; Howell, Neil
2002-05-01
The evolution of the human mitochondrial genome is characterized by the emergence of ethnically distinct lineages or haplogroups. Nine European, seven Asian (including Native American), and three African mitochondrial DNA (mtDNA) haplogroups have been identified previously on the basis of the presence or absence of a relatively small number of restriction-enzyme recognition sites or on the basis of nucleotide sequences of the D-loop region. We have used reduced-median-network approaches to analyze 560 complete European, Asian, and African mtDNA coding-region sequences from unrelated individuals to develop a more complete understanding of sequence diversity both within and between haplogroups. A total of 497 haplogroup-associated polymorphisms were identified, 323 (65%) of which were associated with one haplogroup and 174 (35%) of which were associated with two or more haplogroups. Approximately one-half of these polymorphisms are reported for the first time here. Our results confirm and substantially extend the phylogenetic relationships among mitochondrial genomes described elsewhere from the major human ethnic groups. Another important result is that there were numerous instances both of parallel mutations at the same site and of reversion (i.e., homoplasy). It is likely that homoplasy in the coding region will confound evolutionary analysis of small sequence sets. By a linkage-disequilibrium approach, additional evidence for the absence of human mtDNA recombination is presented here.
Evaluating factors that predict the structure of a commensalistic epiphyte–phorophyte network
Sáyago, Roberto; Lopezaraiza-Mikel, Martha; Quesada, Mauricio; Álvarez-Añorve, Mariana Yolotl; Cascante-Marín, Alfredo; Bastida, Jesus Ma.
2013-01-01
A central issue in ecology is the understanding of the establishment of biotic interactions. We studied the factors that affect the assembly of the commensalistic interactions between vascular epiphytes and their host plants. We used an analytical approach that considers all individuals and species of epiphytic bromeliads and woody hosts and non-hosts at study plots. We built models of interaction probabilities among species to assess if host traits and abundance and spatial overlap of species predict the quantitative epiphyte–host network. Species abundance, species spatial overlap and host size largely predicted pairwise interactions and several network metrics. Wood density and bark texture of hosts also contributed to explain network structure. Epiphytes were more common on large hosts, on abundant woody species, with denser wood and/or rougher bark. The network had a low level of specialization, although several interactions were more frequent than expected by the models. We did not detect a phylogenetic signal on the network structure. The effect of host size on the establishment of epiphytes indicates that mature forests are necessary to preserve diverse bromeliad communities. PMID:23407832
USDA-ARS?s Scientific Manuscript database
Phylogenetic relatedness among ascomycetous yeast genera (subphylum Saccharomycotina, phylum Ascomycota) has been uncertain. In the present study, type species of 70 currently recognized genera are compared from divergence in the nearly entire nuclear gene sequences for large subunit rRNA, small sub...
Use of EST-SSR loci flanking regions for phylogenetic analysis of genus Arachis
USDA-ARS?s Scientific Manuscript database
All wild peanut collections in the genus Arachis were assigned to nine taxonomy sections on the bases of cross-compatibility and morphologic character clustering. These nine sections consist of 80 species from the most ancient to the most advanced, providing a diverse genetic resource for phylogenet...
USDA-ARS?s Scientific Manuscript database
Maruca vitrata Fabricius is a pantropical lepidopteran pest of legumes. Phylogenetic analysis of a mitochondrial cytochrome c oxidase-I gene (coxI) fragment indicates that three Maruca sp. mitochondrial lineages have unique geographic distributions [lineages 1 and 2: Australia, Taiwan, and West Afr...
Phylogenetic Relationships in Actinidia as Revealed by RAPD Analysis
Hongwen Huang; Zuozhou Li; Jianqiang Li; Thomas L. Kubiisiak; Desmond R. Lavne
2002-01-01
Phylogenetic relationships within the Actinidia were investigated using randomly amplified polymorphic DNA (RAPD) markers. DNAs from 10 taxa, including31 species encompassing all four sections and four series of the traditional subdivisions within the genus, were amplified using 22 preselected 10-mer oligonucieotide primers. A total 204 DNA bands...
USDA-ARS?s Scientific Manuscript database
Species of Epipolops Herrich-Schaeffer (Hemiptera: Geocoridae), comprising the largest genus of Pamphantinae, are among the most bizarre true bugs because of their striking morphology. To elucidate evolutionary morphology in Epipolops, a phylogenetic analysis was performed using 17 species and 36 ad...
Host specificity and phylogenetic relationships of chicken and turkey parvoviruses
USDA-ARS?s Scientific Manuscript database
Previous reports indicate that the newly discovered chicken parvoviruses (ChPV) and turkey parvoviruses (TuPV) are very similar to each other, yet they represent different species within a new genus of Parvoviridae. Currently, strain classification is based on the phylogenetic analysis of a 561 bas...
Liu, Jingjing; Wu, Weixiang; Chen, Chongjun; Sun, Faqian; Chen, Yingxu
2011-09-01
In order to obtain insight into the prokaryotic diversity and community in leachate sediment, a culture-independent DNA-based molecular phylogenetic approach was performed with archaeal and bacterial 16S rRNA gene clone libraries derived from leachate sediment of an aged landfill. A total of 59 archaeal and 283 bacterial rDNA phylotypes were identified in 425 archaeal and 375 bacterial analyzed clones. All archaeal clones distributed within two archaeal phyla of the Euryarchaeota and Crenarchaeota, and well-defined methanogen lineages, especially Methanosaeta spp., are the most numerically dominant species of the archaeal community. Phylogenetic analysis of the bacterial library revealed a variety of pollutant-degrading and biotransforming microorganisms, including 18 distinct phyla. A substantial fraction of bacterial clones showed low levels of similarity with any previously documented sequences and thus might be taxonomically new. Chemical characteristics and phylogenetic inferences indicated that (1) ammonium-utilizing bacteria might form consortia to alleviate or avoid the negative influence of high ammonium concentration on other microorganisms, and (2) members of the Crenarchaeota found in the sediment might be involved in ammonium oxidation. This study is the first to report the composition of the microbial assemblages and phylogenetic characteristics of prokaryotic populations extant in leachate sediment. Additional work on microbial activity and contaminant biodegradation remains to be explored.
Universal artifacts affect the branching of phylogenetic trees, not universal scaling laws.
Altaba, Cristian R
2009-01-01
The superficial resemblance of phylogenetic trees to other branching structures allows searching for macroevolutionary patterns. However, such trees are just statistical inferences of particular historical events. Recent meta-analyses report finding regularities in the branching pattern of phylogenetic trees. But is this supported by evidence, or are such regularities just methodological artifacts? If so, is there any signal in a phylogeny? In order to evaluate the impact of polytomies and imbalance on tree shape, the distribution of all binary and polytomic trees of up to 7 taxa was assessed in tree-shape space. The relationship between the proportion of outgroups and the amount of imbalance introduced with them was assessed applying four different tree-building methods to 100 combinations from a set of 10 ingroup and 9 outgroup species, and performing covariance analyses. The relevance of this analysis was explored taking 61 published phylogenies, based on nucleic acid sequences and involving various taxa, taxonomic levels, and tree-building methods. All methods of phylogenetic inference are quite sensitive to the artifacts introduced by outgroups. However, published phylogenies appear to be subject to a rather effective, albeit rather intuitive control against such artifacts. The data and methods used to build phylogenetic trees are varied, so any meta-analysis is subject to pitfalls due to their uneven intrinsic merits, which translate into artifacts in tree shape. The binary branching pattern is an imposition of methods, and seldom reflects true relationships in intraspecific analyses, yielding artifactual polytomies in short trees. Above the species level, the departure of real trees from simplistic random models is caused at least by two natural factors--uneven speciation and extinction rates; and artifacts such as choice of taxa included in the analysis, and imbalance introduced by outgroups and basal paraphyletic taxa. This artifactual imbalance accounts for tree shape convergence of large trees. There is no evidence for any universal scaling in the tree of life. Instead, there is a need for improved methods of tree analysis that can be used to discriminate the noise due to outgroups from the phylogenetic signal within the taxon of interest, and to evaluate realistic models of evolution, correcting the retrospective perspective and explicitly recognizing extinction as a driving force. Artifacts are pervasive, and can only be overcome through understanding the structure and biological meaning of phylogenetic trees. Catalan Abstract in Translation S1.
2011-01-01
Background We characterized variation and chemical composition of epicuticular hydrocarbons (CHCs) in the seven species of the Drosophila buzzatii cluster with gas chromatography/mass spectrometry. Despite the critical role of CHCs in providing resistance to desiccation and involvement in communication, such as courtship behavior, mating, and aggregation, few studies have investigated how CHC profiles evolve within and between species in a phylogenetic context. We analyzed quantitative differences in CHC profiles in populations of the D. buzzatii species cluster in order to assess the concordance of CHC differentiation with species divergence. Results Thirty-six CHC components were scored in single fly extracts with carbon chain lengths ranging from C29 to C39, including methyl-branched alkanes, n-alkenes, and alkadienes. Multivariate analysis of variance revealed that CHC amounts were significantly different among all species and canonical discriminant function (CDF) analysis resolved all species into distinct, non-overlapping groups. Significant intraspecific variation was found in different populations of D. serido suggesting that this taxon is comprised of at least two species. We summarized CHC variation using CDF analysis and mapped the first five CHC canonical variates (CVs) onto an independently derived period (per) gene + chromosome inversion + mtDNA COI gene for each sex. We found that the COI sequences were not phylogenetically informative due to introgression between some species, so only per + inversion data were used. Positive phylogenetic signal was observed mainly for CV1 when parsimony methods and the test for serial independence (TFSI) were used. These results changed when no outgroup species were included in the analysis and phylogenetic signal was then observed for female CV3 and/or CV4 and male CV4 and CV5. Finally, removal of divergent populations of D. serido significantly increased the amount of phylogenetic signal as up to four out of five CVs then displayed positive phylogenetic signal. Conclusions CHCs were conserved among species while quantitative differences in CHC profiles between populations and species were statistically significant. Most CHCs were species-, population-, and sex-specific. Mapping CHCs onto an independently derived phylogeny revealed that a significant portion of CHC variation was explained by species' systematic affinities indicating phylogenetic conservatism in the evolution of these hydrocarbon arrays, presumptive waterproofing compounds and courtship signals as in many other drosophilid species. PMID:21699713
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.
Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T
2016-02-24
Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic methods for larger effect sizes. The Silva/UNITE-based ghost tree presented here can be easily integrated into existing fungal analysis pipelines to enhance the resolution of fungal community differences and improve understanding of these communities in built environments. The ghost-tree software package can also be used to develop phylogenetic trees for other marker gene sets that afford different taxonomic resolution, or for bridging genome trees with amplicon trees. ghost-tree is pip-installable. All source code, documentation, and test code are available under the BSD license at https://github.com/JTFouquier/ghost-tree .
Archaeal Diversity in Waters from Deep South African Gold Mines
Takai, Ken; Moser, Duane P.; DeFlaun, Mary; Onstott, Tullis C.; Fredrickson, James K.
2001-01-01
A culture-independent molecular analysis of archaeal communities in waters collected from deep South African gold mines was performed by performing a PCR-mediated terminal restriction fragment length polymorphism (T-RFLP) analysis of rRNA genes (rDNA) in conjunction with a sequencing analysis of archaeal rDNA clone libraries. The water samples used represented various environments, including deep fissure water, mine service water, and water from an overlying dolomite aquifer. T-RFLP analysis revealed that the ribotype distribution of archaea varied with the source of water. The archaeal communities in the deep gold mine environments exhibited great phylogenetic diversity; the majority of the members were most closely related to uncultivated species. Some archaeal rDNA clones obtained from mine service water and dolomite aquifer water samples were most closely related to environmental rDNA clones from surface soil (soil clones) and marine environments (marine group I [MGI]). Other clones exhibited intermediate phylogenetic affiliation between soil clones and MGI in the Crenarchaeota. Fissure water samples, derived from active or dormant geothermal environments, yielded archaeal sequences that exhibited novel phylogeny, including a novel lineage of Euryarchaeota. These results suggest that deep South African gold mines harbor novel archaeal communities distinct from those observed in other environments. Based on the phylogenetic analysis of archaeal strains and rDNA clones, including the newly discovered archaeal rDNA clones, the evolutionary relationship and the phylogenetic organization of the domain Archaea are reevaluated. PMID:11722932
Principal component analysis and the locus of the Fréchet mean in the space of phylogenetic trees.
Nye, Tom M W; Tang, Xiaoxian; Weyenberg, Grady; Yoshida, Ruriko
2017-12-01
Evolutionary relationships are represented by phylogenetic trees, and a phylogenetic analysis of gene sequences typically produces a collection of these trees, one for each gene in the analysis. Analysis of samples of trees is difficult due to the multi-dimensionality of the space of possible trees. In Euclidean spaces, principal component analysis is a popular method of reducing high-dimensional data to a low-dimensional representation that preserves much of the sample's structure. However, the space of all phylogenetic trees on a fixed set of species does not form a Euclidean vector space, and methods adapted to tree space are needed. Previous work introduced the notion of a principal geodesic in this space, analogous to the first principal component. Here we propose a geometric object for tree space similar to the [Formula: see text]th principal component in Euclidean space: the locus of the weighted Fréchet mean of [Formula: see text] vertex trees when the weights vary over the [Formula: see text]-simplex. We establish some basic properties of these objects, in particular showing that they have dimension [Formula: see text], and propose algorithms for projection onto these surfaces and for finding the principal locus associated with a sample of trees. Simulation studies demonstrate that these algorithms perform well, and analyses of two datasets, containing Apicomplexa and African coelacanth genomes respectively, reveal important structure from the second principal components.
Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells
Borel, Christelle; Mudge, Jonathan M.; Howald, Cédric; Foissac, Sylvain; Ucla, Catherine; Chrast, Jacqueline; Ribeca, Paolo; Martin, David; Murray, Ryan R.; Yang, Xinping; Ghamsari, Lila; Lin, Chenwei; Bell, Ian; Dumais, Erica; Drenkow, Jorg; Tress, Michael L.; Gelpí, Josep Lluís; Orozco, Modesto; Valencia, Alfonso; van Berkum, Nynke L.; Lajoie, Bryan R.; Vidal, Marc; Stamatoyannopoulos, John; Batut, Philippe; Dobin, Alex; Harrow, Jennifer; Hubbard, Tim; Dekker, Job; Frankish, Adam; Salehi-Ashtiani, Kourosh; Reymond, Alexandre; Antonarakis, Stylianos E.; Guigó, Roderic; Gingeras, Thomas R.
2012-01-01
The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5′ and 3′ transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network. PMID:22238572
Systems-level feedback regulation of cell cycle transitions in Ostreococcus tauri.
Kapuy, Orsolya; Vinod, P K; Bánhegyi, Gábor; Novák, Béla
2018-05-01
Ostreococcus tauri is the smallest free-living unicellular organism with one copy of each core cell cycle genes in its genome. There is a growing interest in this green algae due to its evolutionary origin. Since O. tauri is diverged early in the green lineage, relatively close to the ancestral eukaryotic cell, it might hold a key phylogenetic position in the eukaryotic tree of life. In this study, we focus on the regulatory network of its cell division cycle. We propose a mathematical modelling framework to integrate the existing knowledge of cell cycle network of O. tauri. We observe that feedback loop regulation of both G1/S and G2/M transitions in O. tauri is conserved, which can make the transition bistable. This is essential to make the transition irreversible as shown in other eukaryotic organisms. By performing sequence analysis, we also predict the presence of the Greatwall/PP2A pathway in the cell cycle of O. tauri. Since O. tauri cell cycle machinery is conserved, the exploration of the dynamical characteristic of the cell division cycle will help in further understanding the regulation of cell cycle in higher eukaryotes. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Network Analysis Reveals Ecological Links between N-Fixing Bacteria and Wood-Decaying Fungi
Hoppe, Björn; Kahl, Tiemo; Karasch, Peter; Wubet, Tesfaye; Bauhus, Jürgen; Buscot, François; Krüger, Dirk
2014-01-01
Nitrogen availability in dead wood is highly restricted and associations with N-fixing bacteria are thought to enable wood-decaying fungi to meet their nitrogen requirements for vegetative and generative growth. We assessed the diversity of nifH (dinitrogenase reductase) genes in dead wood of the common temperate tree species Fagus sylvatica and Picea abies from differently managed forest plots in Germany using molecular tools. By incorporating these genes into a large compilation of published nifH sequences and subsequent phylogenetic analyses of deduced proteins we verified the presence of diverse pools corresponding to functional nifH, almost all of which are new to science. The distribution of nifH genes strongly correlated with tree species and decay class, but not with forest management, while higher fungal fructification was correlated with decreasing nitrogen content of the dead wood and positively correlated with nifH diversity, especially during the intermediate stage of wood decay. Network analyses based on non-random species co-occurrence patterns revealed interactions among fungi and N-fixing bacteria in the dead wood and strongly indicate the occurrence of at least commensal relationships between these taxa. PMID:24505405
Network analysis reveals ecological links between N-fixing bacteria and wood-decaying fungi.
Hoppe, Björn; Kahl, Tiemo; Karasch, Peter; Wubet, Tesfaye; Bauhus, Jürgen; Buscot, François; Krüger, Dirk
2014-01-01
Nitrogen availability in dead wood is highly restricted and associations with N-fixing bacteria are thought to enable wood-decaying fungi to meet their nitrogen requirements for vegetative and generative growth. We assessed the diversity of nifH (dinitrogenase reductase) genes in dead wood of the common temperate tree species Fagus sylvatica and Picea abies from differently managed forest plots in Germany using molecular tools. By incorporating these genes into a large compilation of published nifH sequences and subsequent phylogenetic analyses of deduced proteins we verified the presence of diverse pools corresponding to functional nifH, almost all of which are new to science. The distribution of nifH genes strongly correlated with tree species and decay class, but not with forest management, while higher fungal fructification was correlated with decreasing nitrogen content of the dead wood and positively correlated with nifH diversity, especially during the intermediate stage of wood decay. Network analyses based on non-random species co-occurrence patterns revealed interactions among fungi and N-fixing bacteria in the dead wood and strongly indicate the occurrence of at least commensal relationships between these taxa.
Kweon, Ohgew; Kim, Seong-Jae; Blom, Jochen; Kim, Sung-Kwan; Kim, Bong-Soo; Baek, Dong-Heon; Park, Su Inn; Sutherland, John B; Cerniglia, Carl E
2015-02-14
The bacterial genus Mycobacterium is of great interest in the medical and biotechnological fields. Despite a flood of genome sequencing and functional genomics data, significant gaps in knowledge between genome and phenome seriously hinder efforts toward the treatment of mycobacterial diseases and practical biotechnological applications. In this study, we propose the use of systematic, comparative functional pan-genomic analysis to build connections between genomic dynamics and phenotypic evolution in polycyclic aromatic hydrocarbon (PAH) metabolism in the genus Mycobacterium. Phylogenetic, phenotypic, and genomic information for 27 completely genome-sequenced mycobacteria was systematically integrated to reconstruct a mycobacterial phenotype network (MPN) with a pan-genomic concept at a network level. In the MPN, mycobacterial phenotypes show typical scale-free relationships. PAH degradation is an isolated phenotype with the lowest connection degree, consistent with phylogenetic and environmental isolation of PAH degraders. A series of functional pan-genomic analyses provide conserved and unique types of genomic evidence for strong epistatic and pleiotropic impacts on evolutionary trajectories of the PAH-degrading phenotype. Under strong natural selection, the detailed gene gain/loss patterns from horizontal gene transfer (HGT)/deletion events hypothesize a plausible evolutionary path, an epistasis-based birth and pleiotropy-dependent death, for PAH metabolism in the genus Mycobacterium. This study generated a practical mycobacterial compendium of phenotypic and genomic changes, focusing on the PAH-degrading phenotype, with a pan-genomic perspective of the evolutionary events and the environmental challenges. Our findings suggest that when selection acts on PAH metabolism, only a small fraction of possible trajectories is likely to be observed, owing mainly to a combination of the ambiguous phenotypic effects of PAHs and the corresponding pleiotropy- and epistasis-dependent evolutionary adaptation. Evolutionary constraints on the selection of trajectories, like those seen in PAH-degrading phenotypes, are likely to apply to the evolution of other phenotypes in the genus Mycobacterium.
Urakawa, Hidetoshi; Tajima, Yoshiyuki; Numata, Yoshiyuki; Tsuneda, Satoshi
2008-01-01
The phylogenetic diversity and species richness of ammonia-oxidizing archaea (AOA) and bacteria (AOB) were examined with aquarium biofiltration systems. Species richness, deduced from rarefaction analysis, and diversity indices indicated that the phylogenetic diversity and species richness of AOA are greater than those of AOB; the diversity of AOA and of AOB is minimized in cold-water aquaria. This finding implies that temperature is a key factor influencing the population structure and diversity of AOA and AOB in aquarium biofiltration systems. PMID:18065610
Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis
2016-09-02
Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal and could be useful in guiding the choice of phylogenetic markers. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Enumerating all maximal frequent subtrees in collections of phylogenetic trees
2014-01-01
Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474
Enumerating all maximal frequent subtrees in collections of phylogenetic trees.
Deepak, Akshay; Fernández-Baca, David
2014-01-01
A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.
Lai, Qiliang; Liu, Yang; Yuan, Jun; Du, Juan; Wang, Liping; Sun, Fengqin; Shao, Zongze
2014-01-01
Thalassospira bacteria are widespread and have been isolated from various marine environments. Less is known about their genetic diversity and biogeography, as well as their role in marine environments, many of them cannot be discriminated merely using the 16S rRNA gene. To address these issues, in this report, the phylogenetic analysis of 58 strains from seawater and deep sea sediments were carried out using the multilocus sequence analysis (MLSA) based on acsA, aroE, gyrB, mutL, rpoD and trpB genes, and the DNA-DNA hybridization (DDH) and average nucleotide identity (ANI) based on genome sequences. The MLSA analysis demonstrated that the 58 strains were clearly separated into 15 lineages, corresponding to seven validly described species and eight potential novel species. The DDH and ANI values further confirmed the validity of the MLSA analysis and eight potential novel species. The MLSA interspecies gap of the genus Thalassospira was determined to be 96.16-97.12% sequence identity on the basis of the combined analyses of the DDH and MLSA, while the ANIm interspecies gap was 95.76-97.20% based on the in silico DDH analysis. Meanwhile, phylogenetic analyses showed that the Thalassospira bacteria exhibited distribution pattern to a certain degree according to geographic regions. Moreover, they clustered together according to the habitats depth. For short, the phylogenetic analyses and biogeography of the Thalassospira bacteria were systematically investigated for the first time. These results will be helpful to explore further their ecological role and adaptive evolution in marine environments.
Yuan, Jun; Du, Juan; Wang, Liping; Sun, Fengqin; Shao, Zongze
2014-01-01
Thalassospira bacteria are widespread and have been isolated from various marine environments. Less is known about their genetic diversity and biogeography, as well as their role in marine environments, many of them cannot be discriminated merely using the 16S rRNA gene. To address these issues, in this report, the phylogenetic analysis of 58 strains from seawater and deep sea sediments were carried out using the multilocus sequence analysis (MLSA) based on acsA, aroE, gyrB, mutL, rpoD and trpB genes, and the DNA-DNA hybridization (DDH) and average nucleotide identity (ANI) based on genome sequences. The MLSA analysis demonstrated that the 58 strains were clearly separated into 15 lineages, corresponding to seven validly described species and eight potential novel species. The DDH and ANI values further confirmed the validity of the MLSA analysis and eight potential novel species. The MLSA interspecies gap of the genus Thalassospira was determined to be 96.16–97.12% sequence identity on the basis of the combined analyses of the DDH and MLSA, while the ANIm interspecies gap was 95.76–97.20% based on the in silico DDH analysis. Meanwhile, phylogenetic analyses showed that the Thalassospira bacteria exhibited distribution pattern to a certain degree according to geographic regions. Moreover, they clustered together according to the habitats depth. For short, the phylogenetic analyses and biogeography of the Thalassospira bacteria were systematically investigated for the first time. These results will be helpful to explore further their ecological role and adaptive evolution in marine environments. PMID:25198177
Wang, Wei; Xia, Minxuan; Chen, Jie; Deng, Fenni; Yuan, Rui; Zhang, Xiaopei; Shen, Fafu
2016-12-01
The data presented in this paper is supporting the research article "Genome-Wide Analysis of Superoxide Dismutase Gene Family in Gossypium raimondii and G. arboreum" [1]. In this data article, we present phylogenetic tree showing dichotomy with two different clusters of SODs inferred by the Bayesian method of MrBayes (version 3.2.4), "Bayesian phylogenetic inference under mixed models" [2], Ramachandran plots of G. raimondii and G. arboreum SODs, the protein sequence used to generate 3D sructure of proteins and the template accession via SWISS-MODEL server, "SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information." [3] and motif sequences of SODs identified by InterProScan (version 4.8) with the Pfam database, "Pfam: the protein families database" [4].
Villano, Umbertina; Lo Presti, Alessandra; Equestre, Michele; Cella, Eleonora; Pisani, Giulio; Giovanetti, Marta; Bruni, Roberto; Tritarelli, Elena; Amicosante, Massimo; Grifoni, Alba; Scarcella, Carmelo; El-Hamad, Issa; Pezzoli, Maria Chiara; Angeletti, Silvia; Silvia, Angeletti; Ciccaglione, Anna Rita; Ciccozzi, Massimo
2015-07-25
Hepatitis B virus infection (HBV) is widespread and it is considered a major health problem worldwide. The global distribution of HBV varies significantly between countries and between regions of the world. Among the many factors contributing to the changing epidemiology of viral hepatitis, the movement of people within and between countries is a potentially important one. In Italy, the number of migrant individuals has been increasing during the past 25 years. HBV genotype D has been found throughout the world, although its highest prevalence is in the Mediterranean area, the Middle East and southern Asia. We describe the molecular epidemiology of HBV in a chronically infected population of migrants (living in Italy), by using the phylogenetic analysis. HBV-DNA was amplified and sequenced from 43 HBV chronically infected patients. Phylogenetic and evolutionary analysis were performed using both maximum Likelihood and Bayesian methods. Of the 43 HBV S gene isolates from migrants, 25 (58.1 %) were classified as D genotype. Maximum Likelihood analysis showed an intermixing between Moldavian and foreigners sequences mostly respect to Italian ones. Italian sequences clustered mostly together in a main clade separately from all others. The estimation of the time of the tree's root gave a mean value of 17 years ago, suggesting the origin of the tree back to 1992 year. The skyline plot showed that the number of infections softly increased until the early 2005s, after which reached a plateau. Comparing phylogenetic data to the migrants date of arrival in Italy, it should be possible that migrants arrived in Italy yet infected from their country of origin. In conclusion, this is the first paper where phylogenetic analysis and genetic evolution has been used to characterize HBV sub genotypes D1 circulation in a selected and homogenous group of migrants coming from a restricted area of Balkans and to approximately define the period of infection besides the migration date.
Hiras, Jennifer; Wu, Yu-Wei; Eichorst, Stephanie A.; ...
2015-09-01
Recent studies have expanded the phylum Chlorobi, demonstrating that the green sulfur bacteria (GSB), the original cultured representatives of the phylum, are a part of a larger lineage whose members have more diverse metabolic capabilities that overlap with members of the phylum Bacteroidetes. The 16S rRNA gene of an uncultivated clone, OPB56, distantly related to the phyla Chlorobi and Bacteroidetes, was recovered from Obsidian Pool in Yellowstone National Park; however, the detailed phylogeny and function of OPB56 and related clones have remained unknown. Culturing of thermophilic bacterial consortia from compost by adaptation to grow on ionic-liquid pretreated switchgrass provided amore » consortium in which one of the most abundant members, NICIL-2, clustered with OPB56-related clones. Phylogenetic analysis using the full-length 16S rRNA gene from NICIL-2 demonstrated that it was part of a monophyletic clade, referred to as OPB56, distinct from the Bacteroidetes and Chlorobi. A near complete draft genome ( > 95% complete) was recovered from metagenomic data from the culture adapted to grow on ionic-liquid pretreated switchgrass using an automated binning algorithm, and this genome was used for marker gene-based phylogenetic analysis and metabolic reconstruction. Six additional genomes related to NICIL-2 were reconstructed from metagenomic data sets obtained from thermal springs at Yellowstone National Park and Nevada Great Boiling Spring. In contrast to the 16S rRNA gene phylogenetic analysis, protein phylogenetic analysis was most consistent with the clustering of the Chlorobea, Ignavibacteria and OPB56 into a single phylum level clade. Metabolic reconstruction of NICIL-2 demonstrated a close linkage with the class Ignavibacteria and the family Rhodothermaceae, a deeply branching Bacteroidetes lineage. The combined phylogenetic and functional analysis of the NICIL-2 genome has refined the membership in the phylum Chlorobi and emphasized the close evolutionary and metabolic relationship between the phyla Chlorobi and the Bacteroidetes.« less
Hiras, Jennifer; Wu, Yu-Wei; Eichorst, Stephanie A; Simmons, Blake A; Singer, Steven W
2016-04-01
Recent studies have expanded the phylum Chlorobi, demonstrating that the green sulfur bacteria (GSB), the original cultured representatives of the phylum, are a part of a broader lineage whose members have more diverse metabolic capabilities that overlap with members of the phylum Bacteroidetes. The 16S rRNA gene of an uncultivated clone, OPB56, distantly related to the phyla Chlorobi and Bacteroidetes, was recovered from Obsidian Pool in Yellowstone National Park; however, the detailed phylogeny and function of OPB56 and related clones have remained unknown. Culturing of thermophilic bacterial consortia from compost by adaptation to grow on ionic-liquid pretreated switchgrass provided a consortium in which one of the most abundant members, NICIL-2, clustered with OPB56-related clones. Phylogenetic analysis using the full-length 16S rRNA gene from NICIL-2 demonstrated that it was part of a monophyletic clade, referred to as OPB56, distinct from the Bacteroidetes and Chlorobi. A near complete draft genome (>95% complete) was recovered from metagenomic data from the culture adapted to grow on ionic-liquid pretreated switchgrass using an automated binning algorithm, and this genome was used for marker gene-based phylogenetic analysis and metabolic reconstruction. Six additional genomes related to NICIL-2 were reconstructed from metagenomic data sets obtained from thermal springs at Yellowstone National Park and Nevada Great Boiling Spring. In contrast to the 16S rRNA gene phylogenetic analysis, protein phylogenetic analysis was most consistent with the clustering of the Chlorobea, Ignavibacteria and OPB56 into a single phylum level clade. Metabolic reconstruction of NICIL-2 demonstrated a close linkage with the class Ignavibacteria and the family Rhodothermaceae, a deeply branching Bacteroidetes lineage. The combined phylogenetic and functional analysis of the NICIL-2 genome has refined the membership in the phylum Chlorobi and emphasized the close evolutionary and metabolic relationship between the phyla Chlorobi and the Bacteroidetes.
Transmission of HIV in sexual networks in sub-Saharan Africa and Europe
NASA Astrophysics Data System (ADS)
van de Vijver, David A. M. C.; Prosperi, Mattia C. F.; Ramasco, José J.
2013-09-01
We are reviewing the literature regarding sexual networks and HIV transmission in sub-Saharan Africa and Europe. On Likoma Island in Malawi, a sexual network was reconstructed using a sociometric survey in which individuals named their sexual partners. The sexual network identified one giant component including half of all sexually active individuals. More than 25% of respondents were linked through independent chains of sexual relations. HIV was more common in the sparser regions of the network due to over-representation of groups with higher HIV prevalence. A study from KwaZulu-Natal in South-Africa collected egocentric data about sexual partners and found that new infections in women in a particular area was associated with the number of life-time partners in men. Data about sexual networks and HIV transmission are not reported in Europe. It is, however, found that the annual number of sexual partners follows a scale-free network. Phylogenetic studies that determine genetic relatedness between HIV isolates obtained from infected individuals, found that patients in the early stages of infections explain a high number of new infections. In conclusion, the limited information that is available suggest that sexual networks play a role in spread of HIV. Obtaining more information about sexual networks can be of benefit for modeling studies on HIV transmission and prevention.
Molecular phylogeography of the Andean alpine plant, Gunnera magellanica
NASA Astrophysics Data System (ADS)
Shimizu, M.; Fujii, N.; Ito, M.; Asakawa, T.; Nishida, H.; Suyama, C.; Ueda, K.
2015-12-01
To clarify the evolutionary history of Gunnera magellanica (Gunneraceae), an alpine plant of the Andes mountains, we performed molecular phylogeographic analyses based on the sequences of an internal transcribed spacer (ITS) of nuclear ribosomal DNA and four non-coding regions (trnH-psbA, trnL-trnF, atpB-rbcL, rpl16 intron) of chloroplast DNA. We investigated 3, 4, 4 and 11 populations in, Ecuador, Bolivia, Argentina, and Chile, respectively, and detected six ITS genotypes (Types A-F) in G. magellanica. Five genotypes (Types A-E) were observed in the northern Andes population (Ecuador and Bolivia); only one ITS genotype (Type F) was observed in the southern Andes population (Chile and Argentina). Phylogenetic analyses showed that the ITS genotypes of the northern and southern Andes populations form different clades with high bootstrap probability. Furthermore, network analysis, analysis of molecular variance, and spatial analysis of molecular variance showed that there were two major clusters (the northern and southern Andes populations) in this species. Furthermore, in chloroplast DNA analysis, three major clades (northern Andes, Chillan, and southern Andes) were inferred from phylogenetic analyses using four non-coding regions, a finding that was supported by the above three types of analysis. The Chillan clade is the northernmost population in the southern Andes populations. With the exception of the Chillan clade (Chillan population), results of nuclear DNA and chloroplast DNA analyses were consistent. Both markers showed that the northern and southern Andes populations of G. magellanica were genetically different from each other. This type of clear phylogeographical structure was supported by PERMUT analysis according to Pons & Petit (1995, 1996). Moreover, based on our preliminary estimation that is based on the ITS sequences, the northern and southern Andes clades diverged ~0.63-3 million years ago, during a period of upheaval in the Andes. This suggests that the populations of G. magellanica that were distributed along the Andes have been divided into the two local populations of the northern and southern Andes during the uplift of the Andes.
Phylogenetic Analysis of Klebsiella pneumoniae from Hospitalized Children, Pakistan.
Ejaz, Hasan; Wang, Nancy; Wilksch, Jonathan J; Page, Andrew J; Cao, Hanwei; Gujaran, Shruti; Keane, Jacqueline A; Lithgow, Trevor; Ul-Haq, Ikram; Dougan, Gordon; Strugnell, Richard A; Heinz, Eva
2017-11-01
Klebsiella pneumoniae shows increasing emergence of multidrug-resistant lineages, including strains resistant to all available antimicrobial drugs. We conducted whole-genome sequencing of 178 highly drug-resistant isolates from a tertiary hospital in Lahore, Pakistan. Phylogenetic analyses to place these isolates into global context demonstrate the expansion of multiple independent lineages, including K. quasipneumoniae.
USDA-ARS?s Scientific Manuscript database
Technical Abstract Here we present a dated phylogenetic tree of the neotropical palm genus Attalea (Arecaceae). We used six orthologs from the nuclear WRKY gene family across 98 accessions to address relationships among species and biogeographic hypotheses. Here we found that the formerly recognized...
Mitochondrial DNA haplogroup phylogeny of the dog: Proposal for a cladistic nomenclature.
Fregel, Rosa; Suárez, Nicolás M; Betancor, Eva; González, Ana M; Cabrera, Vicente M; Pestano, José
2015-05-01
Canis lupus familiaris mitochondrial DNA analysis has increased in recent years, not only for the purpose of deciphering dog domestication but also for forensic genetic studies or breed characterization. The resultant accumulation of data has increased the need for a normalized and phylogenetic-based nomenclature like those provided for human maternal lineages. Although a standardized classification has been proposed, haplotype names within clades have been assigned gradually without considering the evolutionary history of dog mtDNA. Moreover, this classification is based only on the D-loop region, proven to be insufficient for phylogenetic purposes due to its high number of recurrent mutations and the lack of relevant information present in the coding region. In this study, we design 1) a refined mtDNA cladistic nomenclature from a phylogenetic tree based on complete sequences, classifying dog maternal lineages into haplogroups defined by specific diagnostic mutations, and 2) a coding region SNP analysis that allows a more accurate classification into haplogroups when combined with D-loop sequencing, thus improving the phylogenetic information obtained in dog mitochondrial DNA studies. Copyright © 2015 Elsevier B.V. All rights reserved.
Iiyama, Kazuhiro; Otao, Masahiro; Mori, Kazuki; Mon, Hiroaki; Lee, Jae Man; Kusakabe, Takahiro; Tashiro, Kousuke; Asano, Shin-Ichiro; Yasunaga-Aoki, Chisa
2014-01-01
To determine the phylogenetic relationship among Paenibacillus species, putative replication origin regions were compared. In the rsmG-gyrA region, gene arrangements in Paenibacillus species were identical to those of Bacillus species, with the exception of an open reading frame (orf14) positioned between gyrB and gyrA, which was observed only in Paenibacillus species. The orf14 product was homologous to the endospore-associated proteins YheC and YheD of Bacillus subtilis. Phylogenetic analysis based on the YheCD proteins suggested that Orf14 could be categorized into the YheC group. In the Paenibacillus genome, DnaA box clusters were found in rpmH-dnaA and dnaA-dnaN intergenic regions, known as box regions C and R, respectively; this localization was similar to that observed in B. halodurans. A phylogenetic tree based on the nucleotide sequences of the whole replication origin regions suggested that P. popilliae, P. thiaminolyticus, and P. dendritiformis are closely related species.
Phylogenetic analysis reveals a scattered distribution of autumn colours
Archetti, Marco
2009-01-01
Background and Aims Leaf colour in autumn is rarely considered informative for taxonomy, but there is now growing interest in the evolution of autumn colours and different hypotheses are debated. Research efforts are hindered by the lack of basic information: the phylogenetic distribution of autumn colours. It is not known when and how autumn colours evolved. Methods Data are reported on the autumn colours of 2368 tree species belonging to 400 genera of the temperate regions of the world, and an analysis is made of their phylogenetic relationships in order to reconstruct the evolutionary origin of red and yellow in autumn leaves. Key Results Red autumn colours are present in at least 290 species (70 genera), and evolved independently at least 25 times. Yellow is present independently from red in at least 378 species (97 genera) and evolved at least 28 times. Conclusions The phylogenetic reconstruction suggests that autumn colours have been acquired and lost many times during evolution. This scattered distribution could be explained by hypotheses involving some kind of coevolutionary interaction or by hypotheses that rely on the need for photoprotection. PMID:19126636
Diversification of land plants: insights from a family-level phylogenetic analysis.
Fiz-Palacios, Omar; Schneider, Harald; Heinrichs, Jochen; Savolainen, Vincent
2011-11-21
Some of the evolutionary history of land plants has been documented based on the fossil record and a few broad-scale phylogenetic analyses, especially focusing on angiosperms and ferns. Here, we reconstructed phylogenetic relationships among all 706 families of land plants using molecular data. We dated the phylogeny using multiple fossils and a molecular clock technique. Applying various tests of diversification that take into account topology, branch length, numbers of extant species as well as extinction, we evaluated diversification rates through time. We also compared these diversification profiles against the distribution of the climate modes of the Phanerozoic. We found evidence for the radiations of ferns and mosses in the shadow of angiosperms coinciding with the rather warm Cretaceous global climate. In contrast, gymnosperms and liverworts show a signature of declining diversification rates during geological time periods of cool global climate. This broad-scale phylogenetic analysis helps to reveal the successive waves of diversification that made up the diversity of land plants we see today. Both warm temperatures and wet climate may have been necessary for the rise of the diversity under a successive lineage replacement scenario.
Takeo, Toshinori; Tanaka, Tetsuya; Matsubayashi, Makoto; Maeda, Hiroki; Kusakisako, Kodai; Matsui, Toshihiro; Mochizuki, Masami; Matsuo, Tomohide
2014-08-01
Previously, we characterized an undocumented strain of Eimeria krijgsmanni by morphological and biological features. Here, we present a detailed molecular phylogenetic analysis of this organism. Namely, 18S ribosomal RNA gene (rDNA) sequences of E. krijgsmanni were analyzed to incorporate this species into a comprehensive Eimeria phylogeny. As a result, partial 18S rDNA sequence from E. krijgsmanni was successfully determined, and two different types, Type A and Type B, that differed by 1 base pair were identified. E. krijgsmanni was originally isolated from a single oocyst, and thus the result show that the two types might have allelic sequence heterogeneity in the 18S rDNA. Based on phylogenetic analyses, the two types of E. krijgsmanni 18S rDNA formed one of two clades among murine Eimeria spp.; these Eimeria clades reflected morphological similarity among the Eimeria spp. This is the third molecular phylogenetic characterization of a murine Eimeria spp. in addition to E. falciformis and E. papillata. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
WANG, ZHANG-YANG; HONG, WEI-LONG; ZHU, ZHE-HUI; CHEN, YUN-HAO; YE, WEN-LE; CHU, GUANG-YU; LI, JIA-LIN; CHEN, BI-CHENG; XIA, PENG
2015-01-01
BK polyomavirus (BKV) is important pathogen for kidney transplant recipients, as it is frequently re-activated, leading to nephropathy. The aim of this study was to investigate the phylogenetic reconstruction and polymorphism of the VP2 gene in BKV isolated from Chinese kidney transplant recipients. Phylogenetic analysis was carried out in the VP2 region from 135 BKV-positive samples and 28 reference strains retrieved from GenBank. The unweighted pair-group method with arithmetic mean (UPGMA) grouped all strains into subtypes, but failed to subdivide strains into subgroups. Among the plasma and urine samples, all plasma (23/23) and 82 urine samples (82/95) were identified to contain subtype I; the other 10 urine samples contained subtype IV. A 86-bp fragment was identified as a highly conserved sequence. Following alignment with 36 published BKV sequences from China, 92 sites of polymorphism were identified, including 11 single nucleotide polymorphisms (SNPs) prevalent in Chinese individuals and 30 SNPs that were specific to the two predominant subtypes I and IV. The limitations of the VP2 gene segment in subgrouping were confirmed by phylogenetic analysis. The conserved sequence and polymorphism identified in this study may be helpful in the detection and genotyping of BKV. PMID:26640547
Pereira, Edson H L; Reis, Roberto E
2017-05-11
A phylogenetic study of the Loricariidae with emphasis on the Neoplecostominae is presented based on a maximum parsimony analysis of 268 phenotypic characters encompassing osteology, arthrology, and external morphology. Results support previous hypotheses of the monophyly of the Neoplecostominae and each of the included genera: Hirtella, Isbrueckerichthys, Kronichthys, Neoplecostomus, Pareiorhaphis, and Pareiorhina. In addition, previously undiscovered diversity was revealed within the subfamily as an additional genus-level taxon, herein described as Euryochus. Relationships among neoplecostomine genera are: (Kronichthys (Euryochus ((Hirtella + Pareiorhaphis) (Pareiorhina (Isbrueckerichthys + Neoplecostomus))))). Additional undescribed diversity was also detected among most neoplecostomine genera and the Hypoptopomatinae. In addition, recently discovered genera Nannoplecostomus and Microplecostomus were included in the analysis, and were identified as sequential sister-taxa to Neoplecostominae + Hypoptopomatinae, which are currently not included in any subfamily and regarded as incertae sedis in Loricariidae. The three species of Lithogenes were included in an encompassing phylogenetic analysis for the first time, and were identified as a monophyletic unit and sister group to all remaining loricariids. The other loricariid subfamilies were also corroborated as monophyletic, and presented the following interrelationships (Lithogeninae (Delturinae (Loricariinae (Hypostominae (Nannoplecostomus (Microplecostomus (Hypoptopomatinae + Neoplecostominae). The Neoplecostominae and its genera are phylogenetically diagnosed, and hypothesized relationships are compared to those of previous morphological and molecular phylogenetic studies.
Chaouch, Melek; Fathallah-Mili, Akila; Driss, Mehdi; Lahmadi, Ramzi; Ayari, Chiraz; Guizani, Ikram; Ben Said, Moncef; Benabderrazak, Souha
2013-03-01
Discrimination of the Old World Leishmania parasites is important for diagnosis and epidemiological studies of leishmaniasis. We have developed PCR assays that allow the discrimination between Leishmania major, Leishmania tropica and Leishmania infantum Tunisian species. The identification was performed by a simple PCR targeting cysteine protease B (cpb) gene copies. These PCR can be a routine molecular biology tools for discrimination of Leishmania spp. from different geographical origins and different clinical forms. Our assays can be an informative source for cpb gene studying concerning drug, diagnostics and vaccine research. The PCR products of the cpb gene and the N-acetylglucosamine-1-phosphate transferase (nagt) Leishmania gene were sequenced and aligned. Phylogenetic trees of Leishmania based cpb and nagt sequences are close in topology and present the classic distribution of Leishmania in the Old World. The phylogenetic analysis has enabled the characterization and identification of different strains, using both multicopy (cpb) and single copy (nagt) genes. Indeed, the cpb phylogenetic analysis allowed us to identify the Tunisian Leishmania killicki species, and a group which gathers the least evolved isolates of the Leishmania donovani complex, that was originated from East Africa. This clustering confirms the African origin for the visceralizing species of the L. donovani complex. Copyright © 2012 Elsevier B.V. All rights reserved.
Pan, Ting Shuang; Nie, Pin
2013-07-01
Acanthocephalans are a small group of obligate endoparasites. They and rotifers are recently placed in a group called Syndermata. However, phylogenetic relationships within classes of acanthocephalans, and between them and rotifers, have not been well resolved, possibly due to the lack of molecular data suitable for such analysis. In this study, the mitochondrial (mt) genome was sequenced from Pallisentis celatus (Van Cleave, 1928), an acanthocephalan in the class Eoacanthocephala, an intestinal parasite of rice-field eel, Monopterus albus (Zuiew, 1793), in China. The complete mt genome sequence of P. celatus is 13 855 bp long, containing 36 genes including 12 protein-coding genes, 22 transfer RNAs (tRNAs) and 2 ribosomal RNAs (rRNAs) as reported for other acanthocephalan species. All genes are encoded on the same strand and in the same direction. Phylogenetic analysis indicated that acanthocephalans are closely related with a clade containing bdelloids, which then correlates with the clade containing monogononts. The class Eoacanthocephala, containing P. celatus and Paratenuisentis ambiguus (Van Cleave, 1921) was closely related to the Palaeacanthocephala. It is thus indicated that acanthocephalans may be just clustered among groups of rotifers. However, the resolving of phylogenetic relationship among all classes of acanthocephalans and between them and rotifers may require further sampling and more molecular data.
Singh, Jitendra P; Singh, Ak; Bajpai, Anju; Ahmad, Iffat Zareen
2014-01-01
The Indian black berry (Syzygium cumini Skeels) has a great nutraceutical and medicinal properties. As in other fruit crops, the fruit characteristics are important attributes for differentiation were also determined for different accessions of S. cumini. The fruit weight, length, breadth, length: breadth ratio, pulp weight, pulp content, seed weight and pulp: seed ratio significantly varied in different accessions. Molecular characterization was carried out using PCR based RAPD technique. Out of 80 RAPD primers, only 18 primers produced stable polymorphisms that were used to examine the phylogenetic relationship. A sum of 207 loci were generated out of which 201 loci found polymorphic. The average genetic dissimilarity was 97 per cent among jamun accessions. The phylogenetic relationship was also determined by principal coordinates analysis (PCoA) that explained 46.95 per cent cumulative variance. The two-dimensional PCoA analysis showed grouping of the different accessions that were plotted into four sub-plots, representing clustering of accessions. The UPGMA (r = 0.967) and NJ (r = 0.987) dendrogram constructed based on the dissimilarity matrix revealed a good degree of fit with the cophenetic correlation value. The dendrogram grouped the accessions into three main clusters according to their eco-geographical regions which given useful insight into their phylogenetic relationships.
Acremonium phylogenetic overview and revision of Gliomastix, Sarocladium, and Trichothecium
Summerbell, R.C.; Gueidan, C.; Schroers, H-J.; de Hoog, G.S.; Starink, M.; Rosete, Y. Arocha; Guarro, J.; Scott, J.A.
2011-01-01
Over 200 new sequences are generated for members of the genus Acremonium and related taxa including ribosomal small subunit sequences (SSU) for phylogenetic analysis and large subunit (LSU) sequences for phylogeny and DNA-based identification. Phylogenetic analysis reveals that within the Hypocreales, there are two major clusters containing multiple Acremonium species. One clade contains Acremonium sclerotigenum, the genus Emericellopsis, and the genus Geosmithia as prominent elements. The second clade contains the genera Gliomastix sensu stricto and Bionectria. In addition, there are numerous smaller clades plus two multi-species clades, one containing Acremonium strictum and the type species of the genus Sarocladium, and, as seen in the combined SSU/LSU analysis, one associated subclade containing Acremonium breve and related species plus Acremonium curvulum and related species. This sequence information allows the revision of three genera. Gliomastix is revived for five species, G. murorum, G. polychroma, G. tumulicola, G. roseogrisea, and G. masseei. Sarocladium is extended to include all members of the phylogenetically distinct A. strictum clade including the medically important A. kiliense and the protective maize endophyte A. zeae. Also included in Sarocladium are members of the phylogenetically delimited Acremonium bacillisporum clade, closely linked to the A. strictum clade. The genus Trichothecium is revised following the principles of unitary nomenclature based on the oldest valid anamorph or teleomorph name, and new combinations are made in Trichothecium for the tightly interrelated Acremonium crotocinigenum, Spicellum roseum, and teleomorph Leucosphaerina indica. Outside the Hypocreales, numerous Acremonium-like species fall into the Plectosphaerellaceae, and A. atrogriseum falls into the Cephalothecaceae. PMID:21523192
Marinospirillum insulare sp. nov., a novel halophilic helical bacterium isolated from kusaya gravy.
Satomi, M; Kimura, B; Hayashi, M; Okuzumi, M; Fujii, T
2004-01-01
A novel species that belongs to the genus Marinospirillum is described on the basis of phenotypic characteristics, phylogenetic analysis of 16S rRNA and gyrB gene sequences and DNA-DNA hybridization. Four strains of helical, halophilic, Gram-negative, heterotrophic bacteria were isolated from kusaya gravy, which is fermented brine that is used for the production of traditional dried fish in the Izu Islands of Japan. All of the new isolates were motile by means of bipolar tuft flagella, of small cell size, coccoid-body-forming and aerophilic; it was concluded that they belong to the same bacterial species, based on DNA-DNA hybridization values (>70% DNA relatedness). DNA G+C contents of the new strains were 42-43 mol% and they had isoprenoid quinone Q-8 as the major component. Phylogenetic analysis of 16S rRNA gene sequences indicated that the new isolates were members of the genus Marinospirillum; sequence similarity of the new isolates to Marinospirillum minutulum, Marinospirillum megaterium and Marinospirillum alkaliphilum was 98.5, 98.2 and 95.2%, respectively. Phylogenetic analysis based on the gyrB gene indicated that the new isolates had enough phylogenetic distance from M. minutulum and M. megaterium to be regarded as different species, with 84.7 and 78.7% sequence similarity, respectively. DNA-DNA hybridization showed that the new isolates had <36% DNA relatedness to M. minutulum and M. megaterium, supporting the phylogenetic conclusion. Thus, a novel species is proposed: Marinospirillum insulare sp. nov. (type strain, KT=LMG 21802T=NBRC 100033T).
MÜller, Rodrigo Temp; Langer, Max Cardoso; Dias-da-Silva, SÉrgio
2018-03-07
Despite representing a key-taxon in dinosauromorph phylogeny, Lagerpertidae is one of the most obscure and enigmatic branches from the stem that leads to the dinosaurs. Recent new findings have greatly increased our knowledge about lagerpetids, but no phylogenetic analysis has so far included all known members of this group. Here, we present the most inclusive phylogenetic study so far conducted for Lagerpetidae. Phylogenetic analyses were performed based on three independent data matrixes. In two of them, Lagerpeton chanarensis Romer, 1971 is the sister taxon to all other known Lagerpetidae, whereas Ixalerpeton polesinensis Cabreira et al., 2016 is in a sister group relationship with a clade that includes PVSJ 883 and Dromomeron. Conversely, the other analysis supports an alternative topology, where I. polesinensis is the sister taxon to either L. chanarensis or all other Lagerpetidae. Although coeval and geographically close, I. polesinensis and PVSJ 883 do not form a clade exclusive of other lagerpetids. As previously suggested D. gigas Martínez, Apaldetti, Correa Abelín, 2016 is the sister taxon of D. romeri Irmis et al., 2007. The phylogenetic analyses also indicate that the earliest lagerpetids are restricted to southwestern Pangea, whereas later forms spread across the entire western portion of the supercontinent. Finally, quantification of the codified characters of our analysis reveals that Lagerpetidae is one of the poorest known among the Triassic dinosauromorph groups in terms of their anatomy, so that new discoveries of more complete specimens are awaited to establish a more robust phylogeny.
High-confidence prediction of global interactomes based on genome-wide coevolutionary networks
Juan, David; Pazos, Florencio; Valencia, Alfonso
2008-01-01
Interacting or functionally related protein families tend to have similar phylogenetic trees. Based on this observation, techniques have been developed to predict interaction partners. The observed degree of similarity between the phylogenetic trees of two proteins is the result of many different factors besides the actual interaction or functional relationship between them. Such factors influence the performance of interaction predictions. One aspect that can influence this similarity is related to the fact that a given protein interacts with many others, and hence it must adapt to all of them. Accordingly, the interaction or coadaptation signal within its tree is a composite of the influence of all of the interactors. Here, we introduce a new estimator of coevolution to overcome this and other problems. Instead of relying on the individual value of tree similarity between two proteins, we use the whole network of similarities between all of the pairs of proteins within a genome to reassess the similarity of that pair, thereby taking into account its coevolutionary context. We show that this approach offers a substantial improvement in interaction prediction performance, providing a degree of accuracy/coverage comparable with, or in some cases better than, that of experimental techniques. Moreover, important information on the structure, function, and evolution of macromolecular complexes can be inferred with this methodology. PMID:18199838
High-confidence prediction of global interactomes based on genome-wide coevolutionary networks.
Juan, David; Pazos, Florencio; Valencia, Alfonso
2008-01-22
Interacting or functionally related protein families tend to have similar phylogenetic trees. Based on this observation, techniques have been developed to predict interaction partners. The observed degree of similarity between the phylogenetic trees of two proteins is the result of many different factors besides the actual interaction or functional relationship between them. Such factors influence the performance of interaction predictions. One aspect that can influence this similarity is related to the fact that a given protein interacts with many others, and hence it must adapt to all of them. Accordingly, the interaction or coadaptation signal within its tree is a composite of the influence of all of the interactors. Here, we introduce a new estimator of coevolution to overcome this and other problems. Instead of relying on the individual value of tree similarity between two proteins, we use the whole network of similarities between all of the pairs of proteins within a genome to reassess the similarity of that pair, thereby taking into account its coevolutionary context. We show that this approach offers a substantial improvement in interaction prediction performance, providing a degree of accuracy/coverage comparable with, or in some cases better than, that of experimental techniques. Moreover, important information on the structure, function, and evolution of macromolecular complexes can be inferred with this methodology.
Mass extinctions drove increased global faunal cosmopolitanism on the supercontinent Pangaea.
Button, David J; Lloyd, Graeme T; Ezcurra, Martín D; Butler, Richard J
2017-10-10
Mass extinctions have profoundly impacted the evolution of life through not only reducing taxonomic diversity but also reshaping ecosystems and biogeographic patterns. In particular, they are considered to have driven increased biogeographic cosmopolitanism, but quantitative tests of this hypothesis are rare and have not explicitly incorporated information on evolutionary relationships. Here we quantify faunal cosmopolitanism using a phylogenetic network approach for 891 terrestrial vertebrate species spanning the late Permian through Early Jurassic. This key interval witnessed the Permian-Triassic and Triassic-Jurassic mass extinctions, the onset of fragmentation of the supercontinent Pangaea, and the origins of dinosaurs and many modern vertebrate groups. Our results recover significant increases in global faunal cosmopolitanism following both mass extinctions, driven mainly by new, widespread taxa, leading to homogenous 'disaster faunas'. Cosmopolitanism subsequently declines in post-recovery communities. These shared patterns in both biotic crises suggest that mass extinctions have predictable influences on animal distribution and may shed light on biodiversity loss in extant ecosystems.Mass extinctions are thought to produce 'disaster faunas', communities dominated by a small number of widespread species. Here, Button et al. develop a phylogenetic network approach to test this hypothesis and find that mass extinctions did increase faunal cosmopolitanism across Pangaea during the late Palaeozoic and early Mesozoic.
HIV Type 1 Transmission Networks Among Men Having Sex with Men and Heterosexuals in Kenya
Faria, Nuno Rodrigues; Hassan, Amin; Hamers, Raph L.; Mutua, Gaudensia; Anzala, Omu; Mandaliya, Kishor; Cane, Patricia; Berkley, James A.; Rinke de Wit, Tobias F.; Wallis, Carole; Graham, Susan M.; Price, Matthew A.; Coutinho, Roel A.; Sanders, Eduard J.
2014-01-01
Abstract We performed a molecular phylogenetic study on HIV-1 polymerase sequences of men who have sex with men (MSM) and heterosexual patient samples in Kenya to characterize any observed HIV-1 transmission networks. HIV-1 polymerase sequences were obtained from samples in Nairobi and coastal Kenya from 84 MSM, 226 other men, and 364 women from 2005 to 2010. Using Bayesian phylogenetics, we tested whether sequences clustered by sexual orientation and geographic location. In addition, we used trait diffusion analyses to identify significant epidemiological links and to quantify the number of transmissions between risk groups. Finally, we compared 84 MSM sequences with all HIV-1 sequences available online at GenBank. Significant clustering of sequences from MSM at both coastal Kenya and Nairobi was found, with evidence of HIV-1 transmission between both locations. Although a transmission pair between a coastal MSM and woman was confirmed, no significant HIV-1 transmission was evident between MSM and the comparison population for the predominant subtype A (60%). However, a weak but significant link was evident when studying all subtypes together. GenBank comparison did not reveal other important transmission links. Our data suggest infrequent intermingling of MSM and heterosexual HIV-1 epidemics in Kenya. PMID:23947948
Amendola, Antonella; Bianchi, Silvia; Frati, Elena R; Ciceri, Giulia; Faccini, Marino; Senatore, Sabrina; Colzani, Daniela; Lamberti, Anna; Baggieri, Melissa; Cereda, Danilo; Gramegna, Maria; Nicoletti, Loredana; Magurano, Fabio; Tanzi, Elisabetta
2017-08-17
A large measles outbreak has been ongoing in Milan and surrounding areas. From 1 March to 30 June 2017, 203 measles cases were laboratory-confirmed (108 sporadic cases and 95 related to 47 clusters). Phylogenetic analysis revealed the co-circulation of two different genotypes, D8 and B3. Both genotypes caused nosocomial clusters in two hospitals. The rapid analysis of epidemiological and phylogenetic data allowed effective surveillance and tracking of transmission pathways. This article is copyright of The Authors, 2017.
Amendola, Antonella; Bianchi, Silvia; Frati, Elena R; Ciceri, Giulia; Faccini, Marino; Senatore, Sabrina; Colzani, Daniela; Lamberti, Anna; Baggieri, Melissa; Cereda, Danilo; Gramegna, Maria; Nicoletti, Loredana; Magurano, Fabio; Tanzi, Elisabetta
2017-01-01
A large measles outbreak has been ongoing in Milan and surrounding areas. From 1 March to 30 June 2017, 203 measles cases were laboratory-confirmed (108 sporadic cases and 95 related to 47 clusters). Phylogenetic analysis revealed the co-circulation of two different genotypes, D8 and B3. Both genotypes caused nosocomial clusters in two hospitals. The rapid analysis of epidemiological and phylogenetic data allowed effective surveillance and tracking of transmission pathways. PMID:28840825
Yuan, Dongmei; Qin, Hanxiao; Zhang, Jianguo; Liao, Lin; Chen, Qiwei; Chen, Dali; Chen, Jianping
2017-02-01
Leishmaniasis is a worldwide epidemic disease caused by the genus Leishmania, which is still endemic in the west and northwest areas of China. Some viewpoints of the traditional taxonomy of Chinese Leishmania have been challenged by recent phylogenetic researches based on different molecular markers. However, the taxonomic positions and phylogenetic relationships of Chinese Leishmania isolates remain controversial, which need for more data and further analysis. In this study, the heat shock protein 70 (HSP70) gene and cytochrome b (cyt b) gene were used for phylogenetic analysis of Chinese Leishmania isolates from patients, dogs, gerbils, and sand flies in different geographic origins. Besides, for the interesting Leishmania sp. in China, the ultrastructure of three Chinese Leishmania sp. strains (MHOM/CN/90/SC10H2, SD, GL) were observed by transmission electron microscopy. Bayesian trees from HSP70 and cyt b congruently indicated that the 14 Chinese Leishmania isolates belong to three Leishmania species including L. donovani complex, L. gerbilli, and L. (Sauroleishmania) sp. Their identity further confirmed that the undescribed Leishmania species causing visceral Leishmaniasis (VL) in China is closely related to L. tarentolae. The phylogenetic results from HSP70 also suggested the classification of subspecies within L. donovani complex: KXG-918, KXG-927, KXG-Liu, KXG-Xu, 9044, SC6, and KXG-65 belong to L. donovani; Cy, WenChuan, and 801 were proposed to be L. infantum. Through transmission electron microscopy, unexpectedly, the Golgi apparatus were not observed in SC10H2, SD, and GL, which was similar to previous reports of reptilian Leishmania. The statistical analysis of microtubule counts separated SC10H2, SD, and GL as one group from any other reference strain (L. donovani MHOM/IN/80/DD8; L. tropica MHOM/SU/74/K27; L. gerbilli MRHO/CN/60/GERBILLI). The ultrastructural characteristics of Leishmania sp. partly lend support to the phylogenetic inference that Chinese Leishmania sp. is in close relationship with reptilian Leishmania.
Smith, Stephen A; Moore, Michael J; Brown, Joseph W; Yang, Ya
2015-08-05
The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes underlying these complex datasets is necessary to improve and develop adequate models for sequence analysis and downstream applications. To aid this effort, we developed the open source software phyparts ( https://bitbucket.org/blackrim/phyparts ), which calculates unique, conflicting, and concordant bipartitions, maps gene duplications, and outputs summary statistics such as internode certainy (ICA) scores and node-specific counts of gene duplications.
Maheux, Andrée F; Sellam, Adnane; Piché, Yves; Boissinot, Maurice; Pelletier, René; Boudreau, Dominique K; Picard, François J; Trépanier, Hélène; Boily, Marie-Josée; Ouellette, Marc; Roy, Paul H; Bergeron, Michel G
2016-12-01
Successful treatment of a Candida infection relies on 1) an accurate identification of the pathogenic fungus and 2) on its susceptibility to antifungal drugs. In the present study we investigated the level of correlation between phylogenetical evolution and susceptibility of pathogenic Candida spp. to antifungal drugs. For this, we compared a phylogenetic tree, assembled with the concatenated sequences (2475-bp) of the ATP2, TEF1, and TUF1 genes from 20 representative Candida species, with published minimal inhibitory concentrations (MIC) of the four principal antifungal drug classes commonly used in the treatment of candidiasis: polyenes, triazoles, nucleoside analogues, and echinocandins. The phylogenetic tree revealed three distinct phylogenetic clusters among Candida species. Species within a given phylogenetic cluster have generally similar susceptibility profiles to antifungal drugs and species within Clusters II and III were less sensitive to antifungal drugs than Cluster I species. These results showed that phylogenetical relationship between clusters and susceptibility to several antifungal drugs could be used to guide therapy when only species identification is available prior to information pertaining to its resistance profile. An extended study comprising a large panel of clinical samples should be conducted to confirm the efficiency of this approach in the treatment of candidiasis. Copyright © 2016. Published by Elsevier B.V.
Brankovics, Balázs; van Dam, Peter; Rep, Martijn; de Hoog, G Sybren; J van der Lee, Theo A; Waalwijk, Cees; van Diepeningen, Anne D
2017-09-18
The Fusarium oxysporum species complex (FOSC) contains several phylogenetic lineages. Phylogenetic studies identified two to three major clades within the FOSC. The mitochondrial sequences are highly informative phylogenetic markers, but have been mostly neglected due to technical difficulties. A total of 61 complete mitogenomes of FOSC strains were de novo assembled and annotated. Length variations and intron patterns support the separation of three phylogenetic species. The variable region of the mitogenome that is typical for the genus Fusarium shows two new variants in the FOSC. The variant typical for Fusarium is found in members of all three clades, while variant 2 is found in clades 2 and 3 and variant 3 only in clade 2. The extended set of loci analyzed using a new implementation of the genealogical concordance species recognition method support the identification of three phylogenetic species within the FOSC. Comparative analysis of the mitogenomes in the FOSC revealed ongoing mitochondrial recombination within, but not between phylogenetic species. The recombination indicates the presence of a parasexual cycle in F. oxysporum. The obstacles hindering the usage of the mitogenomes are resolved by using next generation sequencing and selective genome assemblers, such as GRAbB. Complete mitogenome sequences offer a stable basis and reference point for phylogenetic and population genetic studies.
Subbotin, S A; Vierstraete, A; De Ley, P; Rowe, J; Waeyenberge, L; Moens, M; Vanfleteren, J R
2001-10-01
The ITS1, ITS2, and 5.8S gene sequences of nuclear ribosomal DNA from 40 taxa of the family Heteroderidae (including the genera Afenestrata, Cactodera, Heterodera, Globodera, Punctodera, Meloidodera, Cryphodera, and Thecavermiculatus) were sequenced and analyzed. The ITS regions displayed high levels of sequence divergence within Heteroderinae and compared to outgroup taxa. Unlike recent findings in root knot nematodes, ITS sequence polymorphism does not appear to complicate phylogenetic analysis of cyst nematodes. Phylogenetic analyses with maximum-parsimony, minimum-evolution, and maximum-likelihood methods were performed with a range of computer alignments, including elision and culled alignments. All multiple alignments and phylogenetic methods yielded similar basic structure for phylogenetic relationships of Heteroderidae. The cyst-forming nematodes are represented by six main clades corresponding to morphological characters and host specialization, with certain clades assuming different positions depending on alignment procedure and/or method of phylogenetic inference. Hypotheses of monophyly of Punctoderinae and Heteroderinae are, respectively, strongly and moderately supported by the ITS data across most alignments. Close relationships were revealed between the Avenae and the Sacchari groups and between the Humuli group and the species H. salixophila within Heteroderinae. The Goettingiana group occupies a basal position within this subfamily. The validity of the genera Afenestrata and Bidera was tested and is discussed based on molecular data. We conclude that ITS sequence data are appropriate for studies of relationships within the different species groups and less so for recovery of more ancient speciations within Heteroderidae. Copyright 2001 Academic Press.
AGeNNT: annotation of enzyme families by means of refined neighborhood networks.
Kandlinger, Florian; Plach, Maximilian G; Merkl, Rainer
2017-05-25
Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The Enzyme Function Initiative ( http://enzymefunction.org/ ) offers services that compute SSNs and GNNs. We have implemented AGeNNT that utilizes these services, albeit with datasets purged with respect to unspecific protein functions and overrepresented species. AGeNNT generates refined GNNs (rGNNs) that consist of cluster-nodes representing the sequences under study and Pfam-nodes representing enzyme functions encoded in the respective neighborhoods. For cluster-nodes, AGeNNT summarizes the phylogenetic relationships of the contributing species and a statistic indicates how unique nodes and GNs are within this rGNN. Pfam-nodes are annotated with additional features like GO terms describing protein function. For edges, the coverage is given, which is the relative number of neighborhoods containing the considered enzyme function (Pfam-node). AGeNNT is available at https://github.com/kandlinf/agennt . An rGNN is easier to interpret than a conventional GNN, which commonly contains proteins without enzymatic function and overly specific neighborhoods due to phylogenetic bias. The implemented filter routines and the statistic allow the user to identify those neighborhoods that are most indicative of a specific metabolic capacity. Thus, AGeNNT facilitates to distinguish and annotate functionally different members of enzyme families.
Functional Basis of Microorganism Classification.
Zhu, Chengsheng; Delmont, Tom O; Vogel, Timothy M; Bromberg, Yana
2015-08-01
Correctly identifying nearest "neighbors" of a given microorganism is important in industrial and clinical applications where close relationships imply similar treatment. Microbial classification based on similarity of physiological and genetic organism traits (polyphasic similarity) is experimentally difficult and, arguably, subjective. Evolutionary relatedness, inferred from phylogenetic markers, facilitates classification but does not guarantee functional identity between members of the same taxon or lack of similarity between different taxa. Using over thirteen hundred sequenced bacterial genomes, we built a novel function-based microorganism classification scheme, functional-repertoire similarity-based organism network (FuSiON; flattened to fusion). Our scheme is phenetic, based on a network of quantitatively defined organism relationships across the known prokaryotic space. It correlates significantly with the current taxonomy, but the observed discrepancies reveal both (1) the inconsistency of functional diversity levels among different taxa and (2) an (unsurprising) bias towards prioritizing, for classification purposes, relatively minor traits of particular interest to humans. Our dynamic network-based organism classification is independent of the arbitrary pairwise organism similarity cut-offs traditionally applied to establish taxonomic identity. Instead, it reveals natural, functionally defined organism groupings and is thus robust in handling organism diversity. Additionally, fusion can use organism meta-data to highlight the specific environmental factors that drive microbial diversification. Our approach provides a complementary view to cladistic assignments and holds important clues for further exploration of microbial lifestyles. Fusion is a more practical fit for biomedical, industrial, and ecological applications, as many of these rely on understanding the functional capabilities of the microbes in their environment and are less concerned with phylogenetic descent.
Functional Basis of Microorganism Classification
Zhu, Chengsheng; Delmont, Tom O.; Vogel, Timothy M.; Bromberg, Yana
2015-01-01
Correctly identifying nearest “neighbors” of a given microorganism is important in industrial and clinical applications where close relationships imply similar treatment. Microbial classification based on similarity of physiological and genetic organism traits (polyphasic similarity) is experimentally difficult and, arguably, subjective. Evolutionary relatedness, inferred from phylogenetic markers, facilitates classification but does not guarantee functional identity between members of the same taxon or lack of similarity between different taxa. Using over thirteen hundred sequenced bacterial genomes, we built a novel function-based microorganism classification scheme, functional-repertoire similarity-based organism network (FuSiON; flattened to fusion). Our scheme is phenetic, based on a network of quantitatively defined organism relationships across the known prokaryotic space. It correlates significantly with the current taxonomy, but the observed discrepancies reveal both (1) the inconsistency of functional diversity levels among different taxa and (2) an (unsurprising) bias towards prioritizing, for classification purposes, relatively minor traits of particular interest to humans. Our dynamic network-based organism classification is independent of the arbitrary pairwise organism similarity cut-offs traditionally applied to establish taxonomic identity. Instead, it reveals natural, functionally defined organism groupings and is thus robust in handling organism diversity. Additionally, fusion can use organism meta-data to highlight the specific environmental factors that drive microbial diversification. Our approach provides a complementary view to cladistic assignments and holds important clues for further exploration of microbial lifestyles. Fusion is a more practical fit for biomedical, industrial, and ecological applications, as many of these rely on understanding the functional capabilities of the microbes in their environment and are less concerned with phylogenetic descent. PMID:26317871
Genome composition and phylogeny of microbes predict their co-occurrence in the environment
2017-01-01
The genomic information of microbes is a major determinant of their phenotypic properties, yet it is largely unknown to what extent ecological associations between different species can be explained by their genome composition. To bridge this gap, this study introduces two new genome-wide pairwise measures of microbe-microbe interaction. The first (genome content similarity index) quantifies similarity in genome composition between two microbes, while the second (microbe-microbe functional association index) summarizes the topology of a protein functional association network built for a given pair of microbes and quantifies the fraction of network edges crossing organismal boundaries. These new indices are then used to predict co-occurrence between reference genomes from two 16S-based ecological datasets, accounting for phylogenetic relatedness of the taxa. Phylogenetic relatedness was found to be a strong predictor of ecological associations between microbes which explains about 10% of variance in co-occurrence data, but genome composition was found to be a strong predictor as well, it explains up to 4% the variance in co-occurrence when all genomic-based indices are used in combination, even after accounting for evolutionary relationships between the species. On their own, the metrics proposed here explain a larger proportion of variance than previously reported more complex methods that rely on metabolic network comparisons. In summary, results of this study indicate that microbial genomes do indeed contain detectable signal of organismal ecology, and the methods described in the paper can be used to improve mechanistic understanding of microbe-microbe interactions. PMID:28152007
A Public Health Model for the Molecular Surveillance of HIV Transmission in San Diego, California
May, Susanne; Tweeten, Samantha; Drumright, Lydia; Pacold, Mary E.; Kosakovsky Pond, Sergei L.; Pesano, Rick L.; Lie, Yolanda S.; Richman, Douglas D.; Frost, Simon D.W.; Woelk, Christopher H.; Little, Susan J.
2009-01-01
Background Current public health efforts often use molecular technologies to identify and contain communicable disease networks, but not for HIV. Here, we investigate how molecular epidemiology can be used to identify highly-related HIV networks within a population and how voluntary contact tracing of sexual partners can be used to selectively target these networks. Methods We evaluated the use of HIV-1 pol sequences obtained from participants of a community-recruited cohort (n=268) and a primary infection research cohort (n=369) to define highly related transmission clusters and the use of contact tracing to link other individuals (n=36) within these clusters. The presence of transmitted drug resistance was interpreted from the pol sequences (Calibrated Population Resistance v3.0). Results Phylogenetic clustering was conservatively defined when the genetic distance between any two pol sequences was <1%, which identified 34 distinct transmission clusters within the combined community-recruited and primary infection research cohorts containing 160 individuals. Although sequences from the epidemiologically-linked partners represented approximately 5% of the total sequences, they clustered with 60% of the sequences that clustered from the combined cohorts (O.R. 21.7; p=<0.01). Major resistance to at least one class of antiretroviral medication was found in 19% of clustering sequences. Conclusions Phylogenetic methods can be used to identify individuals who are within highly related transmission groups, and contact tracing of epidemiologically-linked partners of recently infected individuals can be used to link into previously-defined transmission groups. These methods could be used to implement selectively targeted prevention interventions. PMID:19098493
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.; ...
2016-11-24
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. A multitude of technologies, abstractions, and interpretive frameworks have emerged to answer the challenges presented by genome function and regulatory network inference. Here, we propose a new approach for producing biologically meaningful clusters of coexpressed genes, called Atomic Regulons (ARs), based on expression data, gene context, and functional relationships. We demonstrate this new approach by computing ARs for Escherichia coli, which we compare with the coexpressed gene clusters predicted by two prevalent existing methods: hierarchical clustering and k-meansmore » clustering. We test the consistency of ARs predicted by all methods against expected interactions predicted by the Context Likelihood of Relatedness (CLR) mutual information based method, finding that the ARs produced by our approach show better agreement with CLR interactions. We then apply our method to compute ARs for four other genomes: Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus. We compare the AR clusters from all genomes to study the similarity of coexpression among a phylogenetically diverse set of species, identifying subsystems that show remarkable similarity over wide phylogenetic distances. We also study the sensitivity of our method for computing ARs to the expression data used in the computation, showing that our new approach requires less data than competing approaches to converge to a near final configuration of ARs. We go on to use our sensitivity analysis to identify the specific experiments that lead most rapidly to the final set of ARs for E. coli. As a result, this analysis produces insights into improving the design of gene expression experiments.« less
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Faria, José P.; Davis, James J.; Edirisinghe, Janaka N.
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. A multitude of technologies, abstractions, and interpretive frameworks have emerged to answer the challenges presented by genome function and regulatory network inference. Here, we propose a new approach for producing biologically meaningful clusters of coexpressed genes, called Atomic Regulons (ARs), based on expression data, gene context, and functional relationships. We demonstrate this new approach by computing ARs for Escherichia coli, which we compare with the coexpressed gene clusters predicted by two prevalent existing methods: hierarchical clustering and k-meansmore » clustering. We test the consistency of ARs predicted by all methods against expected interactions predicted by the Context Likelihood of Relatedness (CLR) mutual information based method, finding that the ARs produced by our approach show better agreement with CLR interactions. We then apply our method to compute ARs for four other genomes: Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus. We compare the AR clusters from all genomes to study the similarity of coexpression among a phylogenetically diverse set of species, identifying subsystems that show remarkable similarity over wide phylogenetic distances. We also study the sensitivity of our method for computing ARs to the expression data used in the computation, showing that our new approach requires less data than competing approaches to converge to a near final configuration of ARs. We go on to use our sensitivity analysis to identify the specific experiments that lead most rapidly to the final set of ARs for E. coli. As a result, this analysis produces insights into improving the design of gene expression experiments.« less