Sample records for identify source-associated phylogenetic

  1. Phylogenetic congruence between subtropical trees and their associated fungi.

    PubMed

    Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao

    2016-12-01

    Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.

  2. Identifiability of tree-child phylogenetic networks under a probabilistic recombination-mutation model of evolution.

    PubMed

    Francis, Andrew; Moulton, Vincent

    2018-06-07

    Phylogenetic networks are an extension of phylogenetic trees which are used to represent evolutionary histories in which reticulation events (such as recombination and hybridization) have occurred. A central question for such networks is that of identifiability, which essentially asks under what circumstances can we reliably identify the phylogenetic network that gave rise to the observed data? Recently, identifiability results have appeared for networks relative to a model of sequence evolution that generalizes the standard Markov models used for phylogenetic trees. However, these results are quite limited in terms of the complexity of the networks that are considered. In this paper, by introducing an alternative probabilistic model for evolution along a network that is based on some ground-breaking work by Thatte for pedigrees, we are able to obtain an identifiability result for a much larger class of phylogenetic networks (essentially the class of so-called tree-child networks). To prove our main theorem, we derive some new results for identifying tree-child networks combinatorially, and then adapt some techniques developed by Thatte for pedigrees to show that our combinatorial results imply identifiability in the probabilistic setting. We hope that the introduction of our new model for networks could lead to new approaches to reliably construct phylogenetic networks. Copyright © 2018 Elsevier Ltd. All rights reserved.

  3. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes

    PubMed Central

    Gallus, Susanne; Janke, Axel

    2017-01-01

    Abstract Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. PMID:28985298

  4. Phylogenetic Conflict in Bears Identified by Automated Discovery of Transposable Element Insertions in Low-Coverage Genomes.

    PubMed

    Lammers, Fritjof; Gallus, Susanne; Janke, Axel; Nilsson, Maria A

    2017-10-01

    Phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high-throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) identified 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Analysis of single nucleotide substitutions in the flanking regions of the TEs shows that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, despite strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun, and sloth bear form a monophyletic clade, in which phylogenetic incongruence originates from incomplete lineage sorting. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it possible to confidently extract thousands of TE insertions even from low-coverage genomes (∼10×) of nonmodel organisms. This opens new possibilities for biologists to study phylogenies and evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Evolutionary lineages of marine snails identified using molecular phylogenetics and geometric morphometric analysis of shells.

    PubMed

    Vaux, Felix; Trewick, Steven A; Crampton, James S; Marshall, Bruce A; Beu, Alan G; Hills, Simon F K; Morgan-Richards, Mary

    2018-06-15

    The relationship between morphology and inheritance is of perennial interest in evolutionary biology and palaeontology. Using three marine snail genera Penion, Antarctoneptunea and Kelletia, we investigate whether systematics based on shell morphology accurately reflect evolutionary lineages indicated by molecular phylogenetics. Members of these gastropod genera have been a taxonomic challenge due to substantial variation in shell morphology, conservative radular and soft tissue morphology, few known ecological differences, and geographical overlap between numerous species. Sampling all sixteen putative taxa identified across the three genera, we infer mitochondrial and nuclear ribosomal DNA phylogenetic relationships within the group, and compare this to variation in adult shell shape and size. Results of phylogenetic analysis indicate that each genus is monophyletic, although the status of some phylogenetically derived and likely more recently evolved taxa within Penion is uncertain. The recently described species P. lineatus is supported by genetic evidence. Morphology, captured using geometric morphometric analysis, distinguishes the genera and matches the molecular phylogeny, although using the same dataset, species and phylogenetic subclades are not identified with high accuracy. Overall, despite abundant variation, we find that shell morphology accurately reflects genus-level classification and the corresponding deep phylogenetic splits identified in this group of marine snails. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. Tetrapods on the EDGE: Overcoming data limitations to identify phylogenetic conservation priorities

    PubMed Central

    Gray, Claudia L.; Wearn, Oliver R.; Owen, Nisha R.

    2018-01-01

    The scale of the ongoing biodiversity crisis requires both effective conservation prioritisation and urgent action. As extinction is non-random across the tree of life, it is important to prioritise threatened species which represent large amounts of evolutionary history. The EDGE metric prioritises species based on their Evolutionary Distinctiveness (ED), which measures the relative contribution of a species to the total evolutionary history of their taxonomic group, and Global Endangerment (GE), or extinction risk. EDGE prioritisations rely on adequate phylogenetic and extinction risk data to generate meaningful priorities for conservation. However, comprehensive phylogenetic trees of large taxonomic groups are extremely rare and, even when available, become quickly out-of-date due to the rapid rate of species descriptions and taxonomic revisions. Thus, it is important that conservationists can use the available data to incorporate evolutionary history into conservation prioritisation. We compared published and new methods to estimate missing ED scores for species absent from a phylogenetic tree whilst simultaneously correcting the ED scores of their close taxonomic relatives. We found that following artificial removal of species from a phylogenetic tree, the new method provided the closest estimates of their “true” ED score, differing from the true ED score by an average of less than 1%, compared to the 31% and 38% difference of the previous methods. The previous methods also substantially under- and over-estimated scores as more species were artificially removed from a phylogenetic tree. We therefore used the new method to estimate ED scores for all tetrapods. From these scores we updated EDGE prioritisation rankings for all tetrapod species with IUCN Red List assessments, including the first EDGE prioritisation for reptiles. Further, we identified criteria to identify robust priority species in an effort to further inform conservation action whilst

  7. An original phylogenetic approach identified mitochondrial haplogroup T1a1 as inversely associated with breast cancer risk in BRCA2 mutation carriers.

    PubMed

    Blein, Sophie; Bardel, Claire; Danjean, Vincent; McGuffog, Lesley; Healey, Sue; Barrowdale, Daniel; Lee, Andrew; Dennis, Joe; Kuchenbaecker, Karoline B; Soucy, Penny; Terry, Mary Beth; Chung, Wendy K; Goldgar, David E; Buys, Saundra S; Janavicius, Ramunas; Tihomirova, Laima; Tung, Nadine; Dorfling, Cecilia M; van Rensburg, Elizabeth J; Neuhausen, Susan L; Ding, Yuan Chun; Gerdes, Anne-Marie; Ejlertsen, Bent; Nielsen, Finn C; Hansen, Thomas Vo; Osorio, Ana; Benitez, Javier; Conejero, Raquel Andrés; Segota, Ena; Weitzel, Jeffrey N; Thelander, Margo; Peterlongo, Paolo; Radice, Paolo; Pensotti, Valeria; Dolcetti, Riccardo; Bonanni, Bernardo; Peissel, Bernard; Zaffaroni, Daniela; Scuvera, Giulietta; Manoukian, Siranoush; Varesco, Liliana; Capone, Gabriele L; Papi, Laura; Ottini, Laura; Yannoukakos, Drakoulis; Konstantopoulou, Irene; Garber, Judy; Hamann, Ute; Donaldson, Alan; Brady, Angela; Brewer, Carole; Foo, Claire; Evans, D Gareth; Frost, Debra; Eccles, Diana; Douglas, Fiona; Cook, Jackie; Adlard, Julian; Barwell, Julian; Walker, Lisa; Izatt, Louise; Side, Lucy E; Kennedy, M John; Tischkowitz, Marc; Rogers, Mark T; Porteous, Mary E; Morrison, Patrick J; Platte, Radka; Eeles, Ros; Davidson, Rosemarie; Hodgson, Shirley; Cole, Trevor; Godwin, Andrew K; Isaacs, Claudine; Claes, Kathleen; De Leeneer, Kim; Meindl, Alfons; Gehrig, Andrea; Wappenschmidt, Barbara; Sutter, Christian; Engel, Christoph; Niederacher, Dieter; Steinemann, Doris; Plendl, Hansjoerg; Kast, Karin; Rhiem, Kerstin; Ditsch, Nina; Arnold, Norbert; Varon-Mateeva, Raymonda; Schmutzler, Rita K; Preisler-Adams, Sabine; Markov, Nadja Bogdanova; Wang-Gohrke, Shan; de Pauw, Antoine; Lefol, Cédrick; Lasset, Christine; Leroux, Dominique; Rouleau, Etienne; Damiola, Francesca; Dreyfus, Hélène; Barjhoux, Laure; Golmard, Lisa; Uhrhammer, Nancy; Bonadona, Valérie; Sornin, Valérie; Bignon, Yves-Jean; Carter, Jonathan; Van Le, Linda; Piedmonte, Marion; DiSilvestro, Paul A; de la Hoya, Miguel; Caldes, Trinidad; Nevanlinna, Heli; Aittomäki, Kristiina; Jager, Agnes; van den Ouweland, Ans Mw; Kets, Carolien M; Aalfs, Cora M; van Leeuwen, Flora E; Hogervorst, Frans Bl; Meijers-Heijboer, Hanne Ej; Oosterwijk, Jan C; van Roozendaal, Kees Ep; Rookus, Matti A; Devilee, Peter; van der Luijt, Rob B; Olah, Edith; Diez, Orland; Teulé, Alex; Lazaro, Conxi; Blanco, Ignacio; Del Valle, Jesús; Jakubowska, Anna; Sukiennicki, Grzegorz; Gronwald, Jacek; Lubinski, Jan; Durda, Katarzyna; Jaworska-Bieniek, Katarzyna; Agnarsson, Bjarni A; Maugard, Christine; Amadori, Alberto; Montagna, Marco; Teixeira, Manuel R; Spurdle, Amanda B; Foulkes, William; Olswold, Curtis; Lindor, Noralane M; Pankratz, Vernon S; Szabo, Csilla I; Lincoln, Anne; Jacobs, Lauren; Corines, Marina; Robson, Mark; Vijai, Joseph; Berger, Andreas; Fink-Retter, Anneliese; Singer, Christian F; Rappaport, Christine; Kaulich, Daphne Geschwantler; Pfeiler, Georg; Tea, Muy-Kheng; Greene, Mark H; Mai, Phuong L; Rennert, Gad; Imyanitov, Evgeny N; Mulligan, Anna Marie; Glendon, Gord; Andrulis, Irene L; Tchatchou, Sandrine; Toland, Amanda Ewart; Pedersen, Inge Sokilde; Thomassen, Mads; Kruse, Torben A; Jensen, Uffe Birk; Caligo, Maria A; Friedman, Eitan; Zidan, Jamal; Laitman, Yael; Lindblom, Annika; Melin, Beatrice; Arver, Brita; Loman, Niklas; Rosenquist, Richard; Olopade, Olufunmilayo I; Nussbaum, Robert L; Ramus, Susan J; Nathanson, Katherine L; Domchek, Susan M; Rebbeck, Timothy R; Arun, Banu K; Mitchell, Gillian; Karlan, Beth Y; Lester, Jenny; Orsulic, Sandra; Stoppa-Lyonnet, Dominique; Thomas, Gilles; Simard, Jacques; Couch, Fergus J; Offit, Kenneth; Easton, Douglas F; Chenevix-Trench, Georgia; Antoniou, Antonis C; Mazoyer, Sylvie; Phelan, Catherine M; Sinilnikova, Olga M; Cox, David G

    2015-04-25

    Individuals carrying pathogenic mutations in the BRCA1 and BRCA2 genes have a high lifetime risk of breast cancer. BRCA1 and BRCA2 are involved in DNA double-strand break repair, DNA alterations that can be caused by exposure to reactive oxygen species, a main source of which are mitochondria. Mitochondrial genome variations affect electron transport chain efficiency and reactive oxygen species production. Individuals with different mitochondrial haplogroups differ in their metabolism and sensitivity to oxidative stress. Variability in mitochondrial genetic background can alter reactive oxygen species production, leading to cancer risk. In the present study, we tested the hypothesis that mitochondrial haplogroups modify breast cancer risk in BRCA1/2 mutation carriers. We genotyped 22,214 (11,421 affected, 10,793 unaffected) mutation carriers belonging to the Consortium of Investigators of Modifiers of BRCA1/2 for 129 mitochondrial polymorphisms using the iCOGS array. Haplogroup inference and association detection were performed using a phylogenetic approach. ALTree was applied to explore the reference mitochondrial evolutionary tree and detect subclades enriched in affected or unaffected individuals. We discovered that subclade T1a1 was depleted in affected BRCA2 mutation carriers compared with the rest of clade T (hazard ratio (HR) = 0.55; 95% confidence interval (CI), 0.34 to 0.88; P = 0.01). Compared with the most frequent haplogroup in the general population (that is, H and T clades), the T1a1 haplogroup has a HR of 0.62 (95% CI, 0.40 to 0.95; P = 0.03). We also identified three potential susceptibility loci, including G13708A/rs28359178, which has demonstrated an inverse association with familial breast cancer risk. This study illustrates how original approaches such as the phylogeny-based method we used can empower classical molecular epidemiological studies aimed at identifying association or risk modification effects.

  8. Phylogenetic Analyses of Armillaria Reveal at Least 15 Phylogenetic Lineages in China, Seven of Which Are Associated with Cultivated Gastrodia elata

    PubMed Central

    Guo, Ting; Wang, Han Chen; Xue, Wan Qiu; Zhao, Jun; Yang, Zhu L.

    2016-01-01

    Fungal species of Armillaria, which can act as plant pathogens and/or symbionts of the Chinese traditional medicinal herb Gastrodia elata (“Tianma”), are ecologically and economically important and have consequently attracted the attention of mycologists. However, their taxonomy has been highly dependent on morphological characterization and mating tests. In this study, we phylogenetically analyzed Chinese Armillaria samples using the sequences of the internal transcribed spacer region, translation elongation factor-1 alpha gene and beta-tubulin gene. Our data revealed at least 15 phylogenetic lineages of Armillaria from China, of which seven were newly discovered and two were recorded from China for the first time. Fourteen Chinese biological species of Armillaria, which were previously defined based on mating tests, could be assigned to the 15 phylogenetic lineages identified herein. Seven of the 15 phylogenetic lineages were found to be disjunctively distributed in different continents of the Northern Hemisphere, while eight were revealed to be endemic to certain continents. In addition, we found that seven phylogenetic lineages of Armillaria were used for the cultivation of Tianma, only two of which had been recorded to be associated with Tianma previously. We also illustrated that G. elata f. glauca (“Brown Tianma”) and G. elata f. elata (“Red Tianma”), two cultivars of Tianma grown in different regions of China, form symbiotic relationships with different phylogenetic lineages of Armillaria. These findings should aid the development of Tianma cultivation in China. PMID:27138686

  9. Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis.

    PubMed

    Stoltzfus, Arlin; O'Meara, Brian; Whitacre, Jamie; Mounce, Ross; Gillespie, Emily L; Kumar, Sudhir; Rosauer, Dan F; Vos, Rutger A

    2012-10-22

    Recently, various evolution-related journals adopted policies to encourage or require archiving of phylogenetic trees and associated data. Such attention to practices that promote sharing of data reflects rapidly improving information technology, and rapidly expanding potential to use this technology to aggregate and link data from previously published research. Nevertheless, little is known about current practices, or best practices, for publishing trees and associated data so as to promote re-use. Here we summarize results of an ongoing analysis of current practices for archiving phylogenetic trees and associated data, current practices of re-use, and current barriers to re-use. We find that the technical infrastructure is available to support rudimentary archiving, but the frequency of archiving is low. Currently, most phylogenetic knowledge is not easily re-used due to a lack of archiving, lack of awareness of best practices, and lack of community-wide standards for formatting data, naming entities, and annotating data. Most attempts at data re-use seem to end in disappointment. Nevertheless, we find many positive examples of data re-use, particularly those that involve customized species trees generated by grafting to, and pruning from, a much larger tree. The technologies and practices that facilitate data re-use can catalyze synthetic and integrative research. However, success will require engagement from various stakeholders including individual scientists who produce or consume shareable data, publishers, policy-makers, technology developers and resource-providers. The critical challenges for facilitating re-use of phylogenetic trees and associated data, we suggest, include: a broader commitment to public archiving; more extensive use of globally meaningful identifiers; development of user-friendly technology for annotating, submitting, searching, and retrieving data and their metadata; and development of a minimum reporting standard (MIAPA) indicating

  10. Phylogenetic characterization of culturable bacteria and fungi associated with tarballs from Betul beach, Goa, India.

    PubMed

    Shinde, Varsha Laxman; Meena, Ram Murti; Shenoy, Belle Damodara

    2018-03-01

    Tarballs are semisolid blobs of crude oil, normally formed due to weathering of crude-oil in the sea after any kind of oil spills. Microorganisms are believed to thrive on hydrocarbon-rich tarballs and possibly assist in biodegradation. The taxonomy of ecologically and economically important tarball-associated microbes, however, needs improvement as DNA-based identification and phylogenetic characterization have been scarcely incorporated into it. In this study, bacteria and fungi associated with tarballs from touristic Betul beach in Goa, India were isolated, followed by phylogenetic analyses of 16S rRNA gene and the ITS sequence-data to decipher their clustering patterns with closely-related taxa. The gene-sequence analyses identified phylogenetically diverse 20 bacterial genera belonging to the phyla Proteobacteria (14), Actinobacteria (3), Firmicutes (2) and Bacteroidetes (1), and 8 fungal genera belonging to the classes Eurotiomycetes (6), Sordariomycetes (1) and Leotiomycetes (1) associated with the Betul tarball samples. Future studies employing a polyphasic approach, including multigene sequence-data, are needed for species-level identification of culturable tarball-associated microbes. This paper also discusses potentials of tarball-associated microbes to degrade hydrocarbons. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. PCR-Internal Transcribed Spacer (ITS) genes sequencing and phylogenetic analysis of clinical and environmental Aspergillus species associated with HIV-TB co infected patients in a hospital in Abeokuta, southwestern Nigeria.

    PubMed

    Shittu, Olufunke Bolatito; Adelaja, Oluwabunmi Molade; Obuotor, Tolulope Mobolaji; Sam-Wobo, Sam Olufemi; Adenaike, Adeyemi Sunday

    2016-03-01

    Aspergillosis has been identified as one of the hospital acquired infections but the contribution of water and inhouse air as possible sources of Aspergillus infection in immunocompromised individuals like HIV-TB patients have not been studied in any hospital setting in Nigeria. To identify and investigate genetic relationship between clinical and environmental Aspergillus sp. associated with HIV-TB co infected patients. DNA extraction, purification, amplification and sequencing of Internal Transcribed Spacer (ITS) genes were performed using standard protocols. Similarity search using BLAST on NCBI was used for species identification and MEGA 5.0 was used for phylogenetic analysis. Analyses of sequenced ITS genes of selected fourteen (14) Aspergillus isolates identified in the GenBank database revealed Aspergillus niger (28.57%), A. tubingensis (7.14%), A. flavus (7.14%) and A. fumigatus (57.14%). Aspergillus in sputum of HIV patients were Aspergillus niger, A. fumigatus, A. tubingensis and A. flavus. Also, A. niger and A. fumigatus were identified from water and open-air. Phylogenetic analysis of sequences yielded genetic relatedness between clinical and environmental isolates. Water and air in health care settings in Nigeria are important sources of Aspergillus sp. for HIV-TB patients.

  12. A scalable method for identifying frequent subtrees in sets of large phylogenetic trees.

    PubMed

    Ramu, Avinash; Kahveci, Tamer; Burleigh, J Gordon

    2012-10-03

    We consider the problem of finding the maximum frequent agreement subtrees (MFASTs) in a collection of phylogenetic trees. Existing methods for this problem often do not scale beyond datasets with around 100 taxa. Our goal is to address this problem for datasets with over a thousand taxa and hundreds of trees. We develop a heuristic solution that aims to find MFASTs in sets of many, large phylogenetic trees. Our method works in multiple phases. In the first phase, it identifies small candidate subtrees from the set of input trees which serve as the seeds of larger subtrees. In the second phase, it combines these small seeds to build larger candidate MFASTs. In the final phase, it performs a post-processing step that ensures that we find a frequent agreement subtree that is not contained in a larger frequent agreement subtree. We demonstrate that this heuristic can easily handle data sets with 1000 taxa, greatly extending the estimation of MFASTs beyond current methods. Although this heuristic does not guarantee to find all MFASTs or the largest MFAST, it found the MFAST in all of our synthetic datasets where we could verify the correctness of the result. It also performed well on large empirical data sets. Its performance is robust to the number and size of the input trees. Overall, this method provides a simple and fast way to identify strongly supported subtrees within large phylogenetic hypotheses.

  13. A scalable method for identifying frequent subtrees in sets of large phylogenetic trees

    PubMed Central

    2012-01-01

    Background We consider the problem of finding the maximum frequent agreement subtrees (MFASTs) in a collection of phylogenetic trees. Existing methods for this problem often do not scale beyond datasets with around 100 taxa. Our goal is to address this problem for datasets with over a thousand taxa and hundreds of trees. Results We develop a heuristic solution that aims to find MFASTs in sets of many, large phylogenetic trees. Our method works in multiple phases. In the first phase, it identifies small candidate subtrees from the set of input trees which serve as the seeds of larger subtrees. In the second phase, it combines these small seeds to build larger candidate MFASTs. In the final phase, it performs a post-processing step that ensures that we find a frequent agreement subtree that is not contained in a larger frequent agreement subtree. We demonstrate that this heuristic can easily handle data sets with 1000 taxa, greatly extending the estimation of MFASTs beyond current methods. Conclusions Although this heuristic does not guarantee to find all MFASTs or the largest MFAST, it found the MFAST in all of our synthetic datasets where we could verify the correctness of the result. It also performed well on large empirical data sets. Its performance is robust to the number and size of the input trees. Overall, this method provides a simple and fast way to identify strongly supported subtrees within large phylogenetic hypotheses. PMID:23033843

  14. Molecular epidemiology and phylogenetic distribution of the Escherichia coli pks genomic island.

    PubMed

    Johnson, James R; Johnston, Brian; Kuskowski, Michael A; Nougayrede, Jean-Philippe; Oswald, Eric

    2008-12-01

    Epidemiological and phylogenetic associations of the pks genomic island of extraintestinal pathogenic Escherichia coli (ExPEC), which encodes the genotoxin colibactin, are incompletely defined. clbB and clbN (as markers for the 5' and 3' regions of the pks island, respectively), clbA and clbQ (as supplemental pks island markers), and 12 other putative ExPEC virulence genes were newly sought by PCR among 131 published E. coli isolates from hospitalized veterans (62 blood isolates and 69 fecal isolates). Blood and fecal isolates and clbB-positive and -negative isolates were compared for 66 newly and previously assessed traits. Among the 14 newly sought traits, clbB and clbN (colibactin polyketide synthesis system), hra (heat-resistant agglutinin), and vat (vacuolating toxin) were significantly associated with bacteremia. clbB and clbN identified a subset within phylogenetic group B2 with extremely high virulence scores and a high proportion of blood isolates. However, by multivariable analysis, other traits were more predictive of blood source than clbB and clbN were; indeed, among the newly sought traits, only pic significantly predicted bacteremia (negative association). By correspondence analysis, clbB and clbN were closely associated with group B2 and multiple B2-associated traits; by principal coordinate analysis, clbB and clbN partitioned the data set better than did blood versus fecal source. Thus, the pks island was significantly associated with bacteremia, multiple ExPEC-associated virulence genes, and group B2, and within group B2, it identified an especially high-virulence subset. This extends previous work regarding the pks island and supports investigation of the colibactin system as a potential therapeutic target.

  15. Multidrug- and Extensively Drug-Resistant Uropathogenic Escherichia coli Clinical Strains: Phylogenetic Groups Widely Associated with Integrons Maintain High Genetic Diversity.

    PubMed

    Ochoa, Sara A; Cruz-Córdova, Ariadnna; Luna-Pineda, Victor M; Reyes-Grajeda, Juan P; Cázares-Domínguez, Vicenta; Escalona, Gerardo; Sepúlveda-González, Ma Eugenia; López-Montiel, Fernanda; Arellano-Galindo, José; López-Martínez, Briceida; Parra-Ortega, Israel; Giono-Cerezo, Silvia; Hernández-Castro, Rigoberto; de la Rosa-Zamboni, Daniela; Xicohtencatl-Cortes, Juan

    2016-01-01

    In recent years, an increase of uropathogenic Escherichia coli (UPEC) strains with Multidrug-resistant (MDR) and Extensively Drug-resistant (XDR) profiles that complicate therapy for urinary tract infections (UTIs) has been observed and has directly impacted costs and extended hospital stays. The aim of this study was to determine MDR- and XDR-UPEC clinical strains, their virulence genes, their phylogenetic groups and to ascertain their relationship with integrons and genetic diversity. From a collection of 500 UPEC strains, 103 were selected with MDR and XDR characteristics. MDR-UPEC strains were mainly associated with phylogenetic groups D (54.87%) and B2 (39.02%) with a high percentage (≥70%) of several fimbrial genes ( ecpA, fimH, csgA , and papG II), an iron uptake gene ( chuA ), and a toxin gene ( hlyA ). In addition, a moderate frequency (40-70%) of other genes ( iutD, tosA , and bcs A) was observed. XDR-UPEC strains were predominantly associated with phylogenetic groups B2 (47.61%) and D (42.85%), which grouped with ≥80 virulence genes, including ecpA, fimH, csgA, papG II, iutD , and chuA . A moderate frequency (40-70%) of the tosA and hlyA genes was observed. The class 1 and 2 integrons that were identified in the MDR- and XDR-UPEC strains were associated with phylogenetic groups D, B2, and A, while the XDR-UPEC strains that were associated with phylogenetic groups B2, D, and A showed an extended-spectrum beta-lactamase (ESBL) phenotype. The modifying enzymes ( aad A1, aad B, aac C, ant 1, dfr A1, dfr A17, and aad A4) that were identified in the variable region of class 1 and 2 integrons from the MDR strains showed resistance to gentamycin (56.25 and 66.66%, respectively) and trimethoprim-sulfamethoxazole (84.61 and 66.66%, respectively). The MDR- and XDR-UPEC strains were distributed into seven clusters and were closely related to phylogenic groups B2 and D. The diversity analysis by PFGE showed 42.68% of clones of MDR-UPEC and no clonal association

  16. Phylogenetic and microsatellite markers for Tulasnella (Tulasnellaceae) mycorrhizal fungi associated with Australian orchids.

    PubMed

    Ruibal, Monica P; Peakall, Rod; Smith, Leon M; Linde, Celeste C

    2013-03-01

    Phylogenetic and microsatellite markers were developed for Tulasnella mycorrhizal fungi to investigate fungal species identity and diversity. These markers will be useful in future studies investigating the phylogenetic relationship of the fungal symbionts, specificity of orchid-mycorrhizal associations, and the role of mycorrhizae in orchid speciation within several orchid genera. • We generated partial genome sequences of two Tulasnella symbionts originating from Chiloglottis and Drakaea orchid species with 454 genome sequencing. Cross-genus transferability across mycorrhizal symbionts associated with multiple genera of Australian orchids (Arthrochilus, Chiloglottis, Drakaea, and Paracaleana) was found for seven phylogenetic loci. Five loci showed cross-transferability to Tulasnella from other orchid genera, and two to Sebacina. Furthermore, 11 polymorphic microsatellite loci were developed for Tulasnella from Chiloglottis. • Highly informative markers were obtained, allowing investigation of mycorrhizal diversity of Tulasnellaceae associated with a wide variety of terrestrial orchids in Australia and potentially worldwide.

  17. Phylogenetic diversity, host-specificity and community profiling of sponge-associated bacteria in the northern Gulf of Mexico.

    PubMed

    Erwin, Patrick M; Olson, Julie B; Thacker, Robert W

    2011-01-01

    Marine sponges can associate with abundant and diverse consortia of microbial symbionts. However, associated bacteria remain unexamined for the majority of host sponges and few studies use phylogenetic metrics to quantify symbiont community diversity. DNA fingerprinting techniques, such as terminal restriction fragment length polymorphisms (T-RFLP), might provide rapid profiling of these communities, but have not been explicitly compared to traditional methods. We investigated the bacterial communities associated with the marine sponges Hymeniacidon heliophila and Haliclona tubifera, a sympatric tunicate, Didemnum sp., and ambient seawater from the northern Gulf of Mexico by combining replicated clone libraries with T-RFLP analyses of 16S rRNA gene sequences. Clone libraries revealed that bacterial communities associated with the two sponges exhibited lower species richness and lower species diversity than seawater and tunicate assemblages, with differences in species composition among all four source groups. T-RFLP profiles clustered microbial communities by source; individual T-RFs were matched to the majority (80.6%) of clone library sequences, indicating that T-RFLP analysis can be used to rapidly profile these communities. Phylogenetic metrics of community diversity indicated that the two sponge-associated bacterial communities include dominant and host-specific bacterial lineages that are distinct from bacteria recovered from seawater, tunicates, and unrelated sponge hosts. In addition, a large proportion of the symbionts associated with H. heliophila were shared with distant, conspecific host populations in the southwestern Atlantic (Brazil). The low diversity and species-specific nature of bacterial communities associated with H. heliophila and H. tubifera represent a distinctly different pattern from other, reportedly universal, sponge-associated bacterial communities. Our replicated sampling strategy, which included samples that reflect the ambient

  18. Phylogenetic diversity and biodiversity indices on phylogenetic networks.

    PubMed

    Wicke, Kristina; Fischer, Mareike

    2018-04-01

    In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.

  19. Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

    2003-06-01

    OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally importantmore » for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.« less

  20. Frugivores bias seed-adult tree associations through nonrandom seed dispersal: a phylogenetic approach.

    PubMed

    Razafindratsima, Onja H; Dunham, Amy E

    2016-08-01

    Frugivores are the main seed dispersers in many ecosystems, such that behaviorally driven, nonrandom patterns of seed dispersal are a common process; but patterns are poorly understood. Characterizing these patterns may be essential for understanding spatial organization of fruiting trees and drivers of seed-dispersal limitation in biodiverse forests. To address this, we studied resulting spatial associations between dispersed seeds and adult tree neighbors in a diverse rainforest in Madagascar, using a temporal and phylogenetic approach. Data show that by using fruiting trees as seed-dispersal foci, frugivores bias seed dispersal under conspecific adults and under heterospecific trees that share dispersers and fruiting time with the dispersed species. Frugivore-mediated seed dispersal also resulted in nonrandom phylogenetic associations of dispersed seeds with their nearest adult neighbors, in nine out of the 16 months of our study. However, these nonrandom phylogenetic associations fluctuated unpredictably over time, ranging from clustered to overdispersed. The spatial and phylogenetic template of seed dispersal did not translate to similar patterns of association in adult tree neighborhoods, suggesting the importance of post-dispersal processes in structuring plant communities. Results suggest that frugivore-mediated seed dispersal is important for structuring early stages of plant-plant associations, setting the template for post-dispersal processes that influence ultimate patterns of plant recruitment. Importantly, if biased patterns of dispersal are common in other systems, frugivores may promote tree coexistence in biodiverse forests by limiting the frequency and diversity of heterospecific interactions of seeds they disperse. © 2016 by the Ecological Society of America.

  1. Exploring the Genomic Roadmap and Molecular Phylogenetics Associated with MODY Cascades Using Computational Biology.

    PubMed

    Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy

    2015-04-01

    Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.

  2. Phylogenetically resolving epidemiologic linkage

    PubMed Central

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-01-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  3. Assessment of fecal pollution sources in a small northern-plains watershed using PCR and phylogenetic analyses of Bacteroidetes 16S rRNA gene

    USGS Publications Warehouse

    Lamendella, R.; Domingo, J.W.S.; Oerther, D.B.; Vogel, J.R.; Stoeckel, D.M.

    2007-01-01

    We evaluated the efficacy, sensitivity, host-specificity, and spatial/temporal dynamics of human- and ruminant-specific 16S rRNA gene Bacteroidetes markers used to assess the sources of fecal pollution in a fecally impacted watershed. Phylogenetic analyses of 1271 fecal and environmental 16S rRNA gene clones were also performed to study the diversity of Bacteroidetes in this watershed. The host-specific assays indicated that ruminant feces were present in 28-54% of the water samples and in all sampling seasons, with increasing frequency in downstream sites. The human-targeted assays indicated that only 3-5% of the water samples were positive for human fecal signals, although a higher percentage of human-associated signals (19-24%) were detected in sediment samples. Phylogenetic analysis indicated that 57% of all water clones clustered with yet-to-be-cultured Bacteroidetes species associated with sequences obtained from ruminant feces, further supporting the prevalence of ruminant contamination in this watershed. However, since several clusters contained sequences from multiple sources, future studies need to consider the potential cosmopolitan nature of these bacterial populations when assessing fecal pollution sources using Bacteroidetes markers. Moreover, additional data is needed in order to understand the distribution of Bacteroidetes host-specific markers and their relationship to water quality regulatory standards. ?? 2006 Federation of European Microbiological Societies.

  4. SigTree: A Microbial Community Analysis Tool to Identify and Visualize Significantly Responsive Branches in a Phylogenetic Tree.

    PubMed

    Stevens, John R; Jones, Todd R; Lefevre, Michael; Ganesan, Balasubramanian; Weimer, Bart C

    2017-01-01

    Microbial community analysis experiments to assess the effect of a treatment intervention (or environmental change) on the relative abundance levels of multiple related microbial species (or operational taxonomic units) simultaneously using high throughput genomics are becoming increasingly common. Within the framework of the evolutionary phylogeny of all species considered in the experiment, this translates to a statistical need to identify the phylogenetic branches that exhibit a significant consensus response (in terms of operational taxonomic unit abundance) to the intervention. We present the R software package SigTree , a collection of flexible tools that make use of meta-analysis methods and regular expressions to identify and visualize significantly responsive branches in a phylogenetic tree, while appropriately adjusting for multiple comparisons.

  5. Source environment feature related phylogenetic distribution pattern of anoxygenic photosynthetic bacteria as revealed by pufM analysis.

    PubMed

    Zeng, Yonghui; Jiao, Nianzhi

    2007-06-01

    Anoxygenic photosynthesis, performed primarily by anoxygenic photosynthetic bacteria (APB), has been supposed to arise on Earth more than 3 billion years ago. The long established APB are distributed in almost every corner where light can reach. However, the relationship between APB phylogeny and source environments has been largely unexplored. Here we retrieved the pufM sequences and related source information of 89 pufM containing species from the public database. Phylogenetic analysis revealed that horizontal gene transfer (HGT) most likely occurred within 11 out of a total 21 pufM subgroups, not only among species within the same class but also among species of different phyla or subphyla. A clear source environment feature related phylogenetic distribution pattern was observed, with all species from oxic habitats and those from anoxic habitats clustering into independent subgroups, respectively. HGT among ancient APB and subsequent long term evolution and adaptation to separated niches may have contributed to the coupling of environment and pufM phylogeny.

  6. ProtPhylo: identification of protein-phenotype and protein-protein functional associations via phylogenetic profiling.

    PubMed

    Cheng, Yiming; Perocchi, Fabiana

    2015-07-01

    ProtPhylo is a web-based tool to identify proteins that are functionally linked to either a phenotype or a protein of interest based on co-evolution. ProtPhylo infers functional associations by comparing protein phylogenetic profiles (co-occurrence patterns of orthology relationships) for more than 9.7 million non-redundant protein sequences from all three domains of life. Users can query any of 2048 fully sequenced organisms, including 1678 bacteria, 255 eukaryotes and 115 archaea. In addition, they can tailor ProtPhylo to a particular kind of biological question by choosing among four main orthology inference methods based either on pair-wise sequence comparisons (One-way Best Hits and Best Reciprocal Hits) or clustering of orthologous proteins across multiple species (OrthoMCL and eggNOG). Next, ProtPhylo ranks phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance as a measure of similarity between pairs of phylogenetic profiles. Candidate hits can be easily and flexibly prioritized by complementary clues on subcellular localization, known protein-protein interactions, membrane spanning regions and protein domains. The resulting protein list can be quickly exported into a csv text file for further analyses. ProtPhylo is freely available at http://www.protphylo.org. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Phylogenetic and microsatellite markers for Tulasnella (Tulasnellaceae) mycorrhizal fungi associated with Australian orchids1

    PubMed Central

    Ruibal, Monica P.; Peakall, Rod; Smith, Leon M.; Linde, Celeste C.

    2013-01-01

    • Premise of the study: Phylogenetic and microsatellite markers were developed for Tulasnella mycorrhizal fungi to investigate fungal species identity and diversity. These markers will be useful in future studies investigating the phylogenetic relationship of the fungal symbionts, specificity of orchid–mycorrhizal associations, and the role of mycorrhizae in orchid speciation within several orchid genera. • Methods and Results: We generated partial genome sequences of two Tulasnella symbionts originating from Chiloglottis and Drakaea orchid species with 454 genome sequencing. Cross-genus transferability across mycorrhizal symbionts associated with multiple genera of Australian orchids (Arthrochilus, Chiloglottis, Drakaea, and Paracaleana) was found for seven phylogenetic loci. Five loci showed cross-transferability to Tulasnella from other orchid genera, and two to Sebacina. Furthermore, 11 polymorphic microsatellite loci were developed for Tulasnella from Chiloglottis. • Conclusions: Highly informative markers were obtained, allowing investigation of mycorrhizal diversity of Tulasnellaceae associated with a wide variety of terrestrial orchids in Australia and potentially worldwide. PMID:25202528

  8. Phylogenetically resolving epidemiologic linkage

    DOE PAGES

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-02-22

    The use of phylogenetic trees in epidemiological investigations has become commonplace, but their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the truemore » transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. Moreover, we confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results.« less

  9. EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A

    PubMed Central

    Ndhlovu, Andrew; Durand, Pierre M.; Hazelhurst, Scott

    2015-01-01

    The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. Database URL: http://www.bioinf.wits.ac.za/software/fire/evodb PMID:26140928

  10. EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A.

    PubMed

    Ndhlovu, Andrew; Durand, Pierre M; Hazelhurst, Scott

    2015-01-01

    The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. © The Author(s) 2015. Published by Oxford University Press.

  11. Methamphetamine injecting is associated with phylogenetic clustering of hepatitis C virus infection among street-involved youth in Vancouver, Canada*

    PubMed Central

    Cunningham, Evan; Jacka, Brendan; DeBeck, Kora; Applegate, Tanya A; Harrigan, P. Richard; Krajden, Mel; Marshall, Brandon DL; Montaner, Julio; Lima, Viviane Dias; Olmstead, Andrea; Milloy, M-J; Wood, Evan; Grebely, Jason

    2015-01-01

    Background Among prospective cohorts of people who inject drugs (PWID), phylogenetic clustering of HCV infection has been observed. However, the majority of studies have included older PWID, representing distant transmission events. The aim of this study was to investigate phylogenetic clustering of HCV infection among a cohort of street-involved youth. Methods Data were derived from a prospective cohort of street-involved youth aged 14–26 recruited between 2005 and 2012 in Vancouver, Canada (At Risk Youth Study, ARYS). HCV RNA testing and sequencing (Core-E2) were performed on HCV positive participants. Phylogenetic trees were inferred using maximum likelihood methods and clusters were identified using ClusterPicker (Core-E2 without HVR1, 90% bootstrap threshold, 0.05 genetic distance threshold). Results Among 945 individuals enrolled in ARYS, 16% (n=149, 100% recent injectors) were HCV antibody positive at baseline interview (n=86) or seroconverted during follow-up (n=63). Among HCV antibody positive participants with available samples (n=131), 75% (n=98) had detectable HCV RNA and 66% (n=65, mean age 23, 58% with recent methamphetamine injection, 31% female, 3% HIV+) had available Core-E2 sequences. Of those with Core-E2 sequence, 14% (n=9) were in a cluster (one cluster of three) or pair (two pairs), with all reporting recent methamphetamine injection. Recent methamphetamine injection was associated with membership in a cluster or pair (P=0.009). Conclusion In this study of street-involved youth with HCV infection and recent injecting, 14% demonstrated phylogenetic clustering. Phylogenetic clustering was associated with recent methamphetamine injection, suggesting that methamphetamine drug injection may play an important role in networks of HCV transmission. PMID:25977204

  12. The phylogenetic roots of human lethal violence.

    PubMed

    Gómez, José María; Verdú, Miguel; González-Megías, Adela; Méndez, Marcos

    2016-10-13

    The psychological, sociological and evolutionary roots of conspecific violence in humans are still debated, despite attracting the attention of intellectuals for over two millennia. Here we propose a conceptual approach towards understanding these roots based on the assumption that aggression in mammals, including humans, has a significant phylogenetic component. By compiling sources of mortality from a comprehensive sample of mammals, we assessed the percentage of deaths due to conspecifics and, using phylogenetic comparative tools, predicted this value for humans. The proportion of human deaths phylogenetically predicted to be caused by interpersonal violence stood at 2%. This value was similar to the one phylogenetically inferred for the evolutionary ancestor of primates and apes, indicating that a certain level of lethal violence arises owing to our position within the phylogeny of mammals. It was also similar to the percentage seen in prehistoric bands and tribes, indicating that we were as lethally violent then as common mammalian evolutionary history would predict. However, the level of lethal violence has changed through human history and can be associated with changes in the socio-political organization of human populations. Our study provides a detailed phylogenetic and historical context against which to compare levels of lethal violence observed throughout our history.

  13. [Phylogenetic diversity of microorganisms associated with the deep-water sponge Baikalospongia intermedia].

    PubMed

    Kalyzhnaya, O V; Itskovich, V B

    2014-07-01

    The diversity of bacteria associated with deep-water sponge Baikalospongia intermedia was evaluated by sequence analysis of 16S rRNA genes from two sponge samples collected in Lake Baikal from depths of 550 and 1204 m. A total of 64 operational taxonomic units, belonging to nine bacterial phyla, Proteobacteria (classes Alphaproteobacteria,. Betaproteobacteria, Gammaproteobacteria, and Deltaproteobacteria), Actinobacteria, Planctomycetes, Cloroflexi, Verrucomicrobia, Acidobacteria, Chlorobi, and Nitrospirae, including candidate phylum WS5, were identified. Phylogenetic analysis showed that the examined communities contained phylotypes exhibiting homology to uncultured bacteria from different lake ecosystems, freshwater sediments, soil and geological formations. Moreover, a number of phylotypes were relative to psychrophilic, methane-oxidizing, sulfate-reducing bacteria, and to microorganisms resistant to the influence of heavy metals. It seems likely that the unusual habitation conditions of deep-water sponges contribute to the taxonomic diversity of associated bacteria and have an influence on the presence of functionally important microorganisms in bacterial communities.

  14. PyRAD: assembly of de novo RADseq loci for phylogenetic analyses.

    PubMed

    Eaton, Deren A R

    2014-07-01

    Restriction-site-associated genomic markers are a powerful tool for investigating evolutionary questions at the population level, but are limited in their utility at deeper phylogenetic scales where fewer orthologous loci are typically recovered across disparate taxa. While this limitation stems in part from mutations to restriction recognition sites that disrupt data generation, an additional source of data loss comes from the failure to identify homology during bioinformatic analyses. Clustering methods that allow for lower similarity thresholds and the inclusion of indel variation will perform better at assembling RADseq loci at the phylogenetic scale. PyRAD is a pipeline to assemble de novo RADseq loci with the aim of optimizing coverage across phylogenetic datasets. It uses a wrapper around an alignment-clustering algorithm, which allows for indel variation within and between samples, as well as for incomplete overlap among reads (e.g. paired-end). Here I compare PyRAD with the program Stacks in their performance analyzing a simulated RADseq dataset that includes indel variation. Indels disrupt clustering of homologous loci in Stacks but not in PyRAD, such that the latter recovers more shared loci across disparate taxa. I show through reanalysis of an empirical RADseq dataset that indels are a common feature of such data, even at shallow phylogenetic scales. PyRAD uses parallel processing as well as an optional hierarchical clustering method, which allows it to rapidly assemble phylogenetic datasets with hundreds of sampled individuals. Software is written in Python and freely available at http://www.dereneaton.com/software/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Phylogenetic analysis of rubella viruses identified in Uganda, 2003-2012.

    PubMed

    Namuwulya, Prossy; Abernathy, Emily; Bukenya, Henry; Bwogi, Josephine; Tushabe, Phionah; Birungi, Molly; Seguya, Ronald; Kabaliisa, Theopista; Alibu, Vincent P; Kayondo, Jonathan K; Rivailler, Pierre; Icenogle, Joseph; Bakamutumaho, Barnabas

    2014-12-01

    Molecular data on rubella viruses are limited in Uganda despite the importance of congenital rubella syndrome (CRS). Routine rubella vaccination, while not administered currently in Uganda, is expected to begin by 2015. The World Health Organization recommends that countries without rubella vaccination programs assess the burden of rubella and CRS before starting a routine vaccination program. Uganda is already involved in integrated case-based surveillance, including laboratory testing to confirm measles and rubella, but molecular epidemiologic aspects of rubella circulation have so far not been documented in Uganda. Twenty throat swab or oral fluid samples collected from 12 districts during routine rash and fever surveillance between 2003 and 2012 were identified as rubella virus RNA positive and PCR products encompassing the region used for genotyping were sequenced. Phylogenetic analysis of the 20 sequences identified 19 genotype 1G viruses and 1 genotype 1E virus. Genotype-specific trees showed that the Uganda viruses belonged to specific clusters for both genotypes 1G and 1E and grouped with similar sequences from neighboring countries. Genotype 1G was predominant in Uganda. More epidemiological and molecular epidemiological data are required to determine if genotype 1E is also endemic in Uganda. The information obtained in this study will assist the immunization program in monitoring changes in circulating genotypes. © 2014 Wiley Periodicals, Inc.

  16. Phylogenetic constrains on mycorrhizal specificity in eight Dendrobium (Orchidaceae) species.

    PubMed

    Xing, Xiaoke; Ma, Xueting; Men, Jinxin; Chen, Yanhong; Guo, Shunxing

    2017-05-01

    Plant phylogeny constrains orchid mycorrhizal (OrM) fungal community composition in some orchids. Here, we investigated the structures of the OrM fungal communities of eight Dendrobium species in one niche to determine whether similarities in the OrM fungal communities correlated with the phylogeny of the host plants and whether the Dendrobium-OrM fungal interactions are phylogenetically conserved. A phylogeny based on DNA data was constructed for the eight coexisting Dendrobium species, and the OrM fungal communities were characterized by their roots. There were 31 different fungal lineages associated with the eight Dendrobium species. In total, 82.98% of the identified associations belonging to Tulasnellaceae, and a smaller proportion involved members of the unknown Basidiomycota (9.67%). Community analyses revealed that phylogenetically related Dendrobium tended to interact with a similar set of Tulasnellaceae fungi. The interactions between Dendrobium and Tulasnellaceae fungi were significantly influenced by the phylogenetic relationships among the Dendrobium species. Our results provide evidence that the mycorrhizal specificity in the eight coexisting Dendrobium species was phylogenetically conserved.

  17. Bayesian models for comparative analysis integrating phylogenetic uncertainty.

    PubMed

    de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P

    2012-06-28

    Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses

  18. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    PubMed Central

    2012-01-01

    Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for

  19. Dimensional Reduction for the General Markov Model on Phylogenetic Trees.

    PubMed

    Sumner, Jeremy G

    2017-03-01

    We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identify phylogenetic divergence events. A key feature is the identification of an invariant subspace which depends only bilinearly on the model parameters, in contrast to the usual multi-linear dependence in the full space. We discuss potential applications including the computation of split (edge) weights on phylogenetic trees from observed sequence data.

  20. Ecosystem productivity is associated with bacterial phylogenetic distance in surface marine waters.

    PubMed

    Galand, Pierre E; Salter, Ian; Kalenitchenko, Dimitri

    2015-12-01

    Understanding the link between community diversity and ecosystem function is a fundamental aspect of ecology. Systematic losses in biodiversity are widely acknowledged but the impact this may exert on ecosystem functioning remains ambiguous. There is growing evidence of a positive relationship between species richness and ecosystem productivity for terrestrial macro-organisms, but similar links for marine micro-organisms, which help drive global climate, are unclear. Community manipulation experiments show both positive and negative relationships for microbes. These previous studies rely, however, on artificial communities and any links between the full diversity of active bacterial communities in the environment, their phylogenetic relatedness and ecosystem function remain hitherto unexplored. Here, we test the hypothesis that productivity is associated with diversity in the metabolically active fraction of microbial communities. We show in natural assemblages of active bacteria that communities containing more distantly related members were associated with higher bacterial production. The positive phylogenetic diversity-productivity relationship was independent of community diversity calculated as the Shannon index. From our long-term (7-year) survey of surface marine bacterial communities, we also found that similarly, productive communities had greater phylogenetic similarity to each other, further suggesting that the traits of active bacteria are an important predictor of ecosystem productivity. Our findings demonstrate that the evolutionary history of the active fraction of a microbial community is critical for understanding their role in ecosystem functioning. © 2015 John Wiley & Sons Ltd.

  1. Deconstructing the relationships between phylogenetic diversity and ecology: a case study on ecosystem functioning.

    PubMed

    Davies, T Jonathan; Urban, Mark C; Rayfield, Bronwyn; Cadotte, Marc W; Peres-Neto, Pedro R

    2016-09-01

    Recent studies have supported a link between phylogenetic diversity and various ecological properties including ecosystem function. However, such studies typically assume that phylogenetic branches of equivalent length are more or less interchangeable. Here we suggest that there is a need to consider not only branch lengths but also their placement on the phylogeny. We demonstrate how two common indices of network centrality can be used to describe the evolutionary distinctiveness of network elements (nodes and branches) on a phylogeny. If phylogenetic diversity enhances ecosystem function via complementarity and the representation of functional diversity, we would predict a correlation between evolutionary distinctiveness of network elements and their contribution to ecosystem process. In contrast, if one or a few evolutionary innovations play key roles in ecosystem function, the relationship between evolutionary distinctiveness and functional contribution may be weak or absent. We illustrate how network elements associated with high functional contribution can be identified from regressions between phylogenetic diversity and productivity using a well-known empirical data set on plant productivity from the Cedar Creek Long-Term Ecological Research. We find no association between evolutionary distinctiveness and ecosystem functioning, but we are able to identify phylogenetic elements associated with species of known high functional contribution within the Fabaceae. Our perspective provides a useful guide in the search for ecological traits linking diversity and ecosystem function, and suggests a more nuanced consideration of phylogenetic diversity is required in the conservation and biodiversity-ecosystem-function literature. © 2016 by the Ecological Society of America.

  2. Suprafamilial relationships among Rodentia and the phylogenetic effect of removing fast-evolving nucleotides in mitochondrial, exon and intron fragments.

    PubMed

    Montgelard, Claudine; Forty, Ellen; Arnal, Véronique; Matthee, Conrad A

    2008-11-26

    The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using approximately 7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha (Castoridae + Geomyoidea

  3. Suprafamilial relationships among Rodentia and the phylogenetic effect of removing fast-evolving nucleotides in mitochondrial, exon and intron fragments

    PubMed Central

    2008-01-01

    Background The number of rodent clades identified above the family level is contentious, and to date, no consensus has been reached on the basal evolutionary relationships among all rodent families. Rodent suprafamilial phylogenetic relationships are investigated in the present study using ~7600 nucleotide characters derived from two mitochondrial genes (Cytochrome b and 12S rRNA), two nuclear exons (IRBP and vWF) and four nuclear introns (MGF, PRKC, SPTBN, THY). Because increasing the number of nucleotides does not necessarily increase phylogenetic signal (especially if the data is saturated), we assess the potential impact of saturation for each dataset by removing the fastest-evolving positions that have been recognized as sources of inconsistencies in phylogenetics. Results Taxonomic sampling included multiple representatives of all five rodent suborders described. Fast-evolving positions for each dataset were identified individually using a discrete gamma rate category and sites belonging to the most rapidly evolving eighth gamma category were removed. Phylogenetic tree reconstructions were performed on individual and combined datasets using Parsimony, Bayesian, and partitioned Maximum Likelihood criteria. Removal of fast-evolving positions enhanced the phylogenetic signal to noise ratio but the improvement in resolution was not consistent across different data types. The results suggested that elimination of fastest sites only improved the support for nodes moderately affected by homoplasy (the deepest nodes for introns and more recent nodes for exons and mitochondrial genes). Conclusion The present study based on eight DNA fragments supports a fully resolved higher level rodent phylogeny with moderate to significant nodal support. Two inter-suprafamilial associations emerged. The first comprised a monophyletic assemblage containing the Anomaluromorpha (Anomaluridae + Pedetidae) + Myomorpha (Muridae + Dipodidae) as sister clade to the Castorimorpha

  4. Phylogenetic Diversity of Bacteria Associated with the Marine Sponge Rhopaloeides odorabile†

    PubMed Central

    Webster, Nicole S.; Wilson, Kate J.; Blackall, Linda L.; Hill, Russell T.

    2001-01-01

    Molecular techniques were employed to document the microbial diversity associated with the marine sponge Rhopaloeides odorabile. The phylogenetic affiliation of sponge-associated bacteria was assessed by 16S rRNA sequencing of cloned DNA fragments. Fluorescence in situ hybridization (FISH) was used to confirm the presence of the predominant groups indicated by 16S rDNA analysis. The community structure was extremely diverse with representatives of the Actinobacteria, low-G+C gram-positive bacteria, the β- and γ-subdivisions of the Proteobacteria, Cytophaga/Flavobacterium, green sulfur bacteria, green nonsulfur bacteria, planctomycetes, and other sequence types with no known close relatives. FISH probes revealed the spatial location of these bacteria within the sponge tissue, in some cases suggesting possible symbiotic functions. The high proportion of 16S rRNA sequences derived from novel actinomycetes is good evidence for the presence of an indigenous marine actinomycete assemblage in R. odorabile. High microbial diversity was inferred from low duplication of clones in a library with 70 representatives. Determining the phylogenetic affiliation of sponge-associated microorganisms by 16S rRNA analysis facilitated the rational selection of culture media and isolation conditions to target specific groups of well-represented bacteria for laboratory culture. Novel media incorporating sponge extracts were used to isolate bacteria not previously recovered from this sponge. PMID:11133476

  5. Phylogenetic Analysis of Local-Scale Tree Soil Associations in a Lowland Moist Tropical Forest

    PubMed Central

    Schreeg, Laura A.; Kress, W. John; Erickson, David L.; Swenson, Nathan G.

    2010-01-01

    Background Local plant-soil associations are commonly studied at the species-level, while associations at the level of nodes within a phylogeny have been less well explored. Understanding associations within a phylogenetic context, however, can improve our ability to make predictions across systems and can advance our understanding of the role of evolutionary history in structuring communities. Methodology/Principal Findings Here we quantified evolutionary signal in plant-soil associations using a DNA sequence-based community phylogeny and several soil variables (e.g., extractable phosphorus, aluminum and manganese, pH, and slope as a proxy for soil water). We used published plant distributional data from the 50-ha plot on Barro Colorado Island (BCI), Republic of Panamá. Our results suggest some groups of closely related species do share similar soil associations. Most notably, the node shared by Myrtaceae and Vochysiaceae was associated with high levels of aluminum, a potentially toxic element. The node shared by Apocynaceae was associated with high extractable phosphorus, a nutrient that could be limiting on a taxon specific level. The node shared by the large group of Laurales and Magnoliales was associated with both low extractable phosphorus and with steeper slope. Despite significant node-specific associations, this study detected little to no phylogeny-wide signal. We consider the majority of the ‘traits’ (i.e., soil variables) evaluated to fall within the category of ecological traits. We suggest that, given this category of traits, phylogeny-wide signal might not be expected while node-specific signals can still indicate phylogenetic structure with respect to the variable of interest. Conclusions Within the BCI forest dynamics plot, distributions of some plant taxa are associated with local-scale differences in soil variables when evaluated at individual nodes within the phylogenetic tree, but they are not detectable by phylogeny-wide signal. Trends

  6. Phylogenetic Diversity of Vibrio cholerae Associated with Endemic Cholera in Mexico from 1991 to 2008.

    PubMed

    Choi, Seon Young; Rashed, Shah M; Hasan, Nur A; Alam, Munirul; Islam, Tarequl; Sadique, Abdus; Johura, Fatema-Tuz; Eppinger, Mark; Ravel, Jacques; Huq, Anwar; Cravioto, Alejandro; Colwell, Rita R

    2016-03-15

    in Mexico prior to the 1990s, genetically diverse V. cholerae O1 strains were isolated between 1991 and 2008. Despite the lack of strong evidence, the notion that cholera was transmitted from Africa to Latin America has been proposed in the literature. In this study, we have applied whole-genome sequence analysis to a set of 124 V. cholerae strains, including six Mexican isolates, to determine their phylogenetic relationships. Phylogenetic analysis indicated the six V. cholerae O1 isolates belong to five phylogenetic clades: i.e., basal, nontoxigenic, classical, El Tor, and hybrid El Tor. Thus, the results of phylogenetic analysis, coupled with CTXϕ array and antibiotic susceptibility, do not support single-source transmission of cholera to Mexico from African countries. The association of indigenous populations of V. cholerae that has been observed in this study suggests it plays a significant role in the dynamics of cholera in Mexico. Copyright © 2016 Choi et al.

  7. Identifying metabolic enzymes with multiple types of association evidence

    PubMed Central

    Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

    2006-01-01

    Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130

  8. Understanding phylogenetic incongruence: lessons from phyllostomid bats

    PubMed Central

    Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B

    2012-01-01

    All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive

  9. Leveraging contemporary species introductions to test phylogenetic hypotheses of trait evolution.

    PubMed

    Lu-Irving, Patricia; Marx, Hannah E; Dlugosch, Katrina M

    2018-05-10

    Plant trait evolution is a topic of interest across disciplines and scales. Phylogenetic studies are powerful for generating hypotheses about the mechanisms that have shaped plant traits and their evolution. Introduced plants are a rich source of data on contemporary trait evolution. Introductions could provide especially useful tests of a variety of evolutionary hypotheses because the environments selecting on evolving traits are still present. We review phylogenetic and contemporary studies of trait evolution and identify areas of overlap and areas for further integration. Emerging tools which can promote integration include broadly focused repositories of trait data, and comparative models of trait evolution that consider both intra and interspecific variation. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. On the phylogenetic placement of human T cell leukemia virus type 1 sequences associated with an Andean mummy.

    PubMed

    Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A

    2006-03-01

    Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.

  11. Phylogenetic Analysis of Rubella Viruses Identified in Uganda, 2003–2012

    PubMed Central

    Namuwulya, Prossy; Abernathy, Emily; Bukenya, Henry; Bwogi, Josephine; Tushabe, Phionah; Birungi, Molly; Seguya, Ronald; Kabaliisa, Theopista; Alibu, Vincent P.; Kayondo, Jonathan K.; Rivailler, Pierre; Icenogle, Joseph; Bakamutumaho, Barnabas

    2014-01-01

    Molecular data on rubella viruses are limited in Uganda despite the importance of congenital rubella syndrome (CRS). Routine rubella vaccination, while not administered currently in Uganda, is expected to begin by 2015. The World Health Organization recommends that countries without rubella vaccination programs assess the burden of rubella and CRS before starting a routine vaccination program. Uganda is already involved in integrated case-based surveillance, including laboratory testing to confirm measles and rubella, but molecular epidemiologic aspects of rubella circulation have so far not been documented in Uganda. Twenty throat swab or oral fluid samples collected from 12 districts during routine rash and fever surveillance between 2003 and 2012 were identified as rubella virus RNA positive and PCR products encompassing the region used for genotyping were sequenced. Phylogenetic analysis of the 20 sequences identified 19 genotype 1G viruses and 1 genotype 1E virus. Genotype-specific trees showed that the Uganda viruses belonged to specific clusters for both genotypes 1G and 1E and grouped with similar sequences from neighboring countries. Genotype 1G was predominant in Uganda. More epidemiological and molecular epidemiological data are required to determine if genotype 1E is also endemic in Uganda. The information obtained in this study will assist the immunization program in monitoring changes in circulating genotypes. PMID:24700073

  12. Open Reading Frame Phylogenetic Analysis on the Cloud

    PubMed Central

    2013-01-01

    Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843

  13. [Characterization of Escherichia coli isolates derived from phylogenetic groups A and B1 causing extraintestinal infection].

    PubMed

    Moreno, Eva; Prats, Guillem; Planells, Irene; Planes, Ana M; Pérez, Teresa; Andreu, Antonia

    2006-10-01

    Escherichia coli isolates from the non-pathogenic phylogenetic groups A and B1 rarely cause extraintestinal infections. The aim of this study was to analyze 37 E. coli isolates pertaining to phylogenetic groups A and B1 and compare them with 37 E. coli isolates from group B2 and 31 from group D, which caused the same infections. Among 105 E. coli isolated from the urine of patients with cystitis and pyelonephritis and from the blood of patients with urinary-source and other-source bacteriemia, the E. coli phylogenetic groups, 15 virulence-associated genes, 7 O-antigens and fluoroquinolone resistance were analyzed. E. coli from groups A and B1 showed fewer virulence determinants (median 3.5) than E. coli from group B2 (8.6, P < 0.01) or D (5.3, P < .001); however, a subgroup containing 3 isolates from group A and 5 from B1 harbored 5 or more factors. E. coli from groups A/B1 were associated with resistance to fluoroquinolones (74%, P < .001), whereas E. coli from group B2 were associated with susceptibility to this antibiotic (76%, P = .003). E. coli from groups A/B1 were isolated significantly more frequently in patients with pyelonephritis or sepsis and local or general factors favoring infection, association not observed in patients with cystitis. Even though most of the E. coli isolates from phylogenetic groups A and B1 presented a low virulence potential, they were able to cause extraintestinal infections, particularly in compromised patients.

  14. Joint source based morphometry identifies linked gray and white matter group differences.

    PubMed

    Xu, Lai; Pearlson, Godfrey; Calhoun, Vince D

    2009-02-01

    We present a multivariate approach called joint source based morphometry (jSBM), to identify linked gray and white matter regions which differ between groups. In jSBM, joint independent component analysis (jICA) is used to decompose preprocessed gray and white matter images into joint sources and statistical analysis is used to determine the significant joint sources showing group differences and their relationship to other variables of interest (e.g. age or sex). The identified joint sources are groupings of linked gray and white matter regions with common covariation among subjects. In this study, we first provide a simulation to validate the jSBM approach. To illustrate our method on real data, jSBM is then applied to structural magnetic resonance imaging (sMRI) data obtained from 120 chronic schizophrenia patients and 120 healthy controls to identify group differences. JSBM identified four joint sources as significantly associated with schizophrenia. Linked gray-white matter regions identified in each of the joint sources included: 1) temporal--corpus callosum, 2) occipital/frontal--inferior fronto-occipital fasciculus, 3) frontal/parietal/occipital/temporal--superior longitudinal fasciculus and 4) parietal/frontal--thalamus. Age effects on all four joint sources were significant, but sex effects were significant only for the third joint source. Our findings demonstrate that jSBM can exploit the natural linkage between gray and white matter by incorporating them into a unified framework. This approach is applicable to a wide variety of problems to study linked gray and white matter group differences.

  15. Joint source based morphometry identifies linked gray and white matter group differences

    PubMed Central

    Xu, Lai; Pearlson, Godfrey; Calhoun, Vince D.

    2009-01-01

    We present a multivariate approach called joint source based morphometry (jSBM), to identify linked gray and white matter regions which differ between groups. In jSBM, joint independent component analysis (jICA) is used to decompose preprocessed gray and white matter images into joint sources and statistical analysis is used to determine the significant joint sources showing group differences and their relationship to other variables of interest (e.g. age or sex). The identified joint sources are groupings of linked gray and white matter regions with common covariation among subjects. In this study, we first provide a simulation to validate the jSBM approach. To illustrate our method on real data, jSBM is then applied to structural magnetic resonance imaging (sMRI) data obtained from 120 chronic schizophrenia patients and 120 healthy controls to identify group differences. JSBM identified four joint sources as significantly associated with schizophrenia. Linked gray–white matter regions identified in each of the joint sources included: 1) temporal — corpus callosum, 2) occipital/frontal — inferior fronto-occipital fasciculus, 3) frontal/parietal/occipital/temporal —superior longitudinal fasciculus and 4) parietal/frontal — thalamus. Age effects on all four joint sources were significant, but sex effects were significant only for the third joint source. Our findings demonstrate that jSBM can exploit the natural linkage between gray and white matter by incorporating them into a unified framework. This approach is applicable to a wide variety of problems to study linked gray and white matter group differences. PMID:18992825

  16. Phylogenetic factorization of compositional data yields lineage-level associations in microbiome datasets.

    PubMed

    Washburne, Alex D; Silverman, Justin D; Leff, Jonathan W; Bennett, Dominic J; Darcy, John L; Mukherjee, Sayan; Fierer, Noah; David, Lawrence A

    2017-01-01

    Marker gene sequencing of microbial communities has generated big datasets of microbial relative abundances varying across environmental conditions, sample sites and treatments. These data often come with putative phylogenies, providing unique opportunities to investigate how shared evolutionary history affects microbial abundance patterns. Here, we present a method to identify the phylogenetic factors driving patterns in microbial community composition. We use the method, "phylofactorization," to re-analyze datasets from the human body and soil microbial communities, demonstrating how phylofactorization is a dimensionality-reducing tool, an ordination-visualization tool, and an inferential tool for identifying edges in the phylogeny along which putative functional ecological traits may have arisen.

  17. Development of phylogenetic markers for Sebacina (Sebacinaceae) mycorrhizal fungi associated with Australian orchids.

    PubMed

    Ruibal, Monica P; Peakall, Rod; Foret, Sylvain; Linde, Celeste C

    2014-06-01

    To investigate fungal species identity and diversity in mycorrhizal fungi of order Sebacinales, we developed phylogenetic markers. These new markers will enable future studies investigating species delineation and phylogenetic relationships of the fungal symbionts and facilitate investigations into evolutionary interactions among Sebacina species and their orchid hosts. • We generated partial genome sequences for a Sebacina symbiont originating from Caladenia huegelii with 454 genome sequencing and from three symbionts from Eriochilus dilatatus and one from E. pulchellus using Illumina sequencing. Six nuclear and two mitochondrial loci showed high variability (10-31% parsimony informative sites) for Sebacinales mycorrhizal fungi across four genera of Australian orchids (Caladenia, Eriochilus, Elythranthera, and Glossodia). • We obtained highly informative DNA markers that will allow investigation of mycorrhizal diversity of Sebacinaceae fungi associated with terrestrial orchids in Australia and worldwide.

  18. The Diaporthe sojae species complex: Phylogenetic re-assessment of pathogens associated with soybean, cucurbits and other field crops.

    PubMed

    Udayanga, Dhanushka; Castlebury, Lisa A; Rossman, Amy Y; Chukeatirote, Ekachai; Hyde, Kevin D

    2015-05-01

    Phytopathogenic species of Diaporthe are associated with a number of soybean diseases including seed decay, pod and stem blight and stem canker and lead to considerable crop production losses worldwide. Accurate morphological identification of the species that cause these diseases has been difficult. In this study, we determined the phylogenetic relationships and species boundaries of Diaporthe longicolla, Diaporthe phaseolorum, Diaporthe sojae and closely related taxa. Species boundaries for this complex were determined based on combined phylogenetic analysis of five gene regions: partial sequences of calmodulin (CAL), beta-tubulin (TUB), histone-3 (HIS), translation elongation factor 1-α (EF1-α), and the nuclear ribosomal internal transcribed spacers (ITS). Phylogenetic analyses revealed that this large complex of taxa is comprised of soybean pathogens as well as species associated with herbaceous field crops and weeds. Diaporthe arctii, Diaporthe batatas, D. phaseolorum and D. sojae are epitypified. The seed decay pathogen D. longicolla was determined to be distinct from D. sojae. D. phaseolorum, originally associated with stem and leaf blight of Lima bean, was not found to be associated with soybean. A new species, Diaporthe ueckerae on Cucumis melo, is introduced with description and illustrations. Published by Elsevier Ltd.

  19. HIV infection and hepatitis C virus genotype 1a are associated with phylogenetic clustering among people with recently acquired hepatitis C virus infection.

    PubMed

    Bartlett, Sofia R; Jacka, Brendan; Bull, Rowena A; Luciani, Fabio; Matthews, Gail V; Lamoury, Francois M J; Hellard, Margaret E; Hajarizadeh, Behzad; Teutsch, Suzy; White, Bethany; Maher, Lisa; Dore, Gregory J; Lloyd, Andrew R; Grebely, Jason; Applegate, Tanya L

    2016-01-01

    The aim of this study was to identify factors associated with phylogenetic clustering among people with recently acquired hepatitis C virus (HCV) infection. Participants with available sample at time of HCV detection were selected from three studies; the Australian Trial in Acute Hepatitis C, the Hepatitis C Incidence and Transmission Study - Prison and Community. HCV RNA was extracted and Core to E2 region of HCV sequenced. Clusters were identified from maximum likelihood trees with 1000 bootstrap replicates using 90% bootstrap and 5% genetic distance threshold. Among 225 participants with available Core-E2 sequence (ATAHC, n=113; HITS-p, n=90; and HITS-c, n=22), HCV genotype prevalence was: G1a: 38% (n=86), G1b: 5% (n=12), G2a: 1% (n=2), G2b: 5% (n=11), G3a: 48% (n=109), G6a: 1% (n=2) and G6l 1% (n=3). Of participants included in phylogenetic trees, 22% of participants were in a pair/cluster (G1a-35%, 30/85, mean maximum genetic distance=0.031; G3a-11%, 12/106, mean maximum genetic distance=0.021; other genotypes-21%, 6/28, mean maximum genetic distance=0.023). Among HCV/HIV co-infected participants, 50% (18/36) were in a pair/cluster, compared to 16% (30/183) with HCV mono-infection (P=<0.001). Factors independently associated with phylogenetic clustering were HIV co-infection [vs. HCV mono-infection; adjusted odds ratio (AOR) 4.24; 95%CI 1.91, 9.39], and HCV G1a infection (vs. other HCV genotypes; AOR 3.33, 95%CI 0.14, 0.61).HCV treatment and prevention strategies, including enhanced antiviral therapy, should be optimised. The impact of targeting of HCV treatment as prevention to populations with higher phylogenetic clustering, such as those with HIV co-infection, could be explored through mathematical modelling. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees.

    PubMed

    DeBlasio, Dan F; Wisecaver, Jennifer H

    2016-01-01

    We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.

  1. Novel Sources of Stripe Rust Resistance Identified by Genome-Wide Association Mapping in Ethiopian Durum Wheat (Triticum turgidum ssp. durum)

    PubMed Central

    Liu, Weizhen; Maccaferri, Marco; Rynearson, Sheri; Letta, Tesfaye; Zegeye, Habtemariam; Tuberosa, Roberto; Chen, Xianming; Pumphrey, Michael

    2017-01-01

    Stripe rust of wheat, caused by Puccinia striiformis f. sp. tritici (Pst), is a global concern for wheat production, and has been increasingly destructive in Ethiopia, as well as in the United States and in many other countries. As Ethiopia has a long history of stripe rust epidemics, its native wheat germplasm harbors potentially valuable resistance loci. Moreover, the Ethiopian germplasm has been historically underutilized in breeding of modern wheat worldwide and thus the resistance alleles from the Ethiopian germplasm represent potentially novel sources. The objective of this study was to identify loci conferring resistance to predominant Pst races in Ethiopia and the United States. Using a high-density 90 K wheat single nucleotide polymorphism array, a genome-wide association analysis (GWAS) was conducted on 182 durum wheat landrace accessions and contemporary varieties originating from Ethiopia. Landraces were detected to be more resistant at the seedling stage while cultivars were more resistant at the adult-plant stages. GWAS identified 68 loci associated with seedling resistance to one or more races. Six loci on chromosome arms 1AS, 1BS, 3AS, 4BL, and 5BL were associated with resistance against at least two races at the seedling stage, and five loci were previously undocumented. GWAS analysis of field resistance reactions identified 12 loci associated with resistance on chromosomes 1A, 1B, 2BS, 3BL, 4AL, 4B and 5AL, which were detected in at least two of six field screening nurseries at the adult-plant stage. Comparison with previously mapped resistance loci indicates that six of the 12 resistance loci are newly documented. This study reports effective sources of resistance to contemporary races in Ethiopia and the United States and reveals that Ethiopian durum wheat landraces are abundant in novel Pst resistance loci that may be transferred into adapted cultivars to provide resistance against Pst. PMID:28553306

  2. Identifiability and Identification of Trace Continuous Pollutant Source

    PubMed Central

    Qu, Hongquan; Liu, Shouwen; Pang, Liping; Hu, Tao

    2014-01-01

    Accidental pollution events often threaten people's health and lives, and a pollutant source is very necessary so that prompt remedial actions can be taken. In this paper, a trace continuous pollutant source identification method is developed to identify a sudden continuous emission pollutant source in an enclosed space. The location probability model is set up firstly, and then the identification method is realized by searching a global optimal objective value of the location probability. In order to discuss the identifiability performance of the presented method, a conception of a synergy degree of velocity fields is presented in order to quantitatively analyze the impact of velocity field on the identification performance. Based on this conception, some simulation cases were conducted. The application conditions of this method are obtained according to the simulation studies. In order to verify the presented method, we designed an experiment and identified an unknown source appearing in the experimental space. The result showed that the method can identify a sudden trace continuous source when the studied situation satisfies the application conditions. PMID:24892041

  3. Identifiability and identification of trace continuous pollutant source.

    PubMed

    Qu, Hongquan; Liu, Shouwen; Pang, Liping; Hu, Tao

    2014-01-01

    Accidental pollution events often threaten people's health and lives, and a pollutant source is very necessary so that prompt remedial actions can be taken. In this paper, a trace continuous pollutant source identification method is developed to identify a sudden continuous emission pollutant source in an enclosed space. The location probability model is set up firstly, and then the identification method is realized by searching a global optimal objective value of the location probability. In order to discuss the identifiability performance of the presented method, a conception of a synergy degree of velocity fields is presented in order to quantitatively analyze the impact of velocity field on the identification performance. Based on this conception, some simulation cases were conducted. The application conditions of this method are obtained according to the simulation studies. In order to verify the presented method, we designed an experiment and identified an unknown source appearing in the experimental space. The result showed that the method can identify a sudden trace continuous source when the studied situation satisfies the application conditions.

  4. SUNPLIN: simulation with uncertainty for phylogenetic investigations.

    PubMed

    Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F

    2013-11-15

    Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.

  5. False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.

    PubMed

    Xiao, Jian; Cao, Hongyuan; Chen, Jun

    2017-09-15

    Next generation sequencing technologies have enabled the study of the human microbiome through direct sequencing of microbial DNA, resulting in an enormous amount of microbiome sequencing data. One unique characteristic of microbiome data is the phylogenetic tree that relates all the bacterial species. Closely related bacterial species have a tendency to exhibit a similar relationship with the environment or disease. Thus, incorporating the phylogenetic tree information can potentially improve the detection power for microbiome-wide association studies, where hundreds or thousands of tests are conducted simultaneously to identify bacterial species associated with a phenotype of interest. Despite much progress in multiple testing procedures such as false discovery rate (FDR) control, methods that take into account the phylogenetic tree are largely limited. We propose a new FDR control procedure that incorporates the prior structure information and apply it to microbiome data. The proposed procedure is based on a hierarchical model, where a structure-based prior distribution is designed to utilize the phylogenetic tree. By borrowing information from neighboring bacterial species, we are able to improve the statistical power of detecting associated bacterial species while controlling the FDR at desired levels. When the phylogenetic tree is mis-specified or non-informative, our procedure achieves a similar power as traditional procedures that do not take into account the tree structure. We demonstrate the performance of our method through extensive simulations and real microbiome datasets. We identified far more alcohol-drinking associated bacterial species than traditional methods. R package StructFDR is available from CRAN. chen.jun2@mayo.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  6. Visualizing phylogenetic tree landscapes.

    PubMed

    Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A

    2017-02-02

    Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D

  7. treespace: Statistical exploration of landscapes of phylogenetic trees.

    PubMed

    Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline

    2017-11-01

    The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  8. SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations

    PubMed Central

    2013-01-01

    Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408

  9. Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota.

    PubMed

    Oton, Eduard Vico; Quince, Christopher; Nicol, Graeme W; Prosser, James I; Gubry-Rangin, Cécile

    2016-01-01

    Thaumarchaeota form a ubiquitously distributed archaeal phylum, comprising both the ammonia-oxidising archaea (AOA) and other archaeal groups in which ammonia oxidation has not been demonstrated (including Group 1.1c and Group 1.3). The ecology of AOA in terrestrial environments has been extensively studied using either a functional gene, encoding ammonia monooxygenase subunit A (amoA) or 16S ribosomal RNA (rRNA) genes, which show phylogenetic coherence with respect to soil pH. To test phylogenetic congruence between these two markers and to determine ecological coherence in all Thaumarchaeota, we performed high-throughput sequencing of 16S rRNA and amoA genes in 46 UK soils presenting 29 available contextual soil characteristics. Adaptation to pH and organic matter content reflected strong ecological coherence at various levels of taxonomic resolution for Thaumarchaeota (AOA and non-AOA), whereas nitrogen, total mineralisable nitrogen and zinc concentration were also important factors associated with AOA thaumarchaeotal community distribution. Other significant associations with environmental factors were also detected for amoA and 16S rRNA genes, reflecting different diversity characteristics between these two markers. Nonetheless, there was significant statistical congruence between the markers at fine phylogenetic resolution, supporting the hypothesis of low horizontal gene transfer between Thaumarchaeota. Group 1.1c Thaumarchaeota were also widely distributed, with two clusters predominating, particularly in environments with higher moisture content and organic matter, whereas a similar ecological pattern was observed for Group 1.3 Thaumarchaeota. The ecological and phylogenetic congruence identified is fundamental to understand better the life strategies, evolutionary history and ecosystem function of the Thaumarchaeota.

  10. Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota

    PubMed Central

    Oton, Eduard Vico; Quince, Christopher; Nicol, Graeme W; Prosser, James I; Gubry-Rangin, Cécile

    2016-01-01

    Thaumarchaeota form a ubiquitously distributed archaeal phylum, comprising both the ammonia-oxidising archaea (AOA) and other archaeal groups in which ammonia oxidation has not been demonstrated (including Group 1.1c and Group 1.3). The ecology of AOA in terrestrial environments has been extensively studied using either a functional gene, encoding ammonia monooxygenase subunit A (amoA) or 16S ribosomal RNA (rRNA) genes, which show phylogenetic coherence with respect to soil pH. To test phylogenetic congruence between these two markers and to determine ecological coherence in all Thaumarchaeota, we performed high-throughput sequencing of 16S rRNA and amoA genes in 46 UK soils presenting 29 available contextual soil characteristics. Adaptation to pH and organic matter content reflected strong ecological coherence at various levels of taxonomic resolution for Thaumarchaeota (AOA and non-AOA), whereas nitrogen, total mineralisable nitrogen and zinc concentration were also important factors associated with AOA thaumarchaeotal community distribution. Other significant associations with environmental factors were also detected for amoA and 16S rRNA genes, reflecting different diversity characteristics between these two markers. Nonetheless, there was significant statistical congruence between the markers at fine phylogenetic resolution, supporting the hypothesis of low horizontal gene transfer between Thaumarchaeota. Group 1.1c Thaumarchaeota were also widely distributed, with two clusters predominating, particularly in environments with higher moisture content and organic matter, whereas a similar ecological pattern was observed for Group 1.3 Thaumarchaeota. The ecological and phylogenetic congruence identified is fundamental to understand better the life strategies, evolutionary history and ecosystem function of the Thaumarchaeota. PMID:26140533

  11. Unrealistic phylogenetic trees may improve phylogenetic footprinting.

    PubMed

    Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo

    2017-06-01

    The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  12. Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

    PubMed

    Deepak, Akshay; Fernández-Baca, David

    2014-01-01

    A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.

  13. Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae)

    PubMed Central

    Baker, William J.; Norup, Maria V.; Clarkson, James J.; Couvreur, Thomas L. P.; Dowe, John L.; Lewis, Carl E.; Pintaud, Jean-Christophe; Savolainen, Vincent; Wilmot, Tomas; Chase, Mark W.

    2011-01-01

    Background and Aims The Arecoideae is the largest and most diverse of the five subfamilies of palms (Arecaceae/Palmae), containing >50 % of the species in the family. Despite its importance, phylogenetic relationships among Arecoideae are poorly understood. Here the most densely sampled phylogenetic analysis of Arecoideae available to date is presented. The results are used to test the current classification of the subfamily and to identify priority areas for future research. Methods DNA sequence data for the low-copy nuclear genes PRK and RPB2 were collected from 190 palm species, covering 103 (96 %) genera of Arecoideae. The data were analysed using the parsimony ratchet, maximum likelihood, and both likelihood and parsimony bootstrapping. Key Results and Conclusions Despite the recovery of paralogues and pseudogenes in a small number of taxa, PRK and RPB2 were both highly informative, producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. Simultaneous analyses of the combined data sets provided additional resolution and support. Two areas of incongruence between PRK and RPB2 were strongly supported by the bootstrap relating to the placement of tribes Chamaedoreeae, Iriarteeae and Reinhardtieae; the causes of this incongruence remain uncertain. The current classification within Arecoideae was strongly supported by the present data. Of the 14 tribes and 14 sub-tribes in the classification, only five sub-tribes from tribe Areceae (Basseliniinae, Linospadicinae, Oncospermatinae, Rhopalostylidinae and Verschaffeltiinae) failed to receive support. Three major higher level clades were strongly supported: (1) the RRC clade (Roystoneeae, Reinhardtieae and Cocoseae), (2) the POS clade (Podococceae, Oranieae and Sclerospermeae) and (3) the core arecoid clade (Areceae, Euterpeae, Geonomateae, Leopoldinieae, Manicarieae and Pelagodoxeae). However, new data sources are required to elucidate ambiguities that remain in phylogenetic

  14. Phylogenetic and population-based approaches to mitogenome variation do not support association with male infertility.

    PubMed

    Gómez-Carballa, Alberto; Pardo-Seco, Jacobo; Martinón-Torres, Federico; Salas, Antonio

    2017-03-01

    Infertility has a complex multifactorial etiology and a high prevalence worldwide. Several studies have pointed to variation in the mitochondrial DNA (mtDNA) molecule as a factor responsible for the different disease phenotypes related to infertility. We analyzed 53 mitogenomes of infertile males from Galicia (northwest Spain), and these haplotypes were meta-analyzed phylogenetically with 43 previously reported from Portugal. Taking advantage of the large amount of information available, we additionally carried out association tests between patient mtDNA single-nucleotide polymorphisms (mtSNPs) and haplogroups against Iberian matched controls retrieved from The 1000 Genomes Project and the literature. Phylogenetic and association analyses did not reveal evidence of association between mtSNPs/haplogroups and infertility. Ratios and patterns in patients of nonsynonymous/synonymous changes, and variation at homoplasmic, heteroplasmic and private variants, fall within expected values for healthy individuals. Moreover, the haplogroup background of patients was variable and fits well with patterns typically observed in healthy western Europeans. We did not find evidence of association of mtSNPs or haplogroups pointing to a role for mtDNA in male infertility. A thorough review of the literature on mtDNA variation and infertility revealed contradictory findings and methodological and theoretical problems that overall undermine previous positive findings.

  15. IDENTIFYING SOURCES OF HUMAN EXPOSURE

    EPA Science Inventory

    Air pollution from ambient sources continues to adversely impact human health in the United States. A fundamental goal for EPA is to implement air quality standards and regulations that reduce health risks associated with exposures to criteria pollutants and air toxics. However...

  16. Comparative Analysis of Begonia Plastid Genomes and Their Utility for Species-Level Phylogenetics

    PubMed Central

    Harrison, Nicola; Harrison, Richard J.

    2016-01-01

    Recent, rapid radiations make species-level phylogenetics difficult to resolve. We used a multiplexed, high-throughput sequencing approach to identify informative genomic regions to resolve phylogenetic relationships at low taxonomic levels in Begonia from a survey of sixteen species. A long-range PCR method was used to generate draft plastid genomes to provide a strong phylogenetic backbone, identify fast evolving regions and provide informative molecular markers for species-level phylogenetic studies in Begonia. PMID:27058864

  17. Enumerating all maximal frequent subtrees in collections of phylogenetic trees

    PubMed Central

    2014-01-01

    Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474

  18. The Evolutionary Ecology of Plant Disease: A Phylogenetic Perspective.

    PubMed

    Gilbert, Gregory S; Parker, Ingrid M

    2016-08-04

    An explicit phylogenetic perspective provides useful tools for phytopathology and plant disease ecology because the traits of both plants and microbes are shaped by their evolutionary histories. We present brief primers on phylogenetic signal and the analytical tools of phylogenetic ecology. We review the literature and find abundant evidence of phylogenetic signal in pathogens and plants for most traits involved in disease interactions. Plant nonhost resistance mechanisms and pathogen housekeeping functions are conserved at deeper phylogenetic levels, whereas molecular traits associated with rapid coevolutionary dynamics are more labile at branch tips. Horizontal gene transfer disrupts the phylogenetic signal for some microbial traits. Emergent traits, such as host range and disease severity, show clear phylogenetic signals. Therefore pathogen spread and disease impact are influenced by the phylogenetic structure of host assemblages. Phylogenetically rare species escape disease pressure. Phylogenetic tools could be used to develop predictive tools for phytosanitary risk analysis and reduce disease pressure in multispecies cropping systems.

  19. Refuting phylogenetic relationships

    PubMed Central

    Bucknam, James; Boucher, Yan; Bapteste, Eric

    2006-01-01

    Background Phylogenetic methods are philosophically grounded, and so can be philosophically biased in ways that limit explanatory power. This constitutes an important methodologic dimension not often taken into account. Here we address this dimension in the context of concatenation approaches to phylogeny. Results We discuss some of the limits of a methodology restricted to verificationism, the philosophy on which gene concatenation practices generally rely. As an alternative, we describe a software which identifies and focuses on impossible or refuted relationships, through a simple analysis of bootstrap bipartitions, followed by multivariate statistical analyses. We show how refuting phylogenetic relationships could in principle facilitate systematics. We also apply our method to the study of two complex phylogenies: the phylogeny of the archaea and the phylogeny of the core of genes shared by all life forms. While many groups are rejected, our results left open a possible proximity of N. equitans and the Methanopyrales, of the Archaea and the Cyanobacteria, and as well the possible grouping of the Methanobacteriales/Methanoccocales and Thermosplasmatales, of the Spirochaetes and the Actinobacteria and of the Proteobacteria and firmicutes. Conclusion It is sometimes easier (and preferable) to decide which species do not group together than which ones do. When possible topologies are limited, identifying local relationships that are rejected may be a useful alternative to classical concatenation approaches aiming to find a globally resolved tree on the basis of weak phylogenetic markers. Reviewers This article was reviewed by Mark Ragan, Eugene V Koonin and J Peter Gogarten. PMID:16956399

  20. Phylogenetic classification of bony fishes.

    PubMed

    Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo

    2017-07-06

    Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution

  1. Treelink: data integration, clustering and visualization of phylogenetic trees.

    PubMed

    Allende, Christian; Sohn, Erik; Little, Cedric

    2015-12-29

    Phylogenetic trees are central to a wide range of biological studies. In many of these studies, tree nodes need to be associated with a variety of attributes. For example, in studies concerned with viral relationships, tree nodes are associated with epidemiological information, such as location, age and subtype. Gene trees used in comparative genomics are usually linked with taxonomic information, such as functional annotations and events. A wide variety of tree visualization and annotation tools have been developed in the past, however none of them are intended for an integrative and comparative analysis. Treelink is a platform-independent software for linking datasets and sequence files to phylogenetic trees. The application allows an automated integration of datasets to trees for operations such as classifying a tree based on a field or showing the distribution of selected data attributes in branches and leafs. Genomic and proteonomic sequences can also be linked to the tree and extracted from internal and external nodes. A novel clustering algorithm to simplify trees and display the most divergent clades was also developed, where validation can be achieved using the data integration and classification function. Integrated geographical information allows ancestral character reconstruction for phylogeographic plotting based on parsimony and likelihood algorithms. Our software can successfully integrate phylogenetic trees with different data sources, and perform operations to differentiate and visualize those differences within a tree. File support includes the most popular formats such as newick and csv. Exporting visualizations as images, cluster outputs and genomic sequences is supported. Treelink is available as a web and desktop application at http://www.treelinkapp.com .

  2. Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.

    PubMed

    Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M

    2012-02-01

    The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).

  3. Use of phylogenetic and phenotypic analyses to identify nonhemolytic streptococci isolated from bacteremic patients.

    PubMed

    Hoshino, Tomonori; Fujiwara, Taku; Kilian, Mogens

    2005-12-01

    The aim of this study was to evaluate molecular and phenotypic methods for the identification of nonhemolytic streptococci. A collection of 148 strains consisting of 115 clinical isolates from cases of infective endocarditis, septicemia, and meningitis and 33 reference strains, including type strains of all relevant Streptococcus species, were examined. Identification was performed by phylogenetic analysis of nucleotide sequences of four housekeeping genes, ddl, gdh, rpoB, and sodA; by PCR analysis of the glucosyltransferase (gtf) gene; and by conventional phenotypic characterization and identification using two commercial kits, Rapid ID 32 STREP and STREPTOGRAM and the associated databases. A phylogenetic tree based on concatenated sequences of the four housekeeping genes allowed unequivocal differentiation of recognized species and was used as the reference. Analysis of single gene sequences revealed deviation clustering in eight strains (5.4%) due to homologous recombination with other species. This was particularly evident in S. sanguinis and in members of the anginosus group of streptococci. The rate of correct identification of the strains by both commercial identification kits was below 50% but varied significantly between species. The most significant problems were observed with S. mitis and S. oralis and 11 Streptococcus species described since 1991. Our data indicate that identification based on multilocus sequence analysis is optimal. As a more practical alternative we recommend identification based on sodA sequences with reference to a comprehensive set of sequences that is available for downloading from our server. An analysis of the species distribution of 107 nonhemolytic streptococci from bacteremic patients showed a predominance of S. oralis and S. anginosus with various underlying infections.

  4. Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

    PubMed Central

    Gao, Beile

    2012-01-01

    Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973

  5. Comparison of cluster-based and source-attribution methods for estimating transmission risk using large HIV sequence databases.

    PubMed

    Le Vu, Stéphane; Ratmann, Oliver; Delpech, Valerie; Brown, Alison E; Gill, O Noel; Tostevin, Anna; Fraser, Christophe; Volz, Erik M

    2018-06-01

    Phylogenetic clustering of HIV sequences from a random sample of patients can reveal epidemiological transmission patterns, but interpretation is hampered by limited theoretical support and statistical properties of clustering analysis remain poorly understood. Alternatively, source attribution methods allow fitting of HIV transmission models and thereby quantify aspects of disease transmission. A simulation study was conducted to assess error rates of clustering methods for detecting transmission risk factors. We modeled HIV epidemics among men having sex with men and generated phylogenies comparable to those that can be obtained from HIV surveillance data in the UK. Clustering and source attribution approaches were applied to evaluate their ability to identify patient attributes as transmission risk factors. We find that commonly used methods show a misleading association between cluster size or odds of clustering and covariates that are correlated with time since infection, regardless of their influence on transmission. Clustering methods usually have higher error rates and lower sensitivity than source attribution method for identifying transmission risk factors. But neither methods provide robust estimates of transmission risk ratios. Source attribution method can alleviate drawbacks from phylogenetic clustering but formal population genetic modeling may be required to estimate quantitative transmission risk factors. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  6. Isolation, Phylogenetic Analysis and Anti-infective Activity Screening of Marine Sponge-Associated Actinomycetes

    PubMed Central

    Abdelmohsen, Usama Ramadan; Pimentel-Elardo, Sheila M.; Hanora, Amro; Radwan, Mona; Abou-El-Ela, Soad H.; Ahmed, Safwat; Hentschel, Ute

    2010-01-01

    Terrestrial actinomycetes are noteworthy producers of a multitude of antibiotics, however the marine representatives are much less studied in this regard. In this study, 90 actinomycetes were isolated from 11 different species of marine sponges that had been collected from offshore Ras Mohamed (Egypt) and from Rovinj (Croatia). Phylogenetic characterization of the isolates based on 16S rRNA gene sequencing supported their assignment to 18 different actinomycete genera representing seven different suborders. Fourteen putatively novel species were identified based on sequence similarity values below 98.2% to other strains in the NCBI database. A putative new genus related to Rubrobacter was isolated on M1 agar that had been amended with sponge extract, thus highlighting the need for innovative cultivation protocols. Testing for anti-infective activities was performed against clinically relevant, Gram-positive (Enterococcus faecalis, Staphylococcus aureus) and Gram-negative (Escherichia coli, Pseudomonas aeruginosa) bacteria, fungi (Candida albicans) and human parasites (Leishmania major, Trypanosoma brucei). Bioactivities against these pathogens were documented for 10 actinomycete isolates. These results show a high diversity of actinomycetes associated with marine sponges as well as highlight their potential to produce anti-infective agents. PMID:20411105

  7. The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases.

    PubMed

    Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning

    2007-10-18

    Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at

  8. The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases

    PubMed Central

    Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning

    2007-01-01

    Background Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. Results We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. Conclusion We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and

  9. Phylogenetic Analysis and Classification of the Fungal bHLH Domain

    PubMed Central

    Sailsbery, Joshua K.; Atchley, William R.; Dean, Ralph A.

    2012-01-01

    The basic Helix-Loop-Helix (bHLH) domain is an essential highly conserved DNA-binding domain found in many transcription factors in all eukaryotic organisms. The bHLH domain has been well studied in the Animal and Plant Kingdoms but has yet to be characterized within Fungi. Herein, we obtained and evaluated the phylogenetic relationship of 490 fungal-specific bHLH containing proteins from 55 whole genome projects composed of 49 Ascomycota and 6 Basidiomycota organisms. We identified 12 major groupings within Fungi (F1–F12); identifying conserved motifs and functions specific to each group. Several classification models were built to distinguish the 12 groups and elucidate the most discerning sites in the domain. Performance testing on these models, for correct group classification, resulted in a maximum sensitivity and specificity of 98.5% and 99.8%, respectively. We identified 12 highly discerning sites and incorporated those into a set of rules (simplified model) to classify sequences into the correct group. Conservation of amino acid sites and phylogenetic analyses established that like plant bHLH proteins, fungal bHLH–containing proteins are most closely related to animal Group B. The models used in these analyses were incorporated into a software package, the source code for which is available at www.fungalgenomics.ncsu.edu. PMID:22114358

  10. Functional & phylogenetic diversity of copepod communities

    NASA Astrophysics Data System (ADS)

    Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.

    2016-02-01

    The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.

  11. Comparison of Microbial and Chemical Source Tracking Markers To Identify Fecal Contamination Sources in the Humber River (Toronto, Ontario, Canada) and Associated Storm Water Outfalls.

    PubMed

    Staley, Zachery R; Grabuski, Josey; Sverko, Ed; Edge, Thomas A

    2016-11-01

    Storm water runoff is a major source of pollution, and understanding the components of storm water discharge is essential to remediation efforts and proper assessment of risks to human and ecosystem health. In this study, culturable Escherichia coli and ampicillin-resistant E. coli levels were quantified and microbial source tracking (MST) markers (including markers for general Bacteroidales spp., human, ruminant/cow, gull, and dog) were detected in storm water outfalls and sites along the Humber River in Toronto, Ontario, Canada, and enumerated via endpoint PCR and quantitative PCR (qPCR). Additionally, chemical source tracking (CST) markers specific for human wastewater (caffeine, carbamazepine, codeine, cotinine, acetaminophen, and acesulfame) were quantified. Human and gull fecal sources were detected at all sites, although concentrations of the human fecal marker were higher, particularly in outfalls (mean outfall concentrations of 4.22 log 10 copies, expressed as copy numbers [CN]/100 milliliters for human and 0.46 log 10 CN/100 milliliters for gull). Higher concentrations of caffeine, acetaminophen, acesulfame, E. coli, and the human fecal marker were indicative of greater raw sewage contamination at several sites (maximum concentrations of 34,800 ng/liter, 5,120 ng/liter, 9,720 ng/liter, 5.26 log 10 CFU/100 ml, and 7.65 log 10 CN/100 ml, respectively). These results indicate pervasive sewage contamination at storm water outfalls and throughout the Humber River, with multiple lines of evidence identifying Black Creek and two storm water outfalls with prominent sewage cross-connection problems requiring remediation. Limited data are available on specific sources of pollution in storm water, though our results indicate the value of using both MST and CST methodologies to more reliably assess sewage contamination in impacted watersheds. Storm water runoff is one of the most prominent non-point sources of biological and chemical contaminants which can

  12. Comparison of Microbial and Chemical Source Tracking Markers To Identify Fecal Contamination Sources in the Humber River (Toronto, Ontario, Canada) and Associated Storm Water Outfalls

    PubMed Central

    Grabuski, Josey; Sverko, Ed; Edge, Thomas A.

    2016-01-01

    ABSTRACT Storm water runoff is a major source of pollution, and understanding the components of storm water discharge is essential to remediation efforts and proper assessment of risks to human and ecosystem health. In this study, culturable Escherichia coli and ampicillin-resistant E. coli levels were quantified and microbial source tracking (MST) markers (including markers for general Bacteroidales spp., human, ruminant/cow, gull, and dog) were detected in storm water outfalls and sites along the Humber River in Toronto, Ontario, Canada, and enumerated via endpoint PCR and quantitative PCR (qPCR). Additionally, chemical source tracking (CST) markers specific for human wastewater (caffeine, carbamazepine, codeine, cotinine, acetaminophen, and acesulfame) were quantified. Human and gull fecal sources were detected at all sites, although concentrations of the human fecal marker were higher, particularly in outfalls (mean outfall concentrations of 4.22 log10 copies, expressed as copy numbers [CN]/100 milliliters for human and 0.46 log10 CN/100 milliliters for gull). Higher concentrations of caffeine, acetaminophen, acesulfame, E. coli, and the human fecal marker were indicative of greater raw sewage contamination at several sites (maximum concentrations of 34,800 ng/liter, 5,120 ng/liter, 9,720 ng/liter, 5.26 log10 CFU/100 ml, and 7.65 log10 CN/100 ml, respectively). These results indicate pervasive sewage contamination at storm water outfalls and throughout the Humber River, with multiple lines of evidence identifying Black Creek and two storm water outfalls with prominent sewage cross-connection problems requiring remediation. Limited data are available on specific sources of pollution in storm water, though our results indicate the value of using both MST and CST methodologies to more reliably assess sewage contamination in impacted watersheds. IMPORTANCE Storm water runoff is one of the most prominent non-point sources of biological and chemical contaminants

  13. YBYRÁ facilitates comparison of large phylogenetic trees.

    PubMed

    Machado, Denis Jacob

    2015-07-01

    The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .

  14. Microbial network, phylogenetic diversity and community membership in the active layer across a permafrost thaw gradient.

    PubMed

    Mondav, Rhiannon; McCalley, Carmody K; Hodgkins, Suzanne B; Frolking, Steve; Saleska, Scott R; Rich, Virginia I; Chanton, Jeff P; Crill, Patrick M

    2017-08-01

    Biogenic production and release of methane (CH 4 ) from thawing permafrost has the potential to be a strong source of radiative forcing. We investigated changes in the active layer microbial community of three sites representative of distinct permafrost thaw stages at a palsa mire in northern Sweden. The palsa site (intact permafrost and low radiative forcing signature) had a phylogenetically clustered community dominated by Acidobacteria and Proteobacteria. The bog (thawing permafrost and low radiative forcing signature) had lower alpha diversity and midrange phylogenetic clustering, characteristic of ecosystem disturbance affecting habitat filtering. Hydrogenotrophic methanogens and Acidobacteria dominated the bog shifting from palsa-like to fen-like at the waterline. The fen (no underlying permafrost, high radiative forcing signature) had the highest alpha, beta and phylogenetic diversity, was dominated by Proteobacteria and Euryarchaeota and was significantly enriched in methanogens. The Mire microbial network was modular with module cores consisting of clusters of Acidobacteria, Euryarchaeota or Xanthomonodales. Loss of underlying permafrost with associated hydrological shifts correlated to changes in microbial composition, alpha, beta and phylogenetic diversity associated with a higher radiative forcing signature. These results support the complex role of microbial interactions in mediating carbon budget changes and climate feedback in response to climate forcing. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  15. Phylogenetics and Differentiation of Salmonella Newport Lineages by Whole Genome Sequencing

    PubMed Central

    Cao, Guojie; Meng, Jianghong; Strain, Errol; Stones, Robert; Pettengill, James; Zhao, Shaohua; McDermott, Patrick; Brown, Eric; Allard, Marc

    2013-01-01

    Salmonella Newport has ranked in the top three Salmonella serotypes associated with foodborne outbreaks from 1995 to 2011 in the United States. In the current study, we selected 26 S. Newport strains isolated from diverse sources and geographic locations and then conducted 454 shotgun pyrosequencing procedures to obtain 16–24 × coverage of high quality draft genomes for each strain. Comparative genomic analysis of 28 S. Newport strains (including 2 reference genomes) and 15 outgroup genomes identified more than 140,000 informative SNPs. A resulting phylogenetic tree consisted of four sublineages and indicated that S. Newport had a clear geographic structure. Strains from Asia were divergent from those from the Americas. Our findings demonstrated that analysis using whole genome sequencing data resulted in a more accurate picture of phylogeny compared to that using single genes or small sets of genes. We selected loci around the mutS gene of S. Newport to differentiate distinct lineages, including those between invH and mutS genes at the 3′ end of Salmonella Pathogenicity Island 1 (SPI-1), ste fimbrial operon, and Clustered, Regularly Interspaced, Short Palindromic Repeats (CRISPR) associated-proteins (cas). These genes in the outgroup genomes held high similarity with either S. Newport Lineage II or III at the same loci. S. Newport Lineages II and III have different evolutionary histories in this region and our data demonstrated genetic flow and homologous recombination events around mutS. The findings suggested that S. Newport Lineages II and III diverged early in the serotype evolution and have evolved largely independently. Moreover, we identified genes that could delineate sublineages within the phylogenetic tree and that could be used as potential biomarkers for trace-back investigations during outbreaks. Thus, whole genome sequencing data enabled us to better understand the genetic background of pathogenicity and evolutionary history of S. Newport and

  16. Environmental Sources of Bacteria Differentially Influence Host-Associated Microbial Dynamics.

    PubMed

    Cardona, Cesar; Lax, Simon; Larsen, Peter; Stephens, Brent; Hampton-Marcell, Jarrad; Edwardson, Christian F; Henry, Chris; Van Bonn, Bill; Gilbert, Jack A

    2018-01-01

    Host-associated microbial dynamics are influenced by dietary and immune factors, but how exogenous microbial exposure shapes host-microbe dynamics remains poorly characterized. To investigate this phenomenon, we characterized the skin, rectum, and respiratory tract-associated microbiota in four aquarium-housed dolphins daily over a period of 6 weeks, including administration of a probiotic during weeks 4 to 6. The environmental bacterial sources were also characterized, including the animals' human handlers, the aquarium air and water, and the dolphins' food supply. Continuous microbial exposure occurred between all sites, yet each environment maintained a characteristic microbiota, suggesting that the majority of exposure events do not result in colonization. Small changes in water physicochemistry had a significant but weak correlation with change in dolphin-associated bacterial richness but had no influence on phylogenetic diversity. Food and air microbiota were the richest and had the largest conditional influence on other microbiota in the absence of probiotics, but during probiotic administration, food alone had the largest influence on the stability of the dolphin microbiota. Our results suggest that respiratory tract and gastrointestinal epithelium interactions with air- and food-associated microbes had the biggest influence on host-microbiota dynamics, while other interactions, such as skin transmission, played only a minor role. Finally, direct oral stimulation with a foreign exogenous microbial source can have a profound effect on microbial stability. IMPORTANCE These results provide valuable insights into the ecological influence of exogenous microbial exposure, as well as laying the foundation for improving aquarium management practices. By comparing data for dolphins from aquaria that use natural versus artificial seawater, we demonstrate the potential influence of aquarium water disinfection procedures on dolphin microbial dynamics.

  17. The phylogenetic structure of plant-pollinator networks increases with habitat size and isolation.

    PubMed

    Aizen, Marcelo A; Gleiser, Gabriela; Sabatino, Malena; Gilarranz, Luis J; Bascompte, Jordi; Verdú, Miguel

    2016-01-01

    Similarity among species in traits related to ecological interactions is frequently associated with common ancestry. Thus, closely related species usually interact with ecologically similar partners, which can be reinforced by diverse co-evolutionary processes. The effect of habitat fragmentation on the phylogenetic signal in interspecific interactions and correspondence between plant and animal phylogenies is, however, unknown. Here, we address to what extent phylogenetic signal and co-phylogenetic congruence of plant-animal interactions depend on habitat size and isolation by analysing the phylogenetic structure of 12 pollination webs from isolated Pampean hills. Phylogenetic signal in interspecific interactions differed among webs, being stronger for flower-visiting insects than plants. Phylogenetic signal and overall co-phylogenetic congruence increased independently with hill size and isolation. We propose that habitat fragmentation would erode the phylogenetic structure of interaction webs. A decrease in phylogenetic signal and co-phylogenetic correspondence in plant-pollinator interactions could be associated with less reliable mutualism and erratic co-evolutionary change. © 2015 John Wiley & Sons Ltd/CNRS.

  18. Intra-urban biomonitoring: Source apportionment using tree barks to identify air pollution sources.

    PubMed

    Moreira, Tiana Carla Lopes; de Oliveira, Regiani Carvalho; Amato, Luís Fernando Lourenço; Kang, Choong-Min; Saldiva, Paulo Hilário Nascimento; Saiki, Mitiko

    2016-05-01

    It is of great interest to evaluate if there is a relationship between possible sources and trace elements using biomonitoring techniques. In this study, tree bark samples of 171 trees were collected using a biomonitoring technique in the inner city of São Paulo. The trace elements (Al, Ba, Ca, Cl, Cu, Fe, K, Mg, Mn, Na, P, Rb, S, Sr and Zn) were determined by the energy dispersive X-ray fluorescence (EDXRF) spectrometry. The Principal Component Analysis (PCA) was applied to identify the plausible sources associated with tree bark measurements. The greatest source was vehicle-induced non-tailpipe emissions derived mainly from brakes and tires wear-out and road dust resuspension (characterized with Al, Ba, Cu, Fe, Mn and Zn), which was explained by 27.1% of the variance, followed by cement (14.8%), sea salt (11.6%) and biomass burning (10%), and fossil fuel combustion (9.8%). We also verified that the elements related to vehicular emission showed different concentrations at different sites of the same street, which might be helpful for a new street classification according to the emission source. The spatial distribution maps of element concentrations were obtained to evaluate the different levels of pollution in streets and avenues. Results indicated that biomonitoring techniques using tree bark can be applied to evaluate dispersion of air pollution and provide reliable data for the further epidemiological studies. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. DendroPy: a Python library for phylogenetic computing.

    PubMed

    Sukumaran, Jeet; Holder, Mark T

    2010-06-15

    DendroPy is a cross-platform library for the Python programming language that provides for object-oriented reading, writing, simulation and manipulation of phylogenetic data, with an emphasis on phylogenetic tree operations. DendroPy uses a splits-hash mapping to perform rapid calculations of tree distances, similarities and shape under various metrics. It contains rich simulation routines to generate trees under a number of different phylogenetic and coalescent models. DendroPy's data simulation and manipulation facilities, in conjunction with its support of a broad range of phylogenetic data formats (NEXUS, Newick, PHYLIP, FASTA, NeXML, etc.), allow it to serve a useful role in various phyloinformatics and phylogeographic pipelines. The stable release of the library is available for download and automated installation through the Python Package Index site (http://pypi.python.org/pypi/DendroPy), while the active development source code repository is available to the public from GitHub (http://github.com/jeetsukumaran/DendroPy).

  20. Leveraging Diverse Data Sources to Identify and Describe U.S. Health Care Delivery Systems.

    PubMed

    Cohen, Genna R; Jones, David J; Heeringa, Jessica; Barrett, Kirsten; Furukawa, Michael F; Miller, Dan; Mutti, Anne; Reschovsky, James D; Machta, Rachel; Shortell, Stephen M; Fraze, Taressa; Rich, Eugene

    2017-12-15

    Health care delivery systems are a growing presence in the U.S., yet research is hindered by the lack of universally agreed-upon criteria to denote formal systems. A clearer understanding of how to leverage real-world data sources to empirically identify systems is a necessary first step to such policy-relevant research. We draw from our experience in the Agency for Healthcare Research and Quality's Comparative Health System Performance (CHSP) initiative to assess available data sources to identify and describe systems, including system members (for example, hospitals and physicians) and relationships among the members (for example, hospital ownership of physician groups). We highlight five national data sources that either explicitly track system membership or detail system relationships: (1) American Hospital Association annual survey of hospitals; (2) Healthcare Relational Services Databases; (3) SK&A Healthcare Databases; (4) Provider Enrollment, Chain, and Ownership System; and (5) Internal Revenue Service 990 forms. Each data source has strengths and limitations for identifying and describing systems due to their varied content, linkages across data sources, and data collection methods. In addition, although no single national data source provides a complete picture of U.S. systems and their members, the CHSP initiative will create an early model of how such data can be combined to compensate for their individual limitations. Identifying systems in a way that can be repeated over time and linked to a host of other data sources will support analysis of how different types of organizations deliver health care and, ultimately, comparison of their performance.

  1. What Is Nonconsensual Sex? Young Women Identify Sources of Coerced Sex.

    PubMed

    French, Bryana H; Neville, Helen A

    2016-04-12

    Extending the American Psychological Association (APA) report on the Sexualization of Girls, this study investigated how young women identified sources of coerced sex. Findings from three focus groups with 25 Black and White adolescent women uncovered a perceived overarching force that "pushed" them to have sex before they felt ready. Participants identified four domains of coerced sex: (a) Sociocultural Context, (b) Internalized Sexual Scripts, (c) Partner Manipulation of Sexual Scripts, and (d) Developmental Status. Coerced sex was a complex system consisting of cultural, peer, and internal messages that create pressures to engage in sexual activities. Future implications for research and practice are presented. © The Author(s) 2016.

  2. Phylogenetic comparison of rabies viruses from disease outbreaks on the Svalbard Islands.

    PubMed

    Johnson, N; Dicker, A; Mork, T; Marston, D A; Fooks, A R; Tryland, M; Fuglei, E; Müller, T

    2007-01-01

    Periodic wildlife rabies epizootics occur in Arctic regions. The original sources of these outbreaks are rarely identified. In 1980, a wildlife epizootic of rabies occurred on the previously rabies-free Svalbard Islands, Norway. After this outbreak of rabies in the arctic fox population (Alopex lagopus), only single cases have been reported from the Islands over the following two decades. Phylogenetic characterization of four viruses isolated from infected arctic foxes from Svalbard from three different time periods suggest that the source of these epizootics could have been migration of this species from the Russian mainland. Arctic fox migration has likely contributed to the establishment of another zoonotic disease, Echinococcus multilocularis, on Svalbard in recent years.

  3. CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis

    PubMed Central

    2011-01-01

    Background The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Results Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. Conclusions CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore. PMID:21496247

  4. CDAO-store: ontology-driven data integration for phylogenetic analysis.

    PubMed

    Chisham, Brandon; Wright, Ben; Le, Trung; Son, Tran Cao; Pontelli, Enrico

    2011-04-15

    The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore.

  5. Phylogenetic structure of European Salmonella Enteritidis outbreak correlates with national and international egg distribution network

    PubMed Central

    Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul

    2016-01-01

    Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods. PMID:28348865

  6. Phylogenetic structure of European Salmonella Enteritidis outbreak correlates with national and international egg distribution network.

    PubMed

    Dallman, Tim; Inns, Thomas; Jombart, Thibaut; Ashton, Philip; Loman, Nicolas; Chatt, Carol; Messelhaeusser, Ute; Rabsch, Wolfgang; Simon, Sandra; Nikisins, Sergejs; Bernard, Helen; le Hello, Simon; Jourdan da-Silva, Nathalie; Kornschober, Christian; Mossong, Joel; Hawkey, Peter; de Pinna, Elizabeth; Grant, Kathie; Cleary, Paul

    2016-08-01

    Outbreaks of Salmonella Enteritidis have long been associated with contaminated poultry and eggs. In the summer of 2014 a large multi-national outbreak of Salmonella Enteritidis phage type 14b occurred with over 350 cases reported in the United Kingdom, Germany, Austria, France and Luxembourg. Egg supply network investigation and microbiological sampling identified the source to be a Bavarian egg producer. As part of the international investigation into the outbreak, over 400 isolates were sequenced including isolates from cases, implicated UK premises and eggs from the suspected source producer. We were able to show a clear statistical correlation between the topology of the UK egg distribution network and the phylogenetic network of outbreak isolates. This correlation can most plausibly be explained by different parts of the egg distribution network being supplied by eggs solely from independent premises of the Bavarian egg producer (Company X). Microbiological sampling from the source premises, traceback information and information on the interventions carried out at the egg production premises all supported this conclusion. The level of insight into the outbreak epidemiology provided by whole-genome sequencing (WGS) would not have been possible using traditional microbial typing methods.

  7. Phylogenetic and Functional Structure of Wintering Waterbird Communities Associated with Ecological Differences.

    PubMed

    Che, Xianli; Zhang, Min; Zhao, Yanyan; Zhang, Qiang; Quan, Qing; Møller, Anders; Zou, Fasheng

    2018-01-19

    Ecological differences may be related to community component divisions between Oriental (west) and Sino-Japanese (east) realms, and such differences may result in weak geographical breaks in migratory species that are highly mobile. Here, we conducted comparative phylogenetic and functional structure analyses of wintering waterbird communities in southern China across two realms and subsequently examined possible climate drivers of the observed patterns. An analysis based on such highly migratory species is particularly telling because migration is bound to reduce or completely eliminate any divergence between communities. Phylogenetic and functional structure of eastern communities showed over-dispersion while western communities were clustered. Basal phylogenetic and functional turnover of western communities was significant lower than that of eastern communities. The break between eastern and western communities was masked by these two realms. Geographic patterns were related to mean temperature changes and temperature fluctuations, suggesting that temperature may filter waterbird lineages and traits, thus underlying geographical community divisions. These results suggest phylogenetic and functional divisions in southern China, coinciding with biogeography. This study shows that temperature fluctuations constitute an essential mechanism shaping geographical divisions that have largely gone undetected previously, even under climate change.

  8. phylo-node: A molecular phylogenetic toolkit using Node.js.

    PubMed

    O'Halloran, Damien M

    2017-01-01

    Node.js is an open-source and cross-platform environment that provides a JavaScript codebase for back-end server-side applications. JavaScript has been used to develop very fast and user-friendly front-end tools for bioinformatic and phylogenetic analyses. However, no such toolkits are available using Node.js to conduct comprehensive molecular phylogenetic analysis. To address this problem, I have developed, phylo-node, which was developed using Node.js and provides a stable and scalable toolkit that allows the user to perform diverse molecular and phylogenetic tasks. phylo-node can execute the analysis and process the resulting outputs from a suite of software options that provides tools for read processing and genome alignment, sequence retrieval, multiple sequence alignment, primer design, evolutionary modeling, and phylogeny reconstruction. Furthermore, phylo-node enables the user to deploy server dependent applications, and also provides simple integration and interoperation with other Node modules and languages using Node inheritance patterns, and a customized piping module to support the production of diverse pipelines. phylo-node is open-source and freely available to all users without sign-up or login requirements. All source code and user guidelines are openly available at the GitHub repository: https://github.com/dohalloran/phylo-node.

  9. Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information

    PubMed Central

    McDonald, Daniel; Gonzalez, Antonio; Navas-Molina, Jose A.; Jiang, Lingjing; Xu, Zhenjiang Zech; Winker, Kevin; Kado, Deborah M.; Orwoll, Eric; Manary, Mark; Mirarab, Siavash

    2018-01-01

    ABSTRACT Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. PMID:29719869

  10. Identifying selectively important amino acid positions associated with alternative habitat environments in fish mitochondrial genomes.

    PubMed

    Xia, Jun Hong; Li, Hong Lian; Zhang, Yong; Meng, Zi Ning; Lin, Hao Ran

    2018-05-01

    Fish species inhabitating seawater (SW) or freshwater (FW) habitats have to develop genetic adaptations to alternative environment factors, especially salinity. Functional consequences of the protein variations associated with habitat environments in fish mitochondrial genomes have not yet received much attention. We analyzed 829 complete fish mitochondrial genomes and compared the amino acid differences of 13 mitochondrial protein families between FW and SW fish groups. We identified 47 specificity determining sites (SDS) that associated with FW or SW environments from 12 mitochondrial protein families. Thirty-two (68%) of the SDS sites are hydrophobic, 13 (28%) are neutral, and the remaining sites are acidic or basic. Seven of those SDS from ND1, ND2 and ND5 were scored as probably damaging to the protein structures. Furthermore, phylogenetic tree based Bayes Empirical Bayes analysis also detected 63 positive sites associated with alternative habitat environments across ten mtDNA proteins. These signatures could be important for studying mitochondrial genetic variation relevant to fish physiology and ecology.

  11. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  12. Using Seismic and Infrasonic Data to Identify Persistent Sources

    NASA Astrophysics Data System (ADS)

    Nava, S.; Brogan, R.

    2014-12-01

    Data from seismic and infrasound sensors were combined to aid in the identification of persistent sources such as mining-related explosions. It is of interest to operators of seismic networks to identify these signals in their event catalogs. Acoustic signals below the threshold of human hearing, in the frequency range of ~0.01 to 20 Hz are classified as infrasound. Persistent signal sources are useful as ground truth data for the study of atmospheric infrasound signal propagation, identification of manmade versus naturally occurring seismic sources, and other studies. By using signals emanating from the same location, propagation studies, for example, can be conducted using a variety of atmospheric conditions, leading to improvements to the modeling process for eventual use where the source is not known. We present results from several studies to identify ground truth sources using both seismic and infrasound data.

  13. Estimating phylogenetic trees from genome-scale data.

    PubMed

    Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

    2015-12-01

    The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.

  14. A Checklist for Identifying Funding Sources for Assistive Technology.

    ERIC Educational Resources Information Center

    Menlove, Martell

    1996-01-01

    This article offers a systematically organized series of questions to identify funding sources for assistive technology for students with disabilities. A decision tree links the questions with funding sources. (DB)

  15. Phylogenetic comparative methods on phylogenetic networks with reticulations.

    PubMed

    Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile

    2018-04-25

    The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.

  16. Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales.

    PubMed

    Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter

    2017-12-01

    Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.

  17. Phylogenetics.

    PubMed

    Sleator, Roy D

    2011-04-01

    The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects.

  18. Rearrangement moves on rooted phylogenetic networks

    PubMed Central

    Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine

    2017-01-01

    Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a

  19. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    PubMed

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.

  20. PhyloExplorer: a web server to validate, explore and query phylogenetic trees

    PubMed Central

    Ranwez, Vincent; Clairon, Nicolas; Delsuc, Frédéric; Pourali, Saeed; Auberval, Nicolas; Diser, Sorel; Berry, Vincent

    2009-01-01

    Background Many important problems in evolutionary biology require molecular phylogenies to be reconstructed. Phylogenetic trees must then be manipulated for subsequent inclusion in publications or analyses such as supertree inference and tree comparisons. However, no tool is currently available to facilitate the management of tree collections providing, for instance: standardisation of taxon names among trees with respect to a reference taxonomy; selection of relevant subsets of trees or sub-trees according to a taxonomic query; or simply computation of descriptive statistics on the collection. Moreover, although several databases of phylogenetic trees exist, there is currently no easy way to find trees that are both relevant and complementary to a given collection of trees. Results We propose a tool to facilitate assessment and management of phylogenetic tree collections. Given an input collection of rooted trees, PhyloExplorer provides facilities for obtaining statistics describing the collection, correcting invalid taxon names, extracting taxonomically relevant parts of the collection using a dedicated query language, and identifying related trees in the TreeBASE database. Conclusion PhyloExplorer is a simple and interactive website implemented through underlying Python libraries and MySQL databases. It is available at: and the source code can be downloaded from: . PMID:19450253

  1. Towards an eco-phylogenetic framework for infectious disease ecology.

    PubMed

    Fountain-Jones, Nicholas M; Pearse, William D; Escobar, Luis E; Alba-Casals, Ana; Carver, Scott; Davies, T Jonathan; Kraberger, Simona; Papeş, Monica; Vandegrift, Kurt; Worsley-Tonks, Katherine; Craft, Meggan E

    2018-05-01

    Identifying patterns and drivers of infectious disease dynamics across multiple scales is a fundamental challenge for modern science. There is growing awareness that it is necessary to incorporate multi-host and/or multi-parasite interactions to understand and predict current and future disease threats better, and new tools are needed to help address this task. Eco-phylogenetics (phylogenetic community ecology) provides one avenue for exploring multi-host multi-parasite systems, yet the incorporation of eco-phylogenetic concepts and methods into studies of host pathogen dynamics has lagged behind. Eco-phylogenetics is a transformative approach that uses evolutionary history to infer present-day dynamics. Here, we present an eco-phylogenetic framework to reveal insights into parasite communities and infectious disease dynamics across spatial and temporal scales. We illustrate how eco-phylogenetic methods can help untangle the mechanisms of host-parasite dynamics from individual (e.g. co-infection) to landscape scales (e.g. parasite/host community structure). An improved ecological understanding of multi-host and multi-pathogen dynamics across scales will increase our ability to predict disease threats. © 2017 Cambridge Philosophical Society.

  2. Phylogenetic analysis of Helicobacter pylori cagA gene of Turkish isolates and the association with gastric pathology.

    PubMed

    Salih, Barik A; Bolek, Bora Kazim; Yildiz, Mehmet Taha; Arikan, Soykan

    2013-11-18

    The cagA gene is one of the important virulence factors of Helicobacter pylori. The diversity of cagA 5' conserved region is thought to reflect the phylogenetic relationships between different H. pylori isolates and their association with peptic ulceration. Significant geographical differences among isolates have been reported. The aim of this study is to compare Turkish H. pylori isolates with isolates from different geographical locations and to correlate the association with peptic ulceration. Total of 52 isolates of which 19 were Turkish and 33 from other geographic locations were studied. Gastric antral biopsies collected from 19 Turkish patients (Gastritis = 12, ulcer = 7) were used to amplify the cagA 5' region by PCR then followed by DNA sequencing. The phylogenetic tree displayed 3 groups: A) a mix of 2 sub-groups "Asian" and "African/Anatolian/Asian/European", B) "Anatolian/European" and C) "American-Indian". Turkish H. pylori isolates clustered in the mixed sub-group A were mostly from gastritis patients while those clustered in group B were from peptic ulcer patients. A phylogenetic tree constructed for our Turkish isolates detected distinctive features among those from gastritis and ulcer patients. We have found that 2/3 of the gastritis isolates were clustered alone while 1/3 was clustered together with the ulcer isolates. Several amino acids were found to be shared between the later groups but not with the first group of gastritis. This study provided an additional insight into the profile of our cagA gene which implies a relationship in geographic locations of the isolates.

  3. A phylogenetic road map to antimalarial Artemisia species.

    PubMed

    Pellicer, Jaume; Saslis-Lagoudakis, C Haris; Carrió, Esperança; Ernst, Madeleine; Garnatje, Teresa; Grace, Olwen M; Gras, Airy; Mumbrú, Màrius; Vallès, Joan; Vitales, Daniel; Rønsted, Nina

    2018-06-21

    The discovery of the antimalarial agent artemisinin is considered one of the most significant success stories of ethnopharmacological research in recent times. The isolation of artemisinin was inspired by the use of Artemisia annua in traditional Chinese medicine (TCM) and was awarded a Nobel Prize in 2015. Antimalarial activity has since been demonstrated for a range of other Artemisia species, suggesting that the genus could provide alternative sources of antimalarial treatments. Given the stunning diversity of the genus (c. 500 species), a prioritisation of taxa to be investigated for their likely antimalarial properties is required. Here we use a phylogenetic approach to explore the potential for identifying species more likely to possess antimalarial properties. Ethnobotanical data from literature reports is recorded for 117 species. Subsequent phylogenetically informed analysis was used to identify lineages in which there is an overrepresentation of species used to treat malarial symptoms, and which could therefore be high priority for further investigation of antimalarial activity. We show that these lineages indeed include several species with documented antimalarial activity. To further inform our approach, we use LC-MS/MS analysis to explore artemisinin content in fifteen species from both highlighted and not highlighted lineages. We detected artemisinin in nine species, in eight of them for the first time, doubling the number of Artemisia taxa known to content this molecule. Our findings indicate that artemisinin may be widespread across the genus, providing an accessible local resource outside the distribution area of Artemisia annua. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  4. A Manual to Identify Sources of Fluvial Sediment | Science ...

    EPA Pesticide Factsheets

    Sedimentation is one of the main causes of stream/river aquatic life use impairments in R3. Currently states lack standard guidance on appropriate tools available to quantify sediment sources and develop sediment budgets in TMDL Development. Methods for distinguishing sediment types for TMDL development will focus stream restoration and soil conservation efforts in strategic locations in a watershed and may better target appropriate BMPs to achieve sediment load reductions. Properly identifying sediment sources in a TMDL will also help focus NPDES permitting, stream restoration activities and other TMDL implementation efforts. This project will focus on developing a framework that will be published as a guidance document that outlines steps and approaches to identify the significant sources of fine-grained sediment in 303D listed watersheds. In this framework, the sediment-fingerprinting and sediment budget approaches will be emphasized. This project will focus on developing a framework that will be published as a guidance document that outlines steps and approaches to identify the significant sources of fine-grained sediment in 303D listed watersheds. In this framework, the sediment-fingerprinting and sediment budget approaches will be emphasized.

  5. Public Health Investigation of Two Outbreaks of Shiga Toxin-Producing Escherichia coli O157 Associated with Consumption of Watercress

    PubMed Central

    Dallman, Timothy J.; Launders, Naomi; Willis, Caroline; Byrne, Lisa; Jorgensen, Frieda; Eppinger, Mark; Adak, Goutam K.; Aird, Heather; Elviss, Nicola; Grant, Kathie A.; Morgan, Dilys; McLauchlin, Jim

    2015-01-01

    An increase in the number of cases of Shiga toxin-producing Escherichia coli (STEC) O157 phage type 2 (PT2) in England in September 2013 was epidemiologically linked to watercress consumption. Whole-genome sequencing (WGS) identified a phylogenetically related cluster of 22 cases (outbreak 1). The isolates comprising this cluster were not closely related to any other United Kingdom strain in the Public Health England WGS database, suggesting a possible imported source. A second outbreak of STEC O157 PT2 (outbreak 2) was identified epidemiologically following the detection of outbreak 1. Isolates associated with outbreak 2 were phylogenetically distinct from those in outbreak 1. Epidemiologically unrelated isolates on the same branch as the outbreak 2 cluster included those from human cases in England with domestically acquired infection and United Kingdom domestic cattle. Environmental sampling using PCR resulted in the isolation of STEC O157 PT2 from irrigation water at one implicated watercress farm, and WGS showed this isolate belonged to the same phylogenetic cluster as outbreak 2 isolates. Cattle were in close proximity to the watercress bed and were potentially the source of the second outbreak. Transfer of STEC from the field to the watercress bed may have occurred through wildlife entering the watercress farm or via runoff water. During this complex outbreak investigation, epidemiological studies, comprehensive testing of environmental samples, and the use of novel molecular methods proved invaluable in demonstrating that two simultaneous outbreaks of STEC O157 PT2 were both linked to the consumption of watercress but were associated with different sources of contamination. PMID:25841005

  6. Phylogenetic Copy-Number Factorization of Multiple Tumor Samples.

    PubMed

    Zaccaria, Simone; El-Kebir, Mohammed; Klau, Gunnar W; Raphael, Benjamin J

    2018-04-16

    Cancer is an evolutionary process driven by somatic mutations. This process can be represented as a phylogenetic tree. Constructing such a phylogenetic tree from genome sequencing data is a challenging task due to the many types of mutations in cancer and the fact that nearly all cancer sequencing is of a bulk tumor, measuring a superposition of somatic mutations present in different cells. We study the problem of reconstructing tumor phylogenies from copy-number aberrations (CNAs) measured in bulk-sequencing data. We introduce the Copy-Number Tree Mixture Deconvolution (CNTMD) problem, which aims to find the phylogenetic tree with the fewest number of CNAs that explain the copy-number data from multiple samples of a tumor. We design an algorithm for solving the CNTMD problem and apply the algorithm to both simulated and real data. On simulated data, we find that our algorithm outperforms existing approaches that either perform deconvolution/factorization of mixed tumor samples or build phylogenetic trees assuming homogeneous tumor samples. On real data, we analyze multiple samples from a prostate cancer patient, identifying clones within these samples and a phylogenetic tree that relates these clones and their differing proportions across samples. This phylogenetic tree provides a higher resolution view of copy-number evolution of this cancer than published analyses.

  7. Resolution of habitat-associated ecogenomic signatures in bacteriophage genomes and application to microbial source tracking.

    PubMed

    Ogilvie, Lesley A; Nzakizwanayo, Jonathan; Guppy, Fergus M; Dedi, Cinzia; Diston, David; Taylor, Huw; Ebdon, James; Jones, Brian V

    2018-04-01

    Just as the expansion in genome sequencing has revealed and permitted the exploitation of phylogenetic signals embedded in bacterial genomes, the application of metagenomics has begun to provide similar insights at the ecosystem level for microbial communities. However, little is known regarding this aspect of bacteriophage associated with microbial ecosystems, and if phage encode discernible habitat-associated signals diagnostic of underlying microbiomes. Here we demonstrate that individual phage can encode clear habitat-related 'ecogenomic signatures', based on relative representation of phage-encoded gene homologues in metagenomic data sets. Furthermore, we show the ecogenomic signature encoded by the gut-associated ɸB124-14 can be used to segregate metagenomes according to environmental origin, and distinguish 'contaminated' environmental metagenomes (subject to simulated in silico human faecal pollution) from uncontaminated data sets. This indicates phage-encoded ecological signals likely possess sufficient discriminatory power for use in biotechnological applications, such as development of microbial source tracking tools for monitoring water quality.

  8. The nuclear 18S ribosomal RNA gene as a source of phylogenetic information in the genus Taenia.

    PubMed

    Yan, Hongbin; Lou, Zhongzi; Li, Li; Ni, Xingwei; Guo, Aijiang; Li, Hongmin; Zheng, Yadong; Dyachenko, Viktor; Jia, Wanzhong

    2013-03-01

    Most species of the genus Taenia are of considerable medical and veterinary significance. In this study, complete nuclear 18S rRNA gene sequences were obtained from seven members of genus Taenia [Taenia multiceps, Taenia saginata, Taenia asiatica, Taenia solium, Taenia pisiformis, Taenia hydatigena, and Taenia taeniaeformis] and a phylogeny inferred using these sequences. Most of the variable sites fall within the variable regions, V1-V5. We show that sequences from the nuclear 18S ribosomal RNA gene have considerable promise as sources of phylogenetic information within the genus Taenia. Furthermore, given that almost all the variable sites lie within defined variable portions of that gene, it will be appropriate and economical to sequence only those regions for additional species of Taenia.

  9. Multi-locus phylogeny of dolphins in the subfamily Lissodelphininae: character synergy improves phylogenetic resolution

    PubMed Central

    Harlin-Cognato, April D; Honeycutt, Rodney L

    2006-01-01

    Background Dolphins of the genus Lagenorhynchus are anti-tropically distributed in temperate to cool waters. Phylogenetic analyses of cytochrome b sequences have suggested that the genus is polyphyletic; however, many relationships were poorly resolved. In this study, we present a combined-analysis phylogenetic hypothesis for Lagenorhynchus and members of the subfamily Lissodelphininae, which is derived from two nuclear and two mitochondrial data sets and the addition of 34 individuals representing 9 species. In addition, we characterize with parsimony and Bayesian analyses the phylogenetic utility and interaction of characters with statistical measures, including the utility of highly consistent (non-homoplasious) characters as a conservative measure of phylogenetic robustness. We also explore the effects of removing sources of character conflict on phylogenetic resolution. Results Overall, our study provides strong support for the monophyly of the subfamily Lissodelphininae and the polyphyly of the genus Lagenorhynchus. In addition, the simultaneous parsimony analysis resolved and/or improved resolution for 12 nodes including: (1) L. albirostris, L. acutus; (2) L. obscurus and L. obliquidens; and (3) L. cruciger and L. australis. In addition, the Bayesian analysis supported the monophyly of the Cephalorhynchus, and resolved ambiguities regarding the relationship of L. australis/L. cruciger to other members of the genus Lagenorhynchus. The frequency of highly consistent characters varied among data partitions, but the rate of evolution was consistent within data partitions. Although the control region was the greatest source of character conflict, removal of this data partition impeded phylogenetic resolution. Conclusion The simultaneous analysis approach produced a more robust phylogenetic hypothesis for Lagenorhynchus than previous studies, thus supporting a phylogenetic approach employing multiple data partitions that vary in overall rate of evolution. Even in

  10. Associations of Leaf Spectra with Genetic and Phylogenetic Variation in Oaks: Prospects for Remote Detection of Biodiversity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cavender-Bares, Jeannine; Meireles, Jose; Couture, John

    Species and phylogenetic lineages have evolved to differ in the way that they acquire and deploy resources, with consequences for their physiological, chemical and structural attributes, many of which can be detected using spectral reflectance form leaves. Recent technological advances for assessing optical properties of plants offer opportunities to detect functional traits of organisms and differentiate levels of biological organization across the tree of life. We connect leaf-level full range spectral data (400–2400 nm) of leaves to the hierarchical organization of plant diversity within the oak genus (Quercus) using field and greenhouse experiments in which environmental factors and plant agemore » are controlled. We show that spectral data significantly differentiate populations within a species and that spectral similarity is significantly associated with phylogenetic similarity among species. Furthermore, we show that hyperspectral information allows more accurate classification of taxa than spectrally-derived traits, which by definition are of lower dimensionality. Finally, model accuracy increases at higher levels in the hierarchical organization of plant diversity, such that we are able to better distinguish clades than species or populations. This pattern supports an evolutionary explanation for the degree of optical differentiation among plants and demonstrates potential for remote detection of genetic and phylogenetic diversity.« less

  11. Associations of Leaf Spectra with Genetic and Phylogenetic Variation in Oaks: Prospects for Remote Detection of Biodiversity

    DOE PAGES

    Cavender-Bares, Jeannine; Meireles, Jose; Couture, John; ...

    2016-03-09

    Species and phylogenetic lineages have evolved to differ in the way that they acquire and deploy resources, with consequences for their physiological, chemical and structural attributes, many of which can be detected using spectral reflectance form leaves. Recent technological advances for assessing optical properties of plants offer opportunities to detect functional traits of organisms and differentiate levels of biological organization across the tree of life. We connect leaf-level full range spectral data (400–2400 nm) of leaves to the hierarchical organization of plant diversity within the oak genus (Quercus) using field and greenhouse experiments in which environmental factors and plant agemore » are controlled. We show that spectral data significantly differentiate populations within a species and that spectral similarity is significantly associated with phylogenetic similarity among species. Furthermore, we show that hyperspectral information allows more accurate classification of taxa than spectrally-derived traits, which by definition are of lower dimensionality. Finally, model accuracy increases at higher levels in the hierarchical organization of plant diversity, such that we are able to better distinguish clades than species or populations. This pattern supports an evolutionary explanation for the degree of optical differentiation among plants and demonstrates potential for remote detection of genetic and phylogenetic diversity.« less

  12. Identifying sources of aeolian mineral dust: Present and past

    USGS Publications Warehouse

    Muhs, Daniel R; Prospero, Joseph M; Baddock, Matthew C; Gill, Thomas E

    2014-01-01

    Aeolian mineral dust is an important component of the Earth’s environmental systems, playing roles in the planetary radiation balance, as a source of fertilizer for biota in both terrestrial and marine realms and as an archive for understanding atmospheric circulation and paleoclimate in the geologic past. Crucial to understanding all of these roles of dust is the identification of dust sources. Here we review the methods used to identify dust sources active at present and in the past. Contemporary dust sources, produced by both glaciogenic and non-glaciogenic processes, can be readily identified by the use of Earth-orbiting satellites. These data show that present dust sources are concentrated in a global dust belt that encompasses large topographic basins in low-latitude arid and semiarid regions. Geomorphic studies indicate that specific point sources for dust in this zone include dry or ephemeral lakes, intermittent stream courses, dune fields, and some bedrock surfaces. Back-trajectory analyses are also used to identify dust sources, through modeling of wind fields and the movement of air parcels over periods of several days. Identification of dust sources from the past requires novel approaches that are part of the geologic toolbox of provenance studies. Identification of most dust sources of the past requires the use of physical, mineralogical, geochemical, and isotopic analyses of dust deposits. Physical properties include systematic spatial changes in dust deposit thickness and particle size away from a source. Mineralogy and geochemistry can pinpoint dust sources by clay mineral ratios and Sc-Th-La abundances, respectively. The most commonly used isotopic methods utilize isotopes of Nd, Sr, and Pb and have been applied extensively in dust archives of deep-sea cores, ice cores, and loess. All these methods have shown that dust sources have changed over time, with far more abundant dust supplies existing during glacial periods. Greater dust supplies in

  13. BIMLR: a method for constructing rooted phylogenetic networks from rooted phylogenetic trees.

    PubMed

    Wang, Juan; Guo, Maozu; Xing, Linlin; Che, Kai; Liu, Xiaoyan; Wang, Chunyu

    2013-09-15

    Rooted phylogenetic trees constructed from different datasets (e.g. from different genes) are often conflicting with one another, i.e. they cannot be integrated into a single phylogenetic tree. Phylogenetic networks have become an important tool in molecular evolution, and rooted phylogenetic networks are able to represent conflicting rooted phylogenetic trees. Hence, the development of appropriate methods to compute rooted phylogenetic networks from rooted phylogenetic trees has attracted considerable research interest of late. The CASS algorithm proposed by van Iersel et al. is able to construct much simpler networks than other available methods, but it is extremely slow, and the networks it constructs are dependent on the order of the input data. Here, we introduce an improved CASS algorithm, BIMLR. We show that BIMLR is faster than CASS and less dependent on the input data order. Moreover, BIMLR is able to construct much simpler networks than almost all other methods. BIMLR is available at http://nclab.hit.edu.cn/wangjuan/BIMLR/. © 2013 Elsevier B.V. All rights reserved.

  14. Recent Approaches to Estimate Associations Between Source-Specific Air Pollution and Health.

    PubMed

    Krall, Jenna R; Strickland, Matthew J

    2017-03-01

    Estimating health effects associated with source-specific exposure is important for better understanding how pollution impacts health and for developing policies to better protect public health. Although epidemiologic studies of sources can be informative, these studies are challenging to conduct because source-specific exposures (e.g., particulate matter from vehicles) often are not directly observed and must be estimated. We reviewed recent studies that estimated associations between pollution sources and health to identify methodological developments designed to address important challenges. Notable advances in epidemiologic studies of sources include approaches for (1) propagating uncertainty in source estimation into health effect estimates, (2) assessing regional and seasonal variability in emissions sources and source-specific health effects, and (3) addressing potential confounding in estimated health effects. Novel methodological approaches to address challenges in studies of pollution sources, particularly evaluation of source-specific health effects, are important for determining how source-specific exposure impacts health.

  15. The problem and promise of scale dependency in community phylogenetics.

    PubMed

    Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K

    2006-10-01

    The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.

  16. Is plant mitochondrial RNA editing a source of phylogenetic incongruence? An answer from in silico and in vivo data sets.

    PubMed

    Picardi, Ernesto; Quagliariello, Carla

    2008-03-26

    In plant mitochondria, the post-transcriptional RNA editing process converts C to U at a number of specific sites of the mRNA sequence and usually restores phylogenetically conserved codons and the encoded amino acid residues. Sites undergoing RNA editing evolve at a higher rate than sites not modified by the process. As a result, editing sites strongly affect the evolution of plant mitochondrial genomes, representing an important source of sequence variability and potentially informative characters. To date no clear and convincing evidence has established whether or not editing sites really affect the topology of reconstructed phylogenetic trees. For this reason, we investigated here the effect of RNA editing on the tree building process of twenty different plant mitochondrial gene sequences and by means of computer simulations. Based on our simulation study we suggest that the editing 'noise' in tree topology inference is mainly manifested at the cDNA level. In particular, editing sites tend to confuse tree topologies when artificial genomic and cDNA sequences are generated shorter than 500 bp and with an editing percentage higher than 5.0%. Similar results have been also obtained with genuine plant mitochondrial genes. In this latter instance, indeed, the topology incongruence increases when the editing percentage goes up from about 3.0 to 14.0%. However, when the average gene length is higher than 1,000 bp (rps3, matR and atp1) no differences in the comparison between inferred genomic and cDNA topologies could be detected. Our findings by the here reported in silico and in vivo computer simulation system seem to strongly suggest that editing sites contribute in the generation of misleading phylogenetic trees if the analyzed mitochondrial gene sequence is highly edited (higher than 3.0%) and reduced in length (shorter than 500 bp). In the current lack of direct experimental evidence the results presented here encourage, thus, the use of genomic mitochondrial

  17. Phyx: phylogenetic tools for unix.

    PubMed

    Brown, Joseph W; Walker, Joseph F; Smith, Stephen A

    2017-06-15

    The ease with which phylogenomic data can be generated has drastically escalated the computational burden for even routine phylogenetic investigations. To address this, we present phyx : a collection of programs written in C ++ to explore, manipulate, analyze and simulate phylogenetic objects (alignments, trees and MCMC logs). Modelled after Unix/GNU/Linux command line tools, individual programs perform a single task and operate on standard I/O streams that can be piped to quickly and easily form complex analytical pipelines. Because of the stream-centric paradigm, memory requirements are minimized (often only a single tree or sequence in memory at any instance), and hence phyx is capable of efficiently processing very large datasets. phyx runs on POSIX-compliant operating systems. Source code, installation instructions, documentation and example files are freely available under the GNU General Public License at https://github.com/FePhyFoFum/phyx. eebsmith@umich.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  18. Phyx: phylogenetic tools for unix

    PubMed Central

    Brown, Joseph W.; Walker, Joseph F.; Smith, Stephen A.

    2017-01-01

    Abstract Summary: The ease with which phylogenomic data can be generated has drastically escalated the computational burden for even routine phylogenetic investigations. To address this, we present phyx: a collection of programs written in C ++ to explore, manipulate, analyze and simulate phylogenetic objects (alignments, trees and MCMC logs). Modelled after Unix/GNU/Linux command line tools, individual programs perform a single task and operate on standard I/O streams that can be piped to quickly and easily form complex analytical pipelines. Because of the stream-centric paradigm, memory requirements are minimized (often only a single tree or sequence in memory at any instance), and hence phyx is capable of efficiently processing very large datasets. Availability and Implementation: phyx runs on POSIX-compliant operating systems. Source code, installation instructions, documentation and example files are freely available under the GNU General Public License at https://github.com/FePhyFoFum/phyx Contact: eebsmith@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28174903

  19. Phylogenetic analysis of Helicobacter pylori cagA gene of Turkish isolates and the association with gastric pathology

    PubMed Central

    2013-01-01

    Background The cagA gene is one of the important virulence factors of Helicobacter pylori. The diversity of cagA 5′ conserved region is thought to reflect the phylogenetic relationships between different H. pylori isolates and their association with peptic ulceration. Significant geographical differences among isolates have been reported. The aim of this study is to compare Turkish H. pylori isolates with isolates from different geographical locations and to correlate the association with peptic ulceration. Methods Total of 52 isolates of which 19 were Turkish and 33 from other geographic locations were studied. Gastric antral biopsies collected from 19 Turkish patients (Gastritis = 12, ulcer = 7) were used to amplify the cagA 5′ region by PCR then followed by DNA sequencing. Results The phylogenetic tree displayed 3 groups: A) a mix of 2 sub-groups “Asian” and “African/Anatolian/Asian/European”, B) “Anatolian/European” and C) “American-Indian”. Turkish H. pylori isolates clustered in the mixed sub-group A were mostly from gastritis patients while those clustered in group B were from peptic ulcer patients. A phylogenetic tree constructed for our Turkish isolates detected distinctive features among those from gastritis and ulcer patients. We have found that 2/3 of the gastritis isolates were clustered alone while 1/3 was clustered together with the ulcer isolates. Several amino acids were found to be shared between the later groups but not with the first group of gastritis. Conclusions This study provided an additional insight into the profile of our cagA gene which implies a relationship in geographic locations of the isolates. PMID:24245965

  20. Morphometric study of phylogenetic and ecologic signals in procyonid (mammalia: carnivora) endocasts.

    PubMed

    Ahrens, Heather E

    2014-12-01

    Endocasts provide a proxy for brain morphology but are rarely incorporated in phylogenetic analyses despite the potential for new suites of characters. The phylogeny of Procyonidae, a carnivoran family with relatively limited taxonomic diversity, is not well resolved because morphological and molecular data yield conflicting topologies. The presence of phylogenetic and ecologic signals in the endocasts of procyonids will be determined using three-dimensional geometric morphometrics. Endocasts of seven ingroup species and four outgroup species were digitally rendered and 21 landmarks were collected from the endocast surface. Two phylogenetic hypotheses of Procyonidae will be examined using methods testing for phylogenetic signal in morphometric data. In analyses of all taxa, there is significant phylogenetic signal in brain shape for both the morphological and molecular topologies. However, the analyses of ingroup taxa recover a significant phylogenetic signal for the morphological topology only. These results indicate support for the molecular outgroup topology, but not the ingroup topology given the brain shape data. Further examination of brain shape using principal components analysis and wireframe comparisons suggests procyonids possess more developed areas of the brain associated with motor control, spatial perception, and balance relative to the basal musteloid condition. Within Procyonidae, similar patterns of variation are present, and may be associated with increased arboreality in certain taxa. Thus, brain shape derived from endocasts may be used to test for phylogenetic signal and preliminary analyses suggest an association with behavior and ecology. © 2014 Wiley Periodicals, Inc.

  1. Plant traits determine the phylogenetic structure of arbuscular mycorrhizal fungal communities.

    PubMed

    López-García, Álvaro; Varela-Cervero, Sara; Vasar, Martti; Öpik, Maarja; Barea, José M; Azcón-Aguilar, Concepción

    2017-12-01

    Functional diversity in ecosystems has traditionally been studied using aboveground plant traits. Despite the known effect of plant traits on the microbial community composition, their effects on the microbial functional diversity are only starting to be assessed. In this study, the phylogenetic structure of arbuscular mycorrhizal (AM) fungal communities associated with plant species differing in life cycle and growth form, that is, plant life forms, was determined to unravel the effect of plant traits on the functional diversity of this fungal group. The results of the 454 pyrosequencing showed that the AM fungal community composition differed across plant life forms and this effect was dependent on the soil collection date. Plants with ruderal characteristics tended to associate with phylogenetically clustered AM fungal communities. By contrast, plants with resource-conservative traits associated with phylogenetically overdispersed AM fungal communities. Additionally, the soil collected in different seasons yielded AM fungal communities with different phylogenetic dispersion. In summary, we found that the phylogenetic structure, and hence the functional diversity, of AM fungal communities is dependent on plant traits. This finding adds value to the use of plant traits for the evaluation of belowground ecosystem diversity, functions and processes. © 2017 John Wiley & Sons Ltd.

  2. Placement of attine ant-associated Pseudonocardia in a global Pseudonocardia phylogeny (Pseudonocardiaceae, Actinomycetales): a test of two symbiont-association models

    PubMed Central

    Mueller, Ulrich G.; Ishak, Heather; Lee, Jung C.; Sen, Ruchira; Gutell, Robin R.

    2010-01-01

    We reconstruct the phylogenetic relationships within the bacterial genus Pseudonocardia to evaluate two models explaining how and why Pseudonocardia bacteria colonize the microbial communities on the integument of fungus-gardening ant species (Attini, Formicidae). The traditional Coevolution-Codivergence model views the integument-colonizing Pseudonocardia as mutualistic microbes that are largely vertically transmitted between ant generations and that supply antibiotics that specifically suppress the garden pathogen Escovopsis. The more recent Acquisition model views Pseudonocardia as part of a larger integumental microbe community that frequently colonizes the ant integument from environmental sources (e.g., soil, plant material). Under this latter model, ant-associated Pseudonocardia may have diverse ecological roles on the ant integument (possibly ranging from pathogenic, to commensal, to mutualistic) and are not necessarily related to Escovopsis suppression. We test distinct predictions of these two models regarding the phylogenetic proximity of ant-associated and environmental Pseudonocardia. We amassed 16S-rRNA gene sequence information for 87 attine-associated and 238 environmental Pseudonocardia, aligned the sequences with the help of RNA secondary structure modeling, and reconstructed phylogenetic relationships using a maximum-likelihood approach. We present 16S-rRNA secondary structure models of representative Pseudonocardia species to improve sequence alignments and identify sequencing errors. Our phylogenetic analyses reveal close affinities and even identical sequence matches between environmental Pseudonocardia and ant-associated Pseudonocardia, as well as nesting of environmental Pseudonocardia in subgroups that were previously thought to be specialized to associate only with attine ants. The great majority of ant associated Pseudonocardia are closely related to autotrophic Pseudonocardia and are placed in a large subgroup of Pseudonocardia that is

  3. Placement of attine ant-associated Pseudonocardia in a global Pseudonocardia phylogeny (Pseudonocardiaceae, Actinomycetales): a test of two symbiont-association models.

    PubMed

    Mueller, Ulrich G; Ishak, Heather; Lee, Jung C; Sen, Ruchira; Gutell, Robin R

    2010-08-01

    We reconstruct the phylogenetic relationships within the bacterial genus Pseudonocardia to evaluate two models explaining how and why Pseudonocardia bacteria colonize the microbial communities on the integument of fungus-gardening ant species (Attini, Formicidae). The traditional Coevolution-Codivergence model views the integument-colonizing Pseudonocardia as mutualistic microbes that are largely vertically transmitted between ant generations and that supply antibiotics that specifically suppress the garden pathogen Escovopsis. The more recent Acquisition model views Pseudonocardia as part of a larger integumental microbe community that frequently colonizes the ant integument from environmental sources (e.g., soil, plant material). Under this latter model, ant-associated Pseudonocardia may have diverse ecological roles on the ant integument (possibly ranging from pathogenic, to commensal, to mutualistic) and are not necessarily related to Escovopsis suppression. We test distinct predictions of these two models regarding the phylogenetic proximity of ant-associated and environmental Pseudonocardia. We amassed 16S-rRNA gene sequence information for 87 attine-associated and 238 environmental Pseudonocardia, aligned the sequences with the help of RNA secondary structure modeling, and reconstructed phylogenetic relationships using a maximum-likelihood approach. We present 16S-rRNA secondary structure models of representative Pseudonocardia species to improve sequence alignments and identify sequencing errors. Our phylogenetic analyses reveal close affinities and even identical sequence matches between environmental Pseudonocardia and ant-associated Pseudonocardia, as well as nesting of environmental Pseudonocardia in subgroups that were previously thought to be specialized to associate only with attine ants. The great majority of ant-associated Pseudonocardia are closely related to autotrophic Pseudonocardia and are placed in a large subgroup of Pseudonocardia that is

  4. Phylogenetic and ecological characteristics associated with thiaminase activity in Laurentian Great Lakes fishes

    USGS Publications Warehouse

    Riley, S.C.; Evans, A.N.

    2008-01-01

    Thiamine deficiency complex (TDC) causes mortality and sublethal effects in Great Lakes salmonines and results from low concentrations of egg thiamine that are thought to be caused by thiaminolytic enzymes (i.e., thiaminase) present in the diet. This complex has the potential to undermine efforts to restore lake trout Salvelinus namaycush and severely restrict salmonid production in the Great Lakes. Although thiaminase has been found in a variety of Great Lakes fishes, the ultimate source of thiaminase in Great Lakes fishes is currently unknown. We used logistic regression analysis to investigate relationships between thiaminase activity and phylogenetic or ecological characteristics of 39 Great Lakes fish species. The taxonomically more ancestral species were more likely to show thiaminase activity than the more derived species. Species that feed at lower trophic levels and occupy benthic habitats also appeared to be more likely to show thiaminase activity; these variables were correlated with taxonomy, which was the most important predictor of thiaminase activity. Further analyses of the relationship between quantitative measures of thiaminase activity and ecological characteristics of Great Lakes fish species would provide greater insight into potential sources and pathways of thiaminase in Great Lakes food webs. ?? Copyright by the American Fisheries Society 2008.

  5. Can we identify source lithology of basalt?

    PubMed

    Yang, Zong-Feng; Zhou, Jun-Hong

    2013-01-01

    The nature of source rocks of basaltic magmas plays a fundamental role in understanding the composition, structure and evolution of the solid earth. However, identification of source lithology of basalts remains uncertainty. Using a parameterization of multi-decadal melting experiments on a variety of peridotite and pyroxenite, we show here that a parameter called FC3MS value (FeO/CaO-3*MgO/SiO2, all in wt%) can identify most pyroxenite-derived basalts. The continental oceanic island basalt-like volcanic rocks (MgO>7.5%) (C-OIB) in eastern China and Mongolia are too high in the FC3MS value to be derived from peridotite source. The majority of the C-OIB in phase diagrams are equilibrium with garnet and clinopyroxene, indicating that garnet pyroxenite is the dominant source lithology. Our results demonstrate that many reputed evolved low magnesian C-OIBs in fact represent primary pyroxenite melts, suggesting that many previous geological and petrological interpretations of basalts based on the single peridotite model need to be reconsidered.

  6. Can we identify source lithology of basalt?

    PubMed Central

    Yang, Zong-Feng; Zhou, Jun-Hong

    2013-01-01

    The nature of source rocks of basaltic magmas plays a fundamental role in understanding the composition, structure and evolution of the solid earth. However, identification of source lithology of basalts remains uncertainty. Using a parameterization of multi-decadal melting experiments on a variety of peridotite and pyroxenite, we show here that a parameter called FC3MS value (FeO/CaO-3*MgO/SiO2, all in wt%) can identify most pyroxenite-derived basalts. The continental oceanic island basalt-like volcanic rocks (MgO>7.5%) (C-OIB) in eastern China and Mongolia are too high in the FC3MS value to be derived from peridotite source. The majority of the C-OIB in phase diagrams are equilibrium with garnet and clinopyroxene, indicating that garnet pyroxenite is the dominant source lithology. Our results demonstrate that many reputed evolved low magnesian C-OIBs in fact represent primary pyroxenite melts, suggesting that many previous geological and petrological interpretations of basalts based on the single peridotite model need to be reconsidered. PMID:23676779

  7. Prevalence, Associated Risk Factors, and Phylogenetic Analysis of Toxocara vitulorum Infection in Yaks on the Qinghai Tibetan Plateau, China

    PubMed Central

    Li, Kun; Lan, Yanfang; Luo, Houqiang; Zhang, Hui; Liu, Dongyu; Zhang, Lihong; Gui, Rui; Wang, Lei; Shahzad, Muhammad; Sizhu, Suolang; Li, Jiakui; Chamba, Yangzom

    2016-01-01

    Toxocara vitulorum has been rarely reported in yaks at high altitudes and remote areas of Sichuan Province of Tibetan Plateau of China. The current study was designed to investigate the prevalence, associated risk factors, and phylogenetic characteristics of T. vitulorum in yak calves on the Qinghai Tibetan plateau. Fecal samples were collected from 891 yak calves and were examined for the presence of T. vitulorum eggs by the McMaster technique. A multivariable logistic regression model was employed to explore variables potentially associated with exposure to T. vitulorum infection. T. vitulorum specimens were collected from the feces of yaks in Hongyuan of Sichuan Province, China. DNA was extracted from ascaris. After PCR amplification, the sequencing of ND1 gene was carried out and phylogenetic analyses was performed by MEGA 6.0 software. The results showed that 64 (20.1%; 95% CI 15.8–24.9%), 75 (17.2; 13.8–21.1), 29 (40.9; 29.3–53.2), and 5 (7.6; 2.5–16.8) yak calves were detected out to excrete T. vitulorum eggs in yak calve feces in Qinghai, Tibet, Sichuan, and Gansu, respectively. The present study revealed that high infection and mortality by T. vitulorum is wildly spread on the Qinghai Tibetan plateau, China by fecal examination. Geographical origin, ages, and fecal consistencies are the risk factors associated with T. vitulorum prevalence by logistic regression analysis. Molecular detection and phylogenetic analysis of ND1 gene of T. vitulorum indicated that T. vitulorum in the yak calves on the Qinghai Tibetan plateau are homologous to preveiously studies reported. PMID:27853122

  8. Genetic Diversity of Mycobacterium tuberculosis in Peru and Exploration of Phylogenetic Associations with Drug Resistance

    PubMed Central

    Sheen, Patricia; Couvin, David; Grandjean, Louis; Zimic, Mirko; Dominguez, Maria; Luna, Giannina; Gilman, Robert H.; Rastogi, Nalin; Moore, David A. J.

    2013-01-01

    Background There is limited available data on the strain diversity of M tuberculosis in Peru, though there may be interesting lessons to learn from a setting where multidrug resistant TB has emerged as a major problem despite an apparently well-functioning DOTS control programme. Methods Spoligotyping was undertaken on 794 strains of M tuberculosis collected between 1999 and 2005 from 553 community-based patients and 241 hospital-based HIV co-infected patients with pulmonary tuberculosis in Lima, Peru. Phylogenetic and epidemiologic analyses permitted identification of clusters and exploration of spoligotype associations with drug resistance. Results Mean patient age was 31.9 years, 63% were male and 30.4% were known to be HIV+. Rifampicin mono-resistance, isoniazid mono-resistance and multidrug resistance (MDR) were identified in 4.7%, 8.7% and 17.3% of strains respectively. Of 794 strains from 794 patients there were 149 different spoligotypes. Of these there were 27 strains (3.4%) with novel, unique orphan spoligotypes. 498 strains (62.7%) were clustered in the nine most common spoligotypes: 16.4% SIT 50 (clade H3), 12.3% SIT 53 (clade T1), 8.3% SIT 33 (LAM3), 7.4% SIT 42 (LAM9), 5.5% SIT 1 (Beijing), 3.9% SIT 47 (H1), 3.0% SIT 222 (clade unknown), 3.0% SIT1355 (LAM), and 2.8% SIT 92 (X3). Amongst HIV-negative community-based TB patients no associations were seen between drug resistance and specific spoligotypes; in contrast HIV-associated MDRTB, but not isoniazid or rifampicin mono-resistance, was associated with SIT42 and SIT53 strains. Conclusion Two spoligotypes were associated with MDR particularly amongst patients with HIV. The MDR-HIV association was significantly reduced after controlling for SIT42 and SIT53 status; residual confounding may explain the remaining apparent association. These data are suggestive of a prolonged, clonal, hospital-based outbreak of MDR disease amongst HIV patients but do not support a hypothesis of strain-specific propensity

  9. Community phylogenetics at the biogeographical scale: cold tolerance, niche conservatism and the structure of North American forests.

    PubMed

    Hawkins, Bradford A; Rueda, Marta; Rangel, Thiago F; Field, Richard; Diniz-Filho, José Alexandre F; Linder, Peter

    2014-01-01

    Aim The fossil record has led to a historical explanation for forest diversity gradients within the cool parts of the Northern Hemisphere, founded on a limited ability of woody angiosperm clades to adapt to mid-Tertiary cooling. We tested four predictions of how this should be manifested in the phylogenetic structure of 91,340 communities: (1) forests to the north should comprise species from younger clades (families) than forests to the south; (2) average cold tolerance at a local site should be associated with the mean family age (MFA) of species; (3) minimum temperature should account for MFA better than alternative environmental variables; and (4) traits associated with survival in cold climates should evolve under a niche conservatism constraint. Location The contiguous United States. Methods We extracted angiosperms from the US Forest Service's Forest Inventory and Analysis database. MFA was calculated by assigning age of the family to which each species belongs and averaging across the species in each community. We developed a phylogeny to identify phylogenetic signal in five traits: realized cold tolerance, seed size, seed dispersal mode, leaf phenology and height. Phylogenetic signal representation curves and phylogenetic generalized least squares were used to compare patterns of trait evolution against Brownian motion. Eleven predictors structured at broad or local scales were generated to explore relationships between environment and MFA using random forest and general linear models. Results Consistent with predictions, (1) southern communities comprise angiosperm species from older families than northern communities, (2) cold tolerance is the trait most strongly associated with local MFA, (3) minimum temperature in the coldest month is the environmental variable that best describes MFA, broad-scale variables being much stronger correlates than local-scale variables, and (4) the phylogenetic structures of cold tolerance and at least one other trait

  10. A Multiple-Tracer Approach for Identifying Sewage Sources to an Urban Stream System

    USGS Publications Warehouse

    Hyer, Kenneth Edward

    2007-01-01

    The presence of human-derived fecal coliform bacteria (sewage) in streams and rivers is recognized as a human health hazard. The source of these human-derived bacteria, however, is often difficult to identify and eliminate, because sewage can be delivered to streams through a variety of mechanisms, such as leaking sanitary sewers or private lateral lines, cross-connected pipes, straight pipes, sewer-line overflows, illicit dumping of septic waste, and vagrancy. A multiple-tracer study was conducted to identify site-specific sources of sewage in Accotink Creek, an urban stream in Fairfax County, Virginia, that is listed on the Commonwealth's priority list of impaired streams for violations of the fecal coliform bacteria standard. Beyond developing this multiple-tracer approach for locating sources of sewage inputs to Accotink Creek, the second objective of the study was to demonstrate how the multiple-tracer approach can be applied to other streams affected by sewage sources. The tracers used in this study were separated into indicator tracers, which are relatively simple and inexpensive to apply, and confirmatory tracers, which are relatively difficult and expensive to analyze. Indicator tracers include fecal coliform bacteria, surfactants, boron, chloride, chloride/bromide ratio, specific conductance, dissolved oxygen, turbidity, and water temperature. Confirmatory tracers include 13 organic compounds that are associated with human waste, including caffeine, cotinine, triclosan, a number of detergent metabolites, several fragrances, and several plasticizers. To identify sources of sewage to Accotink Creek, a detailed investigation of the Accotink Creek main channel, tributaries, and flowing storm drains was undertaken from 2001 to 2004. Sampling was conducted in a series of eight synoptic sampling events, each of which began at the most downstream site and extended upstream through the watershed and into the headwaters of each tributary. Using the synoptic

  11. Public Health Investigation of Two Outbreaks of Shiga Toxin-Producing Escherichia coli O157 Associated with Consumption of Watercress.

    PubMed

    Jenkins, Claire; Dallman, Timothy J; Launders, Naomi; Willis, Caroline; Byrne, Lisa; Jorgensen, Frieda; Eppinger, Mark; Adak, Goutam K; Aird, Heather; Elviss, Nicola; Grant, Kathie A; Morgan, Dilys; McLauchlin, Jim

    2015-06-15

    An increase in the number of cases of Shiga toxin-producing Escherichia coli (STEC) O157 phage type 2 (PT2) in England in September 2013 was epidemiologically linked to watercress consumption. Whole-genome sequencing (WGS) identified a phylogenetically related cluster of 22 cases (outbreak 1). The isolates comprising this cluster were not closely related to any other United Kingdom strain in the Public Health England WGS database, suggesting a possible imported source. A second outbreak of STEC O157 PT2 (outbreak 2) was identified epidemiologically following the detection of outbreak 1. Isolates associated with outbreak 2 were phylogenetically distinct from those in outbreak 1. Epidemiologically unrelated isolates on the same branch as the outbreak 2 cluster included those from human cases in England with domestically acquired infection and United Kingdom domestic cattle. Environmental sampling using PCR resulted in the isolation of STEC O157 PT2 from irrigation water at one implicated watercress farm, and WGS showed this isolate belonged to the same phylogenetic cluster as outbreak 2 isolates. Cattle were in close proximity to the watercress bed and were potentially the source of the second outbreak. Transfer of STEC from the field to the watercress bed may have occurred through wildlife entering the watercress farm or via runoff water. During this complex outbreak investigation, epidemiological studies, comprehensive testing of environmental samples, and the use of novel molecular methods proved invaluable in demonstrating that two simultaneous outbreaks of STEC O157 PT2 were both linked to the consumption of watercress but were associated with different sources of contamination. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  12. Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat.

    PubMed

    Harris, J Kirk; Caporaso, J Gregory; Walker, Jeffrey J; Spear, John R; Gold, Nicholas J; Robertson, Charles E; Hugenholtz, Philip; Goodrich, Julia; McDonald, Daniel; Knights, Dan; Marshall, Paul; Tufo, Henry; Knight, Rob; Pace, Norman R

    2013-01-01

    The microbial mats of Guerrero Negro (GN), Baja California Sur, Mexico historically were considered a simple environment, dominated by cyanobacteria and sulfate-reducing bacteria. Culture-independent rRNA community profiling instead revealed these microbial mats as among the most phylogenetically diverse environments known. A preliminary molecular survey of the GN mat based on only ∼1500 small subunit rRNA gene sequences discovered several new phylum-level groups in the bacterial phylogenetic domain and many previously undetected lower-level taxa. We determined an additional ∼119,000 nearly full-length sequences and 28,000 >200 nucleotide 454 reads from a 10-layer depth profile of the GN mat. With this unprecedented coverage of long sequences from one environment, we confirm the mat is phylogenetically stratified, presumably corresponding to light and geochemical gradients throughout the depth of the mat. Previous shotgun metagenomic data from the same depth profile show the same stratified pattern and suggest that metagenome properties may be predictable from rRNA gene sequences. We verify previously identified novel lineages and identify new phylogenetic diversity at lower taxonomic levels, for example, thousands of operational taxonomic units at the family-genus levels differ considerably from known sequences. The new sequences populate parts of the bacterial phylogenetic tree that previously were poorly described, but indicate that any comprehensive survey of GN diversity has only begun. Finally, we show that taxonomic conclusions are generally congruent between Sanger and 454 sequencing technologies, with the taxonomic resolution achieved dependent on the abundance of reference sequences in the relevant region of the rRNA tree of life.

  13. A review of criticisms of phylogenetic nomenclature: is taxonomic freedom the fundamental issue?

    PubMed

    Bryant, Harold N; Cantino, Philip D

    2002-02-01

    The proposal to implement a phylogenetic nomenclatural system governed by the PhyloCode), in which taxon names are defined by explicit reference to common descent, has met with strong criticism from some proponents of phylogenetic taxonomy (taxonomy based on the principle of common descent in which only clades and species are recognized). We examine these criticisms and find that some of the perceived problems with phylogenetic nomenclature are based on misconceptions, some are equally true of the current rank-based nomenclatural system, and some will be eliminated by implementation of the PhyloCode. Most of the criticisms are related to an overriding concern that, because the meanings of names are associated with phylogenetic pattern which is subject to change, the adoption of phylogenetic nomenclature will lead to increased instability in the content of taxa. This concern is associated with the fact that, despite the widespread adoption of the view that taxa are historical entities that are conceptualized based on ancestry, many taxonomists also conceptualize taxa based on their content. As a result, critics of phylogenetic nomenclature have argued that taxonomists should be free to emend the content of taxa without constraints imposed by nomenclatural decisions. However, in phylogenetic nomenclature the contents of taxa are determined, not by the taxonomist, but by the combination of the phylogenetic definition of the name and a phylogenetic hypothesis. Because the contents of taxa, once their names are defined, can no longer be freely modified by taxonomists, phylogenetic nomenclature is perceived as limiting taxonomic freedom. We argue that the form of taxonomic freedom inherent to phylogenetic nomenclature is appropriate to phylogenetic taxonomy in which taxa are considered historical entities that are discovered through phylogenetic analysis and are not human constructs.

  14. Evidence of two distinct phylogenetic lineages of dog rabies virus circulating in Cambodia.

    PubMed

    Mey, Channa; Metlin, Artem; Duong, Veasna; Ong, Sivuth; In, Sotheary; Horwood, Paul F; Reynes, Jean-Marc; Bourhy, Hervé; Tarantola, Arnaud; Buchy, Philippe

    2016-03-01

    This first extensive retrospective study of the molecular epidemiology of dog rabies in Cambodia included 149 rabies virus (RABV) entire nucleoprotein sequences obtained from 1998-2011. The sequences were analyzed in conjunction with RABVs from other Asian countries. Phylogenetic reconstruction confirmed the South-East Asian phylogenetic clade comprising viruses from Cambodia, Vietnam, Thailand, Laos and Myanmar. The present study represents the first attempt to classify the phylogenetic lineages inside this clade, resulting in the confirmation that all the Cambodian viruses belonged to the South-East Asian (SEA) clade. Three distinct phylogenetic lineages in the region were established with the majority of viruses from Cambodia closely related to viruses from Thailand, Laos and Vietnam, forming the geographically widespread phylogenetic lineage SEA1. A South-East Asian lineage SEA2 comprised two viruses from Cambodia was identified, which shared a common ancestor with RABVs originating from Laos. Viruses from Myanmar formed separate phylogenetic lineages within the major SEA clade. Bayesian molecular clock analysis suggested that the time to most recent common ancestor (TMRCA) of all Cambodian RABVs dated to around 1950. The TMRCA of the Cambodian SEA1 lineage was around 1964 and that of the SEA2 lineage was around 1953. The results identified three phylogenetically distinct and geographically separated lineages inside the earlier identified major SEA clade, covering at least five countries in the region. A greater understanding of the molecular epidemiology of rabies in South-East Asia is an important step to monitor progress on the efforts to control canine rabies in the region. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Diversity, Physiochemical and Phylogenetic Analyses of Bacteria Isolated from Various Drinking Water Sources.

    PubMed

    Eid, Neveen H; Al Doghaither, Huda A; Kumosani, Taha A; Gull, Munazza

    2017-01-01

    To evaluate the indigenous bacterial strains of drinking water from the most commercial water types including bottled and filtered water that are currently used in Saudi Arabia. Thirty randomly selected commercial brands of bottled water were purchased from Saudi local markets. Moreover, samples from tap water and filtered water were collected in sterilized glass bottles and stored at 4°C. Biochemical analyses including pH, temperature, lactose fermentation test (LAC), indole test (IND), methyl red test (MR), Voges-Proskauer test (VP), urease test (URE), catalase test (CAT), aerobic and anaerobic test (Ae/An) were measured. Molecular identification and comparative sequence analyses were done by full length 16S rRNA gene sequences using gene bank databases and phylogenetic trees were constructed to see the closely related similarity index between bacterial strains. Among 30 water samples tested, 18 were found positive for bacterial growth. Molecular identification of four selected bacterial strains indicated the alarming presence of pathogenic bacteria Bacillus spp . in most common commercial types of drinking water used in Saudi Arabia. The lack of awareness about good sanitation, poor personal hygienic practices and failure of safe water management and supply are the important factors for poor drinking water quality in these sources, need to be addressed.

  16. Phylogenetic context determines the role of competition in adaptive radiation

    PubMed Central

    Tan, Jiaqi; Slattery, Matthew R.; Yang, Xian; Jiang, Lin

    2016-01-01

    Understanding ecological mechanisms regulating the evolution of biodiversity is of much interest to ecologists and evolutionary biologists. Adaptive radiation constitutes an important evolutionary process that generates biodiversity. Competition has long been thought to influence adaptive radiation, but the directionality of its effect and associated mechanisms remain ambiguous. Here, we report a rigorous experimental test of the role of competition on adaptive radiation using the rapidly evolving bacterium Pseudomonas fluorescens SBW25 interacting with multiple bacterial species that differed in their phylogenetic distance to the diversifying bacterium. We showed that the inhibitive effect of competitors on the adaptive radiation of P. fluorescens decreased as their phylogenetic distance increased. To explain this phylogenetic dependency of adaptive radiation, we linked the phylogenetic distance between P. fluorescens and its competitors to their niche and competitive fitness differences. Competitive fitness differences, which showed weak phylogenetic signal, reduced P. fluorescens abundance and thus diversification, whereas phylogenetically conserved niche differences promoted diversification. These results demonstrate the context dependency of competitive effects on adaptive radiation, and highlight the importance of past evolutionary history for ongoing evolutionary processes. PMID:27335414

  17. Phylogenetic resolution and habitat specificity of members of the Photobacterium phosphoreum species group.

    PubMed

    Ast, Jennifer C; Dunlap, Paul V

    2005-10-01

    Substantial ambiguity exists regarding the phylogenetic status of facultatively psychrophilic luminous bacteria identified as Photobacterium phosphoreum, a species thought to be widely distributed in the world's oceans and believed to be the specific bioluminescent light-organ symbiont of several deep-sea fishes. Members of the P. phosphoreum species group include luminous and non-luminous strains identified phenotypically from a variety of different habitats as well as phylogenetically defined lineages that appear to be evolutionarily distinct. To resolve this ambiguity and to begin developing a meaningful knowledge of the geographic distributions, habitats and symbiotic relationships of bacteria in the P. phosphoreum species group, we carried out a multilocus, fine-scale phylogenetic analysis based on sequences of the 16S rRNA, gyrB and luxABFE genes of many newly isolated luminous strains from symbiotic and saprophytic habitats, together with previously isolated luminous and non-luminous strains identified as P. phosphoreum from these and other habitats. Parsimony analysis unambiguously resolved three evolutionarily distinct clades, phosphoreum, iliopiscarium and kishitanii. The tight phylogenetic clustering within these clades and the distinct separation between them indicates they are different species, P. phosphoreum, Photobacterium iliopiscarium and the newly recognized 'Photobacterium kishitanii'. Previously reported non-luminous strains, which had been identified phenotypically as P. phosphoreum, resolved unambiguously as P. iliopiscarium, and all examined deep-sea fishes (specimens of families Chlorophthalmidae, Macrouridae, Moridae, Trachichthyidae and Acropomatidae) were found to harbour 'P. kishitanii', not P. phosphoreum, in their light organs. This resolution revealed also that 'P. kishitanii' is cosmopolitan in its geographic distribution. Furthermore, the lack of phylogenetic variation within 'P. kishitanii' indicates that this facultatively

  18. Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.

    PubMed

    Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P

    1997-11-01

    A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.

  19. Phylogenetic relationships and diversity of β-rhizobia associated with Mimosa species grown in Sishuangbanna, China.

    PubMed

    Liu, Xiao Yun; Wu, Wei; Wang, En Tao; Zhang, Bin; Macdermott, Jomo; Chen, Wen Xin

    2011-02-01

    In order to investigate the genetic diversity of rhizobia associated with various exotic and invasive species in tropical mainland China, 116 bacterial isolates were obtained from Mimosa root nodules collected from Sishuangbanna and Yuanjiang districts of Yunnan province. Isolated rhizobia were characterized by RFLP analysis of 16S rRNA genes, SDS-PAGE of whole-cell proteins and BOX-PCR. Most of the isolated strains were identified as β-rhizobia belonging to diverse populations of Burkholderia and Cupriavidus, and the phylogenetic relationships of their 16S rRNA gene sequences showed that they were closely related to one of four β-rhizobia species: Burkholderia phymatum, B. mimosarum, B. caribensis or Cupriavidus taiwanensis. Additionally, among the 116 isolates, 53 different whole-cell SDS-PAGE profiles and 30 distinct BOX-PCR genotypic patterns were detected, which demonstrated the genetic and phenotypic diversity found within these Burkholderia and Cupriavidus strains. To the best of our knowledge, this is the first report that β-rhizobia are extant and possibly widespread on the Chinese mainland and nodulate easily with Mimosa plants. We also find it especially interesting that this appears to be the first report from mainland China of Cupriavidus symbionts of Mimosa. These records enrich our knowledge and understanding of the geographical distribution and diversity of these bacteria.

  20. Investigating Salmonella Eko from Various Sources in Nigeria by Whole Genome Sequencing to Identify the Source of Human Infections

    PubMed Central

    Leekitcharoenphon, Pimlapas; Raufu, Ibrahim; Nielsen, Mette T.; Rosenqvist Lund, Birthe S.; Ameh, James A.; Ambali, Abdul G.; Sørensen, Gitte; Le Hello, Simon; Aarestrup, Frank M.; Hendriksen, Rene S.

    2016-01-01

    Twenty-six Salmonella enterica serovar Eko isolated from various sources in Nigeria were investigated by whole genome sequencing to identify the source of human infections. Diversity among the isolates was observed and camel and cattle were identified as the primary reservoirs and the most likely source of the human infections. PMID:27228329

  1. Identifying EGRET Sources

    NASA Technical Reports Server (NTRS)

    Schlegel, E.; Norris, Jay P. (Technical Monitor)

    2002-01-01

    This project was awarded funding from the CGRO program to support ROSAT and ground-based observations of unidentified sources from data obtained by the EGRET instrument on the Compton Gamma-Ray Observatory. The critical items in the project are the individual ROSAT observations that are used to cover the 99% error circle of the unidentified EGRET source. Each error circle is a degree or larger in diameter. Each ROSAT field is about 30 deg in diameter. Hence, a number (>4) of ROSAT pointings must be obtained for each EGRET source to cover the field. The scheduling of ROSAT observations is carried out to maximize the efficiency of the total schedule. As a result, each pointing is broken into one or more sub-pointings of various exposure times. This project was awarded ROSAT observing time for four unidentified EGRET sources, summarized in the table. The column headings are defined as follows: 'Coverings' = number of observations to cover the error circle; 'SubPtg' = total number of sub-pointings to observe all of the coverings; 'Rec'd' = number of individual sub-pointings received to date; 'CompFlds' = number of individual coverings for which the requested complete exposure has been received. Processing of the data can not occur until a complete exposure has been accumulated for each covering.

  2. Identification of fecal contamination sources in water using host-associated markers.

    PubMed

    Krentz, Corinne A; Prystajecky, Natalie; Isaac-Renton, Judith

    2013-03-01

    In British Columbia, Canada, drinking water is tested for total coliforms and Escherichia coli, but there is currently no routine follow-up testing to investigate fecal contamination sources in samples that test positive for indicator bacteria. Reliable microbial source tracking (MST) tools to rapidly test water samples for multiple fecal contamination markers simultaneously are currently lacking. The objectives of this study were (i) to develop a qualitative MST tool to identify fecal contamination from different host groups, and (ii) to evaluate the MST tool using water samples with evidence of fecal contamination. Singleplex and multiplex polymerase chain reaction (PCR) were used to test (i) water from polluted sites and (ii) raw and drinking water samples for presence of bacterial genetic markers associated with feces from humans, cattle, seagulls, pigs, chickens, and geese. The multiplex MST assay correctly identified suspected contamination sources in contaminated waterways, demonstrating that this test may have utility for heavily contaminated sites. Most raw and drinking water samples analyzed using singleplex PCR contained at least one host-associated marker. Singleplex PCR was capable of detecting host-associated markers in small sample volumes and is therefore a promising tool to further analyze water samples submitted for routine testing and provide information useful for water quality management.

  3. Approach to identifying pollutant source and matching flow field

    NASA Astrophysics Data System (ADS)

    Liping, Pang; Yu, Zhang; Hongquan, Qu; Tao, Hu; Wei, Wang

    2013-07-01

    Accidental pollution events often threaten people's health and lives, and it is necessary to identify a pollutant source rapidly so that prompt actions can be taken to prevent the spread of pollution. But this identification process is one of the difficulties in the inverse problem areas. This paper carries out some studies on this issue. An approach using single sensor information with noise was developed to identify a sudden continuous emission trace pollutant source in a steady velocity field. This approach first compares the characteristic distance of the measured concentration sequence to the multiple hypothetical measured concentration sequences at the sensor position, which are obtained based on a source-three-parameter multiple hypotheses. Then we realize the source identification by globally searching the optimal values with the objective function of the maximum location probability. Considering the large amount of computation load resulting from this global searching, a local fine-mesh source search method based on priori coarse-mesh location probabilities is further used to improve the efficiency of identification. Studies have shown that the flow field has a very important influence on the source identification. Therefore, we also discuss the impact of non-matching flow fields with estimation deviation on identification. Based on this analysis, a method for matching accurate flow field is presented to improve the accuracy of identification. In order to verify the practical application of the above method, an experimental system simulating a sudden pollution process in a steady flow field was set up and some experiments were conducted when the diffusion coefficient was known. The studies showed that the three parameters (position, emission strength and initial emission time) of the pollutant source in the experiment can be estimated by using the method for matching flow field and source identification.

  4. Spatial phylogenetics of the native California flora.

    PubMed

    Thornhill, Andrew H; Baldwin, Bruce G; Freyman, William A; Nosratinia, Sonia; Kling, Matthew M; Morueta-Holme, Naia; Madsen, Thomas P; Ackerly, David D; Mishler, Brent D

    2017-10-26

    California is a world floristic biodiversity hotspot where the terms neo- and paleo-endemism were first applied. Using spatial phylogenetics, it is now possible to evaluate biodiversity from an evolutionary standpoint, including discovering significant areas of neo- and paleo-endemism, by combining spatial information from museum collections and DNA-based phylogenies. Here we used a distributional dataset of 1.39 million herbarium specimens, a phylogeny of 1083 operational taxonomic units (OTUs) and 9 genes, and a spatial randomization test to identify regions of significant phylogenetic diversity, relative phylogenetic diversity, and phylogenetic endemism (PE), as well as to conduct a categorical analysis of neo- and paleo-endemism (CANAPE). We found (1) extensive phylogenetic clustering in the South Coast Ranges, southern Great Valley, and deserts of California; (2) significant concentrations of short branches in the Mojave and Great Basin Deserts and the South Coast Ranges and long branches in the northern Great Valley, Sierra Nevada foothills, and the northwestern and southwestern parts of the state; (3) significant concentrations of paleo-endemism in Northwestern California, the northern Great Valley, and western Sonoran Desert, and neo-endemism in the White-Inyo Range, northern Mojave Desert, and southern Channel Islands. Multiple analyses were run to observe the effects on significance patterns of using different phylogenetic tree topologies (uncalibrated trees versus time-calibrated ultrametric trees) and using different representations of OTU ranges (herbarium specimen locations versus species distribution models). These analyses showed that examining the geographic distributions of branch lengths in a statistical framework adds a new dimension to California floristics that, in comparison with climatic data, helps to illuminate causes of endemism. In particular, the concentration of significant PE in more arid regions of California extends previous ideas

  5. Teaching Molecular Phylogenetics through Investigating a Real-World Phylogenetic Problem

    ERIC Educational Resources Information Center

    Zhang, Xiaorong

    2012-01-01

    A phylogenetics exercise is incorporated into the "Introduction to biocomputing" course, a junior-level course at Savannah State University. This exercise is designed to help students learn important concepts and practical skills in molecular phylogenetics through solving a real-world problem. In this application, students are required to identify…

  6. Phylogenetically-informed priorities for amphibian conservation.

    PubMed

    Isaac, Nick J B; Redding, David W; Meredith, Helen M; Safi, Kamran

    2012-01-01

    The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list) for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.

  7. What is the danger of the anomaly zone for empirical phylogenetics?

    PubMed

    Huang, Huateng; Knowles, L Lacey

    2009-10-01

    The increasing number of observations of gene trees with discordant topologies in phylogenetic studies has raised awareness about the problems of incongruence between species trees and gene trees. Moreover, theoretical treatments focusing on the impact of coalescent variance on phylogenetic study have also identified situations where the most probable gene trees are ones that do not match the underlying species tree (i.e., anomalous gene trees [AGTs]). However, although the theoretical proof of the existence of AGTs is alarming, the actual risk that AGTs pose to empirical phylogenetic study is far from clear. Establishing the conditions (i.e., the branch lengths in a species tree) for which AGTs are possible does not address the critical issue of how prevalent they might be. Furthermore, theoretical characterization of the species trees for which AGTs may pose a problem (i.e., the anomaly zone or the species histories for which AGTs are theoretically possible) is based on consideration of just one source of variance that contributes to species tree and gene tree discord-gene lineage coalescence. Yet, empirical data contain another important stochastic component-mutational variance. Estimated gene trees will differ from the underlying gene trees (i.e., the actual genealogy) because of the random process of mutation. Here, we take a simulation approach to investigate the prevalence of AGTs, among estimated gene trees, thereby characterizing the boundaries of the anomaly zone taking into account both coalescent and mutational variances. We also determine the frequency of realized AGTs, which is critical to putting the theoretical work on AGTs into a realistic biological context. Two salient results emerge from this investigation. First, our results show that mutational variance can indeed expand the parameter space (i.e., the relative branch lengths in a species tree) where AGTs might be observed in empirical data. By exploring the underlying cause for the expanded

  8. 16S rRNA gene-based association study identified microbial taxa associated with pork intramuscular fat content in feces and cecum lumen.

    PubMed

    Fang, Shaoming; Xiong, Xingwei; Su, Ying; Huang, Lusheng; Chen, Congying

    2017-07-19

    Intramuscular fat (IMF) that deposits among muscle fibers or within muscle cells is an important meat quality trait in pigs. Previous studies observed the effects of dietary nutrients and additives on improving the pork IMF. Gut microbiome plays an important role in host metabolism and energy harvest. Whether gut microbiota exerts effect on IMF remains unknown. In this study, we investigated the microbial community structure of 500 samples from porcine cecum and feces using high-throughput 16S rRNA gene sequencing. We found that phylogenetic composition and potential function capacity of microbiome varied between two types of samples. Bacteria wide association study identified 119 OTUs significantly associated with IMF in the two types of samples (FDR < 0.1). Most of the IMF-associated OTUs belong to the bacteria related to polysaccharide degradation and amino acid metabolism (such as Prevotella, Treponema, Bacteroides and Clostridium). Potential function capacities related to metabolisms of carbohydrate, energy and amino acids, cell motility, and membrane transport were significantly associated with IMF content. FishTaco analysis suggested that the shifts of potential function capacities of microbiome associated with IMF might be caused by the IMF-associated microbial taxa. This study firstly evaluated the contribution of gut microbiome to porcine IMF content. The results presented a potential capacity for improving IMF through modulating gut microbiota.

  9. Phylogenetic characterization and in situ detection of bacterial communities associated with seahorses (Hippocampus guttulatus) in captivity.

    PubMed

    Balcázar, José L; Lee, Natuschka M; Pintado, José; Planas, Miquel

    2010-03-01

    Although there are several studies describing bacteria associated with marine fish, the bacterial composition associated with seahorses has not been extensively investigated since these studies have been restricted to the identification of bacterial pathogens. In this study, the phylogenetic affiliation of seahorse-associated bacteria was assessed by 16S rRNA gene sequencing of cloned DNA fragments. Fluorescence in situ hybridization (FISH) was used to confirm the presence of the predominant groups indicated by 16S rRNA analysis. Both methods revealed that Vibrionaceae was the dominant population in Artemia sp. (live prey) and intestinal content of the seahorses, while Rhodobacteraceae was dominant in water samples from the aquaculture system and cutaneous mucus of the seahorses. To our knowledge, this is the first time that bacterial communities associated with healthy seahorses in captivity have been described. Crown Copyright 2010. Published by Elsevier GmbH. All rights reserved.

  10. Identifying the TeV gamma-ray source MGRO J2228+61, FINALLY!

    NASA Astrophysics Data System (ADS)

    Aliu, Ester

    2012-09-01

    New VERITAS observations of MGRO J2228+61 allow us to associate its TeV emission with the enigmatic radio supernova remnant SNR G106.3+2.7. This remnant is part of a large complex that includes the Boomerang pulsar and nebula. The reduced field suggests that the TeV emission is not powered by the Boomerang, but instead associated with a much larger remnant. A recent SUZAKU X-ray observation of the smaller gamma-ray error box reveals two possible pulsar candidates. We propose short ACIS exposures to identify these sources to determine if one or both can be responsible for the gamma-ray emission. This will allow us to address the long standing problem on the nature of both MGRO J2228+61 and SNR G106.3+2.7.

  11. Nonbinary Tree-Based Phylogenetic Networks.

    PubMed

    Jetten, Laura; van Iersel, Leo

    2018-01-01

    Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.

  12. Phylogenetic and structural response of heterotrophic bacteria to dissolved organic matter of different chemical composition in a continuous culture study.

    PubMed

    Landa, M; Cottrell, M T; Kirchman, D L; Kaiser, K; Medeiros, P M; Tremblay, L; Batailler, N; Caparros, J; Catala, P; Escoubeyrou, K; Oriol, L; Blain, S; Obernosterer, I

    2014-06-01

    Dissolved organic matter (DOM) and heterotrophic bacteria are highly diverse components of the ocean system, and their interactions are key in regulating the biogeochemical cycles of major elements. How chemical and phylogenetic diversity are linked remains largely unexplored to date. To investigate interactions between bacterial diversity and DOM, we followed the response of natural bacterial communities to two sources of phytoplankton-derived DOM over six bacterial generation times in continuous cultures. Analyses of total hydrolysable neutral sugars and amino acids, and ultrahigh resolution mass spectrometry revealed large differences in the chemical composition of the two DOM sources. According to 454 pyrosequences of 16S ribosomal ribonucleic acid genes, diatom-derived DOM sustained higher levels of bacterial richness, evenness and phylogenetic diversity than cyanobacteria-derived DOM. These distinct community structures were, however, not associated with specific taxa. Grazing pressure affected bacterial community composition without changing the overall pattern of bacterial diversity levels set by DOM. Our results demonstrate that resource composition can shape several facets of bacterial diversity without influencing the phylogenetic composition of bacterial communities, suggesting functional redundancy at different taxonomic levels for the degradation of phytoplankton-derived DOM. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.

  13. Disentangling the phylogenetic and ecological components of spider phenotypic variation.

    PubMed

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.

  14. Disentangling the Phylogenetic and Ecological Components of Spider Phenotypic Variation

    PubMed Central

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264

  15. Increased phylogenetic resolution within the ecologically important Rhizopogon subgenus Amylopogon using 10 anonymous nuclear loci.

    PubMed

    Dowie, Nicholas J; Grubisha, Lisa C; Burton, Brent A; Klooster, Matthew R; Miller, Steven L

    2017-01-01

    Rhizopogon species are ecologically significant ectomycorrhizal fungi in conifer ecosystems. The importance of this system merits the development and utilization of a more robust set of molecular markers specifically designed to evaluate their evolutionary ecology. Anonymous nuclear loci (ANL) were developed for R. subgenus Amylopogon. Members of this subgenus occur throughout the United States and are exclusive fungal symbionts associated with Pterospora andromedea, a threatened mycoheterotrophic plant endemic to disjunct eastern and western regions of North America. Candidate ANL were developed from 454 shotgun pyrosequencing and assessed for positive amplification across targeted species, sequencing success, and recovery of phylogenetically informative sites. Ten ANL were successfully developed and were subsequently used to sequence representative taxa, herbaria holotype and paratype specimens in R. subgenus Amylopogon. Phylogenetic reconstructions were performed on individual and concatenated data sets by Bayesian inference and maximum likelihood methods. Phylogenetic analyses of these 10 ANL were compared with a phylogeny traditionally constructed using the universal fungal barcode nuc rDNA ITS1-5.8S-ITS2 region (ITS). The resulting ANL phylogeny was consistent with most of the species designations delineated by ITS. However, the ANL phylogeny provided much greater phylogenetic resolution, yielding new evidence for cryptic species within previously defined species of R. subgenus Amylopogon. Additionally, the rooted ANL phylogeny provided an alternate topology to the ITS phylogeny, which inferred a novel set of evolutionary relationships not identified in prior phylogenetic studies.

  16. A guide to phylogenetic metrics for conservation, community ecology and macroecology

    PubMed Central

    Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S.; Mooers, Arne O.; Pavoine, Sandrine; Purschke, Oliver; Redding, David W.; Rosauer, Dan F.; Winter, Marten; Mazel, Florent

    2016-01-01

    ABSTRACT The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub‐disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub‐disciplines hampers potential meta‐analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo‐diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo‐diversity metrics based on their mathematical form within these three dimensions and identify ‘anchor’ representatives: for α‐diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. PMID:26785932

  17. Phylogenetic studies of transmission dynamics in generalized HIV epidemics: An essential tool where the burden is greatest?

    PubMed Central

    Dennis, Ann M.; Herbeck, Joshua T.; Brown, Andrew Leigh; Kellam, Paul; de Oliveira, Tulio; Pillay, Deenan; Fraser, Christophe; Cohen, Myron S.

    2014-01-01

    Efficient and effective HIV prevention measures for generalized epidemics in sub-Saharan Africa have not yet been validated at the population-level. Design and impact evaluation of such measures requires fine-scale understanding of local HIV transmission dynamics. The novel tools of HIV phylogenetics and molecular epidemiology may elucidate these transmission dynamics. Such methods have been incorporated into studies of concentrated HIV epidemics to identify proximate and determinant traits associated with ongoing transmission. However, applying similar phylogenetic analyses to generalized epidemics, including the design and evaluation of prevention trials, presents additional challenges. Here we review the scope of these methods and present examples of their use in concentrated epidemics in the context of prevention. Next, we describe the current uses for phylogenetics in generalized epidemics, and discuss their promise for elucidating transmission patterns and informing prevention trials. Finally, we review logistic and technical challenges inherent to large-scale molecular epidemiological studies of generalized epidemics, and suggest potential solutions. PMID:24977473

  18. Identifying Sources of Fecal Contamination in Streams Associated with Chicken Farms

    EPA Science Inventory

    Poultry is responsible for 44% of the total feces production in the U.S., followed by cattle and swine. The large U.S. production of feces poses a contamination risk for affected watersheds across the country. To aid in the identification of the sources of contamination, many D...

  19. Phylogenetic community structure: temporal variation in fish assemblage

    PubMed Central

    Santorelli, Sergio; Magnusson, William; Ferreira, Efrem; Caramaschi, Erica; Zuanon, Jansen; Amadio, Sidnéia

    2014-01-01

    Hypotheses about phylogenetic relationships among species allow inferences about the mechanisms that affect species coexistence. Nevertheless, most studies assume that phylogenetic patterns identified are stable over time. We used data on monthly samples of fish from a single lake over 10 years to show that the structure in phylogenetic assemblages varies over time and conclusions depend heavily on the time scale investigated. The data set was organized in guild structures and temporal scales (grouped at three temporal scales). Phylogenetic distance was measured as the mean pairwise distances (MPD) and as mean nearest-neighbor distance (MNTD). Both distances were based on counts of nodes. We compared the observed values of MPD and MNTD with values that were generated randomly using null model independent swap. A serial runs test was used to assess the temporal independence of indices over time. The phylogenetic pattern in the whole assemblage and the functional groups varied widely over time. Conclusions about phylogenetic clustering or dispersion depended on the temporal scales. Conclusions about the frequency with which biotic processes and environmental filters affect the local assembly do not depend only on taxonomic grouping and spatial scales. While these analyzes allow the assertion that all proposed patterns apply to the fish assemblages in the floodplain, the assessment of the relative importance of these processes, and how they vary depending on the temporal scale and functional group studied, cannot be determined with the effort commonly used. It appears that, at least in the system that we studied, the assemblages are forming and breaking continuously, resulting in various phylogeny-related structures that makes summarizing difficult. PMID:25360256

  20. IcyTree: rapid browser-based visualization for phylogenetic trees and networks

    PubMed Central

    2017-01-01

    Abstract Summary: IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. Availability and Implementation: IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree. Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. Contact: tgvaughan@gmail.com PMID:28407035

  1. IcyTree: rapid browser-based visualization for phylogenetic trees and networks.

    PubMed

    Vaughan, Timothy G

    2017-08-01

    IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree . Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. tgvaughan@gmail.com. © The Author(s) 2017. Published by Oxford University Press.

  2. Phylogenetic support for the Tropical Niche Conservatism Hypothesis despite the absence of a clear latitudinal species richness gradient in Yunnan's woody flora

    NASA Astrophysics Data System (ADS)

    Tang, G.; Zhang, M. G.; Liu, C.; Zhou, Z.; Chen, W.; Slik, J. W. F.

    2014-05-01

    The Tropical Niche Conservatism Hypothesis (TCH) tries to explain the generally observed latitudinal gradient of increasing species diversity towards the tropics. To date, few studies have used phylogenetic approaches to assess its validity, even though such methods are especially suited to detect changes in niche structure. We test the TCH using modeled distributions of 1898 woody species in Yunnan Province (southwest China) in combination with a family level phylogeny. Unlike predicted, species richness and phylogenetic diversity did not show a latitudinal gradient, but identified two high diversity zones, one in Northwest and one in South Yunnan. Despite this, the underlying residual phylogenetic diversity showed a clear decline away from the tropics, while the species composition became progressingly more phylogenetically clustered towards the North. These latitudinal changes were strongly associated with more extreme temperature variability and declining precipitation and soil water availability, especially during the dry season. Our results suggests that the climatically more extreme conditions outside the tropics require adaptations for successful colonization, most likely related to the plant hydraulic system, that have been acquired by only a limited number of phylogenetically closely related plant lineages. We emphasize the importance of phylogenetic approaches for testing the TCH.

  3. Assessing the relationships between phylogenetic and functional singularities in sharks (Chondrichthyes).

    PubMed

    Cachera, Marie; Le Loc'h, François

    2017-08-01

    The relationships between diversity and ecosystem functioning have become a major focus of science. A crucial issue is to estimate functional diversity, as it is intended to impact ecosystem dynamics and stability. However, depending on the ecosystem, it may be challenging or even impossible to directly measure ecological functions and thus functional diversity. Phylogenetic diversity was recently under consideration as a proxy for functional diversity. Phylogenetic diversity is indeed supposed to match functional diversity if functions are conservative traits along evolution. However, in case of adaptive radiation and/or evolutive convergence, a mismatch may appear between species phylogenetic and functional singularities. Using highly threatened taxa, sharks, this study aimed to explore the relationships between phylogenetic and functional diversities and singularities. Different statistical computations were used in order to test both methodological issue (phylogenetic reconstruction) and overall a theoretical questioning: the predictive power of phylogeny for function diversity. Despite these several methodological approaches, a mismatch between phylogeny and function was highlighted. This mismatch revealed that (i) functions are apparently nonconservative in shark species, and (ii) phylogenetic singularity is not a proxy for functional singularity. Functions appeared to be not conservative along the evolution of sharks, raising the conservational challenge to identify and protect both phylogenetic and functional singular species. Facing the current rate of species loss, it is indeed of major importance to target phylogenetically singular species to protect genetic diversity and also functionally singular species in order to maintain particular functions within ecosystem.

  4. Assessment of source-specific health effects associated with an unknown number of major sources of multiple air pollutants: a unified Bayesian approach.

    PubMed

    Park, Eun Sug; Hopke, Philip K; Oh, Man-Suk; Symanski, Elaine; Han, Daikwon; Spiegelman, Clifford H

    2014-07-01

    There has been increasing interest in assessing health effects associated with multiple air pollutants emitted by specific sources. A major difficulty with achieving this goal is that the pollution source profiles are unknown and source-specific exposures cannot be measured directly; rather, they need to be estimated by decomposing ambient measurements of multiple air pollutants. This estimation process, called multivariate receptor modeling, is challenging because of the unknown number of sources and unknown identifiability conditions (model uncertainty). The uncertainty in source-specific exposures (source contributions) as well as uncertainty in the number of major pollution sources and identifiability conditions have been largely ignored in previous studies. A multipollutant approach that can deal with model uncertainty in multivariate receptor models while simultaneously accounting for parameter uncertainty in estimated source-specific exposures in assessment of source-specific health effects is presented in this paper. The methods are applied to daily ambient air measurements of the chemical composition of fine particulate matter ([Formula: see text]), weather data, and counts of cardiovascular deaths from 1995 to 1997 for Phoenix, AZ, USA. Our approach for evaluating source-specific health effects yields not only estimates of source contributions along with their uncertainties and associated health effects estimates but also estimates of model uncertainty (posterior model probabilities) that have been ignored in previous studies. The results from our methods agreed in general with those from the previously conducted workshop/studies on the source apportionment of PM health effects in terms of number of major contributing sources, estimated source profiles, and contributions. However, some of the adverse source-specific health effects identified in the previous studies were not statistically significant in our analysis, which probably resulted because we

  5. Identifying sources of emerging organic contaminants in a mixed use watershed using principal components analysis.

    PubMed

    Karpuzcu, M Ekrem; Fairbairn, David; Arnold, William A; Barber, Brian L; Kaufenberg, Elizabeth; Koskinen, William C; Novak, Paige J; Rice, Pamela J; Swackhamer, Deborah L

    2014-01-01

    Principal components analysis (PCA) was used to identify sources of emerging organic contaminants in the Zumbro River watershed in Southeastern Minnesota. Two main principal components (PCs) were identified, which together explained more than 50% of the variance in the data. Principal Component 1 (PC1) was attributed to urban wastewater-derived sources, including municipal wastewater and residential septic tank effluents, while Principal Component 2 (PC2) was attributed to agricultural sources. The variances of the concentrations of cotinine, DEET and the prescription drugs carbamazepine, erythromycin and sulfamethoxazole were best explained by PC1, while the variances of the concentrations of the agricultural pesticides atrazine, metolachlor and acetochlor were best explained by PC2. Mixed use compounds carbaryl, iprodione and daidzein did not specifically group with either PC1 or PC2. Furthermore, despite the fact that caffeine and acetaminophen have been historically associated with human use, they could not be attributed to a single dominant land use category (e.g., urban/residential or agricultural). Contributions from septic systems did not clarify the source for these two compounds, suggesting that additional sources, such as runoff from biosolid-amended soils, may exist. Based on these results, PCA may be a useful way to broadly categorize the sources of new and previously uncharacterized emerging contaminants or may help to clarify transport pathways in a given area. Acetaminophen and caffeine were not ideal markers for urban/residential contamination sources in the study area and may need to be reconsidered as such in other areas as well.

  6. Mixed-up trees: the structure of phylogenetic mixtures.

    PubMed

    Matsen, Frederick A; Mossel, Elchanan; Steel, Mike

    2008-05-01

    In this paper, we apply new geometric and combinatorial methods to the study of phylogenetic mixtures. The focus of the geometric approach is to describe the geometry of phylogenetic mixture distributions for the two state random cluster model, which is a generalization of the two state symmetric (CFN) model. In particular, we show that the set of mixture distributions forms a convex polytope and we calculate its dimension; corollaries include a simple criterion for when a mixture of branch lengths on the star tree can mimic the site pattern frequency vector of a resolved quartet tree. Furthermore, by computing volumes of polytopes we can clarify how "common" non-identifiable mixtures are under the CFN model. We also present a new combinatorial result which extends any identifiability result for a specific pair of trees of size six to arbitrary pairs of trees. Next we present a positive result showing identifiability of rates-across-sites models. Finally, we answer a question raised in a previous paper concerning "mixed branch repulsion" on trees larger than quartet trees under the CFN model.

  7. Treetrimmer: a method for phylogenetic dataset size reduction.

    PubMed

    Maruyama, Shinichiro; Eveleigh, Robert J M; Archibald, John M

    2013-04-12

    With rapid advances in genome sequencing and bioinformatics, it is now possible to generate phylogenetic trees containing thousands of operational taxonomic units (OTUs) from a wide range of organisms. However, use of rigorous tree-building methods on such large datasets is prohibitive and manual 'pruning' of sequence alignments is time consuming and raises concerns over reproducibility. There is a need for bioinformatic tools with which to objectively carry out such pruning procedures. Here we present 'TreeTrimmer', a bioinformatics procedure that removes unnecessary redundancy in large phylogenetic datasets, alleviating the size effect on more rigorous downstream analyses. The method identifies and removes user-defined 'redundant' sequences, e.g., orthologous sequences from closely related organisms and 'recently' evolved lineage-specific paralogs. Representative OTUs are retained for more rigorous re-analysis. TreeTrimmer reduces the OTU density of phylogenetic trees without sacrificing taxonomic diversity while retaining the original tree topology, thereby speeding up downstream computer-intensive analyses, e.g., Bayesian and maximum likelihood tree reconstructions, in a reproducible fashion.

  8. Phylogenetic Diversity in the Macromolecular Composition of Microalgae

    PubMed Central

    Finkel, Zoe V.; Follows, Mick J.; Liefer, Justin D.; Brown, Chris M.; Benner, Ina; Irwin, Andrew J.

    2016-01-01

    The elemental stoichiometry of microalgae reflects their underlying macromolecular composition and influences competitive interactions among species and their role in the food web and biogeochemistry. Here we provide a new estimate of the macromolecular composition of microalgae using a hierarchical Bayesian analysis of data compiled from the literature. The median macromolecular composition of nutrient-sufficient exponentially growing microalgae is 32.2% protein, 17.3% lipid, 15.0% carbohydrate, 17.3% ash, 5.7% RNA, 1.1% chlorophyll-a and 1.0% DNA as percent dry weight. Our analysis identifies significant phylogenetic differences in macromolecular composition undetected by previous studies due to small sample sizes and the large inherent variability in macromolecular pools. The phylogenetic differences in macromolecular composition lead to variations in carbon-to-nitrogen ratios that are consistent with independent observations. These phylogenetic differences in macromolecular and elemental composition reflect adaptations in cellular architecture and biochemistry; specifically in the cell wall, the light harvesting apparatus, and storage pools. PMID:27228080

  9. Using an epiphytic moss to identify previously unknown sources of atmospheric cadmium pollution

    Treesearch

    Geoffrey H. Donovan; Sarah E. Jovan; Demetrios Gatziolis; Igor Burstyn; Yvonne L. Michael; Michael C. Amacher; Vicente J. Monleon

    2016-01-01

    Urban networks of air-quality monitors are often too widely spaced to identify sources of air pollutants, especially if they do not disperse far from emission sources. The objectives of this study were to test the use of moss bio-indicators to develop a fine-scale map of atmospherically-derived cadmium and to identify the sources of cadmium in a complex urban setting....

  10. The power and pitfalls of HIV phylogenetics in public health.

    PubMed

    Brooks, James I; Sandstrom, Paul A

    2013-07-25

    Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.

  11. Turnover of plant lineages shapes herbivore phylogenetic beta diversity along ecological gradients.

    PubMed

    Pellissier, Loïc; Ndiribe, Charlotte; Dubuis, Anne; Pradervand, Jean-Nicolas; Salamin, Nicolas; Guisan, Antoine; Rasmann, Sergio

    2013-05-01

    Understanding drivers of biodiversity patterns is of prime importance in this era of severe environmental crisis. More diverse plant communities have been postulated to represent a larger functional trait-space, more likely to sustain a diverse assembly of herbivore species. Here, we expand this hypothesis to integrate environmental, functional and phylogenetic variation of plant communities as factors explaining the diversity of lepidopteran assemblages along elevation gradients in the Swiss Western Alps. According to expectations, we found that the association between butterflies and their host plants is highly phylogenetically structured. Multiple regression analyses showed the combined effect of climate, functional traits and phylogenetic diversity in structuring butterfly communities. Furthermore, we provide the first evidence that plant phylogenetic beta diversity is the major driver explaining butterfly phylogenetic beta diversity. Along ecological gradients, the bottom up control of herbivore diversity is thus driven by phylogenetically structured turnover of plant traits as well as environmental variables. © 2013 Blackwell Publishing Ltd/CNRS.

  12. Cellulolytic Streptomyces Strains Associated with Herbivorous Insects Share a Phylogenetically Linked Capacity To Degrade Lignocellulose

    PubMed Central

    Book, Adam J.; Lewin, Gina R.; McDonald, Bradon R.; Takasuka, Taichi E.; Doering, Drew T.; Adams, Aaron S.; Blodgett, Joshua A. V.; Clardy, Jon; Raffa, Kenneth F.; Fox, Brian G.

    2014-01-01

    Actinobacteria in the genus Streptomyces are critical players in microbial communities that decompose complex carbohydrates in the soil, and these bacteria have recently been implicated in the deconstruction of plant polysaccharides for some herbivorous insects. Despite the importance of Streptomyces to carbon cycling, the extent of their plant biomass-degrading ability remains largely unknown. In this study, we compared four strains of Streptomyces isolated from insect herbivores that attack pine trees: DpondAA-B6 (SDPB6) from the mountain pine beetle, SPB74 from the southern pine beetle, and SirexAA-E (SACTE) and SirexAA-G from the woodwasp, Sirex noctilio. Biochemical analysis of secreted enzymes demonstrated that only two of these strains, SACTE and SDPB6, were efficient at degrading plant biomass. Genomic analyses indicated that SACTE and SDPB6 are closely related and that they share similar compositions of carbohydrate-active enzymes. Genome-wide proteomic and transcriptomic analyses revealed that the major exocellulases (GH6 and GH48), lytic polysaccharide monooxygenases (AA10), and mannanases (GH5) were conserved and secreted by both organisms, while the secreted endocellulases (GH5 and GH9 versus GH9 and GH12) were from diverged enzyme families. Together, these data identify two phylogenetically related insect-associated Streptomyces strains with high biomass-degrading activity and characterize key enzymatic similarities and differences used by these organisms to deconstruct plant biomass. PMID:24837391

  13. Study of Clinical Survival and Gene Expression in a Sample of Pancreatic Ductal Adenocarcinoma by Parsimony Phylogenetic Analysis.

    PubMed

    Nalbantoglu, Sinem; Abu-Asab, Mones; Tan, Ming; Zhang, Xuemin; Cai, Ling; Amri, Hakima

    2016-07-01

    Pancreatic ductal adenocarcinoma (PDAC) is one of the rapidly growing forms of pancreatic cancer with a poor prognosis and less than 5% 5-year survival rate. In this study, we characterized the genetic signatures and signaling pathways related to survival from PDAC, using a parsimony phylogenetic algorithm. We applied the parsimony phylogenetic algorithm to analyze the publicly available whole-genome in silico array analysis of a gene expression data set in 25 early-stage human PDAC specimens. We explain here that the parsimony phylogenetics is an evolutionary analytical method that offers important promise to uncover clonal (driver) and nonclonal (passenger) aberrations in complex diseases. In our analysis, parsimony and statistical analyses did not identify significant correlations between survival times and gene expression values. Thus, the survival rankings did not appear to be significantly different between patients for any specific gene (p > 0.05). Also, we did not find correlation between gene expression data and tumor stage in the present data set. While the present analysis was unable to identify in this relatively small sample of patients a molecular signature associated with pancreatic cancer prognosis, we suggest that future research and analyses with the parsimony phylogenetic algorithm in larger patient samples are worthwhile, given the devastating nature of pancreatic cancer and its early diagnosis, and the need for novel data analytic approaches. The future research practices might want to place greater emphasis on phylogenetics as one of the analytical paradigms, as our findings presented here are on the cusp of this shift, especially in the current era of Big Data and innovation policies advocating for greater data sharing and reanalysis.

  14. Identifying source populations for the reintroduction of the Eurasian beaver, Castor fiber L. 1758, into Britain: evidence from ancient DNA.

    PubMed

    Marr, Melissa M; Brace, Selina; Schreve, Danielle C; Barnes, Ian

    2018-02-09

    Establishing true phylogenetic relationships between populations is a critical consideration when sourcing individuals for translocation. This presents huge difficulties with threatened and endangered species that have become extirpated from large areas of their former range. We utilise ancient DNA (aDNA) to reconstruct the phylogenetic relationships of a keystone species which has become extinct in Britain, the Eurasian beaver Castor fiber. We sequenced seventeen 492 bp partial tRNAPro and control region sequences from Late Pleistocene and Holocene age beavers and included these in network, demographic and genealogy analyses. The mode of postglacial population expansion from refugia was investigated by employing tests of neutrality and a pairwise mismatch distribution analysis. We found evidence of a pre-Late Glacial Maximum ancestor for the Western C. fiber clade which experienced a rapid demographic expansion during the terminal Pleistocene to early Holocene period. Ancient British beavers were found to originate from the Western phylogroup but showed no phylogenetic affinity to any one modern relict population over another. Instead, we find that they formed part of a large, continuous, pan-Western European clade that harbored little internal substructure. Our study highlights the utility of aDNA in reconstructing population histories of extirpated species which has real-world implications for conservation planning.

  15. jsPhyloSVG: a javascript library for visualizing interactive and vector-based phylogenetic trees on the web.

    PubMed

    Smits, Samuel A; Ouverney, Cleber C

    2010-08-18

    Many software packages have been developed to address the need for generating phylogenetic trees intended for print. With an increased use of the web to disseminate scientific literature, there is a need for phylogenetic trees to be viewable across many types of devices and feature some of the interactive elements that are integral to the browsing experience. We propose a novel approach for publishing interactive phylogenetic trees. We present a javascript library, jsPhyloSVG, which facilitates constructing interactive phylogenetic trees from raw Newick or phyloXML formats directly within the browser in Scalable Vector Graphics (SVG) format. It is designed to work across all major browsers and renders an alternative format for those browsers that do not support SVG. The library provides tools for building rectangular and circular phylograms with integrated charting. Interactive features may be integrated and made to respond to events such as clicks on any element of the tree, including labels. jsPhyloSVG is an open-source solution for rendering dynamic phylogenetic trees. It is capable of generating complex and interactive phylogenetic trees across all major browsers without the need for plugins. It is novel in supporting the ability to interpret the tree inference formats directly, exposing the underlying markup to data-mining services. The library source code, extensive documentation and live examples are freely accessible at www.jsphylosvg.com.

  16. Identifying fecal sources in a selected catchment reach using multiple source-tracking tools

    USGS Publications Warehouse

    Vogel, J.R.; Stoeckel, D.M.; Lamendella, R.; Zelt, R.B.; Santo, Domingo J.W.; Walker, S.R.; Oerther, D.B.

    2007-01-01

    Given known limitations of current microbial source-tracking (MST) tools, emphasis on small, simple study areas may enhance interpretations of fecal contamination sources in streams. In this study, three MST tools - Escherichia coli repetitive element polymerase chain reaction (rep-PCR), coliphage typing, and Bacteroidales 16S rDNA host-associated markers - were evaluated in a selected reach of Plum Creek in sooth-central Nebraska. Water-quality samples were collected from six sites. One reach was selected for MST evaluation based on observed patterns of E. coli contamination. Despite high E. coli concentrations, coliphages were detected only once among water samples, precluding their use as a MST tool in this setting. Rep-PCR classification of E. coli isolates from both water and sediment samples supported the hypothesis that cattle and wildlife were dominant sources of fecal contamination, with minor contributions by horses and humans. Conversely, neither ruminant nor human sources were detected by Bacteroidales markers in most water samples. In bed sediment, ruminant- and human-associated Bacteroidales markers were detected throughout the interval from 0 to 0.3 m, with detections independent of E. coli concentrations in the sediment. Although results by E. coli-based and Bacteroidales-based MST methods led to similar interpretations, detection of Bacteroidales markers in sediment more commonly than in water indicates that different tools to track fecal contamination (in this case, tools based on Bacteroidales DNA and E. coli isolates) may have varying relevance to the more specific goal of tracking the sources of E. coli in watersheds. This is the first report of simultaneous, toolbox approach application of a library-based and marker-based MST analyses to lowing surface water. ?? ASA, CSSA, SSSA.

  17. Chemical Analyses of Wasp-Associated Streptomyces Bacteria Reveal a Prolific Potential for Natural Products Discovery

    PubMed Central

    Clardy, Jon; Currie, Cameron R.

    2011-01-01

    Identifying new sources for small molecule discovery is necessary to help mitigate the continuous emergence of antibiotic-resistance in pathogenic microbes. Recent studies indicate that one potentially rich source of novel natural products is Actinobacterial symbionts associated with social and solitary Hymenoptera. Here we test this possibility by examining two species of solitary mud dauber wasps, Sceliphron caementarium and Chalybion californicum. We performed enrichment isolations from 33 wasps and obtained more than 200 isolates of Streptomyces Actinobacteria. Chemical analyses of 15 of these isolates identified 11 distinct and structurally diverse secondary metabolites, including a novel polyunsaturated and polyoxygenated macrocyclic lactam, which we name sceliphrolactam. By pairing the 15 Streptomyces strains against a collection of fungi and bacteria, we document their antifungal and antibacterial activity. The prevalence and anti-microbial properties of Actinobacteria associated with these two solitary wasp species suggest the potential role of these Streptomyces as antibiotic-producing symbionts, potentially helping defend their wasp hosts from pathogenic microbes. Finding phylogenetically diverse and chemically prolific Actinobacteria from solitary wasps suggests that insect-associated Actinobacteria can provide a valuable source of novel natural products of pharmaceutical interest. PMID:21364940

  18. A guide to phylogenetic metrics for conservation, community ecology and macroecology.

    PubMed

    Tucker, Caroline M; Cadotte, Marc W; Carvalho, Silvia B; Davies, T Jonathan; Ferrier, Simon; Fritz, Susanne A; Grenyer, Rich; Helmus, Matthew R; Jin, Lanna S; Mooers, Arne O; Pavoine, Sandrine; Purschke, Oliver; Redding, David W; Rosauer, Dan F; Winter, Marten; Mazel, Florent

    2017-05-01

    The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub-disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub-disciplines hampers potential meta-analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo-diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo-diversity metrics based on their mathematical form within these three dimensions and identify 'anchor' representatives: for α-diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. © 2016 The Authors. Biological Reviews published by John Wiley © Sons Ltd on behalf of

  19. Phylogenetic Variation in the Silicon Composition of Plants

    PubMed Central

    HODSON, M. J.; WHITE, P. J.; MEAD, A.; BROADLEY, M. R.

    2005-01-01

    • Background and Aims Silicon (Si) in plants provides structural support and improves tolerance to diseases, drought and metal toxicity. Shoot Si concentrations are generally considered to be greater in monocotyledonous than in non-monocot plant species. The phylogenetic variation in the shoot Si concentration of plants reported in the primary literature has been quantified. • Methods Studies were identified which reported Si concentrations in leaf or non-woody shoot tissues from at least two plant species growing in the same environment. Each study contained at least one species in common with another study. • Key Results Meta-analysis of the data revealed that, in general, ferns, gymnosperms and angiosperms accumulated less Si in their shoots than non-vascular plant species and horsetails. Within angiosperms and ferns, differences in shoot Si concentration between species grouped by their higher-level phylogenetic position were identified. Within the angiosperms, species from the commelinoid monocot orders Poales and Arecales accumulated substantially more Si in their shoots than species from other monocot clades. • Conclusions A high shoot Si concentration is not a general feature of monocot species. Information on the phylogenetic variation in shoot Si concentration may provide useful palaeoecological and archaeological information, and inform studies of the biogeochemical cycling of Si and those of the molecular genetics of Si uptake and transport in plants. PMID:16176944

  20. Full-Sun observations for identifying the source of the slow solar wind

    PubMed Central

    Brooks, David H.; Ugarte-Urra, Ignacio; Warren, Harry P.

    2015-01-01

    Fast (>700 km s−1) and slow (~400 km s−1) winds stream from the Sun, permeate the heliosphere and influence the near-Earth environment. While the fast wind is known to emanate primarily from polar coronal holes, the source of the slow wind remains unknown. Here we identify possible sites of origin using a slow solar wind source map of the entire Sun, which we construct from specially designed, full-disk observations from the Hinode satellite, and a magnetic field model. Our map provides a full-Sun observation that combines three key ingredients for identifying the sources: velocity, plasma composition and magnetic topology and shows them as solar wind composition plasma outflowing on open magnetic field lines. The area coverage of the identified sources is large enough that the sum of their mass contributions can explain a significant fraction of the mass loss rate of the solar wind. PMID:25562705

  1. BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC

    PubMed Central

    Satija, Rahul; Novák, Ádám; Miklós, István; Lyngsø, Rune; Hein, Jotun

    2009-01-01

    Background We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. Results We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the α-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. Conclusion BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from PMID:19715598

  2. BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC.

    PubMed

    Satija, Rahul; Novák, Adám; Miklós, István; Lyngsø, Rune; Hein, Jotun

    2009-08-28

    We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the alpha-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from http://www.stats.ox.ac.uk/~satija/BigFoot/

  3. Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study

    PubMed Central

    Weißenborn, Sandra; Walther, Dirk

    2017-01-01

    Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the

  4. NAC transcription factor genes: genome-wide identification, phylogenetic, motif and cis-regulatory element analysis in pigeonpea (Cajanus cajan (L.) Millsp.).

    PubMed

    Satheesh, Viswanathan; Jagannadham, P Tej Kumar; Chidambaranathan, Parameswaran; Jain, P K; Srinivasan, R

    2014-12-01

    The NAC (NAM, ATAF and CUC) proteins are plant-specific transcription factors implicated in development and stress responses. In the present study 88 pigeonpea NAC genes were identified from the recently published draft genome of pigeonpea by using homology based and de novo prediction programmes. These sequences were further subjected to phylogenetic, motif and promoter analyses. In motif analysis, highly conserved motifs were identified in the NAC domain and also in the C-terminal region of the NAC proteins. A phylogenetic reconstruction using pigeonpea, Arabidopsis and soybean NAC genes revealed 33 putative stress-responsive pigeonpea NAC genes. Several stress-responsive cis-elements were identified through in silico analysis of the promoters of these putative stress-responsive genes. This analysis is the first report of NAC gene family in pigeonpea and will be useful for the identification and selection of candidate genes associated with stress tolerance.

  5. Phylogenetic trees in bioinformatics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Burr, Tom L

    2008-01-01

    Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding themore » best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.« less

  6. Phylogenetic relationship of Ornithobacterium rhinotracheale strains.

    PubMed

    DE Oca-Jimenez, Roberto Montes; Vega-Sanchez, Vicente; Morales-Erasto, Vladimir; Salgado-Miranda, Celene; Blackall, Patrick J; Soriano-Vargas, Edgardo

    2018-04-10

    The bacterium Ornithobacterium rhinotracheale is associated with respiratory disease in wild birds and poultry. In this study, the phylogenetic analysis of nine reference strains of O. rhinotracheale belonging to serovars A to I, and eight Mexican isolates belonging to serovar A, was performed. The analysis was extended to include available sequences from another 23 strains available in the public domain. The analysis showed that the 40 sequences formed six clusters, I to VI. All eight Mexican field isolates were placed in cluster I. One of the reference strains appears to present genetic diversity not previously recognized and was placed in a new genetic cluster. In conclusion, the phylogenetic analysis of O. rhinotracheale strains, based on the 16S rRNA gene, is a suitable tool for epidemiologic studies.

  7. Phylogenetically Diverse Burkholderia Associated with Midgut Crypts of Spurge Bugs, Dicranocephalus spp. (Heteroptera: Stenocephalidae).

    PubMed

    Kuechler, Stefan Martin; Matsuura, Yu; Dettner, Konrad; Kikuchi, Yoshitomo

    2016-06-25

    Diverse phytophagous heteropteran insects, commonly known as stinkbugs, are associated with specific gut symbiotic bacteria, which have been found in midgut cryptic spaces. Recent studies have revealed that members of the stinkbug families Coreidae and Alydidae of the superfamily Coreoidea are consistently associated with a specific group of the betaproteobacterial genus Burkholderia, called the "stinkbug-associated beneficial and environmental (SBE)" group, and horizontally acquire specific symbionts from the environment every generation. However, the symbiotic system of another coreoid family, Stenocephalidae remains undetermined. We herein investigated four species of the stenocephalid genus Dicranocephalus. Examinations via fluorescence in situ hybridization (FISH) and transmission electron microscopy (TEM) revealed the typical arrangement and ultrastructures of midgut crypts and gut symbionts. Cloning and molecular phylogenetic analyses of bacterial genes showed that the midgut crypts of all species are colonized by Burkholderia strains, which were further assigned to different subgroups of the genus Burkholderia. In addition to the SBE-group Burkholderia, a number of stenocephalid symbionts belonged to a novel clade containing B. sordidicola and B. udeis, suggesting a specific symbiont clade for the Stenocephalidae. The symbiotic systems of stenocephalid bugs may provide a unique opportunity to study the ongoing evolution of symbiont associations in the stinkbug-Burkholderia interaction.

  8. Phylogenetically Diverse Burkholderia Associated with Midgut Crypts of Spurge Bugs, Dicranocephalus spp. (Heteroptera: Stenocephalidae)

    PubMed Central

    Kuechler, Stefan Martin; Matsuura, Yu; Dettner, Konrad; Kikuchi, Yoshitomo

    2016-01-01

    Diverse phytophagous heteropteran insects, commonly known as stinkbugs, are associated with specific gut symbiotic bacteria, which have been found in midgut cryptic spaces. Recent studies have revealed that members of the stinkbug families Coreidae and Alydidae of the superfamily Coreoidea are consistently associated with a specific group of the betaproteobacterial genus Burkholderia, called the “stinkbug-associated beneficial and environmental (SBE)” group, and horizontally acquire specific symbionts from the environment every generation. However, the symbiotic system of another coreoid family, Stenocephalidae remains undetermined. We herein investigated four species of the stenocephalid genus Dicranocephalus. Examinations via fluorescence in situ hybridization (FISH) and transmission electron microscopy (TEM) revealed the typical arrangement and ultrastructures of midgut crypts and gut symbionts. Cloning and molecular phylogenetic analyses of bacterial genes showed that the midgut crypts of all species are colonized by Burkholderia strains, which were further assigned to different subgroups of the genus Burkholderia. In addition to the SBE-group Burkholderia, a number of stenocephalid symbionts belonged to a novel clade containing B. sordidicola and B. udeis, suggesting a specific symbiont clade for the Stenocephalidae. The symbiotic systems of stenocephalid bugs may provide a unique opportunity to study the ongoing evolution of symbiont associations in the stinkbug-Burkholderia interaction. PMID:27265344

  9. Identifying avian sources of faecal contamination using sterol analysis.

    PubMed

    Devane, Megan L; Wood, David; Chappell, Andrew; Robson, Beth; Webster-Brown, Jenny; Gilpin, Brent J

    2015-10-01

    Discrimination of the source of faecal pollution in water bodies is an important step in the assessment and mitigation of public health risk. One tool for faecal source tracking is the analysis of faecal sterols which are present in faeces of animals in a range of distinctive ratios. Published ratios are able to discriminate between human and herbivore mammal faecal inputs but are of less value for identifying pollution from wildfowl, which can be a common cause of elevated bacterial indicators in rivers and streams. In this study, the sterol profiles of 50 avian-derived faecal specimens (seagulls, ducks and chickens) were examined alongside those of 57 ruminant faeces and previously published sterol profiles of human wastewater, chicken effluent and animal meatwork effluent. Two novel sterol ratios were identified as specific to avian faecal scats, which, when incorporated into a decision tree with human and herbivore mammal indicative ratios, were able to identify sterols from avian-polluted waterways. For samples where the sterol profile was not consistent with herbivore mammal or human pollution, avian pollution is indicated when the ratio of 24-ethylcholestanol/(24-ethylcholestanol + 24-ethylcoprostanol + 24-ethylepicoprostanol) is ≥0.4 (avian ratio 1) and the ratio of cholestanol/(cholestanol + coprostanol + epicoprostanol) is ≥0.5 (avian ratio 2). When avian pollution is indicated, further confirmation by targeted PCR specific markers can be employed if greater confidence in the pollution source is required. A 66% concordance between sterol ratios and current avian PCR markers was achieved when 56 water samples from polluted waterways were analysed.

  10. Multi-particle inspection using associated particle sources

    DOEpatents

    Bingham, Philip R.; Mihalczo, John T.; Mullens, James A.; McConchie, Seth M.; Hausladen, Paul A.

    2016-02-16

    Disclosed herein are representative embodiments of methods, apparatus, and systems for performing combined neutron and gamma ray radiography. For example, one exemplary system comprises: a neutron source; a set of alpha particle detectors configured to detect alpha particles associated with neutrons generated by the neutron source; neutron detectors positioned to detect at least some of the neutrons generated by the neutron source; a gamma ray source; a set of verification gamma ray detectors configured to detect verification gamma rays associated with gamma rays generated by the gamma ray source; a set of gamma ray detectors configured to detect gamma rays generated by the gamma ray source; and an interrogation region located between the neutron source, the gamma ray source, the neutron detectors, and the gamma ray detectors.

  11. Autumn Algorithm-Computation of Hybridization Networks for Realistic Phylogenetic Trees.

    PubMed

    Huson, Daniel H; Linz, Simone

    2018-01-01

    A minimum hybridization network is a rooted phylogenetic network that displays two given rooted phylogenetic trees using a minimum number of reticulations. Previous mathematical work on their calculation has usually assumed the input trees to be bifurcating, correctly rooted, or that they both contain the same taxa. These assumptions do not hold in biological studies and "realistic" trees have multifurcations, are difficult to root, and rarely contain the same taxa. We present a new algorithm for computing minimum hybridization networks for a given pair of "realistic" rooted phylogenetic trees. We also describe how the algorithm might be used to improve the rooting of the input trees. We introduce the concept of "autumn trees", a nice framework for the formulation of algorithms based on the mathematics of "maximum acyclic agreement forests". While the main computational problem is hard, the run-time depends mainly on how different the given input trees are. In biological studies, where the trees are reasonably similar, our parallel implementation performs well in practice. The algorithm is available in our open source program Dendroscope 3, providing a platform for biologists to explore rooted phylogenetic networks. We demonstrate the utility of the algorithm using several previously studied data sets.

  12. Identifying potential sources of Sudan I contamination in Capsicum fruits over its growth period.

    PubMed

    Wu, Naiying; Gao, Wei; Zhou, Li; Lian, Yunhe; Li, Fengfei; Han, Wenjie

    2015-04-15

    Sudan dyes in spices are often assumed to arise from cross-contamination or malicious addition. Here, experiments were carried out to identify the potential source of Sudan I-IV in Capsicum fruits through investigation of their contents in native Capsicum tissues, soils and associated agronomic materials. Sudan II-IV was not detected in any of the tested samples. Sudan I was found in almost all samples except for the mulching film. Sudan I concentrations decreased from stems to leaves and then to fruits or roots. Sudan I levels in soils were significantly elevated by vegetation treatment. These results exclude the possibility of soil as the main source for Sudan I contamination in Capsicum fruits. Further study found out pesticide and fertilizer constitutes the major source of Sudan I contamination. This work represents a preliminary step for a detailed Sudan I assessment to support Capsicum management and protection in the studied region. Copyright © 2014 Elsevier Ltd. All rights reserved.

  13. Identifying Sources of Funding That Contribute to Scholastic Productivity in Academic Plastic Surgeons.

    PubMed

    Ruan, Qing Zhao; Cohen, Justin B; Baek, Yoonji; Chen, Austin D; Doval, Andres F; Singhal, Dhruv; Fukudome, Eugene Y; Lin, Samuel J; Lee, Bernard T

    2018-04-01

    Scholastic productivity has previously been shown to be positively associated with National Institute of Health (NIH) grants and industry funding. This study examines whether society, industry, or federal funding contributes toward academic productivity as measured by scholastic output of academic plastic surgeons. Institution Web sites were used to acquire academic attributes of full-time academic plastic surgeons. The Center for Medicare and Medicaid Services Open Payment database, NIH reporter, the Plastic Surgery Foundation (PSF), and American Association of Plastic Surgeons (AAPS) Web sites were accessed for funding and endowment details. Bibliometric data of each surgeon were then collected via Scopus to ascertain strengths of association with each source. Multiple linear regression analysis was used to identify significant contributors to high scholastic output. We identified 935 academic plastic surgeons with 94 (10.1%), 24 (2.6%), 724 (77.4%), and 62 (6.6%) receiving funding from PSF, AAPS, industry, and NIH, respectively. There were positive correlations in receiving NIH, PSF, and/or AAPS funding (P < 0.001), whereas industry funding was found to negatively associate with PSF (r = -0.75, P = 0.022) grants. The NIH R award was consistently found to be the most predictive of academic output across bibliometrics, followed by the AAPS academic scholarship award. Conventional measures of academic seniority remained predictive across all measures used. Our study demonstrates for the first time interactions between industry, federal, and association funding. The NIH R award was the strongest determinant of high scholastic productivity. Recognition through AAPS academic scholarships seemed to associate with subsequent success in NIH funding.

  14. Source-dependent and source-independent controls on plutonium oxidation state and colloid associations in groundwater.

    PubMed

    Buesseler, Ken O; Kaplan, Daniel I; Dai, Minhan; Pike, Steven

    2009-03-01

    Plutonium (Pu) was characterized for its isotopic composition, oxidation states, and association with colloids in groundwater samples near disposal basins in F-Area of the Savannah River Site and compared to similar samples collected six years earlier. Two sources of Pu were identified, the disposal basins, which contained a 24Pu/l39Pu isotopic signature consistent with weapons grade Pu, and 244Cm, a cocontaminant that is a progenitor radionuclide of 24Pu. 24Pu that originated primarily from 244Cm tended to be appreciably more oxidized (Pu(V/VI)), less associated with colloids (approximately 1 kDa - 0.2 microm), and more mobile than 239Pu, as suggested by our prior studies at this site. This is not evidence of isotope fractionation but rather "source-dependent" controls on 240Pu speciation which are processes that are not at equilibrium, i.e., processes that appear kinetically hindered. There were also "source-independent" controls on 239Pu speciation, which are those processes that follow thermodynamic equilibrium with their surroundings. For example, a groundwater pH increase in one well from 4.1 in 1998 to 6.1 in 2004 resulted in an order of magnitude decrease in groundwater 239Pu concentrations. Similarly, the fraction of 239Pu in the reduced Pu(III/IV) and colloidal forms increased systematically with decreases in redox condition in 2004 vs 1998. This research demonstrates the importance of source-dependent and source-independent controls on Pu speciation which would impact Pu mobility during changes in hydrological, chemical, or biological conditions on both seasonal and decadal time scales, and over short spatial scales. This implies more dynamic shifts in Pu speciation, colloids association, and transport in groundwater than commonly believed.

  15. Comparative phylogenetic analysis and transcriptional profiling of MADS-box gene family identified DAM and FLC-like genes in apple (Malusx domestica)

    PubMed Central

    Kumar, Gulshan; Arya, Preeti; Gupta, Khushboo; Randhawa, Vinay; Acharya, Vishal; Singh, Anil Kumar

    2016-01-01

    The MADS-box transcription factors play essential roles in various processes of plant growth and development. In the present study, phylogenetic analysis of 142 apple MADS-box proteins with that of other dicotyledonous species identified six putative Dormancy-Associated MADS-box (DAM) and four putative Flowering Locus C-like (FLC-like) proteins. In order to study the expression of apple MADS-box genes, RNA-seq analysis of 3 apical and 5 spur bud stages during dormancy, 6 flower stages and 7 fruit development stages was performed. The dramatic reduction in expression of two MdDAMs, MdMADS063 and MdMADS125 and two MdFLC-like genes, MdMADS135 and MdMADS136 during dormancy release suggests their role as flowering-repressors in apple. Apple orthologs of Arabidopsis genes, FLOWERING LOCUS T, FRIGIDA, SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 and LEAFY exhibit similar expression patterns as reported in Arabidopsis, suggesting functional conservation in floral signal integration and meristem determination pathways. Gene ontology enrichment analysis of predicted targets of DAM revealed their involvement in regulation of reproductive processes and meristematic activities, indicating functional conservation of SVP orthologs (DAM) in apple. This study provides valuable insights into the functions of MADS-box proteins during apple phenology, which may help in devising strategies to improve important traits in apple. PMID:26856238

  16. Comparative phylogenetic analysis and transcriptional profiling of MADS-box gene family identified DAM and FLC-like genes in apple (Malusx domestica).

    PubMed

    Kumar, Gulshan; Arya, Preeti; Gupta, Khushboo; Randhawa, Vinay; Acharya, Vishal; Singh, Anil Kumar

    2016-02-09

    The MADS-box transcription factors play essential roles in various processes of plant growth and development. In the present study, phylogenetic analysis of 142 apple MADS-box proteins with that of other dicotyledonous species identified six putative Dormancy-Associated MADS-box (DAM) and four putative Flowering Locus C-like (FLC-like) proteins. In order to study the expression of apple MADS-box genes, RNA-seq analysis of 3 apical and 5 spur bud stages during dormancy, 6 flower stages and 7 fruit development stages was performed. The dramatic reduction in expression of two MdDAMs, MdMADS063 and MdMADS125 and two MdFLC-like genes, MdMADS135 and MdMADS136 during dormancy release suggests their role as flowering-repressors in apple. Apple orthologs of Arabidopsis genes, FLOWERING LOCUS T, FRIGIDA, SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 and LEAFY exhibit similar expression patterns as reported in Arabidopsis, suggesting functional conservation in floral signal integration and meristem determination pathways. Gene ontology enrichment analysis of predicted targets of DAM revealed their involvement in regulation of reproductive processes and meristematic activities, indicating functional conservation of SVP orthologs (DAM) in apple. This study provides valuable insights into the functions of MADS-box proteins during apple phenology, which may help in devising strategies to improve important traits in apple.

  17. Phylogenetic Group Determination of Escherichia coli Isolated from Animals Samples

    PubMed Central

    Morcatti Coura, Fernanda; Diniz, Soraia de Araújo; Silva, Marcos Xavier; Mussi, Jamili Maria Suhet; Barbosa, Silvia Minharro; Lage, Andrey Pereira; Heinemann, Marcos Bryan

    2015-01-01

    This study analyzes the occurrence and distribution of phylogenetic groups of 391 strains of Escherichia coli isolated from poultry, cattle, and water buffalo. The frequency of the phylogroups was A = 19%, B1 = 57%, B2 = 2.3%, C = 4.6%, D = 2.8%, E = 11%, and F = 3.3%. Phylogroups A (P < 0.001) and F (P = 0.018) were associated with E. coli strains isolated from poultry, phylogroups B1 (P < 0.001) and E (P = 0.002) were associated with E. coli isolated from cattle, and phylogroups B2 (P = 0.003) and D (P = 0.017) were associated with E. coli isolated from water buffalo. This report demonstrated that some phylogroups are associated with the host analyzed and the results provide knowledge of the phylogenetic composition of E. coli from domestic animals. PMID:26421310

  18. New substitution models for rooting phylogenetic trees.

    PubMed

    Williams, Tom A; Heaps, Sarah E; Cherlin, Svetlana; Nye, Tom M W; Boys, Richard J; Embley, T Martin

    2015-09-26

    The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. © 2015 The Authors.

  19. Using Multiple-Variable Matching to Identify Cultural Sources of Differential Item Functioning

    ERIC Educational Resources Information Center

    Wu, Amery D.; Ercikan, Kadriye

    2006-01-01

    Identifying the sources of differential item functioning (DIF) in international assessments is very challenging, because such sources are often nebulous and intertwined. Even though researchers frequently focus on test translation and content area, few actually go beyond these factors to investigate other cultural sources of DIF. This article…

  20. Identification of Tunisian Leishmania spp. by PCR amplification of cysteine proteinase B (cpb) genes and phylogenetic analysis.

    PubMed

    Chaouch, Melek; Fathallah-Mili, Akila; Driss, Mehdi; Lahmadi, Ramzi; Ayari, Chiraz; Guizani, Ikram; Ben Said, Moncef; Benabderrazak, Souha

    2013-03-01

    Discrimination of the Old World Leishmania parasites is important for diagnosis and epidemiological studies of leishmaniasis. We have developed PCR assays that allow the discrimination between Leishmania major, Leishmania tropica and Leishmania infantum Tunisian species. The identification was performed by a simple PCR targeting cysteine protease B (cpb) gene copies. These PCR can be a routine molecular biology tools for discrimination of Leishmania spp. from different geographical origins and different clinical forms. Our assays can be an informative source for cpb gene studying concerning drug, diagnostics and vaccine research. The PCR products of the cpb gene and the N-acetylglucosamine-1-phosphate transferase (nagt) Leishmania gene were sequenced and aligned. Phylogenetic trees of Leishmania based cpb and nagt sequences are close in topology and present the classic distribution of Leishmania in the Old World. The phylogenetic analysis has enabled the characterization and identification of different strains, using both multicopy (cpb) and single copy (nagt) genes. Indeed, the cpb phylogenetic analysis allowed us to identify the Tunisian Leishmania killicki species, and a group which gathers the least evolved isolates of the Leishmania donovani complex, that was originated from East Africa. This clustering confirms the African origin for the visceralizing species of the L. donovani complex. Copyright © 2012 Elsevier B.V. All rights reserved.

  1. Transforming phylogenetic networks: Moving beyond tree space.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-07

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.

  2. Phylogenetic effective sample size.

    PubMed

    Bartoszek, Krzysztof

    2016-10-21

    In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present concept of effective sample size (the mean effective sample size). Through a simulation study I find that the AICc is robust if one corrects for the number of species or effective number of species. Lastly I discuss how the concept of the phylogenetic effective sample size can be useful for biodiversity quantification, identification of interesting clades and deciding on the importance of phylogenetic correlations. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. Species Divergence and Phylogenetic Variation of Ecophysiological Traits in Lianas and Trees

    PubMed Central

    Rios, Rodrigo S.; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [Amax], dark respiration rate [Rd], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that Rd evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for Rd, while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. Rd followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for Amax. Rd may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades. PMID:24914958

  4. A Format for Phylogenetic Placements

    PubMed Central

    Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988

  5. A format for phylogenetic placements.

    PubMed

    Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.

  6. Aujeszky's disease in red fox (Vulpes vulpes): phylogenetic analysis unravels an unexpected epidemiologic link.

    PubMed

    Caruso, Claudio; Dondo, Alessandro; Cerutti, Francesco; Masoero, Loretta; Rosamilia, Alfonso; Zoppi, Simona; D'Errico, Valeria; Grattarola, Carla; Acutis, Pier Luigi; Peletto, Simone

    2014-07-01

    We describe Aujeszky's disease in a female of red fox (Vulpes vulpes). Although wild boar (Sus scrofa) would be the expected source of infection, phylogenetic analysis suggested a domestic rather than a wild source of virus, underscoring the importance of biosecurity measures in pig farms to prevent contact with wild animals.

  7. Computational Tools for Parsimony Phylogenetic Analysis of Omics Data

    PubMed Central

    Salazar, Jose; Amri, Hakima; Noursi, David

    2015-01-01

    Abstract High-throughput assays from genomics, proteomics, metabolomics, and next generation sequencing produce massive omics datasets that are challenging to analyze in biological or clinical contexts. Thus far, there is no publicly available program for converting quantitative omics data into input formats to be used in off-the-shelf robust phylogenetic programs. To the best of our knowledge, this is the first report on creation of two Windows-based programs, OmicsTract and SynpExtractor, to address this gap. We note, as a way of introduction and development of these programs, that one particularly useful bioinformatics inferential modeling is the phylogenetic cladogram. Cladograms are multidimensional tools that show the relatedness between subgroups of healthy and diseased individuals and the latter's shared aberrations; they also reveal some characteristics of a disease that would not otherwise be apparent by other analytical methods. The OmicsTract and SynpExtractor were written for the respective tasks of (1) accommodating advanced phylogenetic parsimony analysis (through standard programs of MIX [from PHYLIP] and TNT), and (2) extracting shared aberrations at the cladogram nodes. OmicsTract converts comma-delimited data tables through assigning each data point into a binary value (“0” for normal states and “1” for abnormal states) then outputs the converted data tables into the proper input file formats for MIX or with embedded commands for TNT. SynapExtractor uses outfiles from MIX and TNT to extract the shared aberrations of each node of the cladogram, matching them with identifying labels from the dataset and exporting them into a comma-delimited file. Labels may be gene identifiers in gene-expression datasets or m/z values in mass spectrometry datasets. By automating these steps, OmicsTract and SynpExtractor offer a veritable opportunity for rapid and standardized phylogenetic analyses of omics data; their model can also be extended to next

  8. Association of fine particulate matter from different sources with daily mortality in six U.S. cities.

    PubMed Central

    Laden, F; Neas, L M; Dockery, D W; Schwartz, J

    2000-01-01

    Previously we reported that fine particle mass (particulate matter [less than and equal to] 2.5 microm; PM(2.5)), which is primarily from combustion sources, but not coarse particle mass, which is primarily from crustal sources, was associated with daily mortality in six eastern U.S. cities (1). In this study, we used the elemental composition of size-fractionated particles to identify several distinct source-related fractions of fine particles and examined the association of these fractions with daily mortality in each of the six cities. Using specific rotation factor analysis for each city, we identified a silicon factor classified as soil and crustal material, a lead factor classified as motor vehicle exhaust, a selenium factor representing coal combustion, and up to two additional factors. We extracted daily counts of deaths from National Center for Health Statistics records and estimated city-specific associations of mortality with each source factor by Poisson regression, adjusting for time trends, weather, and the other source factors. Combined effect estimates were calculated as the inverse variance weighted mean of the city-specific estimates. In the combined analysis, a 10 microg/m(3) increase in PM(2.5) from mobile sources accounted for a 3.4% increase in daily mortality [95% confidence interval (CI), 1.7-5.2%], and the equivalent increase in fine particles from coal combustion sources accounted for a 1.1% increase [CI, 0.3-2.0%). PM(2.5) crustal particles were not associated with daily mortality. These results indicate that combustion particles in the fine fraction from mobile and coal combustion sources, but not fine crustal particles, are associated with increased mortality. PMID:11049813

  9. Phylogenetic inertia and Darwin's higher law.

    PubMed

    Shanahan, Timothy

    2011-03-01

    The concept of 'phylogenetic inertia' is routinely deployed in evolutionary biology as an alternative to natural selection for explaining the persistence of characteristics that appear sub-optimal from an adaptationist perspective. However, in many of these contexts the precise meaning of 'phylogenetic inertia' and its relationship to selection are far from clear. After tracing the history of the concept of 'inertia' in evolutionary biology, I argue that treating phylogenetic inertia and natural selection as alternative explanations is mistaken because phylogenetic inertia is, from a Darwinian point of view, simply an expected effect of selection. Although Darwin did not discuss 'phylogenetic inertia,' he did assert the explanatory priority of selection over descent. An analysis of 'phylogenetic inertia' provides a perspective from which to assess Darwin's view. Copyright © 2010 Elsevier Ltd. All rights reserved.

  10. A phylogenetic perspective on the association between ants (Hymenoptera: Formicidae) and black yeasts (Ascomycota: Chaetothyriales).

    PubMed

    Vasse, Marie; Voglmayr, Hermann; Mayer, Veronika; Gueidan, Cécile; Nepel, Maximilian; Moreno, Leandro; de Hoog, Sybren; Selosse, Marc-André; McKey, Doyle; Blatrix, Rumsaïs

    2017-03-15

    The frequency and the geographical extent of symbiotic associations between ants and fungi of the order Chaetothyriales have been highlighted only recently. Using a phylogenetic approach based on seven molecular markers, we showed that ant-associated Chaetothyriales are scattered through the phylogeny of this order. There was no clustering according to geographical origin or to the taxonomy of the ant host. However, strains tended to be clustered according to the type of association with ants: strains from ant-made carton and strains from plant cavities occupied by ants ('domatia') rarely clustered together. Defining molecular operational taxonomic units (MOTUs) with an internal transcribed spacer sequence similarity cut-off of 99% revealed that a single MOTU could be composed of strains collected from various ant species and from several continents. Some ant-associated MOTUs also contained strains isolated from habitats other than ant-associated structures. Altogether, our results suggest that the degree of specialization of the interactions between ants and their fungal partners is highly variable. A better knowledge of the ecology of these interactions and a more comprehensive sampling of the fungal order are needed to elucidate the evolutionary history of mutualistic symbioses between ants and Chaetothyriales. © 2017 The Author(s).

  11. An approach to source characterization of tremor signals associated with eruptions and lahars

    NASA Astrophysics Data System (ADS)

    Kumagai, Hiroyuki; Mothes, Patricia; Ruiz, Mario; Maeda, Yuta

    2015-11-01

    Tremor signals are observed in association with eruption activity and lahar descents. Reduced displacement ( D R) derived from tremor signals has been used to quantify tremor sources. However, tremor duration is not considered in D R, which makes it difficult to compare D R values estimated for different tremor episodes. We propose application of the amplitude source location (ASL) method to characterize the sources of tremor signals. We used this method to estimate the tremor source location and source amplitude from high-frequency (5-10 Hz) seismic amplitudes under the assumption of isotropic S-wave radiation. We considered the source amplitude to be the maximum value during tremor. We estimated the cumulative source amplitude ( I s) as the offset value of the time-integrated envelope of the vertical seismogram of tremor corrected for geometrical spreading and medium attenuation in the 5-10-Hz band. For eruption tremor signals, we also estimated the cumulative source pressure ( I p) from an infrasonic envelope waveform corrected for geometrical spreading. We studied these parameters of tremor signals associated with eruptions and lahars and explosion events at Tungurahua volcano, Ecuador. We identified two types of eruption tremor at Tungurahua: noise-like inharmonic waveforms and harmonic oscillatory signals. We found that I s increased linearly with increasing source amplitude for lahar tremor signals and explosion events, but I s increased exponentially with increasing source amplitude for inharmonic eruption tremor signals. The source characteristics of harmonic eruption tremor signals differed from those of inharmonic tremor signals. We found a linear relation between I s and I p for both explosion events and eruption tremor. Because I p may be proportional to the total mass involved during an eruption episode, this linear relation suggests that I s may be useful to quantify eruption size. The I s values we estimated for inharmonic eruption tremor were

  12. Application of agglomerative clustering for analyzing phylogenetically on bacterium of saliva

    NASA Astrophysics Data System (ADS)

    Bustamam, A.; Fitria, I.; Umam, K.

    2017-07-01

    Analyzing population of Streptococcus bacteria is important since these species can cause dental caries, periodontal, halitosis (bad breath) and more problems. This paper will discuss the phylogenetically relation between the bacterium Streptococcus in saliva using a phylogenetic tree of agglomerative clustering methods. Starting with the bacterium Streptococcus DNA sequence obtained from the GenBank, then performed characteristic extraction of DNA sequences. The characteristic extraction result is matrix form, then performed normalization using min-max normalization and calculate genetic distance using Manhattan distance. Agglomerative clustering technique consisting of single linkage, complete linkage and average linkage. In this agglomerative algorithm number of group is started with the number of individual species. The most similar species is grouped until the similarity decreases and then formed a single group. Results of grouping is a phylogenetic tree and branches that join an established level of distance, that the smaller the distance the more the similarity of the larger species implementation is using R, an open source program.

  13. Arbuscular mycorrhizal fungal communities are phylogenetically clustered at small scales

    PubMed Central

    Horn, Sebastian; Caruso, Tancredi; Verbruggen, Erik; Rillig, Matthias C; Hempel, Stefan

    2014-01-01

    Next-generation sequencing technologies with markers covering the full Glomeromycota phylum were used to uncover phylogenetic community structure of arbuscular mycorrhizal fungi (AMF) associated with Festuca brevipila. The study system was a semi-arid grassland with high plant diversity and a steep environmental gradient in pH, C, N, P and soil water content. The AMF community in roots and rhizosphere soil were analyzed separately and consisted of 74 distinct operational taxonomic units (OTUs) in total. Community-level variance partitioning showed that the role of environmental factors in determining AM species composition was marginal when controlling for spatial autocorrelation at multiple scales. Instead, phylogenetic distance and spatial distance were major correlates of AMF communities: OTUs that were more closely related (and which therefore may have similar traits) were more likely to co-occur. This pattern was insensitive to phylogenetic sampling breadth. Given the minor effects of the environment, we propose that at small scales closely related AMF positively associate through biotic factors such as plant-AMF filtering and interactions within the soil biota. PMID:24824667

  14. Phylogenetic and Evolutionary Patterns in Microbial Carotenoid Biosynthesis Are Revealed by Comparative Genomics

    PubMed Central

    Klassen, Jonathan L.

    2010-01-01

    Background Carotenoids are multifunctional, taxonomically widespread and biotechnologically important pigments. Their biosynthesis serves as a model system for understanding the evolution of secondary metabolism. Microbial carotenoid diversity and evolution has hitherto been analyzed primarily from structural and biosynthetic perspectives, with the few phylogenetic analyses of microbial carotenoid biosynthetic proteins using either used limited datasets or lacking methodological rigor. Given the recent accumulation of microbial genome sequences, a reappraisal of microbial carotenoid biosynthetic diversity and evolution from the perspective of comparative genomics is warranted to validate and complement models of microbial carotenoid diversity and evolution based upon structural and biosynthetic data. Methodology/Principal Findings Comparative genomics were used to identify and analyze in silico microbial carotenoid biosynthetic pathways. Four major phylogenetic lineages of carotenoid biosynthesis are suggested composed of: (i) Proteobacteria; (ii) Firmicutes; (iii) Chlorobi, Cyanobacteria and photosynthetic eukaryotes; and (iv) Archaea, Bacteroidetes and two separate sub-lineages of Actinobacteria. Using this phylogenetic framework, specific evolutionary mechanisms are proposed for carotenoid desaturase CrtI-family enzymes and carotenoid cyclases. Several phylogenetic lineage-specific evolutionary mechanisms are also suggested, including: (i) horizontal gene transfer; (ii) gene acquisition followed by differential gene loss; (iii) co-evolution with other biochemical structures such as proteorhodopsins; and (iv) positive selection. Conclusions/Significance Comparative genomics analyses of microbial carotenoid biosynthetic proteins indicate a much greater taxonomic diversity then that identified based on structural and biosynthetic data, and divides microbial carotenoid biosynthesis into several, well-supported phylogenetic lineages not evident previously. This

  15. Effectiveness of source documents for identifying fatal occupational injuries: a synthesis of studies.

    PubMed

    Stout, N; Bell, C

    1991-06-01

    The complete and accurate identification of fatal occupational injuries among the US work force is an important first step in developing work injury prevention efforts. Numerous sources of information, such as death certificates, Workers' Compensation files, Occupational Safety and Health Administration (OSHA) files, medical examiner records, state health and labor department reports, and various combinations of these, have been used to identify cases of work-related fatal injuries. Recent studies have questioned the effectiveness of these sources for identifying such cases. At least 10 studies have used multiple sources to define the universe of fatal work injuries within a state and to determine the capture rates, or proportion of the universe identified, by each source. Results of these studies, which are not all available in published literature, are summarized here in a format that allows researchers to readily compare the ascertainment capabilities of the sources. The overall average capture rates of sources were as follows: death certificates, 81%; medical examiner records, 61%; Workers' Compensation reports, 57%; and OSHA reports 32%. Variations by state and value added through the use of multiple sources are presented and discussed. This meta-analysis of 10 state-based studies summarizes the effectiveness of various source documents for capturing cases of fatal occupational injuries to help researchers make informed decisions when designing occupational injury surveillance systems.

  16. Effectiveness of source documents for identifying fatal occupational injuries: a synthesis of studies.

    PubMed Central

    Stout, N; Bell, C

    1991-01-01

    BACKGROUND: The complete and accurate identification of fatal occupational injuries among the US work force is an important first step in developing work injury prevention efforts. Numerous sources of information, such as death certificates, Workers' Compensation files, Occupational Safety and Health Administration (OSHA) files, medical examiner records, state health and labor department reports, and various combinations of these, have been used to identify cases of work-related fatal injuries. Recent studies have questioned the effectiveness of these sources for identifying such cases. METHODS: At least 10 studies have used multiple sources to define the universe of fatal work injuries within a state and to determine the capture rates, or proportion of the universe identified, by each source. Results of these studies, which are not all available in published literature, are summarized here in a format that allows researchers to readily compare the ascertainment capabilities of the sources. RESULTS: The overall average capture rates of sources were as follows: death certificates, 81%; medical examiner records, 61%; Workers' Compensation reports, 57%; and OSHA reports 32%. Variations by state and value added through the use of multiple sources are presented and discussed. CONCLUSIONS: This meta-analysis of 10 state-based studies summarizes the effectiveness of various source documents for capturing cases of fatal occupational injuries to help researchers make informed decisions when designing occupational injury surveillance systems. PMID:1827569

  17. Phylogenetic Analysis Reveals a High Prevalence of Sporothrix brasiliensis in Feline Sporotrichosis Outbreaks

    PubMed Central

    Rodrigues, Anderson Messias; de Melo Teixeira, Marcus; de Hoog, G. Sybren; Schubach, Tânia Maria Pacheco; Pereira, Sandro Antonio; Fernandes, Geisa Ferreira; Bezerra, Leila Maria Lopes; Felipe, Maria Sueli; de Camargo, Zoilo Pires

    2013-01-01

    Sporothrix schenckii, previously assumed to be the sole agent of human and animal sporotrichosis, is in fact a species complex. Recently recognized taxa include S. brasiliensis, S. globosa, S. mexicana, and S. luriei, in addition to S. schenckii sensu stricto. Over the last decades, large epidemics of sporotrichosis occurred in Brazil due to zoonotic transmission, and cats were pointed out as key susceptible hosts. In order to understand the eco-epidemiology of feline sporotrichosis and its role in human sporotrichosis a survey was conducted among symptomatic cats. Prevalence and phylogenetic relationships among feline Sporothrix species were investigated by reconstructing their phylogenetic origin using the calmodulin (CAL) and the translation elongation factor-1 alpha (EF1α) loci in strains originated from Rio de Janeiro (RJ, n = 15), Rio Grande do Sul (RS, n = 10), Paraná (PR, n = 4), São Paulo (SP, n = 3) and Minas Gerais (MG, n = 1). Our results showed that S. brasiliensis is highly prevalent among cats (96.9%) with sporotrichosis, while S. schenckii was identified only once. The genotype of Sporothrix from cats was found identical to S. brasiliensis from human sources confirming that the disease is transmitted by cats. Sporothrix brasiliensis presented low genetic diversity compared to its sister taxon S. schenckii. No evidence of recombination in S. brasiliensis was found by split decomposition or PHI-test analysis, suggesting that S. brasiliensis is a clonal species. Strains recovered in states SP, MG and PR share the genotype of the RJ outbreak, different from the RS clone. The occurrence of separate genotypes among strains indicated that the Brazilian S. brasiliensis epidemic has at least two distinct sources. We suggest that cats represent a major host and the main source of cat and human S. brasiliensis infections in Brazil. PMID:23818999

  18. Phylogenetic analysis reveals a high prevalence of Sporothrix brasiliensis in feline sporotrichosis outbreaks.

    PubMed

    Rodrigues, Anderson Messias; de Melo Teixeira, Marcus; de Hoog, G Sybren; Schubach, Tânia Maria Pacheco; Pereira, Sandro Antonio; Fernandes, Geisa Ferreira; Bezerra, Leila Maria Lopes; Felipe, Maria Sueli; de Camargo, Zoilo Pires

    2013-01-01

    Sporothrix schenckii, previously assumed to be the sole agent of human and animal sporotrichosis, is in fact a species complex. Recently recognized taxa include S. brasiliensis, S. globosa, S. mexicana, and S. luriei, in addition to S. schenckii sensu stricto. Over the last decades, large epidemics of sporotrichosis occurred in Brazil due to zoonotic transmission, and cats were pointed out as key susceptible hosts. In order to understand the eco-epidemiology of feline sporotrichosis and its role in human sporotrichosis a survey was conducted among symptomatic cats. Prevalence and phylogenetic relationships among feline Sporothrix species were investigated by reconstructing their phylogenetic origin using the calmodulin (CAL) and the translation elongation factor-1 alpha (EF1α) loci in strains originated from Rio de Janeiro (RJ, n = 15), Rio Grande do Sul (RS, n = 10), Paraná (PR, n = 4), São Paulo (SP, n =3) and Minas Gerais (MG, n = 1). Our results showed that S. brasiliensis is highly prevalent among cats (96.9%) with sporotrichosis, while S. schenckii was identified only once. The genotype of Sporothrix from cats was found identical to S. brasiliensis from human sources confirming that the disease is transmitted by cats. Sporothrix brasiliensis presented low genetic diversity compared to its sister taxon S. schenckii. No evidence of recombination in S. brasiliensis was found by split decomposition or PHI-test analysis, suggesting that S. brasiliensis is a clonal species. Strains recovered in states SP, MG and PR share the genotype of the RJ outbreak, different from the RS clone. The occurrence of separate genotypes among strains indicated that the Brazilian S. brasiliensis epidemic has at least two distinct sources. We suggest that cats represent a major host and the main source of cat and human S. brasiliensis infections in Brazil.

  19. Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.

    PubMed

    Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.

  20. Phylo.io: Interactive Viewing and Comparison of Large Phylogenetic Trees on the Web.

    PubMed

    Robinson, Oscar; Dylus, David; Dessimoz, Christophe

    2016-08-01

    Phylogenetic trees are pervasively used to depict evolutionary relationships. Increasingly, researchers need to visualize large trees and compare multiple large trees inferred for the same set of taxa (reflecting uncertainty in the tree inference or genuine discordance among the loci analyzed). Existing tree visualization tools are however not well suited to these tasks. In particular, side-by-side comparison of trees can prove challenging beyond a few dozen taxa. Here, we introduce Phylo.io, a web application to visualize and compare phylogenetic trees side-by-side. Its distinctive features are: highlighting of similarities and differences between two trees, automatic identification of the best matching rooting and leaf order, scalability to large trees, high usability, multiplatform support via standard HTML5 implementation, and possibility to store and share visualizations. The tool can be freely accessed at http://phylo.io and can easily be embedded in other web servers. The code for the associated JavaScript library is available at https://github.com/DessimozLab/phylo-io under an MIT open source license. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  1. Molecular Phylogenetics: Concepts for a Newcomer.

    PubMed

    Ajawatanawong, Pravech

    Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.

  2. Phylogenetically diverse macrophyte community promotes species diversity of mobile epi-benthic invertebrates

    NASA Astrophysics Data System (ADS)

    Nakamoto, Kenta; Hayakawa, Jun; Kawamura, Tomohiko; Kodama, Masafumi; Yamada, Hideaki; Kitagawa, Takashi; Watanabe, Yoshiro

    2018-07-01

    Various aspects of plant diversity such as species diversity and phylogenetic diversity enhance the species diversity of associated animals in terrestrial systems. In marine systems, however, the effects of macrophyte diversity on the species diversity of associated animals have received little attention. Here, we sampled in a subtropical seagrass-seaweed mixed bed to elucidate the effect of the macrophyte phylogenetic diversity based on the taxonomic relatedness as well as the macrophyte species diversity on species diversity of mobile epi-benthic invertebrates. Using regression analyses for each macrophyte parameter as well as multiple regression analyses, we found that the macrophyte phylogenetic diversity (taxonomic diversity index: Delta) positively influenced the invertebrate species richness and diversity index (H‧). Although the macrophyte species richness and H‧ also positively influenced the invertebrate species richness, the best fit model for invertebrate species richness did not include them, suggesting that the macrophyte species diversity indirectly influenced invertebrate species diversity. Possible explanations of the effects of macrophyte Delta on the invertebrate species diversity were the niche complementarity effect and the selection effect. This is the first study which demonstrates that macrophyte phylogenetic diversity has a strong effect on the species diversity of mobile epi-benthic invertebrates.

  3. pez: phylogenetics for the environmental sciences.

    PubMed

    Pearse, William D; Cadotte, Marc W; Cavender-Bares, Jeannine; Ives, Anthony R; Tucker, Caroline M; Walker, Steve C; Helmus, Matthew R

    2015-09-01

    pez is an R package that permits measurement, modelling and simulation of phylogenetic structure in ecological data. pez contains the first implementation of many methods in R, and aggregates existing data structures and methods into a single, coherent package. pez is released under the GPL v3 open-source license, available on the Internet from CRAN (http://cran.r-project.org). The package is under active development, and the authors welcome contributions (see http://github.com/willpearse/pez). will.pearse@gmail.com. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Phylogenetic Invariants for Metazoan Mitochondrial Genome Evolution.

    PubMed

    Sankoff; Blanchette

    1998-01-01

    The method of phylogenetic invariants was developed to apply to aligned sequence data generated, according to a stochastic substitution model, for N species related through an unknown phylogenetic tree. The invariants are functions of the probabilities of the observable N-tuples, which are identically zero, over all choices of branch length, for some trees. Evaluating the invariants associated with all possible trees, using observed N-tuple frequencies over all sequence positions, enables us to rapidly infer the generating tree. An aspect of evolution at the genomic level much studied recently is the rearrangements of gene order along the chromosome from one species to another. Instead of the substitutions responsible for sequence evolution, we examine the non-local processes responsible for genome rearrangements such as inversion of arbitrarily long segments of chromosomes. By treating the potential adjacency of each possible pair of genes as a position", an appropriate substitution" model can be recognized as governing the rearrangement process, and a probabilistically principled phylogenetic inference can be set up. We calculate the invariants for this process for N=5, and apply them to mitochondrial genome data from coelomate metazoans, showing how they resolve key aspects of branching order.

  5. Phylogenetic diversity and spatial distribution of the microbial community associated with the Caribbean deep-water sponge Polymastia cf. corticata by 16S rRNA, aprA, and amoA gene analysis.

    PubMed

    Meyer, Birte; Kuever, Jan

    2008-08-01

    Denaturing gradient gel electrophoresis (DGGE)-based analyses of 16S rRNA, aprA, and amoA genes demonstrated that a phylogenetically diverse and complex microbial community was associated with the Caribbean deep-water sponge Polymastia cf. corticata Ridley and Dendy, 1887. From the 38 archaeal and bacterial 16S rRNA phylotypes identified, 53% branched into the sponge-specific, monophyletic sequence clusters determined by previous studies (considering predominantly shallow-water sponge species), whereas 26% appeared to be P. cf. corticata specifically associated microorganisms ("specialists"); 21% of the phylotypes were confirmed to represent seawater- and sediment-derived proteobacterial species ("contaminants") acquired by filtration processes from the host environment. Consistently, the aprA and amoA gene-based analyses indicated the presence of environmentally derived sulfur- and ammonia-oxidizers besides putative sponge-specific sulfur-oxidizing Gammaproteobacteria and Alphaproteobacteria and a sulfate-reducing archaeon. A sponge-specific, endosymbiotic sulfur cycle as described for marine oligochaetes is proposed to be also present in P. cf. corticata. Overall, the results of this work support the recent studies that demonstrated the sponge species specificity of the associated microbial community while the biogeography of the host collection site has only a minor influence on the composition. In P. cf. corticata, the specificity of the sponge-microbe associations is even extended to the spatial distribution of the microorganisms within the sponge body; distinct bacterial populations were associated with the different tissue sections, papillae, outer and inner cortex, and choanosome. The local distribution of a phylotype within P. cf. corticata correlated with its (1) phylogenetic affiliation, (2) classification as sponge-specific or nonspecifically associated microorganism, and (3) potential ecological role in the host sponge.

  6. Source-specific pollution exposure and associations with pulmonary response in the Atlanta Commuters Exposure Studies.

    PubMed

    Krall, Jenna R; Ladva, Chandresh N; Russell, Armistead G; Golan, Rachel; Peng, Xing; Shi, Guoliang; Greenwald, Roby; Raysoni, Amit U; Waller, Lance A; Sarnat, Jeremy A

    2018-06-01

    Concentrations of traffic-related air pollutants are frequently higher within commuting vehicles than in ambient air. Pollutants found within vehicles may include those generated by tailpipe exhaust, brake wear, and road dust sources, as well as pollutants from in-cabin sources. Source-specific pollution, compared to total pollution, may represent regulation targets that can better protect human health. We estimated source-specific pollution exposures and corresponding pulmonary response in a panel study of commuters. We used constrained positive matrix factorization to estimate source-specific pollution factors and, subsequently, mixed effects models to estimate associations between source-specific pollution and pulmonary response. We identified four pollution factors that we named: crustal, primary tailpipe traffic, non-tailpipe traffic, and secondary. Among asthmatic subjects (N = 48), interquartile range increases in crustal and secondary pollution were associated with changes in lung function of -1.33% (95% confidence interval (CI): -2.45, -0.22) and -2.19% (95% CI: -3.46, -0.92) relative to baseline, respectively. Among non-asthmatic subjects (N = 51), non-tailpipe pollution was associated with pulmonary response only at 2.5 h post-commute. We found no significant associations between pulmonary response and primary tailpipe pollution. Health effects associated with traffic-related pollution may vary by source, and therefore some traffic pollution sources may require targeted interventions to protect health.

  7. Encoding phylogenetic trees in terms of weighted quartets.

    PubMed

    Grünewald, Stefan; Huber, Katharina T; Moulton, Vincent; Semple, Charles

    2008-04-01

    One of the main problems in phylogenetics is to develop systematic methods for constructing evolutionary or phylogenetic trees. For a set of species X, an edge-weighted phylogenetic X-tree or phylogenetic tree is a (graph theoretical) tree with leaf set X and no degree 2 vertices, together with a map assigning a non-negative length to each edge of the tree. Within phylogenetics, several methods have been proposed for constructing such trees that work by trying to piece together quartet trees on X, i.e. phylogenetic trees each having four leaves in X. Hence, it is of interest to characterise when a collection of quartet trees corresponds to a (unique) phylogenetic tree. Recently, Dress and Erdös provided such a characterisation for binary phylogenetic trees, that is, phylogenetic trees all of whose internal vertices have degree 3. Here we provide a new characterisation for arbitrary phylogenetic trees.

  8. Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

    PubMed

    Parks, Donovan H; Beiko, Robert G

    2013-01-01

    High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.

  9. Internet-accessible DNA sequence database for identifying fusaria from human and animal infections.

    PubMed

    O'Donnell, Kerry; Sutton, Deanna A; Rinaldi, Michael G; Sarver, Brice A J; Balajee, S Arunmozhi; Schroers, Hans-Josef; Summerbell, Richard C; Robert, Vincent A R G; Crous, Pedro W; Zhang, Ning; Aoki, Takayuki; Jung, Kyongyong; Park, Jongsun; Lee, Yong-Hwan; Kang, Seogchan; Park, Bongsoo; Geiser, David M

    2010-10-01

    Because less than one-third of clinically relevant fusaria can be accurately identified to species level using phenotypic data (i.e., morphological species recognition), we constructed a three-locus DNA sequence database to facilitate molecular identification of the 69 Fusarium species associated with human or animal mycoses encountered in clinical microbiology laboratories. The database comprises partial sequences from three nuclear genes: translation elongation factor 1α (EF-1α), the largest subunit of RNA polymerase (RPB1), and the second largest subunit of RNA polymerase (RPB2). These three gene fragments can be amplified by PCR and sequenced using primers that are conserved across the phylogenetic breadth of Fusarium. Phylogenetic analyses of the combined data set reveal that, with the exception of two monotypic lineages, all clinically relevant fusaria are nested in one of eight variously sized and strongly supported species complexes. The monophyletic lineages have been named informally to facilitate communication of an isolate's clade membership and genetic diversity. To identify isolates to the species included within the database, partial DNA sequence data from one or more of the three genes can be used as a BLAST query against the database which is Web accessible at FUSARIUM-ID (http://isolate.fusariumdb.org) and the Centraalbureau voor Schimmelcultures (CBS-KNAW) Fungal Biodiversity Center (http://www.cbs.knaw.nl/fusarium). Alternatively, isolates can be identified via phylogenetic analysis by adding sequences of unknowns to the DNA sequence alignment, which can be downloaded from the two aforementioned websites. The utility of this database should increase significantly as members of the clinical microbiology community deposit in internationally accessible culture collections (e.g., CBS-KNAW or the Fusarium Research Center) cultures of novel mycosis-associated fusaria, along with associated, corrected sequence chromatograms and data, so that the

  10. Using Social Media to Identify Sources of Healthy Food in Urban Neighborhoods.

    PubMed

    Gomez-Lopez, Iris N; Clarke, Philippa; Hill, Alex B; Romero, Daniel M; Goodspeed, Robert; Berrocal, Veronica J; Vinod Vydiswaran, V G; Veinot, Tiffany C

    2017-06-01

    An established body of research has used secondary data sources (such as proprietary business databases) to demonstrate the importance of the neighborhood food environment for multiple health outcomes. However, documenting food availability using secondary sources in low-income urban neighborhoods can be particularly challenging since small businesses play a crucial role in food availability. These small businesses are typically underrepresented in national databases, which rely on secondary sources to develop data for marketing purposes. Using social media and other crowdsourced data to account for these smaller businesses holds promise, but the quality of these data remains unknown. This paper compares the quality of full-line grocery store information from Yelp, a crowdsourced content service, to a "ground truth" data set (Detroit Food Map) and a commercially-available dataset (Reference USA) for the greater Detroit area. Results suggest that Yelp is more accurate than Reference USA in identifying healthy food stores in urban areas. Researchers investigating the relationship between the nutrition environment and health may consider Yelp as a reliable and valid source for identifying sources of healthy food in urban environments.

  11. Choosing and Using Introns in Molecular Phylogenetics

    PubMed Central

    Creer, Simon

    2007-01-01

    Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984

  12. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation

    PubMed Central

    Roger, Andrew J; Hug, Laura A

    2006-01-01

    Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods

  13. On the Shapley Value of Unrooted Phylogenetic Trees.

    PubMed

    Wicke, Kristina; Fischer, Mareike

    2018-01-17

    The Shapley value, a solution concept from cooperative game theory, has recently been considered for both unrooted and rooted phylogenetic trees. Here, we focus on the Shapley value of unrooted trees and first revisit the so-called split counts of a phylogenetic tree and the Shapley transformation matrix that allows for the calculation of the Shapley value from the edge lengths of a tree. We show that non-isomorphic trees may have permutation-equivalent Shapley transformation matrices and permutation-equivalent null spaces. This implies that estimating the split counts associated with a tree or the Shapley values of its leaves does not suffice to reconstruct the correct tree topology. We then turn to the use of the Shapley value as a prioritization criterion in biodiversity conservation and compare it to a greedy solution concept. Here, we show that for certain phylogenetic trees, the Shapley value may fail as a prioritization criterion, meaning that the diversity spanned by the top k species (ranked by their Shapley values) cannot approximate the total diversity of all n species.

  14. Tree-Based Unrooted Phylogenetic Networks.

    PubMed

    Francis, A; Huber, K T; Moulton, V

    2018-02-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.

  15. Phylogenetic diversity of plants alters the effect of species richness on invertebrate herbivory

    PubMed Central

    2013-01-01

    Long-standing ecological theory proposes that diverse communities of plants should experience a decrease in herbivory. Yet previous empirical examinations of this hypothesis have revealed that plant species richness increases herbivory in just as many systems as it decreases it. In this study, I ask whether more insight into the role of plant diversity in promoting or suppressing herbivory can be gained by incorporating information about the evolutionary history of species in a community. In an old field system in southern Ontario, I surveyed communities of plants and measured levels of leaf damage on 27 species in 38 plots. I calculated a measure of phylogenetic diversity (PSE) that encapsulates information about the amount of evolutionary history represented in each of the plots and looked for a relationship between levels of herbivory and both species richness and phylogenetic diversity using a generalized linear mixed model (GLMM) that could account for variation in herbivory levels between species. I found that species richness was positively associated with herbivore damage at the plot-level, in keeping with the results from several other recent studies on this question. On the other hand, phylogenetic diversity was associated with decreased herbivory. Importantly, there was also an interaction between species richness and phylogenetic diversity, such that plots with the highest levels of herbivory were plots which had many species but only if those species tended to be closely related to one another. I propose that these results are the consequence of interactions with herbivores whose diets are phylogenetically specialized (for which I introduce the term cladophage), and how phylogenetic diversity may alter their realized host ranges. These results suggest that incorporating a phylogenetic perspective can add valuable additional insight into the role of plant diversity in explaining or predicting levels of herbivory at a whole-community scale. PMID:23825795

  16. On Identifying the Sound Sources in a Turbulent Flow

    NASA Technical Reports Server (NTRS)

    Goldstein, M. E.

    2008-01-01

    A space-time filtering approach is used to divide an unbounded turbulent flow into its radiating and non-radiating components. The result is then used to clarify a number of issues including the possibility of identifying the sources of the sound in such flows. It is also used to investigate the efficacy of some of the more recent computational approaches.

  17. TreeVector: scalable, interactive, phylogenetic trees for the web.

    PubMed

    Pethica, Ralph; Barker, Gary; Kovacs, Tim; Gough, Julian

    2010-01-28

    Phylogenetic trees are complex data forms that need to be graphically displayed to be human-readable. Traditional techniques of plotting phylogenetic trees focus on rendering a single static image, but increases in the production of biological data and large-scale analyses demand scalable, browsable, and interactive trees. We introduce TreeVector, a Scalable Vector Graphics-and Java-based method that allows trees to be integrated and viewed seamlessly in standard web browsers with no extra software required, and can be modified and linked using standard web technologies. There are now many bioinformatics servers and databases with a range of dynamic processes and updates to cope with the increasing volume of data. TreeVector is designed as a framework to integrate with these processes and produce user-customized phylogenies automatically. We also address the strengths of phylogenetic trees as part of a linked-in browsing process rather than an end graphic for print. TreeVector is fast and easy to use and is available to download precompiled, but is also open source. It can also be run from the web server listed below or the user's own web server. It has already been deployed on two recognized and widely used database Web sites.

  18. A Nondestructive Method to Identify POP Contamination Sources in Omnivorous Seabirds.

    PubMed

    Michielsen, Rosanne J; Shamoun-Baranes, Judy; Parsons, John R; Kraak, Michiel H S

    2018-03-13

    Persistent organic pollutants (POPs) are present in almost all environments due to their high bioaccumulation potential. Especially species that adapted to human activities, like gulls, might be exposed to harmful concentrations of these chemicals. The nature and degree of the exposure to POPs greatly vary between individual gulls, due to their diverse foraging behavior and specialization in certain foraging tactics. Therefore, in order clarify the effect of POP-contaminated areas on gull populations, it is important to identify the sources of POP contamination in individual gulls. Conventional sampling methods applied when studying POP contamination are destructive and ethically undesired. The aim of this literature review was to evaluate the potential of using feathers as a nondestructive method to determine sources of POP contamination in individual gulls. The reviewed data showed that high concentrations of PCBs and PBDEs in feathers together with a large proportion of less bioaccumulative congeners may indicate that the contamination originates from landfills. Low PCB and PBDE concentrations in feathers and a large proportion of more bioaccumulative congeners could indicate that the contamination originates from marine prey. We propose a nondestructive approach to identify the source of contamination in individual gulls based on individual contamination levels and PCB and PBDE congener profiles in feathers. Despite some uncertainties that might be reduced by future research, we conclude that especially when integrated with other methods like GPS tracking and the analysis of stable isotopic signatures, identifying the source of POP contamination based on congener profiles in feathers could become a powerful nondestructive method.

  19. Nodal distances for rooted phylogenetic trees.

    PubMed

    Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel

    2010-08-01

    Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).

  20. Molecular phylogenetic reconstruction of the endemic Asian salamander family Hynobiidae (Amphibia, Caudata).

    PubMed

    Weisrock, David W; Macey, J Robert; Matsui, Masafumi; Mulcahy, Daniel G; Papenfuss, Theodore J

    2013-01-01

    The salamander family Hynobiidae contains over 50 species and has been the subject of a number of molecular phylogenetic investigations aimed at reconstructing branches across the entire family. In general, studies using the greatest amount of sequence data have used reduced taxon sampling, while the study with the greatest taxon sampling has used a limited sequence data set. Here, we provide insights into the phylogenetic history of the Hynobiidae using both dense taxon sampling and a large mitochondrial DNA sequence data set. We report exclusive new mitochondrial DNA data of 2566 aligned bases (with 151 excluded sites, of included sites 1157 are variable with 957 parsimony informative). This is sampled from two genic regions encoding a 12S-16S region (the 3' end of 12S rRNA, tRNA(VAI), and the 5' end of 16S rRNA), and a ND2-COI region (ND2, tRNA(Trp), tRNA(Ala), tRNA(Asn), the origin for light strand replication--O(L), tRNA(Cys), tRNAT(Tyr), and the 5' end of COI). Analyses using parsimony, Bayesian, and maximum likelihood optimality criteria produce similar phylogenetic trees, with discordant branches generally receiving low levels of branch support. Monophyly of the Hynobiidae is strongly supported across all analyses, as is the sister relationship and deep divergence between the genus Onychodactylus with all remaining hynobiids. Within this latter grouping our phylogenetic results identify six clades that are relatively divergent from one another, but for which there is minimal support for their phylogenetic placement. This includes the genus Batrachuperus, the genus Hynobius, the genus Pachyhynobius, the genus Salamandrella, a clade containing the genera Ranodon and Paradactylodon, and a clade containing the genera Liua and Pseudohynobius. This latter clade receives low bootstrap support in the parsimony analysis, but is consistent across all three analytical methods. Our results also clarify a number of well-supported relationships within the larger

  1. Phylogenetic mixtures and linear invariants for equal input models.

    PubMed

    Casanellas, Marta; Steel, Mike

    2017-04-01

    The reconstruction of phylogenetic trees from molecular sequence data relies on modelling site substitutions by a Markov process, or a mixture of such processes. In general, allowing mixed processes can result in different tree topologies becoming indistinguishable from the data, even for infinitely long sequences. However, when the underlying Markov process supports linear phylogenetic invariants, then provided these are sufficiently informative, the identifiability of the tree topology can be restored. In this paper, we investigate a class of processes that support linear invariants once the stationary distribution is fixed, the 'equal input model'. This model generalizes the 'Felsenstein 1981' model (and thereby the Jukes-Cantor model) from four states to an arbitrary number of states (finite or infinite), and it can also be described by a 'random cluster' process. We describe the structure and dimension of the vector spaces of phylogenetic mixtures and of linear invariants for any fixed phylogenetic tree (and for all trees-the so called 'model invariants'), on any number n of leaves. We also provide a precise description of the space of mixtures and linear invariants for the special case of [Formula: see text] leaves. By combining techniques from discrete random processes and (multi-) linear algebra, our results build on a classic result that was first established by James Lake (Mol Biol Evol 4:167-191, 1987).

  2. galaxie--CGI scripts for sequence identification through automated phylogenetic analysis.

    PubMed

    Nilsson, R Henrik; Larsson, Karl-Henrik; Ursing, Björn M

    2004-06-12

    The prevalent use of similarity searches like BLAST to identify sequences and species implicitly assumes the reference database to be of extensive sequence sampling. This is often not the case, restraining the correctness of the outcome as a basis for sequence identification. Phylogenetic inference outperforms similarity searches in retrieving correct phylogenies and consequently sequence identities, and a project was initiated to design a freely available script package for sequence identification through automated Web-based phylogenetic analysis. Three CGI scripts were designed to facilitate qualified sequence identification from a Web interface. Query sequences are aligned to pre-made alignments or to alignments made by ClustalW with entries retrieved from a BLAST search. The subsequent phylogenetic analysis is based on the PHYLIP package for inferring neighbor-joining and parsimony trees. The scripts are highly configurable. A service installation and a version for local use are found at http://andromeda.botany.gu.se/galaxiewelcome.html and http://galaxie.cgb.ki.se

  3. The use of source memory to identify one's own episodic confusion errors.

    PubMed

    Smith, S M; Tindell, D R; Pierce, B H; Gilliland, T R; Gerkens, D R

    2001-03-01

    In 4 category cued recall experiments, participants falsely recalled nonlist common members, a semantic confusion error. Errors were more likely if critical nonlist words were presented on an incidental task, causing source memory failures called episodic confusion errors. Participants could better identify the source of falsely recalled words if they had deeply processed the words on the incidental task. For deep but not shallow processing, participants could reliably include or exclude incidentally shown category members in recall. The illusion that critical items actually appeared on categorized lists was diminished but not eradicated when participants identified episodic confusion errors post hoc among their own recalled responses; participants often believed that critical items had been on both the incidental task and the study list. Improved source monitoring can potentially mitigate episodic (but not semantic) confusion errors.

  4. Gulls identified as major source of fecal pollution in coastal waters: a microbial source tracking study.

    PubMed

    Araújo, Susana; Henriques, Isabel S; Leandro, Sérgio Miguel; Alves, Artur; Pereira, Anabela; Correia, António

    2014-02-01

    Gulls were reported as sources of fecal pollution in coastal environments and potential vectors of human infections. Microbial source tracking (MST) methods were rarely tested to identify this pollution origin. This study was conducted to ascertain the source of water fecal contamination in the Berlenga Island, Portugal. A total of 169 Escherichia coli isolates from human sewage, 423 isolates from gull feces and 334 water isolates were analyzed by BOX-PCR. An average correct classification of 79.3% was achieved. When an 85% similarity cutoff was applied 24% of water isolates were present in gull feces against 2.7% detected in sewage. Jackknifing resulted in 29.3% of water isolates classified as gull, and 10.8% classified as human. Results indicate that gulls constitute a major source of water contamination in the Berlenga Island. This study validated a methodology to differentiate human and gull fecal pollution sources in a real case of a contaminated beach. © 2013.

  5. Extinction in Phylogenetics and Biogeography: From Timetrees to Patterns of Biotic Assemblage

    PubMed Central

    Meseguer, Andrea S.

    2016-01-01

    Global climate change and its impact on biodiversity levels have made extinction a relevant topic in biological research. Yet, until recently, extinction has received less attention in macroevolutionary studies than speciation; the reason is the difficulty to infer an event that actually eliminates rather than creates new taxa. For example, in biogeography, extinction has often been seen as noise, introducing homoplasy in biogeographic relationships, rather than a pattern-generating process. The molecular revolution and the possibility to integrate time into phylogenetic reconstructions have allowed studying extinction under different perspectives. Here, we review phylogenetic (temporal) and biogeographic (spatial) approaches to the inference of extinction and the challenges this process poses for reconstructing evolutionary history. Specifically, we focus on the problem of discriminating between alternative high extinction scenarios using time trees with only extant taxa, and on the confounding effect introduced by asymmetric spatial extinction – different rates of extinction across areas – in biogeographic inference. Finally, we identify the most promising avenues of research in both fields, which include the integration of additional sources of evidence such as the fossil record or environmental information in birth–death models and biogeographic reconstructions, the development of new models that tie extinction rates to phenotypic or environmental variation, or the implementation within a Bayesian framework of parametric non-stationary biogeographic models. PMID:27047538

  6. A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.

    PubMed

    Takahashi, Tetsumi; Moreno, Edmundo

    2015-12-01

    The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Selection and paucity of phylogenetic signal challenge the utility of alpha-tubulin in reconstruction of evolutionary history of free-living litostomateans (Protista, Ciliophora).

    PubMed

    Rajter, Ľubomír; Vďačný, Peter

    2018-05-12

    The class Litostomatea represents a highly diverse but monophyletic group, uniting both free-living and endosymbiotic ciliates. Ribosomal RNA genes and ITS-region sequences helped to recognize and define the main litostomatean lineages, but did not provide enough phylogenetic signal to unambiguously resolve their interrelationships. In this study, we attempted to improve the resolution among main free-living predatory lineages by adding the gene coding for alpha-tubulin. However, our phylogenetic analyses challenged the performance of alpha-tubulin in reconstruction of evolutionary history of free-living litostomateans. We identified several mutually interconnected problems associated with the ciliate alpha-tubulin gene: the paucity of phylogenetic signal, molecular homoplasies and non-neutral evolution. Positive selection may generate molecular homoplasies (parallel evolution), while negative selection may cause a small number of changes and hence little phylogenetic informativness. Both problems were encountered in nucleotide and amino acid alpha-tubulin alignments, indicating an action of various selective pressures. Taking into account the involvement of alpha-tubulin in many essential biological processes, this protein could be so strongly affected by purifying selection that it even might have become an inappropriate molecular marker for reconstruction of phylogenetic relationships. Therefore, a great caution should be paid when tubulin genes are included in phylogenetic and/or phylogenomic analyses. Copyright © 2018 Elsevier Inc. All rights reserved.

  8. On Tree-Based Phylogenetic Networks.

    PubMed

    Zhang, Louxin

    2016-07-01

    A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.

  9. Distributed watershed modeling of design storms to identify nonpoint source loading areas

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Endreny, T.A.; Wood, E.F.

    1999-03-01

    Watershed areas that generate nonpoint source (NPS) polluted runoff need to be identified prior to the design of basin-wide water quality projects. Current watershed-scale NPS models lack a variable source area (VSA) hydrology routine, and are therefore unable to identify spatially dynamic runoff zones. The TOPLATS model used a watertable-driven VSA hydrology routine to identify runoff zones in a 17.5 km{sup 2} agricultural watershed in central Oklahoma. Runoff areas were identified in a static modeling framework as a function of prestorm watertable depth and also in a dynamic modeling framework by simulating basin response to 2, 10, and 25 yrmore » return period 6 h design storms. Variable source area expansion occurred throughout the duration of each 6 h storm and total runoff area increased with design storm intensity. Basin-average runoff rates of 1 mm h{sup {minus}1} provided little insight into runoff extremes while the spatially distributed analysis identified saturation excess zones with runoff rates equaling effective precipitation. The intersection of agricultural landcover areas with these saturation excess runoff zones targeted the priority potential NPS runoff zones that should be validated with field visits. These intersected areas, labeled as potential NPS runoff zones, were mapped within the watershed to demonstrate spatial analysis options available in TOPLATS for managing complex distributions of watershed runoff. TOPLATS concepts in spatial saturation excess runoff modelling should be incorporated into NPS management models.« less

  10. Amine control for DUV lithography: identifying hidden sources

    NASA Astrophysics Data System (ADS)

    Kishkovich, Oleg P.; Larson, Carl E.

    2000-06-01

    The impact of airborne basic molecular contamination (MB) on the performance of chemically amplified (CA) resist systems has been a long standing problem. Low ppb levels of MB may be sufficient for robust 0.25 micrometer lithography with today's advanced CA resist systems combined with adequate chemical air filtration. However, with minimum CD targets heading below 150 nm, the introduction of new resist chemistries for Next Generation Lithography, and the trend towards thinner resists, the impact of MB at low and sub-ppb levels again becomes a critical manufacturing issue. Maximizing process control at aggressive feature sizes requires that the level of MB be maintained below a certain limit, which depends on such parameters as the sensitivity of the CA resist, the type of production tools, product mix, and process characteristics. Three approaches have been identified to reduce the susceptibility of CA resists to MB: effective chemical air filtration, modifications to resist chemistry/processing and cleanroom protocols involving MB monitoring and removal of MB sources from the fab. The final MB concentration depends on the effectiveness of filtration resources and on the total pollution originating from different sources in and out of the cleanroom. There are many well-documented sources of MB. Among these are: ambient air; polluted exhaust from other manufacturing areas re-entering the cleanroom through make-up air handlers; manufacturing process chemicals containing volatile molecular bases; certain cleanroom construction materials, such as paint and ceiling tiles; and volatile, humidifier system boiler additives (corrosion inhibitors), such as morpholine, cyclohexylamine, and dimethylaminoethanol. However, there is also an indeterminate number of other 'hidden' pollution sources, which are neither obvious nor well-documented. None of these sources are new, but they had little impact on earlier semiconductor manufacturing processes because the contamination

  11. Contribution of WUSCHEL-related homeobox (WOX) genes to identify the phylogenetic relationships among Petunia species

    PubMed Central

    Segatto, Ana Lúcia Anversa; Thompson, Claudia Elizabeth; Freitas, Loreta Brandão

    2016-01-01

    Abstract Developmental genes are believed to contribute to major changes during plant evolution, from infrageneric to higher levels. Due to their putative high sequence conservation, developmental genes are rarely used as molecular markers, and few studies including these sequences at low taxonomic levels exist. WUSCHEL-related homeobox genes (WOX) are transcription factors exclusively present in plants and are involved in developmental processes. In this study, we characterized the infrageneric genetic variation of Petunia WOX genes. We obtained phylogenetic relationships consistent with other phylogenies based on nuclear markers, but with higher statistical support, resolution in terminals, and compatibility with flower morphological changes. PMID:27768156

  12. Phylogenetic systematics of the genus Echinococcus (Cestoda: Taeniidae).

    PubMed

    Nakao, Minoru; Lavikainen, Antti; Yanagida, Tetsuya; Ito, Akira

    2013-11-01

    Echinococcosis is a serious helminthic zoonosis in humans, livestock and wildlife. The pathogenic organisms are members of the genus Echinococcus (Cestoda: Taeniidae). Life cycles of Echinococcus spp. are consistently dependent on predator-prey association between two obligate mammalian hosts. Carnivores (canids and felids) serve as definitive hosts for adult tapeworms and their herbivore prey (ungulates, rodents and lagomorphs) as intermediate hosts for metacestode larvae. Humans are involved as an accidental host for metacestode infections. The metacestodes develop in various internal organs, particularly in liver and lungs. Each metacestode of Echinococcus spp. has an organotropism and a characteristic form known as an unilocular (cystic), alveolar or polycystic hydatid. Recent molecular phylogenetic studies have demonstrated that the type species, Echinococcus granulosus, causing cystic echinococcosis is a cryptic species complex. Therefore, the orthodox taxonomy of Echinococcus established from morphological criteria has been revised from the standpoint of phylogenetic systematics. Nine valid species including newly resurrected taxa are recognised as a result of the revision. This review summarises the recent advances in the phylogenetic systematics of Echinococcus, together with the historical backgrounds and molecular epidemiological aspects of each species. A new phylogenetic tree inferred from the mitochondrial genomes of all valid Echinococcus spp. is also presented. The taxonomic nomenclature for Echinococcus oligarthrus is shown to be incorrect and this name should be replaced with Echinococcus oligarthra. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.

  13. Molecular, phylogenetic and comparative genomic analysis of the cytokinin oxidase/dehydrogenase gene family in the Poaceae.

    PubMed

    Mameaux, Sabine; Cockram, James; Thiel, Thomas; Steuernagel, Burkhard; Stein, Nils; Taudien, Stefan; Jack, Peter; Werner, Peter; Gray, John C; Greenland, Andy J; Powell, Wayne

    2012-01-01

    The genomes of cereals such as wheat (Triticum aestivum) and barley (Hordeum vulgare) are large and therefore problematic for the map-based cloning of agronomicaly important traits. However, comparative approaches within the Poaceae permit transfer of molecular knowledge between species, despite their divergence from a common ancestor sixty million years ago. The finding that null variants of the rice gene cytokinin oxidase/dehydrogenase 2 (OsCKX2) result in large yield increases provides an opportunity to explore whether similar gains could be achieved in other Poaceae members. Here, phylogenetic, molecular and comparative analyses of CKX families in the sequenced grass species rice, brachypodium, sorghum, maize and foxtail millet, as well as members identified from the transcriptomes/genomes of wheat and barley, are presented. Phylogenetic analyses define four Poaceae CKX clades. Comparative analyses showed that CKX phylogenetic groupings can largely be explained by a combination of local gene duplication, and the whole-genome duplication event that predates their speciation. Full-length OsCKX2 homologues in barley (HvCKX2.1, HvCKX2.2) and wheat (TaCKX2.3, TaCKX2.4, TaCKX2.5) are characterized, with comparative analysis at the DNA, protein and genetic/physical map levels suggesting that true CKX2 orthologs have been identified. Furthermore, our analysis shows CKX2 genes in barley and wheat have undergone a Triticeae-specific gene-duplication event. Finally, by identifying ten of the eleven CKX genes predicted to be present in barley by comparative analyses, we show that next-generation sequencing approaches can efficiently determine the gene space of large-genome crops. Together, this work provides the foundation for future functional investigation of CKX family members within the Poaceae. © 2011 National Institute of Agricultural Botany (NIAB). Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell

  14. Optical Spectra of Four Objects Identified with Variable Radio Sources

    NASA Astrophysics Data System (ADS)

    Chavushyan, V.; Mujica, R.; Gorshkov, A. G.; Konnikova, V. K.; Mingaliev, M. G.

    2000-06-01

    We obtained optical spectra of four objects identified with variable radio sources. Three objects (0029+0554, 0400+0550, 2245+0500) were found to be quasars with redshifts of 1.314, 0.761, and 1.091. One object (2349+0534) has a continuum spectrum characteristic of BL Lac objects. We analyze spectra of the radio sources in the range 0.97-21.7 GHz for the epoch 1997 and in the range 3.9-11.1 GHz for the epoch 1990, as well as the pattern of variability of their flux densities on time scales of 1.5 and 7 years.

  15. Recovery and phylogenetic diversity of culturable fungi associated with marine sponges Clathrina luteoculcitella and Holoxea sp. in the South China Sea.

    PubMed

    Ding, Bo; Yin, Ying; Zhang, Fengli; Li, Zhiyong

    2011-08-01

    Sponge-associated fungi represent an important source of marine natural products, but little is known about the fungal diversity and the relationship of sponge-fungal association, especially no research on the fungal diversity in the South China Sea sponge has been reported. In this study, a total of 111 cultivable fungi strains were isolated from two South China Sea sponges Clathrina luteoculcitella and Holoxea sp. using eight different media. Thirty-two independent representatives were selected for analysis of phylogenetic diversity according to ARDRA and morphological characteristics. The culturable fungal communities consisted of at least 17 genera within ten taxonomic orders of two phyla (nine orders of the phylum Ascomycota and one order of the phylum Basidiomycota) including some potential novel marine fungi. Particularly, eight genera of Apiospora, Botryosphaeria, Davidiella, Didymocrea, Lentomitella, Marasmius, Pestalotiopsis, and Rhizomucor were isolated from sponge for the first time. Sponge C. luteoculcitella has greater culturable fungal diversity than sponge Holoxea sp. Five genera of Aspergillus, Davidiella, Fusarium, Paecilomyces, and Penicillium were isolated from both sponges, while 12 genera of Apiospora, Botryosphaeria, Candida, Marasmius, Cladosporium, Didymocrea, Hypocrea, Lentomitella, Nigrospora, Pestalotiopsis, Rhizomucor, and Scopulariopsis were isolated from sponge C. luteoculcitella only. Order Eurotiales especially genera Penicillium, Aspergillus, and order Hypocreales represented the dominant culturable fungi in these two South China Sea sponges. Nigrospora oryzae strain PF18 isolated from sponge C. luteoculcitella showed a strong and broad spectrum antimicrobial activities suggesting the potential for antimicrobial compounds production.

  16. Evaluation of PCB sources and releases for identifying priorities to reduce PCBs in Washington State (USA).

    PubMed

    Davies, Holly; Delistraty, Damon

    2016-02-01

    Polychlorinated biphenyls (PCBs) are ubiquitously distributed in the environment and produce multiple adverse effects in humans and wildlife. As a result, the purpose of our study was to characterize PCB sources in anthropogenic materials and releases to the environment in Washington State (USA) in order to formulate recommendations to reduce PCB exposures. Methods included review of relevant publications (e.g., open literature, industry studies and reports, federal and state government databases), scaling of PCB sources from national or county estimates to state estimates, and communication with industry associations and private and public utilities. Recognizing high associated uncertainty due to incomplete data, we strived to provide central tendency estimates for PCB sources. In terms of mass (high to low), PCB sources include lamp ballasts, caulk, small capacitors, large capacitors, and transformers. For perspective, these sources (200,000-500,000 kg) overwhelm PCBs estimated to reside in the Puget Sound ecosystem (1500 kg). Annual releases of PCBs to the environment (high to low) are attributed to lamp ballasts (400-1500 kg), inadvertent generation by industrial processes (900 kg), caulk (160 kg), small capacitors (3-150 kg), large capacitors (10-80 kg), pigments and dyes (0.02-31 kg), and transformers (<2 kg). Recommendations to characterize the extent of PCB distribution and decrease exposures include assessment of PCBs in buildings (e.g., schools) and replacement of these materials, development of Best Management Practices (BMPs) to contain PCBs, reduction of inadvertent generation of PCBs in consumer products, expansion of environmental monitoring and public education, and research to identify specific PCB congener profiles in human tissues.

  17. Phylogenetic affinity of tree shrews to Glires is attributed to fast evolution rate.

    PubMed

    Lin, Jiannan; Chen, Guangfeng; Gu, Liang; Shen, Yuefeng; Zheng, Meizhu; Zheng, Weisheng; Hu, Xinjie; Zhang, Xiaobai; Qiu, Yu; Liu, Xiaoqing; Jiang, Cizhong

    2014-02-01

    Previous phylogenetic analyses have led to incongruent evolutionary relationships between tree shrews and other suborders of Euarchontoglires. What caused the incongruence remains elusive. In this study, we identified 6845 orthologous genes between seventeen placental mammals. Tree shrews and Primates were monophyletic in the phylogenetic trees derived from the first or/and second codon positions whereas tree shrews and Glires formed a monophyly in the trees derived from the third or all codon positions. The same topology was obtained in the phylogeny inference using the slowly and fast evolving genes, respectively. This incongruence was likely attributed to the fast substitution rate in tree shrews and Glires. Notably, sequence GC content only was not informative to resolve the controversial phylogenetic relationships between tree shrews, Glires, and Primates. Finally, estimation in the confidence of the tree selection strongly supported the phylogenetic affiliation of tree shrews to Primates as a monophyly. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Determinants of plant community assembly in a mosaic of landscape units in central Amazonia: ecological and phylogenetic perspectives.

    PubMed

    Umaña, María Natalia; Norden, Natalia; Cano, Angela; Stevenson, Pablo R

    2012-01-01

    The Amazon harbours one of the richest ecosystems on Earth. Such diversity is likely to be promoted by plant specialization, associated with the occurrence of a mosaic of landscape units. Here, we integrate ecological and phylogenetic data at different spatial scales to assess the importance of habitat specialization in driving compositional and phylogenetic variation across the Amazonian forest. To do so, we evaluated patterns of floristic dissimilarity and phylogenetic turnover, habitat association and phylogenetic structure in three different landscape units occurring in terra firme (Hilly and Terrace) and flooded forests (Igapó). We established two 1-ha tree plots in each of these landscape units at the Caparú Biological Station, SW Colombia, and measured edaphic, topographic and light variables. At large spatial scales, terra firme forests exhibited higher levels of species diversity and phylodiversity than flooded forests. These two types of forests showed conspicuous differences in species and phylogenetic composition, suggesting that environmental sorting due to flood is important, and can go beyond the species level. At a local level, landscape units showed floristic divergence, driven both by geographical distance and by edaphic specialization. In terms of phylogenetic structure, Igapó forests showed phylogenetic clustering, whereas Hilly and Terrace forests showed phylogenetic evenness. Within plots, however, local communities did not show any particular trend. Overall, our findings suggest that flooded forests, characterized by stressful environments, impose limits to species occurrence, whereas terra firme forests, more environmentally heterogeneous, are likely to provide a wider range of ecological conditions and therefore to bear higher diversity. Thus, Amazonia should be considered as a mosaic of landscape units, where the strength of habitat association depends upon their environmental properties.

  19. Determinants of Plant Community Assembly in a Mosaic of Landscape Units in Central Amazonia: Ecological and Phylogenetic Perspectives

    PubMed Central

    Umaña, María Natalia; Norden, Natalia; Cano, Ángela; Stevenson, Pablo R.

    2012-01-01

    The Amazon harbours one of the richest ecosystems on Earth. Such diversity is likely to be promoted by plant specialization, associated with the occurrence of a mosaic of landscape units. Here, we integrate ecological and phylogenetic data at different spatial scales to assess the importance of habitat specialization in driving compositional and phylogenetic variation across the Amazonian forest. To do so, we evaluated patterns of floristic dissimilarity and phylogenetic turnover, habitat association and phylogenetic structure in three different landscape units occurring in terra firme (Hilly and Terrace) and flooded forests (Igapó). We established two 1-ha tree plots in each of these landscape units at the Caparú Biological Station, SW Colombia, and measured edaphic, topographic and light variables. At large spatial scales, terra firme forests exhibited higher levels of species diversity and phylodiversity than flooded forests. These two types of forests showed conspicuous differences in species and phylogenetic composition, suggesting that environmental sorting due to flood is important, and can go beyond the species level. At a local level, landscape units showed floristic divergence, driven both by geographical distance and by edaphic specialization. In terms of phylogenetic structure, Igapó forests showed phylogenetic clustering, whereas Hilly and Terrace forests showed phylogenetic evenness. Within plots, however, local communities did not show any particular trend. Overall, our findings suggest that flooded forests, characterized by stressful environments, impose limits to species occurrence, whereas terra firme forests, more environmentally heterogeneous, are likely to provide a wider range of ecological conditions and therefore to bear higher diversity. Thus, Amazonia should be considered as a mosaic of landscape units, where the strength of habitat association depends upon their environmental properties. PMID:23028844

  20. Trait-based assembly and phylogenetic structure in northeast Pacific rockfish assemblages.

    PubMed

    Ingram, Travis; Shurin, Jonathan B

    2009-09-01

    If natural communities are assembled according to deterministic rules, coexisting species will represent a nonrandom subset of the potential species pool. We tested for signatures of assembly rules in the distribution of species' traits in Pacific rockfish (Sebastes spp.) assemblages. We used morphology, dietary niche (estimated with stable nitrogen isotopes), and distribution data to identify traits that relate to local-scale resource use (the alpha-niche) and to environmental gradients (the beta-niche). We showed that gill raker morphology was related to trophic position (an alpha-niche axis), while relative eye size was associated with depth habitat (a beta-niche axis). We therefore hypothesized that, within assemblages of coexisting rockfish species, the gill raker trait would be overdispersed (evenly spaced) due to limiting similarity, while relative eye size would be clustered due to environmental filtering. We examined the evolutionary relatedness of coexisting species to ask whether phylogenetic community structure and trait distributions gave similar indications about the roles of assembly processes. We tested the trait distributions and phylogenetic structure of 30 published rockfish assemblages against a null model of random community assembly. As predicted, the gill raker trait tended to be more evenly spaced than expected by chance, as did overall body size, while relative eye size was more clustered than expected. Phylogenetic community structure appeared to reflect historical dispersal and speciation and did not provide consistent support for assembly rules. Our results indicate that rockfish community assembly is nonrandom with regard to species' traits and show how distinguishing traits related to the alpha- and beta-niches and incorporating functional morphology can provide for powerful tests of assembly rules.

  1. Spatial patterns of phylogenetic diversity.

    PubMed

    Morlon, Hélène; Schwilk, Dylan W; Bryant, Jessica A; Marquet, Pablo A; Rebelo, Anthony G; Tauss, Catherine; Bohannan, Brendan J M; Green, Jessica L

    2011-02-01

    Ecologists and conservation biologists have historically used species-area and distance-decay relationships as tools to predict the spatial distribution of biodiversity and the impact of habitat loss on biodiversity. These tools treat each species as evolutionarily equivalent, yet the importance of species' evolutionary history in their ecology and conservation is becoming increasingly evident. Here, we provide theoretical predictions for phylogenetic analogues of the species-area and distance-decay relationships. We use a random model of community assembly and a spatially explicit flora dataset collected in four Mediterranean-type regions to provide theoretical predictions for the increase in phylogenetic diversity - the total phylogenetic branch-length separating a set of species - with increasing area and the decay in phylogenetic similarity with geographic separation. These developments may ultimately provide insights into the evolution and assembly of biological communities, and guide the selection of protected areas. © 2010 Blackwell Publishing Ltd/CNRS.

  2. Phylogenetic analysis of honey bee behavioral evolution.

    PubMed

    Raffiudin, Rika; Crozier, Ross H

    2007-05-01

    DNA sequences from three mitochondrial (rrnL, cox2, nad2) and one nuclear gene (itpr) from all 9 known honey bee species (Apis), a 10th possible species, Apis dorsata binghami, and three outgroup species (Bombus terrestris, Melipona bicolor and Trigona fimbriata) were used to infer Apis phylogenetic relationships using Bayesian analysis. The dwarf honey bees were confirmed as basal, and the giant and cavity-nesting species to be monophyletic. All nodes were strongly supported except that grouping Apis cerana with A. nigrocincta. Two thousand post-burnin trees from the phylogenetic analysis were used in a Bayesian comparative analysis to explore the evolution of dance type, nest structure, comb structure and dance sound within Apis. The ancestral honey bee species was inferred with high support to have nested in the open, and to have more likely than not had a silent vertical waggle dance and a single comb. The common ancestor of the giant and cavity-dwelling bees is strongly inferred to have had a buzzing vertical directional dance. All pairwise combinations of characters showed strong association, but the multiple comparisons problem reduces the ability to infer associations between states between characters. Nevertheless, a buzzing dance is significantly associated with cavity-nesting, several vertical combs, and dancing vertically, a horizontal dance is significantly associated with a nest with a single comb wrapped around the support, and open nesting with a single pendant comb and a silent waggle dance.

  3. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education.

  4. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  5. Morphological, molecular and phylogenetic analyses of Diplotriaena bargusinica Skrjabin, 1917 (Nematoda: Diplotriaenidae).

    PubMed

    Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud

    2017-10-01

    The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Phylogenetic overdispersion of plant species in southern Brazilian savannas.

    PubMed

    Silva, I A; Batalha, M A

    2009-08-01

    Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutionary processes. Based on these assumptions, we may infer mechanisms of contemporary coexistence from the phylogenetic relationships of the species in a community. We studied the phylogenetic structure of plant communities in four cerrado sites, in southeastern Brazil. We calculated two raw phylogenetic distances among the species sampled. We estimated the phylogenetic structure by comparing the observed phylogenetic distances to the distribution of phylogenetic distances in null communities. We obtained null communities by randomizing the phylogenetic relationships of the regional pool of species. We found a phylogenetic overdispersion of the cerrado species. Phylogenetic overdispersion has several explanations, depending on the phylogenetic history of traits and contemporary ecological interactions. However, based on coexistence models between grasses and trees, density-dependent ecological forces, and the evolutionary history of the cerrado flora, we argue that the phylogenetic overdispersion of cerrado species is predominantly due to competitive interactions, herbivores and pathogen attacks, and ecological speciation. Future studies will need to include information on the phylogenetic history of plant traits.

  7. SERAPHIM: studying environmental rasters and phylogenetically informed movements.

    PubMed

    Dellicour, Simon; Rose, Rebecca; Faria, Nuno R; Lemey, Philippe; Pybus, Oliver G

    2016-10-15

    SERAPHIM ("Studying Environmental Rasters and PHylogenetically Informed Movements") is a suite of computational methods developed to study phylogenetic reconstructions of spatial movement in an environmental context. SERAPHIM extracts the spatio-temporal information contained in estimated phylogenetic trees and uses this information to calculate summary statistics of spatial spread and to visualize dispersal history. Most importantly, SERAPHIM enables users to study the impact of customized environmental variables on the spread of the study organism. Specifically, given an environmental raster, SERAPHIM computes environmental "weights" for each phylogeny branch, which represent the degree to which the environmental variable impedes (or facilitates) lineage movement. Correlations between movement duration and these environmental weights are then assessed, and the statistical significances of these correlations are evaluated using null distributions generated by a randomization procedure. SERAPHIM can be applied to any phylogeny whose nodes are annotated with spatial and temporal information. At present, such phylogenies are most often found in the field of emerging infectious diseases, but will become increasingly common in other biological disciplines as population genomic data grows. SERAPHIM 1.0 is freely available from http://evolve.zoo.ox.ac.uk/ R package, source code, example files, tutorials and a manual are also available from this website. simon.dellicour@kuleuven.be or oliver.pybus@zoo.ox.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Considering common sources of exposure in association studies - Urinary benzophenone-3 and DEHP metabolites are associated with altered thyroid hormone balance in the NHANES 2007-2008.

    PubMed

    Kim, Sujin; Kim, Sunmi; Won, Sungho; Choi, Kyungho

    2017-10-01

    Epidemiological studies have shown that thyroid hormone balances can be disrupted by chemical exposure. However, many association studies have often failed to consider multiple chemicals with possible common sources of exposure, rendering their conclusions less reliable. In the 2007-2008 National Health and Nutrition Examination Survey (NHANES) from the U.S.A., urinary levels of environmental phenols, parabens, and phthalate metabolites as well as serum thyroid hormones were measured in a general U.S. population (≥12years old, n=1829). Employing these data, first, the chemicals or their metabolites associated with thyroid hormone measures were identified. Then, the chemicals/metabolites with possible common exposure sources were included in the analytical model to test the sensitivities of their association with thyroid hormone levels. Benzophenone-3 (BP-3), bisphenol A (BPA), and a metabolite of di(2-ethylhexyl) phthalate (DEHP) were identified as significant determinants of decreased serum thyroid hormones. However, significant positive correlations were detected (p-value<0.05, r=0.23 to 0.45) between these chemicals/metabolites, which suggests that they might share similar exposure sources. In the subsequent sensitivity analysis, which included the chemicals/metabolite with potentially similar exposure sources in the model, we found that urinary BP-3 and DEHP exposure were associated with decreased thyroid hormones among the general population but BPA exposure was not. In association studies, the presence of possible common exposure sources should be considered to circumvent possible false-positive conclusions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Phylogenetic relationships among zooxanthellae (Symbiodinium) associated to excavating sponges (Cliona spp.) reveal an unexpected lineage in the Caribbean.

    PubMed

    Granados, C; Camargo, C; Zea, S; Sánchez, J A

    2008-11-01

    Phylogenetic relationships of symbiotic dinoflagellate lineages, distributed in all tropical and subtropical seas, suggest strategies for long distance dispersal but at the same time strong host specialization. Zooxanthellae (Symbiodinium: Dinophyta), which are associated to diverse shallow-water cnidarians, also engage in symbioses with some sponge species of the genus Cliona. In the Caribbean, zooxanthellae-bearing Cliona has recently become abundant due to global warming, overfishing, and algae abundance. Using molecular techniques, the symbionts from five excavating species (Clionacaribbaea, C. tenuis, C. varians, C. aprica and C. laticavicola) from the southern and southwestern Caribbean were surveyed. Several DNA sequence regions were used in order to confirm zooxanthellae identity; 18S rDNA, domain V of chloroplast large subunit (cp23S), internal transcribed spacer 2 (ITS2), and ITS2 secondary structure. Sequence analyses corroborated the presence of three zooxanthellae clades: A, B, and G. Presence of clades A and B in common boring sponges of the Caribbean fit with the general pattern of the province. The discovery of clade G for the first time in any organism of the Atlantic Ocean leads us to consider this unusual finding as a phylogenetic relict through common ancestors of sponge clades or an invasion of the sponge from the Indo-Pacific.

  10. Different relationships between temporal phylogenetic turnover and phylogenetic similarity and in two forests were detected by a new null model.

    PubMed

    Huang, Jian-Xiong; Zhang, Jian; Shen, Yong; Lian, Ju-yu; Cao, Hong-lin; Ye, Wan-hui; Wu, Lin-fang; Bin, Yue

    2014-01-01

    Ecologists have been monitoring community dynamics with the purpose of understanding the rates and causes of community change. However, there is a lack of monitoring of community dynamics from the perspective of phylogeny. We attempted to understand temporal phylogenetic turnover in a 50 ha tropical forest (Barro Colorado Island, BCI) and a 20 ha subtropical forest (Dinghushan in southern China, DHS). To obtain temporal phylogenetic turnover under random conditions, two null models were used. The first shuffled names of species that are widely used in community phylogenetic analyses. The second simulated demographic processes with careful consideration on the variation in dispersal ability among species and the variations in mortality both among species and among size classes. With the two models, we tested the relationships between temporal phylogenetic turnover and phylogenetic similarity at different spatial scales in the two forests. Results were more consistent with previous findings using the second null model suggesting that the second null model is more appropriate for our purposes. With the second null model, a significantly positive relationship was detected between phylogenetic turnover and phylogenetic similarity in BCI at a 10 m×10 m scale, potentially indicating phylogenetic density dependence. This relationship in DHS was significantly negative at three of five spatial scales. This could indicate abiotic filtering processes for community assembly. Using variation partitioning, we found phylogenetic similarity contributed to variation in temporal phylogenetic turnover in the DHS plot but not in BCI plot. The mechanisms for community assembly in BCI and DHS vary from phylogenetic perspective. Only the second null model detected this difference indicating the importance of choosing a proper null model.

  11. Identification and phylogenetic analysis of contagious ecthyma virus from camels (Camelus dromedarius) in Iran.

    PubMed

    Oryan, Ahmad; Mosadeghhesari, Mahboobe; Zibaee, Saeed; Mohammadi, Ali

    2017-03-24

    Contagious ecthyma is a highly contagious disease affecting domestic and wild ruminants such as sheep, goats and camels. The identification and characterisation of a parapoxvirus (PPV) infecting camels is described here. The virus was detected in dromedary camels (Camelus dromedarius) from Kerman and Shiraz in Iran. PPV-specific amplification by polymerase chain reaction (PCR) further confirmed that the disease was associated with PPV infection. Phylogenetic analysis of ORF011 (B2L) gene sequences showed 99.79% and 82.13% similarity of the PPV identified in this study with the Jodhpur isolate and the bovine papular stomatitis virus (BPSV) isolates (CE41), respectively. Moreover, phylogenetic analysis of the ORF045 gene indicated that the Shiraz sample was in all probability closely related to VR634 and to F00.120R and PCPV776. In conclusion, the results suggest that camel PPV (CPPV) is a likely cause of contagious ecthyma in dromedary camels in Iran.

  12. A manual to identify sources of fluvial sediment

    USGS Publications Warehouse

    Gellis, Allen C.; Fitzpatrick, Faith A.; Schubauer-Berigan, Joseph

    2016-01-01

    Sediment is an important pollutant of concern that can degrade and alter aquatic habitat. A sediment budget is an accounting of the sources, storage, and export of sediment over a defined spatial and temporal scale. This manual focuses on field approaches to estimate a sediment budget. We also highlight the sediment fingerprinting approach to attribute sediment to different watershed sources. Determining the sources and sinks of sediment is important in developing strategies to reduce sediment loads to water bodies impaired by sediment. Therefore, this manual can be used when developing a sediment TMDL requiring identification of sediment sources.The manual takes the user through the seven necessary steps to construct a sediment budget:Decision-making for watershed scale and time period of interestFamiliarization with the watershed by conducting a literature review, compiling background information and maps relevant to study questions, conducting a reconnaissance of the watershedDeveloping partnerships with landowners and jurisdictionsCharacterization of watershed geomorphic settingDevelopment of a sediment budget designData collectionInterpretation and construction of the sediment budgetGenerating products (maps, reports, and presentations) to communicate findings.Sediment budget construction begins with examining the question(s) being asked and whether a sediment budget is necessary to answer these question(s). If undertaking a sediment budget analysis is a viable option, the next step is to define the spatial scale of the watershed and the time scale needed to answer the question(s). Of course, we understand that monetary constraints play a big role in any decision.Early in the sediment budget development process, we suggest getting to know your watershed by conducting a reconnaissance and meeting with local stakeholders. The reconnaissance aids in understanding the geomorphic setting of the watershed and potential sources of sediment. Identifying the potential

  13. Identifying Dust Sources by Positive Matrix Factorization (PMF)

    NASA Astrophysics Data System (ADS)

    Engelbrecht, Johann P.

    2010-05-01

    elemental species was modeled by PMF. A five factor solution identified three soil factors, a silicate soil, limestone soil, and a gypsum soil, as well as a salt factor and an anthropogenic metal factor. Similarly, a set of 362 quartz filter samples analyzed for 10 selected chemical species was modeled by PMF. A five factor solution provided a limestone-gypsum soil, diesel combustion, secondary ammonium sulfate, salt and agricultural-burnpit combustion source type. Examples of time series plots of PMF factor contributions for each of six sampling sites (Balad, Baghdad, Tallil, Tikrit, Taji, and Al Asad) will be discussed. Engelbrecht , J. P., McDonald, E. V., Gillies, J. A., Jayanty, R. K. M., Casuccio, G., and Gertler, A. W., 2009, Characterizing mineral dusts and other aerosols from the Middle East - Part 1: Ambient sampling: Inhalation Toxicology, v. 21, p. 297-326.

  14. Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.

    2017-02-01

    Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.

  15. PHYLOGENETIC ANALYSIS OF THE GENOME OF AN ENTERITIS-ASSOCIATED BOTTLENOSE DOLPHIN MASTADENOVIRUS SUPPORTS A CLADE INFECTING THE CETARTIODACTYLA.

    PubMed

    Standorf, Kali; Cortés-Hinojosa, Galaxia; Venn-Watson, Stephanie; Rivera, Rebecca; Archer, Linda L; Wellehan, James F X

    2018-01-01

    :  Adenoviruses are nonenveloped, double-stranded DNA viruses, known to infect members of all tetrapod classes, with a similarity between phylogenies of hosts and viruses observed. We characterized bottlenose dolphin adenovirus 2 (BdAdV-2) found in a bottlenose dolphin ( Tursiops truncatus) with enteritis. Virions were seen by negative staining electron microscopy of feces. Initial sequences obtained using conserved PCR primers were expanded using primer walking techniques, and the complete coding sequence was obtained. Phylogenetic analyses were consistent with coevolution of this virus and its bottlenose dolphin host, placing BdAdV-2 into a monophyletic group with other mastadenoviruses of Cetartiodactyla. When considering the low guanine/cytosine (G/C) content of BdAdV-2 with the phylogenetic data, this virus may represent a host-jumping event from another member of Cetartiodactyla. Analysis of partial polymerase indicated that bottlenose dolphin adenovirus 1, previously identified in Spain, and BdAdV-2 are sister taxa with harbor porpoise adenovirus 1, forming a cetacean clade. Bottlenose dolphin adenovirus 2 includes a highly divergent fiber gene. Two genes homologous to the dUTPase superfamily are also present which could play a role in enabling viral replication in nondividing cells. We used sequence data to develop a probe hybridization quantitative PCR assay specific to BdAdV-2 with a limit of detection of 10 copies.

  16. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    PubMed

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new

  17. Constructing phylogenetic trees using interacting pathways.

    PubMed

    Wan, Peng; Che, Dongsheng

    2013-01-01

    Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.

  18. Transmission clustering among newly diagnosed HIV patients in Chicago, 2008 to 2011: using phylogenetics to expand knowledge of regional HIV transmission patterns

    PubMed Central

    Lubelchek, Ronald J.; Hoehnen, Sarah C.; Hotton, Anna L.; Kincaid, Stacey L.; Barker, David E.; French, Audrey L.

    2014-01-01

    Introduction HIV transmission cluster analyses can inform HIV prevention efforts. We describe the first such assessment for transmission clustering among HIV patients in Chicago. Methods We performed transmission cluster analyses using HIV pol sequences from newly diagnosed patients presenting to Chicago’s largest HIV clinic between 2008 and 2011. We compared sequences via progressive pairwise alignment, using neighbor joining to construct an un-rooted phylogenetic tree. We defined clusters as >2 sequences among which each sequence had at least one partner within a genetic distance of ≤ 1.5%. We used multivariable regression to examine factors associated with clustering and used geospatial analysis to assess geographic proximity of phylogenetically clustered patients. Results We compared sequences from 920 patients; median age 35 years; 75% male; 67% Black, 23% Hispanic; 8% had a Rapid Plasma Reagin (RPR) titer ≥ 1:16 concurrent with their HIV diagnosis. We had HIV transmission risk data for 54%; 43% identified as men who have sex with men (MSM). Phylogenetic analysis demonstrated 123 patients (13%) grouped into 26 clusters, the largest having 20 members. In multivariable regression, age < 25, Black race, MSM status, male gender, higher HIV viral load, and RPR ≥ 1:16 associated with clustering. We did not observe geographic grouping of genetically clustered patients. Discussion Our results demonstrate high rates of HIV transmission clustering, without local geographic foci, among young Black MSM in Chicago. Applied prospectively, phylogenetic analyses could guide prevention efforts and help break the cycle of transmission. PMID:25321182

  19. Landscape patterns in rainforest phylogenetic signal: isolated islands of refugia or structured continental distributions?

    PubMed

    Kooyman, Robert M; Rossetto, Maurizio; Sauquet, Hervé; Laffan, Shawn W

    2013-01-01

    Identify patterns of change in species distributions, diversity, concentrations of evolutionary history, and assembly of Australian rainforests. We used the distribution records of all known rainforest woody species in Australia across their full continental extent. These were analysed using measures of species richness, phylogenetic diversity (PD), phylogenetic endemism (PE) and phylogenetic structure (net relatedness index; NRI). Phylogenetic structure was assessed using both continental and regional species pools. To test the influence of growth-form, freestanding and climbing plants were analysed independently, and in combination. Species richness decreased along two generally orthogonal continental axes, corresponding with wet to seasonally dry and tropical to temperate habitats. The PE analyses identified four main areas of substantially restricted phylogenetic diversity, including parts of Cape York, Wet Tropics, Border Ranges, and Tasmania. The continental pool NRI results showed evenness (species less related than expected by chance) in groups of grid cells in coastally aligned areas of species rich tropical and sub-tropical rainforest, and in low diversity moist forest areas in the south-east of the Great Dividing Range and in Tasmania. Monsoon and drier vine forests, and moist forests inland from upland refugia showed phylogenetic clustering, reflecting lower diversity and more relatedness. Signals for evenness in Tasmania and clustering in northern monsoon forests weakened in analyses using regional species pools. For climbing plants, values for NRI by grid cell showed strong spatial structuring, with high diversity and PE concentrated in moist tropical and subtropical regions. Concentrations of rainforest evolutionary history (phylo-diversity) were patchily distributed within a continuum of species distributions. Contrasting with previous concepts of rainforest community distribution, our findings of continuous distributions and continental

  20. Landscape Patterns in Rainforest Phylogenetic Signal: Isolated Islands of Refugia or Structured Continental Distributions?

    PubMed Central

    Kooyman, Robert M.; Rossetto, Maurizio; Sauquet, Hervé; Laffan, Shawn W.

    2013-01-01

    Objectives Identify patterns of change in species distributions, diversity, concentrations of evolutionary history, and assembly of Australian rainforests. Methods We used the distribution records of all known rainforest woody species in Australia across their full continental extent. These were analysed using measures of species richness, phylogenetic diversity (PD), phylogenetic endemism (PE) and phylogenetic structure (net relatedness index; NRI). Phylogenetic structure was assessed using both continental and regional species pools. To test the influence of growth-form, freestanding and climbing plants were analysed independently, and in combination. Results Species richness decreased along two generally orthogonal continental axes, corresponding with wet to seasonally dry and tropical to temperate habitats. The PE analyses identified four main areas of substantially restricted phylogenetic diversity, including parts of Cape York, Wet Tropics, Border Ranges, and Tasmania. The continental pool NRI results showed evenness (species less related than expected by chance) in groups of grid cells in coastally aligned areas of species rich tropical and sub-tropical rainforest, and in low diversity moist forest areas in the south-east of the Great Dividing Range and in Tasmania. Monsoon and drier vine forests, and moist forests inland from upland refugia showed phylogenetic clustering, reflecting lower diversity and more relatedness. Signals for evenness in Tasmania and clustering in northern monsoon forests weakened in analyses using regional species pools. For climbing plants, values for NRI by grid cell showed strong spatial structuring, with high diversity and PE concentrated in moist tropical and subtropical regions. Conclusions/Significance Concentrations of rainforest evolutionary history (phylo-diversity) were patchily distributed within a continuum of species distributions. Contrasting with previous concepts of rainforest community distribution, our findings of

  1. Inferring Phylogenetic Networks Using PhyloNet.

    PubMed

    Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay

    2018-07-01

    PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.

  2. The space of ultrametric phylogenetic trees.

    PubMed

    Gavryushkin, Alex; Drummond, Alexei J

    2016-08-21

    The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  3. Phylesystem: a git-based data store for community-curated phylogenetic estimates.

    PubMed

    McTavish, Emily Jane; Hinchliff, Cody E; Allman, James F; Brown, Joseph W; Cranston, Karen A; Holder, Mark T; Rees, Jonathan A; Smith, Stephen A

    2015-09-01

    Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct. Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git's version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the 'phylesystem-api', which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements. Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed

  4. Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.

    PubMed

    Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T

    2016-02-24

    methods for larger effect sizes. The Silva/UNITE-based ghost tree presented here can be easily integrated into existing fungal analysis pipelines to enhance the resolution of fungal community differences and improve understanding of these communities in built environments. The ghost-tree software package can also be used to develop phylogenetic trees for other marker gene sets that afford different taxonomic resolution, or for bridging genome trees with amplicon trees. ghost-tree is pip-installable. All source code, documentation, and test code are available under the BSD license at https://github.com/JTFouquier/ghost-tree .

  5. The ethnobotany of psychoactive plant use: a phylogenetic perspective

    PubMed Central

    2016-01-01

    Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives). The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg), Papaveraceae (opium poppy), Cactaceae (peyote), Convolvulaceae (morning glory), Solanaceae (tobacco), Lamiaceae (mints), Apocynaceae (dogbane) have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence). However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants suggests

  6. The ethnobotany of psychoactive plant use: a phylogenetic perspective.

    PubMed

    Alrashedy, Nashmiah Aid; Molina, Jeanmaire

    2016-01-01

    Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives). The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg), Papaveraceae (opium poppy), Cactaceae (peyote), Convolvulaceae (morning glory), Solanaceae (tobacco), Lamiaceae (mints), Apocynaceae (dogbane) have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence). However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants suggests

  7. Application of classification-tree methods to identify nitrate sources in ground water

    USGS Publications Warehouse

    Spruill, T.B.; Showers, W.J.; Howe, S.S.

    2002-01-01

    A study was conducted to determine if nitrate sources in ground water (fertilizer on crops, fertilizer on golf courses, irrigation spray from hog (Sus scrofa) wastes, and leachate from poultry litter and septic systems) could be classified with 80% or greater success. Two statistical classification-tree models were devised from 48 water samples containing nitrate from five source categories. Model I was constructed by evaluating 32 variables and selecting four primary predictor variables (??15N, nitrate to ammonia ratio, sodium to potassium ratio, and zinc) to identify nitrate sources. A ??15N value of nitrate plus potassium 18.2 indicated inorganic or soil organic N. A nitrate to ammonia ratio 575 indicated nitrate from golf courses. A sodium to potassium ratio 3.2 indicated spray or poultry wastes. A value for zinc 2.8 indicated poultry wastes. Model 2 was devised by using all variables except ??15N. This model also included four variables (sodium plus potassium, nitrate to ammonia ratio, calcium to magnesium ratio, and sodium to potassium ratio) to distinguish categories. Both models were able to distinguish all five source categories with better than 80% overall success and with 71 to 100% success in individual categories using the learning samples. Seventeen water samples that were not used in model development were tested using Model 2 for three categories, and all were correctly classified. Classification-tree models show great potential in identifying sources of contamination and variables important in the source-identification process.

  8. Basic Helix-Loop-Helix Transcription Factor Gene Family Phylogenetics and Nomenclature

    PubMed Central

    Skinner, Michael K.; Rawls, Alan; Wilson-Rawls, Jeanne; Roalson, Eric H.

    2010-01-01

    A phylogenetic analysis of the basic helix-loop-helix (bHLH) gene superfamily was performed using seven different species (human, mouse, rat, worm, fly, yeast, and plant Arabidopsis) and involving over 600 bHLH genes [1]. All bHLH genes were identified in the genomes of the various species, including expressed sequence tags, and the entire coding sequence was used in the analysis. Nearly 15% of the gene family has been updated or added since the original publication. A super-tree involving six clades and all structural relationships was established and is now presented for four of the species. The wealth of functional data available for members of the bHLH gene superfamily provides us with the opportunity to use this exhaustive phylogenetic tree to predict potential functions of uncharacterized members of the family. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique elements of the evolution and functional relationships of the different genes in the bHLH gene family. PMID:20219281

  9. BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.

    PubMed

    Ayres, Daniel L; Darling, Aaron; Zwickl, Derrick J; Beerli, Peter; Holder, Mark T; Lewis, Paul O; Huelsenbeck, John P; Ronquist, Fredrik; Swofford, David L; Cummings, Michael P; Rambaut, Andrew; Suchard, Marc A

    2012-01-01

    Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software.

  10. Comparative metagenomic, phylogenetic and physiological analyses of soil microbial communities across nitrogen gradients.

    PubMed

    Fierer, Noah; Lauber, Christian L; Ramirez, Kelly S; Zaneveld, Jesse; Bradford, Mark A; Knight, Rob

    2012-05-01

    Terrestrial ecosystems are receiving elevated inputs of nitrogen (N) from anthropogenic sources and understanding how these increases in N availability affect soil microbial communities is critical for predicting the associated effects on belowground ecosystems. We used a suite of approaches to analyze the structure and functional characteristics of soil microbial communities from replicated plots in two long-term N fertilization experiments located in contrasting systems. Pyrosequencing-based analyses of 16S rRNA genes revealed no significant effects of N fertilization on bacterial diversity, but significant effects on community composition at both sites; copiotrophic taxa (including members of the Proteobacteria and Bacteroidetes phyla) typically increased in relative abundance in the high N plots, with oligotrophic taxa (mainly Acidobacteria) exhibiting the opposite pattern. Consistent with the phylogenetic shifts under N fertilization, shotgun metagenomic sequencing revealed increases in the relative abundances of genes associated with DNA/RNA replication, electron transport and protein metabolism, increases that could be resolved even with the shallow shotgun metagenomic sequencing conducted here (average of 75 000 reads per sample). We also observed shifts in the catabolic capabilities of the communities across the N gradients that were significantly correlated with the phylogenetic and metagenomic responses, indicating possible linkages between the structure and functioning of soil microbial communities. Overall, our results suggest that N fertilization may, directly or indirectly, induce a shift in the predominant microbial life-history strategies, favoring a more active, copiotrophic microbial community, a pattern that parallels the often observed replacement of K-selected with r-selected plant species with elevated N.

  11. Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca

    2017-04-01

    The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.

  12. Improved Maximum Parsimony Models for Phylogenetic Networks.

    PubMed

    Van Iersel, Leo; Jones, Mark; Scornavacca, Celine

    2018-05-01

    Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.

  13. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  14. Plunging hands into the mushroom jar: a phylogenetic framework for Lyophyllaceae (Agaricales, Basidiomycota).

    PubMed

    Bellanger, J-M; Moreau, P-A; Corriol, G; Bidaud, A; Chalange, R; Dudova, Z; Richard, F

    2015-04-01

    During the last two decades, the unprecedented development of molecular phylogenetic tools has propelled an opportunity to revisit the fungal kingdom under an evolutionary perspective. Mycology has been profoundly changed but a sustained effort to elucidate large sections of the astonishing fungal diversity is still needed. Here we fill this gap in the case of Lyophyllaceae, a species-rich and ecologically diversified family of mushrooms. Assembly and genealogical concordance multigene phylogenetic analysis of a large dataset that includes original, vouchered material from expert field mycologists reveal the phylogenetic topology of the family, from higher (generic) to lower (species) levels. A comparative analysis of the most widely used phylogenetic markers in Fungi indicates that the nuc rDNA region encompassing the internal transcribed spacers 1 and 2, along with the 5.8S rDNA (ITS) and portions of the genes for RNA polymerase II second largest subunit (RPB2) is the most performing combination to resolve the broadest range of taxa within Lyophyllaceae. Eleven distinct evolutionary lineages are identified, that display partial overlap with traditional genera as well as with the phylogenetic framework previously proposed for the family. Eighty phylogenetic species are delineated, which shed light on a large number of morphological concepts, including rare and poorly documented ones. Probing these novel phylogenetic species to the barcoding method of species limit delineation, indicates that the latter method fully resolves Lyophyllaceae species, except in one clade. This case study provides the first comprehensive phylogenetic overview of Lyophyllaceae, a necessary step towards a taxonomical, ecological and nomenclatural revision of this family of mushrooms. It also proposes a set of methodological guidelines that may be of relevance for future taxonomic works in other groups of Fungi.

  15. Phylogenetic Factor Analysis.

    PubMed

    Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A

    2018-05-01

    Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.

  16. Evaluation of atpB nucleotide sequences for phylogenetic studies of ferns and other pteridophytes.

    PubMed

    Wolf, P

    1997-10-01

    Inferring basal relationships among vascular plants poses a major challenge to plant systematists. The divergence events that describe these relationships occurred long ago and considerable homoplasy has since accrued for both molecular and morphological characters. A potential solution is to examine phylogenetic analyses from multiple data sets. Here I present a new source of phylogenetic data for ferns and other pteridophytes. I sequenced the chloroplast gene atpB from 23 pteridophyte taxa and used maximum parsimony to infer relationships. A 588-bp region of the gene appeared to contain a statistically significant amount of phylogenetic signal and the resulting trees were largely congruent with similar analyses of nucleotide sequences from rbcL. However, a combined analysis of atpB plus rbcL produced a better resolved tree than did either data set alone. In the shortest trees, leptosporangiate ferns formed a monophyletic group. Also, I detected a well-supported clade of Psilotaceae (Psilotum and Tmesipteris) plus Ophioglossaceae (Ophioglossum and Botrychium). The demonstrated utility of atpB suggests that sequences from this gene should play a role in phylogenetic analyses that incorporate data from chloroplast genes, nuclear genes, morphology, and fossil data.

  17. Phylogenetic marker development for target enrichment from transcriptome and genome skim data: the pipeline and its application in southern African Oxalis (Oxalidaceae)

    Treesearch

    Roswitha Schmickl; Aaron Liston; Vojtěch Zeisek; Kenneth Oberlander; Kevin Weitemier; Shannon C. K. Straub; Richard C. Cronn; Léanne L. Dreyer; Jan Suda

    2016-01-01

    Phylogenetics benefits from using a large number of putatively independent nuclear loci and their combination with other sources of information, such as the plastid and mitochondrial genomes. To facilitate the selection of orthologous low-copy nuclear (LCN) loci for phylogenetics in nonmodel organisms, we created an automated and interactive script to select hundreds...

  18. Probabilistic Graphical Model Representation in Phylogenetics

    PubMed Central

    Höhna, Sebastian; Heath, Tracy A.; Boussau, Bastien; Landis, Michael J.; Ronquist, Fredrik; Huelsenbeck, John P.

    2014-01-01

    Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to grasp complex phylogenetic models, including their assumptions and parameter/variable dependencies. Graphical modeling is a unifying framework that has gained in popularity in the statistical literature in recent years. The core idea is to break complex models into conditionally independent distributions. The strength lies in the comprehensibility, flexibility, and adaptability of this formalism, and the large body of computational work based on it. Graphical models are well-suited to teach statistical models, to facilitate communication among phylogeneticists and in the development of generic software for simulation and statistical inference. Here, we provide an introduction to graphical models for phylogeneticists and extend the standard graphical model representation to the realm of phylogenetics. We introduce a new graphical model component, tree plates, to capture the changing structure of the subgraph corresponding to a phylogenetic tree. We describe a range of phylogenetic models using the graphical model framework and introduce modules to simplify the representation of standard components in large and complex models. Phylogenetic model graphs can be readily used in simulation, maximum likelihood inference, and Bayesian inference using, for example, Metropolis–Hastings or Gibbs sampling of the posterior distribution. [Computation; graphical models; inference; modularization; statistical phylogenetics; tree plate.] PMID:24951559

  19. aes, the gene encoding the esterase B in Escherichia coli, is a powerful phylogenetic marker of the species.

    PubMed

    Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand

    2009-12-29

    Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.

  20. Heritable Bovine Rumen Bacteria Are Phylogenetically Related and Correlated with the Cow’s Capacity To Harvest Energy from Its Feed

    PubMed Central

    Sasson, Goor; Kruger Ben-Shabat, Sheerli; Seroussi, Eyal; Doron-Faigenboim, Adi; Shterzer, Naama; Yaacoby, Shamay; Berg Miller, Margret E.; White, Bryan A.; Halperin, Eran

    2017-01-01

    ABSTRACT Ruminants sustain a long-lasting obligatory relationship with their rumen microbiome dating back 50 million years. In this unique host-microbiome relationship, the host’s ability to digest its feed is completely dependent on its coevolved microbiome. This extraordinary alliance raises questions regarding the dependent relationship between ruminants’ genetics and physiology and the rumen microbiome structure, composition, and metabolism. To elucidate this relationship, we examined the association of host genetics with the phylogenetic and functional composition of the rumen microbiome. We accomplished this by studying a population of 78 Holstein-Friesian dairy cows, using a combination of rumen microbiota data and other phenotypes from each animal with genotypic data from a subset of 47 animals. We identified 22 operational taxonomic units (OTUs) whose abundances were associated with rumen metabolic traits and host physiological traits and which showed measurable heritability. The abundance patterns of these microbes can explain high proportions of variance in rumen metabolism and many of the host physiological attributes such as its energy-harvesting efficiency. Interestingly, these OTUs shared higher phylogenetic similarity between themselves than expected by chance, suggesting occupation of a specific ecological niche within the rumen ecosystem. The findings presented here suggest that ruminant genetics and physiology are correlated with microbiome structure and that host genetics may shape the microbiome landscape by enriching for phylogenetically related taxa that may occupy a unique niche. PMID:28811339

  1. Human-mediated loss of phylogenetic and functional diversity in coral reef fishes.

    PubMed

    D'agata, Stéphanie; Mouillot, David; Kulbicki, Michel; Andréfouët, Serge; Bellwood, David R; Cinner, Joshua E; Cowman, Peter F; Kronen, Mecki; Pinca, Silvia; Vigliola, Laurent

    2014-03-03

    Beyond the loss of species richness, human activities may also deplete the breadth of evolutionary history (phylogenetic diversity) and the diversity of roles (functional diversity) carried out by species within communities, two overlooked components of biodiversity. Both are, however, essential to sustain ecosystem functioning and the associated provision of ecosystem services, particularly under fluctuating environmental conditions. We quantified the effect of human activities on the taxonomic, phylogenetic, and functional diversity of fish communities in coral reefs, while teasing apart the influence of biogeography and habitat along a gradient of human pressure across the Pacific Ocean. We detected nonlinear relationships with significant breaking points in the impact of human population density on phylogenetic and functional diversity of parrotfishes, at 25 and 15 inhabitants/km(2), respectively, while parrotfish species richness decreased linearly along the same population gradient. Over the whole range, species richness decreased by 11.7%, while phylogenetic and functional diversity dropped by 35.8% and 46.6%, respectively. Our results call for caution when using species richness as a benchmark for measuring the status of ecosystems since it appears to be less responsive to variation in human population densities than its phylogenetic and functional counterparts, potentially imperiling the functioning of coral reef ecosystems. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. [Phylogenetic analysis of closely related Leuconostoc citreum species based on partial housekeeping genes].

    PubMed

    Lv, Qiang; Chen, Ming; Xu, Haiyan; Song, Yuqin; Sun, Zhihong; Dan, Tong; Sun, Tiansong

    2013-07-04

    Using the 16S rRNA, dnaA, murC and pyrG gene sequences, we identified the phylogenetic relationship among closely related Leuconostoc citreum species. Seven Leu. citreum strains originally isolated from sourdough were characterized by PCR methods to amplify the dnaA, murC and pyrG gene sequences, which were determined to assess the suitability as phylogenetic markers. Then, we estimated the genetic distance and constructed the phylogenetic trees including 16S rRNA and above mentioned three housekeeping genes combining with published corresponding sequences. By comparing the phylogenetic trees, the topology of three housekeeping genes trees were consistent with that of 16S rRNA gene. The homology of closely related Leu. citreum species among dnaA, murC, pyrG and 16S rRNA gene sequences were different, ranged from75.5% to 97.2%, 50.2% to 99.7%, 65.0% to 99.8% and 98.5% 100%, respectively. The phylogenetic relationship of three housekeeping genes sequences were highly consistent with the results of 16S rRNA gene sequence, while the genetic distance of these housekeeping genes were extremely high than 16S rRNA gene. Consequently, the dnaA, murC and pyrG gene are suitable for classification and identification closely related Leu. citreum species.

  3. Microbial source tracking: a tool for identifying sources of microbial contamination in the food chain.

    PubMed

    Fu, Ling-Lin; Li, Jian-Rong

    2014-01-01

    The ability to trace fecal indicators and food-borne pathogens to the point of origin has major ramifications for food industry, food regulatory agencies, and public health. Such information would enable food producers and processors to better understand sources of contamination and thereby take corrective actions to prevent transmission. Microbial source tracking (MST), which currently is largely focused on determining sources of fecal contamination in waterways, is also providing the scientific community tools for tracking both fecal bacteria and food-borne pathogens contamination in the food chain. Approaches to MST are commonly classified as library-dependent methods (LDMs) or library-independent methods (LIMs). These tools will have widespread applications, including the use for regulatory compliance, pollution remediation, and risk assessment. These tools will reduce the incidence of illness associated with food and water. Our aim in this review is to highlight the use of molecular MST methods in application to understanding the source and transmission of food-borne pathogens. Moreover, the future directions of MST research are also discussed.

  4. Contributions to the Nutrient Toolbox: Identifying Drivers, Nutrient Sources, and Attribution of Exceedances

    EPA Science Inventory

    Nutrients are a leading cause of impairments in the United States, and as a result tools are needed to identify drivers of nutrients and response variables (such as chlorophyll a), nutrient sources, and identify causes of exceedances of water quality thresholds. This presentatio...

  5. A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.

    PubMed

    Rajan, Vaibhav

    2013-03-01

    Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.

  6. Screen and clean: a tool for identifying interactions in genome-wide association studies.

    PubMed

    Wu, Jing; Devlin, Bernie; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn

    2010-04-01

    Epistasis could be an important source of risk for disease. How interacting loci might be discovered is an open question for genome-wide association studies (GWAS). Most researchers limit their statistical analyses to testing individual pairwise interactions (i.e., marginal tests for association). A more effective means of identifying important predictors is to fit models that include many predictors simultaneously (i.e., higher-dimensional models). We explore a procedure called screen and clean (SC) for identifying liability loci, including interactions, by using the lasso procedure, which is a model selection tool for high-dimensional regression. We approach the problem by using a varying dictionary consisting of terms to include in the model. In the first step the lasso dictionary includes only main effects. The most promising single-nucleotide polymorphisms (SNPs) are identified using a screening procedure. Next the lasso dictionary is adjusted to include these main effects and the corresponding interaction terms. Again, promising terms are identified using lasso screening. Then significant terms are identified through the cleaning process. Implementation of SC for GWAS requires algorithms to explore the complex model space induced by the many SNPs genotyped and their interactions. We propose and explore a set of algorithms and find that SC successfully controls Type I error while yielding good power to identify risk loci and their interactions. When the method is applied to data obtained from the Wellcome Trust Case Control Consortium study of Type 1 Diabetes it uncovers evidence supporting interaction within the HLA class II region as well as within Chromosome 12q24.

  7. Increased competition does not lead to increased phylogenetic overdispersion in a native grassland.

    PubMed

    Bennett, Jonathan A; Lamb, Eric G; Hall, Jocelyn C; Cardinal-McTeague, Warren M; Cahill, James F

    2013-09-01

    That competition is stronger among closely related species and leads to phylogenetic overdispersion is a common assumption in community ecology. However, tests of this assumption are rare and field-based experiments lacking. We tested the relationship between competition, the degree of relatedness, and overdispersion among plants experimentally and using a field survey in a native grassland. Relatedness did not affect competition, nor was competition associated with phylogenetic overdispersion. Further, there was only weak evidence for increased overdispersion at spatial scales where plants are likely to compete. These results challenge traditional theory, but are consistent with recent theories regarding the mechanisms of plant competition and its potential effect on phylogenetic structure. We suggest that specific conditions related to the form of competition and trait conservatism must be met for competition to cause phylogenetic overdispersion. Consequently, overdispersion as a result of competition is likely to be rare in natural communities. © 2013 John Wiley & Sons Ltd/CNRS.

  8. Cross-validation to select Bayesian hierarchical models in phylogenetics.

    PubMed

    Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C

    2016-05-26

    Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.

  9. Identifying sources of heterogeneity in capture probabilities: An example using the Great Tit Parus major

    USGS Publications Warehouse

    Senar, J.C.; Conroy, M.J.; Carrascal, L.M.; Domenech, J.; Mozetich, I.; Uribe, F.

    1999-01-01

    Heterogeneous capture probabilities are a common problem in many capture-recapture studies. Several methods of detecting the presence of such heterogeneity are currently available, and stratification of data has been suggested as the standard method to avoid its effects. However, few studies have tried to identify sources of heterogeneity, or whether there are interactions among sources. The aim of this paper is to suggest an analytical procedure to identify sources of capture heterogeneity. We use data on the sex and age of Great Tits captured in baited funnel traps, at two localities differing in average temperature. We additionally use 'recapture' data obtained by videotaping at feeder (with no associated trap), where the tits ringed with different colours were recorded. This allowed us to test whether individuals in different classes (age, sex and condition) are not trapped because of trap shyness or because o a reduced use of the bait. We used logistic regression analysis of the capture probabilities to test for the effects of age, sex, condition, location and 'recapture method. The results showed a higher recapture probability in the colder locality. Yearling birds (either males or females) had the highest recapture prob abilities, followed by adult males, while adult females had the lowest recapture probabilities. There was no effect of the method of 'recapture' (trap or video tape), which suggests that adult females are less often captured in traps no because of trap-shyness but because of less dependence on supplementary food. The potential use of this methodological approach in other studies is discussed.

  10. Identifying sources of fugitive emissions in industrial facilities using trajectory statistical methods

    NASA Astrophysics Data System (ADS)

    Brereton, Carol A.; Johnson, Matthew R.

    2012-05-01

    Fugitive pollutant sources from the oil and gas industry are typically quite difficult to find within industrial plants and refineries, yet they are a significant contributor of global greenhouse gas emissions. A novel approach for locating fugitive emission sources using computationally efficient trajectory statistical methods (TSM) has been investigated in detailed proof-of-concept simulations. Four TSMs were examined in a variety of source emissions scenarios developed using transient CFD simulations on the simplified geometry of an actual gas plant: potential source contribution function (PSCF), concentration weighted trajectory (CWT), residence time weighted concentration (RTWC), and quantitative transport bias analysis (QTBA). Quantitative comparisons were made using a correlation measure based on search area from the source(s). PSCF, CWT and RTWC could all distinguish areas near major sources from the surroundings. QTBA successfully located sources in only some cases, even when provided with a large data set. RTWC, given sufficient domain trajectory coverage, distinguished source areas best, but otherwise could produce false source predictions. Using RTWC in conjunction with CWT could overcome this issue as well as reduce sensitivity to noise in the data. The results demonstrate that TSMs are a promising approach for identifying fugitive emissions sources within complex facility geometries.

  11. Fourier transform inequalities for phylogenetic trees.

    PubMed

    Matsen, Frederick A

    2009-01-01

    Phylogenetic invariants are not the only constraints on site-pattern frequency vectors for phylogenetic trees. A mutation matrix, by its definition, is the exponential of a matrix with non-negative off-diagonal entries; this positivity requirement implies non-trivial constraints on the site-pattern frequency vectors. We call these additional constraints "edge-parameter inequalities". In this paper, we first motivate the edge-parameter inequalities by considering a pathological site-pattern frequency vector corresponding to a quartet tree with a negative internal edge. This site-pattern frequency vector nevertheless satisfies all of the constraints described up to now in the literature. We next describe two complete sets of edge-parameter inequalities for the group-based models; these constraints are square-free monomial inequalities in the Fourier transformed coordinates. These inequalities, along with the phylogenetic invariants, form a complete description of the set of site-pattern frequency vectors corresponding to bona fide trees. Said in mathematical language, this paper explicitly presents two finite lists of inequalities in Fourier coordinates of the form "monomial < or = 1", each list characterizing the phylogenetically relevant semialgebraic subsets of the phylogenetic varieties.

  12. Classification of Phylogenetic Profiles for Protein Function Prediction: An SVM Approach

    NASA Astrophysics Data System (ADS)

    Kotaru, Appala Raju; Joshi, Ramesh C.

    Predicting the function of an uncharacterized protein is a major challenge in post-genomic era due to problems complexity and scale. Having knowledge of protein function is a crucial link in the development of new drugs, better crops, and even the development of biochemicals such as biofuels. Recently numerous high-throughput experimental procedures have been invented to investigate the mechanisms leading to the accomplishment of a protein’s function and Phylogenetic profile is one of them. Phylogenetic profile is a way of representing a protein which encodes evolutionary history of proteins. In this paper we proposed a method for classification of phylogenetic profiles using supervised machine learning method, support vector machine classification along with radial basis function as kernel for identifying functionally linked proteins. We experimentally evaluated the performance of the classifier with the linear kernel, polynomial kernel and compared the results with the existing tree kernel. In our study we have used proteins of the budding yeast saccharomyces cerevisiae genome. We generated the phylogenetic profiles of 2465 yeast genes and for our study we used the functional annotations that are available in the MIPS database. Our experiments show that the performance of the radial basis kernel is similar to polynomial kernel is some functional classes together are better than linear, tree kernel and over all radial basis kernel outperformed the polynomial kernel, linear kernel and tree kernel. In analyzing these results we show that it will be feasible to make use of SVM classifier with radial basis function as kernel to predict the gene functionality using phylogenetic profiles.

  13. Phylogenetic search through partial tree mixing

    PubMed Central

    2012-01-01

    Background Recent advances in sequencing technology have created large data sets upon which phylogenetic inference can be performed. Current research is limited by the prohibitive time necessary to perform tree search on a reasonable number of individuals. This research develops new phylogenetic algorithms that can operate on tens of thousands of species in a reasonable amount of time through several innovative search techniques. Results When compared to popular phylogenetic search algorithms, better trees are found much more quickly for large data sets. These algorithms are incorporated in the PSODA application available at http://dna.cs.byu.edu/psoda Conclusions The use of Partial Tree Mixing in a partition based tree space allows the algorithm to quickly converge on near optimal tree regions. These regions can then be searched in a methodical way to determine the overall optimal phylogenetic solution. PMID:23320449

  14. Phylogenetic Analysis of Seven WRKY Genes across the Palm Subtribe Attaleinae (Arecaceae) Identifies Syagrus as Sister Group of the Coconut

    PubMed Central

    Meerow, Alan W.; Noblick, Larry; Borrone, James W.; Couvreur, Thomas L. P.; Mauro-Herrera, Margarita; Hahn, William J.; Kuhn, David N.; Nakamura, Kyoko; Oleas, Nora H.; Schnell, Raymond J.

    2009-01-01

    Background The Cocoseae is one of 13 tribes of Arecaceae subfam. Arecoideae, and contains a number of palms with significant economic importance, including the monotypic and pantropical Cocos nucifera L., the coconut, the origins of which have been one of the “abominable mysteries” of palm systematics for decades. Previous studies with predominantly plastid genes weakly supported American ancestry for the coconut but ambiguous sister relationships. In this paper, we use multiple single copy nuclear loci to address the phylogeny of the Cocoseae subtribe Attaleinae, and resolve the closest extant relative of the coconut. Methodology/Principal Findings We present the results of combined analysis of DNA sequences of seven WRKY transcription factor loci across 72 samples of Arecaceae tribe Cocoseae subtribe Attaleinae, representing all genera classified within the subtribe, and three outgroup taxa with maximum parsimony, maximum likelihood, and Bayesian approaches, producing highly congruent and well-resolved trees that robustly identify the genus Syagrus as sister to Cocos and resolve novel and well-supported relationships among the other genera of the Attaleinae. We also address incongruence among the gene trees with gene tree reconciliation analysis, and assign estimated ages to the nodes of our tree. Conclusions/Significance This study represents the as yet most extensive phylogenetic analyses of Cocoseae subtribe Attaleinae. We present a well-resolved and supported phylogeny of the subtribe that robustly indicates a sister relationship between Cocos and Syagrus. This is not only of biogeographic interest, but will also open fruitful avenues of inquiry regarding evolution of functional genes useful for crop improvement. Establishment of two major clades of American Attaleinae occurred in the Oligocene (ca. 37 MYBP) in Eastern Brazil. The divergence of Cocos from Syagrus is estimated at 35 MYBP. The biogeographic and morphological congruence that we see for

  15. BEAGLE: An Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics

    PubMed Central

    Ayres, Daniel L.; Darling, Aaron; Zwickl, Derrick J.; Beerli, Peter; Holder, Mark T.; Lewis, Paul O.; Huelsenbeck, John P.; Ronquist, Fredrik; Swofford, David L.; Cummings, Michael P.; Rambaut, Andrew; Suchard, Marc A.

    2012-01-01

    Abstract Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software. PMID:21963610

  16. Phylogenetic ctDNA analysis depicts early stage lung cancer evolution

    PubMed Central

    Abbosh, Christopher; Birkbak, Nicolai J.; Wilson, Gareth A.; Jamal-Hanjani, Mariam; Constantin, Tudor; Salari, Raheleh; Le Quesne, John; Moore, David A; Veeriah, Selvaraju; Rosenthal, Rachel; Marafioti, Teresa; Kirkizlar, Eser; Watkins, Thomas B K; McGranahan, Nicholas; Ward, Sophia; Martinson, Luke; Riley, Joan; Fraioli, Francesco; Al Bakir, Maise; Grönroos, Eva; Zambrana, Francisco; Endozo, Raymondo; Bi, Wenya Linda; Fennessy, Fiona M.; Sponer, Nicole; Johnson, Diana; Laycock, Joanne; Shafi, Seema; Czyzewska-Khan, Justyna; Rowan, Andrew; Chambers, Tim; Matthews, Nik; Turajlic, Samra; Hiley, Crispin; Lee, Siow Ming; Forster, Martin D.; Ahmad, Tanya; Falzon, Mary; Borg, Elaine; Lawrence, David; Hayward, Martin; Kolvekar, Shyam; Panagiotopoulos, Nikolaos; Janes, Sam M; Thakrar, Ricky; Ahmed, Asia; Blackhall, Fiona; Summers, Yvonne; Hafez, Dina; Naik, Ashwini; Ganguly, Apratim; Kareht, Stephanie; Shah, Rajesh; Joseph, Leena; Quinn, Anne Marie; Crosbie, Phil; Naidu, Babu; Middleton, Gary; Langman, Gerald; Trotter, Simon; Nicolson, Marianne; Remmen, Hardy; Kerr, Keith; Chetty, Mahendran; Gomersall, Lesley; Fennell, Dean; Nakas, Apostolos; Rathinam, Sridhar; Anand, Girija; Khan, Sajid; Russell, Peter; Ezhil, Veni; Ismail, Babikir; Irvin-sellers, Melanie; Prakash, Vineet; Lester, Jason; Kornaszewska, Malgorzata; Attanoos, Richard; Adams, Haydn; Davies, Helen; Oukrif, Dahmane; Akarca, Ayse U; Hartley, John A; Lowe, Helen L; Lock, Sara; Iles, Natasha; Bell, Harriet; Ngai, Yenting; Elgar, Greg; Szallasi, Zoltan; Schwarz, Roland F; Herrero, Javier; Stewart, Aengus; Quezada, Sergio A; Peggs, Karl S.; Van Loo, Peter; Dive, Caroline; Lin, Jimmy; Rabinowitz, Matthew; Aerts, Hugo JWL; Hackshaw, Allan; Shaw, Jacqui A; Zimmermann, Bernhard G.; Swanton, Charles

    2017-01-01

    Summary The early detection of relapse following primary surgery for non-small cell lung cancer and the characterization of emerging subclones seeding metastatic sites might offer new therapeutic approaches to limit tumor recurrence. The potential to non-invasively track tumor evolutionary dynamics in ctDNA of early-stage lung cancer is not established. Here we conduct a tumour-specific phylogenetic approach to ctDNA profiling in the first 100 TRACERx (TRAcking non-small cell lung Cancer Evolution through therapy (Rx)) study participants, including one patient co-recruited to the PEACE (Posthumous Evaluation of Advanced Cancer Environment) post-mortem study. We identify independent predictors of ctDNA release and perform tumor volume limit of detection analyses. Through blinded profiling of post-operative plasma, we observe evidence of adjuvant chemotherapy resistance and identify patients destined to experience recurrence of their lung cancer. Finally, we show that phylogenetic ctDNA profiling tracks the subclonal nature of lung cancer relapse and metastases, providing a new approach for ctDNA driven therapeutic studies PMID:28445469

  17. Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution.

    PubMed

    Abbosh, Christopher; Birkbak, Nicolai J; Wilson, Gareth A; Jamal-Hanjani, Mariam; Constantin, Tudor; Salari, Raheleh; Le Quesne, John; Moore, David A; Veeriah, Selvaraju; Rosenthal, Rachel; Marafioti, Teresa; Kirkizlar, Eser; Watkins, Thomas B K; McGranahan, Nicholas; Ward, Sophia; Martinson, Luke; Riley, Joan; Fraioli, Francesco; Al Bakir, Maise; Grönroos, Eva; Zambrana, Francisco; Endozo, Raymondo; Bi, Wenya Linda; Fennessy, Fiona M; Sponer, Nicole; Johnson, Diana; Laycock, Joanne; Shafi, Seema; Czyzewska-Khan, Justyna; Rowan, Andrew; Chambers, Tim; Matthews, Nik; Turajlic, Samra; Hiley, Crispin; Lee, Siow Ming; Forster, Martin D; Ahmad, Tanya; Falzon, Mary; Borg, Elaine; Lawrence, David; Hayward, Martin; Kolvekar, Shyam; Panagiotopoulos, Nikolaos; Janes, Sam M; Thakrar, Ricky; Ahmed, Asia; Blackhall, Fiona; Summers, Yvonne; Hafez, Dina; Naik, Ashwini; Ganguly, Apratim; Kareht, Stephanie; Shah, Rajesh; Joseph, Leena; Marie Quinn, Anne; Crosbie, Phil A; Naidu, Babu; Middleton, Gary; Langman, Gerald; Trotter, Simon; Nicolson, Marianne; Remmen, Hardy; Kerr, Keith; Chetty, Mahendran; Gomersall, Lesley; Fennell, Dean A; Nakas, Apostolos; Rathinam, Sridhar; Anand, Girija; Khan, Sajid; Russell, Peter; Ezhil, Veni; Ismail, Babikir; Irvin-Sellers, Melanie; Prakash, Vineet; Lester, Jason F; Kornaszewska, Malgorzata; Attanoos, Richard; Adams, Haydn; Davies, Helen; Oukrif, Dahmane; Akarca, Ayse U; Hartley, John A; Lowe, Helen L; Lock, Sara; Iles, Natasha; Bell, Harriet; Ngai, Yenting; Elgar, Greg; Szallasi, Zoltan; Schwarz, Roland F; Herrero, Javier; Stewart, Aengus; Quezada, Sergio A; Peggs, Karl S; Van Loo, Peter; Dive, Caroline; Lin, C Jimmy; Rabinowitz, Matthew; Aerts, Hugo J W L; Hackshaw, Allan; Shaw, Jacqui A; Zimmermann, Bernhard G; Swanton, Charles

    2017-04-26

    The early detection of relapse following primary surgery for non-small-cell lung cancer and the characterization of emerging subclones, which seed metastatic sites, might offer new therapeutic approaches for limiting tumour recurrence. The ability to track the evolutionary dynamics of early-stage lung cancer non-invasively in circulating tumour DNA (ctDNA) has not yet been demonstrated. Here we use a tumour-specific phylogenetic approach to profile the ctDNA of the first 100 TRACERx (Tracking Non-Small-Cell Lung Cancer Evolution Through Therapy (Rx)) study participants, including one patient who was also recruited to the PEACE (Posthumous Evaluation of Advanced Cancer Environment) post-mortem study. We identify independent predictors of ctDNA release and analyse the tumour-volume detection limit. Through blinded profiling of postoperative plasma, we observe evidence of adjuvant chemotherapy resistance and identify patients who are very likely to experience recurrence of their lung cancer. Finally, we show that phylogenetic ctDNA profiling tracks the subclonal nature of lung cancer relapse and metastasis, providing a new approach for ctDNA-driven therapeutic studies.

  18. Insect pathogenicity in plant-beneficial pseudomonads: phylogenetic distribution and comparative genomics.

    PubMed

    Flury, Pascale; Aellen, Nora; Ruffner, Beat; Péchy-Tarr, Maria; Fataar, Shakira; Metla, Zane; Dominguez-Ferreras, Ana; Bloemberg, Guido; Frey, Joachim; Goesmann, Alexander; Raaijmakers, Jos M; Duffy, Brion; Höfte, Monica; Blom, Jochen; Smits, Theo H M; Keel, Christoph; Maurhofer, Monika

    2016-10-01

    Bacteria of the genus Pseudomonas occupy diverse environments. The Pseudomonas fluorescens group is particularly well-known for its plant-beneficial properties including pathogen suppression. Recent observations that some strains of this group also cause lethal infections in insect larvae, however, point to a more versatile ecology of these bacteria. We show that 26 P. fluorescens group strains, isolated from three continents and covering three phylogenetically distinct sub-clades, exhibited different activities toward lepidopteran larvae, ranging from lethal to avirulent. All strains of sub-clade 1, which includes Pseudomonas chlororaphis and Pseudomonas protegens, were highly insecticidal regardless of their origin (animals, plants). Comparative genomics revealed that strains in this sub-clade possess specific traits allowing a switch between plant- and insect-associated lifestyles. We identified 90 genes unique to all highly insecticidal strains (sub-clade 1) and 117 genes common to all strains of sub-clade 1 and present in some moderately insecticidal strains of sub-clade 3. Mutational analysis of selected genes revealed the importance of chitinase C and phospholipase C in insect pathogenicity. The study provides insight into the genetic basis and phylogenetic distribution of traits defining insecticidal activity in plant-beneficial pseudomonads. Strains with potent dual activity against plant pathogens and herbivorous insects have great potential for use in integrated pest management for crops.

  19. Insect pathogenicity in plant-beneficial pseudomonads: phylogenetic distribution and comparative genomics

    PubMed Central

    Flury, Pascale; Aellen, Nora; Ruffner, Beat; Péchy-Tarr, Maria; Fataar, Shakira; Metla, Zane; Dominguez-Ferreras, Ana; Bloemberg, Guido; Frey, Joachim; Goesmann, Alexander; Raaijmakers, Jos M; Duffy, Brion; Höfte, Monica; Blom, Jochen; Smits, Theo H M; Keel, Christoph; Maurhofer, Monika

    2016-01-01

    Bacteria of the genus Pseudomonas occupy diverse environments. The Pseudomonas fluorescens group is particularly well-known for its plant-beneficial properties including pathogen suppression. Recent observations that some strains of this group also cause lethal infections in insect larvae, however, point to a more versatile ecology of these bacteria. We show that 26 P. fluorescens group strains, isolated from three continents and covering three phylogenetically distinct sub-clades, exhibited different activities toward lepidopteran larvae, ranging from lethal to avirulent. All strains of sub-clade 1, which includes Pseudomonas chlororaphis and Pseudomonas protegens, were highly insecticidal regardless of their origin (animals, plants). Comparative genomics revealed that strains in this sub-clade possess specific traits allowing a switch between plant- and insect-associated lifestyles. We identified 90 genes unique to all highly insecticidal strains (sub-clade 1) and 117 genes common to all strains of sub-clade 1 and present in some moderately insecticidal strains of sub-clade 3. Mutational analysis of selected genes revealed the importance of chitinase C and phospholipase C in insect pathogenicity. The study provides insight into the genetic basis and phylogenetic distribution of traits defining insecticidal activity in plant-beneficial pseudomonads. Strains with potent dual activity against plant pathogens and herbivorous insects have great potential for use in integrated pest management for crops. PMID:26894448

  20. Is invasion success of Australian trees mediated by their native biogeography, phylogenetic history, or both?

    PubMed

    Miller, Joseph T; Hui, Cang; Thornhill, Andrew; Gallien, Laure; Le Roux, Johannes J; Richardson, David M

    2016-12-30

    For a plant species to become invasive it has to progress along the introduction-naturalization-invasion (INI) continuum which reflects the joint direction of niche breadth. Identification of traits that correlate with and drive species invasiveness along the continuum is a major focus of invasion biology. If invasiveness is underlain by heritable traits, and if such traits are phylogenetically conserved, then we would expect non-native species with different introduction status (i.e. position along the INI continuum) to show phylogenetic signal. This study uses two clades that contain a large number of invasive tree species from the genera Acacia and Eucalyptus to test whether geographic distribution and a novel phylogenetic conservation method can predict which species have been introduced, became naturalized, and invasive. Our results suggest that no underlying phylogenetic signal underlie the introduction status for both groups of trees, except for introduced acacias. The more invasive acacia clade contains invasive species that have smoother geographic distributions and are more marginal in the phylogenetic network. The less invasive eucalyptus group contains invasive species that are more clustered geographically, more centrally located in the phylogenetic network and have phylogenetic distances between invasive and non-invasive species that are trending toward the mean pairwise distance. This suggests that highly invasive groups may be identified because they have invasive species with smoother and faster expanding native distributions and are located more to the edges of phylogenetic networks than less invasive groups. Published by Oxford University Press on behalf of the Annals of Botany Company.

  1. How does cognition evolve? Phylogenetic comparative psychology

    PubMed Central

    Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

    2014-01-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850

  2. Phylogenetic diversity measures based on Hill numbers.

    PubMed

    Chao, Anne; Chiu, Chun-Huo; Jost, Lou

    2010-11-27

    We propose a parametric class of phylogenetic diversity (PD) measures that are sensitive to both species abundance and species taxonomic or phylogenetic distances. This work extends the conventional parametric species-neutral approach (based on 'effective number of species' or Hill numbers) to take into account species relatedness, and also generalizes the traditional phylogenetic approach (based on 'total phylogenetic length') to incorporate species abundances. The proposed measure quantifies 'the mean effective number of species' over any time interval of interest, or the 'effective number of maximally distinct lineages' over that time interval. The product of the measure and the interval length quantifies the 'branch diversity' of the phylogenetic tree during that interval. The new measures generalize and unify many existing measures and lead to a natural definition of taxonomic diversity as a special case. The replication principle (or doubling property), an important requirement for species-neutral diversity, is generalized to PD. The widely used Rao's quadratic entropy and the phylogenetic entropy do not satisfy this essential property, but a simple transformation converts each to our measures, which do satisfy the property. The proposed approach is applied to forest data for interpreting the effects of thinning.

  3. How does cognition evolve? Phylogenetic comparative psychology.

    PubMed

    MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria

    2012-03-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.

  4. Phylogenetic assessment of heterotrophic bacteria from a water distribution system using 16S rDNA sequencing.

    PubMed

    Tokajian, Sima T; Hashwa, Fuad A; Hancock, Ian C; Zalloua, Pierre A

    2005-04-01

    Determination of a heterotrophic plate count (HPC) for drinking-water samples alone is not enough to assess possible health hazards associated with sudden changes in the bacterial count. Speciation is very crucial to determine whether the population includes pathogens and (or) opportunistic pathogens. Most of the isolates recovered from drinking water samples could not be allocated to a specific phylogenetic branch based on the use of conventional diagnostic methods. The present study had to use phylogenetic analysis, which was simplified by determining and using the first 500-bp sequence of the 16S rDNA, to successfully identify the type and species of bacteria found in the samples. Gram-positive bacteria alpha-, beta-, and gamma-Proteobacteria were found to be the major groups representing the heterotrophic bacteria in drinking water. The study also revealed that the presence of sphingomonads in drinking water supplies may be much more common than has been reported so far and thus further studies are merited. The intermittent mode of supply, mainly characterized by water stagnation and flow interruption associated possibly with biofilm detachment, raised the possibility that the studied bacterial populations in such systems represented organisms coming from 2 different niches, the biofilm and the water column.

  5. Probabilistic analysis showing that a combination of bacteroides and methanobrevibacter source tracking markers is effective for identifying waters contaminated by human fecal pollution

    USGS Publications Warehouse

    Johnston, Christopher; Byappanahalli, Muruleedhara N.; Gibson, Jacqueline MacDonald; Ufnar, Jennifer A.; Whitman, Richard L.; Stewart, Jill R.

    2013-01-01

    Microbial source tracking assays to identify sources of waterborne contamination typically target genetic markers of host-specific microorganisms. However, no bacterial marker has been shown to be 100% host-specific, and cross-reactivity has been noted in studies evaluating known source samples. Using 485 challenge samples from 20 different human and animal fecal sources, this study evaluated microbial source tracking markers including the Bacteroides HF183 16S rRNA, M. smithii nifH, and Enterococcus esp gene targets that have been proposed as potential indicators of human fecal contamination. Bayes' Theorem was used to calculate the conditional probability that these markers or a combination of markers can correctly identify human sources of fecal pollution. All three human-associated markers were detected in 100% of the sewage samples analyzed. Bacteroides HF183 was the most effective marker for determining whether contamination was specifically from a human source, and greater than 98% certainty that contamination was from a human source was shown when both Bacteroides HF183 and M. smithii nifH markers were present. A high degree of certainty was attained even in cases where the prior probability of human fecal contamination was as low as 8.5%. The combination of Bacteroides HF183 and M. smithii nifH source tracking markers can help identify surface waters impacted by human fecal contamination, information useful for prioritizing restoration activities or assessing health risks from exposure to contaminated waters.

  6. PHYLOGEOrec: A QGIS plugin for spatial phylogeographic reconstruction from phylogenetic tree and geographical information data

    NASA Astrophysics Data System (ADS)

    Nashrulloh, Maulana Malik; Kurniawan, Nia; Rahardi, Brian

    2017-11-01

    The increasing availability of genetic sequence data associated with explicit geographic and environment (including biotic and abiotic components) information offers new opportunities to study the processes that shape biodiversity and its patterns. Developing phylogeography reconstruction, by integrating phylogenetic and biogeographic knowledge, provides richer and deeper visualization and information on diversification events than ever before. Geographical information systems such as QGIS provide an environment for spatial modeling, analysis, and dissemination by which phylogenetic models can be explicitly linked with their associated spatial data, and subsequently, they will be integrated with other related georeferenced datasets describing the biotic and abiotic environment. We are introducing PHYLOGEOrec, a QGIS plugin for building spatial phylogeographic reconstructions constructed from phylogenetic tree and geographical information data based on QGIS2threejs. By using PHYLOGEOrec, researchers can integrate existing phylogeny and geographical information data, resulting in three-dimensional geographic visualizations of phylogenetic trees in the Keyhole Markup Language (KML) format. Such formats can be overlaid on a map using QGIS and finally, spatially viewed in QGIS by means of a QGIS2threejs engine for further analysis. KML can also be viewed in reputable geobrowsers with KML-support (i.e., Google Earth).

  7. Efficiently Identifying Significant Associations in Genome-wide Association Studies

    PubMed Central

    Eskin, Eleazar

    2013-01-01

    Abstract Over the past several years, genome-wide association studies (GWAS) have implicated hundreds of genes in common disease. More recently, the GWAS approach has been utilized to identify regions of the genome that harbor variation affecting gene expression or expression quantitative trait loci (eQTLs). Unlike GWAS applied to clinical traits, where only a handful of phenotypes are analyzed per study, in eQTL studies, tens of thousands of gene expression levels are measured, and the GWAS approach is applied to each gene expression level. This leads to computing billions of statistical tests and requires substantial computational resources, particularly when applying novel statistical methods such as mixed models. We introduce a novel two-stage testing procedure that identifies all of the significant associations more efficiently than testing all the single nucleotide polymorphisms (SNPs). In the first stage, a small number of informative SNPs, or proxies, across the genome are tested. Based on their observed associations, our approach locates the regions that may contain significant SNPs and only tests additional SNPs from those regions. We show through simulations and analysis of real GWAS datasets that the proposed two-stage procedure increases the computational speed by a factor of 10. Additionally, efficient implementation of our software increases the computational speed relative to the state-of-the-art testing approaches by a factor of 75. PMID:24033261

  8. Evolution of four gene families with patchy phylogenetic distributions: influx of genes into protist genomes

    PubMed Central

    Andersson, Jan O; Hirt, Robert P; Foster, Peter G; Roger, Andrew J

    2006-01-01

    Background Lateral gene transfer (LGT) in eukaryotes from non-organellar sources is a controversial subject in need of further study. Here we present gene distribution and phylogenetic analyses of the genes encoding the hybrid-cluster protein, A-type flavoprotein, glucosamine-6-phosphate isomerase, and alcohol dehydrogenase E. These four genes have a limited distribution among sequenced prokaryotic and eukaryotic genomes and were previously implicated in gene transfer events affecting eukaryotes. If our previous contention that these genes were introduced by LGT independently into the diplomonad and Entamoeba lineages were true, we expect that the number of putative transfers and the phylogenetic signal supporting LGT should be stable or increase, rather than decrease, when novel eukaryotic and prokaryotic homologs are added to the analyses. Results The addition of homologs from phagotrophic protists, including several Entamoeba species, the pelobiont Mastigamoeba balamuthi, and the parabasalid Trichomonas vaginalis, and a large quantity of sequences from genome projects resulted in an apparent increase in the number of putative transfer events affecting all three domains of life. Some of the eukaryotic transfers affect a wide range of protists, such as three divergent lineages of Amoebozoa, represented by Entamoeba, Mastigamoeba, and Dictyostelium, while other transfers only affect a limited diversity, for example only the Entamoeba lineage. These observations are consistent with a model where these genes have been introduced into protist genomes independently from various sources over a long evolutionary time. Conclusion Phylogenetic analyses of the updated datasets using more sophisticated phylogenetic methods, in combination with the gene distribution analyses, strengthened, rather than weakened, the support for LGT as an important mechanism affecting the evolution of these gene families. Thus, gene transfer seems to be an on-going evolutionary mechanism by

  9. Comparative genomic analysis and phylogenetic position of Theileria equi

    PubMed Central

    2012-01-01

    Background Transmission of arthropod-borne apicomplexan parasites that cause disease and result in death or persistent infection represents a major challenge to global human and animal health. First described in 1901 as Piroplasma equi, this re-emergent apicomplexan parasite was renamed Babesia equi and subsequently Theileria equi, reflecting an uncertain taxonomy. Understanding mechanisms by which apicomplexan parasites evade immune or chemotherapeutic elimination is required for development of effective vaccines or chemotherapeutics. The continued risk of transmission of T. equi from clinically silent, persistently infected equids impedes the goal of returning the U. S. to non-endemic status. Therefore comparative genomic analysis of T. equi was undertaken to: 1) identify genes contributing to immune evasion and persistence in equid hosts, 2) identify genes involved in PBMC infection biology and 3) define the phylogenetic position of T. equi relative to sequenced apicomplexan parasites. Results The known immunodominant proteins, EMA1, 2 and 3 were discovered to belong to a ten member gene family with a mean amino acid identity, in pairwise comparisons, of 39%. Importantly, the amino acid diversity of EMAs is distributed throughout the length of the proteins. Eight of the EMA genes were simultaneously transcribed. As the agents that cause bovine theileriosis infect and transform host cell PBMCs, we confirmed that T. equi infects equine PBMCs, however, there is no evidence of host cell transformation. Indeed, a number of genes identified as potential manipulators of the host cell phenotype are absent from the T. equi genome. Comparative genomic analysis of T. equi revealed the phylogenetic positioning relative to seven apicomplexan parasites using deduced amino acid sequences from 150 genes placed it as a sister taxon to Theileria spp. Conclusions The EMA family does not fit the paradigm for classical antigenic variation, and we propose a novel model describing the

  10. A novel prophage identified in strains from Salmonella enterica serovar Enteritidis is a phylogenetic signature of the lineage ST-1974

    PubMed Central

    D'Alessandro, Bruno; Pérez Escanda, Victoria; Balestrazzi, Lucía; Iriarte, Andrés; Pickard, Derek; Yim, Lucía; Chabalgoity, José Alejandro; Betancor, Laura

    2018-01-01

    Salmonella enterica serovar Enteritidis is a major agent of foodborne diseases worldwide. In Uruguay, this serovar was almost negligible until the mid 1990s but since then it has become the most prevalent. Previously, we characterized a collection of strains isolated from 1988 to 2005 and found that the two oldest strains were the most genetically divergent. In order to further characterize these strains, we sequenced and annotated eight genomes including those of the two oldest isolates. We report on the identification and characterization of a novel 44 kbp Salmonella prophage found exclusively in these two genomes. Sequence analysis reveals that the prophage is a mosaic, with homologous regions in different Salmonella prophages. It contains 60 coding sequences, including two genes, gogB and sseK3, involved in virulence and modulation of host immune response. Analysis of serovar Enteritidis genomes available in public databases confirmed that this prophage is absent in most of them, with the exception of a group of 154 genomes. All 154 strains carrying this prophage belong to the same sequence type (ST-1974), suggesting that its acquisition occurred in a common ancestor. We tested this by phylogenetic analysis of 203 genomes representative of the intraserovar diversity. The ST-1974 forms a distinctive monophyletic lineage, and the newly described prophage is a phylogenetic signature of this lineage that could be used as a molecular marker. The phylogenetic analysis also shows that the major ST (ST-11) is polyphyletic and might have given rise to almost all other STs, including ST-1974. PMID:29509137

  11. pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

    PubMed Central

    2010-01-01

    Background Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. Results This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Conclusions Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service. PMID:21034504

  12. The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae).

    PubMed

    Prychitko, T M; Moore, W S

    1997-10-01

    Estimating phylogenies from DNA sequence data has become the major methodology of molecular phylogenetics. To date, molecular phylogenetics of the vertebrates has been very dependent on mtDNA, but studies involving mtDNA are limited because the several genes comprising the mt-genome are inherited as a single linkage group. The only apparent solution to this problem is to sequence additional genes, each representing a distinct linkage group, so that the resultant gene trees provide independent estimates of the species tree. There exists the need to find novel gene sequences which contain enough phylogenetic information to resolve relationships between closely related species. A possible source is the nuclear-encoded introns, because they evolve more rapidly than exons. We designed primers to amplify and sequence the 7 intron from the beta-fibrinogen gene for a recently evolved group, the woodpeckers. We sequenced the entire intron for 10 specimens representing five species. Nucleotide substitutions are randomly distributed along the length of the intron, suggesting selective neutrality. A preliminary analysis indicates that the phylogenetic signal in the intron is as strong as that in the mitochondrial encoded cytochrome b (cyt b) gene. The topology of the beta-fibrinogen tree is identical to that of the cyt b tree. This analysis demonstrates the ability of the 7 intron of beta-fibrinogen to provide well resolved, independent gene trees for recently evolved groups and establishes it as a source of sequences to be used in other phylogenetic studies. Copyright 1997 Academic Press

  13. Phylogenetic species identification in Rattus highlights rapid radiation and morphological similarity of New Guinean species.

    PubMed

    Robins, Judith H; Tintinger, Vernon; Aplin, Ken P; Hingston, Melanie; Matisoo-Smith, Elizabeth; Penny, David; Lavery, Shane D

    2014-01-01

    The genus Rattus is highly speciose, the taxonomy is complex, and individuals are often difficult to identify to the species level. Previous studies have demonstrated the usefulness of phylogenetic approaches to identification in Rattus but some species, especially among the endemics of the New Guinean region, showed poor resolution. Possible reasons for this are simple misidentification, incomplete gene lineage sorting, hybridization, and phylogenetically distinct lineages that are unrecognised taxonomically. To assess these explanations we analysed 217 samples, representing nominally 25 Rattus species, collected in New Guinea, Asia, Australia and the Pacific. To reduce misidentification problems we sequenced museum specimens from earlier morphological studies and recently collected tissues from samples with associated voucher specimens. We also reassessed vouchers from previously sequenced specimens. We inferred combined and separate phylogenies from two mitochondrial DNA regions comprising 550 base pair D-loop sequences and both long (655 base pair) and short (150 base pair) cytochrome oxidase I sequences. Our phylogenetic species identification for 17 species was consistent with morphological designations and current taxonomy thus reinforcing the usefulness of this approach. We reduced misidentifications and consequently the number of polyphyletic species in our phylogenies but the New Guinean Rattus clades still exhibited considerable complexity. Only three of our eight New Guinean species were monophyletic. We found good evidence for either incomplete mitochondrial lineage sorting or hybridization between species within two pairs, R. leucopus/R. cf. verecundus and R. steini/R. praetor. Additionally, our results showed that R. praetor, R. niobe and R. verecundus each likely encompass more than one species. Our study clearly points to the need for a revised taxonomy of the rats of New Guinea, based on broader sampling and informed by both morphology and

  14. Phylogenetic Species Identification in Rattus Highlights Rapid Radiation and Morphological Similarity of New Guinean Species

    PubMed Central

    Robins, Judith H.; Tintinger, Vernon; Aplin, Ken P.; Hingston, Melanie; Matisoo-Smith, Elizabeth; Penny, David; Lavery, Shane D.

    2014-01-01

    The genus Rattus is highly speciose, the taxonomy is complex, and individuals are often difficult to identify to the species level. Previous studies have demonstrated the usefulness of phylogenetic approaches to identification in Rattus but some species, especially among the endemics of the New Guinean region, showed poor resolution. Possible reasons for this are simple misidentification, incomplete gene lineage sorting, hybridization, and phylogenetically distinct lineages that are unrecognised taxonomically. To assess these explanations we analysed 217 samples, representing nominally 25 Rattus species, collected in New Guinea, Asia, Australia and the Pacific. To reduce misidentification problems we sequenced museum specimens from earlier morphological studies and recently collected tissues from samples with associated voucher specimens. We also reassessed vouchers from previously sequenced specimens. We inferred combined and separate phylogenies from two mitochondrial DNA regions comprising 550 base pair D-loop sequences and both long (655 base pair) and short (150 base pair) cytochrome oxidase I sequences. Our phylogenetic species identification for 17 species was consistent with morphological designations and current taxonomy thus reinforcing the usefulness of this approach. We reduced misidentifications and consequently the number of polyphyletic species in our phylogenies but the New Guinean Rattus clades still exhibited considerable complexity. Only three of our eight New Guinean species were monophyletic. We found good evidence for either incomplete mitochondrial lineage sorting or hybridization between species within two pairs, R. leucopus/R. cf. verecundus and R. steini/R. praetor. Additionally, our results showed that R. praetor, R. niobe and R. verecundus each likely encompass more than one species. Our study clearly points to the need for a revised taxonomy of the rats of New Guinea, based on broader sampling and informed by both morphology and

  15. Sequence comparison of phoR, gyrB, groEL, and cheA genes as phylogenetic markers for distinguishing Bacillus amyloliquefaciens and B. subtilis and for identifying Bacillus strain B29.

    PubMed

    Yu, C; Jin, J; Meng, L-Q; Xia, H-H; Yuan, H-F; Wang, J; Yu, D-S; Zhao, X-Y; Sha, C-Q

    2017-05-20

    Given the close genetic relationship between Bacillus amyloliquefaciens and B. subtilis, distinguishing the two solely based on their physiological and biochemical characteristics and 16S rRNA sequences is difficult. Molecular identification was used to discover suitable genes for distinguishing the two bacteria, and to identify the bio-controlling strain B29, due to molecular identification has been paid more and more attention. The similarity of four genes, cheA, gyrB, groEL and phoR, of the two species was compared by the software BLASTN and MAGA, and phylogenetic tree was constructed. The B29 strain was re-identified by using the screened genes. The similarities of the four genes, gyrB, groEL, cheA and phoR, of the two species were 93-95%, 82-84%, 76-78% and 76-77%, respectively. The homologies of the four genes of the strain B29 and the strains of B. amyloliquefaciens strains were more than 95%. We determined how well the phoR and cheA genes could be used to differentiate B. amyloliquefacien and B. subtilis. The previously isolated biological control strain B29, initially classified as B. subtilis, was re-classified as B. amyloliquefaciens. Our data indicate that other than the phoR gene, the cheA gene might be a useful phylogenetic marker for differentiating B. subtilis and B. amyloliquefaciens.

  16. The Escherichia coli phylogenetic group B2 with integrons prevails in childhood recurrent urinary tract infections.

    PubMed

    Kõljalg, Siiri; Truusalu, Kai; Stsepetova, Jelena; Pai, Kristiine; Vainumäe, Inga; Sepp, Epp; Mikelsaar, Marika

    2014-05-01

    The aim of our study was to characterize the phylogenetic groups of Escherichia coli, antibiotic resistance, and containment of class 1 integrons in the first attack of pyelonephritis and in subsequent recurrences in young children. Altogether, 89 urine E. coli isolates from 41 children with urinary tract infection (UTI) were studied for prevalence and persistence of phylogenetic groups by pulsed-field gel electrophoresis (PFGE), antibacterial resistance by minimal inhibitory concentrations (MIC) and class 1 integrons by PCR. Phylogenetic group B2 was most common (57%), followed by D (20%), A (18%) and B1 (5%). Overall resistance to betalactams was 61%, trimethoprim-sulfamethoxazole 28%, and was not associated with phylogenetic groups. According to PFGE, the same clonal strain persisted in 77% of patients. The persistence was detected most often in phylogenetic group B2 (70%). Phylogenetic group B2 more often contained class 1 integrons than group A. Integron positive strains had higher MIC values of cefuroxime, cefotaxime, and gentamicin. In conclusion, phylogenetic group B2 was the most common cause of the first episode of pyelonephritis, as well as in case of the persistence of the same strain and contained frequently class 1 integrons in childhood recurrent UTI. An overall frequent betalactam resistance was equally distributed among phylogenetic groups. © 2013 APMIS. Published by John Wiley & Sons Ltd.

  17. Phylogenetic relationship and species delimitation of matsutake and allied species based on multilocus phylogeny and haplotype analyses.

    PubMed

    Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro

    2012-01-01

    Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.

  18. Identifying constituent spectra sources in multispectral images to quantify and locate cervical neoplasia

    NASA Astrophysics Data System (ADS)

    Baker, Kevin C.; Bambot, Shabbir

    2011-02-01

    Optical spectroscopy has been shown to be an effective method for detecting neoplasia. Guided Therapeutics has developed LightTouch, a non invasive device that uses a combination of reflectance and fluorescence spectroscopy for identifying early cancer of the human cervix. The combination of the multispectral information from the two spectroscopic modalities has been shown to be an effective method to screen for cervical cancer. There has however been a relative paucity of work in identifying the individual spectral components that contribute to the measured fluorescence and reflectance spectra. This work aims to identify the constituent source spectra and their concentrations. We used non-negative matrix factorization (NNMF) numerical methods to decompose the mixed multispectral data into the constituent spectra and their corresponding concentrations. NNMF is an iterative approach that factorizes the measured data into non-negative factors. The factors are chosen to minimize the root-mean-squared residual error. NNMF has shown promise for feature extraction and identification in the fields of text mining and spectral data analysis. Since both the constituent source spectra and their corresponding concentrations are assumed to be non-negative by nature NNMF is a reasonable approach to deconvolve the measured multispectral data. Supervised learning methods were then used to determine which of the constituent spectra sources best predict the amount of neoplasia. The constituent spectra sources found to best predict neoplasia were then compared with spectra of known biological chromophores.

  19. Palaeoproteomic evidence identifies archaic hominins associated with the Châtelperronian at the Grotte du Renne

    PubMed Central

    Welker, Frido; Hajdinjak, Mateja; Talamo, Sahra; Jaouen, Klervia; Dannemann, Michael; David, Francine; Julien, Michèle; Meyer, Matthias; Barnes, Ian; Brace, Selina; Kamminga, Pepijn; Fischer, Roman; Kessler, Benedikt M.; Stewart, John R.; Pääbo, Svante; Collins, Matthew J.; Hublin, Jean-Jacques

    2016-01-01

    In Western Europe, the Middle to Upper Paleolithic transition is associated with the disappearance of Neandertals and the spread of anatomically modern humans (AMHs). Current chronological, behavioral, and biological models of this transitional period hinge on the Châtelperronian technocomplex. At the site of the Grotte du Renne, Arcy-sur-Cure, morphological Neandertal specimens are not directly dated but are contextually associated with the Châtelperronian, which contains bone points and beads. The association between Neandertals and this “transitional” assemblage has been controversial because of the lack either of a direct hominin radiocarbon date or of molecular confirmation of the Neandertal affiliation. Here we provide further evidence for a Neandertal–Châtelperronian association at the Grotte du Renne through biomolecular and chronological analysis. We identified 28 additional hominin specimens through zooarchaeology by mass spectrometry (ZooMS) screening of morphologically uninformative bone specimens from Châtelperronian layers at the Grotte du Renne. Next, we obtain an ancient hominin bone proteome through liquid chromatography-MS/MS analysis and error-tolerant amino acid sequence analysis. Analysis of this palaeoproteome allows us to provide phylogenetic and physiological information on these ancient hominin specimens. We distinguish Late Pleistocene clades within the genus Homo based on ancient protein evidence through the identification of an archaic-derived amino acid sequence for the collagen type X, alpha-1 (COL10α1) protein. We support this by obtaining ancient mtDNA sequences, which indicate a Neandertal ancestry for these specimens. Direct accelerator mass spectometry radiocarbon dating and Bayesian modeling confirm that the hominin specimens date to the Châtelperronian at the Grotte du Renne. PMID:27638212

  20. Palaeoproteomic evidence identifies archaic hominins associated with the Châtelperronian at the Grotte du Renne.

    PubMed

    Welker, Frido; Hajdinjak, Mateja; Talamo, Sahra; Jaouen, Klervia; Dannemann, Michael; David, Francine; Julien, Michèle; Meyer, Matthias; Kelso, Janet; Barnes, Ian; Brace, Selina; Kamminga, Pepijn; Fischer, Roman; Kessler, Benedikt M; Stewart, John R; Pääbo, Svante; Collins, Matthew J; Hublin, Jean-Jacques

    2016-10-04

    In Western Europe, the Middle to Upper Paleolithic transition is associated with the disappearance of Neandertals and the spread of anatomically modern humans (AMHs). Current chronological, behavioral, and biological models of this transitional period hinge on the Châtelperronian technocomplex. At the site of the Grotte du Renne, Arcy-sur-Cure, morphological Neandertal specimens are not directly dated but are contextually associated with the Châtelperronian, which contains bone points and beads. The association between Neandertals and this "transitional" assemblage has been controversial because of the lack either of a direct hominin radiocarbon date or of molecular confirmation of the Neandertal affiliation. Here we provide further evidence for a Neandertal-Châtelperronian association at the Grotte du Renne through biomolecular and chronological analysis. We identified 28 additional hominin specimens through zooarchaeology by mass spectrometry (ZooMS) screening of morphologically uninformative bone specimens from Châtelperronian layers at the Grotte du Renne. Next, we obtain an ancient hominin bone proteome through liquid chromatography-MS/MS analysis and error-tolerant amino acid sequence analysis. Analysis of this palaeoproteome allows us to provide phylogenetic and physiological information on these ancient hominin specimens. We distinguish Late Pleistocene clades within the genus Homo based on ancient protein evidence through the identification of an archaic-derived amino acid sequence for the collagen type X, alpha-1 (COL10α1) protein. We support this by obtaining ancient mtDNA sequences, which indicate a Neandertal ancestry for these specimens. Direct accelerator mass spectometry radiocarbon dating and Bayesian modeling confirm that the hominin specimens date to the Châtelperronian at the Grotte du Renne.

  1. Application of community phylogenetic approaches to understand gene expression: differential exploration of venom gene space in predatory marine gastropods.

    PubMed

    Chang, Dan; Duda, Thomas F

    2014-06-05

    Predatory marine gastropods of the genus Conus exhibit substantial variation in venom composition both within and among species. Apart from mechanisms associated with extensive turnover of gene families and rapid evolution of genes that encode venom components ('conotoxins'), the evolution of distinct conotoxin expression patterns is an additional source of variation that may drive interspecific differences in the utilization of species' 'venom gene space'. To determine the evolution of expression patterns of venom genes of Conus species, we evaluated the expression of A-superfamily conotoxin genes of a set of closely related Conus species by comparing recovered transcripts of A-superfamily genes that were previously identified from the genomes of these species. We modified community phylogenetics approaches to incorporate phylogenetic history and disparity of genes and their expression profiles to determine patterns of venom gene space utilization. Less than half of the A-superfamily gene repertoire of these species is expressed, and only a few orthologous genes are coexpressed among species. Species exhibit substantially distinct expression strategies, with some expressing sets of closely related loci ('under-dispersed' expression of available genes) while others express sets of more disparate genes ('over-dispersed' expression). In addition, expressed genes show higher dN/dS values than either unexpressed or ancestral genes; this implies that expression exposes genes to selection and facilitates rapid evolution of these genes. Few recent lineage-specific gene duplicates are expressed simultaneously, suggesting that expression divergence among redundant gene copies may be established shortly after gene duplication. Our study demonstrates that venom gene space is explored differentially by Conus species, a process that effectively permits the independent and rapid evolution of venoms in these species.

  2. Selecting Species Traits for Biomonitoring Applications in light of Phylogenetic Relationships among Lotic Insects

    NASA Astrophysics Data System (ADS)

    Poff, N.; Vieira, N. K.; Simmons, M. P.; Olden, J. D.; Kondratieff, B. C.; Finn, D. S.

    2005-05-01

    The use of species traits as indicators of environmental disturbance is being considered for biomonitoring programs globally. As such, methods to select relevant and informative traits for inclusion in biometrics need to be developed. In this research, we identified 20 traits of aquatic insects within six trait groups: morphology, mobility, life-history strategy, thermal tolerance, feeding guild and ecology (e.g., habitat preference). We constructed phylogenetic trees for 1) all lotic insect species of North America and 2) all Ephemeroptera, Plecoptera and Trichoptera species based on morphology- and molecular-based analyses and classifications. We then measured variability (i.e., plasticity) of the 20 traits and six trait groups across the two phylogenetic trees. Traits with higher degrees of plasticity indicated traits that were less phylogenetically constrained, and were considered informative for biomonitoring purposes. Thermal tolerance, rheophily, body size at maturity and feeding guild showed the highest plasticity across both phylogenetic trees. Two mobility traits, occurrence in drift and adult dispersal distance, showed moderate plasticity. By contrast, adult exiting ability, degree of attachment, adult lifespan and body shape showed low variability and were thus less informative. Plastic species traits that are less phylogenetically constrained may be most useful in detecting community change along environmental gradients.

  3. Airborne Quercus pollen in SW Spain: Identifying favourable conditions for atmospheric transport and potential source areas.

    PubMed

    Maya-Manzano, José María; Fernández-Rodríguez, Santiago; Smith, Matt; Tormo-Molina, Rafael; Reynolds, Andrew M; Silva-Palacios, Inmaculada; Gonzalo-Garijo, Ángela; Sadyś, Magdalena

    2016-11-15

    The pollen grains of Quercus spp. (oak trees) are allergenic. This study investigates airborne Quercus pollen in SW Spain with the aim identifying favourable conditions for atmospheric transport and potential sources areas. Two types of Quercus distribution maps were produced. Airborne Quercus pollen concentrations were measured at three sites located in the Extremadura region (SW Spain) for 3 consecutive years. The seasonal occurrence of Quercus pollen in the air was investigated, as well as days with pollen concentrations ≥80Pm(-3). The distance that Quercus pollen can be transported in appreciable numbers was calculated using clusters of back trajectories representing the air mass movement above the source areas (oak woodlands), and by using a state-of-the-art dispersion model. The two main potential sources of Quercus airborne pollen captured in SW Spain are Q. ilex subsp. ballota and Q. suber. The minimum distances between aerobiological stations and Quercus woodlands have been estimated as: 40km (Plasencia), 66km (Don Benito), 62km (Zafra) from the context of this study. Daily mean Quercus pollen concentration can exceed 1,700Pm(-3), levels reached not less than 24 days in a single year. High Quercus pollen concentration were mostly associated with moderate wind speed events (6-10ms(-1)), whereas that a high wind speed (16-20ms(-1)) seems to be associated with low concentrations. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Identifying risk sources of air contamination by polycyclic aromatic hydrocarbons.

    PubMed

    Huzlik, Jiri; Bozek, Frantisek; Pawelczyk, Adam; Licbinsky, Roman; Naplavova, Magdalena; Pondelicek, Michael

    2017-09-01

    This article is directed to determining concentrations of polycyclic aromatic hydrocarbons (PAHs), which are sorbed to solid particles in the air. Pollution sources were identified on the basis of the ratio of benzo[ghi]perylene (BghiPe) to benzo[a]pyrene (BaP). Because various important information is lost by determining the simple ratio of concentrations, least squares linear regression (classic ordinary least squares regression), reduced major axis, orthogonal regression, and Kendall-Theil robust diagnostics were utilized for identification. Statistical evaluation using all aforementioned methods demonstrated different ratios of the monitored PAHs in the intervals examined during warmer and colder periods. Analogous outputs were provided by comparing gradients of the emission factors acquired from the measured concentrations of BghiPe and BaP in motor vehicle exhaust gases. Based on these outputs, it was possible plausibly to state that the influence of burning organic fuels in heating stoves is prevalent in colder periods whereas in warmer periods transport was the exclusive source because other sources of PAH emissions were not found in the examined locations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. A Functional-Phylogenetic Classification System for Transmembrane Solute Transporters

    PubMed Central

    Saier, Milton H.

    2000-01-01

    A comprehensive classification system for transmembrane molecular transporters has been developed and recently approved by the transport panel of the nomenclature committee of the International Union of Biochemistry and Molecular Biology. This system is based on (i) transporter class and subclass (mode of transport and energy coupling mechanism), (ii) protein phylogenetic family and subfamily, and (iii) substrate specificity. Almost all of the more than 250 identified families of transporters include members that function exclusively in transport. Channels (115 families), secondary active transporters (uniporters, symporters, and antiporters) (78 families), primary active transporters (23 families), group translocators (6 families), and transport proteins of ill-defined function or of unknown mechanism (51 families) constitute distinct categories. Transport mode and energy coupling prove to be relatively immutable characteristics and therefore provide primary bases for classification. Phylogenetic grouping reflects structure, function, mechanism, and often substrate specificity and therefore provides a reliable secondary basis for classification. Substrate specificity and polarity of transport prove to be more readily altered during evolutionary history and therefore provide a tertiary basis for classification. With very few exceptions, a phylogenetic family of transporters includes members that function by a single transport mode and energy coupling mechanism, although a variety of substrates may be transported, sometimes with either inwardly or outwardly directed polarity. In this review, I provide cross-referencing of well-characterized constituent transporters according to (i) transport mode, (ii) energy coupling mechanism, (iii) phylogenetic grouping, and (iv) substrates transported. The structural features and distribution of recognized family members throughout the living world are also evaluated. The tabulations should facilitate familial and functional

  6. Factors shaping bacterial phylogenetic and functional diversity in coastal waters of the NW Mediterranean Sea

    NASA Astrophysics Data System (ADS)

    Boras, Julia A.; Vaqué, Dolors; Maynou, Francesc; Sà, Elisabet L.; Weinbauer, Markus G.; Sala, Maria Montserrat

    2015-03-01

    To evaluate the main factors shaping bacterioplankton phylogenetic and functional diversity in marine coastal waters, we carried out a two-year study based on a monthly sampling in Blanes Bay (NW Mediterranean). We expected the key factors driving bacterial diversity to be (1) temperature and nutrient concentration, together with chlorophyll a concentration as an indicator of phytoplankton biomass and, hence, a carbon source for bacteria (here called bottom-up factors), and (2) top-down pressure (virus- and protist-mediated mortality of bacteria). Phylogenetic diversity was analyzed by denaturing gradient gel electrophoresis (DGGE) of 16S rRNA. Functional diversity was assessed by using monomeric carbon sources in Biolog EcoPlates and by determining the activity of six extracellular enzymes. Our results indicate that the bacterial phylogenetic and functional diversity in this coastal system is shaped mainly by bottom-up factors. A dendrogram analysis of the DGGE banding patterns revealed three main sample clusters. Two clusters differed significantly in temperature, nitrate and chlorophyll a concentration, and the third was characterized by the highest losses of bacterial production due to viral lysis detected over the whole study period. Protistan grazing had no effect on bacterial functional diversity, since there were no correlations between protist-mediated mortality (PMM) and extracellular enzyme activities, and utilization of only two out of the 31 carbon sources (N-acetyl-D-glucosamine and α-cyclodextrin) was correlated with PMM. In contrast, virus-mediated mortality correlated with changes in the percentage of use of four carbon sources, and also with specific leu-aminopeptidase and β-glucosidase activity. This suggests that viral lysate provides a pool of labile carbon sources, presumably including amino acids and glucose, which may inhibit proteolytic and glucosidic activity. Our results indicate that bottom-up factors play a more important role than

  7. Phylogenetic diversity, functional trait diversity and extinction: avoiding tipping points and worst-case losses.

    PubMed

    Faith, Daniel P

    2015-02-19

    The phylogenetic diversity measure, ('PD'), measures the relative feature diversity of different subsets of taxa from a phylogeny. At the level of feature diversity, PD supports the broad goal of biodiversity conservation to maintain living variation and option values. PD calculations at the level of lineages and features include those integrating probabilities of extinction, providing estimates of expected PD. This approach has known advantages over the evolutionarily distinct and globally endangered (EDGE) methods. Expected PD methods also have limitations. An alternative notion of expected diversity, expected functional trait diversity, relies on an alternative non-phylogenetic model and allows inferences of diversity at the level of functional traits. Expected PD also faces challenges in helping to address phylogenetic tipping points and worst-case PD losses. Expected PD may not choose conservation options that best avoid worst-case losses of long branches from the tree of life. We can expand the range of useful calculations based on expected PD, including methods for identifying phylogenetic key biodiversity areas. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  8. Phylogenetic diversity, functional trait diversity and extinction: avoiding tipping points and worst-case losses

    PubMed Central

    Faith, Daniel P.

    2015-01-01

    The phylogenetic diversity measure, (‘PD’), measures the relative feature diversity of different subsets of taxa from a phylogeny. At the level of feature diversity, PD supports the broad goal of biodiversity conservation to maintain living variation and option values. PD calculations at the level of lineages and features include those integrating probabilities of extinction, providing estimates of expected PD. This approach has known advantages over the evolutionarily distinct and globally endangered (EDGE) methods. Expected PD methods also have limitations. An alternative notion of expected diversity, expected functional trait diversity, relies on an alternative non-phylogenetic model and allows inferences of diversity at the level of functional traits. Expected PD also faces challenges in helping to address phylogenetic tipping points and worst-case PD losses. Expected PD may not choose conservation options that best avoid worst-case losses of long branches from the tree of life. We can expand the range of useful calculations based on expected PD, including methods for identifying phylogenetic key biodiversity areas. PMID:25561672

  9. Paralogues of nuclear ribosomal genes conceal phylogenetic signals within the invasive Asian fish tapeworm lineage: evidence from next generation sequencing data.

    PubMed

    Brabec, Jan; Kuchta, Roman; Scholz, Tomáš; Littlewood, D Timothy J

    2016-08-01

    Complete mitochondrial genomes and nuclear rRNA operons of eight geographically distinct isolates of the Asian fish tapeworm Schyzocotyle acheilognathi (syn. Bothriocephalus acheilognathi), representing the parasite's global diversity spanning four continents, were fully characterised using an Illumina sequencing platform. This cestode species represents an extreme example of a highly invasive, globally distributed pathogen of veterinary importance with exceptionally low host specificity unseen elsewhere within the parasitic flatworms. In addition to eight specimens of S. acheilognathi, we fully characterised its closest known relative and the only congeneric species, Schyzocotyle nayarensis, from cyprinids in the Indian subcontinent. Since previous nucleotide sequence data on the Asian fish tapeworm were restricted to a single molecular locus of questionable phylogenetic utility-the nuclear rRNA genes-separating internal transcribed spacers-the mitogenomic data presented here offer a unique opportunity to gain the first detailed insights into both the intraspecific phylogenetic relationships and population genetic structure of the parasite, providing key baseline information for future research in the field. Additionally, we identify a previously unnoticed source of error and demonstrate the limited utility of the nuclear rRNA sequences, including the internal transcribed spacers that has likely misled most of the previous molecular phylogenetic and population genetic estimates on the Asian fish tapeworm. Copyright © 2016 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.

  10. Phylogenetic Systematics, Biogeography, and Ecology of the Electric Fish Genus Brachyhypopomus (Ostariophysi: Gymnotiformes)

    PubMed Central

    de Santana, Carlos David; Waddell, Joseph C.; Lovejoy, Nathan R.

    2016-01-01

    A species-level phylogenetic reconstruction of the Neotropical bluntnose knifefish genus Brachyhypopomus (Gymnotiformes, Hypopomidae) is presented, based on 60 morphological characters, approximately 1100 base pairs of the mitochondrial cytb gene, and approximately 1000 base pairs of the nuclear rag2 gene. The phylogeny includes 28 species of Brachyhypopomus and nine outgroup species from nine other gymnotiform genera, including seven in the superfamily Rhamphichthyoidea (Hypopomidae and Rhamphichthyidae). Parsimony and Bayesian total evidence phylogenetic analyses confirm the monophyly of the genus, and identify nine robust species groups. Homoplastic osteological characters associated with diminutive body size and occurrence in small stream habitats, including loss of squamation and simplifications of the skeleton, appear to mislead a phylogenetic analysis based on morphological characters alone–resulting in the incorrect placing of Microsternarchus + Racenisia in a position deeply nested within Brachyhypopomus. Consideration of geographical distribution in light of the total evidence phylogeny indicates an origin for Brachyhypopomus in Greater Amazonia (the superbasin comprising the Amazon, Orinoco and major Guiana drainages), with subsequent dispersal and vicariance in peripheral basins, including the La Plata, the São Francisco, and trans-Andean basins of northwest South America and Central America. The ancestral habitat of Brachyhypopomus likely resembled the normoxic, low-conductivity terra firme stream system occupied by many extant species, and the genus has subsequently occupied a wide range of terra firme and floodplain habitats including low- and high-conductivity systems, and normoxic and hypoxic systems. Adaptations for impedance matching to high conductivity, and/or for air breathing in hypoxic systems have attended these habitat transitions. Several species of Brachyhypopomus are eurytopic with respect to habitat occupancy and these generally

  11. The Probability of a Gene Tree Topology within a Phylogenetic Network with Applications to Hybridization Detection

    PubMed Central

    Yu, Yun; Degnan, James H.; Nakhleh, Luay

    2012-01-01

    Gene tree topologies have proven a powerful data source for various tasks, including species tree inference and species delimitation. Consequently, methods for computing probabilities of gene trees within species trees have been developed and widely used in probabilistic inference frameworks. All these methods assume an underlying multispecies coalescent model. However, when reticulate evolutionary events such as hybridization occur, these methods are inadequate, as they do not account for such events. Methods that account for both hybridization and deep coalescence in computing the probability of a gene tree topology currently exist for very limited cases. However, no such methods exist for general cases, owing primarily to the fact that it is currently unknown how to compute the probability of a gene tree topology within the branches of a phylogenetic network. Here we present a novel method for computing the probability of gene tree topologies on phylogenetic networks and demonstrate its application to the inference of hybridization in the presence of incomplete lineage sorting. We reanalyze a Saccharomyces species data set for which multiple analyses had converged on a species tree candidate. Using our method, though, we show that an evolutionary hypothesis involving hybridization in this group has better support than one of strict divergence. A similar reanalysis on a group of three Drosophila species shows that the data is consistent with hybridization. Further, using extensive simulation studies, we demonstrate the power of gene tree topologies at obtaining accurate estimates of branch lengths and hybridization probabilities of a given phylogenetic network. Finally, we discuss identifiability issues with detecting hybridization, particularly in cases that involve extinction or incomplete sampling of taxa. PMID:22536161

  12. Phylogenetic lineages in the Botryosphaeriales: a systematic and evolutionary framework

    PubMed Central

    Slippers, B.; Boissin, E.; Phillips, A.J.L.; Groenewald, J.Z.; Lombard, L.; Wingfield, M.J.; Postma, A.; Burgess, T.; Crous, P.W.

    2013-01-01

    The order Botryosphaeriales represents several ecologically diverse fungal families that are commonly isolated as endophytes or pathogens from various woody hosts. The taxonomy of members of this order has been strongly influenced by sequence-based phylogenetics, and the abandonment of dual nomenclature. In this study, the phylogenetic relationships of the genera known from culture are evaluated based on DNA sequence data for six loci (SSU, LSU, ITS, EF1, BT, mtSSU). The results make it possible to recognise a total of six families. Other than the Botryosphaeriaceae (17 genera), Phyllostictaceae (Phyllosticta) and Planistromellaceae (Kellermania), newly introduced families include Aplosporellaceae (Aplosporella and Bagnisiella), Melanopsaceae (Melanops), and Saccharataceae (Saccharata). Furthermore, the evolution of morphological characters in the Botryosphaeriaceae were investigated via analysis of phylogeny-trait association. None of the traits presented a significant phylogenetic signal, suggesting that conidial and ascospore pigmentation, septation and appendages evolved more than once in the family. Molecular clock dating on radiations within the Botryosphaeriales based on estimated mutation rates of the rDNA SSU locus, suggests that the order originated in the Cretaceous period around 103 (45-188) mya, with most of the diversification in the Tertiary period. This coincides with important periods of radiation and spread of the main group of plants that these fungi infect, namely woody Angiosperms. The resulting host-associations and distribution could have influenced the diversification of these fungi. Taxonomic novelties: New families - Aplosporellaceae Slippers, Boissin & Crous, Melanopsaceae Phillips, Slippers, Boissin & Crous, Saccharataceae Slippers, Boissin & Crous. PMID:24302789

  13. Effects of Phylogenetic Tree Style on Student Comprehension

    NASA Astrophysics Data System (ADS)

    Dees, Jonathan Andrew

    Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.

  14. Combining Ordinary Kriging with wind directions to identify sources of industrial odors in Portland, Oregon.

    PubMed

    Eckmann, Ted C; Wright, Samantha G; Simpson, Logan K; Walker, Joe L; Kolmes, Steven A; Houck, James E; Velasquez, Sandra C

    2018-01-01

    This study combines Ordinary Kriging, odor monitoring, and wind direction data to demonstrate how these elements can be applied to identify the source of an industrial odor. The specific case study used as an example of how to address this issue was the University Park neighborhood of Portland, Oregon (USA) where residents frequently complain about industrial odors, and suspect the main source to be a nearby Daimler Trucks North America LLC manufacturing plant. We collected 19,665 odor observations plus 105,120 wind measurements, using an automated weather station to measure winds in the area at five-minute intervals, logging continuously from December 2014 through November 2015, while we also measured odors at 19 locations, three times per day, using methods from the American Society of the International Association for Testing and Materials. Our results quantify how winds vary with season and time of day when industrial odors were observed versus when they were not observed, while also mapping spatiotemporal patterns in these odors using Ordinary Kriging. Our analyses show that industrial odors were detected most frequently to the northwest of the Daimler plant, mostly when winds blew from the southeast, suggesting Daimler's facility is a likely source for much of this odor.

  15. Combining Ordinary Kriging with wind directions to identify sources of industrial odors in Portland, Oregon

    PubMed Central

    Kolmes, Steven A.; Houck, James E.; Velasquez, Sandra C.

    2018-01-01

    This study combines Ordinary Kriging, odor monitoring, and wind direction data to demonstrate how these elements can be applied to identify the source of an industrial odor. The specific case study used as an example of how to address this issue was the University Park neighborhood of Portland, Oregon (USA) where residents frequently complain about industrial odors, and suspect the main source to be a nearby Daimler Trucks North America LLC manufacturing plant. We collected 19,665 odor observations plus 105,120 wind measurements, using an automated weather station to measure winds in the area at five-minute intervals, logging continuously from December 2014 through November 2015, while we also measured odors at 19 locations, three times per day, using methods from the American Society of the International Association for Testing and Materials. Our results quantify how winds vary with season and time of day when industrial odors were observed versus when they were not observed, while also mapping spatiotemporal patterns in these odors using Ordinary Kriging. Our analyses show that industrial odors were detected most frequently to the northwest of the Daimler plant, mostly when winds blew from the southeast, suggesting Daimler’s facility is a likely source for much of this odor. PMID:29385136

  16. Genome-wide association study identifies 74 loci associated with educational attainment

    PubMed Central

    Okbay, Aysu; Beauchamp, Jonathan P.; Fontana, Mark A.; Lee, James J.; Pers, Tune H.; Rietveld, Cornelius A.; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S. Fleur W.; Oskarsson, Sven; Pickrell, Joseph K.; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S.; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H.; Concas, Maria Pina; Derringer, Jaime; Furlotte, Nicholas A.; Galesloot, Tessel E.; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M.; Harris, Sarah E.; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E.; Kaasik, Kadri; Kalafati, Ioanna P.; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J.; de Leeuw, Christiaan; Lind, Penelope A.; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B.; van der Most, Peter J.; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J.; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E.; Shi, Jianxin; Smith, Albert V.; Poot, Raymond A.; Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z.; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E.; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A.; Campbell, Harry; Cappuccio, Francesco P.; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans68, David M.; Faul, Jessica D.; Feitosa, Mary F.; Forstner, Andreas J.; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V.; Harris, Tamara B.; Heath, Andrew C.; Hocking, Lynne J.; Holliday, Elizabeth G.; Homuth, Georg; Horan, Michael A.; Hottenga, Jouke-Jan; de Jager, Philip L.; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika A.; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A.L.M.; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T.; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J.; Lebreton, Maël P.; Levinson, Douglas F.; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.M.; Loukola, Anu; Madden, Pamela A.; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E.; Marques-Vidal, Pedro; Meddens, Gerardus A.; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W.; Myhre, Ronny; Nelson, Christopher P.; Nyholt, Dale R.; Ollier, William E.R.; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L.; Petrovic, Katja E.; Porteous, David J.; Räikkönen, Katri; Ring, Susan M.; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R.; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J.; Smith, Blair H.; Smith, Jennifer A.; Staessen, Jan A.; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D.; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J.A.; Venturini, Cristina; Vinkhuyzen, Anna A.E.; Völker, Uwe; Völzke, Henry; Vonk, Judith M.; Vozzi, Diego; Waage, Johannes; Ware, Erin B.; Willemsen, Gonneke; Attia, John R.; Bennett, David A.; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I.; Borecki, Ingrid B.; Bultmann, Ute; Chabris, Christopher F.; Cucca, Francesco; Cusi, Daniele; Deary, Ian J.; Dedoussis, George V.; van Duijn, Cornelia M.; Eriksson, Johan G.; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V.; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J.F.; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A.; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G.; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L.R.; Lehtimäki, Terho; Lehrer, Steven F.; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W.J.H.; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A.; Samani, Nilesh J.; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I.A.; Spector, Tim D.; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tiemeier, Henning; Tung, Joyce Y.; Uitterlinden, André G.; Vitart, Veronique; Vollenweider, Peter; Weir, David R.; Wilson, James F.; Wright, Alan F.; Conley, Dalton C.; Krueger, Robert F.; Smith, George Davey; Hofman, Albert; Laibson, David I.; Medland, Sarah E.; Meyer, Michelle N.; Yang, Jian; Johannesson, Magnus; Visscher, Peter M.; Esko, Tõnu; Koellinger, Philipp D.; Cesarini, David; Benjamin, Daniel J.

    2016-01-01

    Summary Educational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease. PMID:27225129

  17. Genome-wide association study identifies 74 loci associated with educational attainment.

    PubMed

    Okbay, Aysu; Beauchamp, Jonathan P; Fontana, Mark Alan; Lee, James J; Pers, Tune H; Rietveld, Cornelius A; Turley, Patrick; Chen, Guo-Bo; Emilsson, Valur; Meddens, S Fleur W; Oskarsson, Sven; Pickrell, Joseph K; Thom, Kevin; Timshel, Pascal; de Vlaming, Ronald; Abdellaoui, Abdel; Ahluwalia, Tarunveer S; Bacelis, Jonas; Baumbach, Clemens; Bjornsdottir, Gyda; Brandsma, Johannes H; Pina Concas, Maria; Derringer, Jaime; Furlotte, Nicholas A; Galesloot, Tessel E; Girotto, Giorgia; Gupta, Richa; Hall, Leanne M; Harris, Sarah E; Hofer, Edith; Horikoshi, Momoko; Huffman, Jennifer E; Kaasik, Kadri; Kalafati, Ioanna P; Karlsson, Robert; Kong, Augustine; Lahti, Jari; van der Lee, Sven J; deLeeuw, Christiaan; Lind, Penelope A; Lindgren, Karl-Oskar; Liu, Tian; Mangino, Massimo; Marten, Jonathan; Mihailov, Evelin; Miller, Michael B; van der Most, Peter J; Oldmeadow, Christopher; Payton, Antony; Pervjakova, Natalia; Peyrot, Wouter J; Qian, Yong; Raitakari, Olli; Rueedi, Rico; Salvi, Erika; Schmidt, Börge; Schraut, Katharina E; Shi, Jianxin; Smith, Albert V; Poot, Raymond A; St Pourcain, Beate; Teumer, Alexander; Thorleifsson, Gudmar; Verweij, Niek; Vuckovic, Dragana; Wellmann, Juergen; Westra, Harm-Jan; Yang, Jingyun; Zhao, Wei; Zhu, Zhihong; Alizadeh, Behrooz Z; Amin, Najaf; Bakshi, Andrew; Baumeister, Sebastian E; Biino, Ginevra; Bønnelykke, Klaus; Boyle, Patricia A; Campbell, Harry; Cappuccio, Francesco P; Davies, Gail; De Neve, Jan-Emmanuel; Deloukas, Panos; Demuth, Ilja; Ding, Jun; Eibich, Peter; Eisele, Lewin; Eklund, Niina; Evans, David M; Faul, Jessica D; Feitosa, Mary F; Forstner, Andreas J; Gandin, Ilaria; Gunnarsson, Bjarni; Halldórsson, Bjarni V; Harris, Tamara B; Heath, Andrew C; Hocking, Lynne J; Holliday, Elizabeth G; Homuth, Georg; Horan, Michael A; Hottenga, Jouke-Jan; de Jager, Philip L; Joshi, Peter K; Jugessur, Astanand; Kaakinen, Marika A; Kähönen, Mika; Kanoni, Stavroula; Keltigangas-Järvinen, Liisa; Kiemeney, Lambertus A L M; Kolcic, Ivana; Koskinen, Seppo; Kraja, Aldi T; Kroh, Martin; Kutalik, Zoltan; Latvala, Antti; Launer, Lenore J; Lebreton, Maël P; Levinson, Douglas F; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C M; Loukola, Anu; Madden, Pamela A; Mägi, Reedik; Mäki-Opas, Tomi; Marioni, Riccardo E; Marques-Vidal, Pedro; Meddens, Gerardus A; McMahon, George; Meisinger, Christa; Meitinger, Thomas; Milaneschi, Yusplitri; Milani, Lili; Montgomery, Grant W; Myhre, Ronny; Nelson, Christopher P; Nyholt, Dale R; Ollier, William E R; Palotie, Aarno; Paternoster, Lavinia; Pedersen, Nancy L; Petrovic, Katja E; Porteous, David J; Räikkönen, Katri; Ring, Susan M; Robino, Antonietta; Rostapshova, Olga; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sanders, Alan R; Sarin, Antti-Pekka; Schmidt, Helena; Scott, Rodney J; Smith, Blair H; Smith, Jennifer A; Staessen, Jan A; Steinhagen-Thiessen, Elisabeth; Strauch, Konstantin; Terracciano, Antonio; Tobin, Martin D; Ulivi, Sheila; Vaccargiu, Simona; Quaye, Lydia; van Rooij, Frank J A; Venturini, Cristina; Vinkhuyzen, Anna A E; Völker, Uwe; Völzke, Henry; Vonk, Judith M; Vozzi, Diego; Waage, Johannes; Ware, Erin B; Willemsen, Gonneke; Attia, John R; Bennett, David A; Berger, Klaus; Bertram, Lars; Bisgaard, Hans; Boomsma, Dorret I; Borecki, Ingrid B; Bültmann, Ute; Chabris, Christopher F; Cucca, Francesco; Cusi, Daniele; Deary, Ian J; Dedoussis, George V; van Duijn, Cornelia M; Eriksson, Johan G; Franke, Barbara; Franke, Lude; Gasparini, Paolo; Gejman, Pablo V; Gieger, Christian; Grabe, Hans-Jörgen; Gratten, Jacob; Groenen, Patrick J F; Gudnason, Vilmundur; van der Harst, Pim; Hayward, Caroline; Hinds, David A; Hoffmann, Wolfgang; Hyppönen, Elina; Iacono, William G; Jacobsson, Bo; Järvelin, Marjo-Riitta; Jöckel, Karl-Heinz; Kaprio, Jaakko; Kardia, Sharon L R; Lehtimäki, Terho; Lehrer, Steven F; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; Metspalu, Andres; Pendleton, Neil; Penninx, Brenda W J H; Perola, Markus; Pirastu, Nicola; Pirastu, Mario; Polasek, Ozren; Posthuma, Danielle; Power, Christine; Province, Michael A; Samani, Nilesh J; Schlessinger, David; Schmidt, Reinhold; Sørensen, Thorkild I A; Spector, Tim D; Stefansson, Kari; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tiemeier, Henning; Tung, Joyce Y; Uitterlinden, André G; Vitart, Veronique; Vollenweider, Peter; Weir, David R; Wilson, James F; Wright, Alan F; Conley, Dalton C; Krueger, Robert F; Davey Smith, George; Hofman, Albert; Laibson, David I; Medland, Sarah E; Meyer, Michelle N; Yang, Jian; Johannesson, Magnus; Visscher, Peter M; Esko, Tõnu; Koellinger, Philipp D; Cesarini, David; Benjamin, Daniel J

    2016-05-26

    Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.

  18. Phylogenetic patterns and the adaptive evolution of osmoregulation in fiddler crabs (Brachyura, Uca)

    PubMed Central

    Faria, Samuel Coelho; Provete, Diogo Borges; Thurman, Carl Leo

    2017-01-01

    Salinity is the primary driver of osmoregulatory evolution in decapods, and may have influenced their diversification into different osmotic niches. In semi-terrestrial crabs, hyper-osmoregulatory ability favors sojourns into burrows and dilute media, and provides a safeguard against hemolymph dilution; hypo-osmoregulatory ability underlies emersion capability and a life more removed from water sources. However, most comparative studies have neglected the roles of the phylogenetic and environmental components of inter-specific physiological variation, hindering evaluation of phylogenetic patterns and the adaptive nature of osmoregulatory evolution. Semi-terrestrial fiddler crabs (Uca) inhabit fresh to hyper-saline waters, with species from the Americas occupying higher intertidal habitats than Indo-west Pacific species mainly found in the low intertidal zone. Here, we characterize numerous osmoregulatory traits in all ten fiddler crabs found along the Atlantic coast of Brazil, and we employ phylogenetic comparative methods using 24 species to test for: (i) similarities of osmoregulatory ability among closely related species; (ii) salinity as a driver of osmoregulatory evolution; (iii) correlation between salt uptake and secretion; and (iv) adaptive peaks in osmoregulatory ability in the high intertidal American lineages. Our findings reveal that osmoregulation in Uca exhibits strong phylogenetic patterns in salt uptake traits. Salinity does not correlate with hyper/hypo-regulatory abilities, but drives hemolymph osmolality at ambient salinities. Osmoregulatory traits have evolved towards three adaptive peaks, revealing a significant contribution of hyper/hypo-regulatory ability in the American clades. Thus, during the evolutionary history of fiddler crabs, salinity has driven some of the osmoregulatory transformations that underpin habitat diversification, although others are apparently constrained phylogenetically. PMID:28182764

  19. A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

    PubMed

    Shen, Xing-Xing; Salichos, Leonidas; Rokas, Antonis

    2016-09-02

    Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based, and 9 tree-based) with each other and with three measures of phylogenetic signal in two assembled data sets of 2,832 yeast and 2,002 mammalian genes. We found that most gene properties, such as evolutionary rate (measured through the percent average of pairwise identity across taxa) and total tree length, were highly correlated with each other. Similarly, several gene properties, such as gene alignment length, Guanine-Cytosine content, and the proportion of tree distance on internal branches divided by relative composition variability (treeness/RCV), were strongly correlated with phylogenetic signal. Analysis of partial correlations between gene properties and phylogenetic signal in which gene evolutionary rate and alignment length were simultaneously controlled, showed similar patterns of correlations, albeit weaker in strength. Examination of the relative importance of each gene property on phylogenetic signal identified gene alignment length, alongside with number of parsimony-informative sites and variable sites, as the most important predictors. Interestingly, the subsets of gene properties that optimally predicted phylogenetic signal differed considerably across our three phylogenetic measures and two data sets; however, gene alignment length and RCV were consistently included as predictors of all three phylogenetic measures in both yeasts and mammals. These results suggest that a handful of sequence-based gene properties are reliable predictors of phylogenetic signal

  20. Phylogenetically informed logic relationships improve detection of biological network organization

    PubMed Central

    2011-01-01

    Background A "phylogenetic profile" refers to the presence or absence of a gene across a set of organisms, and it has been proven valuable for understanding gene functional relationships and network organization. Despite this success, few studies have attempted to search beyond just pairwise relationships among genes. Here we search for logic relationships involving three genes, and explore its potential application in gene network analyses. Results Taking advantage of a phylogenetic matrix constructed from the large orthologs database Roundup, we invented a method to create balanced profiles for individual triplets of genes that guarantee equal weight on the different phylogenetic scenarios of coevolution between genes. When we applied this idea to LAPP, the method to search for logic triplets of genes, the balanced profiles resulted in significant performance improvement and the discovery of hundreds of thousands more putative triplets than unadjusted profiles. We found that logic triplets detected biological network organization and identified key proteins and their functions, ranging from neighbouring proteins in local pathways, to well separated proteins in the whole pathway, and to the interactions among different pathways at the system level. Finally, our case study suggested that the directionality in a logic relationship and the profile of a triplet could disclose the connectivity between the triplet and surrounding networks. Conclusion Balanced profiles are superior to the raw profiles employed by traditional methods of phylogenetic profiling in searching for high order gene sets. Gene triplets can provide valuable information in detection of biological network organization and identification of key genes at different levels of cellular interaction. PMID:22172058

  1. High School Students' Learning and Perceptions of Phylogenetics of Flowering Plants

    ERIC Educational Resources Information Center

    Bokor, Julie R.; Landis, Jacob B.; Crippen, Kent J.

    2014-01-01

    Basic phylogenetics and associated "tree thinking" are often minimized or excluded in formal school curricula. Informal settings provide an opportunity to extend the K-12 school curriculum, introducing learners to new ideas, piquing interest in science, and fostering scientific literacy. Similarly, university researchers participating in…

  2. Genome-wide association studies to identify rice salt-tolerance markers.

    PubMed

    Patishtan, Juan; Hartley, Tom N; Fonseca de Carvalho, Raquel; Maathuis, Frans J M

    2018-05-01

    Salinity is an ever increasing menace that affects agriculture worldwide. Crops such as rice are salt sensitive, but its degree of susceptibility varies widely between cultivars pointing to extensive genetic diversity that can be exploited to identify genes and proteins that are relevant in the response of rice to salt stress. We used a diversity panel of 306 rice accessions and collected phenotypic data after short (6 h), medium (7 d) and long (30 d) salinity treatment (50 mm NaCl). A genome-wide association study (GWAS) was subsequently performed, which identified around 1200 candidate genes from many functional categories, but this was treatment period dependent. Further analysis showed the presence of cation transporters and transcription factors with a known role in salinity tolerance and those that hitherto were not known to be involved in salt stress. Localization analysis of single nucleotide polymorphisms (SNPs) showed the presence of several hundred non-synonymous SNPs (nsSNPs) in coding regions and earmarked specific genomic regions with increased numbers of nsSNPs. It points to components of the ubiquitination pathway as important sources of genetic diversity that could underpin phenotypic variation in stress tolerance. © 2017 John Wiley & Sons Ltd.

  3. The ovary structure and oogenesis in the basal crustaceans and hexapods. Possible phylogenetic significance.

    PubMed

    Jaglarz, Mariusz K; Kubrakiewicz, Janusz; Bilinski, Szczepan M

    2014-07-01

    Recent large-scale phylogenetic analyses of exclusively molecular or combined molecular and morphological characters support a close relationship between Crustacea and Hexapoda. The growing consensus on this phylogenetic link is reflected in uniting both taxa under the name Pancrustacea or Tetraconata. Several recent molecular phylogenies have also indicated that the monophyletic hexapods should be nested within paraphyletic crustaceans. However, it is still contentious exactly which crustacean taxon is the sister group to Hexapoda. Among the favored candidates are Branchiopoda, Malacostraca, Remipedia and Xenocarida (Remipedia + Cephalocarida). In this context, we review morphological and ultrastructural features of the ovary architecture and oogenesis in these crustacean groups in search of traits potentially suitable for phylogenetic considerations. We have identified a suite of morphological characters which may prove useful in further comparative studies. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Phylogenetic characterization of Canine Parvovirus VP2 partial sequences from symptomatic dogs samples.

    PubMed

    Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A

    2016-01-01

    The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.

  5. Phylogenetically conserved resource partitioning in the coastal microbial loop

    DOE PAGES

    Bryson, Samuel; Li, Zhou; Chavez, Francisco; ...

    2017-08-11

    Resource availability influences marine microbial community structure, suggesting that population-specific resource partitioning defines discrete niches. Identifying how resources are partitioned among populations, thereby characterizing functional guilds within the communities, remains a challenge for microbial ecologists. We used proteomic stable isotope probing (SIP) and NanoSIMS analysis of phylogenetic microarrays (Chip-SIP) along with 16S rRNA gene amplicon and metagenomic sequencing to characterize the assimilation of six 13C-labeled common metabolic substrates and changes in the microbial community structure within surface water collected from Monterey Bay, CA. Both sequencing approaches indicated distinct substrate-specific community shifts. However, observed changes in relative abundance for individual populationsmore » did not correlate well with directly measured substrate assimilation. The complementary SIP techniques identified assimilation of all six substrates by diverse taxa, but also revealed differential assimilation of substrates into protein and ribonucleotide biomass between taxa. Substrate assimilation trends indicated significantly conserved resource partitioning among populations within the Flavobacteriia, Alphaproteobacteria and Gammaproteobacteria classes, suggesting that functional guilds within marine microbial communities are phylogenetically cohesive. However, populations within these classes exhibited heterogeneity in biosynthetic activity, which distinguished high-activity copiotrophs from low-activity oligotrophs. These results indicate distinct growth responses between populations that is not apparent by genome sequencing alone.« less

  6. Phylogenetically conserved resource partitioning in the coastal microbial loop

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bryson, Samuel; Li, Zhou; Chavez, Francisco

    Resource availability influences marine microbial community structure, suggesting that population-specific resource partitioning defines discrete niches. Identifying how resources are partitioned among populations, thereby characterizing functional guilds within the communities, remains a challenge for microbial ecologists. We used proteomic stable isotope probing (SIP) and NanoSIMS analysis of phylogenetic microarrays (Chip-SIP) along with 16S rRNA gene amplicon and metagenomic sequencing to characterize the assimilation of six 13C-labeled common metabolic substrates and changes in the microbial community structure within surface water collected from Monterey Bay, CA. Both sequencing approaches indicated distinct substrate-specific community shifts. However, observed changes in relative abundance for individual populationsmore » did not correlate well with directly measured substrate assimilation. The complementary SIP techniques identified assimilation of all six substrates by diverse taxa, but also revealed differential assimilation of substrates into protein and ribonucleotide biomass between taxa. Substrate assimilation trends indicated significantly conserved resource partitioning among populations within the Flavobacteriia, Alphaproteobacteria and Gammaproteobacteria classes, suggesting that functional guilds within marine microbial communities are phylogenetically cohesive. However, populations within these classes exhibited heterogeneity in biosynthetic activity, which distinguished high-activity copiotrophs from low-activity oligotrophs. These results indicate distinct growth responses between populations that is not apparent by genome sequencing alone.« less

  7. Phylogenetically conserved resource partitioning in the coastal microbial loop

    PubMed Central

    Bryson, Samuel; Li, Zhou; Chavez, Francisco; Weber, Peter K; Pett-Ridge, Jennifer; Hettich, Robert L; Pan, Chongle; Mayali, Xavier; Mueller, Ryan S

    2017-01-01

    Resource availability influences marine microbial community structure, suggesting that population-specific resource partitioning defines discrete niches. Identifying how resources are partitioned among populations, thereby characterizing functional guilds within the communities, remains a challenge for microbial ecologists. We used proteomic stable isotope probing (SIP) and NanoSIMS analysis of phylogenetic microarrays (Chip-SIP) along with 16S rRNA gene amplicon and metagenomic sequencing to characterize the assimilation of six 13C-labeled common metabolic substrates and changes in the microbial community structure within surface water collected from Monterey Bay, CA. Both sequencing approaches indicated distinct substrate-specific community shifts. However, observed changes in relative abundance for individual populations did not correlate well with directly measured substrate assimilation. The complementary SIP techniques identified assimilation of all six substrates by diverse taxa, but also revealed differential assimilation of substrates into protein and ribonucleotide biomass between taxa. Substrate assimilation trends indicated significantly conserved resource partitioning among populations within the Flavobacteriia, Alphaproteobacteria and Gammaproteobacteria classes, suggesting that functional guilds within marine microbial communities are phylogenetically cohesive. However, populations within these classes exhibited heterogeneity in biosynthetic activity, which distinguished high-activity copiotrophs from low-activity oligotrophs. These results indicate distinct growth responses between populations that is not apparent by genome sequencing alone. PMID:28800138

  8. Using tree diversity to compare phylogenetic heuristics.

    PubMed

    Sul, Seung-Jin; Matthews, Suzanne; Williams, Tiffani L

    2009-04-29

    Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees-especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.

  9. Identifying Genetic Sources of Phenotypic Heterogeneity in Orofacial Clefts by Targeted Sequencing.

    PubMed

    Carlson, Jenna C; Taub, Margaret A; Feingold, Eleanor; Beaty, Terri H; Murray, Jeffrey C; Marazita, Mary L; Leslie, Elizabeth J

    2017-07-17

    Orofacial clefts (OFCs), including nonsyndromic cleft lip with or without cleft palate (NSCL/P), are common birth defects. NSCL/P is highly heterogeneous with multiple phenotypic presentations. Two common subtypes of NSCL/P are cleft lip (CL) and cleft lip with cleft palate (CLP) which have different population prevalence. Similarly, NSCL/P can be divided into bilateral and unilateral clefts, with unilateral being the most common. Individuals with unilateral NSCL/P are more likely to be affected on the left side of the upper lip, but right side affection also occurs. Moreover, NSCL/P is twice as common in males as in females. The goal of this study is to discover genetic variants that have different effects in case subgroups. We conducted both common variant and rare variant analyses in 1034 individuals of Asian ancestry with NSCL/P, examining four sources of heterogeneity within CL/P: cleft type, sex, laterality, and side. We identified several regions associated with subtype differentiation: cleft type differences in 8q24 (p = 1.00 × 10 -4 ), laterality differences in IRF6, a gene previously implicated with wound healing (p = 2.166 × 10 -4 ), sex differences and side of unilateral CL differences in FGFR2 (p = 3.00 × 10 -4 ; p = 6.00 × 10 -4 ), and sex differences in VAX1 (p < 1.00 × 10 -4 ) among others. Many of the regions associated with phenotypic modification were either adjacent to or overlapping functional elements based on ENCODE chromatin marks and published craniofacial enhancers. We have identified multiple common and rare variants as potential phenotypic modifiers of NSCL/P, and suggest plausible elements responsible for phenotypic heterogeneity, further elucidating the complex genetic architecture of OFCs. Birth Defects Research 109:1030-1038, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  10. Effects of phylogenetic reconstruction method on the robustness of species delimitation using single-locus data

    PubMed Central

    Tang, Cuong Q; Humphreys, Aelys M; Fontaneto, Diego; Barraclough, Timothy G; Paradis, Emmanuel

    2014-01-01

    Coalescent-based species delimitation methods combine population genetic and phylogenetic theory to provide an objective means for delineating evolutionarily significant units of diversity. The generalised mixed Yule coalescent (GMYC) and the Poisson tree process (PTP) are methods that use ultrametric (GMYC or PTP) or non-ultrametric (PTP) gene trees as input, intended for use mostly with single-locus data such as DNA barcodes. Here, we assess how robust the GMYC and PTP are to different phylogenetic reconstruction and branch smoothing methods. We reconstruct over 400 ultrametric trees using up to 30 different combinations of phylogenetic and smoothing methods and perform over 2000 separate species delimitation analyses across 16 empirical data sets. We then assess how variable diversity estimates are, in terms of richness and identity, with respect to species delimitation, phylogenetic and smoothing methods. The PTP method generally generates diversity estimates that are more robust to different phylogenetic methods. The GMYC is more sensitive, but provides consistent estimates for BEAST trees. The lower consistency of GMYC estimates is likely a result of differences among gene trees introduced by the smoothing step. Unresolved nodes (real anomalies or methodological artefacts) affect both GMYC and PTP estimates, but have a greater effect on GMYC estimates. Branch smoothing is a difficult step and perhaps an underappreciated source of bias that may be widespread among studies of diversity and diversification. Nevertheless, careful choice of phylogenetic method does produce equivalent PTP and GMYC diversity estimates. We recommend simultaneous use of the PTP model with any model-based gene tree (e.g. RAxML) and GMYC approaches with BEAST trees for obtaining species hypotheses. PMID:25821577

  11. Phylogenetic evidence for cladogenetic polyploidization in land plants.

    PubMed

    Zhan, Shing H; Drori, Michal; Goldberg, Emma E; Otto, Sarah P; Mayrose, Itay

    2016-07-01

    Polyploidization is a common and recurring phenomenon in plants and is often thought to be a mechanism of "instant speciation". Whether polyploidization is associated with the formation of new species (cladogenesis) or simply occurs over time within a lineage (anagenesis), however, has never been assessed systematically. We tested this hypothesis using phylogenetic and karyotypic information from 235 plant genera (mostly angiosperms). We first constructed a large database of combined sequence and chromosome number data sets using an automated procedure. We then applied likelihood models (ClaSSE) that estimate the degree of synchronization between polyploidization and speciation events in maximum likelihood and Bayesian frameworks. Our maximum likelihood analysis indicated that 35 genera supported a model that includes cladogenetic transitions over a model with only anagenetic transitions, whereas three genera supported a model that incorporates anagenetic transitions over one with only cladogenetic transitions. Furthermore, the Bayesian analysis supported a preponderance of cladogenetic change in four genera but did not support a preponderance of anagenetic change in any genus. Overall, these phylogenetic analyses provide the first broad confirmation that polyploidization is temporally associated with speciation events, suggesting that it is indeed a major speciation mechanism in plants, at least in some genera. © 2016 Botanical Society of America.

  12. Borrelia sp. phylogenetically different from Lyme disease- and relapsing fever-related Borrelia spp. in Amblyomma varanense from Python reticulatus.

    PubMed

    Trinachartvanit, Wachareeporn; Hirunkanokpun, Supanee; Sudsangiem, Ronnayuth; Lijuan, Wanwisa; Boonkusol, Duangjai; Baimai, Visut; Ahantarig, Arunee

    2016-06-24

    Species of the genus Borrelia are causative agents of Lyme disease and relapsing fever. Lyme disease is the most commonly reported vector-borne disease in the northern hemisphere. However, in some parts of the world Lyme borreliosis and relapsing fever may be caused by novel Borrelia genotypes. Herein, we report the presence of a Borrelia sp. in an Amblyomma varanense collected from Python reticulatus. Ticks were collected from snakes, identified to species level and examined by PCR for the presence of Borrelia spp. flaB and 16S rRNA genes. Phylogenetic trees were constructed using the neighbour-joining method. Three A. varanense ticks collected from P. reticulatus were positive for a unique Borrelia sp., which was phylogenetically divergent from both Lyme disease- and relapsing fever-associated Borrelia spp. The results of this study suggest for the first time that there is a Borrelia sp. in A. varanense tick in the snake P. reticulatus that might be novel.

  13. Community phylogenetic diversity of cyanobacterial mats associated with geothermal springs along a tropical intertidal gradient.

    PubMed

    Jing, Hongmei; Lacap, Donnabella C; Lau, Chui Yim; Pointing, Stephen B

    2006-04-01

    The 16S rRNA gene-defined bacterial diversity of tropical intertidal geothermal vents subject to varying degrees of seawater inundation was investigated. Shannon-Weaver diversity estimates of clone library-derived sequences revealed that the hottest pools located above the mean high-water mark that did not experience seawater inundation were most diverse, followed by those that were permanently submerged below the mean low-water mark. Pools located in the intertidal were the least biodiverse, and this is attributed to the fluctuating conditions caused by periodic seawater inundation rather than physicochemical conditions per se. Phylogenetic analysis revealed that a ubiquitous Oscillatoria-like phylotype accounted for 83% of clones. Synechococcus-like phylotypes were also encountered at each location, whilst others belonging to the Chroococcales, Oscillatoriales, and other non-phototrophic bacteria occurred only at specific locations along the gradient. All cyanobacterial phylotypes displayed highest phylogenetic affinity to terrestrial thermophilic counterparts rather than marine taxa.

  14. Threat Diversity Will Erode Mammalian Phylogenetic Diversity in the Near Future

    PubMed Central

    Jono, Clémentine M. A.; Pavoine, Sandrine

    2012-01-01

    To reduce the accelerating rate of phylogenetic diversity loss, many studies have searched for mechanisms that could explain why certain species are at risk, whereas others are not. In particular, it has been demonstrated that species might be affected by both extrinsic threat factors as well as intrinsic biological traits that could render a species more sensitive to extinction; here, we focus on extrinsic factors. Recently, the International Union for Conservation of Nature developed a new classification of threat types, including climate change, urbanization, pollution, agriculture and aquaculture, and harvesting/hunting. We have used this new classification to analyze two main factors that could explain the expected future loss of mammalian phylogenetic diversity: 1. differences in the type of threats that affect mammals and 2. differences in the number of major threats that accumulate for a single species. Our results showed that Cetartiodactyla, Diprotodontia, Monotremata, Perissodactyla, Primates, and Proboscidea could lose a high proportion of their current phylogenetic diversity in the coming decades. In contrast, Chiroptera, Didelphimorphia, and Rodentia could lose less phylogenetic diversity than expected if extinctions were random. Some mammalian clades, including Marsupiala, Chiroptera, and a subclade of Primates, are affected by particular threat types, most likely due solely to their geographic locations and associations with particular habitats. However, regardless of the geography, habitat, and taxon considered, it is not the threat type, but the threat diversity that determines the extinction risk for species and clades. Thus, some mammals might be randomly located in areas subjected to a large diversity of threats; they might also accumulate detrimental traits that render them sensitive to different threats, which is a characteristic that could be associated with large body size. Any action reducing threat diversity is expected to have a

  15. Ecomorphology and phylogenetic risk: Implications for habitat reconstruction using fossil bovids.

    PubMed

    Scott, Robert S; Barr, W Andrew

    2014-08-01

    Reconstructions of paleohabitats are necessary aids in understanding hominin evolution. The morphology of species from relevant sites, understood in terms of functional relationships to habitat (termed ecomorphology), offers a direct link to habitat. Bovids are a speciose radiation that includes many habitat specialists and are abundant in the fossil record. Thus, bovids are extremely common in ecomorphological analyses. However, bovid phylogeny and habitat preference are related, which raises the possibility that analyses linking habitat with morphology are not 'taxon free' but 'taxon-dependent.' Here we analyze eight relative dimensions and one shape index of the metatarsal for a sample of 72 bovid species and one antilocaprid. The selected variables have been previously shown to have strong associations with habitat and to have functional explanations for these associations. Phylogenetic generalized least squares analyses of these variables, including habitat and size, resulted in estimates for the parameter lambda (used to model phylogenetic signal) varying from zero to one. Thus, while phylogeny, morphology, and habitat all march together among the bovids, the odds that phylogeny confounds ecomorphological analyses may vary depending on particular morphological characteristics. While large values of lambda do not necessarily indicate that habitat differences are unimportant drivers of morphology, we consider the low value of lambda for relative metatarsal width suggestive that conclusions about habitat built on observations of this particular morphology carry with them less 'phylogenetic risk.' We suggest that the way forward for ecomorphology is grounded in functionally relevant observations and careful consideration of phylogeny designed to bracket probable habitat preferences appropriately. Separate consideration of different morphological variables may help to determine the level of 'phylogenetic risk' attached to conclusions linking habitat and morphology

  16. Coping styles and its association with sources of stress in undergraduate medical students.

    PubMed

    Cherkil, Sandhya; Gardens, Seby J; Soman, Deepak Kuttikatt

    2013-10-01

    The two ubiquitous factors that have been identified in medical courses to underlie mental health are stress and different coping styles adopted to combat stress. To find the association between coping styles and stress in undergraduate medical students. A medical college in Central Kerala. A cross-sectional study design was adopted. Source and Severity of Stress Scale, Medical Student Version, was used to assess the source and nature of stress. Brief Cope was used to find out the coping styles adopted. The statistical analysis was done using Statistical Package for Social Sciences version 20 and SAS. Chi-square analysis was used to find the association between coping styles and stress domains and with the overall stress score. There is a significant positive association between overall stress score and coping styles (P=0.001) of 'Negative cope', 'Blame', and 'Humor'. 'Positive cope' and 'Religion' has significant positive association with 'Academics' (P=0.047) and 'self Expectations' (P=0.009). 'Blame' (P<0.001) has very high significant positive association with 'Academics', 'self expectation', and 'Relationships'. Very high significant positive association is further found between 'Humor' (P<0.001) and 'self expectations', 'Living conditions', and 'Health and Value conflict'. 'substance Use' is positively associated in high significance to 'Health and Value conflict' (P<0.001). The outcome of the study emphasizes the need for stress management techniques in the medical school.

  17. A phylogenetic transform enhances analysis of compositional microbiota data.

    PubMed

    Silverman, Justin D; Washburne, Alex D; Mukherjee, Sayan; David, Lawrence A

    2017-02-15

    Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities.

  18. Phylogenetic Diversity of NTT Nucleotide Transport Proteins in Free-Living and Parasitic Bacteria and Eukaryotes

    PubMed Central

    Major, Peter; Embley, T. Martin

    2017-01-01

    Plasma membrane-located nucleotide transport proteins (NTTs) underpin the lifestyle of important obligate intracellular bacterial and eukaryotic pathogens by importing energy and nucleotides from infected host cells that the pathogens can no longer make for themselves. As such their presence is often seen as a hallmark of an intracellular lifestyle associated with reductive genome evolution and loss of primary biosynthetic pathways. Here, we investigate the phylogenetic distribution of NTT sequences across the domains of cellular life. Our analysis reveals an unexpectedly broad distribution of NTT genes in both host-associated and free-living prokaryotes and eukaryotes. We also identify cases of within-bacteria and bacteria-to-eukaryote horizontal NTT transfer, including into the base of the oomycetes, a major clade of parasitic eukaryotes. In addition to identifying sequences that retain the canonical NTT structure, we detected NTT gene fusions with HEAT-repeat and cyclic nucleotide binding domains in Cyanobacteria, pathogenic Chlamydiae and Oomycetes. Our results suggest that NTTs are versatile functional modules with a much wider distribution and a broader range of potential roles than has previously been appreciated. PMID:28164241

  19. Phylogenetic Properties of RNA Viruses

    PubMed Central

    Pompei, Simone; Loreto, Vittorio; Tria, Francesca

    2012-01-01

    A new word, phylodynamics, was coined to emphasize the interconnection between phylogenetic properties, as observed for instance in a phylogenetic tree, and the epidemic dynamics of viruses, where selection, mediated by the host immune response, and transmission play a crucial role. The challenges faced when investigating the evolution of RNA viruses call for a virtuous loop of data collection, data analysis and modeling. This already resulted both in the collection of massive sequences databases and in the formulation of hypotheses on the main mechanisms driving qualitative differences observed in the (reconstructed) evolutionary patterns of different RNA viruses. Qualitatively, it has been observed that selection driven by the host immune response induces an uneven survival ability among co-existing strains. As a consequence, the imbalance level of the phylogenetic tree is manifestly more pronounced if compared to the case when the interaction with the host immune system does not play a central role in the evolutive dynamics. While many imbalance metrics have been introduced, reliable methods to discriminate in a quantitative way different level of imbalance are still lacking. In our work, we reconstruct and analyze the phylogenetic trees of six RNA viruses, with a special emphasis on the human Influenza A virus, due to its relevance for vaccine preparation as well as for the theoretical challenges it poses due to its peculiar evolutionary dynamics. We focus in particular on topological properties. We point out the limitation featured by standard imbalance metrics, and we introduce a new methodology with which we assign the correct imbalance level of the phylogenetic trees, in agreement with the phylodynamics of the viruses. Our thorough quantitative analysis allows for a deeper understanding of the evolutionary dynamics of the considered RNA viruses, which is crucial in order to provide a valuable framework for a quantitative assessment of theoretical

  20. Identifying Disease Associated miRNAs Based on Protein Domains.

    PubMed

    Qin, Gui-Min; Li, Rui-Yi; Zhao, Xing-Ming

    2016-01-01

    MicroRNAs (miRNAs) are a class of small endogenous non-coding genes, acting as regulators in the post-transcriptional processes. Recently, the miRNAs are found to be widely involved in different types of diseases. Therefore, the identification of disease associated miRNAs can help understand the mechanisms that underlie the disease and identify new biomarkers. However, it is not easy to identify the miRNAs related to diseases due to its extensive involvements in various biological processes. In this work, we present a new approach to identify disease associated miRNAs based on domains, the functional and structural blocks of proteins. The results on real datasets demonstrate that our method can effectively identify disease related miRNAs with high precision.

  1. Visualizing Phylogenetic Treespace Using Cartographic Projections

    NASA Astrophysics Data System (ADS)

    Sundberg, Kenneth; Clement, Mark; Snell, Quinn

    Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger datasets.

  2. Relating phylogenetic trees to transmission trees of infectious disease outbreaks.

    PubMed

    Ypma, Rolf J F; van Ballegooijen, W Marijn; Wallinga, Jacco

    2013-11-01

    Transmission events are the fundamental building blocks of the dynamics of any infectious disease. Much about the epidemiology of a disease can be learned when these individual transmission events are known or can be estimated. Such estimations are difficult and generally feasible only when detailed epidemiological data are available. The genealogy estimated from genetic sequences of sampled pathogens is another rich source of information on transmission history. Optimal inference of transmission events calls for the combination of genetic data and epidemiological data into one joint analysis. A key difficulty is that the transmission tree, which describes the transmission events between infected hosts, differs from the phylogenetic tree, which describes the ancestral relationships between pathogens sampled from these hosts. The trees differ both in timing of the internal nodes and in topology. These differences become more pronounced when a higher fraction of infected hosts is sampled. We show how the phylogenetic tree of sampled pathogens is related to the transmission tree of an outbreak of an infectious disease, by the within-host dynamics of pathogens. We provide a statistical framework to infer key epidemiological and mutational parameters by simultaneously estimating the phylogenetic tree and the transmission tree. We test the approach using simulations and illustrate its use on an outbreak of foot-and-mouth disease. The approach unifies existing methods in the emerging field of phylodynamics with transmission tree reconstruction methods that are used in infectious disease epidemiology.

  3. Seroepidemiology and phylogenetic characterisation of measles virus in Ireland, 2004-2013.

    PubMed

    O' Riordan, Bernadette; Carr, Michael J; Connell, Jeff; Dunford, Linda; Hall, William W; Hassan, Jaythoon

    2014-08-01

    Ireland is classified as an area of high measles incidence. A World Health Organisation-European Region strategic plan exists for measles elimination by 2015. To retrospectively investigate measles outbreaks using all patient samples (sera and oral fluid) received for measles laboratory diagnosis and characterise the genetic diversity of circulating measles genotypes in Ireland. 704 cases of acute measles infection as determined by the presence of measles specific IgM in sera and oral fluids were confirmed at the National Virus Reference Laboratory. Measles positive samples (n=116) were examined by genotyping, sequence analysis and phylogenetic characterisation. Three measles outbreaks occurred over the study period: 2004, 2009/2010 and 2011. Measles IgM positivity ranged from 22-29% in outbreak years to 5-10% in the intervening years. Age profile analysis revealed that whereas individuals >10 years accounted for only 8% of cases in the 2004 outbreak, this increased to 33% and 29% in the 2009/2010 and 2011 outbreaks, respectively. The <1 year cohort accounted for 18-20% of cases in all outbreaks. Phylogenetic analysis demonstrated both indigenous transmission and also importation events. Clade D viruses were exclusively found circulating in Ireland, with autochthonous transmission of diverse genotype D4 strains associated with large outbreaks across Europe. More recently, genotype D8 was identified and these were associated with importation events. This study provides a comprehensive genetic analysis of circulating measles genotypes in Ireland and discriminated between indigenous and imported viral strains. Notably, an increase in laboratory-confirmed measles cases in the greater than 10 years of age group was seen over the study period. This information is valuable to inform vaccination strategies with a focus on those populations who remain susceptible to measles infection. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees.

    PubMed

    Martínez-Aquino, Andrés

    2016-08-01

    Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host-parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a "compass" when "walking" through jungles of tangled phylogenetic trees.

  5. Identifying organic aerosol sources by comparing functional group composition in chamber and atmospheric particles

    PubMed Central

    Russell, Lynn M.; Bahadur, Ranjit; Ziemann, Paul J.

    2011-01-01

    Measurements of submicron particles by Fourier transform infrared spectroscopy in 14 campaigns in North America, Asia, South America, and Europe were used to identify characteristic organic functional group compositions of fuel combustion, terrestrial vegetation, and ocean bubble bursting sources, each of which often accounts for more than a third of organic mass (OM), and some of which is secondary organic aerosol (SOA) from gas-phase precursors. The majority of the OM consists of alkane, carboxylic acid, hydroxyl, and carbonyl groups. The organic functional groups formed from combustion and vegetation emissions are similar to the secondary products identified in chamber studies. The near absence of carbonyl groups in the observed SOA associated with combustion is consistent with alkane rather than aromatic precursors, and the absence of organonitrate groups can be explained by their hydrolysis in humid ambient conditions. The remote forest observations have ratios of carboxylic acid, organic hydroxyl, and nonacid carbonyl groups similar to those observed for isoprene and monoterpene chamber studies, but in biogenic aerosols transported downwind of urban areas the formation of esters replaces the acid and hydroxyl groups and leaves only nonacid carbonyl groups. The carbonyl groups in SOA associated with vegetation emissions provides striking evidence for the mechanism of esterification as the pathway for possible oligomerization reactions in the atmosphere. Forest fires include biogenic emissions that produce SOA with organic components similar to isoprene and monoterpene chamber studies, also resulting in nonacid carbonyl groups in SOA. PMID:21317360

  6. Double-coronal X-Ray and Microwave Sources Associated with a Magnetic Breakout Solar Eruption

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Yao; Wu, Zhao; Zhao, Di

    Double-coronal hard X-ray (HXR) sources are believed to be critical observational evidence of bi-directional energy release through magnetic reconnection in large-scale current sheets in solar flares. Here, we present a study on double-coronal sources observed in both HXR and microwave regimes, revealing new characteristics distinct from earlier reports. This event is associated with a footpoint-occulted X1.3-class flare (2014 April 25, starting at 00:17 UT) and a coronal mass ejection that were likely triggered by the magnetic breakout process, with the lower source extending upward from the top of the partially occulted flare loops and the upper source co-incident with rapidlymore » squeezing-in side lobes (at a speed of ∼250 km s{sup −1} on both sides). The upper source can be identified at energies as high as 70–100 keV. The X-ray upper source is characterized by flux curves that differ from those of the lower source, a weak energy dependence of projected centroid altitude above 20 keV, a shorter duration, and an HXR photon spectrum slightly harder than those of the lower source. In addition, the microwave emission at 34 GHz also exhibits a similar double-source structure and the microwave spectra at both sources are in line with gyrosynchrotron emission given by non-thermal energetic electrons. These observations, especially the co-incidence of the very-fast squeezing-in motion of side lobes and the upper source, indicate that the upper source is associated with (and possibly caused by) this fast motion of arcades. This sheds new light on the origin of the corona double-source structure observed in both HXRs and microwaves.« less

  7. A novel transient structure with phylogenetic implications found in ratite spermatids

    PubMed Central

    2013-01-01

    Background A novel transient structure was observed in the spermatids of three ratite species using transmission electron microscopy. Results The structure first appeared at the circular manchette stage of sperm development, was most prominent during the longitudinal manchette phase and disappeared abruptly prior to spermiation. It was composed of regularly-spaced finger-like projections which were closely associated with the outer nuclear membrane, giving the nucleus a cogwheel-like appearance. The projections were approximately 30 nm long and 14 nm wide. Although a similar structure has been described in certain lizard and crocodile species, this is the first report of a similar structure in the developing spermatids of birds. Conclusions The potential value of non-traditional characters, such as spermiogenesis and sperm ultrastructure, as phylogenetic markers has recently been advocated. The morphologically unique structure found in ratite spermatids provides additional evidence of a possible phylogenetic link between the reptiles and birds. It also endorses the basal positioning of the ratites as a monophyletic group within the avian phylogenetic tree. PMID:23705947

  8. Folding and unfolding phylogenetic trees and networks.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Steel, Mike; Wu, Taoyang

    2016-12-01

    Phylogenetic networks are rooted, labelled directed acyclic graphswhich are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network N can be "unfolded" to obtain a MUL-tree U(N) and, conversely, a MUL-tree T can in certain circumstances be "folded" to obtain aphylogenetic network F(T) that exhibits T. In this paper, we study properties of the operations U and F in more detail. In particular, we introduce the class of stable networks, phylogenetic networks N for which F(U(N)) is isomorphic to N, characterise such networks, and show that they are related to the well-known class of tree-sibling networks. We also explore how the concept of displaying a tree in a network N can be related to displaying the tree in the MUL-tree U(N). To do this, we develop aphylogenetic analogue of graph fibrations. This allows us to view U(N) as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in U(N) and reconciling phylogenetic trees with networks.

  9. Identifying equivalent sound sources from aeroacoustic simulations using a numerical phased array

    NASA Astrophysics Data System (ADS)

    Pignier, Nicolas J.; O'Reilly, Ciarán J.; Boij, Susann

    2017-04-01

    An application of phased array methods to numerical data is presented, aimed at identifying equivalent flow sound sources from aeroacoustic simulations. Based on phased array data extracted from compressible flow simulations, sound source strengths are computed on a set of points in the source region using phased array techniques assuming monopole propagation. Two phased array techniques are used to compute the source strengths: an approach using a Moore-Penrose pseudo-inverse and a beamforming approach using dual linear programming (dual-LP) deconvolution. The first approach gives a model of correlated sources for the acoustic field generated from the flow expressed in a matrix of cross- and auto-power spectral values, whereas the second approach results in a model of uncorrelated sources expressed in a vector of auto-power spectral values. The accuracy of the equivalent source model is estimated by computing the acoustic spectrum at a far-field observer. The approach is tested first on an analytical case with known point sources. It is then applied to the example of the flow around a submerged air inlet. The far-field spectra obtained from the source models for two different flow conditions are in good agreement with the spectra obtained with a Ffowcs Williams-Hawkings integral, showing the accuracy of the source model from the observer's standpoint. Various configurations for the phased array and for the sources are used. The dual-LP beamforming approach shows better robustness to changes in the number of probes and sources than the pseudo-inverse approach. The good results obtained with this simulation case demonstrate the potential of the phased array approach as a modelling tool for aeroacoustic simulations.

  10. Worldwide Phylogenetic Relationship of Avian Poxviruses

    PubMed Central

    Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of the Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups, and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g., starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy. PMID:23408635

  11. Worldwide phylogenetic relationship of avian poxviruses

    USGS Publications Warehouse

    Gyuranecz, Miklós; Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we have expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g. starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy.

  12. Diversity of Kale (Brassica oleracea var. sabellica): Glucosinolate Content and Phylogenetic Relationships.

    PubMed

    Hahn, Christoph; Müller, Anja; Kuhnert, Nikolai; Albach, Dirk

    2016-04-27

    Recently, kale has become popular due to nutritive components beneficial for human health. It is an important source of phytochemicals such as glucosinolates that trigger associated cancer-preventive activity. However, nutritional value varies among glucosinolates and among cultivars. Here, we start a systematic determination of the content of five glucosinolates in 25 kale varieties and 11 non-kale Brassica oleracea cultivars by HPLC-DAD-ESI-MS(n) and compare the profiles with results from the analysis of SNPs derived from a KASP genotyping assay. Our results demonstrate that the glucosinolate levels differ markedly among varieties of different origin. Comparison of the phytochemical data with phylogenetic relationships revealed that the common name kale refers to at least three different groups. German, American, and Italian kales differ morphologically and phytochemically. Landraces do not show outstanding glucosinolate levels. Our results demonstrate the diversity of kale and the importance of preserving a broad genepool for future breeding purposes.

  13. Culturable bacterial communities associated to Brazilian Oscarella species (Porifera: Homoscleromorpha) and their antagonistic interactions.

    PubMed

    Laport, Marinella Silva; Bauwens, Mathieu; de Oliveira Nunes, Suzanne; Willenz, Philippe; George, Isabelle; Muricy, Guilherme

    2017-04-01

    Sponges offer an excellent model to investigate invertebrate-microorganism interactions. Furthermore, bacteria associated with marine sponges represent a rich source of bioactive metabolites. The aim of this study was to characterize the bacteria inhabiting a genus of sponges, Oscarella, and their potentiality for antimicrobial production. Bacterial isolates were recovered from different Oscarella specimens, among which 337 were phylogenetically identified. The culturable community was dominated by Proteobacteria and Firmicutes, and Vibrio was the most frequently isolated genus, followed by Shewanella. When tested for antimicrobial production, bacteria of the 12 genera isolated were capable of producing antimicrobial substances. The majority of strains were involved in antagonistic interactions and inhibitory activities were also observed against bacteria of medical importance. It was more pronounced in some isolated genera (Acinetobacter, Bacillus, Photobacterium, Shewanella and Vibrio). These findings suggest that chemical antagonism could play a significant role in shaping bacterial communities within Oscarella, a genus classified as low-microbial abundance sponge. Moreover, the identified strains may contribute to the search for new sources of antimicrobial substances, an important strategy for developing therapies to treat infections caused by multidrug-resistant bacteria. This study was the first to investigate the diversity and antagonistic activity of bacteria isolated from Oscarella spp. It highlights the biotechnological potential of sponge-associated bacteria.

  14. Deciphering the recent phylogenetic expansion of the originally deeply rooted Mycobacterium tuberculosis lineage 7.

    PubMed

    Yimer, Solomon A; Namouchi, Amine; Zegeye, Ephrem Debebe; Holm-Hansen, Carol; Norheim, Gunnstein; Abebe, Markos; Aseffa, Abraham; Tønjum, Tone

    2016-06-30

    A deeply rooted phylogenetic lineage of Mycobacterium tuberculosis (M. tuberculosis) termed lineage 7 was discovered in Ethiopia. Whole genome sequencing of 30 lineage 7 strains from patients in Ethiopia was performed. Intra-lineage genome variation was defined and unique characteristics identified with a focus on genes involved in DNA repair, recombination and replication (3R genes). More than 800 mutations specific to M. tuberculosis lineage 7 strains were identified. The proportion of non-synonymous single nucleotide polymorphisms (nsSNPs) in 3R genes was higher after the recent expansion of M. tuberculosis lineage 7 strain started. The proportion of nsSNPs in genes involved in inorganic ion transport and metabolism was significantly higher before the expansion began. A total of 22346 bp deletions were observed. Lineage 7 strains also exhibited a high number of mutations in genes involved in carbohydrate transport and metabolism, transcription, energy production and conversion. We have identified unique genomic signatures of the lineage 7 strains. The high frequency of nsSNP in 3R genes after the phylogenetic expansion may have contributed to recent variability and adaptation. The abundance of mutations in genes involved in inorganic ion transport and metabolism before the expansion period may indicate an adaptive response of lineage 7 strains to enable survival, potentially under environmental stress exposure. As lineage 7 strains originally were phylogenetically deeply rooted, this may indicate fundamental adaptive genomic pathways affecting the fitness of M. tuberculosis as a species.

  15. Associating Fast Radio Bursts with Extragalactic Radio Sources: General Methodology and a Search for a Counterpart to FRB 170107

    NASA Astrophysics Data System (ADS)

    Eftekhari, T.; Berger, E.; Williams, P. K. G.; Blanchard, P. K.

    2018-06-01

    The discovery of a repeating fast radio burst (FRB) has led to the first precise localization, an association with a dwarf galaxy, and the identification of a coincident persistent radio source. However, further localizations are required to determine the nature of FRBs, the sources powering them, and the possibility of multiple populations. Here we investigate the use of associated persistent radio sources to establish FRB counterparts, taking into account the localization area and the source flux density. Due to the lower areal number density of radio sources compared to faint optical sources, robust associations can be achieved for less precise localizations as compared to direct optical host galaxy associations. For generally larger localizations that preclude robust associations, the number of candidate hosts can be reduced based on the ratio of radio-to-optical brightness. We find that confident associations with sources having a flux density of ∼0.01–1 mJy, comparable to the luminosity of the persistent source associated with FRB 121102 over the redshift range z ≈ 0.1–1, require FRB localizations of ≲20″. We demonstrate that even in the absence of a robust association, constraints can be placed on the luminosity of an associated radio source as a function of localization and dispersion measure (DM). For DM ≈1000 pc cm‑3, an upper limit comparable to the luminosity of the FRB 121102 persistent source can be placed if the localization is ≲10″. We apply our analysis to the case of the ASKAP FRB 170107, using optical and radio observations of the localization region. We identify two candidate hosts based on a radio-to-optical brightness ratio of ≳100. We find that if one of these is indeed associated with FRB 170107, the resulting radio luminosity (1029‑ 4 × 1030 erg s‑1 Hz‑1, as constrained from the DM value) is comparable to the luminosity of the FRB 121102 persistent source.

  16. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution

    PubMed Central

    Kendall, Michelle; Colijn, Caroline

    2016-01-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. Key words: phylogenetics, evolution, tree metrics, genetics, sequencing. PMID:27343287

  17. Determinants of HIV Phylogenetic Clustering in Chicago Among Young Black Men Who Have Sex With Men From the uConnect Cohort.

    PubMed

    Morgan, Ethan; Nyaku, Amesika N; DʼAquila, Richard T; Schneider, John A

    2017-07-01

    Phylogenetic analysis determines similarities among HIV genetic sequences from persons infected with HIV, identifying clusters of transmission. We determined characteristics associated with both membership in an HIV transmission cluster and the number of clustered sequences among a cohort of young black men who have sex with men (YBMSM) in Chicago. Pairwise genetic distances of HIV-1 pol sequences were collected during 2013-2016. Potential transmission ties were identified among HIV-infected persons whose sequences were ≤1.5% genetically distant. Putative transmission pairs were defined as ≥1 tie to another sequence. We then determined demographic and risk attributes associated with both membership in an HIV transmission cluster and the number of ties to the sequences from other persons in the cluster. Of 86 available sequences, 31 (36.0%) were tied to ≥1 other sequence. Through multivariable analyses, we determined that those who reported symptoms of depression and those who had a higher number of confidants in their network had significantly decreased odds of membership in transmission clusters. We found that those who had unstable housing and who reported heavy marijuana use had significantly more ties to other individuals within transmission clusters, whereas those identifying as bisexual, those participating in group sex, and those with higher numbers of sexual partners had significantly fewer ties. This study demonstrates the potential for combining phylogenetic and individual and network attributes to target HIV control efforts to persons with potentially higher transmission risk, as well as suggesting some unappreciated specific predictors of transmission risk among YBMSM in Chicago for future study.

  18. Ixodes ricinus Tick Lipocalins: Identification, Cloning, Phylogenetic Analysis and Biochemical Characterization

    PubMed Central

    Beaufays, Jérôme; Adam, Benoît; Decrem, Yves; Prévôt, Pierre-Paul; Santini, Sébastien; Brasseur, Robert; Brossard, Michel; Lins, Laurence

    2008-01-01

    Background During their blood meal, ticks secrete a wide variety of proteins that interfere with their host's defense mechanisms. Among these proteins, lipocalins play a major role in the modulation of the inflammatory response. Methodology/Principal Findings Screening a cDNA library in association with RT-PCR and RACE methodologies allowed us to identify 14 new lipocalin genes in the salivary glands of the Ixodes ricinus hard tick. A computational in-depth structural analysis confirmed that LIRs belong to the lipocalin family. These proteins were called LIR for “Lipocalin from I. ricinus” and numbered from 1 to 14 (LIR1 to LIR14). According to their percentage identity/similarity, LIR proteins may be assigned to 6 distinct phylogenetic groups. The mature proteins have calculated pM and pI varying from 21.8 kDa to 37.2 kDa and from 4.45 to 9.57 respectively. In a western blot analysis, all recombinant LIRs appeared as a series of thin bands at 50–70 kDa, suggesting extensive glycosylation, which was experimentally confirmed by treatment with N-glycosidase F. In addition, the in vivo expression analysis of LIRs in I. ricinus, examined by RT-PCR, showed homogeneous expression profiles for certain phylogenetic groups and relatively heterogeneous profiles for other groups. Finally, we demonstrated that LIR6 codes for a protein that specifically binds leukotriene B4. Conclusions/Significance This work confirms that, regarding their biochemical properties, expression profile, and sequence signature, lipocalins in Ixodes hard tick genus, and more specifically in the Ixodes ricinus species, are segregated into distinct phylogenetic groups suggesting potential distinct function. This was particularly demonstrated by the ability of LIR6 to scavenge leukotriene B4. The other LIRs did not bind any of the ligands tested, such as 5-hydroxytryptamine, ADP, norepinephrine, platelet activating factor, prostaglandins D2 and E2, and finally leukotrienes B4 and C4. PMID:19096708

  19. Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

    PubMed

    Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

    2003-07-04

    The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.

  20. Towards measles elimination: Phylogenetic analysis of measles viruses in Turkey (2012-2013) and identification of genotype D8.

    PubMed

    Kalaycioglu, Atila T; Yolbakan, Sultan; Guldemir, Dilek; Korukluoglu, Gulay; Coskun, Aslihan; Cosgun, Yasemin; Durmaz, Riza

    2016-11-01

    Molecular characterization of different measles virus (MV) strains is essential to combat the disease. Sixty measles MV strains were obtained from throat swabs or urine of patients in Turkey between 2012 and 2013 and characterized. MV RNA sequences (n = 60) were analysed for 456 nucleotides representing hypervariable domain of the nucleoprotein (N) gene. Of the 60 strains analysed 53 were the D8 genotype, 6 were B3, 1 was D4, and 1 was A. This report describes MV genotype D8 that was involved in a measles outbreak in Turkey. Sequences of most genotype D8 strains (n = 51) were identical to the sequence of variant D8-Frankfurt-Main, which has been associated with outbreaks throughout Europe. Despite the lack of epidemiologic information, a phylogenetic analysis suggested that the genotype D8 MV may have been brought to Turkey from elsewhere. Phylogenetic and epidemiological findings suggested that strains identified in tourists and associated with importation included one strain of genotype D8, one strain of genotype B3, and one strain of genotype D4. These findings from the 2012 to 2013 outbreak in Turkey confirm that pockets of unimmunised individuals are making the country susceptible to measles outbreaks. To prevent further outbreaks, deliberate and sustained effort must be made to reach, and immunise susceptible age groups. Towards measles elimination process, continued molecular surveillance of measles strains in Turkey will help identify transmission patterns of virus and evaluate vaccination efforts. J. Med. Virol. 88:1867-1873, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  1. Identifying Emergency Department Patients at Low Risk for a Variceal Source of Upper Gastrointestinal Hemorrhage.

    PubMed

    Klein, Lauren R; Money, Joel; Maharaj, Kaveesh; Robinson, Aaron; Lai, Tarissa; Driver, Brian E

    2017-11-01

    Assessing the likelihood of a variceal versus nonvariceal source of upper gastrointestinal bleeding (UGIB) guides therapy, but can be difficult to determine on clinical grounds. The objective of this study was to determine if there are easily ascertainable clinical and laboratory findings that can identify a patient as low risk for a variceal source of hemorrhage. This was a retrospective cohort study of adult ED patients with UGIB between January 2008 and December 2014 who had upper endoscopy performed during hospitalization. Clinical and laboratory data were abstracted from the medical record. The source of the UGIB was defined as variceal or nonvariceal based on endoscopic reports. Binary recursive partitioning was utilized to create a clinical decision rule. The rule was internally validated and test characteristics were calculated with 1,000 bootstrap replications. A total of 719 patients were identified; mean age was 55 years and 61% were male. There were 71 (10%) patients with a variceal UGIB identified on endoscopy. Binary recursive partitioning yielded a two-step decision rule (platelet count > 200 × 10 9 /L and an international normalized ratio [INR] < 1.3), which identified patients who were low risk for a variceal source of hemorrhage. For the bootstrapped samples, the rule performed with 97% sensitivity (95% confidence interval [CI] = 91%-100%) and 49% specificity (95% CI = 44%-53%). Although this derivation study must be externally validated before widespread use, patients presenting to the ED with an acute UGIB with platelet count of >200 × 10 9 /L and an INR of <1.3 may be at very low risk for a variceal source of their upper gastrointestinal hemorrhage. © 2017 by the Society for Academic Emergency Medicine.

  2. Predicting rates of interspecific interaction from phylogenetic trees.

    PubMed

    Nuismer, Scott L; Harmon, Luke J

    2015-01-01

    Integrating phylogenetic information can potentially improve our ability to explain species' traits, patterns of community assembly, the network structure of communities, and ecosystem function. In this study, we use mathematical models to explore the ecological and evolutionary factors that modulate the explanatory power of phylogenetic information for communities of species that interact within a single trophic level. We find that phylogenetic relationships among species can influence trait evolution and rates of interaction among species, but only under particular models of species interaction. For example, when interactions within communities are mediated by a mechanism of phenotype matching, phylogenetic trees make specific predictions about trait evolution and rates of interaction. In contrast, if interactions within a community depend on a mechanism of phenotype differences, phylogenetic information has little, if any, predictive power for trait evolution and interaction rate. Together, these results make clear and testable predictions for when and how evolutionary history is expected to influence contemporary rates of species interaction. © 2014 John Wiley & Sons Ltd/CNRS.

  3. Phylogenetic reconstruction and polymorphism analysis of BK virus VP2 gene isolated from renal transplant recipients in China

    PubMed Central

    WANG, ZHANG-YANG; HONG, WEI-LONG; ZHU, ZHE-HUI; CHEN, YUN-HAO; YE, WEN-LE; CHU, GUANG-YU; LI, JIA-LIN; CHEN, BI-CHENG; XIA, PENG

    2015-01-01

    BK polyomavirus (BKV) is important pathogen for kidney transplant recipients, as it is frequently re-activated, leading to nephropathy. The aim of this study was to investigate the phylogenetic reconstruction and polymorphism of the VP2 gene in BKV isolated from Chinese kidney transplant recipients. Phylogenetic analysis was carried out in the VP2 region from 135 BKV-positive samples and 28 reference strains retrieved from GenBank. The unweighted pair-group method with arithmetic mean (UPGMA) grouped all strains into subtypes, but failed to subdivide strains into subgroups. Among the plasma and urine samples, all plasma (23/23) and 82 urine samples (82/95) were identified to contain subtype I; the other 10 urine samples contained subtype IV. A 86-bp fragment was identified as a highly conserved sequence. Following alignment with 36 published BKV sequences from China, 92 sites of polymorphism were identified, including 11 single nucleotide polymorphisms (SNPs) prevalent in Chinese individuals and 30 SNPs that were specific to the two predominant subtypes I and IV. The limitations of the VP2 gene segment in subgrouping were confirmed by phylogenetic analysis. The conserved sequence and polymorphism identified in this study may be helpful in the detection and genotyping of BKV. PMID:26640547

  4. Polymorphism and phylogenetic species delimitation in filamentous fungi from predominant mycobiota in withered grapes.

    PubMed

    Lorenzini, M; Cappello, M S; Logrieco, A; Zapparoli, G

    2016-12-05

    Filamentous fungi are the main pathogens of withered grapes destined for passito wine production. Knowledge of which species inhabit these post-harvest fruits and their pathogenicity is essential in order to develop strategies to control infection, but is still scarce. This study investigated the predominant mycobiota of withered grapes through a cultivation-dependent approach. Strain and species heterogeneity was evidenced on examining isolates collected over three consecutive years. Colony morphology and PCR-restriction fragment length polymorphism (PCR-RFLP) analysis revealed the occurrence of several phenotypes and haplotypes, respectively. Strains were phylogenetically analyzed based on sequence typing of different genes or regions (e.g. calmodulin, β-tubulin and internal transcribed spacer region). Beside the most common necrotrophic-saprophytic species of Penicillium, Aspergillus, Alternaria and Botrytis species responsible for fruit rot, other saprobic species were identified (e.g. Trichoderma atroviride, Sarocladium terricola, Arthrinium arundinis and Diaporthe eres) generally not associated with post-harvest fruit diseases. Species such as Penicillium ubiquetum, Cladosporium pseudocladosporioides, Lichtheimia ramosa, Sarocladium terricola, Diaporthe nobilis, Bipolaris secalis, Paraconiothyrium fuckelii and Galactomyces reessii that had never previously been isolated from grapevine or grape were also identified. Moreover, it was not possible to assign a species to some isolates, while some members of Didymosphaeriaceae and Didymellaceae remained unclassified even at genus level. This study provides insights into the diversity of the epiphytic fungi inhabiting withered grapes and evidences the importance of their identification to understand the causes of fruit diseases. Finally, phylogenetic species delimitation furnished data of interest to fungal taxonomy. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Identifying natural and anthropogenic sources of metals in urban and rural soils using GIS-based data, PCA, and spatial interpolation

    PubMed Central

    Davis, Harley T.; Aelion, C. Marjorie; McDermott, Suzanne; Lawson, Andrew B.

    2009-01-01

    Determining sources of neurotoxic metals in rural and urban soils is important for mitigating human exposure. Surface soil from four areas with significant clusters of mental retardation and developmental delay (MR/DD) in children, and one control site were analyzed for nine metals and characterized by soil type, climate, ecological region, land use and industrial facilities using readily-available GIS-based data. Kriging, principal component analysis (PCA) and cluster analysis (CA) were used to identify commonalities of metal distribution. Three MR/DD areas (one rural and two urban) had similar soil types and significantly higher soil metal concentrations. PCA and CA results suggested that Ba, Be and Mn were consistently from natural sources; Pb and Hg from anthropogenic sources; and As, Cr, Cu, and Ni from both sources. Arsenic had low commonality estimates, was highly associated with a third PCA factor, and had a complex distribution, complicating mitigation strategies to minimize concentrations and exposures. PMID:19361902

  6. FluReF, an automated flu virus reassortment finder based on phylogenetic trees.

    PubMed

    Yurovsky, Alisa; Moret, Bernard M E

    2011-01-01

    Reassortments are events in the evolution of the genome of influenza (flu), whereby segments of the genome are exchanged between different strains. As reassortments have been implicated in major human pandemics of the last century, their identification has become a health priority. While such identification can be done "by hand" on a small dataset, researchers and health authorities are building up enormous databases of genomic sequences for every flu strain, so that it is imperative to develop automated identification methods. However, current methods are limited to pairwise segment comparisons. We present FluReF, a fully automated flu virus reassortment finder. FluReF is inspired by the visual approach to reassortment identification and uses the reconstructed phylogenetic trees of the individual segments and of the full genome. We also present a simple flu evolution simulator, based on the current, source-sink, hypothesis for flu cycles. On synthetic datasets produced by our simulator, FluReF, tuned for a 0% false positive rate, yielded false negative rates of less than 10%. FluReF corroborated two new reassortments identified by visual analysis of 75 Human H3N2 New York flu strains from 2005-2008 and gave partial verification of reassortments found using another bioinformatics method. FluReF finds reassortments by a bottom-up search of the full-genome and segment-based phylogenetic trees for candidate clades--groups of one or more sampled viruses that are separated from the other variants from the same season. Candidate clades in each tree are tested to guarantee confidence values, using the lengths of key edges as well as other tree parameters; clades with reassortments must have validated incongruencies among segment trees. FluReF demonstrates robustness of prediction for geographically and temporally expanded datasets, and is not limited to finding reassortments with previously collected sequences. The complete source code is available from http://lcbb.epfl.ch/software.html.

  7. Maximizing the phylogenetic diversity of seed banks.

    PubMed

    Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

    2015-04-01

    Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. © 2014 Society for Conservation Biology.

  8. What is the phylogenetic signal limit from mitogenomes? The reconciliation between mitochondrial and nuclear data in the Insecta class phylogeny

    PubMed Central

    2011-01-01

    Background Efforts to solve higher-level evolutionary relationships within the class Insecta by using mitochondrial genomic data are hindered due to fast sequence evolution of several groups, most notably Hymenoptera, Strepsiptera, Phthiraptera, Hemiptera and Thysanoptera. Accelerated rates of substitution on their sequences have been shown to have negative consequences in phylogenetic inference. In this study, we tested several methodological approaches to recover phylogenetic signal from whole mitochondrial genomes. As a model, we used two classical problems in insect phylogenetics: The relationships within Paraneoptera and within Holometabola. Moreover, we assessed the mitochondrial phylogenetic signal limits in the deeper Eumetabola dataset, and we studied the contribution of individual genes. Results Long-branch attraction (LBA) artefacts were detected in all the datasets. Methods using Bayesian inference outperformed maximum likelihood approaches, and LBA was avoided in Paraneoptera and Holometabola when using protein sequences and the site-heterogeneous mixture model CAT. The better performance of this method was evidenced by resulting topologies matching generally accepted hypotheses based on nuclear and/or morphological data, and was confirmed by cross-validation and simulation analyses. Using the CAT model, the order Strepsiptera was recovered as sister to Coleoptera for the first time using mitochondrial sequences, in agreement with recent results based on large nuclear and morphological datasets. Also the Hymenoptera-Mecopterida association was obtained, leaving Coleoptera and Strepsiptera as the basal groups of the holometabolan insects, which coincides with one of the two main competing hypotheses. For the Paraneroptera, the currently accepted non-monophyly of Homoptera was documented as a phylogenetic novelty for mitochondrial data. However, results were not satisfactory when exploring the entire Eumetabola, revealing the limits of the phylogenetic

  9. Acremonium phylogenetic overview and revision of Gliomastix, Sarocladium, and Trichothecium

    PubMed Central

    Summerbell, R.C.; Gueidan, C.; Schroers, H-J.; de Hoog, G.S.; Starink, M.; Rosete, Y. Arocha; Guarro, J.; Scott, J.A.

    2011-01-01

    Over 200 new sequences are generated for members of the genus Acremonium and related taxa including ribosomal small subunit sequences (SSU) for phylogenetic analysis and large subunit (LSU) sequences for phylogeny and DNA-based identification. Phylogenetic analysis reveals that within the Hypocreales, there are two major clusters containing multiple Acremonium species. One clade contains Acremonium sclerotigenum, the genus Emericellopsis, and the genus Geosmithia as prominent elements. The second clade contains the genera Gliomastix sensu stricto and Bionectria. In addition, there are numerous smaller clades plus two multi-species clades, one containing Acremonium strictum and the type species of the genus Sarocladium, and, as seen in the combined SSU/LSU analysis, one associated subclade containing Acremonium breve and related species plus Acremonium curvulum and related species. This sequence information allows the revision of three genera. Gliomastix is revived for five species, G. murorum, G. polychroma, G. tumulicola, G. roseogrisea, and G. masseei. Sarocladium is extended to include all members of the phylogenetically distinct A. strictum clade including the medically important A. kiliense and the protective maize endophyte A. zeae. Also included in Sarocladium are members of the phylogenetically delimited Acremonium bacillisporum clade, closely linked to the A. strictum clade. The genus Trichothecium is revised following the principles of unitary nomenclature based on the oldest valid anamorph or teleomorph name, and new combinations are made in Trichothecium for the tightly interrelated Acremonium crotocinigenum, Spicellum roseum, and teleomorph Leucosphaerina indica. Outside the Hypocreales, numerous Acremonium-like species fall into the Plectosphaerellaceae, and A. atrogriseum falls into the Cephalothecaceae. PMID:21523192

  10. Acremonium phylogenetic overview and revision of Gliomastix, Sarocladium, and Trichothecium.

    PubMed

    Summerbell, R C; Gueidan, C; Schroers, H-J; de Hoog, G S; Starink, M; Rosete, Y Arocha; Guarro, J; Scott, J A

    2011-01-01

    Over 200 new sequences are generated for members of the genus Acremonium and related taxa including ribosomal small subunit sequences (SSU) for phylogenetic analysis and large subunit (LSU) sequences for phylogeny and DNA-based identification. Phylogenetic analysis reveals that within the Hypocreales, there are two major clusters containing multiple Acremonium species. One clade contains Acremonium sclerotigenum, the genus Emericellopsis, and the genus Geosmithia as prominent elements. The second clade contains the genera Gliomastixsensu stricto and Bionectria. In addition, there are numerous smaller clades plus two multi-species clades, one containing Acremonium strictum and the type species of the genus Sarocladium, and, as seen in the combined SSU/LSU analysis, one associated subclade containing Acremonium breve and related species plus Acremonium curvulum and related species. This sequence information allows the revision of three genera. Gliomastix is revived for five species, G. murorum, G. polychroma, G. tumulicola, G. roseogrisea, and G. masseei. Sarocladium is extended to include all members of the phylogenetically distinct A. strictum clade including the medically important A. kiliense and the protective maize endophyte A. zeae. Also included in Sarocladium are members of the phylogenetically delimited Acremonium bacillisporum clade, closely linked to the A. strictum clade. The genus Trichothecium is revised following the principles of unitary nomenclature based on the oldest valid anamorph or teleomorph name, and new combinations are made in Trichothecium for the tightly interrelated Acremonium crotocinigenum, Spicellum roseum, and teleomorph Leucosphaerinaindica. Outside the Hypocreales, numerous Acremonium-like species fall into the Plectosphaerellaceae, and A. atrogriseum falls into the Cephalothecaceae.

  11. Assessment of phylogenetic sensitivity for reconstructing HIV-1 epidemiological relationships.

    PubMed

    Beloukas, Apostolos; Magiorkinis, Emmanouil; Magiorkinis, Gkikas; Zavitsanou, Asimina; Karamitros, Timokratis; Hatzakis, Angelos; Paraskevis, Dimitrios

    2012-06-01

    Phylogenetic analysis has been extensively used as a tool for the reconstruction of epidemiological relations for research or for forensic purposes. It was our objective to assess the sensitivity of different phylogenetic methods and various phylogenetic programs to reconstruct epidemiological links among HIV-1 infected patients that is the probability to reveal a true transmission relationship. Multiple datasets (90) were prepared consisting of HIV-1 sequences in protease (PR) and partial reverse transcriptase (RT) sampled from patients with documented epidemiological relationship (target population), and from unrelated individuals (control population) belonging to the same HIV-1 subtype as the target population. Each dataset varied regarding the number, the geographic origin and the transmission risk groups of the sequences among the control population. Phylogenetic trees were inferred by neighbor-joining (NJ), maximum likelihood heuristics (hML) and Bayesian methods. All clusters of sequences belonging to the target population were correctly reconstructed by NJ and Bayesian methods receiving high bootstrap and posterior probability (PP) support, respectively. On the other hand, TreePuzzle failed to reconstruct or provide significant support for several clusters; high puzzling step support was associated with the inclusion of control sequences from the same geographic area as the target population. In contrary, all clusters were correctly reconstructed by hML as implemented in PhyML 3.0 receiving high bootstrap support. We report that under the conditions of our study, hML using PhyML, NJ and Bayesian methods were the most sensitive for the reconstruction of epidemiological links mostly from sexually infected individuals. Copyright © 2012 Elsevier B.V. All rights reserved.

  12. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.

    PubMed

    Kendall, Michelle; Colijn, Caroline

    2016-10-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. Panoramic Electrophysiological Mapping but not Electrogram Morphology Identifies Stable Sources for Human Atrial Fibrillation

    PubMed Central

    Narayan, Sanjiv M.; Shivkumar, Kalyanam; Krummen, David E.; Miller, John M.; Rappel, Wouter-Jan

    2013-01-01

    Background The foundation for successful arrhythmia ablation is the mapping of electric propagation to identify underlying mechanisms. In atrial fibrillation (AF), however, mapping is difficult so that ablation has often targeted electrogram features, with mixed results. We hypothesized that wide field-of-view (panoramic) mapping of both atria would identify causal mechanisms for AF and allow interpretation of local electrogram features, including complex fractionated atrial electrograms (CFAE). Methods and Results Contact mapping was performed using biatrial multipolar catheters in 36 AF subjects (29 persistent). Stable AF rotors (spiral waves) or focal sources were seen in 35 of 36 cases and targeted for ablation (focal impulse and rotor modulation) before pulmonary vein isolation. In 31 of 36 subjects (86.1%), AF acutely terminated (n=20; 16 to sinus rhythm) or organized (n=11; 19±8% slowing) with 2.5 minutes focal impulse and rotor modulation (interquartile range, 1.0–3.1) at one source, defined as the primary source. Subjects exhibited 2.1±1.0 concurrent AF sources of which the primary, by phase mapping, precessed in limited areas (persistent 2.5±1.7 versus paroxysmal 1.7±0.5 cm2; P=0.30). Notably, source regions showed mixed electrogram amplitudes and CFAE grades that did not differ from surrounding atrium (P=NS). AF sources were not consistently surrounded by CFAE (P=0.67). Conclusions Stable rotors and focal sources for human AF were revealed by contact panoramic mapping (focal impulse and rotor modulation mapping), but not by electrogram footprints. AF sources precessed within areas of ≈2 cm2, with diverse voltage characteristics poorly correlated with CFAE. Most CFAE sites lie remote from AF sources and are not suitable targets for catheter ablation of AF. PMID:23392583

  14. Taxonomic and Phylogenetic Determinants of Functional Composition of Bolivian Bat Assemblages

    PubMed Central

    Aguirre, Luis F.; Montaño-Centellas, Flavia A.; Gavilanez, M. Mercedes; Stevens, Richard D.

    2016-01-01

    Understanding diversity patterns and the potential mechanisms driving them is a fundamental goal in ecology. Examination of different dimensions of biodiversity can provide insights into the relative importance of different processes acting upon biotas to shape communities. Unfortunately, patterns of diversity are still poorly understood in hyper-diverse tropical countries. Here, we assess spatial variation of taxonomic, functional and phylogenetic diversity of bat assemblages in one of the least studied Neotropical countries, Bolivia, and determine whether changes in biodiversity are explained by the replacement of species or functional groups, or by differences in richness (i.e., gain or loss of species or functional groups). Further, we evaluate the contribution of phylogenetic and taxonomic changes in the resulting patterns of functional diversity of bats. Using well-sampled assemblages from published studies we examine noctilionoid bats at ten study sites across five ecoregions in Bolivia. Bat assemblages differed from each other in all dimensions of biodiversity considered; however, diversity patterns for each dimension were likely structured by different mechanisms. Within ecoregions, differences were largely explained by species richness, suggesting that the gain or loss of species or functional groups (as opposed to replacement) was driving dissimilarity patterns. Overall, our results suggest that whereas evolutionary processes (i.e., historical connection and dispersal routes across Bolivia) create a template of diversity patterns across the country, ecological mechanisms modify these templates, decoupling the observed patterns of functional, taxonomic and phylogenetic diversity in Bolivian bats. Our results suggests that elevation represents an important source of variability among diversity patterns for each dimension of diversity considered. Further, we found that neither phylogenetic nor taxonomic diversity can fully account for patterns of functional

  15. Taxonomic and Phylogenetic Determinants of Functional Composition of Bolivian Bat Assemblages.

    PubMed

    Aguirre, Luis F; Montaño-Centellas, Flavia A; Gavilanez, M Mercedes; Stevens, Richard D

    2016-01-01

    Understanding diversity patterns and the potential mechanisms driving them is a fundamental goal in ecology. Examination of different dimensions of biodiversity can provide insights into the relative importance of different processes acting upon biotas to shape communities. Unfortunately, patterns of diversity are still poorly understood in hyper-diverse tropical countries. Here, we assess spatial variation of taxonomic, functional and phylogenetic diversity of bat assemblages in one of the least studied Neotropical countries, Bolivia, and determine whether changes in biodiversity are explained by the replacement of species or functional groups, or by differences in richness (i.e., gain or loss of species or functional groups). Further, we evaluate the contribution of phylogenetic and taxonomic changes in the resulting patterns of functional diversity of bats. Using well-sampled assemblages from published studies we examine noctilionoid bats at ten study sites across five ecoregions in Bolivia. Bat assemblages differed from each other in all dimensions of biodiversity considered; however, diversity patterns for each dimension were likely structured by different mechanisms. Within ecoregions, differences were largely explained by species richness, suggesting that the gain or loss of species or functional groups (as opposed to replacement) was driving dissimilarity patterns. Overall, our results suggest that whereas evolutionary processes (i.e., historical connection and dispersal routes across Bolivia) create a template of diversity patterns across the country, ecological mechanisms modify these templates, decoupling the observed patterns of functional, taxonomic and phylogenetic diversity in Bolivian bats. Our results suggests that elevation represents an important source of variability among diversity patterns for each dimension of diversity considered. Further, we found that neither phylogenetic nor taxonomic diversity can fully account for patterns of functional

  16. Phylogenetic Diversity of Vibrio cholerae Associated with Endemic Cholera in Mexico from 1991 to 2008

    PubMed Central

    Choi, Seon Young; Rashed, Shah M.; Hasan, Nur A.; Alam, Munirul; Islam, Tarequl; Sadique, Abdus; Johura, Fatema-Tuz; Eppinger, Mark; Huq, Anwar; Cravioto, Alejandro

    2016-01-01

    ABSTRACT An outbreak of cholera occurred in 1991 in Mexico, where it had not been reported for more than a century and is now endemic. Vibrio cholerae O1 prototype El Tor and classical strains coexist with altered El Tor strains (1991 to 1997). Nontoxigenic (CTX−) V. cholerae El Tor dominated toxigenic (CTX+) strains (2001 to 2003), but V. cholerae CTX+ variant El Tor was isolated during 2004 to 2008, outcompeting CTX− V. cholerae. Genomes of six Mexican V. cholerae O1 strains isolated during 1991 to 2008 were sequenced and compared with both contemporary and archived strains of V. cholerae. Three were CTX+ El Tor, two were CTX− El Tor, and the remaining strain was a CTX+ classical isolate. Whole-genome sequence analysis showed the six isolates belonged to five distinct phylogenetic clades. One CTX− isolate is ancestral to the 6th and 7th pandemic CTX+ V. cholerae isolates. The other CTX− isolate joined with CTX− non-O1/O139 isolates from Haiti and seroconverted O1 isolates from Brazil and Amazonia. One CTX+ isolate was phylogenetically placed with the sixth pandemic classical clade and the V. cholerae O395 classical reference strain. Two CTX+ El Tor isolates possessing intact Vibrio seventh pandemic island II (VSP-II) are related to hybrid El Tor isolates from Mozambique and Bangladesh. The third CTX+ El Tor isolate contained West African-South American (WASA) recombination in VSP-II and showed relatedness to isolates from Peru and Brazil. Except for one isolate, all Mexican isolates lack SXT/R391 integrative conjugative elements (ICEs) and sensitivity to selected antibiotics, with one isolate resistant to streptomycin. No isolates were related to contemporary isolates from Asia, Africa, or Haiti, indicating phylogenetic diversity. PMID:26980836

  17. Phylogenetic diversity of bacteria associated with Paleolithic paintings and surrounding rock walls in two Spanish caves (Llonín and La Garma).

    PubMed

    Schabereiter-Gurtner, Claudia; Saiz-Jimenez, Cesareo; Piñar, Guadalupe; Lubitz, Werner; Rölleke, Sabine

    2004-02-01

    Bacterial diversity in caves is still rarely investigated using culture-independent techniques. In the present study, bacterial communities on Paleolithic paintings and surrounding rock walls in two Spanish caves (Llonín and La Garma) were analyzed, using 16S rDNA-based denaturing gradient gel electrophoresis community fingerprinting and phylogenetic analyses without prior cultivation. Results revealed complex bacterial communities consisting of a high number of novel 16S rDNA sequence types and indicated a high biodiversity of lithotrophic and heterotrophic bacteria. Identified bacteria were related to already cultured bacteria (39 clones) and to environmental 16S rDNA clones (46 clones). The nearest phylogenetic relatives were members of the Proteobacteria (41.1%), of the Acidobacterium division (16.5%), Actinobacteria (20%), Firmicutes (10.6%), of the Cytophaga/Flexibacter/Bacteroides division (5.9%), Nitrospira group (3.5%), green non-sulfur bacteria (1.2%), and candidate WS3 division (1.2%). Thirteen of these clones were most closely related to those obtained from the previous studies on Tito Bustillo Cave. The comparison of the present data with the data obtained previously from Altamira and Tito Bustillo Caves revealed similarities in the bacterial community components, especially in the high abundance of the Acidobacteria and Rhizobiaceae, and in the presence of bacteria related to ammonia and sulfur oxidizers.

  18. Phenome-Wide Association Study of Rheumatoid Arthritis Subgroups Identifies Association between Seronegative Disease and Fibromyalgia

    PubMed Central

    Doss, Jayanth; Mo, Huan; Carroll, Robert J.; Crofford, Leslie J.; Denny, Joshua C.

    2016-01-01

    Objective The differences between seronegative and seropositive rheumatoid arthritis (RA) have not been widely reported. We performed electronic health record (EHR)-based phenome-wide association studies (PheWAS) to identify disease associations in seropositive and seronegative RA. Methods A validated algorithm identified RA subjects from the de-identified EHR. Serotypes were determined by values of rheumatoid factor (RF) and anti-cyclic citrullinated peptide antibody (ACPA). We tested EHR-derived phenotypes using PheWAS comparing seropositive RA against seronegative RA, yielding disease associations. PheWAS was also performed on RF-positive versus RF-negative subjects and ACPA-positive versus ACPA-negative subjects. Following PheWAS, select phenotypes were then manually reviewed and fibromyalgia was specifically evaluated using a validated algorithm. Results There were 2199 individuals identified with RA and either RF or ACPA testing. Of these, 1382 (63%) were seropositive. Seronegative RA was associated with “Myalgia and Myositis” (odds ratio [OR] 2.1, P=3.7×10−10) and back pain. A manual record review showed 80% of Myalgia and Myositis codes were used for fibromyalgia, and follow-up with a specific EHR algorithm for fibromyalgia confirmed that seronegative RA was associated with fibromyalgia (OR=1.8, P=4.0×10−6). Seropositive RA was associated with Chronic Airway Obstruction (OR=2.2, P=1.4×10−4) and tobacco use (OR=2.2, P=7.0×10−4). Conclusion This PheWAS in RA patients identifies a strong association between seronegativity and fibromyalgia. It also affirms relationships between seropositivity with chronic airway obstruction and seropositivity with tobacco use. These findings demonstrate the utility of the PheWAS approach to discover novel phenotype associations within different subgroups of a disease. PMID:27589350

  19. Phenome-Wide Association Study of Rheumatoid Arthritis Subgroups Identifies Association Between Seronegative Disease and Fibromyalgia.

    PubMed

    Doss, Jayanth; Mo, Huan; Carroll, Robert J; Crofford, Leslie J; Denny, Joshua C

    2017-02-01

    The differences between seronegative and seropositive rheumatoid arthritis (RA) have not been widely reported. We performed electronic health record (EHR)-based phenome-wide association studies (PheWAS) to identify disease associations in seropositive and seronegative RA. A validated algorithm identified RA subjects from the de-identified version of the Vanderbilt University Medical Center EHR. Serotypes were determined by rheumatoid factor (RF) and anti-cyclic citrullinated peptide antibody (ACPA) values. We tested EHR-derived phenotypes using PheWAS comparing seropositive RA and seronegative RA, yielding disease associations. PheWAS was also performed in RF-positive versus RF-negative subjects and ACPA-positive versus ACPA-negative subjects. Following PheWAS, select phenotypes were then manually reviewed, and fibromyalgia was specifically evaluated using a validated algorithm. A total of 2,199 RA individuals with either RF or ACPA testing were identified. Of these, 1,382 patients (63%) were classified as seropositive. Seronegative RA was associated with myalgia and myositis (odds ratio [OR] 2.1, P = 3.7 × 10 -10 ) and back pain. A manual review of the health record showed that among subjects coded for Myalgia and Myositis, ∼80% had fibromyalgia. Follow-up with a specific EHR algorithm for fibromyalgia confirmed that seronegative RA was associated with fibromyalgia (OR 1.8, P = 4.0 × 10 -6 ). Seropositive RA was associated with chronic airway obstruction (OR 2.2, P = 1.4 × 10 -4 ) and tobacco use (OR 2.2, P = 7.0 × 10 -4 ). This PheWAS of RA patients identifies a strong association between seronegativity and fibromyalgia. It also affirms relationships between seropositivity and chronic airway obstruction and between seropositivity and tobacco use. These findings demonstrate the utility of the PheWAS approach to discover novel phenotype associations within different subgroups of a disease. © 2016, American College of Rheumatology.

  20. Evaluation of Bayesian approaches to identify DDT source contributions to soils in Southeast China.

    PubMed

    Zeng, Faming; Yang, Dan; Xing, Xinli; Qi, Shihua

    2017-06-01

    Dicofol application may be an important source to elevate the dichlorodiphenyltrichloroethane (DDT) residues to soils in Fujian, Southeast China, after the technical DDT was banned, which left DDT residues from the historical application. The DDT residues varied geographically, corresponding to the varied potential sources of DDT. In this study, a novel approach based on the Bayesian method (BM) was developed to identify the source contributions of DDT to soils, composed with both historical DDT and dicofol. The Naive Bayesian classifier was used basing on the subset of the samples, which were determined by chemical analysis independent of the Bayesian approach. The results show that BM (95%) was higher than that using the ratio of o, p'-/p, p'-DDT (84%) to identify DDT source contributions. High detection rate (97%) of dicofol (p, p'-OH-DDT) was observed in the subset, showing dicofol application influenced the DDX levels in soils in Fujian. However, the contribution from historical technical DDT source was greater than that from dicofol in Fujian, indicating historical technical DDT was still an important pollution source to soils. In addition, both the DDX (DDT isomers and derivatives) level and dicofol contribution in non-agricultural soils were higher than other agricultural land uses, especially in hilly regions, the potential cause may be the atmospheric transport of dicofol type DDT, after spraying during daytime, or regional difference on production and application. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Phylogenetic turnover along local environmental gradients in tropical forest communities.

    PubMed

    Baldeck, C A; Kembel, S W; Harms, K E; Yavitt, J B; John, R; Turner, B L; Madawala, S; Gunatilleke, N; Gunatilleke, S; Bunyavejchewin, S; Kiratiprayoon, S; Yaacob, A; Supardi, M N N; Valencia, R; Navarrete, H; Davies, S J; Chuyong, G B; Kenfack, D; Thomas, D W; Dalling, J W

    2016-10-01

    While the importance of local-scale habitat niches in shaping tree species turnover along environmental gradients in tropical forests is well appreciated, relatively little is known about the influence of phylogenetic signal in species' habitat niches in shaping local community structure. We used detailed maps of the soil resource and topographic variation within eight 24-50 ha tropical forest plots combined with species phylogenies created from the APG III phylogeny to examine how phylogenetic beta diversity (indicating the degree of phylogenetic similarity of two communities) was related to environmental gradients within tropical tree communities. Using distance-based redundancy analysis we found that phylogenetic beta diversity, expressed as either nearest neighbor distance or mean pairwise distance, was significantly related to both soil and topographic variation in all study sites. In general, more phylogenetic beta diversity within a forest plot was explained by environmental variables this was expressed as nearest neighbor distance versus mean pairwise distance (3.0-10.3 % and 0.4-8.8 % of variation explained among plots, respectively), and more variation was explained by soil resource variables than topographic variables using either phylogenetic beta diversity metric. We also found that patterns of phylogenetic beta diversity expressed as nearest neighbor distance were consistent with previously observed patterns of niche similarity among congeneric species pairs in these plots. These results indicate the importance of phylogenetic signal in local habitat niches in shaping the phylogenetic structure of tropical tree communities, especially at the level of close phylogenetic neighbors, where similarity in habitat niches is most strongly preserved.

  2. Short-wavelength sensitive opsin (SWS1) as a new marker for vertebrate phylogenetics

    PubMed Central

    van Hazel, Ilke; Santini, Francesco; Müller, Johannes; Chang, Belinda SW

    2006-01-01

    Background Vertebrate SWS1 visual pigments mediate visual transduction in response to light at short wavelengths. Due to their importance in vision, SWS1 genes have been isolated from a surprisingly wide range of vertebrates, including lampreys, teleosts, amphibians, reptiles, birds, and mammals. The SWS1 genes exhibit many of the characteristics of genes typically targeted for phylogenetic analyses. This study investigates both the utility of SWS1 as a marker for inferring vertebrate phylogenetic relationships, and the characteristics of the gene that contribute to its phylogenetic utility. Results Phylogenetic analyses of vertebrate SWS1 genes produced topologies that were remarkably congruent with generally accepted hypotheses of vertebrate evolution at both higher and lower taxonomic levels. The few exceptions were generally associated with areas of poor taxonomic sampling, or relationships that have been difficult to resolve using other molecular markers. The SWS1 data set was characterized by a substantial amount of among-site rate variation, and a relatively unskewed substitution rate matrix, even when the data were partitioned into different codon sites and individual taxonomic groups. Although there were nucleotide biases in some groups at third positions, these biases were not convergent across different taxonomic groups. Conclusion Our results suggest that SWS1 may be a good marker for vertebrate phylogenetics due to the variable yet consistent patterns of sequence evolution exhibited across fairly wide taxonomic groups. This may result from constraints imposed by the functional role of SWS1 pigments in visual transduction. PMID:17107620

  3. Phylogenetic framework for coevolutionary studies: a compass for exploring jungles of tangled trees

    PubMed Central

    2016-01-01

    Abstract Phylogenetics is used to detect past evolutionary events, from how species originated to how their ecological interactions with other species arose, which can mirror cophylogenetic patterns. Cophylogenetic reconstructions uncover past ecological relationships between taxa through inferred coevolutionary events on trees, for example, codivergence, duplication, host-switching, and loss. These events can be detected by cophylogenetic analyses based on nodes and the length and branching pattern of the phylogenetic trees of symbiotic associations, for example, host–parasite. In the past 2 decades, algorithms have been developed for cophylogetenic analyses and implemented in different software, for example, statistical congruence index and event-based methods. Based on the combination of these approaches, it is possible to integrate temporal information into cophylogenetical inference, such as estimates of lineage divergence times between 2 taxa, for example, hosts and parasites. Additionally, the advances in phylogenetic biogeography applying methods based on parametric process models and combined Bayesian approaches, can be useful for interpreting coevolutionary histories in a scenario of biogeographical area connectivity through time. This article briefly reviews the basics of parasitology and provides an overview of software packages in cophylogenetic methods. Thus, the objective here is to present a phylogenetic framework for coevolutionary studies, with special emphasis on groups of parasitic organisms. Researchers wishing to undertake phylogeny-based coevolutionary studies can use this review as a “compass” when “walking” through jungles of tangled phylogenetic trees. PMID:29491928

  4. Identifying (subsurface) anthropogenic heat sources that influence temperature in the drinking water distribution system

    NASA Astrophysics Data System (ADS)

    Agudelo-Vera, Claudia M.; Blokker, Mirjam; de Kater, Henk; Lafort, Rob

    2017-09-01

    The water temperature in the drinking water distribution system and at customers' taps approaches the surrounding soil temperature at a depth of 1 m. Water temperature is an important determinant of water quality. In the Netherlands drinking water is distributed without additional residual disinfectant and the temperature of drinking water at customers' taps is not allowed to exceed 25 °C. In recent decades, the urban (sub)surface has been getting more occupied by various types of infrastructures, and some of these can be heat sources. Only recently have the anthropogenic sources and their influence on the underground been studied on coarse spatial scales. Little is known about the urban shallow underground heat profile on small spatial scales, of the order of 10 m × 10 m. Routine water quality samples at the tap in urban areas have shown up locations - so-called hotspots - in the city, with relatively high soil temperatures - up to 7 °C warmer - compared to the soil temperatures in the surrounding rural areas. Yet the sources and the locations of these hotspots have not been identified. It is expected that with climate change during a warm summer the soil temperature in the hotspots can be above 25 °C. The objective of this paper is to find a method to identify heat sources and urban characteristics that locally influence the soil temperature. The proposed method combines mapping of urban anthropogenic heat sources, retrospective modelling of the soil temperature, analysis of water temperature measurements at the tap, and extensive soil temperature measurements. This approach provided insight into the typical range of the variation of the urban soil temperature, and it is a first step to identifying areas with potential underground heat stress towards thermal underground management in cities.

  5. Phylogenetic diversity of bacterial communities in bovine rumen as affected by diets and microenvironments.

    PubMed

    Kim, Minseok; Morrison, Mark; Yu, Zhongtang

    2011-09-01

    Phylogenetic analysis was conducted to examine ruminal bacteria in two ruminal fractions (adherent fraction vs. liquid fraction) collected from cattle fed with two different diets: forage alone vs. forage plus concentrate. One hundred forty-four 16S rRNA gene (rrs) sequences were obtained from clone libraries constructed from the four samples. These rrs sequences were assigned to 116 different operational taxonomic units (OTUs) defined at 0.03 phylogenetic distance. Most of these OTUs could not be assigned to any known genus. The phylum Firmicutes was represented by approximately 70% of all the sequences. By comparing to the OTUs already documented in the rumen, 52 new OTUs were identified. UniFrac, SONS, and denaturing gradient gel electrophoresis analyses revealed difference in diversity between the two fractions and between the two diets. This study showed that rrs sequences recovered from small clone libraries can still help identify novel species-level OTUs.

  6. Phylogenetic Distribution of CRISPR-Cas Systems in Antibiotic-Resistant Pseudomonas aeruginosa.

    PubMed

    van Belkum, Alex; Soriaga, Leah B; LaFave, Matthew C; Akella, Srividya; Veyrieras, Jean-Baptiste; Barbu, E Magda; Shortridge, Dee; Blanc, Bernadette; Hannum, Gregory; Zambardi, Gilles; Miller, Kristofer; Enright, Mark C; Mugnier, Nathalie; Brami, Daniel; Schicklin, Stéphane; Felderman, Martina; Schwartz, Ariel S; Richardson, Toby H; Peterson, Todd C; Hubby, Bolyn; Cady, Kyle C

    2015-11-24

    Pseudomonas aeruginosa is an antibiotic-refractory pathogen with a large genome and extensive genotypic diversity. Historically, P. aeruginosa has been a major model system for understanding the molecular mechanisms underlying type I clustered regularly interspaced short palindromic repeat (CRISPR) and CRISPR-associated protein (CRISPR-Cas)-based bacterial immune system function. However, little information on the phylogenetic distribution and potential role of these CRISPR-Cas systems in molding the P. aeruginosa accessory genome and antibiotic resistance elements is known. Computational approaches were used to identify and characterize CRISPR-Cas systems within 672 genomes, and in the process, we identified a previously unreported and putatively mobile type I-C P. aeruginosa CRISPR-Cas system. Furthermore, genomes harboring noninhibited type I-F and I-E CRISPR-Cas systems were on average ~300 kb smaller than those without a CRISPR-Cas system. In silico analysis demonstrated that the accessory genome (n = 22,036 genes) harbored the majority of identified CRISPR-Cas targets. We also assembled a global spacer library that aided the identification of difficult-to-characterize mobile genetic elements within next-generation sequencing (NGS) data and allowed CRISPR typing of a majority of P. aeruginosa strains. In summary, our analysis demonstrated that CRISPR-Cas systems play an important role in shaping the accessory genomes of globally distributed P. aeruginosa isolates. P. aeruginosa is both an antibiotic-refractory pathogen and an important model system for type I CRISPR-Cas bacterial immune systems. By combining the genome sequences of 672 newly and previously sequenced genomes, we were able to provide a global view of the phylogenetic distribution, conservation, and potential targets of these systems. This analysis identified a new and putatively mobile P. aeruginosa CRISPR-Cas subtype, characterized the diverse distribution of known CRISPR-inhibiting genes, and

  7. Paracas dust storms: Sources, trajectories and associated meteorological conditions

    NASA Astrophysics Data System (ADS)

    Briceño-Zuluaga, F.; Castagna, A.; Rutllant, J. A.; Flores-Aqueveque, V.; Caquineau, S.; Sifeddine, A.; Velazco, F.; Gutierrez, D.; Cardich, J.

    2017-09-01

    Dust storms that develop along the Pisco-Ica desert in Southern Peru, locally known as ;Paracas; winds have ecological, health and economic repercussions. Here we identify dust sources through MODIS (Moderate Resolution Imaging Spectroradiometer) imagery and analyze HYSPLIT (Hybrid Single Particles Lagrangian Integrated Trajectory) model trajectories and dispersion patterns, along with concomitant synoptic-scale meteorological conditions from National Centers for Environmental Prediction/National Center for Atmospheric Research reanalysis (NCEP/NCAR). Additionally, surface pressure data from the hourly METeorological Aerodrome Report (METAR) at Arica (18.5°S, 70.3°W) and Pisco (13.7°S, 76.2°W) were used to calculate Alongshore (sea-level) Pressure Gradient (APG) anomalies during Paracas dust storms, their duration and associated wind-speeds and wind directions. This study provides a review on the occurrence and strength of the Paracas dust storms as reported in the Pisco airfield for five-year period and their correspondence with MODIS true-color imagery in terms of dust-emission source areas. Our results show that most of the particle fluxes moving into the Ica-Pisco desert area during Paracas wind events originate over the coastal zone, where strong winds forced by steep APGs develop as the axis of a deep mid-troposphere trough sets in along north-central Chile. Direct relationships between Paracas wind intensity, number of active dust-emission sources and APGs are also documented, although the scarcity of simultaneous METAR/MODIS data for clearly observed MODIS dust plumes prevents any significant statistical inference. Synoptic-scale meteorological composites from NCEP/NCAR reanalysis data show that Paracas wind events (steep APGs) are mostly associated with the strengthening of anticyclonic conditions in northern Chile, that can be attributed to cold air advection associated with the incoming trough. Compared to the MODIS images, HYSPLIT outputs were able

  8. Characterization of phylogenetically diverse astroviruses of marine mammals.

    PubMed

    Rivera, Rebecca; Nollens, Hendrik H; Venn-Watson, Stephanie; Gulland, Frances M D; Wellehan, James F X

    2010-01-01

    Astroviruses are small, non-enveloped, positive-stranded RNA viruses. Previously studied mammalian astroviruses have been associated with diarrhoeal disease. Knowledge of astrovirus diversity is very limited, with only six officially recognized astrovirus species from mammalian hosts and, in addition, one human and some bat astroviruses were recently described. We used consensus PCR techniques for initial identification of five astroviruses of marine mammals: three from California sea lions (Zalophus californianus), one from a Steller sea lion (Eumetopias jubatus) and one from a bottlenose dolphin (Tursiops truncatus). Bayesian and maximum-likelihood phylogenetic analysis found that these viruses showed significant diversity at a level consistent with novel species. Astroviruses that we identified from marine mammals were found across the mamastrovirus tree and did not form a monophyletic group. Recombination analysis found that a recombination event may have occurred between a human and a California sea lion astrovirus, suggesting that both lineages may have been capable of infecting the same host at one point. The diversity found amongst marine mammal astroviruses and their similarity to terrestrial astroviruses suggests that the marine environment plays an important role in astrovirus ecology.

  9. TreSpEx—Detection of Misleading Signal in Phylogenetic Reconstructions Based on Tree Information

    PubMed Central

    Struck, Torsten H

    2014-01-01

    Phylogenies of species or genes are commonplace nowadays in many areas of comparative biological studies. However, for phylogenetic reconstructions one must refer to artificial signals such as paralogy, long-branch attraction, saturation, or conflict between different datasets. These signals might eventually mislead the reconstruction even in phylogenomic studies employing hundreds of genes. Unfortunately, there has been no program allowing the detection of such effects in combination with an implementation into automatic process pipelines. TreSpEx (Tree Space Explorer) now combines different approaches (including statistical tests), which utilize tree-based information like nodal support or patristic distances (PDs) to identify misleading signals. The program enables the parallel analysis of hundreds of trees and/or predefined gene partitions, and being command-line driven, it can be integrated into automatic process pipelines. TreSpEx is implemented in Perl and supported on Linux, Mac OS X, and MS Windows. Source code, binaries, and additional material are freely available at http://www.annelida.de/research/bioinformatics/software.html. PMID:24701118

  10. Phylogenetic analysis and antifouling potentials of culturable fungi in mangrove sediments from Techeng Isle, China.

    PubMed

    Zhang, Xiao-Yong; Fu, Wen; Chen, Xiao; Yan, Mu-Ting; Huang, Xian-De; Bao, Jie

    2018-06-09

    To search for more microbial resources for screening environment-friendly antifoulants, we investigated the phylogenetic diversity and antifouling potentials of culturable fungi in mangrove sediments from Techeng Isle, China. A total of 176 isolates belonging to 57 fungal taxa were recovered and identified. The high levels of diversity and abundance of mangrove fungi from Techeng Isle were in accordance with previous studies on fungi from other mangrove ecosystems. Fifteen of the 176 isolates demonstrated high divergence (87-93%) from the known fungal taxa in GenBank. Moreover, 26 isolates recorded in mangrove ecosystems for the first time. These results suggested that mangrove sediments from Techeng Isle harbored some new fungal communities compared with other mangrove ecosystems. The antifouling activity of 57 representative isolates (belonging to 57 different fungal taxa) was tested against three marine bacteria (Loktanella hongkongensis, Micrococcus luteus and Pseudoalteromonas piscida) and two marine macrofoulers (bryozoan Bugula neritina and barnacle Balanus amphitrite). Approximately 40% of the tested isolates displayed distinct antifouling activity. Furthermore, 17 fungal isolates were found to display strong or a wide spectrum of antifouling activity in this study, suggesting that these isolates deserve further study as potential sources of novel antifouling metabolites. To our knowledge, this is the first report on the investigation of the phylogenetic diversity and antifouling potential of culturable fungi in mangrove sediments from Techeng Isle, China. These results contribute to our knowledge of mangrove fungi and further increases the pool of fungi available for natural bioactive product screening.

  11. Identifying Source Water and Flow Paths in a Semi-Arid Watershed

    NASA Astrophysics Data System (ADS)

    Gulvin, C. J.; Miller, S. N.

    2016-12-01

    Processes controlling water delivery to perennial streams in the semi-arid mountain west are poorly understood, yet necessary to characterize water distribution across the landscape and better protect and manage diminishing water resources. Stream water chemistry profiling and hydrograph separation using stable isotopes can help identify source waters. Weekly stream water samples tested for stable water isotope fractionations, and major cations and anions at seven sites collocated with continuously recording stream depth gauges within a small watershed in southeastern Wyoming is a necessary first-step to identifying seasonally changing source water and flow paths. Sample results will help establish appropriate end members for a mixing analysis, as well as, characterize flow path heterogeneity, transit time distributions, and landscape selectively features. Hourly stream sampling during late-summer thunderstorms and rapid spring melt will help demonstrate if and how stream discharge change is affected by the two different events. Soil water and water extracted from tree xylem will help resolve how water is partitioned in the first 10m of the subsurface. In the face of land use change and a growing demand for water in the area, understanding how the water in small mountain streams is sustained is crucial for the future of agriculture, municipal water supplies, and countless ecosystem services.

  12. ["Long-branch Attraction" artifact in phylogenetic reconstruction].

    PubMed

    Li, Yi-Wei; Yu, Li; Zhang, Ya-Ping

    2007-06-01

    Phylogenetic reconstruction among various organisms not only helps understand their evolutionary history but also reveal several fundamental evolutionary questions. Understanding of the evolutionary relationships among organisms establishes the foundation for the investigations of other biological disciplines. However, almost all the widely used phylogenetic methods have limitations which fail to eliminate systematic errors effectively, preventing the reconstruction of true organismal relationships. "Long-branch Attraction" (LBA) artifact is one of the most disturbing factors in phylogenetic reconstruction. In this review, the conception and analytic method as well as the avoidance strategy of LBA were summarized. In addition, several typical examples were provided. The approach to avoid and resolve LBA artifact has been discussed.

  13. Identifying sources of Pb pollution in urban soils by means of MC-ICP-MS and TOF-SIMS.

    PubMed

    Rodríguez-Seijo, Andrés; Arenas-Lago, Daniel; Andrade, María Luisa; Vega, Flora A

    2015-05-01

    Lead pollution was evaluated in 17 urban soils from parks and gardens in the city of Vigo (NW Spain). The Pb isotope ratios ((207)Pb/(206)Pb, (208)Pb/(204)Pb, (206)Pb/(204)Pb and (208)Pb/(206)Pb) were determined after being measured by MC-ICP-MS. The association of the isotopes ((204)Pb, (206)Pb, (207)Pb and (208)Pb) with the different components of the soil was studied using TOF-SIMS. The isotopic ranges obtained for the samples were between 1.116 and 1.203 ((206)Pb/(207)Pb), 2.044-2.143 ((208)Pb/(206)Pb), 37.206-38.608 ((208)Pb/(204)Pb), 15.5482-15.6569 ((207)Pb/(204)Pb) and 17.357-18.826 ((206)Pb/(204)Pb). The application of the three-end-member model indicates that the Pb derived from petrol is the main source of Pb in the soils (43.51% on average), followed by natural or geogenic Pb (39.12%) and industrial emissions (17.37%). The emissions derived from coal combustion do not appear to influence the content of Pb in the soil. TOF-SIMS images show that the Pb mainly interacts with organic matter. This technique contributes to the understanding of the association of anthropogenic Pb with the components of the soil, as well as the particle size of these associations, thus allowing the possible sources of Pb to be identified.

  14. High School Students’ Learning and Perceptions of Phylogenetics of Flowering Plants

    PubMed Central

    Landis, Jacob B.; Crippen, Kent J.

    2014-01-01

    Basic phylogenetics and associated “tree thinking” are often minimized or excluded in formal school curricula. Informal settings provide an opportunity to extend the K–12 school curriculum, introducing learners to new ideas, piquing interest in science, and fostering scientific literacy. Similarly, university researchers participating in science, technology, engineering, and mathematics (STEM) outreach activities increase awareness of college and career options and highlight interdisciplinary fields of science research and augment the science curriculum. To aid in this effort, we designed a 6-h module in which students utilized 12 flowering plant species to generate morphological and molecular phylogenies using biological techniques and bioinformatics tools. The phylogenetics module was implemented with 83 high school students during a weeklong university STEM immersion program and aimed to increase student understanding of phylogenetics and coevolution of plants and pollinators. Student response reflected positive engagement and learning gains as evidenced through content assessments, program evaluation surveys, and program artifacts. We present the results of the first year of implementation and discuss modifications for future use in our immersion programs as well as in multiple course settings at the high school and undergraduate levels. PMID:25452488

  15. Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis

    PubMed Central

    Gibbs, S; Collard, M; Wood, B

    2002-01-01

    This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence. PMID:11833653

  16. Soft-tissue anatomy of the extant hominoids: a review and phylogenetic analysis.

    PubMed

    Gibbs, S; Collard, M; Wood, B

    2002-01-01

    This paper reports the results of a literature search for information about the soft-tissue anatomy of the extant non-human hominoid genera, Pan, Gorilla, Pongo and Hylobates, together with the results of a phylogenetic analysis of these data plus comparable data for Homo. Information on the four extant non-human hominoid genera was located for 240 out of the 1783 soft-tissue structures listed in the Nomina Anatomica. Numerically these data are biased so that information about some systems (e.g. muscles) and some regions (e.g. the forelimb) are over-represented, whereas other systems and regions (e.g. the veins and the lymphatics of the vascular system, the head region) are either under-represented or not represented at all. Screening to ensure that the data were suitable for use in a phylogenetic analysis reduced the number of eligible soft-tissue structures to 171. These data, together with comparable data for modern humans, were converted into discontinuous character states suitable for phylogenetic analysis and then used to construct a taxon-by-character matrix. This matrix was used in two tests of the hypothesis that soft-tissue characters can be relied upon to reconstruct hominoid phylogenetic relationships. In the first, parsimony analysis was used to identify cladograms requiring the smallest number of character state changes. In the second, the phylogenetic bootstrap was used to determine the confidence intervals of the most parsimonious clades. The parsimony analysis yielded a single most parsimonious cladogram that matched the molecular cladogram. Similarly the bootstrap analysis yielded clades that were compatible with the molecular cladogram; a (Homo, Pan) clade was supported by 95% of the replicates, and a (Gorilla, Pan, Homo) clade by 96%. These are the first hominoid morphological data to provide statistically significant support for the clades favoured by the molecular evidence.

  17. Archaeal phylogeny: reexamination of the phylogenetic position of Archaeoglobus fulgidus in light of certain composition-induced artifacts

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Achenbach, L.; Rouviere, P.; Mandelco, L.

    1991-01-01

    A major and too little recognized source of artifact in phylogenetic analysis of molecular sequence data is compositional difference among sequences. The problem becomes particularly acute when alignments contain ribosomal RNAs from both mesophilic and thermophilic species. Among prokaryotes the latter are considerably higher in G + C content than the former, which often results in artificial clustering of thermophilic lineages and their being placed artificially deep in phylogenetic trees. In this communication we review archaeal phylogeny in the light of this consideration, focusing in particular on the phylogenetic position of the sulfate reducing species Archaeoglobus fulgidus, using both 16S rRNA and 23S rRNA sequences. The analysis shows clearly that the previously reported deep branching of the A. fulgidus lineage (very near the base of the euryarchaeal side of the archaeal tree) is incorrect, and that the lineage actually groups with a previously recognized unit that comprises the Methanomicrobiales and extreme halophiles.

  18. Prioritizing Populations for Conservation Using Phylogenetic Networks

    PubMed Central

    Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.

    2014-01-01

    In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451

  19. Integrated analyses using RNA-Seq data reveal viral genomes, single nucleotide variations, the phylogenetic relationship, and recombination for Apple stem grooving virus.

    PubMed

    Jo, Yeonhwa; Choi, Hoseong; Kim, Sang-Min; Kim, Sun-Lim; Lee, Bong Choon; Cho, Won Kyong

    2016-08-09

    Next-generation sequencing (NGS) provides many possibilities for plant virology research. In this study, we performed integrated analyses using plant transcriptome data for plant virus identification using Apple stem grooving virus (ASGV) as an exemplar virus. We used 15 publicly available transcriptome libraries from three different studies, two mRNA-Seq studies and a small RNA-Seq study. We de novo assembled nearly complete genomes of ASGV isolates Fuji and Cuiguan from apple and pear transcriptomes, respectively, and identified single nucleotide variations (SNVs) of ASGV within the transcriptomes. We demonstrated the application of NGS raw data to confirm viral infections in the plant transcriptomes. In addition, we compared the usability of two de novo assemblers, Trinity and Velvet, for virus identification and genome assembly. A phylogenetic tree revealed that ASGV and Citrus tatter leaf virus (CTLV) are the same virus, which was divided into two clades. Recombination analyses identified six recombination events from 21 viral genomes. Taken together, our in silico analyses using NGS data provide a successful application of plant transcriptomes to reveal extensive information associated with viral genome assembly, SNVs, phylogenetic relationships, and genetic recombination.

  20. From symmetry to asymmetry: Phylogenetic patterns of asymmetry variation in animals and their evolutionary significance

    PubMed Central

    Palmer, A. Richard

    1996-01-01

    Phylogenetic analyses of asymmetry variation offer a powerful tool for exploring the interplay between ontogeny and evolution because (i) conspicuous asymmetries exist in many higher metazoans with widely varying modes of development, (ii) patterns of bilateral variation within species may identify genetically and environmentally triggered asymmetries, and (iii) asymmetries arising at different times during development may be more sensitive to internal cytoplasmic inhomogeneities compared to external environmental stimuli. Using four broadly comparable asymmetry states (symmetry, antisymmetry, dextral, and sinistral), and two stages at which asymmetry appears developmentally (larval and postlarval), I evaluated relations between ontogenetic and phylogenetic patterns of asymmetry variation. Among 140 inferred phylogenetic transitions between asymmetry states, recorded from 11 classes in five phyla, directional asymmetry (dextral or sinistral) evolved directly from symmetrical ancestors proportionally more frequently among larval asymmetries. In contrast, antisymmetry, either as an end state or as a transitional stage preceding directional asymmetry, was confined primarily to postlarval asymmetries. The ontogenetic origin of asymmetry thus significantly influences its subsequent evolution. Furthermore, because antisymmetry typically signals an environmentally triggered asymmetry, the phylogenetic transition from antisymmetry to directional asymmetry suggests that many cases of laterally fixed asymmetries evolved via genetic assimilation. PMID:8962039

  1. Homoplasious colony morphology and mito-nuclear phylogenetic discordance among Eastern Pacific octocorals.

    PubMed

    Ament-Velásquez, Sandra L; Breedy, Odalisca; Cortés, Jorge; Guzman, Hector M; Wörheide, Gert; Vargas, Sergio

    2016-05-01

    Octocorals are a diverse and ecologically important group of cnidarians. However, the phylogenetic relationships of many octocoral groups are not well understood and are based mostly on mitochondrial sequence data. In addition, the discovery and description of new gorgonian species displaying unusual or intermediate morphologies and uncertain phylogenetic affinities further complicates the study of octocoral systematics and raises questions about the role played by processes such as plasticity, crypsis, and convergence in the evolution of this group of organisms. Here, we use nuclear (i.e. 28S rDNA) and mitochondrial (mtMutS) markers and a sample of Eastern Pacific gorgonians thought to be remarkable from a morphological point of view to shed light on the morphological diversification among these organisms. Our study reveals the loss of the anastomosed colony morphology in two unrelated lineages of the seafan genus Pacifigorgia and offers strong evidence for the independent evolution of a whip-like morphology in two lineages of Eastern Pacific Leptogorgia. Additionally, our data revealed one instance of mito-nuclear discordance in the genera Leptogorgia and Eugorgia, which may be the results of incomplete lineage sorting or ancient hybridization-introgression events. Our study stresses the importance of comprehensive taxonomic sampling and the use of independent sources of evidence to address the phylogenetic relationships and clarifying the evolution of octocorals. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Phylogenetic structure of soil bacterial communities predicts ecosystem functioning.

    PubMed

    Pérez-Valera, Eduardo; Goberna, Marta; Verdú, Miguel

    2015-05-01

    Quantifying diversity with phylogeny-informed metrics helps understand the effects of diversity on ecosystem functioning (EF). The sign of these effects remains controversial because phylogenetic diversity and taxonomic identity may interactively influence EF. Positive relationships, traditionally attributed to complementarity effects, seem unimportant in natural soil bacterial communities. Negative relationships could be attributed to fitness differences leading to the overrepresentation of few productive clades, a mechanism recently invoked to assemble soil bacteria communities. We tested in two ecosystems contrasting in terms of environmental heterogeneity whether two metrics of phylogenetic community structure, a simpler measure of phylogenetic diversity (NRI) and a more complex metric incorporating taxonomic identity (PCPS), correctly predict microbially mediated EF. We show that the relationship between phylogenetic diversity and EF depends on the taxonomic identity of the main coexisting lineages. Phylogenetic diversity was negatively related to EF in soils where a marked fertility gradient exists and a single and productive clade (Proteobacteria) outcompete other clades in the most fertile plots. However, phylogenetic diversity was unrelated to EF in soils where the fertility gradient is less marked and Proteobacteria coexist with other abundant lineages. Including the taxonomic identity of bacterial lineages in metrics of phylogenetic community structure allows the prediction of EF in both ecosystems. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Conservation Action Based on Threatened Species Capture Taxonomic and Phylogenetic Richness in Breeding and Wintering Populations of Central Asian Birds

    PubMed Central

    Schweizer, Manuel; Ayé, Raffael; Kashkarov, Roman; Roth, Tobias

    2014-01-01

    Although phylogenetic diversity has been suggested to be relevant from a conservation point of view, its role is still limited in applied nature conservation. Recently, the practice of investing conservation resources based on threatened species was identified as a reason for the slow integration of phylogenetic diversity in nature conservation planning. One of the main arguments is based on the observation that threatened species are not evenly distributed over the phylogenetic tree. However this argument seems to dismiss the fact that conservation action is a spatially explicit process, and even if threatened species are not evenly distributed over the phylogenetic tree, the occurrence of threatened species could still indicate areas with above average phylogenetic diversity and consequently could protect phylogenetic diversity. Here we aim to study the selection of important bird areas in Central Asia, which were nominated largely based on the presence of threatened bird species. We show that although threatened species occurring in Central Asia do not capture phylogenetically more distinct species than expected by chance, the current spatially explicit conservation approach of selecting important bird areas covers above average taxonomic and phylogenetic diversity of breeding and wintering birds. We conclude that the spatially explicit processes of conservation actions need to be considered in the current discussion of whether new prioritization methods are needed to complement conservation action based on threatened species. PMID:25337861

  4. A phylogenetic transform enhances analysis of compositional microbiota data

    PubMed Central

    Silverman, Justin D; Washburne, Alex D; Mukherjee, Sayan; David, Lawrence A

    2017-01-01

    Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities. DOI: http://dx.doi.org/10.7554/eLife.21887.001 PMID:28198697

  5. Prokaryotic diversity, composition structure, and phylogenetic analysis of microbial communities in leachate sediment ecosystems.

    PubMed

    Liu, Jingjing; Wu, Weixiang; Chen, Chongjun; Sun, Faqian; Chen, Yingxu

    2011-09-01

    In order to obtain insight into the prokaryotic diversity and community in leachate sediment, a culture-independent DNA-based molecular phylogenetic approach was performed with archaeal and bacterial 16S rRNA gene clone libraries derived from leachate sediment of an aged landfill. A total of 59 archaeal and 283 bacterial rDNA phylotypes were identified in 425 archaeal and 375 bacterial analyzed clones. All archaeal clones distributed within two archaeal phyla of the Euryarchaeota and Crenarchaeota, and well-defined methanogen lineages, especially Methanosaeta spp., are the most numerically dominant species of the archaeal community. Phylogenetic analysis of the bacterial library revealed a variety of pollutant-degrading and biotransforming microorganisms, including 18 distinct phyla. A substantial fraction of bacterial clones showed low levels of similarity with any previously documented sequences and thus might be taxonomically new. Chemical characteristics and phylogenetic inferences indicated that (1) ammonium-utilizing bacteria might form consortia to alleviate or avoid the negative influence of high ammonium concentration on other microorganisms, and (2) members of the Crenarchaeota found in the sediment might be involved in ammonium oxidation. This study is the first to report the composition of the microbial assemblages and phylogenetic characteristics of prokaryotic populations extant in leachate sediment. Additional work on microbial activity and contaminant biodegradation remains to be explored.

  6. Chemical screening method for the rapid identification of microbial sources of marine invertebrate-associated metabolites.

    PubMed

    Berrue, Fabrice; Withers, Sydnor T; Haltli, Brad; Withers, Jo; Kerr, Russell G

    2011-03-21

    Marine invertebrates have proven to be a rich source of secondary metabolites. The growing recognition that marine microorganisms associated with invertebrate hosts are involved in the biosynthesis of secondary metabolites offers new alternatives for the discovery and development of marine natural products. However, the discovery of microorganisms producing secondary metabolites previously attributed to an invertebrate host poses a significant challenge. This study describes an efficient chemical screening method utilizing a 96-well plate-based bacterial cultivation strategy to identify and isolate microbial producers of marine invertebrate-associated metabolites.

  7. A Genome-Wide Association Study Identifies Genetic Variants Associated with Mathematics Ability

    PubMed Central

    Chen, Huan; Gu, Xiao-hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin

    2017-01-01

    Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10−10, maximum β −2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level. PMID:28155865

  8. A Genome-Wide Association Study Identifies Genetic Variants Associated with Mathematics Ability.

    PubMed

    Chen, Huan; Gu, Xiao-Hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin

    2017-02-03

    Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10 -10 , maximum β -2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level.

  9. Let them fall where they may: congruence analysis in massive phylogenetically messy data sets.

    PubMed

    Leigh, Jessica W; Schliep, Klaus; Lopez, Philippe; Bapteste, Eric

    2011-10-01

    Interest in congruence in phylogenetic data has largely focused on issues affecting multicellular organisms, and animals in particular, in which the level of incongruence is expected to be relatively low. In addition, assessment methods developed in the past have been designed for reasonably small numbers of loci and scale poorly for larger data sets. However, there are currently over a thousand complete genome sequences available and of interest to evolutionary biologists, and these sequences are predominantly from microbial organisms, whose molecular evolution is much less frequently tree-like than that of multicellular life forms. As such, the level of incongruence in these data is expected to be high. We present a congruence method that accommodates both very large numbers of genes and high degrees of incongruence. Our method uses clustering algorithms to identify subsets of genes based on similarity of phylogenetic signal. It involves only a single phylogenetic analysis per gene, and therefore, computation time scales nearly linearly with the number of genes in the data set. We show that our method performs very well with sets of sequence alignments simulated under a wide variety of conditions. In addition, we present an analysis of core genes of prokaryotes, often assumed to have been largely vertically inherited, in which we identify two highly incongruent classes of genes. This result is consistent with the complexity hypothesis.

  10. A program to compute the soft Robinson-Foulds distance between phylogenetic networks.

    PubMed

    Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai

    2017-03-14

    Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.

  11. Auditing Associative Relations across Two Knowledge Sources

    PubMed Central

    Vizenor, Lowell T.; Bodenreider, Olivier; McCray, Alexa T.

    2009-01-01

    Objectives This paper proposes a novel semantic method for auditing associative relations in biomedical terminologies. We tested our methodology on two Unified Medical Language System (UMLS) knowledge sources. Methods We use the UMLS semantic groups as high-level representations of the domain and range of relationships in the Metathesaurus and in the Semantic Network. A mapping created between Metathesaurus relationships and Semantic Network relationships forms the basis for comparing the signatures of a given Metathesaurus relationship to the signatures of the semantic relationship to which it is mapped. The consistency of Metathesaurus relations is studied for each relationship. Results Of the 177 associative relationships in the Metathesaurus, 84 (48%) exhibit a high degree of consistency with the corresponding Semantic Network relationships. Overall, 63% of the 1.8M associative relations in the Metathesaurus are consistent with relations in the Semantic Network. Conclusion The semantics of associative relationships in biomedical terminologies should be defined explicitly by their developers. The Semantic Network would benefit from being extended with new relationships and with new relations for some existing relationships. The UMLS editing environment could take advantage of the correspondence established between relationships in the Metathesaurus and the Semantic Network. Finally, the auditing method also yielded useful information for refining the mapping of associative relationships between the two sources. PMID:19475724

  12. Searching for Compact Radio Sources Associated with UCH ii Regions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Masqué, Josep M.; Trinidad, Miguel A.; Rodríguez-Rico, Carlos A.

    Ultra-compact (UC)H ii regions represent a very early stage of massive star formation. The structure and evolution of these regions are not yet fully understood. Interferometric observations showed in recent years that compact sources of uncertain nature are associated with some UCH ii regions. To examine this, we carried out VLA 1.3 cm observations in the A configuration of selected UCH ii regions in order to report additional cases of compact sources embedded in UCH ii regions. With these observations, we find 13 compact sources that are associated with 9 UCH ii regions. Although we cannot establish an unambiguous naturemore » for the newly detected sources, we assess some of their observational properties. According to the results, we can distinguish between two types of compact sources. One type corresponds to sources that are probably deeply embedded in the dense ionized gas of the UCH ii region. These sources are photoevaporated by the exciting star of the region and will last for 10{sup 4}–10{sup 5} years. They may play a crucial role in the evolution of the UCH ii region as the photoevaporated material could replenish the expanding plasma and might provide a solution to the so-called lifetime problem of these regions. The second type of compact sources is not associated with the densest ionized gas of the region. A few of these sources appear resolved and may be photoevaporating objects such as those of the first type, but with significantly lower mass depletion rates. The remaining sources of this second type appear unresolved, and their properties are varied. We speculate on the similarity between the sources of the second type and those of the Orion population of radio sources.« less

  13. Inferring influenza global transmission networks without complete phylogenetic information

    PubMed Central

    Aris-Brosou, Stéphane

    2014-01-01

    Influenza is one of the most severe respiratory infections affecting humans throughout the world, yet the dynamics of its global transmission network are still contentious. Here, I describe a novel combination of phylogenetics, time series, and graph theory to analyze 14.25 years of data stratified in space and in time, focusing on the main target of the human immune response, the hemagglutinin gene. While bypassing the complete phylogenetic inference of huge data sets, the method still extracts information suggesting that waves of genetic or of nucleotide diversity circulate continuously around the globe for subtypes that undergo sustained transmission over several seasons, such as H3N2 and pandemic H1N1/09, while diversity of prepandemic H1N1 viruses had until 2009 a noncontinuous transmission pattern consistent with a source/sink model. Irrespective of the shift in the structure of H1N1 diversity circulation with the emergence of the pandemic H1N1/09 strain, US prevalence peaks during the winter months when genetic diversity is at its lowest. This suggests that a dominant strain is generally responsible for epidemics and that monitoring genetic and/or nucleotide diversity in real time could provide public health agencies with an indirect estimate of prevalence. PMID:24665342

  14. Two C++ Libraries for Counting Trees on a Phylogenetic Terrace.

    PubMed

    Biczok, R; Bozsoky, P; Eisenmann, P; Ernst, J; Ribizel, T; Scholz, F; Trefzer, A; Weber, F; Hamann, M; Stamatakis, A

    2018-05-08

    The presence of terraces in phylogenetic tree space, that is, a potentially large number of distinct tree topologies that have exactly the same analytical likelihood score, was first described by Sanderson et al. (2011). However, popular software tools for maximum likelihood and Bayesian phylogenetic inference do not yet routinely report, if inferred phylogenies reside on a terrace, or not. We believe, this is due to the lack of an efficient library to (i) determine if a tree resides on a terrace, (ii) calculate how many trees reside on a terrace, and (iii) enumerate all trees on a terrace. In our bioinformatics practical that is set up as a programming contest we developed two efficient and independent C++ implementations of the SUPERB algorithm by Constantinescu and Sankoff (1995) for counting and enumerating trees on a terrace. Both implementations yield exactly the same results, are more than one order of magnitude faster, and require one order of magnitude less memory than a previous 3rd party python implementation. The source codes are available under GNU GPL at https://github.com/terraphast. Alexandros.Stamatakis@h-its.org. Supplementary data are available at Bioinformatics online.

  15. Arcobacter in Lake Erie beach waters: an emerging gastrointestinal pathogen linked with human-associated fecal contamination.

    PubMed

    Lee, Cheonghoon; Agidi, Senyo; Marion, Jason W; Lee, Jiyoung

    2012-08-01

    The genus Arcobacter has been associated with human illness and fecal contamination by humans and animals. To better characterize the health risk posed by this emerging waterborne pathogen, we investigated the occurrence of Arcobacter spp. in Lake Erie beach waters. During the summer of 2010, water samples were collected 35 times from the Euclid, Villa Angela, and Headlands (East and West) beaches, located along Ohio's Lake Erie coast. After sample concentration, Arcobacter was quantified by real-time PCR targeting the Arcobacter 23S rRNA gene. Other fecal genetic markers (Bacteroides 16S rRNA gene [HuBac], Escherichia coli uidA gene, Enterococcus 23S rRNA gene, and tetracycline resistance genes) were also assessed. Arcobacter was detected frequently at all beaches, and both the occurrence and densities of Arcobacter spp. were higher at the Euclid and Villa Angela beaches (with higher levels of fecal contamination) than at the East and West Headlands beaches. The Arcobacter density in Lake Erie beach water was significantly correlated with the human-specific fecal marker HuBac according to Spearman's correlation analysis (r = 0.592; P < 0.001). Phylogenetic analysis demonstrated that most of the identified Arcobacter sequences were closely related to Arcobacter cryaerophilus, which is known to cause gastrointestinal diseases in humans. Since human-pathogenic Arcobacter spp. are linked to human-associated fecal sources, it is important to identify and manage the human-associated contamination sources for the prevention of Arcobacter-associated public health risks at Lake Erie beaches.

  16. Chagas disease vector blood meal sources identified by protein mass spectrometry

    PubMed Central

    Keller, Judith I.; Ballif, Bryan A.; St. Clair, Riley M.; Vincent, James J.; Monroy, M. Carlota

    2017-01-01

    Chagas disease is a complex vector borne parasitic disease involving blood feeding Triatominae (Hemiptera: Reduviidae) insects, also known as kissing bugs, and the vertebrates they feed on. This disease has tremendous impacts on millions of people and is a global health problem. The etiological agent of Chagas disease, Trypanosoma cruzi (Kinetoplastea: Trypanosomatida: Trypanosomatidae), is deposited on the mammalian host in the insect’s feces during a blood meal, and enters the host’s blood stream through mucous membranes or a break in the skin. Identifying the blood meal sources of triatomine vectors is critical in understanding Chagas disease transmission dynamics, can lead to identification of other vertebrates important in the transmission cycle, and aids management decisions. The latter is particularly important as there is little in the way of effective therapeutics for Chagas disease. Several techniques, mostly DNA-based, are available for blood meal identification. However, further methods are needed, particularly when sample conditions lead to low-quality DNA or to assess the risk of human cross-contamination. We demonstrate a proteomics-based approach, using liquid chromatography tandem mass spectrometry (LC-MS/MS) to identify host-specific hemoglobin peptides for blood meal identification in mouse blood control samples and apply LC-MS/MS for the first time to Triatoma dimidiata insect vectors, tracing blood sources to species. In contrast to most proteins, hemoglobin, stabilized by iron, is incredibly stable even being preserved through geologic time. We compared blood stored with and without an anticoagulant and examined field-collected insect specimens stored in suboptimal conditions such as at room temperature for long periods of time. To our knowledge, this is the first study using LC-MS/MS on field-collected arthropod disease vectors to identify blood meal composition, and where blood meal identification was confirmed with more traditional DNA

  17. Phylogenetic system and zoogeography of the Plecoptera.

    PubMed

    Zwick, P

    2000-01-01

    Information about the phylogenetic relationships of Plecoptera is summarized. The few characters supporting monophyly of the order are outlined. Several characters of possible significance for the search for the closest relatives of the stoneflies are discussed, but the sister-group of the order remains unknown. Numerous characters supporting the presently recognized phylogenetic system of Plecoptera are presented, alternative classifications are discussed, and suggestions for future studies are made. Notes on zoogeography are appended. The order as such is old (Permian fossils), but phylogenetic relationships and global distribution patterns suggest that evolution of the extant suborders started with the breakup of Pangaea. There is evidence of extensive recent speciation in all parts of the world.

  18. Epidemiological study of phylogenetic transmission clusters in a local HIV-1 epidemic reveals distinct differences between subtype B and non-B infections.

    PubMed

    Chalmet, Kristen; Staelens, Delfien; Blot, Stijn; Dinakis, Sylvie; Pelgrom, Jolanda; Plum, Jean; Vogelaers, Dirk; Vandekerckhove, Linos; Verhofstede, Chris

    2010-09-07

    The number of HIV-1 infected individuals in the Western world continues to rise. More in-depth understanding of regional HIV-1 epidemics is necessary for the optimal design and adequate use of future prevention strategies. The use of a combination of phylogenetic analysis of HIV sequences, with data on patients' demographics, infection route, clinical information and laboratory results, will allow a better characterization of individuals responsible for local transmission. Baseline HIV-1 pol sequences, obtained through routine drug-resistance testing, from 506 patients, newly diagnosed between 2001 and 2009, were used to construct phylogenetic trees and identify transmission-clusters. Patients' demographics, laboratory and clinical data, were retrieved anonymously. Statistical analysis was performed to identify subtype-specific and transmission-cluster-specific characteristics. Multivariate analysis showed significant differences between the 59.7% of individuals with subtype B infection and the 40.3% non-B infected individuals, with regard to route of transmission, origin, infection with Chlamydia (p = 0.01) and infection with Hepatitis C virus (p = 0.017). More and larger transmission-clusters were identified among the subtype B infections (p < 0.001). Overall, in multivariate analysis, clustering was significantly associated with Caucasian origin, infection through homosexual contact and younger age (all p < 0.001). Bivariate analysis additionally showed a correlation between clustering and syphilis (p < 0.001), higher CD4 counts (p = 0.002), Chlamydia infection (p = 0.013) and primary HIV (p = 0.017). Combination of phylogenetics with demographic information, laboratory and clinical data, revealed that HIV-1 subtype B infected Caucasian men-who-have-sex-with-men with high prevalence of sexually transmitted diseases, account for the majority of local HIV-transmissions. This finding elucidates observed epidemiological trends through molecular analysis, and

  19. Phylogenetic relationships in Cortinarius, section Calochroi, inferred from nuclear DNA sequences

    PubMed Central

    Garnica, Sigisfredo; Weiß, Michael; Oertel, Bernhard; Ammirati, Joseph; Oberwinkler, Franz

    2009-01-01

    Background Section Calochroi is one of the most species-rich lineages in the genus Cortinarius (Agaricales, Basidiomycota) and is widely distributed across boreo-nemoral areas, with some extensions into meridional zones. Previous phylogenetic studies of Calochroi (incl. section Fulvi) have been geographically restricted; therefore, phylogenetic and biogeographic relationships within this lineage at a global scale have been largely unknown. In this study, we obtained DNA sequences from a nearly complete taxon sampling of known species from Europe, Central America and North America. We inferred intra- and interspecific phylogenetic relationships as well as major morphological evolutionary trends within section Calochroi based on 576 ITS sequences, 230 ITS + 5.8S + D1/D2 sequences, and a combined dataset of ITS + 5.8S + D1/D2 and RPB1 sequences of a representative subsampling of 58 species. Results More than 100 species were identified by integrating DNA sequences with morphological, macrochemical and ecological data. Cortinarius section Calochroi was consistently resolved with high branch support into at least seven major lineages: Calochroi, Caroviolacei, Dibaphi, Elegantiores, Napi, Pseudoglaucopodes and Splendentes; whereas Rufoolivacei and Sulfurini appeared polyphyletic. A close relationship between Dibaphi, Elegantiores, Napi and Splendentes was consistently supported. Combinations of specific morphological, pigmentation and molecular characters appear useful in circumscribing clades. Conclusion Our analyses demonstrate that Calochroi is an exclusively northern hemispheric lineage, where species follow their host trees throughout their natural ranges within and across continents. Results of this study contribute substantially to defining European species in this group and will help to either identify or to name new species occurring across the northern hemisphere. Major groupings are in partial agreement with earlier morphology-based and molecular phylogenetic

  20. Cophenetic metrics for phylogenetic trees, after Sokal and Rohlf.

    PubMed

    Cardona, Gabriel; Mir, Arnau; Rosselló, Francesc; Rotger, Lucía; Sánchez, David

    2013-01-16

    Phylogenetic tree comparison metrics are an important tool in the study of evolution, and hence the definition of such metrics is an interesting problem in phylogenetics. In a paper in Taxon fifty years ago, Sokal and Rohlf proposed to measure quantitatively the difference between a pair of phylogenetic trees by first encoding them by means of their half-matrices of cophenetic values, and then comparing these matrices. This idea has been used several times since then to define dissimilarity measures between phylogenetic trees but, to our knowledge, no proper metric on weighted phylogenetic trees with nested taxa based on this idea has been formally defined and studied yet. Actually, the cophenetic values of pairs of different taxa alone are not enough to single out phylogenetic trees with weighted arcs or nested taxa. For every (rooted) phylogenetic tree T, let its cophenetic vectorφ(T) consist of all pairs of cophenetic values between pairs of taxa in T and all depths of taxa in T. It turns out that these cophenetic vectors single out weighted phylogenetic trees with nested taxa. We then define a family of cophenetic metrics dφ,p by comparing these cophenetic vectors by means of Lp norms, and we study, either analytically or numerically, some of their basic properties: neighbors, diameter, distribution, and their rank correlation with each other and with other metrics. The cophenetic metrics can be safely used on weighted phylogenetic trees with nested taxa and no restriction on degrees, and they can be computed in O(n2) time, where n stands for the number of taxa. The metrics dφ,1 and dφ,2 have positive skewed distributions, and they show a low rank correlation with the Robinson-Foulds metric and the nodal metrics, and a very high correlation with each other and with the splitted nodal metrics. The diameter of dφ,p, for p⩾1 , is in O(n(p+2)/p), and thus for low p they are more discriminative, having a wider range of values.

  1. Ensemble classification for identifying neighbourhood sources of fugitive dust and associations with observed PM10

    NASA Astrophysics Data System (ADS)

    Khuluse-Makhanya, Sibusisiwe; Stein, Alfred; Breytenbach, André; Gxumisa, Athi; Dudeni-Tlhone, Nontembeko; Debba, Pravesh

    2017-10-01

    In urban areas the deterioration of air quality as a result of fugitive dust receives less attention than the more prominent traffic and industrial emissions. We assessed whether fugitive dust emission sources in the neighbourhood of an air quality monitor are predictors of ambient PM10 concentrations on days characterized by strong local winds. An ensemble maximum likelihood method is developed for land cover mapping in the vicinity of an air quality station using SPOT 6 multi-spectral images. The ensemble maximum likelihood classifier is developed through multiple training iterations for improved accuracy of the bare soil class. Five primary land cover classes are considered, namely built-up areas, vegetation, bare soil, water and 'mixed bare soil' which denotes areas where soil is mixed with either vegetation or synthetic materials. Preliminary validation of the ensemble classifier for the bare soil class results in an accuracy range of 65-98%. Final validation of all classes results in an overall accuracy of 78%. Next, cluster analysis and a varying intercepts regression model are used to assess the statistical association between land cover, a fugitive dust emissions proxy and observed PM10. We found that land cover patterns in the neighbourhood of an air quality station are significant predictors of observed average PM10 concentrations on days when wind speeds are conducive for dust emissions. This study concludes that in the absence of an emissions inventory for ambient particulate matter, PM10 emitted from dust reservoirs can be statistically accounted for by land cover characteristics. This supports the use of land cover data for improved prediction of PM10 at locations without air quality monitoring stations.

  2. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    PubMed Central

    Dees, Jonathan; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa relatedness on phylogenetic trees, to measure the prevalence of correct taxa-relatedness interpretations, and to determine how student reasoning and correctness change in response to instruction and over time. Counting synapomorphies and nodes between taxa were the most common forms of incorrect reasoning, which presents a pedagogical dilemma concerning labeled synapomorphies on phylogenetic trees. Students also independently generated an alternative form of correct reasoning using monophyletic groups, the use of which decreased in popularity over time. Approximately half of all students were able to correctly interpret taxa relatedness on phylogenetic trees, and many memorized correct reasoning without understanding its application. Broad initial instruction that allowed students to generate inferences on their own contributed very little to phylogenetic tree understanding, while targeted instruction on evolutionary relationships improved understanding to some extent. Phylogenetic trees, which can directly affect student understanding of evolution, appear to offer introductory biology instructors a formidable pedagogical challenge. PMID:25452489

  3. Occurrence of bacteriophages infecting Aeromonas, Enterobacter, and Klebsiella in water and association with contamination sources in Thailand.

    PubMed

    Wangkahad, Bencharong; Bosup, Suchada; Mongkolsuk, Skorn; Sirikanchana, Kwanrawee

    2015-06-01

    The co-residence of bacteriophages and their bacterial hosts in humans, animals, and environmental sources directed the use of bacteriophages to track the origins of the pathogenic bacteria that can be found in contaminated water. The objective of this study was to enumerate bacteriophages of Aeromonas caviae (AecaKS148), Enterobacter sp. (EnspKS513), and Klebsiella pneumoniae (KlpnKS648) in water and evaluate their association with contamination sources (human vs. animals). Bacterial host strains were isolated from untreated wastewater in Bangkok, Thailand. A double-layer agar technique was used to detect bacteriophages. All three bacteriophages were detected in polluted canal samples, with likely contamination from human wastewater, whereas none was found in non-polluted river samples. AecaKS148 was found to be associated with human fecal sources, while EnspKS513 and KlpnKS648 seemed to be equally prevalent in both human and animal fecal sources. Both bacteriophages were also present in polluted canals that could receive contamination from other fecal sources or the environment. In conclusion, all three bacteriophages were successfully monitored in Bangkok, Thailand. This study provided an example of bacteriophages for potential use as source identifiers of pathogen contamination. The results from this study will assist in controlling sources of pathogen contamination, especially in developing countries.

  4. Potential use of ionic species for identifying source land-uses of stormwater runoff.

    PubMed

    Lee, Dong Hoon; Kim, Jin Hwi; Mendoza, Joseph A; Lee, Chang-Hee; Kang, Joo-Hyon

    2017-02-01

    Identifying critical land-uses or source areas is important to prioritize resources for cost-effective stormwater management. This study investigated the use of information on ionic composition as a fingerprint to identify the source land-use of stormwater runoff. We used 12 ionic species in stormwater runoff monitored for a total of 20 storm events at five sites with different land-use compositions during the 2012-2014 wet seasons. A stepwise forward discriminant function analysis (DFA) with the jack-knifed cross validation approach was used to select ionic species that better discriminate the land-use of its source. Of the 12 ionic species, 9 species (K + , Mg 2+ , Na + , NH 4 + , Br - , Cl - , F - , NO 2 - , and SO 4 2- ) were selected for better performance of the DFA. The DFA successfully differentiated stormwater samples from urban, rural, and construction sites using concentrations of the ionic species (70%, 95%, and 91% of correct classification, respectively). Over 80% of the new data cases were correctly classified by the trained DFA model. When applied to data cases from a mixed land-use catchment and downstream, the DFA model showed the greater impact of urban areas and rural areas respectively in the earlier and later parts of a storm event.

  5. A methodological investigation of hominoid craniodental morphology and phylogenetics.

    PubMed

    Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A

    2011-01-01

    The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.

  6. Sorting through the chaff, nDNA gene trees for phylogenetic inference and hybrid identification of annual sunflowers (Helianthus sect. Helianthus).

    PubMed

    Moody, Michael L; Rieseberg, Loren H

    2012-07-01

    The annual sunflowers (Helianthus sect. Helianthus) present a formidable challenge for phylogenetic inference because of ancient hybrid speciation, recent introgression, and suspected issues with deep coalescence. Here we analyze sequence data from 11 nuclear DNA (nDNA) genes for multiple genotypes of species within the section to (1) reconstruct the phylogeny of this group, (2) explore the utility of nDNA gene trees for detecting hybrid speciation and introgression; and (3) test an empirical method of hybrid identification based on the phylogenetic congruence of nDNA gene trees from tightly linked genes. We uncovered considerable topological heterogeneity among gene trees with or without three previously identified hybrid species included in the analyses, as well as a general lack of reciprocal monophyly of species. Nonetheless, partitioned Bayesian analyses provided strong support for the reciprocal monophyly of all species except H. annuus (0.89 PP), the most widespread and abundant annual sunflower. Previous hypotheses of relationships among taxa were generally strongly supported (1.0 PP), except among taxa typically associated with H. annuus, apparently due to the paraphyly of the latter in all gene trees. While the individual nDNA gene trees provided a useful means for detecting recent hybridization, identification of ancient hybridization was problematic for all ancient hybrid species, even when linkage was considered. We discuss biological factors that affect the efficacy of phylogenetic methods for hybrid identification.

  7. Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

    PubMed Central

    Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John

    2014-01-01

    phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723

  8. A nuclear phylogenetic analysis: SNPs, indels and SSRs deliver new insights into the relationships in the 'true citrus fruit trees' group (Citrinae, Rutaceae) and the origin of cultivated species.

    PubMed

    Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the

  9. Genetic association analysis identifies variants associated with disease progression in primary sclerosing cholangitis.

    PubMed

    Alberts, Rudi; de Vries, Elisabeth M G; Goode, Elizabeth C; Jiang, Xiaojun; Sampaziotis, Fotis; Rombouts, Krista; Böttcher, Katrin; Folseraas, Trine; Weismüller, Tobias J; Mason, Andrew L; Wang, Weiwei; Alexander, Graeme; Alvaro, Domenico; Bergquist, Annika; Björkström, Niklas K; Beuers, Ulrich; Björnsson, Einar; Boberg, Kirsten Muri; Bowlus, Christopher L; Bragazzi, Maria C; Carbone, Marco; Chazouillères, Olivier; Cheung, Angela; Dalekos, Georgios; Eaton, John; Eksteen, Bertus; Ellinghaus, David; Färkkilä, Martti; Festen, Eleonora A M; Floreani, Annarosa; Franceschet, Irene; Gotthardt, Daniel Nils; Hirschfield, Gideon M; Hoek, Bart van; Holm, Kristian; Hohenester, Simon; Hov, Johannes Roksund; Imhann, Floris; Invernizzi, Pietro; Juran, Brian D; Lenzen, Henrike; Lieb, Wolfgang; Liu, Jimmy Z; Marschall, Hanns-Ulrich; Marzioni, Marco; Melum, Espen; Milkiewicz, Piotr; Müller, Tobias; Pares, Albert; Rupp, Christian; Rust, Christian; Sandford, Richard N; Schramm, Christoph; Schreiber, Stefan; Schrumpf, Erik; Silverberg, Mark S; Srivastava, Brijesh; Sterneck, Martina; Teufel, Andreas; Vallier, Ludovic; Verheij, Joanne; Vila, Arnau Vich; Vries, Boudewijn de; Zachou, Kalliopi; Chapman, Roger W; Manns, Michael P; Pinzani, Massimo; Rushbrook, Simon M; Lazaridis, Konstantinos N; Franke, Andre; Anderson, Carl A; Karlsen, Tom H; Ponsioen, Cyriel Y; Weersma, Rinse K

    2017-08-04

    Primary sclerosing cholangitis (PSC) is a genetically complex, inflammatory bile duct disease of largely unknown aetiology often leading to liver transplantation or death. Little is known about the genetic contribution to the severity and progression of PSC. The aim of this study is to identify genetic variants associated with PSC disease progression and development of complications. We collected standardised PSC subphenotypes in a large cohort of 3402 patients with PSC. After quality control, we combined 130 422 single nucleotide polymorphisms of all patients-obtained using the Illumina immunochip-with their disease subphenotypes. Using logistic regression and Cox proportional hazards models, we identified genetic variants associated with binary and time-to-event PSC subphenotypes. We identified genetic variant rs853974 to be associated with liver transplant-free survival (p=6.07×10 -9 ). Kaplan-Meier survival analysis showed a 50.9% (95% CI 41.5% to 59.5%) transplant-free survival for homozygous AA allele carriers of rs853974 compared with 72.8% (95% CI 69.6% to 75.7%) for GG carriers at 10 years after PSC diagnosis. For the candidate gene in the region, RSPO3 , we demonstrated expression in key liver-resident effector cells, such as human and murine cholangiocytes and human hepatic stellate cells. We present a large international PSC cohort, and report genetic loci associated with PSC disease progression. For liver transplant-free survival, we identified a genome-wide significant signal and demonstrated expression of the candidate gene RSPO3 in key liver-resident effector cells. This warrants further assessments of the role of this potential key PSC modifier gene. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  10. A phylogenetic perspective on the individual species-area relationship in temperate and tropical tree communities.

    PubMed

    Yang, Jie; Swenson, Nathan G; Cao, Min; Chuyong, George B; Ewango, Corneille E N; Howe, Robert; Kenfack, David; Thomas, Duncan; Wolf, Amy; Lin, Luxiang

    2013-01-01

    Ecologists have historically used species-area relationships (SARs) as a tool to understand the spatial distribution of species. Recent work has extended SARs to focus on individual-level distributions to generate individual species area relationships (ISARs). The ISAR approach quantifies whether individuals of a species tend have more or less species richness surrounding them than expected by chance. By identifying richness 'accumulators' and 'repellers', respectively, the ISAR approach has been used to infer the relative importance of abiotic and biotic interactions and neutrality. A clear limitation of the SAR and ISAR approaches is that all species are treated as evolutionarily independent and that a large amount of work has now shown that local tree neighborhoods exhibit non-random phylogenetic structure given the species richness. Here, we use nine tropical and temperate forest dynamics plots to ask: (i) do ISARs change predictably across latitude?; (ii) is the phylogenetic diversity in the neighborhood of species accumulators and repellers higher or lower than that expected given the observed species richness?; and (iii) do species accumulators, repellers distributed non-randomly on the community phylogenetic tree? The results indicate no clear trend in ISARs from the temperate zone to the tropics and that the phylogenetic diversity surrounding the individuals of species is generally only non-random on very local scales. Interestingly the distribution of species accumulators and repellers was non-random on the community phylogenies suggesting the presence of phylogenetic signal in the ISAR across latitude.

  11. Identifying Rhodamine Dye Plume Sources in Near-Shore Oceanic Environments by Integration of Chemical and Visual Sensors

    PubMed Central

    Tian, Yu; Kang, Xiaodong; Li, Yunyi; Li, Wei; Zhang, Aiqun; Yu, Jiangchen; Li, Yiping

    2013-01-01

    This article presents a strategy for identifying the source location of a chemical plume in near-shore oceanic environments where the plume is developed under the influence of turbulence, tides and waves. This strategy includes two modules: source declaration (or identification) and source verification embedded in a subsumption architecture. Algorithms for source identification are derived from the moth-inspired plume tracing strategies based on a chemical sensor. The in-water test missions, conducted in November 2002 at San Clemente Island (California, USA) in June 2003 in Duck (North Carolina, USA) and in October 2010 at Dalian Bay (China), successfully identified the source locations after autonomous underwater vehicles tracked the rhodamine dye plumes with a significant meander over 100 meters. The objective of the verification module is to verify the declared plume source using a visual sensor. Because images taken in near shore oceanic environments are very vague and colors in the images are not well-defined, we adopt a fuzzy color extractor to segment the color components and recognize the chemical plume and its source by measuring color similarity. The source verification module is tested by images taken during the CPT missions. PMID:23507823

  12. Comprehensive phylogenetic analysis of bacterial reverse transcriptases.

    PubMed

    Toro, Nicolás; Nisa-Martínez, Rafael

    2014-01-01

    Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology.

  13. Comprehensive Phylogenetic Analysis of Bacterial Reverse Transcriptases

    PubMed Central

    Toro, Nicolás; Nisa-Martínez, Rafael

    2014-01-01

    Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology. PMID:25423096

  14. Taxonomic revision and phylogenetic analyses of rubber powdery mildew fungi.

    PubMed

    Liyanage, K K; Khan, Sehroon; Brooks, Siraprapa; Mortimer, Peter E; Karunarathna, Samantha C; Xu, Jianchu; Hyde, Kevin D

    2017-04-01

    Powdery mildew is a fungal disease that infects a wide range of plants, including rubber trees, which results in a reduction of latex yields of up to 45%. The causal agent of powdery mildew of rubber was first described as Oidium heveae, but later morpho-molecular research suggested that in the past, O. heveae has been confused with Erysiphe quercicola. However, it is still under debate whether the causal agent should be classified as a species of the genus Erysiphe emend. or Golovinomyces and Podosphaera, respectively. Therefore, the aim of this study was to undertake the morpho-molecular characterization of powdery mildew species associated with rubber trees, thus resolving these taxonomic issues. Morphological observation under light and scanning electron microscopes (SEM) clearly identified two morphotypes of the rubber powdery mildew. With the support of morphological and phylogenetic data, one of the two morphotypes was identified as the asexual morph of E. quercicola, while the second morphotype is still insufficiently known and according to the morphological results obtained we assume that it might belong to the genus Golovinomyces. More collections and additional molecular data are required for final conclusions regarding the exact taxonomic position of the second morphotype of rubber powdery mildew and its relation to the name O. heveae. The haplotype analysis identified eight haplotype groups of E. quercicola indicating the high genetic diversity of the species. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Climate Change Impacts on the Tree of Life: Changes in Phylogenetic Diversity Illustrated for Acropora Corals

    PubMed Central

    Faith, Daniel P.; Richards, Zoe T.

    2012-01-01

    The possible loss of whole branches from the tree of life is a dramatic, but under-studied, biological implication of climate change. The tree of life represents an evolutionary heritage providing both present and future benefits to humanity, often in unanticipated ways. Losses in this evolutionary (evo) life-support system represent losses in “evosystem” services, and are quantified using the phylogenetic diversity (PD) measure. High species-level biodiversity losses may or may not correspond to high PD losses. If climate change impacts are clumped on the phylogeny, then loss of deeper phylogenetic branches can mean disproportionately large PD loss for a given degree of species loss. Over time, successive species extinctions within a clade each may imply only a moderate loss of PD, until the last species within that clade goes extinct, and PD drops precipitously. Emerging methods of “phylogenetic risk analysis” address such phylogenetic tipping points by adjusting conservation priorities to better reflect risk of such worst-case losses. We have further developed and explored this approach for one of the most threatened taxonomic groups, corals. Based on a phylogenetic tree for the corals genus Acropora, we identify cases where worst-case PD losses may be avoided by designing risk-averse conservation priorities. We also propose spatial heterogeneity measures changes to assess possible changes in the geographic distribution of corals PD. PMID:24832524

  16. Climate-driven extinctions shape the phylogenetic structure of temperate tree floras.

    PubMed

    Eiserhardt, Wolf L; Borchsenius, Finn; Plum, Christoffer M; Ordonez, Alejandro; Svenning, Jens-Christian

    2015-03-01

    When taxa go extinct, unique evolutionary history is lost. If extinction is selective, and the intrinsic vulnerabilities of taxa show phylogenetic signal, more evolutionary history may be lost than expected under random extinction. Under what conditions this occurs is insufficiently known. We show that late Cenozoic climate change induced phylogenetically selective regional extinction of northern temperate trees because of phylogenetic signal in cold tolerance, leading to significantly and substantially larger than random losses of phylogenetic diversity (PD). The surviving floras in regions that experienced stronger extinction are phylogenetically more clustered, indicating that non-random losses of PD are of increasing concern with increasing extinction severity. Using simulations, we show that a simple threshold model of survival given a physiological trait with phylogenetic signal reproduces our findings. Our results send a strong warning that we may expect future assemblages to be phylogenetically and possibly functionally depauperate if anthropogenic climate change affects taxa similarly. © 2015 John Wiley & Sons Ltd/CNRS.

  17. Increased phylogenetic resolution using target enrichment in Rubus

    USDA-ARS?s Scientific Manuscript database

    Phylogenetic analyses in Rubus L. have been challenging due to polyploidy, hybridization, and apomixis within the genus. Wide morphological diversity occurs within and between species, contributing to challenges at lower and higher systematic levels. Phylogenetic inferences to date have been based o...

  18. Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.

    PubMed

    Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F

    2017-08-01

    Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Interpreting the universal phylogenetic tree

    NASA Technical Reports Server (NTRS)

    Woese, C. R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.

  20. The Biogeography of Deep Time Phylogenetic Reticulation.

    PubMed

    Burbrink, Frank T; Gehara, Marcelo

    2018-03-09

    Most phylogenies are typically represented as purely bifurcating. However, as genomic data has become more common in phylogenetic studies, it is not unusual to find reticulation among terminal lineages or among internal nodes (deep time reticulation; DTR). In these situations, gene flow must have happened in the same or adjacent geographic areas for these DTRs to have occurred and therefore biogeographic reconstruction should provide similar area estimates for parental nodes, provided extinction or dispersal has not eroded these patterns. We examine the phylogeny of the widely distributed New World kingsnakes (Lampropeltis), determine if DTR is present in this group, and estimate the ancestral area for reticulation. Importantly, we develop a new method that uses coalescent simulations in a machine learning framework to show conclusively that this phylogeny is best represented as reticulating at deeper time. Using joint probabilities of ancestral area reconstructions on the bifurcating parental lineages from the reticulating node, we show that this reticulation likely occurred in northwestern Mexico/southwestern US and subsequently led to the diversification of the Mexican kingsnakes. This region has been previously identified as an area important for understanding speciation and secondary contact with gene flow in snakes and other squamates. This research shows that phylogenetic reticulation is common, even in well-studied groups, and that the geographic scope of ancient hybridization is recoverable.