Science.gov

Sample records for accurate phylogenetic classification

  1. Accurate phylogenetic classification of DNA fragments based onsequence composition

    SciTech Connect

    McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis; Hugenholtz, Philip; Rigoutsos, Isidore

    2006-05-01

    Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequence characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.

  2. Concepts of Classification and Taxonomy Phylogenetic Classification

    NASA Astrophysics Data System (ADS)

    Fraix-Burnet, D.

    2016-05-01

    Phylogenetic approaches to classification have been heavily developed in biology by bioinformaticians. But these techniques have applications in other fields, in particular in linguistics. Their main characteristics is to search for relationships between the objects or species in study, instead of grouping them by similarity. They are thus rather well suited for any kind of evolutionary objects. For nearly fifteen years, astrocladistics has explored the use of Maximum Parsimony (or cladistics) for astronomical objects like galaxies or globular clusters. In this lesson we will learn how it works.

  3. Towards an integrated phylogenetic classification of the Tremellomycetes.

    PubMed

    Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

    2015-06-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  4. Towards an integrated phylogenetic classification of the Tremellomycetes

    PubMed Central

    Liu, X.-Z.; Wang, Q.-M.; Göker, M.; Groenewald, M.; Kachalkin, A.V.; Lumbsch, H.T.; Millanes, A.M.; Wedin, M.; Yurkov, A.M.; Boekhout, T.; Bai, F.-Y.

    2016-01-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  5. Phylogenetic classification and the universal tree.

    PubMed

    Doolittle, W F

    1999-06-25

    From comparative analyses of the nucleotide sequences of genes encoding ribosomal RNAs and several proteins, molecular phylogeneticists have constructed a "universal tree of life," taking it as the basis for a "natural" hierarchical classification of all living things. Although confidence in some of the tree's early branches has recently been shaken, new approaches could still resolve many methodological uncertainties. More challenging is evidence that most archaeal and bacterial genomes (and the inferred ancestral eukaryotic nuclear genome) contain genes from multiple sources. If "chimerism" or "lateral gene transfer" cannot be dismissed as trivial in extent or limited to special categories of genes, then no hierarchical universal classification can be taken as natural. Molecular phylogeneticists will have failed to find the "true tree," not because their methods are inadequate or because they have chosen the wrong genes, but because the history of life cannot properly be represented as a tree. However, taxonomies based on molecular sequences will remain indispensable, and understanding of the evolutionary process will ultimately be enriched, not impoverished. PMID:10381871

  6. A Functional-Phylogenetic Classification System for Transmembrane Solute Transporters

    PubMed Central

    Saier, Milton H.

    2000-01-01

    A comprehensive classification system for transmembrane molecular transporters has been developed and recently approved by the transport panel of the nomenclature committee of the International Union of Biochemistry and Molecular Biology. This system is based on (i) transporter class and subclass (mode of transport and energy coupling mechanism), (ii) protein phylogenetic family and subfamily, and (iii) substrate specificity. Almost all of the more than 250 identified families of transporters include members that function exclusively in transport. Channels (115 families), secondary active transporters (uniporters, symporters, and antiporters) (78 families), primary active transporters (23 families), group translocators (6 families), and transport proteins of ill-defined function or of unknown mechanism (51 families) constitute distinct categories. Transport mode and energy coupling prove to be relatively immutable characteristics and therefore provide primary bases for classification. Phylogenetic grouping reflects structure, function, mechanism, and often substrate specificity and therefore provides a reliable secondary basis for classification. Substrate specificity and polarity of transport prove to be more readily altered during evolutionary history and therefore provide a tertiary basis for classification. With very few exceptions, a phylogenetic family of transporters includes members that function by a single transport mode and energy coupling mechanism, although a variety of substrates may be transported, sometimes with either inwardly or outwardly directed polarity. In this review, I provide cross-referencing of well-characterized constituent transporters according to (i) transport mode, (ii) energy coupling mechanism, (iii) phylogenetic grouping, and (iv) substrates transported. The structural features and distribution of recognized family members throughout the living world are also evaluated. The tabulations should facilitate familial and functional

  7. A higher-level phylogenetic classification of the Fungi.

    PubMed

    Hibbett, David S; Binder, Manfred; Bischoff, Joseph F; Blackwell, Meredith; Cannon, Paul F; Eriksson, Ove E; Huhndorf, Sabine; James, Timothy; Kirk, Paul M; Lücking, Robert; Thorsten Lumbsch, H; Lutzoni, François; Matheny, P Brandon; McLaughlin, David J; Powell, Martha J; Redhead, Scott; Schoch, Conrad L; Spatafora, Joseph W; Stalpers, Joost A; Vilgalys, Rytas; Aime, M Catherine; Aptroot, André; Bauer, Robert; Begerow, Dominik; Benny, Gerald L; Castlebury, Lisa A; Crous, Pedro W; Dai, Yu-Cheng; Gams, Walter; Geiser, David M; Griffith, Gareth W; Gueidan, Cécile; Hawksworth, David L; Hestmark, Geir; Hosaka, Kentaro; Humber, Richard A; Hyde, Kevin D; Ironside, Joseph E; Kõljalg, Urmas; Kurtzman, Cletus P; Larsson, Karl-Henrik; Lichtwardt, Robert; Longcore, Joyce; Miadlikowska, Jolanta; Miller, Andrew; Moncalvo, Jean-Marc; Mozley-Standridge, Sharon; Oberwinkler, Franz; Parmasto, Erast; Reeb, Valérie; Rogers, Jack D; Roux, Claude; Ryvarden, Leif; Sampaio, José Paulo; Schüssler, Arthur; Sugiyama, Junta; Thorn, R Greg; Tibell, Leif; Untereiner, Wendy A; Walker, Christopher; Wang, Zheng; Weir, Alex; Weiss, Michael; White, Merlin M; Winka, Katarina; Yao, Yi-Jian; Zhang, Ning

    2007-05-01

    A comprehensive phylogenetic classification of the kingdom Fungi is proposed, with reference to recent molecular phylogenetic analyses, and with input from diverse members of the fungal taxonomic community. The classification includes 195 taxa, down to the level of order, of which 16 are described or validated here: Dikarya subkingdom nov.; Chytridiomycota, Neocallimastigomycota phyla nov.; Monoblepharidomycetes, Neocallimastigomycetes class. nov.; Eurotiomycetidae, Lecanoromycetidae, Mycocaliciomycetidae subclass. nov.; Acarosporales, Corticiales, Baeomycetales, Candelariales, Gloeophyllales, Melanosporales, Trechisporales, Umbilicariales ords. nov. The clade containing Ascomycota and Basidiomycota is classified as subkingdom Dikarya, reflecting the putative synapomorphy of dikaryotic hyphae. The most dramatic shifts in the classification relative to previous works concern the groups that have traditionally been included in the Chytridiomycota and Zygomycota. The Chytridiomycota is retained in a restricted sense, with Blastocladiomycota and Neocallimastigomycota representing segregate phyla of flagellated Fungi. Taxa traditionally placed in Zygomycota are distributed among Glomeromycota and several subphyla incertae sedis, including Mucoromycotina, Entomophthoromycotina, Kickxellomycotina, and Zoopagomycotina. Microsporidia are included in the Fungi, but no further subdivision of the group is proposed. Several genera of 'basal' Fungi of uncertain position are not placed in any higher taxa, including Basidiobolus, Caulochytrium, Olpidium, and Rozella. PMID:17572334

  8. Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

    PubMed

    Westesson, Oscar; Lunter, Gerton; Paten, Benedict; Holmes, Ian

    2012-01-01

    The Multiple Sequence Alignment (MSA) is a computational abstraction that represents a partial summary either of indel history, or of structural similarity. Taking the former view (indel history), it is possible to use formal automata theory to generalize the phylogenetic likelihood framework for finite substitution models (Dayhoff's probability matrices and Felsenstein's pruning algorithm) to arbitrary-length sequences. In this paper, we report results of a simulation-based benchmark of several methods for reconstruction of indel history. The methods tested include a relatively new algorithm for statistical marginalization of MSAs that sums over a stochastically-sampled ensemble of the most probable evolutionary histories. For mammalian evolutionary parameters on several different trees, the single most likely history sampled by our algorithm appears less biased than histories reconstructed by other MSA methods. The algorithm can also be used for alignment-free inference, where the MSA is explicitly summed out of the analysis. As an illustration of our method, we discuss reconstruction of the evolutionary histories of human protein-coding genes. PMID:22536326

  9. The ABC of ABCS: a phylogenetic and functional classification of ABC systems in living organisms.

    PubMed

    Dassa, E; Bouige, P

    2001-01-01

    ATP binding cassette (ABC) systems constitute one of the most abundant superfamilies of proteins. They are involved not only in the transport of a wide variety of substances, but also in many cellular processes and in their regulation. In this paper, we made a comparative analysis of the properties of ABC systems and we provide a phylogenetic and functional classification. This analysis will be helpful to accurately annotate ABC systems discovered during the sequencing of the genome of living organisms and to identify the partners of the ABC ATPases. PMID:11421270

  10. Phylogenetic and functional classification of ATP-binding cassette (ABC) systems.

    PubMed

    Bouige, Philippe; Laurent, David; Piloyan, Linda; Dassa, Elie

    2002-10-01

    ATP binding cassette (ABC) systems constitute one of the most abundant superfamilies of proteins. They are involved in the transport of a wide variety of substances, but also in many cellular processes and in their regulation. In this paper, we made a comparative analysis of the properties of ABC systems and we provide a phylogenetic and functional classification. This analysis will be helpful to accurately annotate ABC systems discovered during the sequencing of the genome of living organisms and to identify the partners of the ABC ATPases. PMID:12370001

  11. Phylogenetics, classification, and biogeography of the treefrogs (Amphibia: Anura: Arboranae).

    PubMed

    Duellman, William E; Marion, Angela B; Hedges, S Blair

    2016-01-01

    A phylogenetic analysis of sequences from 503 species of hylid frogs and four outgroup taxa resulted in 16,128 aligned sites of 19 genes. The molecular data were subjected to a maximum likelihood analysis that resulted in a new phylogenetic tree of treefrogs. A conservative new classification based on the tree has (1) three families composing an unranked taxon, Arboranae, (2) nine subfamilies (five resurrected, one new), and (3) six resurrected generic names and five new generic names. Using the results of a maximum likelihood timetree, times of divergence were determined. For the most part these times of divergence correlated well with historical geologic events. The arboranan frogs originated in South America in the Late Mesozoic or Early Cenozoic. The family Pelodryadidae diverged from its South American relative, Phyllomedusidae, in the Eocene and invaded Australia via Antarctica. There were two dispersals from South America to North America in the Paleogene. One lineage was the ancestral stock of Acris and its relatives, whereas the other lineage, subfamily Hylinae, differentiated into a myriad of genera in Middle America. PMID:27394762

  12. Accurate molecular classification of cancer using simple rules

    PubMed Central

    Wang, Xiaosheng; Gotoh, Osamu

    2009-01-01

    Background One intractable problem with using microarray data analysis for cancer classification is how to reduce the extremely high-dimensionality gene feature data to remove the effects of noise. Feature selection is often used to address this problem by selecting informative genes from among thousands or tens of thousands of genes. However, most of the existing methods of microarray-based cancer classification utilize too many genes to achieve accurate classification, which often hampers the interpretability of the models. For a better understanding of the classification results, it is desirable to develop simpler rule-based models with as few marker genes as possible. Methods We screened a small number of informative single genes and gene pairs on the basis of their depended degrees proposed in rough sets. Applying the decision rules induced by the selected genes or gene pairs, we constructed cancer classifiers. We tested the efficacy of the classifiers by leave-one-out cross-validation (LOOCV) of training sets and classification of independent test sets. Results We applied our methods to five cancerous gene expression datasets: leukemia (acute lymphoblastic leukemia [ALL] vs. acute myeloid leukemia [AML]), lung cancer, prostate cancer, breast cancer, and leukemia (ALL vs. mixed-lineage leukemia [MLL] vs. AML). Accurate classification outcomes were obtained by utilizing just one or two genes. Some genes that correlated closely with the pathogenesis of relevant cancers were identified. In terms of both classification performance and algorithm simplicity, our approach outperformed or at least matched existing methods. Conclusion In cancerous gene expression datasets, a small number of genes, even one or two if selected correctly, is capable of achieving an ideal cancer classification effect. This finding also means that very simple rules may perform well for cancerous class prediction. PMID:19874631

  13. Phylogenetic classification of Cordyceps and the clavicipitaceous fungi

    PubMed Central

    Sung, Gi-Ho; Hywel-Jones, Nigel L.; Sung, Jae-Mo; Luangsa-ard, J. Jennifer; Shrestha, Bhushan; Spatafora, Joseph W.

    2007-01-01

    Cordyceps, comprising over 400 species, was historically classified in the Clavicipitaceae, based on cylindrical asci, thickened ascus apices and filiform ascospores, which often disarticulate into part-spores. Cordyceps was characterized by the production of well-developed often stipitate stromata and an ecology as a pathogen of arthropods and Elaphomyces with infrageneric classifications emphasizing arrangement of perithecia, ascospore morphology and host affiliation. To refine the classification of Cordyceps and the Clavicipitaceae, the phylogenetic relationships of 162 taxa were estimated based on analyses consisting of five to seven loci, including the nuclear ribosomal small and large subunits (nrSSU and nrLSU), the elongation factor 1α (tef1), the largest and the second largest subunits of RNA polymerase II (rpb1 and rpb2), β-tubulin (tub), and mitochondrial ATP6 (atp6). Our results strongly support the existence of three clavicipitaceous clades and reject the monophyly of both Cordyceps and Clavicipitaceae. Most diagnostic characters used in current classifications of Cordyceps (e.g., arrangement of perithecia, ascospore fragmentation, etc.) were not supported as being phylogenetically informative; the characters that were most consistent with the phylogeny were texture, pigmentation and morphology of stromata. Therefore, we revise the taxonomy of Cordyceps and the Clavicipitaceae to be consistent with the multi-gene phylogeny. The family Cordycipitaceae is validated based on the type of Cordyceps, C. militaris, and includes most Cordyceps species that possess brightly coloured, fleshy stromata. The new family Ophiocordycipitaceae is proposed based on Ophiocordyceps Petch, which we emend. The majority of species in this family produce darkly pigmented, tough to pliant stromata that often possess aperithecial apices. The new genus Elaphocordyceps is proposed for a subclade of the Ophiocordycipitaceae, which includes all species of Cordyceps that parasitize

  14. Towards a phylogenetic classification of Leptothecata (Cnidaria, Hydrozoa)

    PubMed Central

    Maronna, Maximiliano M.; Miranda, Thaís P.; Peña Cantero, Álvaro L.; Barbeitos, Marcos S.; Marques, Antonio C.

    2016-01-01

    Leptothecata are hydrozoans whose hydranths are covered by perisarc and gonophores and whose medusae bear gonads on their radial canals. They develop complex polypoid colonies and exhibit considerable morphological variation among species with respect to growth, defensive structures and mode of development. For instance, several lineages within this order have lost the medusa stage. Depending on the author, traditional taxonomy in hydrozoans may be either polyp- or medusa-oriented. Therefore, the absence of the latter stage in some lineages may lead to very different classification schemes. Molecular data have proved useful in elucidating this taxonomic challenge. We analyzed a super matrix of new and published rRNA gene sequences (16S, 18S and 28S), employing newly proposed methods to measure branch support and improve phylogenetic signal. Our analysis recovered new clades not recognized by traditional taxonomy and corroborated some recently proposed taxa. We offer a thorough taxonomic revision of the Leptothecata, erecting new orders, suborders, infraorders and families. We also discuss the origination and diversification dynamics of the group from a macroevolutionary perspective. PMID:26821567

  15. Towards a phylogenetic classification of Leptothecata (Cnidaria, Hydrozoa).

    PubMed

    Maronna, Maximiliano M; Miranda, Thaís P; Peña Cantero, Álvaro L; Barbeitos, Marcos S; Marques, Antonio C

    2016-01-01

    Leptothecata are hydrozoans whose hydranths are covered by perisarc and gonophores and whose medusae bear gonads on their radial canals. They develop complex polypoid colonies and exhibit considerable morphological variation among species with respect to growth, defensive structures and mode of development. For instance, several lineages within this order have lost the medusa stage. Depending on the author, traditional taxonomy in hydrozoans may be either polyp- or medusa-oriented. Therefore, the absence of the latter stage in some lineages may lead to very different classification schemes. Molecular data have proved useful in elucidating this taxonomic challenge. We analyzed a super matrix of new and published rRNA gene sequences (16S, 18S and 28S), employing newly proposed methods to measure branch support and improve phylogenetic signal. Our analysis recovered new clades not recognized by traditional taxonomy and corroborated some recently proposed taxa. We offer a thorough taxonomic revision of the Leptothecata, erecting new orders, suborders, infraorders and families. We also discuss the origination and diversification dynamics of the group from a macroevolutionary perspective. PMID:26821567

  16. A phylogenetic analysis of the mycoplasmas: basis for their classification.

    PubMed Central

    Weisburg, W G; Tully, J G; Rose, D L; Petzel, J P; Oyaizu, H; Yang, D; Mandelco, L; Sechrest, J; Lawrence, T G; Van Etten, J

    1989-01-01

    Small-subunit rRNA sequences were determined for almost 50 species of mycoplasmas and their walled relatives, providing the basis for a phylogenetic systematic analysis of these organisms. Five groups of mycoplasmas per se were recognized (provisional names are given): the hominis group (which included species such as Mycoplasma hominis, Mycoplasma lipophilum, Mycoplasma pulmonis, and Mycoplasma neurolyticum), the pneumoniae group (which included species such as Mycoplasma pneumoniae and Mycoplasma muris), the spiroplasma group (which included species such as Mycoplasma mycoides, Spiroplasma citri, and Spiroplasma apis), the anaeroplasma group (which encompassed the anaeroplasmas and acholeplasmas), and a group known to contain only the isolated species Asteroleplasma anaerobium. In addition to these five mycoplasma groups, a sixth group of variously named gram-positive, walled organisms (which included lactobacilli, clostridia, and other organisms) was also included in the overall phylogenetic unit. In each of these six primary groups, subgroups were readily recognized and defined. Although the phylogenetic units identified by rRNA comparisons are difficult to recognize on the basis of mutually exclusive phenotypic characters alone, phenotypic justification can be given a posteriori for a number of them. PMID:2592342

  17. Automatic classification and accurate size measurement of blank mask defects

    NASA Astrophysics Data System (ADS)

    Bhamidipati, Samir; Paninjath, Sankaranarayanan; Pereira, Mark; Buck, Peter

    2015-07-01

    complexity of defects encountered. The variety arises due to factors such as defect nature, size, shape and composition; and the optical phenomena occurring around the defect. This paper focuses on preliminary characterization results, in terms of classification and size estimation, obtained by Calibre MDPAutoClassify tool on a variety of mask blank defects. It primarily highlights the challenges faced in achieving the results with reference to the variety of defects observed on blank mask substrates and the underlying complexities which make accurate defect size measurement an important and challenging task.

  18. Accurate mobile malware detection and classification in the cloud.

    PubMed

    Wang, Xiaolei; Yang, Yuexiang; Zeng, Yingzhi

    2015-01-01

    As the dominator of the Smartphone operating system market, consequently android has attracted the attention of s malware authors and researcher alike. The number of types of android malware is increasing rapidly regardless of the considerable number of proposed malware analysis systems. In this paper, by taking advantages of low false-positive rate of misuse detection and the ability of anomaly detection to detect zero-day malware, we propose a novel hybrid detection system based on a new open-source framework CuckooDroid, which enables the use of Cuckoo Sandbox's features to analyze Android malware through dynamic and static analysis. Our proposed system mainly consists of two parts: anomaly detection engine performing abnormal apps detection through dynamic analysis; signature detection engine performing known malware detection and classification with the combination of static and dynamic analysis. We evaluate our system using 5560 malware samples and 6000 benign samples. Experiments show that our anomaly detection engine with dynamic analysis is capable of detecting zero-day malware with a low false negative rate (1.16 %) and acceptable false positive rate (1.30 %); it is worth noting that our signature detection engine with hybrid analysis can accurately classify malware samples with an average positive rate 98.94 %. Considering the intensive computing resources required by the static and dynamic analysis, our proposed detection system should be deployed off-device, such as in the Cloud. The app store markets and the ordinary users can access our detection system for malware detection through cloud service. PMID:26543718

  19. Phylogenetic classification of yeasts and related taxa within Pucciniomycotina.

    PubMed

    Wang, Q-M; Yurkov, A M; Göker, M; Lumbsch, H T; Leavitt, S D; Groenewald, M; Theelen, B; Liu, X-Z; Boekhout, T; Bai, F-Y

    2015-06-01

    Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the "One Fungus = One Name" nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study. PMID:26951631

  20. Phylogenetic classification of yeasts and related taxa within Pucciniomycotina

    PubMed Central

    Wang, Q.-M.; Yurkov, A.M.; Göker, M.; Lumbsch, H.T.; Leavitt, S.D.; Groenewald, M.; Theelen, B.; Liu, X.-Z.; Boekhout, T.; Bai, F.-Y.

    2016-01-01

    Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the “One Fungus = One Name” nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study. PMID:26951631

  1. Phylogenetic classification of Aureobasidium pullulans strains for production of pullulan and xylanase

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This study tests the hypothesis that phylogenetic classification can predict whether A. pullulans strains will produce useful levels of the commercial polysaccharide, pullulan, or the valuable enzyme, xylanase. To test this hypothesis, 19 strains of A. pullulans with previously described phenotypes...

  2. Molecular classification and phylogenetic relationships of selected edible Basidiomycetes species.

    PubMed

    Avin, Farhat Ahmadi; Bhassu, Subha; Shin, Tan Yee; Sabaratnam, Vikineswary

    2012-07-01

    Morphological identification of edible mushrooms can sometimes prove troublesome, because phenotypic variation in fungi can be affected by substrate and environmental factors. One of the most important problems for mushroom breeders is the lack of a systematic consensus tool to distinguish different species, which are sometimes morphologically identical. Basidiomycetes as one of the largest groups of edible mushrooms have become more important in recent times for their medicinal and nutritional properties. Partial rDNA sequences, including the Internal Transcribed Spacer I-5.8SrDNA-Internal Transcribed Spacer II, were used in this study for molecular identification and assessment of phylogenetic relationships between selected edible species of the Basidiomycetes. Phylogenetic trees showed five distinct clades; each clade belonging to a separate family group. The first clade included all the species belonging to the Pleurotaceae (Pleurotus spp.) family; similarly, the second, third, fourth, and fifth clades consist of species from the Agaricaceae (Agaricus sp.), Lyophllaceae (Hypsigygus sp.), Marasmiaceae (Lentinula edodes sp.) and Physalacriaceae (Flammulina velutipes sp.) families, respectively. Moreover, different species of each family were clearly placed in a distinct sub-cluster and a total of 13 species were taken for analysis. Species differentiation was re-confirmed by AMOVA analysis (among the populations: 99.67%; within: 0.33%), nucleotide divergence, haplotyping and P value. Polymorphism occurred throughout the ITS regions due to insertion-deletion and point mutations, and can be clearly differentiated within the families as well as genera. Moreover, this study proves that the sequence of the ITS region is a superior molecular DNA barcode for taxonomic identification of Basidiomycetes. PMID:22327649

  3. Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes

    PubMed Central

    Liu, Kuan-Liang; Porras-Alfaro, Andrea; Eichorst, Stephanie A.

    2012-01-01

    Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naïve Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naïve Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naïve Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project (http://rdp.cme.msu.edu/classifier/classifier.jsp). PMID:22194300

  4. Phylogenetics.

    PubMed

    Sleator, Roy D

    2011-04-01

    The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects. PMID:21249334

  5. The phagotrophic origin of eukaryotes and phylogenetic classification of Protozoa.

    PubMed

    Cavalier-Smith, T

    2002-03-01

    ancestrally biciliate clade, named 'bikonts'. The apparently conflicting rRNA and protein trees can be reconciled with each other and this ultrastructural interpretation if long-branch distortions, some mechanistically explicable, are allowed for. Bikonts comprise two groups: corticoflagellates, with a younger anterior cilium, no centrosomal cone and ancestrally a semi-rigid cell cortex with a microtubular band on either side of the posterior mature centriole; and Rhizaria [a new infrakingdom comprising Cercozoa (now including Ascetosporea classis nov.), Retaria phylum nov., Heliozoa and Apusozoa phylum nov.], having a centrosomal cone or radiating microtubules and two microtubular roots and a soft surface, frequently with reticulopodia. Corticoflagellates comprise photokaryotes (Plantae and chromalveolates, both ancestrally with cortical alveoli) and Excavata (a new protozoan infrakingdom comprising Loukozoa, Discicristata and Archezoa, ancestrally with three microtubular roots). All basal eukaryotic radiations were of mitochondrial aerobes; hydrogenosomes evolved polyphyletically from mitochondria long afterwards, the persistence of their double envelope long after their genomes disappeared being a striking instance of membrane heredity. I discuss the relationship between the 13 protozoan phyla recognized here and revise higher protozoan classification by updating as subkingdoms Lankester's 1878 division of Protozoa into Corticata (Excavata, Alveolata; with prominent cortical microtubules and ancestrally localized cytostome--the Parabasalia probably secondarily internalized the cytoskeleton) and Gymnomyxa [infrakingdoms Sarcomastigota (Choanozoa, Amoebozoa) and Rhizaria; both ancestrally with a non-cortical cytoskeleton of radiating singlet microtubules and a relatively soft cell surface with diffused feeding]. As the eukaryote root almost certainly lies within Gymnomyxa, probably among the Sarcomastigota, Corticata are derived. Following the single symbiogenetic origin of

  6. Structural and phylogenetic basis for the classification of group III phospholipase A2.

    PubMed

    Hariprasad, Gururao; Srinivasan, Alagiri; Singh, Reema

    2013-09-01

    Secretory phospholipase A2 (PLA2) catalyses the hydrolysis of the sn-2 position of glycerophospholipids to liberate arachidonic acid, a precursor of eicosanoids, that are known mediators of inflammation. The group III PLA2 enzymes are present in a wide array of organisms across many species with completely different functions. A detailed understanding of the structure and evolutionary proximity amongst the enzymes was carried out for a meaningful classification of this group. Fifty protein sequences from different species of the group were considered for a detailed sequence, structural and phylogenetic studies. In addition to the conservation of calcium binding motif and the catalytic histidine, the sequences exhibit specific 'amino acid signatures'. Structural analysis reveals that these enzymes have a conserved globular structure with species specific variations seen at the active site, calcium binding loop, hydrophobic channel, the C-terminal domain and the quaternary conformational state. Character and distance based phylogenetic analysis of these sequences are in accordance with the structural features. The outcomes of the structural and phylogenetic analysis lays a convincing platform for the classification the group III PLA2s into (1A) venomous insects; (IB) non-venomous insects; (II) mammals; (IIIA) gila monsters; (IIIB) reptiles, amphibians, fishes, sea anemones and liver fluke, and (IV) scorpions. This classification also helps to understand structure-function relationship, enzyme-substrate specificity and designing of potent inhibitors against the drug target isoforms. PMID:23793742

  7. Phylogeny and phylogenetic classification of the antbirds, ovenbirds, woodcreepers, and allies (Aves: Passeriformes: Infraorder Furnariides)

    USGS Publications Warehouse

    Moyle, R.G.; Chesser, R.T.; Brumfield, R.T.; Tello, J.G.; Marchese, D.J.; Cracraft, J.

    2009-01-01

    The infraorder Furnariides is a diverse group of suboscine passerine birds comprising a substantial component of the Neotropical avifauna. The included species encompass a broad array of morphologies and behaviours, making them appealing for evolutionary studies, but the size of the group (ca. 600 species) has limited well-sampled higher-level phylogenetic studies. Using DNA sequence data from the nuclear RAG-1 and RAG-2 exons, we undertook a phylogenetic analysis of the Furnariides sampling 124 (more than 88%) of the genera. Basal relationships among family-level taxa differed depending on phylogenetic method, but all topologies had little nodal support, mirroring the results from earlier studies in which discerning relationships at the base of the radiation was also difficult. In contrast, branch support for family-rank taxa and for many relationships within those clades was generally high. Our results support the Melanopareidae and Grallariidae as distinct from the Rhinocryptidae and Formicariidae, respectively. Within the Furnariides our data contradict some recent phylogenetic hypotheses and suggest that further study is needed to resolve these discrepancies. Of the few genera represented by multiple species, several were not monophyletic, indicating that additional systematic work remains within furnariine families and must include dense taxon sampling. We use this study as a basis for proposing a new phylogenetic classification for the group and in the process erect new family-group names for clades having high branch support across methods. ?? 2009 The Willi Hennig Society.

  8. Molecular phylogenetic perspectives for character classification and convergence: Framing some issues with nematode vulval appendages and telotylenchid tail termini

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Characters flagged as convergent based on newer molecular phylogenetic trees inform both practical identification and more esoteric classification. Nematode morphological characters such as lateral lines, bullae and laciniae are quite independent structures from those similarly named in other organi...

  9. How Accurate and Robust Are the Phylogenetic Estimates of Austronesian Language Relationships?

    PubMed Central

    Greenhill, Simon J.; Drummond, Alexei J.; Gray, Russell D.

    2010-01-01

    We recently used computational phylogenetic methods on lexical data to test between two scenarios for the peopling of the Pacific. Our analyses of lexical data supported a pulse-pause scenario of Pacific settlement in which the Austronesian speakers originated in Taiwan around 5,200 years ago and rapidly spread through the Pacific in a series of expansion pulses and settlement pauses. We claimed that there was high congruence between traditional language subgroups and those observed in the language phylogenies, and that the estimated age of the Austronesian expansion at 5,200 years ago was consistent with the archaeological evidence. However, the congruence between the language phylogenies and the evidence from historical linguistics was not quantitatively assessed using tree comparison metrics. The robustness of the divergence time estimates to different calibration points was also not investigated exhaustively. Here we address these limitations by using a systematic tree comparison metric to calculate the similarity between the Bayesian phylogenetic trees and the subgroups proposed by historical linguistics, and by re-estimating the age of the Austronesian expansion using only the most robust calibrations. The results show that the Austronesian language phylogenies are highly congruent with the traditional subgroupings, and the date estimates are robust even when calculated using a restricted set of historical calibrations. PMID:20224774

  10. Molecular Phylogenetic Evaluation of Classification and Scenarios of Character Evolution in Calcareous Sponges (Porifera, Class Calcarea)

    PubMed Central

    Voigt, Oliver; Wülfing, Eilika; Wörheide, Gert

    2012-01-01

    Calcareous sponges (Phylum Porifera, Class Calcarea) are known to be taxonomically difficult. Previous molecular studies have revealed many discrepancies between classically recognized taxa and the observed relationships at the order, family and genus levels; these inconsistencies question underlying hypotheses regarding the evolution of certain morphological characters. Therefore, we extended the available taxa and character set by sequencing the complete small subunit (SSU) rDNA and the almost complete large subunit (LSU) rDNA of additional key species and complemented this dataset by substantially increasing the length of available LSU sequences. Phylogenetic analyses provided new hypotheses about the relationships of Calcarea and about the evolution of certain morphological characters. We tested our phylogeny against competing phylogenetic hypotheses presented by previous classification systems. Our data reject the current order-level classification by again finding non-monophyletic Leucosolenida, Clathrinida and Murrayonida. In the subclass Calcinea, we recovered a clade that includes all species with a cortex, which is largely consistent with the previously proposed order Leucettida. Other orders that had been rejected in the current system were not found, but could not be rejected in our tests either. We found several additional families and genera polyphyletic: the families Leucascidae and Leucaltidae and the genus Leucetta in Calcinea, and in Calcaronea the family Amphoriscidae and the genus Ute. Our phylogeny also provided support for the vaguely suspected close relationship of several members of Grantiidae with giantortical diactines to members of Heteropiidae. Similarly, our analyses revealed several unexpected affinities, such as a sister group relationship between Leucettusa (Leucaltidae) and Leucettidae and between Leucascandra (Jenkinidae) and Sycon carteri (Sycettidae). According to our results, the taxonomy of Calcarea is in desperate need of a

  11. Phylogenetic systematics and a revised generic classification of anthidiine bees (Hymenoptera: Megachilidae).

    PubMed

    Litman, Jessica R; Griswold, Terry; Danforth, Bryan N

    2016-07-01

    The bee tribe Anthidiini (Hymenoptera: Megachilidae) is a large, cosmopolitan group of solitary bees that exhibit intriguing nesting behavior. We present the first molecular-based phylogenetic analysis of relationships within Anthidiini using model-based methods and a large, multi-locus dataset (five nuclear genes, 5081 base pairs), as well as a combined analysis using our molecular dataset in conjunction with a previously published morphological matrix. We discuss the evolution of nesting behavior in Anthidiini and the relationship between nesting material and female mandibular morphology. Following an examination of the morphological characters historically used to recognize anthidiine genera, we recommend the use of a molecular-based phylogenetic backbone to define taxonomic groups prior to the assignment of diagnostic morphological characters for these groups. Finally, our results reveal the paraphyly of numerous genera and have significant consequences for anthidiine classification. In order to promote a classification system based on stable, monophyletic clades, we hereby make the following changes to Michener's (2007) classification: The subgenera Afranthidium (Zosteranthidium) Michener and Griswold, 1994, Afranthidium (Branthidium) Pasteels, 1969 and Afranthidium (Immanthidium) Pasteels, 1969 are moved into the genus Pseudoanthidium, thus forming the new combinations Pseudoanthidium (Zosteranthidium), Pseudoanthidium (Branthidium), and Pseudoanthidium (Immanthidium). The genus Neanthidium Pasteels, 1969 is also moved into the genus Pseudoanthidium, thus forming the new combination Pseudoanthidium (Neanthidium). Based on morphological characters shared with our new definition of the genus Pseudoanthidium, the subgenus Afranthidium (Mesanthidiellum) Pasteels, 1969 and the genus Gnathanthidium Pasteels, 1969 are also moved into the genus Pseudoanthidium, thus forming the new combinations Pseudoanthidium (Mesanthidiellum) and Pseudoanthidium (Gnathanthidium

  12. Molecular phylogenetic evaluation of classification and scenarios of character evolution in calcareous sponges (Porifera, Class Calcarea).

    PubMed

    Voigt, Oliver; Wülfing, Eilika; Wörheide, Gert

    2012-01-01

    Calcareous sponges (Phylum Porifera, Class Calcarea) are known to be taxonomically difficult. Previous molecular studies have revealed many discrepancies between classically recognized taxa and the observed relationships at the order, family and genus levels; these inconsistencies question underlying hypotheses regarding the evolution of certain morphological characters. Therefore, we extended the available taxa and character set by sequencing the complete small subunit (SSU) rDNA and the almost complete large subunit (LSU) rDNA of additional key species and complemented this dataset by substantially increasing the length of available LSU sequences. Phylogenetic analyses provided new hypotheses about the relationships of Calcarea and about the evolution of certain morphological characters. We tested our phylogeny against competing phylogenetic hypotheses presented by previous classification systems. Our data reject the current order-level classification by again finding non-monophyletic Leucosolenida, Clathrinida and Murrayonida. In the subclass Calcinea, we recovered a clade that includes all species with a cortex, which is largely consistent with the previously proposed order Leucettida. Other orders that had been rejected in the current system were not found, but could not be rejected in our tests either. We found several additional families and genera polyphyletic: the families Leucascidae and Leucaltidae and the genus Leucetta in Calcinea, and in Calcaronea the family Amphoriscidae and the genus Ute. Our phylogeny also provided support for the vaguely suspected close relationship of several members of Grantiidae with giantortical diactines to members of Heteropiidae. Similarly, our analyses revealed several unexpected affinities, such as a sister group relationship between Leucettusa (Leucaltidae) and Leucettidae and between Leucascandra (Jenkinidae) and Sycon carteri (Sycettidae). According to our results, the taxonomy of Calcarea is in desperate need of a

  13. Cloning, in Vitro expression, and novel phylogenetic classification of a channel catfish estrogen receptor

    USGS Publications Warehouse

    Xia, Z.; Patino, R.; Gale, W.L.; Maule, A.G.; Densmore, L.D.

    1999-01-01

    We obtained two channel catfish estrogen receptor (ccER) cDNA from liver of female fish using RT–PCR. The two fragments were identical in sequence except that the smaller one had an out-of-frame deletion in the E domain, suggesting the existence of ccER splice variants. The larger fragment was used to screen a cDNA library from liver of a prepubescent female. A cDNA was obtained that encoded a 581-amino-acid ER with a deduced molecular weight of 63.8 kDa. Extracts of COS-7 cells transfected with ccER cDNA bound estrogen with high affinity (Kd = 4.7 nM) and specificity. Maximum parsimony and Neighbor Joining analyses were used to generate a phylogenetic classification of ccER on the basis of 18 full-length ER sequences. The tree suggested the existence of two major ER branches. One branch contained two clearly divergent clades which included all piscine ER (except Japanese eel ER) and all tetrapod ERα, respectively. The second major branch contained the eel ER and the mammalian ERβ. The high degree of divergence between the eel ER and mammalian ERβ suggested that they also represent distinct piscine and tetrapod ER. These data suggest that ERα and ERβ are present throughout vertebrates and that these two major ER types evolved by duplication of an ancestral ER gene. Sequence alignments with other members of the nuclear hormone receptor superfamily indicated the presence of 8 amino acids in the E domain that align exclusively among ER. Four of these amino acids have not received prior research attention and their function is unknown. The novel finding of putative ER splice variants in a nonmammalian vertebrate and the novel phylogenetic classification of ER offer new perspectives in understanding the diversification and function of ER.

  14. Accurate cortical tissue classification on MRI by modeling cortical folding patterns.

    PubMed

    Kim, Hosung; Caldairou, Benoit; Hwang, Ji-Wook; Mansi, Tommaso; Hong, Seok-Jun; Bernasconi, Neda; Bernasconi, Andrea

    2015-09-01

    Accurate tissue classification is a crucial prerequisite to MRI morphometry. Automated methods based on intensity histograms constructed from the entire volume are challenged by regional intensity variations due to local radiofrequency artifacts as well as disparities in tissue composition, laminar architecture and folding patterns. Current work proposes a novel anatomy-driven method in which parcels conforming cortical folding were regionally extracted from the brain. Each parcel is subsequently classified using nonparametric mean shift clustering. Evaluation was carried out on manually labeled images from two datasets acquired at 3.0 Tesla (n = 15) and 1.5 Tesla (n = 20). In both datasets, we observed high tissue classification accuracy of the proposed method (Dice index >97.6% at 3.0 Tesla, and >89.2% at 1.5 Tesla). Moreover, our method consistently outperformed state-of-the-art classification routines available in SPM8 and FSL-FAST, as well as a recently proposed local classifier that partitions the brain into cubes. Contour-based analyses localized more accurate white matter-gray matter (GM) interface classification of the proposed framework compared to the other algorithms, particularly in central and occipital cortices that generally display bright GM due to their highly degree of myelination. Excellent accuracy was maintained, even in the absence of correction for intensity inhomogeneity. The presented anatomy-driven local classification algorithm may significantly improve cortical boundary definition, with possible benefits for morphometric inference and biomarker discovery. PMID:26037453

  15. Accurate crop classification using hierarchical genetic fuzzy rule-based systems

    NASA Astrophysics Data System (ADS)

    Topaloglou, Charalampos A.; Mylonas, Stelios K.; Stavrakoudis, Dimitris G.; Mastorocostas, Paris A.; Theocharis, John B.

    2014-10-01

    This paper investigates the effectiveness of an advanced classification system for accurate crop classification using very high resolution (VHR) satellite imagery. Specifically, a recently proposed genetic fuzzy rule-based classification system (GFRBCS) is employed, namely, the Hierarchical Rule-based Linguistic Classifier (HiRLiC). HiRLiC's model comprises a small set of simple IF-THEN fuzzy rules, easily interpretable by humans. One of its most important attributes is that its learning algorithm requires minimum user interaction, since the most important learning parameters affecting the classification accuracy are determined by the learning algorithm automatically. HiRLiC is applied in a challenging crop classification task, using a SPOT5 satellite image over an intensively cultivated area in a lake-wetland ecosystem in northern Greece. A rich set of higher-order spectral and textural features is derived from the initial bands of the (pan-sharpened) image, resulting in an input space comprising 119 features. The experimental analysis proves that HiRLiC compares favorably to other interpretable classifiers of the literature, both in terms of structural complexity and classification accuracy. Its testing accuracy was very close to that obtained by complex state-of-the-art classification systems, such as the support vector machines (SVM) and random forest (RF) classifiers. Nevertheless, visual inspection of the derived classification maps shows that HiRLiC is characterized by higher generalization properties, providing more homogeneous classifications that the competitors. Moreover, the runtime requirements for producing the thematic map was orders of magnitude lower than the respective for the competitors.

  16. Highly accurate recognition of human postures and activities through classification with rejection.

    PubMed

    Tang, Wenlong; Sazonov, Edward S

    2014-01-01

    Monitoring of postures and activities is used in many clinical and research applications, some of which may require highly reliable posture and activity recognition with desired accuracy well above 99% mark. This paper suggests a method for performing highly accurate recognition of postures and activities from data collected by a wearable shoe monitor (SmartShoe) through classification with rejection. Signals from pressure and acceleration sensors embedded in SmartShoe are used either as raw sensor data or after feature extraction. The Support vector machine (SVM) and multilayer perceptron (MLP) are used to implement classification with rejection. Unreliable observations are rejected by measuring the distance from the decision boundary and eliminating those observations that reside below rejection threshold. The results show a significant improvement (from 97.3% ± 2.3% to 99.8% ± 0.1%) in the classification accuracy after the rejection, using MLP with raw sensor data and rejecting 31.6% of observations. The results also demonstrate that MLP outperformed the SVM, and the classification accuracy based on raw sensor data was higher than the accuracy based on extracted features. The proposed approach will be especially beneficial in applications where high accuracy of recognition is desired while not all observations need to be assigned a class label. PMID:24403429

  17. Toward a stable classification of genera within the Entolomataceae: a phylogenetic re-evaluation of the Rhodocybe-Clitopilus clade.

    PubMed

    Kluting, Kerri L; Baroni, Timothy J; Bergemann, Sarah E

    2014-01-01

    Despite the recent molecular systematic analyses of the Entolomataceae (Agaricales, Basidiomycota), a robust classification of genera supported by morphological and phylogenetic evidence remains unresolved for this cosmopolitan family of pink-spored fungi. Here, a phylogenetic analysis for one of the two major clades (Rhodocybe-Clitopilus) was conducted using three nuclear protein-coding gene regions, the mitochondrial ATP synthase subunit 6 (atp6), the nuclear RNA polymerase subunit II (rpb2) and the nuclear translation elongation factor subunit 1-α (tef1). Five monophyletic groups are resolved with strong statistical support and a set of morphological features for delineation of genera is presented. In the revised classification proposed here, Clitopilus is retained, Rhodocybe is emended, two genera previously accepted as synonyms of Rhodocybe (Clitopilopsis and Rhodophana) are resurrected and Clitocella is described as new. PMID:24987124

  18. Towards a formal genealogical classification of the Lezgian languages (North Caucasus): testing various phylogenetic methods on lexical data.

    PubMed

    Kassian, Alexei

    2015-01-01

    A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456

  19. Towards a Formal Genealogical Classification of the Lezgian Languages (North Caucasus): Testing Various Phylogenetic Methods on Lexical Data

    PubMed Central

    Kassian, Alexei

    2015-01-01

    A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456

  20. Phylogenetic classification of Aureobasidium pullulans strains for production of feruloyl esterase

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective was to phylogenetically classify diverse strains of A. pullulans and determine their production of feruloyl esterase. Seventeen strains from the A. pullulans literature were phylogenetically classified. Phenotypic traits of color variation and endo-ß-1,4-xylanase overproduction were as...

  1. Molecular and morphological data supporting phylogenetic reconstruction of the genus Goniothalamus (Annonaceae), including a reassessment of previous infrageneric classifications

    PubMed Central

    Tang, Chin Cheung; Thomas, Daniel C.; Saunders, Richard M.K.

    2015-01-01

    Data is presented in support of a phylogenetic reconstruction of the species-rich early-divergent angiosperm genus Goniothalamus (Annonaceae) (Tang et al., Mol. Phylogenetic Evol., 2015) [1], inferred using chloroplast DNA (cpDNA) sequences. The data includes a list of primers for amplification and sequencing for nine cpDNA regions: atpB-rbcL, matK, ndhF, psbA-trnH, psbM-trnD, rbcL, trnL-F, trnS-G, and ycf1, the voucher information and molecular data (GenBank accession numbers) of 67 ingroup Goniothalamus accessions and 14 outgroup accessions selected from across the tribe Annoneae, and aligned data matrices for each gene region. We also present our Bayesian phylogenetic reconstructions for Goniothalamus, with information on previous infrageneric classifications superimposed to enable an evaluation of monophyly, together with a taxon-character data matrix (with 15 morphological characters scored for 66 Goniothalamus species and seven other species from the tribe Annoneae that are shown to be phylogenetically correlated). PMID:26286044

  2. A classification of the Chloridoideae (Poaceae) based on multi-gene phylogenetic trees.

    PubMed

    Peterson, Paul M; Romaschenko, Konstantin; Johnson, Gabriel

    2010-05-01

    We conducted a molecular phylogenetic study of the subfamily Chloridoideae using six plastid DNA sequences (ndhA intron, ndhF, rps16-trnK, rps16 intron, rps3, and rpl32-trnL) and a single nuclear ITS DNA sequence. Our large original data set includes 246 species (17.3%) representing 95 genera (66%) of the grasses currently placed in the Chloridoideae. The maximum likelihood and Bayesian analysis of DNA sequences provides strong support for the monophyly of the Chloridoideae; followed by, in order of divergence: a Triraphideae clade with Neyraudia sister to Triraphis; an Eragrostideae clade with the Cotteinae (includes Cottea and Enneapogon) sister to the Uniolinae (includes Entoplocamia, Tetrachne, and Uniola), and a terminal Eragrostidinae clade of Ectrosia, Harpachne, and Psammagrostis embedded in a polyphyletic Eragrostis; a Zoysieae clade with Urochondra sister to a Zoysiinae (Zoysia) clade, and a terminal Sporobolinae clade that includes Spartina, Calamovilfa, Pogoneura, and Crypsis embedded in a polyphyletic Sporobolus; and a very large terminal Cynodonteae clade that includes 13 monophyletic subtribes. The Cynodonteae includes, in alphabetical order: Aeluropodinae (Aeluropus); Boutelouinae (Bouteloua); Eleusininae (includes Apochiton, Astrebla with Schoenefeldia embedded, Austrochloris, Brachyachne, Chloris, Cynodon with Brachyachne embedded in part, Eleusine, Enteropogon with Eustachys embedded in part, Eustachys, Chrysochloa, Coelachyrum, Leptochloa with Dinebra embedded, Lepturus, Lintonia, Microchloa, Saugetia, Schoenefeldia, Sclerodactylon, Tetrapogon, and Trichloris); Hilariinae (Hilaria); Monanthochloinae (includes Distichlis, Monanthochloe, and Reederochloa); Muhlenbergiinae (Muhlenbergia with Aegopogon, Bealia, Blepharoneuron, Chaboissaea, Lycurus, Pereilema, Redfieldia, Schaffnerella, and Schedonnardus all embedded); Orcuttiinae (includes Orcuttia and Tuctoria); Pappophorinae (includes Neesiochloa and Pappophorum); Scleropogoninae (includes

  3. GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.

    PubMed

    Smalter, Aaron; Huan, Jun Luke; Jia, Yi; Lushington, Gerald

    2010-01-01

    Graph data mining is an active research area. Graphs are general modeling tools to organize information from heterogeneous sources and have been applied in many scientific, engineering, and business fields. With the fast accumulation of graph data, building highly accurate predictive models for graph data emerges as a new challenge that has not been fully explored in the data mining community. In this paper, we demonstrate a novel technique called graph pattern diffusion (GPD) kernel. Our idea is to leverage existing frequent pattern discovery methods and to explore the application of kernel classifier (e.g., support vector machine) in building highly accurate graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the graph database and use a process we call "pattern diffusion" to label nodes in the graphs. Finally, we designed a graph alignment algorithm to compute the inner product of two graphs. We have tested our algorithm using a number of chemical structure data. The experimental results demonstrate that our method is significantly better than competing methods such as those kernel functions based on paths, cycles, and subgraphs. PMID:20431140

  4. Phylogenetic systematics and a revised generic classification of anthidiine bees (Hymenoptera: Megachile)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The bee tribe Anthidiini (Hymenoptera: Megachilidae) is a large, cosmopolitan group of solitary bees that exhibit intriguing nesting behavior. We present the first molecular-based phylogenetic analysis of relationships within Anthidiini using model based methods and a large, multi-locus dataset (fiv...

  5. Addition of wsp sequences to the Wolbachia phylogenetic tree and stability of the classification.

    PubMed

    Pintureau, B; Chaudier, S; Lassablière, F; Charles, H; Grenier, S

    2000-10-01

    Wolbachia are symbiotic bacteria altering reproductive characters of numerous arthropods. Their most recent phylogeny and classification are based on sequences of the wsp gene. We sequenced wsp gene from six Wolbachia strains infecting six Trichogramma species that live as egg parasitoids on many insects. This allows us to test the effect of the addition of sequences on the Wolbachia phylogeny and to check the classification of Wolbachia infecting Trichogramma. The six Wolbachia studied are classified in the B supergroup. They confirm the monophyletic structure of the B Wolbachia in Trichogramma but introduce small differences in the Wolbachia classification. Modifications include the definition of a new group, Sem, for Wolbachia of T. semblidis and the merging of the two closely related groups, Sib and Kay. Specific primers were determined and tested for the Sem group. PMID:11040288

  6. Phylogenetic classification of the frog pathogen Amphibiothecum (Dermosporidium) penneri based on small ribosomal subunit sequencing

    USGS Publications Warehouse

    Feldman, S.H.; Wimsatt, J.H.; Green, D.E.

    2005-01-01

    We determined 1,600 base pairs of DNA sequence in the 18S small ribosomal subunit from two geographically distinct isolates of Dermosporidium penneri. Maximum likelihood and parsimony analysis of these sequences place D. penneri in the order Dermocystida of the class Mesomycetozoea. The 18S rRNA sequences from these two isolates only differ within a single region of 16 contiguous nucleotides. Based on the distant phylogenetic relationship of these organisms to Amphibiocystidium ranae and similarity to Sphaerothecum destruens we propose the organism be renamed Amphibiothecum penneri.

  7. Chemical classification of cattle. 2. Phylogenetic tree and specific status of the Zebu.

    PubMed

    Manwell, C; Baker, C M

    1980-01-01

    Phylogenetic trees for the ten major breed groups of cattle were constructed by Farris's (1972) maximum parsimony method, or Fitch & Margoliash's (1967) method, which averages ou the deviation over the entire assemblage. Both techniques yield essentially identical trees. The phylogenetic tree for the ten major cattle breed groups can be superimposed on a map of Europe and western Asia, the root of the tree being close to the 'fertile crescent' in Asia Minor, believed to be a primary centre of bovine domestication. For some but not all protein variants there is a cline of gene frequencies as one proceeds from the British Isles and northwest Europe towards southeast Europe and Asia Minor, with the most extreme gene frequencies in the Zebu breeds of India. It is not clear to what extent the observed clines are primary or secondary, i.e., consequent to the initial migrations of cattle towards the end of the Pleistocene or consequent to the many migrations of man with his domesticated cattle. Such clines as exist are not in themselves sufficient to prove either selection versus genetic drift or to establish taxonomic ranking. Contrary to some suggestions in the literature, the biochemical evidence supports Linnaeus's original conclusions: Bos taurus and Bos indicus are distinct species. PMID:7458002

  8. From learning taxonomies to phylogenetic learning: Integration of 16S rRNA gene data into FAME-based bacterial classification

    PubMed Central

    2010-01-01

    Background Machine learning techniques have shown to improve bacterial species classification based on fatty acid methyl ester (FAME) data. Nonetheless, FAME analysis has a limited resolution for discrimination of bacteria at the species level. In this paper, we approach the species classification problem from a taxonomic point of view. Such a taxonomy or tree is typically obtained by applying clustering algorithms on FAME data or on 16S rRNA gene data. The knowledge gained from the tree can then be used to evaluate FAME-based classifiers, resulting in a novel framework for bacterial species classification. Results In view of learning in a taxonomic framework, we consider two types of trees. First, a FAME tree is constructed with a supervised divisive clustering algorithm. Subsequently, based on 16S rRNA gene sequence analysis, phylogenetic trees are inferred by the NJ and UPGMA methods. In this second approach, the species classification problem is based on the combination of two different types of data. Herein, 16S rRNA gene sequence data is used for phylogenetic tree inference and the corresponding binary tree splits are learned based on FAME data. We call this learning approach 'phylogenetic learning'. Supervised Random Forest models are developed to train the classification tasks in a stratified cross-validation setting. In this way, better classification results are obtained for species that are typically hard to distinguish by a single or flat multi-class classification model. Conclusions FAME-based bacterial species classification is successfully evaluated in a taxonomic framework. Although the proposed approach does not improve the overall accuracy compared to flat multi-class classification, it has some distinct advantages. First, it has better capabilities for distinguishing species on which flat multi-class classification fails. Secondly, the hierarchical classification structure allows to easily evaluate and visualize the resolution of FAME data for

  9. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life

    NASA Technical Reports Server (NTRS)

    Margulis, L.

    1996-01-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae.

  10. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life.

    PubMed

    Margulis, L

    1996-02-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae. PMID:8577716

  11. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life.

    PubMed Central

    Margulis, L

    1996-01-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae. Images Fig. 1 PMID:8577716

  12. Assignment of Calibration Information to Deeper Phylogenetic Nodes is More Effective in Obtaining Precise and Accurate Divergence Time Estimates.

    PubMed

    Mello, Beatriz; Schrago, Carlos G

    2014-01-01

    Divergence time estimation has become an essential tool for understanding macroevolutionary events. Molecular dating aims to obtain reliable inferences, which, within a statistical framework, means jointly increasing the accuracy and precision of estimates. Bayesian dating methods exhibit the propriety of a linear relationship between uncertainty and estimated divergence dates. This relationship occurs even if the number of sites approaches infinity and places a limit on the maximum precision of node ages. However, how the placement of calibration information may affect the precision of divergence time estimates remains an open question. In this study, relying on simulated and empirical data, we investigated how the location of calibration within a phylogeny affects the accuracy and precision of time estimates. We found that calibration priors set at median and deep phylogenetic nodes were associated with higher precision values compared to analyses involving calibration at the shallowest node. The results were independent of the tree symmetry. An empirical mammalian dataset produced results that were consistent with those generated by the simulated sequences. Assigning time information to the deeper nodes of a tree is crucial to guarantee the accuracy and precision of divergence times. This finding highlights the importance of the appropriate choice of outgroups in molecular dating. PMID:24855333

  13. Deceptive Desmas: Molecular Phylogenetics Suggests a New Classification and Uncovers Convergent Evolution of Lithistid Demosponges

    PubMed Central

    Schuster, Astrid; Erpenbeck, Dirk; Pisera, Andrzej; Hooper, John; Bryce, Monika; Fromont, Jane; Wörheide, Gert

    2015-01-01

    Reconciling the fossil record with molecular phylogenies to enhance the understanding of animal evolution is a challenging task, especially for taxa with a mostly poor fossil record, such as sponges (Porifera). ‘Lithistida’, a polyphyletic group of recent and fossil sponges, are an exception as they provide the richest fossil record among demosponges. Lithistids, currently encompassing 13 families, 41 genera and >300 recent species, are defined by the common possession of peculiar siliceous spicules (desmas) that characteristically form rigid articulated skeletons. Their phylogenetic relationships are to a large extent unresolved and there has been no (taxonomically) comprehensive analysis to formally reallocate lithistid taxa to their closest relatives. This study, based on the most comprehensive molecular and morphological investigation of ‘lithistid’ demosponges to date, corroborates some previous weakly-supported hypotheses, and provides novel insights into the evolutionary relationships of the previous ‘order Lithistida’. Based on molecular data (partial mtDNA CO1 and 28S rDNA sequences), we show that 8 out of 13 ‘Lithistida’ families belong to the order Astrophorida, whereas Scleritodermidae and Siphonidiidae form a separate monophyletic clade within Tetractinellida. Most lithistid astrophorids are dispersed between different clades of the Astrophorida and we propose to formally reallocate them, respectively. Corallistidae, Theonellidae and Phymatellidae are monophyletic, whereas the families Pleromidae and Scleritodermidae are polyphyletic. Family Desmanthidae is polyphyletic and groups within Halichondriidae – we formally propose a reallocation. The sister group relationship of the family Vetulinidae to Spongillida is confirmed and we propose here for the first time to include Vetulina into a new Order Sphaerocladina. Megascleres and microscleres possibly evolved and/or were lost several times independently in different

  14. Accurate multi-source forest species mapping using the multiple spectral-spatial classification approach

    NASA Astrophysics Data System (ADS)

    Stavrakoudis, Dimitris; Gitas, Ioannis; Karydas, Christos; Kolokoussis, Polychronis; Karathanassi, Vassilia

    2015-10-01

    This paper proposes an efficient methodology for combining multiple remotely sensed imagery, in order to increase the classification accuracy in complex forest species mapping tasks. The proposed scheme follows a decision fusion approach, whereby each image is first classified separately by means of a pixel-wise Fuzzy-Output Support Vector Machine (FO-SVM) classifier. Subsequently, the multiple results are fused according to the so-called multiple spectral- spatial classifier using the minimum spanning forest (MSSC-MSF) approach, which constitutes an effective post-regularization procedure for enhancing the result of a single pixel-based classification. For this purpose, the original MSSC-MSF has been extended in order to handle multiple classifications. In particular, the fuzzy outputs of the pixel-based classifiers are stacked and used to grow the MSF, whereas the markers are also determined considering both classifications. The proposed methodology has been tested on a challenging forest species mapping task in northern Greece, considering a multispectral (GeoEye) and a hyper-spectral (CASI) image. The pixel-wise classifications resulted in overall accuracies (OA) of 68.71% for the GeoEye and 77.95% for the CASI images, respectively. Both of them are characterized by high levels of speckle noise. Applying the proposed multi-source MSSC-MSF fusion, the OA climbs to 90.86%, which is attributed both to the ability of MSSC-MSF to tackle the salt-and-pepper effect, as well as the fact that the fusion approach exploits the relative advantages of both information sources.

  15. Phylogenetic analysis, genomic diversity and classification of M class gene segments of turkey reoviruses.

    PubMed

    Mor, Sunil K; Marthaler, Douglas; Verma, Harsha; Sharafeldin, Tamer A; Jindal, Naresh; Porter, Robert E; Goyal, Sagar M

    2015-03-23

    From 2011 to 2014, 13 turkey arthritis reoviruses (TARVs) were isolated from cases of swollen hock joints in 2-18-week-old turkeys. In addition, two isolates from similar cases of turkey arthritis were received from another laboratory. Eight turkey enteric reoviruses (TERVs) isolated from fecal samples of turkeys were also used for comparison. The aims of this study were to characterize turkey reovirus (TRV) based on complete M class genome segments and to determine genetic diversity within TARVs in comparison to TERVs and chicken reoviruses (CRVs). Nucleotide (nt) cut off values of 84%, 83% and 85% for the M1, M2 and M3 gene segments were proposed and used for genotype classification, generating 5, 7, and 3 genotypes, respectively. Using these nt cut off values, we propose M class genotype constellations (GCs) for avian reoviruses. Of the seven GCs, GC1 and GC3 were shared between the TARVs and TERVs, indicating possible reassortment between turkey and chicken reoviruses. The TARVs and TERVs were divided into three GCs, and GC2 was unique to TARVs and TERVs. The proposed new GC approach should be useful in identifying reassortant viruses, which may ultimately be used in the design of a universal vaccine against both chicken and turkey reoviruses. PMID:25655814

  16. Classification algorithms with multi-modal data fusion could accurately distinguish neuromyelitis optica from multiple sclerosis.

    PubMed

    Eshaghi, Arman; Riyahi-Alam, Sadjad; Saeedi, Roghayyeh; Roostaei, Tina; Nazeri, Arash; Aghsaei, Aida; Doosti, Rozita; Ganjgahi, Habib; Bodini, Benedetta; Shakourirad, Ali; Pakravan, Manijeh; Ghana'ati, Hossein; Firouznia, Kavous; Zarei, Mojtaba; Azimi, Amir Reza; Sahraian, Mohammad Ali

    2015-01-01

    Neuromyelitis optica (NMO) exhibits substantial similarities to multiple sclerosis (MS) in clinical manifestations and imaging results and has long been considered a variant of MS. With the advent of a specific biomarker in NMO, known as anti-aquaporin 4, this assumption has changed; however, the differential diagnosis remains challenging and it is still not clear whether a combination of neuroimaging and clinical data could be used to aid clinical decision-making. Computer-aided diagnosis is a rapidly evolving process that holds great promise to facilitate objective differential diagnoses of disorders that show similar presentations. In this study, we aimed to use a powerful method for multi-modal data fusion, known as a multi-kernel learning and performed automatic diagnosis of subjects. We included 30 patients with NMO, 25 patients with MS and 35 healthy volunteers and performed multi-modal imaging with T1-weighted high resolution scans, diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI). In addition, subjects underwent clinical examinations and cognitive assessments. We included 18 a priori predictors from neuroimaging, clinical and cognitive measures in the initial model. We used 10-fold cross-validation to learn the importance of each modality, train and finally test the model performance. The mean accuracy in differentiating between MS and NMO was 88%, where visible white matter lesion load, normal appearing white matter (DTI) and functional connectivity had the most important contributions to the final classification. In a multi-class classification problem we distinguished between all of 3 groups (MS, NMO and healthy controls) with an average accuracy of 84%. In this classification, visible white matter lesion load, functional connectivity, and cognitive scores were the 3 most important modalities. Our work provides preliminary evidence that computational tools can be used to help make an objective differential diagnosis of NMO and MS

  17. Classification algorithms with multi-modal data fusion could accurately distinguish neuromyelitis optica from multiple sclerosis

    PubMed Central

    Eshaghi, Arman; Riyahi-Alam, Sadjad; Saeedi, Roghayyeh; Roostaei, Tina; Nazeri, Arash; Aghsaei, Aida; Doosti, Rozita; Ganjgahi, Habib; Bodini, Benedetta; Shakourirad, Ali; Pakravan, Manijeh; Ghana'ati, Hossein; Firouznia, Kavous; Zarei, Mojtaba; Azimi, Amir Reza; Sahraian, Mohammad Ali

    2015-01-01

    Neuromyelitis optica (NMO) exhibits substantial similarities to multiple sclerosis (MS) in clinical manifestations and imaging results and has long been considered a variant of MS. With the advent of a specific biomarker in NMO, known as anti-aquaporin 4, this assumption has changed; however, the differential diagnosis remains challenging and it is still not clear whether a combination of neuroimaging and clinical data could be used to aid clinical decision-making. Computer-aided diagnosis is a rapidly evolving process that holds great promise to facilitate objective differential diagnoses of disorders that show similar presentations. In this study, we aimed to use a powerful method for multi-modal data fusion, known as a multi-kernel learning and performed automatic diagnosis of subjects. We included 30 patients with NMO, 25 patients with MS and 35 healthy volunteers and performed multi-modal imaging with T1-weighted high resolution scans, diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI). In addition, subjects underwent clinical examinations and cognitive assessments. We included 18 a priori predictors from neuroimaging, clinical and cognitive measures in the initial model. We used 10-fold cross-validation to learn the importance of each modality, train and finally test the model performance. The mean accuracy in differentiating between MS and NMO was 88%, where visible white matter lesion load, normal appearing white matter (DTI) and functional connectivity had the most important contributions to the final classification. In a multi-class classification problem we distinguished between all of 3 groups (MS, NMO and healthy controls) with an average accuracy of 84%. In this classification, visible white matter lesion load, functional connectivity, and cognitive scores were the 3 most important modalities. Our work provides preliminary evidence that computational tools can be used to help make an objective differential diagnosis of NMO and MS

  18. Towards a phylogenetic generic classification of Thelypteridaceae: Additional sampling suggests alterations of neotropical taxa and further study of paleotropical genera.

    PubMed

    Almeida, Thaís Elias; Hennequin, Sabine; Schneider, Harald; Smith, Alan R; Batista, João Aguiar Nogueira; Ramalho, Aline Joseph; Proite, Karina; Salino, Alexandre

    2016-01-01

    Thelypteridaceae is one of the largest fern families, having about 950 species and a cosmopolitan distribution but with most species occurring in tropical and subtropical regions. Its generic classification remains controversial, with different authors recognizing from one up to 32 genera. Phylogenetic relationships within the family have not been exhaustively studied, but previous studies have confirmed the monophyly of the lineage. Thus far, sampling has been inadequate for establishing a robust hypothesis of infrafamilial relationships within the family. In order to understand phylogenetic relationships within Thelypteridaceae and thus to improve generic reclassification, we expand the molecular sampling, including new samples of Old World taxa and, especially, many additional neotropical representatives. We also explore the monophyly of exclusively or mostly neotropical genera Amauropelta, Goniopteris, Meniscium, and Steiropteris. Our sampling includes 68 taxa and 134 newly generated sequences from two plastid genomic regions (rps4-trnS and trnL-trnF), plus 73 rps4 and 72 trnL-trnF sequences from GenBank. These data resulted in a concatenated matrix of 1980 molecular characters for 149 taxa. The combined data set was analyzed using maximum parsimony and bayesian inference of phylogeny. Our results are consistent with the general topological structure found in previous studies, including two main lineages within the family: phegopteroid and thelypteroid. The thelypteroid lineage comprises two clades; one of these included the segregates Metathelypteris, Coryphopteris, and Amauropelta (including part of Parathelypteris), whereas the other comprises all segregates of Cyclosorus s.l., such as Goniopteris, Meniscium, and Steiropteris (including Thelypteris polypodioides, previously incertae sedis). The three mainly neotropical segregates were found to be monophyletic but nested in a broadly defined Cyclosorus. The fourth mainly neotropical segregate, Amauropelta

  19. Use B-factor related features for accurate classification between protein binding interfaces and crystal packing contacts

    PubMed Central

    2014-01-01

    Background Distinction between true protein interactions and crystal packing contacts is important for structural bioinformatics studies to respond to the need of accurate classification of the rapidly increasing protein structures. There are many unannotated crystal contacts and there also exist false annotations in this rapidly expanding volume of data. Previous tools have been proposed to address this problem. However, challenging issues still remain, such as low performance when the training and test data contain mixed interfaces having diverse sizes of contact areas. Methods and results B factor is a measure to quantify the vibrational motion of an atom, a more relevant feature than interface size to characterize protein binding. We propose to use three features related to B factor for the classification between biological interfaces and crystal packing contacts. The first feature is the sum of the normalized B factors of the interfacial atoms in the contact area, the second is the average of the interfacial B factor per residue in the chain, and the third is the average number of interfacial atoms with a negative normalized B factor per residue in the chain. We investigate the distribution properties of these basic features and a compound feature on four datasets of biological binding and crystal packing, and on a protein binding-only dataset with known binding affinity. We also compare the cross-dataset classification performance of these features with existing methods and with a widely-used and the most effective feature interface area. The results demonstrate that our features outperform the interface area approach and the existing prediction methods remarkably for many tests on all of these datasets. Conclusions The proposed B factor related features are more effective than interface area to distinguish crystal packing from biological binding interfaces. Our computational methods have a potential for large-scale and accurate identification of biological

  20. Two fast and accurate heuristic RBF learning rules for data classification.

    PubMed

    Rouhani, Modjtaba; Javan, Dawood S

    2016-03-01

    This paper presents new Radial Basis Function (RBF) learning methods for classification problems. The proposed methods use some heuristics to determine the spreads, the centers and the number of hidden neurons of network in such a way that the higher efficiency is achieved by fewer numbers of neurons, while the learning algorithm remains fast and simple. To retain network size limited, neurons are added to network recursively until termination condition is met. Each neuron covers some of train data. The termination condition is to cover all training data or to reach the maximum number of neurons. In each step, the center and spread of the new neuron are selected based on maximization of its coverage. Maximization of coverage of the neurons leads to a network with fewer neurons and indeed lower VC dimension and better generalization property. Using power exponential distribution function as the activation function of hidden neurons, and in the light of new learning approaches, it is proved that all data became linearly separable in the space of hidden layer outputs which implies that there exist linear output layer weights with zero training error. The proposed methods are applied to some well-known datasets and the simulation results, compared with SVM and some other leading RBF learning methods, show their satisfactory and comparable performance. PMID:26797472

  1. TIPP: taxonomic identification and phylogenetic profiling

    PubMed Central

    Nguyen, Nam-phuong; Mirarab, Siavash; Liu, Bo; Pop, Mihai; Warnow, Tandy

    2014-01-01

    Motivation: Abundance profiling (also called ‘phylogenetic profiling’) is a crucial step in understanding the diversity of a metagenomic sample, and one of the basic techniques used for this is taxonomic identification of the metagenomic reads. Results: We present taxon identification and phylogenetic profiling (TIPP), a new marker-based taxon identification and abundance profiling method. TIPP combines SAT\\'e-enabled phylogenetic placement a phylogenetic placement method, with statistical techniques to control the classification precision and recall, and results in improved abundance profiles. TIPP is highly accurate even in the presence of high indel errors and novel genomes, and matches or improves on previous approaches, including NBC, mOTU, PhymmBL, MetaPhyler and MetaPhlAn. Availability and implementation: Software and supplementary materials are available at http://www.cs.utexas.edu/users/phylo/software/sepp/tipp-submission/. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25359891

  2. A comprehensive multilocus phylogeny of the Neotropical cotingas (Cotingidae, Aves) with a comparative evolutionary analysis of breeding system and plumage dimorphism and a revised phylogenetic classification.

    PubMed

    Berv, Jacob S; Prum, Richard O

    2014-12-01

    The Neotropical cotingas (Cotingidae: Aves) are a group of passerine birds that are characterized by extreme diversity in morphology, ecology, breeding system, and behavior. Here, we present a comprehensive phylogeny of the Neotropical cotingas based on six nuclear and mitochondrial loci (∼7500 bp) for a sample of 61 cotinga species in all 25 genera, and 22 species of suboscine outgroups. Our taxon sample more than doubles the number of cotinga species studied in previous analyses, and allows us to test the monophyly of the cotingas as well as their intrageneric relationships with high resolution. We analyze our genetic data using a Bayesian species tree method, and concatenated Bayesian and maximum likelihood methods, and present a highly supported phylogenetic hypothesis. We confirm the monophyly of the cotingas, and present the first phylogenetic evidence for the relationships of Phibalura flavirostris as the sister group to Ampelion and Doliornis, and the paraphyly of Lipaugus with respect to Tijuca. In addition, we resolve the diverse radiations within the Cotinga, Lipaugus, Pipreola, and Procnias genera. We find no support for Darwin's (1871) hypothesis that the increase in sexual selection associated with polygynous breeding systems drives the evolution of color dimorphism in the cotingas, at least when analyzed at a broad categorical scale. Finally, we present a new comprehensive phylogenetic classification of all cotinga species. PMID:25234241

  3. Phylogenetics, ancestral state reconstruction, and a new infrafamilial classification of the pantropical Ochnaceae (Medusagynaceae, Ochnaceae s.str., Quiinaceae) based on five DNA regions.

    PubMed

    Schneider, Julio V; Bissiengou, Pulcherie; Amaral, Maria do Carmo E; Tahir, Ali; Fay, Michael F; Thines, Marco; Sosef, Marc S M; Zizka, Georg; Chatrou, Lars W

    2014-09-01

    Ochnaceae s.str. (Malpighiales) are a pantropical family of about 500 species and 27 genera of almost exclusively woody plants. Infrafamilial classification and relationships have been controversial partially due to the lack of a robust phylogenetic framework. Including all genera except Indosinia and Perissocarpa and DNA sequence data for five DNA regions (ITS, matK, ndhF, rbcL, trnL-F), we provide for the first time a nearly complete molecular phylogenetic analysis of Ochnaceae s.l. resolving most of the phylogenetic backbone of the family. Based on this, we present a new classification of Ochnaceae s.l., with Medusagynoideae and Quiinoideae included as subfamilies and the former subfamilies Ochnoideae and Sauvagesioideae recognized at the rank of tribe. Our data support a monophyletic Ochneae, but Sauvagesieae in the traditional circumscription is paraphyletic because Testulea emerges as sister to the rest of Ochnoideae, and the next clade shows Luxemburgia+Philacra as sister group to the remaining Ochnoideae. To avoid paraphyly, we classify Luxemburgieae and Testuleeae as new tribes. The African genus Lophira, which has switched between subfamilies (here tribes) in past classifications, emerges as sister to all other Ochneae. Thus, endosperm-free seeds and ovules with partly to completely united integuments (resulting in an apparently single integument) are characters that unite all members of that tribe. The relationships within its largest clade, Ochnineae (former Ochneae), are poorly resolved, but former Ochninae (Brackenridgea, Ochna) are polyphyletic. Within Sauvagesieae, the genus Sauvagesia in its broad circumscription is polyphyletic as Sauvagesia serrata is sister to a clade of Adenarake, Sauvagesia spp., and three other genera. Within Quiinoideae, in contrast to former phylogenetic hypotheses, Lacunaria and Touroulia form a clade that is sister to Quiina. Bayesian ancestral state reconstructions showed that zygomorphic flowers with adaptations to buzz

  4. Phylogenetic classification of Escherichia coli O26 strains from human, animals, and environmental origins using nucleotide polymorphisms

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Shiga toxin-producing Escherichia coli (STEC) O26 strains are food-borne pathogens that were recently classified as adulterants in certain beef products. Little is known about their genetic diversity, including whether or not phylogenetic subtypes within the serogroup vary in their assoc...

  5. Revisiting the phylogeny of Bombacoideae (Malvaceae): Novel relationships, morphologically cohesive clades, and a new tribal classification based on multilocus phylogenetic analyses.

    PubMed

    Carvalho-Sobrinho, Jefferson G; Alverson, William S; Alcantara, Suzana; Queiroz, Luciano P; Mota, Aline C; Baum, David A

    2016-08-01

    Bombacoideae (Malvaceae) is a clade of deciduous trees with a marked dominance in many forests, especially in the Neotropics. The historical lack of a well-resolved phylogenetic framework for Bombacoideae hinders studies in this ecologically important group. We reexamined phylogenetic relationships in this clade based on a matrix of 6465 nuclear (ETS, ITS) and plastid (matK, trnL-trnF, trnS-trnG) DNA characters. We used maximum parsimony, maximum likelihood, and Bayesian inference to infer relationships among 108 species (∼70% of the total number of known species). We analyzed the evolution of selected morphological traits: trunk or branch prickles, calyx shape, endocarp type, seed shape, and seed number per fruit, using ML reconstructions of their ancestral states to identify possible synapomorphies for major clades. Novel phylogenetic relationships emerged from our analyses, including three major lineages marked by fruit or seed traits: the winged-seed clade (Bernoullia, Gyranthera, and Huberodendron), the spongy endocarp clade (Adansonia, Aguiaria, Catostemma, Cavanillesia, and Scleronema), and the Kapok clade (Bombax, Ceiba, Eriotheca, Neobuchia, Pachira, Pseudobombax, Rhodognaphalon, and Spirotheca). The Kapok clade, the most diverse lineage of the subfamily, includes sister relationships (i) between Pseudobombax and "Pochota fendleri" a historically incertae sedis taxon, and (ii) between the Paleotropical genera Bombax and Rhodognaphalon, implying just two bombacoid dispersals to the Old World, the other one involving Adansonia. This new phylogenetic framework offers new insights and a promising avenue for further evolutionary studies. In view of this information, we present a new tribal classification of the subfamily, accompanied by an identification key. PMID:27154210

  6. Classification

    NASA Astrophysics Data System (ADS)

    Oza, Nikunj

    2012-03-01

    would represent one sunspot’s classification (y_i) and the corresponding set of measurements (x_i). The output of a supervised learning algorithm is a model h that approximates the unknown mapping from the inputs to the outputs. In our example, h would map from the sunspot measurements to the type of sunspot. We may have a test set S—a set of examples not used in training that we use to test how well the model h predicts the outputs on new examples. Just as with the examples in T, the examples in S are assumed to be independent and identically distributed (i.i.d.) draws from the distribution D. We measure the error of h on the test set as the proportion of test cases that h misclassifies: 1/|S| Sigma(x,y union S)[I(h(x)!= y)] where I(v) is the indicator function—it returns 1 if v is true and 0 otherwise. In our sunspot classification example, we would identify additional examples of sunspots that were not used in generating the model, and use these to determine how accurate the model is—the fraction of the test samples that the model classifies correctly. An example of a classification model is the decision tree shown in Figure 23.1. We will discuss the decision tree learning algorithm in more detail later—for now, we assume that, given a training set with examples of sunspots, this decision tree is derived. This can be used to classify previously unseen examples of sunpots. For example, if a new sunspot’s inputs indicate that its "Group Length" is in the range 10-15, then the decision tree would classify the sunspot as being of type “E,” whereas if the "Group Length" is "NULL," the "Magnetic Type" is "bipolar," and the "Penumbra" is "rudimentary," then it would be classified as type "C." In this chapter, we will add to the above description of classification problems. We will discuss decision trees and several other classification models. In particular, we will discuss the learning algorithms that generate these classification models, how to use them to

  7. When proglottids and scoleces conflict: phylogenetic relationships and a family-level classification of the Lecanicephalidea (Platyhelminthes: Cestoda).

    PubMed

    Jensen, Kirsten; Caira, Janine N; Cielocha, Joanna J; Littlewood, D Timothy J; Waeschenbach, Andrea

    2016-05-01

    This study presents the first comprehensive phylogenetic analysis of the interrelationships of the morphologically diverse elasmobranch-hosted tapeworm order Lecanicephalidea, based on molecular sequence data. With almost half of current generic diversity having been erected or resurrected within the last decade, an apparent conflict between scolex morphology and proglottid anatomy has hampered the assignment of many of these genera to families. Maximum likelihood and Bayesian analyses of two nuclear markers (D1-D3 of lsrDNA and complete ssrDNA) and two mitochondrial markers (partial rrnL and partial cox1) for 61 lecanicephalidean species representing 22 of the 25 valid genera were conducted; new sequence data were generated for 43 species and 11 genera, including three undescribed genera. The monophyly of the order was confirmed in all but the analyses based on cox1 data alone. Sesquipedalapex placed among species of Anteropora and was thus synonymized with the latter genus. Based on analyses of the concatenated dataset, eight major groups emerged which are herein formally recognised at the familial level. Existing family names (i.e., Lecanicephalidae, Polypocephalidae, Tetragonocephalidae, and Cephalobothriidae) are maintained for four of the eight clades, and new families are proposed for the remaining four groups (Aberrapecidae n. fam., Eniochobothriidae n. fam., Paraberrapecidae n. fam., and Zanobatocestidae n. fam.). The four new families and the Tetragonocephalidae are monogeneric, while the Cephalobothriidae, Lecanicephalidae and Polypocephalidae comprise seven, eight and four genera, respectively. As a result of their unusual morphologies, the three genera not included here (i.e., Corrugatocephalum, Healyum and Quadcuspibothrium) are considered incertae sedis within the order until their familial affinities can be examined in more detail. All eight families are newly circumscribed based on morphological features and a key to the families is provided

  8. Classification

    ERIC Educational Resources Information Center

    Clary, Renee; Wandersee, James

    2013-01-01

    In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…

  9. A species independent universal bio-detection microarray for pathogen forensics and phylogenetic classification of unknown microorganisms

    PubMed Central

    2011-01-01

    Background The ability to differentiate a bioterrorist attack or an accidental release of a research pathogen from a naturally occurring pandemic or disease event is crucial to the safety and security of this nation by enabling an appropriate and rapid response. It is critical in samples from an infected patient, the environment, or a laboratory to quickly and accurately identify the precise pathogen including natural or engineered variants and to classify new pathogens in relation to those that are known. Current approaches for pathogen detection rely on prior genomic sequence information. Given the enormous spectrum of genetic possibilities, a field deployable, robust technology, such as a universal (any species) microarray has near-term potential to address these needs. Results A new and comprehensive sequence-independent array (Universal Bio-Signature Detection Array) was designed with approximately 373,000 probes. The main feature of this array is that the probes are computationally derived and sequence independent. There is one probe for each possible 9-mer sequence, thus 49 (262,144) probes. Each genome hybridized on this array has a unique pattern of signal intensities corresponding to each of these probes. These signal intensities were used to generate an un-biased cluster analysis of signal intensity hybridization patterns that can easily distinguish species into accepted and known phylogenomic relationships. Within limits, the array is highly sensitive and is able to detect synthetically mixed pathogens. Examples of unique hybridization signal intensity patterns are presented for different Brucella species as well as relevant host species and other pathogens. These results demonstrate the utility of the UBDA array as a diagnostic tool in pathogen forensics. Conclusions This pathogen detection system is fast, accurate and can be applied to any species. Hybridization patterns are unique to a specific genome and these can be used to decipher the identity of

  10. A non-contact method based on multiple signal classification algorithm to reduce the measurement time for accurately heart rate detection.

    PubMed

    Bechet, P; Mitran, R; Munteanu, M

    2013-08-01

    Non-contact methods for the assessment of vital signs are of great interest for specialists due to the benefits obtained in both medical and special applications, such as those for surveillance, monitoring, and search and rescue. This paper investigates the possibility of implementing a digital processing algorithm based on the MUSIC (Multiple Signal Classification) parametric spectral estimation in order to reduce the observation time needed to accurately measure the heart rate. It demonstrates that, by proper dimensioning the signal subspace, the MUSIC algorithm can be optimized in order to accurately assess the heart rate during an 8-28 s time interval. The validation of the processing algorithm performance was achieved by minimizing the mean error of the heart rate after performing simultaneous comparative measurements on several subjects. In order to calculate the error the reference value of heart rate was measured using a classic measurement system through direct contact. PMID:24007088

  11. A non-contact method based on multiple signal classification algorithm to reduce the measurement time for accurately heart rate detection

    NASA Astrophysics Data System (ADS)

    Bechet, P.; Mitran, R.; Munteanu, M.

    2013-08-01

    Non-contact methods for the assessment of vital signs are of great interest for specialists due to the benefits obtained in both medical and special applications, such as those for surveillance, monitoring, and search and rescue. This paper investigates the possibility of implementing a digital processing algorithm based on the MUSIC (Multiple Signal Classification) parametric spectral estimation in order to reduce the observation time needed to accurately measure the heart rate. It demonstrates that, by proper dimensioning the signal subspace, the MUSIC algorithm can be optimized in order to accurately assess the heart rate during an 8-28 s time interval. The validation of the processing algorithm performance was achieved by minimizing the mean error of the heart rate after performing simultaneous comparative measurements on several subjects. In order to calculate the error the reference value of heart rate was measured using a classic measurement system through direct contact.

  12. Molecular phylogenetics of the spider infraorder Mygalomorphae using nuclear rRNA genes (18S and 28S): conflict and agreement with the current system of classification.

    PubMed

    Hedin, Marshal; Bond, Jason E

    2006-11-01

    Mygalomorph spiders, which include the tarantulas, trapdoor spiders, and their kin, represent one of three main spider lineages. Mygalomorphs are currently classified into 15 families, comprising roughly 2500 species and 300 genera. The few published phylogenies of mygalomorph relationships are based exclusively on morphological data and reveal areas of both conflict and congruence, suggesting the need for additional phylogenetic research utilizing new character systems. As part of a larger combined evidence study of global mygalomorph relationships, we have gathered approximately 3.7 kb of rRNA data (18S and 28S) for a sample of 80 genera, representing all 15 mygalomorph families. Taxon sampling was particularly intensive across families that are questionable in composition-Cyrtaucheniidae and Nemesiidae. The following primary results are supported by both Bayesian and parsimony analyses of combined matrices representing multiple 28S alignments: (1) the Atypoidea, a clade that includes the families Atypidae, Antrodiaetidae, and Mecicobothriidae, is recovered as a basal lineage sister to all other mygalomorphs, (2) diplurids and hexathelids form a paraphyletic grade at the base of the non-atypoid clade, but neither family is monophyletic in any of our analyses, (3) a clade consisting of all sampled nemesiids, Microstigmata and the cyrtaucheniid genera Kiama, Acontius, and Fufius is consistently recovered, (4) other sampled cyrtaucheniids are fragmented across three separate clades, including a monophyletic North American Euctenizinae and a South African clade, (5) of the Domiothelina, only idiopids are consistently recovered as monophyletic; ctenizids are polyphyletic and migids are only weakly supported. The Domiothelina is not monophyletic. The molecular results we present are consistent with more recent hypotheses of mygalomorph relationship; however, additional work remains before mygalomorph classification can be formally reassessed with confidence

  13. Is the SIOP-2001 Classification of Renal Tumors of Childhood accurate with regard to prognosis? A problem revisited

    PubMed Central

    Taran, Katarzyna; Młynarski, Wojciech; Sitkiewicz, Anna

    2012-01-01

    Introduction The goal of this study was to analyze morbidity and mortality of Wilms’ tumor based on the revised SIOP-2001 classification. Material and methods Sixty-four patients with unilateral Wilms’ tumor, 33 girls (51.5%) and 31 boys (48.5%), aged 1 to 144 months (mean: 42.8 months) were treated between 1993 and 2009. All patients underwent multimodal therapy according to the SIOP protocols. The follow-up period ranged from 2 to 18 years (mean: 11.6 years). Results Thirty-three patients (51.6%) had intermediate-risk, 6 (9.4%) low-risk and 25 (39%) high-risk tumors. Stage I disease was diagnosed in 28 (43.7%), stage II in 19 (29.7%), stage III in 8 (12.5%) and stage IV in 9 patients (14.1%). Event-free survival (EFS) in the entire group was 78.1% and OS was 92.2%. The EFS in stage IV (44.4%) was significantly lower than in stage I (82.1%, p = 0.04), stage II (89.5%, p = 0.02) and in the entire group (78.1%, p = 0.04). Sixteen complications were observed in 14 children (21.9%); metastases in 7 cases (10.9%), 8 relapses (12.5%) and 5 deaths (7.8%). Blastemal (20/24 – 83.3%) and anaplastic (3/24 – 12.5%) subtypes were responsible for mortality in high-risk tumors (OS – 87.5%), while poorly differentiated epithelial (7/34 – 20.6%) and regressive (8/34 – 23.5%) subtypes decreased OS (94.1%) in the intermediate-risk tumors. Conclusions The results of our study show that epithelial and regressive subtypes were responsible for mortality in the intermediate-risk Wilms’ tumors. PMID:23056081

  14. Classification

    NASA Technical Reports Server (NTRS)

    Oza, Nikunj C.

    2011-01-01

    A supervised learning task involves constructing a mapping from input data (normally described by several features) to the appropriate outputs. Within supervised learning, one type of task is a classification learning task, in which each output is one or more classes to which the input belongs. In supervised learning, a set of training examples---examples with known output values---is used by a learning algorithm to generate a model. This model is intended to approximate the mapping between the inputs and outputs. This model can be used to generate predicted outputs for inputs that have not been seen before. For example, we may have data consisting of observations of sunspots. In a classification learning task, our goal may be to learn to classify sunspots into one of several types. Each example may correspond to one candidate sunspot with various measurements or just an image. A learning algorithm would use the supplied examples to generate a model that approximates the mapping between each supplied set of measurements and the type of sunspot. This model can then be used to classify previously unseen sunspots based on the candidate's measurements. This chapter discusses methods to perform machine learning, with examples involving astronomy.

  15. 16S classifier: a tool for fast and accurate taxonomic classification of 16S rRNA hypervariable regions in metagenomic datasets.

    PubMed

    Chaudhary, Nikhil; Sharma, Ashok K; Agarwal, Piyush; Gupta, Ankit; Sharma, Vineet K

    2015-01-01

    The diversity of microbial species in a metagenomic study is commonly assessed using 16S rRNA gene sequencing. With the rapid developments in genome sequencing technologies, the focus has shifted towards the sequencing of hypervariable regions of 16S rRNA gene instead of full length gene sequencing. Therefore, 16S Classifier is developed using a machine learning method, Random Forest, for faster and accurate taxonomic classification of short hypervariable regions of 16S rRNA sequence. It displayed precision values of up to 0.91 on training datasets and the precision values of up to 0.98 on the test dataset. On real metagenomic datasets, it showed up to 99.7% accuracy at the phylum level and up to 99.0% accuracy at the genus level. 16S Classifier is available freely at http://metagenomics.iiserb.ac.in/16Sclassifier and http://metabiosys.iiserb.ac.in/16Sclassifier. PMID:25646627

  16. Molecular-genetic analysis is essential for accurate classification of renal carcinoma resembling Xp11.2 translocation carcinoma.

    PubMed

    Hayes, Malcolm; Peckova, Kvetoslava; Martinek, Petr; Hora, Milan; Kalusova, Kristyna; Straka, Lubomir; Daum, Ondrej; Kokoskova, Bohuslava; Rotterova, Pavla; Pivovarčikova, Kristyna; Branzovsky, Jindrich; Dubova, Magdalena; Vesela, Pavla; Michal, Michal; Hes, Ondrej

    2015-03-01

    tumours can only be sub-classified accurately by multi-parameter molecular-genetic analysis. PMID:25544614

  17. Phylogenetic Status of an Unrecorded Species of Curvularia, C. spicifera, Based on Current Classification System of Curvularia and Bipolaris Group Using Multi Loci

    PubMed Central

    Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong

    2015-01-01

    A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae. PMID:26539036

  18. Say goodbye to tribes in the new house fly classification: A new molecular phylogenetic analysis and an updated biogeographical narrative for the Muscidae (Diptera).

    PubMed

    Haseyama, Kirstern L F; Wiegmann, Brian M; Almeida, Eduardo A B; de Carvalho, Claudio J B

    2015-08-01

    House flies are one of the best known groups of flies and comprise about 5000 species worldwide. Despite over a century of intensive taxonomic research on these flies, classification of the Muscidae is still poorly resolved. Here we brought together the most diverse molecular dataset ever examined for the Muscidae, with 142 species in 67 genera representing all tribes and all biogeographic regions. Four protein coding genes were analyzed: mitochondrial CO1 and nuclear AATS, CAD (region 4) and EF1-α. Maximum likelihood and Bayesian approaches were used to analyze five different partitioning schemes for the alignment. We also used Bayes factors to test monophyly of the traditionally accepted tribes and subfamilies. Most subfamilial taxa were not recovered in our analyses, and accordingly monophyly was rejected by Bayes factor tests. Our analysis consistently found three main clades of Muscidae and so we propose a new classification with only three subfamilies without tribes. Additionally, we provide the first timeframe for the diversification of all major lineages of house flies and examine contemporary biogeographic hypotheses in light of this timeframe. We conclude that the muscid radiation began in the Paleocene to Eocene and is congruent with the final stages of the breakup of Gondwana, which resulted in the complete separation of Antarctica, Australia, and South America. With this newly proposed classification and better understanding of the timing of evolutionary events, we provide new perspectives for integrating morphological and ecological evolutionary understanding of house flies, their taxonomy, phylogeny, and biogeography. PMID:25869937

  19. High-resolution phylogenetic microbial community profiling.

    PubMed

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; Bowman, Brett; Bowers, Robert M; Levy, Asaf; Gies, Esther A; Cheng, Jan-Fang; Copeland, Alex; Klenk, Hans-Peter; Hallam, Steven J; Hugenholtz, Philip; Tringe, Susannah G; Woyke, Tanja

    2016-08-01

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential. PMID:26859772

  20. Large-Scale Phylogenetic Classification of Fungal Chitin Synthases and Identification of a Putative Cell-Wall Metabolism Gene Cluster in Aspergillus Genomes

    PubMed Central

    Pacheco-Arjona, Jose Ramon; Ramirez-Prado, Jorge Humberto

    2014-01-01

    The cell wall is a protective and versatile structure distributed in all fungi. The component responsible for its rigidity is chitin, a product of chitin synthase (Chsp) enzymes. There are seven classes of chitin synthase genes (CHS) and the amount and type encoded in fungal genomes varies considerably from one species to another. Previous Chsp sequence analyses focused on their study as individual units, regardless of genomic context. The identification of blocks of conserved genes between genomes can provide important clues about the interactions and localization of chitin synthases. On the present study, we carried out an in silico search of all putative Chsp encoded in 54 full fungal genomes, encompassing 21 orders from five phyla. Phylogenetic studies of these Chsp were able to confidently classify 347 out of the 369 Chsp identified (94%). Patterns in the distribution of Chsp related to taxonomy were identified, the most prominent being related to the type of fungal growth. More importantly, a synteny analysis for genomic blocks centered on class IV Chsp (the most abundant and widely distributed Chsp class) identified a putative cell wall metabolism gene cluster in members of the genus Aspergillus, the first such association reported for any fungal genome. PMID:25148134

  1. Phylogenetic classification of Escherichia coli O157:H7 strains of human and bovine origin using a novel set of nucleotide polymorphisms

    PubMed Central

    Clawson, Michael L; Keen, James E; Smith, Timothy PL; Durso, Lisa M; McDaneld, Tara G; Mandrell, Robert E; Davis, Margaret A; Bono, James L

    2009-01-01

    Background Cattle are a reservoir of Shiga toxin-producing Escherichia coli O157:H7 (STEC O157), and are known to harbor subtypes not typically found in clinically ill humans. Consequently, nucleotide polymorphisms previously discovered via strains originating from human outbreaks may be restricted in their ability to distinguish STEC O157 genetic subtypes present in cattle. The objectives of this study were firstly to identify nucleotide polymorphisms in a diverse sampling of human and bovine STEC O157 strains, secondly to classify strains of either bovine or human origin by polymorphism-derived genotypes, and finally to compare the genotype diversity with pulsed-field gel electrophoresis (PFGE), a method currently used for assessing STEC O157 diversity. Results High-throughput 454 sequencing of pooled STEC O157 strain DNAs from human clinical cases (n = 91) and cattle (n = 102) identified 16,218 putative polymorphisms. From those, 178 were selected primarily within genomic regions conserved across E. coli serotypes and genotyped in 261 STEC O157 strains. Forty-two unique genotypes were observed that are tagged by a minimal set of 32 polymorphisms. Phylogenetic trees of the genotypes are divided into clades that represent strains of cattle origin, or cattle and human origin. Although PFGE diversity surpassed genotype diversity overall, ten PFGE patterns each occurred with multiple strains having different genotypes. Conclusions Deep sequencing of pooled STEC O157 DNAs proved highly effective in polymorphism discovery. A polymorphism set has been identified that characterizes genetic diversity within STEC O157 strains of bovine origin, and a subset observed in human strains. The set may complement current techniques used to classify strains implicated in disease outbreaks. PMID:19463166

  2. A preliminary phylogenetic analysis of the New World Helopini (Coleoptera, Tenebrionidae, Tenebrioninae) indicates the need for profound rearrangements of the classification

    PubMed Central

    Cifuentes-Ruiz, Paulina; Zaragoza-Caballero, Santiago; Ochoterena-Booth, Helga; Morón, Miguel Ángel

    2014-01-01

    Abstract Helopini is a diverse tribe in the subfamily Tenebrioninae with a worldwide distribution. The New World helopine species have not been reviewed recently and several doubts emerge regarding their generic assignment as well as the naturalness of the tribe and subordinate taxa. To assess these questions, a preliminary cladistic analysis was conducted with emphasis on sampling the genera distributed in the New World, but including representatives from other regions. The parsimony analysis includes 30 ingroup species from America, Europe and Asia of the subtribes Helopina and Cylindrinotina, plus three outgroups, and 67 morphological characters. Construction of the matrix resulted in the discovery of morphological character states not previously reported for the tribe, particularly from the genitalia of New World species. A consensus of the 12 most parsimonious trees supports the monophyly of the tribe based on a unique combination of characters, including one synapomorphy. None of the subtribes or the genera of the New World represented by more than one species (Helops Fabricius, Nautes Pascoe and Tarpela Bates) were recovered as monophyletic. Helopina was recovered as paraphyletic in relation to Cylindrinotina. One Nearctic species of Helops and one Palearctic species of Tarpela (subtribe Helopina) were more closely related to species of Cylindrinotina. A relatively derived clade, mainly composed by Neotropical species, was found; it includes seven species of Tarpela, seven species of Nautes, and three species of Helops, two Nearctic and one Neotropical. Our results reveal the need to deeply re-evaluate the current classification of the tribe and subordinated taxa, but a broader taxon sampling and further character exploration is needed in order to fully recognize monophyletic groups at different taxonomic levels (from subtribes to genera). PMID:25009428

  3. Accurate age classification of 6 and 12 month-old infants based on resting-state functional connectivity magnetic resonance imaging data

    PubMed Central

    Pruett, John R.; Kandala, Sridhar; Hoertel, Sarah; Snyder, Abraham Z.; Elison, Jed T.; Nishino, Tomoyuki; Feczko, Eric; Dosenbach, Nico U.F.; Nardos, Binyam; Power, Jonathan D.; Adeyemo, Babatunde; Botteron, Kelly N.; McKinstry, Robert C.; Evans, Alan C.; Hazlett, Heather C.; Dager, Stephen R.; Paterson, Sarah; Schultz, Robert T.; Collins, D. Louis; Fonov, Vladimir S.; Styner, Martin; Gerig, Guido; Das, Samir; Kostopoulos, Penelope; Constantino, John N.; Estes, Annette M.; Petersen, Steven E.; Schlaggar, Bradley L.; Piven, Joseph

    2015-01-01

    Human large-scale functional brain networks are hypothesized to undergo significant changes over development. Little is known about these functional architectural changes, particularly during the second half of the first year of life. We used multivariate pattern classification of resting-state functional connectivity magnetic resonance imaging (fcMRI) data obtained in an on-going, multi-site, longitudinal study of brain and behavioral development to explore whether fcMRI data contained information sufficient to classify infant age. Analyses carefully account for the effects of fcMRI motion artifact. Support vector machines (SVMs) classified 6 versus 12 month-old infants (128 datasets) above chance based on fcMRI data alone. Results demonstrate significant changes in measures of brain functional organization that coincide with a special period of dramatic change in infant motor, cognitive, and social development. Explorations of the most different correlations used for SVM lead to two different interpretations about functional connections that support 6 versus 12-month age categorization. PMID:25704288

  4. Rapid and accurate taxonomic classification of insect (class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier

    PubMed Central

    Porter, Teresita M; Gibson, Joel F; Shokralla, Shadi; Baird, Donald J; Golding, G Brian; Hajibabaei, Mehrdad

    2014-01-01

    Current methods to identify unknown insect (class Insecta) cytochrome c oxidase (COI barcode) sequences often rely on thresholds of distances that can be difficult to define, sequence similarity cut-offs, or monophyly. Some of the most commonly used metagenomic classification methods do not provide a measure of confidence for the taxonomic assignments they provide. The aim of this study was to use a naïve Bayesian classifier (Wang et al. Applied and Environmental Microbiology, 2007; 73: 5261) to automate taxonomic assignments for large batches of insect COI sequences such as data obtained from high-throughput environmental sequencing. This method provides rank-flexible taxonomic assignments with an associated bootstrap support value, and it is faster than the blast-based methods commonly used in environmental sequence surveys. We have developed and rigorously tested the performance of three different training sets using leave-one-out cross-validation, two field data sets, and targeted testing of Lepidoptera, Diptera and Mantodea sequences obtained from the Barcode of Life Data system. We found that type I error rates, incorrect taxonomic assignments with a high bootstrap support, were already relatively low but could be lowered further by ensuring that all query taxa are actually present in the reference database. Choosing bootstrap support cut-offs according to query length and summarizing taxonomic assignments to more inclusive ranks can also help to reduce error while retaining the maximum number of assignments. Additionally, we highlight gaps in the taxonomic and geographic representation of insects in public sequence databases that will require further work by taxonomists to improve the quality of assignments generated using any method.

  5. Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the ‘Extreme Learning Machine’ Algorithm

    PubMed Central

    McDonnell, Mark D.; Tissera, Migel D.; Vladusich, Tony; van Schaik, André; Tapson, Jonathan

    2015-01-01

    Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional neural networks. This is achieved by training such networks using the ‘Extreme Learning Machine’ (ELM) approach, which also enables a very rapid training time (∼ 10 minutes). Adding distortions, as is common practise for MNIST, reduces error rates even further. Our methods are also shown to be capable of achieving less than 5.5% error rates on the NORB image database. To achieve these results, we introduce several enhancements to the standard ELM algorithm, which individually and in combination can significantly improve performance. The main innovation is to ensure each hidden-unit operates only on a randomly sized and positioned patch of each image. This form of random ‘receptive field’ sampling of the input ensures the input weight matrix is sparse, with about 90% of weights equal to zero. Furthermore, combining our methods with a small number of iterations of a single-batch backpropagation method can significantly reduce the number of hidden-units required to achieve a particular performance. Our close to state-of-the-art results for MNIST and NORB suggest that the ease of use and accuracy of the ELM algorithm for designing a single-hidden-layer neural network classifier should cause it to be given greater consideration either as a standalone method for simpler problems, or as the final classification stage in deep neural networks applied to more difficult problems. PMID:26262687

  6. Complete chloroplast genome of the genus Cymbidium: lights into the species identification, phylogenetic implications and population genetic analyses

    PubMed Central

    2013-01-01

    Background Cymbidium orchids, including some 50 species, are the famous flowers, and they possess high commercial value in the floricultural industry. Furthermore, the values of different orchids are great differences. However, species identification is very difficult. To a certain degree, chloroplast DNA sequence data are a versatile tool for species identification and phylogenetic implications in plants. Different chloroplast loci have been utilized for evaluating phylogenetic relationships at each classification level among plant species, including at the interspecies and intraspecies levels. However, there is no evidence that a short sequence can distinguish all plant species from each other in order to infer phylogenetic relationships. Molecular markers derived from the complete chloroplast genome can provide effective tools for species identification and phylogenetic resolution. Results The complete nucleotide sequences of eight individuals from a total of five Cymbidium species’ chloroplast (cp) genomes were determined using Illumina sequencing technology of the total DNA via a combination of de novo and reference-guided assembly. The length of the Cymbidium cp genome is about 155 kb. The cp genomes contain 123 unique genes, and the IR regions contain 24 duplicates. Although the genomes, including genome structure, gene order and orientation, are similar to those of other orchids, they are not evolutionarily conservative. The cp genome of Cymbidium evolved moderately with more than 3% sequence divergence, which could provide enough information for phylogeny. Rapidly evolving chloroplast genome regions were identified and 11 new divergence hotspot regions were disclosed for further phylogenetic study and species identification in Orchidaceae. Conclusions Phylogenomic analyses were conducted using 10 complete chloroplast genomes from seven orchid species. These data accurately identified the individuals and established the phylogenetic relationships between

  7. A genus-level classification of the family Thraupidae (Class Aves: Order Passeriformes).

    PubMed

    Burns, Kevin J; Unitt, Philip; Mason, Nicholas A

    2016-01-01

    The tanagers (Thraupidae) are a major component of the Neotropical avifauna, and vary in plumage colors, behaviors, morphologies, and ecologies. Globally, they represent nearly 4% of all avian species and are the largest family of songbirds. However, many currently used tanager genera are not monophyletic, based on analyses of molecular data that have accumulated over the past 25 years. Current genus-level classifications of tanagers have not been revised according to newly documented relationships of tanagers for various reasons: 1) the lack of a comprehensive phylogeny, 2) reluctance to lump existing genera into larger groups, and 3) the lack of available names for newly defined smaller groups. Here, we present two alternative classifications based on a newly published comprehensive phylogeny of tanagers. One of these classifications uses existing generic names, but defines them broadly. The other, which we advocate and follow here, provides new generic names for more narrowly defined groups. Under the latter, we propose eleven new genera (Asemospiza, Islerothraupis, Maschalethraupis, Chrysocorypha, Kleinothraupis, Castanozoster, Ephippiospingus, Chionodacryon, Pseudosaltator, Poecilostreptus, Stilpnia), and resurrect several generic names to form monophyletic taxa. Either of these classifications would allow taxonomic authorities to reconcile classification with current understanding of tanager phylogenetic relationships. Having a more phylogenetically accurate classification for tanagers will facilitate the study and conservation of this important Neotropical radiation of songbirds. PMID:27394344

  8. Phylogenetic Inference From Conserved sites Alignments

    SciTech Connect

    grundy, W.N.; Naylor, G.J.P.

    1999-08-15

    Molecular sequences provide a rich source of data for inferring the phylogenetic relationships among species. However, recent work indicates that even an accurate multiple alignment of a large sequence set may yield an incorrect phylogeny and that the quality of the phylogenetic tree improves when the input consists only of the highly conserved, motif regions of the alignment. This work introduces two methods of producing multiple alignments that include only the conserved regions of the initial alignment. The first method retains conserved motifs, whereas the second retains individual conserved sites in the initial alignment. Using parsimony analysis on a mitochondrial data set containing 19 species among which the phylogenetic relationships are widely accepted, both conserved alignment methods produce better phylogenetic trees than the complete alignment. Unlike any of the 19 inference methods used before to analyze this data, both methods produce trees that are completely consistent with the known phylogeny. The motif-based method employs far fewer alignment sites for comparable error rates. For a larger data set containing mitochondrial sequences from 39 species, the site-based method produces a phylogenetic tree that is largely consistent with known phylogenetic relationships and suggests several novel placements.

  9. The evolution of HPV by means of a phylogenetic study.

    PubMed

    Isea, Raúl; Chaves, Juan L; Montes, Esther; Rubio-Montero, Antonio J; Mayo, Rafael

    2009-01-01

    In this work we demonstrate the adequacy of revising the classification systems based on molecular phylogenetic calculations by allowing an arbitrary number of taxas that take advantage of high performance computing platforms for the Human papillomavirus (HPV) case. To do so, we have analysed several phylogenetic trees which have been calculated with the PhyloGrid tool, a workflow developed in the framework of the EELA-2 Project. PMID:19593062

  10. Phylogenetic lineages in Entomophthoromycota

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Entomophthoromycota Humber is one of five major phylogenetic lineages among the former phylum Zygomycota. These early terrestrial fungi share evolutionarily ancestral characters such as coenocytic mycelium and gametangiogamy as a sexual process resulting in zygospore formation. Previous molecular st...

  11. Phylogenetic Analysis of Salmonella, Shigella, and Escherichia coli Strains on the Basis of the gyrB Gene Sequence

    PubMed Central

    Fukushima, Masao; Kakinuma, Kenichi; Kawaguchi, Ryuji

    2002-01-01

    Phylogenetic analysis of about 200 strains of Salmonella, Shigella, and Escherichia coli was carried out using the nucleotide sequence of the gene for DNA gyrase B (gyrB), which was determined by directly sequencing PCR fragments. The results establish a new phylogenetic tree for the classification of Salmonella, Shigella, and Escherichia coli in which Salmonella forms a cluster separate from but closely related to Shigella and E. coli. In comparison with 16S rRNA analysis, the gyrB sequences indicated a greater evolutionary divergence for the bacteria. Thus, in screening for the presence of bacteria, the gyrB gene might be a useful tool for differentiating between closely related species of bacteria such as Shigella spp. and E. coli. At present, 16S rRNA sequence analysis is an accurate and rapid method for identifying most unknown bacteria to the genus level because the highly conserved 16S rRNA region is easy to amplify; however, analysis of the more variable gyrB sequence region can identify unknown bacteria to the species level. In summary, we have shown that gyrB sequence analysis is a useful alternative to 16S rRNA analysis for constructing the phylogenetic relationships of bacteria, in particular for the classification of closely related bacterial species. PMID:12149329

  12. Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae)

    PubMed Central

    Baker, William J.; Norup, Maria V.; Clarkson, James J.; Couvreur, Thomas L. P.; Dowe, John L.; Lewis, Carl E.; Pintaud, Jean-Christophe; Savolainen, Vincent; Wilmot, Tomas; Chase, Mark W.

    2011-01-01

    Background and Aims The Arecoideae is the largest and most diverse of the five subfamilies of palms (Arecaceae/Palmae), containing >50 % of the species in the family. Despite its importance, phylogenetic relationships among Arecoideae are poorly understood. Here the most densely sampled phylogenetic analysis of Arecoideae available to date is presented. The results are used to test the current classification of the subfamily and to identify priority areas for future research. Methods DNA sequence data for the low-copy nuclear genes PRK and RPB2 were collected from 190 palm species, covering 103 (96 %) genera of Arecoideae. The data were analysed using the parsimony ratchet, maximum likelihood, and both likelihood and parsimony bootstrapping. Key Results and Conclusions Despite the recovery of paralogues and pseudogenes in a small number of taxa, PRK and RPB2 were both highly informative, producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. Simultaneous analyses of the combined data sets provided additional resolution and support. Two areas of incongruence between PRK and RPB2 were strongly supported by the bootstrap relating to the placement of tribes Chamaedoreeae, Iriarteeae and Reinhardtieae; the causes of this incongruence remain uncertain. The current classification within Arecoideae was strongly supported by the present data. Of the 14 tribes and 14 sub-tribes in the classification, only five sub-tribes from tribe Areceae (Basseliniinae, Linospadicinae, Oncospermatinae, Rhopalostylidinae and Verschaffeltiinae) failed to receive support. Three major higher level clades were strongly supported: (1) the RRC clade (Roystoneeae, Reinhardtieae and Cocoseae), (2) the POS clade (Podococceae, Oranieae and Sclerospermeae) and (3) the core arecoid clade (Areceae, Euterpeae, Geonomateae, Leopoldinieae, Manicarieae and Pelagodoxeae). However, new data sources are required to elucidate ambiguities that remain in phylogenetic

  13. Phyloproteomics: What Phylogenetic Analysis Reveals about Serum Proteomics

    PubMed Central

    Abu-Asab, Mones; Chaouchi, Mohamed; Amri, Hakima

    2008-01-01

    Phyloproteomics is a novel analytical tool that solves the issue of comparability between proteomic analyses, utilizes a total spectrum-parsing algorithm, and produces biologically meaningful classification of specimens. Phyloproteomics employs two algorithms: a new parsing algorithm (UNIPAL) and a phylogenetic algorithm (MIX). By outgroup comparison, the parsing algorithm identifies novel or vanished MS peaks and peaks signifying up or down regulated proteins and scores them as derived or ancestral. The phylogenetic algorithm uses the latter scores to produce a biologically meaningful classification of the specimens. PMID:16944935

  14. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  15. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  16. Phylogenetic reconstruction of the wolf spiders (Araneae: Lycosidae) using sequences from the 12S rRNA, 28S rRNA, and NADH1 genes: implications for classification, biogeography, and the evolution of web building behavior.

    PubMed

    Murphy, Nicholas P; Framenau, Volker W; Donnellan, Stephen C; Harvey, Mark S; Park, Yung-Chul; Austin, Andrew D

    2006-03-01

    Current knowledge of the evolutionary relationships amongst the wolf spiders (Araneae: Lycosidae) is based on assessment of morphological similarity or phylogenetic analysis of a small number of taxa. In order to enhance the current understanding of lycosid relationships, phylogenies of 70 lycosid species were reconstructed by parsimony and Bayesian methods using three molecular markers; the mitochondrial genes 12S rRNA, NADH1, and the nuclear gene 28S rRNA. The resultant trees from the mitochondrial markers were used to assess the current taxonomic status of the Lycosidae and to assess the evolutionary history of sheet-web construction in the group. The results suggest that a number of genera are not monophyletic, including Lycosa, Arctosa, Alopecosa, and Artoria. At the subfamilial level, the status of Pardosinae needs to be re-assessed, and the position of a number of genera within their respective subfamilies is in doubt (e.g., Hippasa and Arctosa in Lycosinae and Xerolycosa, Aulonia and Hygrolycosa in Venoniinae). In addition, a major clade of strictly Australasian taxa may require the creation of a new subfamily. The analysis of sheet-web building in Lycosidae revealed that the interpretation of this trait as an ancestral state relies on two factors: (1) an asymmetrical model favoring the loss of sheet-webs and (2) that the suspended silken tube of Pirata is directly descended from sheet-web building. Paralogous copies of the nuclear 28S rRNA gene were sequenced, confounding the interpretation of the phylogenetic analysis and suggesting that a cautionary approach should be taken to the further use of this gene for lycosid phylogenetic analysis. PMID:16503280

  17. The Phylogenetic Likelihood Library

    PubMed Central

    Flouri, T.; Izquierdo-Carrasco, F.; Darriba, D.; Aberer, A.J.; Nguyen, L.-T.; Minh, B.Q.; Von Haeseler, A.; Stamatakis, A.

    2015-01-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2–10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). PMID:25358969

  18. Phylogenetically resolving epidemiologic linkage

    PubMed Central

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-01-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  19. Phylogenetically resolving epidemiologic linkage

    DOE PAGESBeta

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-02-22

    The use of phylogenetic trees in epidemiological investigations has become commonplace, but their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the truemore » transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. Moreover, we confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results.« less

  20. The phylogenetic likelihood library.

    PubMed

    Flouri, T; Izquierdo-Carrasco, F; Darriba, D; Aberer, A J; Nguyen, L-T; Minh, B Q; Von Haeseler, A; Stamatakis, A

    2015-03-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2-10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). PMID:25358969

  1. Phylogenetically resolving epidemiologic linkage.

    PubMed

    Romero-Severson, Ethan O; Bulla, Ingo; Leitner, Thomas

    2016-03-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals' HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  2. Relaxed Phylogenetics and Dating with Confidence

    PubMed Central

    Ho, Simon Y. W; Phillips, Matthew J

    2006-01-01

    In phylogenetics, the unrooted model of phylogeny and the strict molecular clock model are two extremes of a continuum. Despite their dominance in phylogenetic inference, it is evident that both are biologically unrealistic and that the real evolutionary process lies between these two extremes. Fortunately, intermediate models employing relaxed molecular clocks have been described. These models open the gate to a new field of “relaxed phylogenetics.” Here we introduce a new approach to performing relaxed phylogenetic analysis. We describe how it can be used to estimate phylogenies and divergence times in the face of uncertainty in evolutionary rates and calibration times. Our approach also provides a means for measuring the clocklikeness of datasets and comparing this measure between different genes and phylogenies. We find no significant rate autocorrelation among branches in three large datasets, suggesting that autocorrelated models are not necessarily suitable for these data. In addition, we place these datasets on the continuum of clocklikeness between a strict molecular clock and the alternative unrooted extreme. Finally, we present analyses of 102 bacterial, 106 yeast, 61 plant, 99 metazoan, and 500 primate alignments. From these we conclude that our method is phylogenetically more accurate and precise than the traditional unrooted model while adding the ability to infer a timescale to evolution. PMID:16683862

  3. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes

    PubMed Central

    Segata, Nicola; Börnigen, Daniela; Morgan, Xochitl C.; Huttenhower, Curtis

    2013-01-01

    New microbial genomes are constantly being sequenced, and it is crucial to accurately determine their taxonomic identities and evolutionary relationships. Here we report PhyloPhlAn, a new method to assign microbial phylogeny and putative taxonomy using >400 proteins optimized from among 3,737 genomes. This method measures the sequence diversity of all clades, classifies genomes from deep-branching candidate divisions through closely-related subspecies, and improves consistency between phylogenetic and taxonomic groupings. PhyloPhlAn improved taxonomic accuracy for existing and newly-sequenced genomes, detecting 157 erroneous labels, correcting 46, and placing or refining 130 new genomes. We provide examples of accurate classifications from subspecies (Sulfolobus spp.) to phyla, and of preliminary rooting of deep-branching candidate divisions, including consistent statistical support for Caldiserica (formerly candidate division OP5). PhyloPhlAn will thus be useful for both phylogenetic assessment and taxonomic quality control of newly-sequenced genomes. The final phylogenies, conserved protein sequences, and open-source implementation are available online. PMID:23942190

  4. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes.

    PubMed

    Segata, Nicola; Börnigen, Daniela; Morgan, Xochitl C; Huttenhower, Curtis

    2013-01-01

    New microbial genomes are constantly being sequenced, and it is crucial to accurately determine their taxonomic identities and evolutionary relationships. Here we report PhyloPhlAn, a new method to assign microbial phylogeny and putative taxonomy using >400 proteins optimized from among 3,737 genomes. This method measures the sequence diversity of all clades, classifies genomes from deep-branching candidate divisions through closely related subspecies and improves consistency between phylogenetic and taxonomic groupings. PhyloPhlAn improved taxonomic accuracy for existing and newly sequenced genomes, detecting 157 erroneous labels, correcting 46 and placing or refining 130 new genomes. We provide examples of accurate classifications from subspecies (Sulfolobus spp.) to phyla, and of preliminary rooting of deep-branching candidate divisions, including consistent statistical support for Caldiserica (formerly candidate division OP5). PhyloPhlAn will thus be useful for both phylogenetic assessment and taxonomic quality control of newly sequenced genomes. The final phylogenies, conserved protein sequences and open-source implementation are available online. PMID:23942190

  5. A Universal Phylogenetic Tree.

    ERIC Educational Resources Information Center

    Offner, Susan

    2001-01-01

    Presents a universal phylogenetic tree suitable for use in high school and college-level biology classrooms. Illustrates the antiquity of life and that all life is related, even if it dates back 3.5 billion years. Reflects important evolutionary relationships and provides an exciting way to learn about the history of life. (SAH)

  6. Host specificity and phylogenetic relationships of chicken and turkey parvoviruses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previous reports indicate that the newly discovered chicken parvoviruses (ChPV) and turkey parvoviruses (TuPV) are very similar to each other, yet they represent different species within a new genus of Parvoviridae. Currently, strain classification is based on the phylogenetic analysis of a 561 bas...

  7. Charles Darwin, beetles and phylogenetics.

    PubMed

    Beutel, Rolf G; Friedrich, Frank; Leschen, Richard A B

    2009-11-01

    Here, we review Charles Darwin's relation to beetles and developments in coleopteran systematics in the last two centuries. Darwin was an enthusiastic beetle collector. He used beetles to illustrate different evolutionary phenomena in his major works, and astonishingly, an entire sub-chapter is dedicated to beetles in "The Descent of Man". During his voyage on the Beagle, Darwin was impressed by the high diversity of beetles in the tropics, and he remarked that, to his surprise, the majority of species were small and inconspicuous. However, despite his obvious interest in the group, he did not get involved in beetle taxonomy, and his theoretical work had little immediate impact on beetle classification. The development of taxonomy and classification in the late nineteenth and earlier twentieth century was mainly characterised by the exploration of new character systems (e.g. larval features and wing venation). In the mid-twentieth century, Hennig's new methodology to group lineages by derived characters revolutionised systematics of Coleoptera and other organisms. As envisioned by Darwin and Ernst Haeckel, the new Hennigian approach enabled systematists to establish classifications truly reflecting evolution. Roy A. Crowson and Howard E. Hinton, who both made tremendous contributions to coleopterology, had an ambivalent attitude towards the Hennigian ideas. The Mickoleit school combined detailed anatomical work with a classical Hennigian character evaluation, with stepwise tree building, comparatively few characters and a priori polarity assessment without explicit use of the outgroup comparison method. The rise of cladistic methods in the 1970s had a strong impact on beetle systematics. Cladistic computer programs facilitated parsimony analyses of large data matrices, mostly morphological characters not requiring detailed anatomical investigations. Molecular studies on beetle phylogeny started in the 1990s with modest taxon sampling and limited DNA data. This has

  8. Charles Darwin, beetles and phylogenetics

    NASA Astrophysics Data System (ADS)

    Beutel, Rolf G.; Friedrich, Frank; Leschen, Richard A. B.

    2009-11-01

    Here, we review Charles Darwin’s relation to beetles and developments in coleopteran systematics in the last two centuries. Darwin was an enthusiastic beetle collector. He used beetles to illustrate different evolutionary phenomena in his major works, and astonishingly, an entire sub-chapter is dedicated to beetles in “The Descent of Man”. During his voyage on the Beagle, Darwin was impressed by the high diversity of beetles in the tropics, and he remarked that, to his surprise, the majority of species were small and inconspicuous. However, despite his obvious interest in the group, he did not get involved in beetle taxonomy, and his theoretical work had little immediate impact on beetle classification. The development of taxonomy and classification in the late nineteenth and earlier twentieth century was mainly characterised by the exploration of new character systems (e.g. larval features and wing venation). In the mid-twentieth century, Hennig’s new methodology to group lineages by derived characters revolutionised systematics of Coleoptera and other organisms. As envisioned by Darwin and Ernst Haeckel, the new Hennigian approach enabled systematists to establish classifications truly reflecting evolution. Roy A. Crowson and Howard E. Hinton, who both made tremendous contributions to coleopterology, had an ambivalent attitude towards the Hennigian ideas. The Mickoleit school combined detailed anatomical work with a classical Hennigian character evaluation, with stepwise tree building, comparatively few characters and a priori polarity assessment without explicit use of the outgroup comparison method. The rise of cladistic methods in the 1970s had a strong impact on beetle systematics. Cladistic computer programs facilitated parsimony analyses of large data matrices, mostly morphological characters not requiring detailed anatomical investigations. Molecular studies on beetle phylogeny started in the 1990s with modest taxon sampling and limited DNA data

  9. Canonical phylogenetic ordination.

    PubMed

    Giannini, Norberto P

    2003-10-01

    A phylogenetic comparative method is proposed for estimating historical effects on comparative data using the partitions that compose a cladogram, i.e., its monophyletic groups. Two basic matrices, Y and X, are defined in the context of an ordinary linear model. Y contains the comparative data measured over t taxa. X consists of an initial tree matrix that contains all the xj monophyletic groups (each coded separately as a binary indicator variable) of the phylogenetic tree available for those taxa. The method seeks to define the subset of groups, i.e., a reduced tree matrix, that best explains the patterns in Y. This definition is accomplished via regression or canonical ordination (depending on the dimensionality of Y) coupled with Monte Carlo permutations. It is argued here that unrestricted permutations (i.e., under an equiprobable model) are valid for testing this specific kind of groupwise hypothesis. Phylogeny is either partialled out or, more properly, incorporated into the analysis in the form of component variation. Direct extensions allow for testing ecomorphological data controlled by phylogeny in a variation partitioning approach. Currently available statistical techniques make this method applicable under most univariate/multivariate models and metrics; two-way phylogenetic effects can be estimated as well. The simplest case (univariate Y), tested with simulations, yielded acceptable type I error rates. Applications presented include examples from evolutionary ethology, ecology, and ecomorphology. Results showed that the new technique detected previously overlooked variation clearly associated with phylogeny and that many phylogenetic effects on comparative data may occur at particular groups rather than across the entire tree. PMID:14530135

  10. Refuting phylogenetic relationships

    PubMed Central

    Bucknam, James; Boucher, Yan; Bapteste, Eric

    2006-01-01

    Background Phylogenetic methods are philosophically grounded, and so can be philosophically biased in ways that limit explanatory power. This constitutes an important methodologic dimension not often taken into account. Here we address this dimension in the context of concatenation approaches to phylogeny. Results We discuss some of the limits of a methodology restricted to verificationism, the philosophy on which gene concatenation practices generally rely. As an alternative, we describe a software which identifies and focuses on impossible or refuted relationships, through a simple analysis of bootstrap bipartitions, followed by multivariate statistical analyses. We show how refuting phylogenetic relationships could in principle facilitate systematics. We also apply our method to the study of two complex phylogenies: the phylogeny of the archaea and the phylogeny of the core of genes shared by all life forms. While many groups are rejected, our results left open a possible proximity of N. equitans and the Methanopyrales, of the Archaea and the Cyanobacteria, and as well the possible grouping of the Methanobacteriales/Methanoccocales and Thermosplasmatales, of the Spirochaetes and the Actinobacteria and of the Proteobacteria and firmicutes. Conclusion It is sometimes easier (and preferable) to decide which species do not group together than which ones do. When possible topologies are limited, identifying local relationships that are rejected may be a useful alternative to classical concatenation approaches aiming to find a globally resolved tree on the basis of weak phylogenetic markers. Reviewers This article was reviewed by Mark Ragan, Eugene V Koonin and J Peter Gogarten. PMID:16956399

  11. The revised classification of eukaryotes

    PubMed Central

    Adl, Sina M.; Simpson, Alastair. G.; Lane, Christopher E.; Lukeš, Julius; Bass, David; Bowser, Samuel S.; Brown, Matt; Burki, Fabien; Dunthorn, Micah; Hampl, Vladimir; Heiss, Aaron; Hoppenrath, Mona; Lara, Enrique; leGall, Line; Lynn, Denis H.; McManus, Hilary; Mitchell, Edward A. D.; Mozley-Stanridge, Sharon E.; Parfrey, Laura Wegener; Pawlowski, Jan; Rueckert, Sonja; Shadwick, Lora; Schoch, Conrad; Smirnov, Alexey; Spiegel, Frederick W.

    2012-01-01

    This revision of the classification of eukaryotes, which updates that of Adl et al. (2005), retains an emphasis on the protists and incorporates changes since 2005 that have resolved nodes and branches in phylogenetic trees. Whereas the previous revision was successful in re-introducing name stability to the classification, this revision provides a classification for lineages that were then still unresolved. The supergroups have withstood phylogenetic hypothesis testing with some modifications, but despite some progress, problematic nodes at the base of the eukaryotic tree still remain to be statistically resolved. Looking forward, subsequent transformations to our understanding of the diversity of life will be from the discovery of novel lineages in previously under-sampled areas and from environmental genomic information. PMID:23020233

  12. Associations of Leaf Spectra with Genetic and Phylogenetic Variation in Oaks: Prospects for Remote Detection of Biodiversity

    DOE PAGESBeta

    Cavender-Bares, Jeannine; Meireles, Jose; Couture, John; Kaproth, Matthew; Kingdon, Clayton; Singh, Aditya; Serbin, Shawn; Center, Alyson; Zuniga, Esau; Pilz, George; et al

    2016-03-09

    Species and phylogenetic lineages have evolved to differ in the way that they acquire and deploy resources, with consequences for their physiological, chemical and structural attributes, many of which can be detected using spectral reflectance form leaves. Recent technological advances for assessing optical properties of plants offer opportunities to detect functional traits of organisms and differentiate levels of biological organization across the tree of life. We connect leaf-level full range spectral data (400–2400 nm) of leaves to the hierarchical organization of plant diversity within the oak genus (Quercus) using field and greenhouse experiments in which environmental factors and plant agemore » are controlled. We show that spectral data significantly differentiate populations within a species and that spectral similarity is significantly associated with phylogenetic similarity among species. Furthermore, we show that hyperspectral information allows more accurate classification of taxa than spectrally-derived traits, which by definition are of lower dimensionality. Finally, model accuracy increases at higher levels in the hierarchical organization of plant diversity, such that we are able to better distinguish clades than species or populations. This pattern supports an evolutionary explanation for the degree of optical differentiation among plants and demonstrates potential for remote detection of genetic and phylogenetic diversity.« less

  13. Phylogenetic Comparative Assembly

    NASA Astrophysics Data System (ADS)

    Husemann, Peter; Stoye, Jens

    Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence. Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a contig adjacency graph. From this a layout graph can be computed which indicates putative adjacencies of the contigs in order to aid biologists in finishing the complete genomic sequence.

  14. ClassyFlu: classification of influenza A viruses with Discriminatively trained profile-HMMs.

    PubMed

    Van der Auwera, Sandra; Bulla, Ingo; Ziller, Mario; Pohlmann, Anne; Harder, Timm; Stanke, Mario

    2014-01-01

    Accurate and rapid characterization of influenza A virus (IAV) hemagglutinin (HA) and neuraminidase (NA) sequences with respect to subtype and clade is at the basis of extended diagnostic services and implicit to molecular epidemiologic studies. ClassyFlu is a new tool and web service for the classification of IAV sequences of the HA and NA gene into subtypes and phylogenetic clades using discriminatively trained profile hidden Markov models (HMMs), one for each subtype or clade. ClassyFlu merely requires as input unaligned, full-length or partial HA or NA DNA sequences. It enables rapid and highly accurate assignment of HA sequences to subtypes H1-H17 but particularly focusses on the finer grained assignment of sequences of highly pathogenic avian influenza viruses of subtype H5N1 according to the cladistics proposed by the H5N1 Evolution Working Group. NA sequences are classified into subtypes N1-N10. ClassyFlu was compared to semiautomatic classification approaches using BLAST and phylogenetics and additionally for H5 sequences to the new "Highly Pathogenic H5N1 Clade Classification Tool" (IRD-CT) proposed by the Influenza Research Database. Our results show that both web tools (ClassyFlu and IRD-CT), although based on different methods, are nearly equivalent in performance and both are more accurate and faster than semiautomatic classification. A retraining of ClassyFlu to altered cladistics as well as an extension of ClassyFlu to other IAV genome segments or fragments thereof is undemanding. This is exemplified by unambiguous assignment to a distinct cluster within subtype H7 of sequences of H7N9 viruses which emerged in China early in 2013 and caused more than 130 human infections. http://bioinf.uni-greifswald.de/ClassyFlu is a free web service. For local execution, the ClassyFlu source code in PERL is freely available. PMID:24404173

  15. Phenotypic and phylogenetic characterization of an abamectin-degrading bacterial strain isolated from a citrus orchard.

    PubMed

    Ali, Shinawar Waseem; Yu, Fang-Bo; Haider, Muhammad Saleem; Yan, Xin; Li, Shun-Peng

    2013-01-01

    Bacterial strain GB-01 was isolated from abamectin-contaminated soils by continuous enrichment culture. The preliminary identification of strain GB-01 as a Burkholderia species was based mainly on simple biochemical and substrate utilization tests; however, these tests alone cannot accurately differentiate all the species within the genus Burkholderia. The strain GB-01 was subjected to taxonomic analysis through a polyphasic approach, in which phenotypic, genotypic, and phylogenetic information was gathered to conclude the classification of this microbe. Phenotypic information comes from basic bacteriological tests and substrate utilization patterns using the Biolog GN2 MicroPlating system and automated miniature biochemical test kits, i.e. API 20 NE, ID 32 GN and API 50 CH, as well as analyzing the whole cell fatty acid profile. Genotypic information was gathered from whole genome DNA base composition (G+C mol%), and DNA-DNA hybridization with its closest species, while phylogenetic information was collected from the comparative analysis of 16S rRNA and recA gene sequences. The results of polyphasic analysis concluded that strain GB-01 is an atypical strain of the Burkholderia diffusa species. PMID:23863292

  16. Entanglement, Invariants, and Phylogenetics

    NASA Astrophysics Data System (ADS)

    Sumner, J. G.

    2007-10-01

    This thesis develops and expands upon known techniques of mathematical physics relevant to the analysis of the popular Markov model of phylogenetic trees required in biology to reconstruct the evolutionary relationships of taxonomic units from biomolecular sequence data. The techniques of mathematical physics are plethora and have been developed for some time. The Markov model of phylogenetics and its analysis is a relatively new technique where most progress to date has been achieved by using discrete mathematics. This thesis takes a group theoretical approach to the problem by beginning with a remarkable mathematical parallel to the process of scattering in particle physics. This is shown to equate to branching events in the evolutionary history of molecular units. The major technical result of this thesis is the derivation of existence proofs and computational techniques for calculating polynomial group invariant functions on a multi-linear space where the group action is that relevant to a Markovian time evolution. The practical results of this thesis are an extended analysis of the use of invariant functions in distance based methods and the presentation of a new reconstruction technique for quartet trees which is consistent with the most general Markov model of sequence evolution.

  17. Phylogenetic trees in bioinformatics

    SciTech Connect

    Burr, Tom L

    2008-01-01

    Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding the best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.

  18. CREST--classification resources for environmental sequence tags.

    PubMed

    Lanzén, Anders; Jørgensen, Steffen L; Huson, Daniel H; Gorfer, Markus; Grindhaug, Svenn Helge; Jonassen, Inge; Øvreås, Lise; Urich, Tim

    2012-01-01

    Sequencing of taxonomic or phylogenetic markers is becoming a fast and efficient method for studying environmental microbial communities. This has resulted in a steadily growing collection of marker sequences, most notably of the small-subunit (SSU) ribosomal RNA gene, and an increased understanding of microbial phylogeny, diversity and community composition patterns. However, to utilize these large datasets together with new sequencing technologies, a reliable and flexible system for taxonomic classification is critical. We developed CREST (Classification Resources for Environmental Sequence Tags), a set of resources and tools for generating and utilizing custom taxonomies and reference datasets for classification of environmental sequences. CREST uses an alignment-based classification method with the lowest common ancestor algorithm. It also uses explicit rank similarity criteria to reduce false positives and identify novel taxa. We implemented this method in a web server, a command line tool and the graphical user interfaced program MEGAN. Further, we provide the SSU rRNA reference database and taxonomy SilvaMod, derived from the publicly available SILVA SSURef, for classification of sequences from bacteria, archaea and eukaryotes. Using cross-validation and environmental datasets, we compared the performance of CREST and SilvaMod to the RDP Classifier. We also utilized Greengenes as a reference database, both with CREST and the RDP Classifier. These analyses indicate that CREST performs better than alignment-free methods with higher recall rate (sensitivity) as well as precision, and with the ability to accurately identify most sequences from novel taxa. Classification using SilvaMod performed better than with Greengenes, particularly when applied to environmental sequences. CREST is freely available under a GNU General Public License (v3) from http://apps.cbu.uib.no/crest and http://lcaclassifier.googlecode.com. PMID:23145153

  19. The Phylogenetic Diversity of Metagenomes

    PubMed Central

    Kembel, Steven W.; Eisen, Jonathan A.; Pollard, Katherine S.; Green, Jessica L.

    2011-01-01

    Phylogenetic diversity—patterns of phylogenetic relatedness among organisms in ecological communities—provides important insights into the mechanisms underlying community assembly. Studies that measure phylogenetic diversity in microbial communities have primarily been limited to a single marker gene approach, using the small subunit of the rRNA gene (SSU-rRNA) to quantify phylogenetic relationships among microbial taxa. In this study, we present an approach for inferring phylogenetic relationships among microorganisms based on the random metagenomic sequencing of DNA fragments. To overcome challenges caused by the fragmentary nature of metagenomic data, we leveraged fully sequenced bacterial genomes as a scaffold to enable inference of phylogenetic relationships among metagenomic sequences from multiple phylogenetic marker gene families. The resulting metagenomic phylogeny can be used to quantify the phylogenetic diversity of microbial communities based on metagenomic data sets. We applied this method to understand patterns of microbial phylogenetic diversity and community assembly along an oceanic depth gradient, and compared our findings to previous studies of this gradient using SSU-rRNA gene and metagenomic analyses. Bacterial phylogenetic diversity was highest at intermediate depths beneath the ocean surface, whereas taxonomic diversity (diversity measured by binning sequences into taxonomically similar groups) showed no relationship with depth. Phylogenetic diversity estimates based on the SSU-rRNA gene and the multi-gene metagenomic phylogeny were broadly concordant, suggesting that our approach will be applicable to other metagenomic data sets for which corresponding SSU-rRNA gene sequences are unavailable. Our approach opens up the possibility of using metagenomic data to study microbial diversity in a phylogenetic context. PMID:21912589

  20. Modeling body size evolution in Felidae under alternative phylogenetic hypotheses

    PubMed Central

    2009-01-01

    The use of phylogenetic comparative methods in ecological research has advanced during the last twenty years, mainly due to accurate phylogenetic reconstructions based on molecular data and computational and statistical advances. We used phylogenetic correlograms and phylogenetic eigenvector regression (PVR) to model body size evolution in 35 worldwide Felidae (Mammalia, Carnivora) species using two alternative phylogenies and published body size data. The purpose was not to contrast the phylogenetic hypotheses but to evaluate how analyses of body size evolution patterns can be affected by the phylogeny used for comparative analyses (CA). Both phylogenies produced a strong phylogenetic pattern, with closely related species having similar body sizes and the similarity decreasing with increasing distances in time. The PVR explained 65% to 67% of body size variation and all Moran's I values for the PVR residuals were non-significant, indicating that both these models explained phylogenetic structures in trait variation. Even though our results did not suggest that any phylogeny can be used for CA with the same power, or that “good” phylogenies are unnecessary for the correct interpretation of the evolutionary dynamics of ecological, biogeographical, physiological or behavioral patterns, it does suggest that developments in CA can, and indeed should, proceed without waiting for perfect and fully resolved phylogenies. PMID:21637664

  1. Insights into the evolution of sorbitol metabolism: phylogenetic analysis of SDR196C family

    PubMed Central

    2012-01-01

    Background Short chain dehydrogenases/reductases (SDR) are NAD(P)(H)-dependent oxidoreductases with a highly conserved 3D structure and of an early origin, which has allowed them to diverge into several families and enzymatic activities. The SDR196C family (http://www.sdr-enzymes.org) groups bacterial sorbitol dehydrogenases (SDH), which are of great industrial interest. In this study, we examine the phylogenetic relationship between the members of this family, and based on the findings and some sequence conserved blocks, a new and a more accurate classification is proposed. Results The distribution of the 66 bacterial SDH species analyzed was limited to Gram-negative bacteria. Six different bacterial families were found, encompassing α-, β- and γ-proteobacteria. This broad distribution in terms of bacteria and niches agrees with that of SDR, which are found in all forms of life. A cluster analysis of sorbitol dehydrogenase revealed different types of gene organization, although with a common pattern in which the SDH gene is surrounded by sugar ABC transporter proteins, another SDR, a kinase, and several gene regulators. According to the obtained trees, six different lineages and three sublineages can be discerned. The phylogenetic analysis also suggested two different origins for SDH in β-proteobacteria and four origins for γ-proteobacteria. Finally, this subdivision was further confirmed by the differences observed in the sequence of the conserved blocks described for SDR and some specific blocks of SDH, and by a functional divergence analysis, which made it possible to establish new consensus sequences and specific fingerprints for the lineages and sub lineages. Conclusion SDH distribution agrees with that observed for SDR, indicating the importance of the polyol metabolism, as an alternative source of carbon and energy. The phylogenetic analysis pointed to six clearly defined lineages and three sub lineages, and great variability in the origin of this gene

  2. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    PubMed

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. PMID:24382658

  3. Quantitative developmental data in a phylogenetic framework.

    PubMed

    Giannini, Norberto Pedro

    2014-12-01

    Following the embryonic period of organogenesis, most development is allometric growth, which is thought to produce most of the evolutionary morphological divergence between related species. Bivariate or multivariate coefficients of allometry are used to describe quantitative developmental data and are comparable across taxa; as such, these coefficients are amenable to direct treatment in a phylogenetic framework. Mapping of actual allometric coefficients onto phylogenetic trees is supported on the basis of the evolving nature of growth programs and the type of character (continuous) that they represent. This procedure depicts evolutionary allometry accurately and allows for the generation of reliable reconstructions of ancestral allometry, as shown here with a previously published case study on rodent cranial ontogeny. Results reconstructed the signature allometric patterns of rodents to the root of the phylogeny, which could be traced back into a (minimum) Paleocene age. Both character and statistical dependence need to be addressed, so this approach can be integrated with phylogenetic comparative methods that deal with those issues. It is shown that, in this particular sample of rodents, common ancestry explains little allometric variation given the level of divergence present within, and convergence between, major rodent lineages. Furthermore, all that variation is independent of body mass. Thus, from an evolutionary perspective, allometry appears to have a strong functional and likely adaptive basis. PMID:25130201

  4. The evolution of HIV: Inferences using phylogenetics

    PubMed Central

    Castro-Nallar, Eduardo; Pérez-Losada, Marcos; Burton, Gregory F.; Crandall, Keith A.

    2011-01-01

    Molecular phylogenetics has revolutionized the study of not only evolution but also disparate fields such as genomics, bioinformatics, epidemiology, ecology, microbiology, molecular biology and biochemistry. Particularly significant are its achievements in population genetics as a result of the development of coalescent theory, which have contributed to more accurate model-based parameter estimation and explicit hypothesis testing. The study of the evolution of many microorganisms, and HIV in particular, have benefited from these new methodologies. HIV is well suited for such sophisticated population analyses because of its large population sizes, short generation times, high substitution rates and relatively small genomes. All these factors make HIV an ideal and fascinating model to study molecular evolution in real time. Here we review the significant advances made in HIV evolution through the application of phylogenetic approaches. We first examine the relative roles of mutation and recombination on the molecular evolution of HIV and its adaptive response to drug therapy and tissue allocation. We then review some of the fundamental questions in HIV evolution in relation to its origin and diversification and describe some of the insights gained using phylogenies. Finally, we show how phylogenetic analysis has advanced our knowledge of HIV dynamics (i.e., phylodynamics). PMID:22138161

  5. Photometric brown-dwarf classification. II. A homogeneous sample of 1361 L and T dwarfs brighter than J = 17.5 with accurate spectral types

    NASA Astrophysics Data System (ADS)

    Skrzypek, N.; Warren, S. J.; Faherty, J. K.

    2016-04-01

    We present a homogeneous sample of 1361 L and T dwarfs brighter than J = 17.5 (of which 998 are new), from an effective area of 3070 deg2, classified by the photo-type method to an accuracy of one spectral sub-type using izYJHKW1W2 photometry from SDSS+UKIDSS+WISE. Other than a small bias in the early L types, the sample is shown to be effectively complete to the magnitude limit, for all spectral types L0 to T8. The nature of the bias is an incompleteness estimated at 3% because peculiar blue L dwarfs of type L4 and earlier are classified late M. There is a corresponding overcompleteness because peculiar red (likely young) late M dwarfs are classified early L. Contamination of the sample is confirmed to be small: so far spectroscopy has been obtained for 19 sources in the catalogue and all are confirmed to be ultracool dwarfs. We provide coordinates and izYJHKW1W2 photometry of all sources. We identify an apparent discontinuity, Δm ~ 0.4 mag, in the Y - K colour between spectral types L7 and L8. We present near-infrared spectra of nine sources identified by photo-type as peculiar, including a new low-gravity source ULAS J005505.68+013436.0, with spectroscopic classification L2γ. We provide revised izYJHKW1W2 template colours for late M dwarfs, types M7 to M9. The catalogue is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/589/A49

  6. Quartets and unrooted phylogenetic networks.

    PubMed

    Gambette, Philippe; Berry, Vincent; Paul, Christophe

    2012-08-01

    Phylogenetic networks were introduced to describe evolution in the presence of exchanges of genetic material between coexisting species or individuals. Split networks in particular were introduced as a special kind of abstract network to visualize conflicts between phylogenetic trees which may correspond to such exchanges. More recently, methods were designed to reconstruct explicit phylogenetic networks (whose vertices can be interpreted as biological events) from triplet data. In this article, we link abstract and explicit networks through their combinatorial properties, by introducing the unrooted analog of level-k networks. In particular, we give an equivalence theorem between circular split systems and unrooted level-1 networks. We also show how to adapt to quartets some existing results on triplets, in order to reconstruct unrooted level-k phylogenetic networks. These results give an interesting perspective on the combinatorics of phylogenetic networks and also raise algorithmic and combinatorial questions. PMID:22809417

  7. Bayesian phylogenetic estimation of fossil ages.

    PubMed

    Drummond, Alexei J; Stadler, Tanja

    2016-07-19

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using

  8. Bayesian phylogenetic estimation of fossil ages

    PubMed Central

    Drummond, Alexei J.; Stadler, Tanja

    2016-01-01

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences

  9. A revision of infrageneric classification in Astelia Banks & Sol. ex R.Br. (Asteliaceae)

    PubMed Central

    Birch, Joanne L.

    2015-01-01

    Abstract Systematic investigations and phylogenetic analyses have indicated that Astelia, as currently circumscribed, is paraphyletic, with Collospermum nested within it. Further, Astelia subgenus Astelia is polyphyletic, and Astelia subgenera Asteliopsis and Tricella are paraphyletic, as currently circumscribed. Revision of the subgeneric classification of Astelia is warranted to ensure classification accurately reflects the evolutionary history of these taxa. Collospermum is relegated to synonymy within Astelia. Astelia is dioecious or polygamodioecious, with a superior ovary, anthers dorsi- or basifixed, pistillodes or pistils that have a single short or poorly defined style, a 3 lobed stigma, and fleshy uni- or trilocular fruit with funicular hairs that are poorly to well developed. Astelia subgenus Collospermum (Skottsb.) Birch is described. A key to Astelia sections is provided. Astelia hastata Colenso, Astelia montana Seem., and Astelia microsperma Colenso pro parte are resurrected and the new combination Astelia samoense (Skottsb.) Birch, comb. nov. is made. PMID:26312037

  10. Classification Options

    ERIC Educational Resources Information Center

    Exceptional Children, 1978

    1978-01-01

    The interview presents opinions of Nicholas Hobbs on the classification of exceptional children, including topics such as ecologically oriented classification systems, the role of parents, and need for revision of teacher preparation programs. (IM)

  11. High-resolution phylogenetic microbial community profiling

    SciTech Connect

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett; Schwientek, Patrick; Clum, Alicia; Copeland, Alex; Ciobanu, Doina; Cheng, Jan-Fang; Gies, Esther; Hallam, Steve; Tringe, Susannah; Woyke, Tanja

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance our knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.

  12. Phylogenetics and the Human Microbiome

    PubMed Central

    Matsen, Frederick A.

    2015-01-01

    The human microbiome is the ensemble of genes in the microbes that live inside and on the surface of humans. Because microbial sequencing information is now much easier to come by than phenotypic information, there has been an explosion of sequencing and genetic analysis of microbiome samples. Much of the analytical work for these sequences involves phylogenetics, at least indirectly, but methodology has developed in a somewhat different direction than for other applications of phylogenetics. In this article, I review the field and its methods from the perspective of a phylogeneticist, as well as describing current challenges for phylogenetics coming from this type of work. PMID:25102857

  13. [Foundations of the new phylogenetics].

    PubMed

    Pavlinov, I Ia

    2004-01-01

    Evolutionary idea is the core of the modern biology. Due to this, phylogenetics dealing with historical reconstructions in biology takes a priority position among biological disciplines. The second half of the 20th century witnessed growth of a great interest to phylogenetic reconstructions at macrotaxonomic level which replaced microevolutionary studies dominating during the 30s-60s. This meant shift from population thinking to phylogenetic one but it was not revival of the classical phylogenetics; rather, a new approach emerged that was baptized The New Phylogenetics. It arose as a result of merging of three disciplines which were developing independently during 60s-70s, namely cladistics, numerical phyletics, and molecular phylogenetics (now basically genophyletics). Thus, the new phylogenetics could be defined as a branch of evolutionary biology aimed at elaboration of "parsimonious" cladistic hypotheses by means of numerical methods on the basis of mostly molecular data. Classical phylogenetics, as a historical predecessor of the new one, emerged on the basis of the naturphilosophical worldview which included a superorganismal idea of biota. Accordingly to that view, historical development (the phylogeny) was thought an analogy of individual one (the ontogeny) so its most basical features were progressive parallel developments of "parts" (taxa), supplemented with Darwinian concept of monophyly. Two predominating traditions were diverged within classical phylogenetics according to a particular interpretation of relation between these concepts. One of them (Cope, Severtzow) belittled monophyly and paid most attention to progressive parallel developments of morphological traits. Such an attitude turned this kind of phylogenetics to be rather the semogenetics dealing primarily with evolution of structures and not of taxa. Another tradition (Haeckel) considered both monophyletic and parallel origins of taxa jointly: in the middle of 20th century it was split into

  14. A phylogenetic analysis of the myxobacteria: basis for their classification

    NASA Technical Reports Server (NTRS)

    Shimkets, L.; Woese, C. R.

    1992-01-01

    The primary sequence and secondary structural features of the 16S rRNA were compared for 12 different myxobacteria representing all the known cultivated genera. Analysis of these data show the myxobacteria to form a monophyletic grouping consisting of three distinct families, which lies within the delta subdivision of the purple bacterial phylum. The composition of the families is consistent with differences in cell and spore morphology, cell behavior, and pigment and secondary metabolite production but is not correlated with the morphological complexity of the fruiting bodies. The Nannocystis exedens lineage has evolved at an unusually rapid pace and its rRNA shows numerous primary and secondary structural idiosyncrasies.

  15. Phylogenetic Studies and Modern Classification of the Pyraloidea (Lepidoptera)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pyraloidea, the third largest superfamily of the Lepidoptera, is comprised of two families - Pyralidae and Crambidae. The history of families previously placed in the Pyraloidea is discussed. The group now includes about 16,000 species worldwide. Morphologically, the superfamily is defined by a b...

  16. Community Phylogenetics: Assessing Tree Reconstruction Methods and the Utility of DNA Barcodes

    PubMed Central

    Boyle, Elizabeth E.; Adamowicz, Sarah J.

    2015-01-01

    Studies examining phylogenetic community structure have become increasingly prevalent, yet little attention has been given to the influence of the input phylogeny on metrics that describe phylogenetic patterns of co-occurrence. Here, we examine the influence of branch length, tree reconstruction method, and amount of sequence data on measures of phylogenetic community structure, as well as the phylogenetic signal (Pagel’s λ) in morphological traits, using Trichoptera larval communities from Churchill, Manitoba, Canada. We find that model-based tree reconstruction methods and the use of a backbone family-level phylogeny improve estimations of phylogenetic community structure. In addition, trees built using the barcode region of cytochrome c oxidase subunit I (COI) alone accurately predict metrics of phylogenetic community structure obtained from a multi-gene phylogeny. Input tree did not alter overall conclusions drawn for phylogenetic signal, as significant phylogenetic structure was detected in two body size traits across input trees. As the discipline of community phylogenetics continues to expand, it is important to investigate the best approaches to accurately estimate patterns. Our results suggest that emerging large datasets of DNA barcode sequences provide a vast resource for studying the structure of biological communities. PMID:26110886

  17. Molecular systematics of Volvocales (Chlorophyceae, Chlorophyta) based on exhaustive 18S rRNA phylogenetic analyses.

    PubMed

    Nakada, Takashi; Misawa, Kazuharu; Nozaki, Hisayoshi

    2008-07-01

    The taxonomy of Volvocales (Chlorophyceae, Chlorophyta) was traditionally based solely on morphological characteristics. However, because recent molecular phylogeny largely contradicts the traditional subordinal and familial classifications, no classification system has yet been established that describes the subdivision of Volvocales in a manner consistent with the phylogenetic relationships. Towards development of a natural classification system at and above the generic level, identification and sorting of hundreds of sequences based on subjective phylogenetic definitions is a significant step. We constructed an 18S rRNA gene phylogeny based on 449 volvocalean sequences collected using exhaustive BLAST searches of the GenBank database. Many chimeric sequences, which can cause fallacious phylogenetic trees, were detected and excluded during data collection. The results revealed 21 strongly supported primary clades within phylogenetically redefined Volvocales. Phylogenetic classification following PhyloCode was proposed based on the presented 18S rRNA gene phylogeny along with the results of previous combined 18S and 26S rRNA and chloroplast multigene analyses. PMID:18430591

  18. Phylogenetic lineages in Pseudocercospora

    PubMed Central

    Crous, P.W.; Braun, U.; Hunter, G.C.; Wingfield, M.J.; Verkley, G.J.M.; Shin, H.-D.; Nakashima, C.; Groenewald, J.Z.

    2013-01-01

    Pseudocercospora is a large cosmopolitan genus of plant pathogenic fungi that are commonly associated with leaf and fruit spots as well as blights on a wide range of plant hosts. They occur in arid as well as wet environments and in a wide range of climates including cool temperate, sub-tropical and tropical regions. Pseudocercospora is now treated as a genus in its own right, although formerly recognised as either an anamorphic state of Mycosphaerella or having mycosphaerella-like teleomorphs. The aim of this study was to sequence the partial 28S nuclear ribosomal RNA gene of a selected set of isolates to resolve phylogenetic generic limits within the Pseudocercospora complex. From these data, 14 clades are recognised, six of which cluster in Mycosphaerellaceae. Pseudocercospora s. str. represents a distinct clade, sister to Passalora eucalypti, and a clade representing the genera Scolecostigmina, Trochophora and Pallidocercospora gen. nov., taxa formerly accommodated in the Mycosphaerella heimii complex and characterised by smooth, pale brown conidia, as well as the formation of red crystals in agar media. Other clades in Mycosphaerellaceae include Sonderhenia, Microcyclosporella, and Paracercospora. Pseudocercosporella resides in a large clade along with Phloeospora, Miuraea, Cercospora and Septoria. Additional clades represent Dissoconiaceae, Teratosphaeriaceae, Cladosporiaceae, and the genera Xenostigmina, Strelitziana, Cyphellophora and Thedgonia. The genus Phaeomycocentrospora is introduced to accommodate Mycocentrospora cantuariensis, primarily distinguished from Pseudocercospora based on its hyaline hyphae, broad conidiogenous loci and hila. Host specificity was considered for 146 species of Pseudocercospora occurring on 115 host genera from 33 countries. Partial nucleotide sequence data for three gene loci, ITS, EF-1α, and ACT suggest that the majority of these species are host specific. Species identified on the basis of host, symptomatology and general

  19. The phylogenetic utility of chloroplast and nuclear DNA markers and the phylogeny of the Rubiaceae tribe Spermacoceae.

    PubMed

    Kårehed, Jesper; Groeninckx, Inge; Dessein, Steven; Motley, Timothy J; Bremer, Birgitta

    2008-12-01

    The phylogenetic utility of chloroplast (atpB-rbcL, petD, rps16, trnL-F) and nuclear (ETS, ITS) DNA regions was investigated for the tribe Spermacoceae of the coffee family (Rubiaceae). ITS was, despite often raised cautions of its utility at higher taxonomic levels, shown to provide the highest number of parsimony informative characters, in partitioned Bayesian analyses it yielded the fewest trees in the 95% credible set, it resolved the highest proportion of well resolved clades, and was the most accurate region as measured by the partition metric and the proportion of correctly resolved clades (well supported clades retrieved from a combined analysis regarded as "true"). For Hedyotis, the nuclear 5S-NTS was shown to be potentially as useful as ITS, despite its shorter sequence length. The chloroplast region being the most phylogenetically informative was the petD group II intron. We also present a phylogeny of Spermacoceae based on a Bayesian analysis of the four chloroplast regions, ITS, and ETS combined. Spermacoceae are shown to be monophyletic. Clades supported by high posterior probabilities are discussed, especially in respect to the current generic classification. Notably, Oldenlandia is polyphyletic, the two subgenera of Kohautia are not sister taxa, and Hedyotis should be treated in a narrow sense to include only Asian species. PMID:18950720

  20. Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies

    PubMed Central

    Platt, Roy N.; Blanco-Berdugo, Laura; Ray, David A.

    2016-01-01

    Transposable elements (TEs) are mobile genetic elements with the ability to replicate themselves throughout the host genome. In some taxa TEs reach copy numbers in hundreds of thousands and can occupy more than half of the genome. The increasing number of reference genomes from nonmodel species has begun to outpace efforts to identify and annotate TE content and methods that are used vary significantly between projects. Here, we demonstrate variation that arises in TE annotations when less than optimal methods are used. We found that across a variety of taxa, the ability to accurately identify TEs based solely on homology decreased as the phylogenetic distance between the queried genome and a reference increased. Next we annotated repeats using homology alone, as is often the case in new genome analyses, and a combination of homology and de novo methods as well as an additional manual curation step. Reannotation using these methods identified a substantial number of new TE subfamilies in previously characterized genomes, recognized a higher proportion of the genome as repetitive, and decreased the average genetic distance within TE families, implying recent TE accumulation. Finally, these finding—increased recognition of younger TEs—were confirmed via an analysis of the postman butterfly (Heliconius melpomene). These observations imply that complete TE annotation relies on a combination of homology and de novo–based repeat identification, manual curation, and classification and that relying on simple, homology-based methods is insufficient to accurately describe the TE landscape of a newly sequenced genome. PMID:26802115

  1. Endodontic classification.

    PubMed

    Morse, D R; Seltzer, S; Sinai, I; Biron, G

    1977-04-01

    Clinical and histopathologic findings are mixed in current endodontic classifications. A new system, based on symptomatology, may be more useful in clincial practice. The classifications are vital asymptomatic, hypersensitive dentin, inflamed-reversible, inflamed/dengenerating without area-irreversible, inflamed/degenerating with area-irreversible, necrotic without area, and necrotic with area. PMID:265327

  2. Phylogenetic Approaches Toward Crocodylian History

    NASA Astrophysics Data System (ADS)

    Brochu, Christopher A.

    A review of crocodylian phylogeny reveals a more complex history than might have been anticipated from a direct reading of the fossil record without consideration of phylogenetic relationships. The three main extant crocodylian lineagesGavialoidea, Alligatoroidea, Crocodyloideaare known from fossils in the Late Cretaceous, and the group is found nearly worldwide during the Cenozoic. Some groups have distributions that are best explained by the crossing of marine barriers during the Tertiary. Early Tertiary crocodylian faunas are phylogenetically composite, and clades tend to be morphologically uniform and geographically widespread. Later in the Tertiary, Old World crocodylian faunas are more endemic. Crocodylian phylogeneticists face numerous challenges, the most important being the phylogenetic relationships and time of divergence of the two living gharials (Gavialis gangeticus and Tomistoma schlegelii), the relationships among living true crocodiles (Crocodylus), and the relationships among caimans.

  3. [Phylogenetic analysis of Pleurotus species].

    PubMed

    Shnyreva, A A; Shnyreva, A V

    2015-02-01

    We performed phylogenetic analysis for ten Pleurotus species, based on internal transcribed spacer (ITS) sequences of rDNA. A phylogenetic tree was constructed on the basis of 31 oyster fungi strains of different origin and 10 reference sequences from GenBank. Our analysis demonstrates that the tested Pleurotus species are of monophyletic origin. We evaluated the evolutionary distances between these species. Classic genetic analysis of sexual compatibility based on monocaryon (mon)-mon crosses showed no reproductive barriers within the P. cornucopiae-P. euosmus species complex. Thus, despite the divergence (subclustering) between commercial strains and natural isolates of P. ostreatus revealed by phylogenetic analysis, there is no reproductive isolation between these groups. A common allele of the matB locus was identified for the commercial strains Sommer and L/4, supporting the common origin of these strains. PMID:25966583

  4. Grading More Accurately

    ERIC Educational Resources Information Center

    Rom, Mark Carl

    2011-01-01

    Grades matter. College grading systems, however, are often ad hoc and prone to mistakes. This essay focuses on one factor that contributes to high-quality grading systems: grading accuracy (or "efficiency"). I proceed in several steps. First, I discuss the elements of "efficient" (i.e., accurate) grading. Next, I present analytical results…

  5. Two issues in archaeological phylogenetics: taxon construction and outgroup selection.

    PubMed

    O'Brien, Michael J; Lyman, R Lee; Saab, Youssef; Saab, Elias; Darwent, John; Glover, Daniel S

    2002-03-21

    Cladistics is widely used in biology and paleobiology to construct phylogenetic hypotheses, but rarely has it been applied outside those disciplines. There is, however, no reason to suppose that cladistics is not applicable to anything that evolves by cladogenesis and produces a nested hierarchy of taxa. This includes cultural phenomena such as languages and tools recovered from archaeological contexts. Two methodological issues assume primacy in attempts to extend cladistics to archaeological materials: the construction of analytical taxa and the selection of appropriate outgroups. In biology the species is the primary taxonomic unit used, irrespective of the debates that have arisen in phylogenetic theory over the nature of species. Also in biology the phylogenetic history of a group of taxa usually is well enough known that an appropriate taxon can be selected as an outgroup. No analytical unit parallel to the species exists in archaeology, and thus taxa have to be constructed specifically for phylogenetic analysis. One method of constructing taxa is paradigmatic classification, which defines classes (taxa) on the basis of co-occurring, unweighted character states. Once classes have been created, a form of occurrence seriation-an archaeological method based on the theory of cultural transmission and heritability-offers an objective basis for selecting an outgroup. PMID:12051970

  6. Interpreting the universal phylogenetic tree

    PubMed Central

    Woese, Carl R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist. PMID:10900003

  7. Interpreting the universal phylogenetic tree

    NASA Technical Reports Server (NTRS)

    Woese, C. R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.

  8. Molecular phylogenetics and evolutionary history of ariid catfishes revisited: a comprehensive sampling

    PubMed Central

    Betancur-R, Ricardo

    2009-01-01

    Background Ariids or sea catfishes are one of the two otophysan fish families (out of about 67 families in four orders) that inhabit mainly marine and brackish waters (although some species occur strictly in fresh waters). The group includes over 150 species placed in ~29 genera and two subfamilies (Galeichthyinae and Ariinae). Despite their global distribution, ariids are largely restricted to the continental shelves due in part to their specialized reproductive behavior (i.e., oral incubation). Thus, among marine fishes, ariids offer an excellent opportunity for inferring historical biogeographic scenarios. Phylogenetic hypotheses available for ariids have focused on restricted geographic areas and comprehensive phylogenies are still missing. This study inferred phylogenetic hypotheses for 123 ariid species in 28 genera from different biogeographic provinces using both mitochondrial and nuclear sequences (up to ~4 kb). Results While the topologies obtained support the monophyly of basal groups, up to ten genera validated in previous morphological studies were incongruent with the molecular topologies. New World ariines were recovered as paraphyletic and Old World ariines were grouped into a well-supported clade that was further divided into subclades mainly restricted to major Gondwanan landmasses. A general area cladogram derived from the area cladograms of ariines and three other fish groups was largely congruent with the geological area cladogram of Gondwana. Nonetheless, molecular clock estimations provided variable results on the timing of ariine diversification (~105-41 mya). Conclusion This study provides the most comprehensive phylogeny of sea catfishes to date and highlights the need for re-assessment of their classification. While from a topological standpoint the evolutionary history of ariines is mostly congruent with vicariance associated with the sequence of events during Gondwanan fragmentation, ambiguous divergence time estimations hinders

  9. Factors That Affect Large Subunit Ribosomal DNA Amplicon Sequencing Studies of Fungal Communities: Classification Method, Primer Choice, and Error

    PubMed Central

    Porter, Teresita M.; Golding, G. Brian

    2012-01-01

    Nuclear large subunit ribosomal DNA is widely used in fungal phylogenetics and to an increasing extent also amplicon-based environmental sequencing. The relatively short reads produced by next-generation sequencing, however, makes primer choice and sequence error important variables for obtaining accurate taxonomic classifications. In this simulation study we tested the performance of three classification methods: 1) a similarity-based method (BLAST + Metagenomic Analyzer, MEGAN); 2) a composition-based method (Ribosomal Database Project naïve Bayesian classifier, NBC); and, 3) a phylogeny-based method (Statistical Assignment Package, SAP). We also tested the effects of sequence length, primer choice, and sequence error on classification accuracy and perceived community composition. Using a leave-one-out cross validation approach, results for classifications to the genus rank were as follows: BLAST + MEGAN had the lowest error rate and was particularly robust to sequence error; SAP accuracy was highest when long LSU query sequences were classified; and, NBC runs significantly faster than the other tested methods. All methods performed poorly with the shortest 50–100 bp sequences. Increasing simulated sequence error reduced classification accuracy. Community shifts were detected due to sequence error and primer selection even though there was no change in the underlying community composition. Short read datasets from individual primers, as well as pooled datasets, appear to only approximate the true community composition. We hope this work informs investigators of some of the factors that affect the quality and interpretation of their environmental gene surveys. PMID:22558215

  10. Absolute Pitch in Boreal Chickadees and Humans: Exceptions that Test a Phylogenetic Rule

    ERIC Educational Resources Information Center

    Weisman, Ronald G.; Balkwill, Laura-Lee; Hoeschele, Marisa; Moscicki, Michele K.; Bloomfield, Laurie L.; Sturdy, Christopher B.

    2010-01-01

    This research examined generality of the phylogenetic rule that birds discriminate frequency ranges more accurately than mammals. Human absolute pitch chroma possessors accurately tracked transitions between frequency ranges. Independent tests showed that they used note naming (pitch chroma) to remap the tones into ranges; neither possessors nor…

  11. Phylogenetic Analysis of a Spontaneous Cocoa Bean Fermentation Metagenome Reveals New Insights into Its Bacterial and Fungal Community Diversity

    PubMed Central

    Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan

    2012-01-01

    This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques. PMID:22666442

  12. Molecular identification of hepatitis B virus genotypes/subgenotypes: Revised classification hurdles and updated resolutions

    PubMed Central

    Pourkarim, Mahmoud Reza; Amini-Bavil-Olyaee, Samad; Kurbanov, Fuat; Van Ranst, Marc; Tacke, Frank

    2014-01-01

    The clinical course of infections with the hepatitis B virus (HBV) substantially varies between individuals, as a consequence of a complex interplay between viral, host, environmental and other factors. Due to the high genetic variability of HBV, the virus can be categorized into different HBV genotypes and subgenotypes, which considerably differ with respect to geographical distribution, transmission routes, disease progression, responses to antiviral therapy or vaccination, and clinical outcome measures such as cirrhosis or hepatocellular carcinoma. However, HBV (sub)genotyping has caused some controversies in the past due to misclassifications and incorrect interpretations of different genotyping methods. Thus, an accurate, holistic and dynamic classification system is essential. In this review article, we aimed at highlighting potential pitfalls in genetic and phylogenetic analyses of HBV and suggest novel terms for HBV classification. Analyzing full-length genome sequences when classifying genotypes and subgenotypes is the foremost prerequisite of this classification system. Careful attention must be paid to all aspects of phylogenetic analysis, such as bootstrapping values and meeting the necessary thresholds for (sub)genotyping. Quasi-subgenotype refers to subgenotypes that were incorrectly suggested to be novel. As many of these strains were misclassified due to genetic differences resulting from recombination, we propose the term “recombino-subgenotype”. Moreover, immigration is an important confounding facet of global HBV distribution and substantially changes the geographic pattern of HBV (sub)genotypes. We therefore suggest the term “immigro-subgenotype” to distinguish exotic (sub)genotypes from native ones. We are strongly convinced that applying these two proposed terms in HBV classification will help harmonize this rapidly progressing field and allow for improved prophylaxis, diagnosis and treatment. PMID:24966586

  13. Phylogenetic placement of the ectomycorrhizal genus Cenococcum in Gloniaceae (Dothideomycetes).

    PubMed

    Spatafora, Joseph W; Owensby, C Alisha; Douhan, Greg W; Boehm, Eric W A; Schoch, Conrad L

    2012-01-01

    Cenococcum is a genus of ectomycorrhizal Ascomycota that has a broad host range and geographic distribution. It is not known to produce either meiotic or mitotic spores and is known to exist only in the form of hyphae, sclerotia and host-colonized ectomycorrhizal root tips. Due to its lack of sexual and asexual spores and reproductive structures, it has proven difficult to incorporate into traditional classification within Ascomycota. Molecular phylogenetic studies of ribosomal RNA placed Cenococcum in Dothideomycetes, but the definitive identification of closely related taxa remained elusive. Here we report a phylogenetic analysis of five nuclear loci (SSU, LSU, TEF1, RPB1, RPB2) of Dothideomycetes that placed Cenococcum as a close relative of the genus Glonium of Gloniaceae (Pleosporomycetidae incertae sedis) with strong statistical support. Glonium is a genus of saprobic Dothideomycetes that produces darkly pigmented, carbonaceous, hysteriate apothecia and is not known to be biotrophic. Evolution of ectomycorhizae, Cenococcum and Dothideomycetes is discussed. PMID:22453119

  14. PoInTree: a polar and interactive phylogenetic tree.

    PubMed

    Carreras, Marco; Marco, Cerreras; Gianti, Eleonora; Eleonora, Gianti; Sartori, Luca; Luca, Sartori; Plyte, Simon Edward; Edward, Plyte Simon; Isacchi, Antonella; Antonella, Isacchi; Bosotti, Roberta; Roberta, Bosotti

    2005-02-01

    PoInTree (Polar and Interactive Tree) is an application that allows to build, visualize and customize phylogenetic trees in a polar interactive and highly flexible view. It takes as input a FASTA file or multiple alignment formats. Phylogenetic tree calculation is based on a sequence distance method and utilizes the Neighbor Joining (NJ) algorithm. It also allows displaying precalculated trees of the major protein families based on Pfam classification. In PoInTree, nodes can be dynamically opened and closed and distances between genes are graphically represented. Tree root can be centered on a selected leaf. Text search mechanism, color-coding and labeling display are integrated. The visualizer can be connected to an Oracle database containing information on sequences and other biological data, helping to guide their interpretation within a given protein family across multiple species. The application is written in Borland Delphi and based on VCL Teechart Pro 6 graphical component (Steema software). PMID:16144524

  15. On the nature of global classification

    NASA Technical Reports Server (NTRS)

    Wheelis, M. L.; Kandler, O.; Woese, C. R.

    1992-01-01

    Molecular sequencing technology has brought biology into the era of global (universal) classification. Methodologically and philosophically, global classification differs significantly from traditional, local classification. The need for uniformity requires that higher level taxa be defined on the molecular level in terms of universally homologous functions. A global classification should reflect both principal dimensions of the evolutionary process: genealogical relationship and quality and extent of divergence within a group. The ultimate purpose of a global classification is not simply information storage and retrieval; such a system should also function as an heuristic representation of the evolutionary paradigm that exerts a directing influence on the course of biology. The global system envisioned allows paraphyletic taxa. To retain maximal phylogenetic information in these cases, minor notational amendments in existing taxonomic conventions should be adopted.

  16. Accurate Finite Difference Algorithms

    NASA Technical Reports Server (NTRS)

    Goodrich, John W.

    1996-01-01

    Two families of finite difference algorithms for computational aeroacoustics are presented and compared. All of the algorithms are single step explicit methods, they have the same order of accuracy in both space and time, with examples up to eleventh order, and they have multidimensional extensions. One of the algorithm families has spectral like high resolution. Propagation with high order and high resolution algorithms can produce accurate results after O(10(exp 6)) periods of propagation with eight grid points per wavelength.

  17. Accurate monotone cubic interpolation

    NASA Technical Reports Server (NTRS)

    Huynh, Hung T.

    1991-01-01

    Monotone piecewise cubic interpolants are simple and effective. They are generally third-order accurate, except near strict local extrema where accuracy degenerates to second-order due to the monotonicity constraint. Algorithms for piecewise cubic interpolants, which preserve monotonicity as well as uniform third and fourth-order accuracy are presented. The gain of accuracy is obtained by relaxing the monotonicity constraint in a geometric framework in which the median function plays a crucial role.

  18. Phylogenetic Relationships Among Lepidium Papilliferum

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previous phylogenetic analyses of Lepidium included only a few acessions of L. montanum, L. flavum, and L. fremontii to represent western North Amrican species. Two additional species endemic to southwest Idaho have posed both taxonomic and conservation questions regarding their species status. Le...

  19. Functional Basis of Microorganism Classification

    PubMed Central

    Zhu, Chengsheng; Delmont, Tom O.; Vogel, Timothy M.; Bromberg, Yana

    2015-01-01

    Correctly identifying nearest “neighbors” of a given microorganism is important in industrial and clinical applications where close relationships imply similar treatment. Microbial classification based on similarity of physiological and genetic organism traits (polyphasic similarity) is experimentally difficult and, arguably, subjective. Evolutionary relatedness, inferred from phylogenetic markers, facilitates classification but does not guarantee functional identity between members of the same taxon or lack of similarity between different taxa. Using over thirteen hundred sequenced bacterial genomes, we built a novel function-based microorganism classification scheme, functional-repertoire similarity-based organism network (FuSiON; flattened to fusion). Our scheme is phenetic, based on a network of quantitatively defined organism relationships across the known prokaryotic space. It correlates significantly with the current taxonomy, but the observed discrepancies reveal both (1) the inconsistency of functional diversity levels among different taxa and (2) an (unsurprising) bias towards prioritizing, for classification purposes, relatively minor traits of particular interest to humans. Our dynamic network-based organism classification is independent of the arbitrary pairwise organism similarity cut-offs traditionally applied to establish taxonomic identity. Instead, it reveals natural, functionally defined organism groupings and is thus robust in handling organism diversity. Additionally, fusion can use organism meta-data to highlight the specific environmental factors that drive microbial diversification. Our approach provides a complementary view to cladistic assignments and holds important clues for further exploration of microbial lifestyles. Fusion is a more practical fit for biomedical, industrial, and ecological applications, as many of these rely on understanding the functional capabilities of the microbes in their environment and are less concerned

  20. Cyber-infrastructure for Fusarium (CiF): Three integrated platforms supporting strain identification, phylogenetics, comparative genomics, and knowledge sharing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on ...

  1. Molecular identification and phylogenetic study of Demodex caprae.

    PubMed

    Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian

    2014-10-01

    The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis. PMID:25132566

  2. Phylogenetic analysis of otospiralin protein

    PubMed Central

    Torktaz, Ibrahim; Behjati, Mohaddeseh; Rostami, Amin

    2016-01-01

    Background: Fibrocyte-specific protein, otospiralin, is a small protein, widely expressed in the central nervous system as neuronal cell bodies and glia. The increased expression of otospiralin in reactive astrocytes implicates its role in signaling pathways and reparative mechanisms subsequent to injury. Indeed, otospiralin is considered to be essential for the survival of fibrocytes of the mesenchymal nonsensory regions of the cochlea. It seems that other functions of this protein are not yet completely understood. Materials and Methods: Amino acid sequences of otospiralin from 12 vertebrates were derived from National Center for Biotechnology Information database. Phylogenetic analysis and phylogeny estimation were performed using MEGA 5.0.5 program, and neighbor-joining tree was constructed by this software. Results: In this computational study, the phylogenetic tree of otospiralin has been investigated. Therefore, dendrograms of otospiralin were depicted. Alignment performed in MUSCLE method by UPGMB algorithm. Also, entropy plot determined for a better illustration of amino acid variations in this protein. Conclusion: In the present study, we used otospiralin sequence of 12 different species and by constructing phylogenetic tree, we suggested out group for some related species. PMID:27099854

  3. Phylogenetic analysis of adenovirus sequences.

    PubMed

    Harrach, Balázs; Benko, Mária

    2007-01-01

    Members of the family Adenoviridae have been isolated from a large variety of hosts, including representatives from every major vertebrate class from fish to mammals. The high prevalence, together with the fairly conserved organization of the central part of their genomes, make the adenoviruses one of (if not the) best models for studying viral evolution on a larger time scale. Phylogenetic calculation can infer the evolutionary distance among adenovirus strains on serotype, species, and genus levels, thus helping the establishment of a correct taxonomy on the one hand, and speeding up the process of typing new isolates on the other. Initially, four major lineages corresponding to four genera were recognized. Later, the demarcation criteria of lower taxon levels, such as species or types, could also be defined with phylogenetic calculations. A limited number of possible host switches have been hypothesized and convincingly supported. Application of the web-based BLAST and MultAlin programs and the freely available PHYLIP package, along with the TreeView program, enables everyone to make correct calculations. In addition to step-by-step instruction on how to perform phylogenetic analysis, critical points where typical mistakes or misinterpretation of the results might occur will be identified and hints for their avoidance will be provided. PMID:17656792

  4. Accurate measurement of time

    NASA Astrophysics Data System (ADS)

    Itano, Wayne M.; Ramsey, Norman F.

    1993-07-01

    The paper discusses current methods for accurate measurements of time by conventional atomic clocks, with particular attention given to the principles of operation of atomic-beam frequency standards, atomic hydrogen masers, and atomic fountain and to the potential use of strings of trapped mercury ions as a time device more stable than conventional atomic clocks. The areas of application of the ultraprecise and ultrastable time-measuring devices that tax the capacity of modern atomic clocks include radio astronomy and tests of relativity. The paper also discusses practical applications of ultraprecise clocks, such as navigation of space vehicles and pinpointing the exact position of ships and other objects on earth using the GPS.

  5. Classification of spatially unresolved objects

    NASA Technical Reports Server (NTRS)

    Nalepka, R. F.; Horwitz, H. M.; Hyde, P. D.; Morgenstern, J. P.

    1972-01-01

    A proportion estimation technique for classification of multispectral scanner images is reported that uses data point averaging to extract and compute estimated proportions for a single average data point to classify spatial unresolved areas. Example extraction calculations of spectral signatures for bare soil, weeds, alfalfa, and barley prove quite accurate.

  6. Accurate quantum chemical calculations

    NASA Technical Reports Server (NTRS)

    Bauschlicher, Charles W., Jr.; Langhoff, Stephen R.; Taylor, Peter R.

    1989-01-01

    An important goal of quantum chemical calculations is to provide an understanding of chemical bonding and molecular electronic structure. A second goal, the prediction of energy differences to chemical accuracy, has been much harder to attain. First, the computational resources required to achieve such accuracy are very large, and second, it is not straightforward to demonstrate that an apparently accurate result, in terms of agreement with experiment, does not result from a cancellation of errors. Recent advances in electronic structure methodology, coupled with the power of vector supercomputers, have made it possible to solve a number of electronic structure problems exactly using the full configuration interaction (FCI) method within a subspace of the complete Hilbert space. These exact results can be used to benchmark approximate techniques that are applicable to a wider range of chemical and physical problems. The methodology of many-electron quantum chemistry is reviewed. Methods are considered in detail for performing FCI calculations. The application of FCI methods to several three-electron problems in molecular physics are discussed. A number of benchmark applications of FCI wave functions are described. Atomic basis sets and the development of improved methods for handling very large basis sets are discussed: these are then applied to a number of chemical and spectroscopic problems; to transition metals; and to problems involving potential energy surfaces. Although the experiences described give considerable grounds for optimism about the general ability to perform accurate calculations, there are several problems that have proved less tractable, at least with current computer resources, and these and possible solutions are discussed.

  7. Form classification

    NASA Astrophysics Data System (ADS)

    Reddy, K. V. Umamaheswara; Govindaraju, Venu

    2008-01-01

    The problem of form classification is to assign a single-page form image to one of a set of predefined form types or classes. We classify the form images using low level pixel density information from the binary images of the documents. In this paper, we solve the form classification problem with a classifier based on the k-means algorithm, supported by adaptive boosting. Our classification method is tested on the NIST scanned tax forms data bases (special forms databases 2 and 6) which include machine-typed and handwritten documents. Our method improves the performance over published results on the same databases, while still using a simple set of image features.

  8. Transforming phylogenetic networks: Moving beyond tree space.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. PMID:27224010

  9. A Phylogenetic Re-Analysis of Groupers with Applications for Ciguatera Fish Poisoning

    PubMed Central

    Schoelinck, Charlotte; Hinsinger, Damien D.; Dettaï, Agnès; Cruaud, Corinne; Justine, Jean-Lou

    2014-01-01

    Background Ciguatera fish poisoning (CFP) is a significant public health problem due to dinoflagellates. It is responsible for one of the highest reported incidence of seafood-borne illness and Groupers are commonly reported as a source of CFP due to their position in the food chain. With the role of recent climate change on harmful algal blooms, CFP cases might become more frequent and more geographically widespread. Since there is no appropriate treatment for CFP, the most efficient solution is to regulate fish consumption. Such a strategy can only work if the fish sold are correctly identified, and it has been repeatedly shown that misidentifications and species substitutions occur in fish markets. Methods We provide here both a DNA-barcoding reference for groupers, and a new phylogenetic reconstruction based on five genes and a comprehensive taxonomical sampling. We analyse the correlation between geographic range of species and their susceptibility to ciguatera accumulation, and the co-occurrence of ciguatoxins in closely related species, using both character mapping and statistical methods. Results Misidentifications were encountered in public databases, precluding accurate species identifications. Epinephelinae now includes only twelve genera (vs. 15 previously). Comparisons with the ciguatera incidences show that in some genera most species are ciguateric, but statistical tests display only a moderate correlation with the phylogeny. Atlantic species were rarely contaminated, with ciguatera occurrences being restricted to the South Pacific. Conclusions The recent changes in classification based on the reanalyses of the relationships within Epinephelidae have an impact on the interpretation of the ciguatera distribution in the genera. In this context and to improve the monitoring of fish trade and safety, we need to obtain extensive data on contamination at the species level. Accurate species identifications through DNA barcoding are thus an essential tool in

  10. Learning classification trees

    NASA Technical Reports Server (NTRS)

    Buntine, Wray

    1991-01-01

    Algorithms for learning classification trees have had successes in artificial intelligence and statistics over many years. How a tree learning algorithm can be derived from Bayesian decision theory is outlined. This introduces Bayesian techniques for splitting, smoothing, and tree averaging. The splitting rule turns out to be similar to Quinlan's information gain splitting rule, while smoothing and averaging replace pruning. Comparative experiments with reimplementations of a minimum encoding approach, Quinlan's C4 and Breiman et al. Cart show the full Bayesian algorithm is consistently as good, or more accurate than these other approaches though at a computational price.

  11. HIV classification using coalescent theory

    SciTech Connect

    Zhang, Ming; Letiner, Thomas K; Korber, Bette T

    2008-01-01

    Algorithms for subtype classification and breakpoint detection of HIV-I sequences are based on a classification system of HIV-l. Hence, their quality highly depend on this system. Due to the history of creation of the current HIV-I nomenclature, the current one contains inconsistencies like: The phylogenetic distance between the subtype B and D is remarkably small compared with other pairs of subtypes. In fact, it is more like the distance of a pair of subsubtypes Robertson et al. (2000); Subtypes E and I do not exist any more since they were discovered to be composed of recombinants Robertson et al. (2000); It is currently discussed whether -- instead of CRF02 being a recombinant of subtype A and G -- subtype G should be designated as a circulating recombination form (CRF) nd CRF02 as a subtype Abecasis et al. (2007); There are 8 complete and over 400 partial HIV genomes in the LANL-database which belong neither to a subtype nor to a CRF (denoted by U). Moreover, the current classification system is somehow arbitrary like all complex classification systems that were created manually. To this end, it is desirable to deduce the classification system of HIV systematically by an algorithm. Of course, this problem is not restricted to HIV, but applies to all fast mutating and recombining viruses. Our work addresses the simpler subproblem to score classifications of given input sequences of some virus species (classification denotes a partition of the input sequences in several subtypes and CRFs). To this end, we reconstruct ancestral recombination graphs (ARG) of the input sequences under restrictions determined by the given classification. These restritions are imposed in order to ensure that the reconstructed ARGs do not contradict the classification under consideration. Then, we find the ARG with maximal probability by means of Markov Chain Monte Carlo methods. The probability of the most probable ARG is interpreted as a score for the classification. To our

  12. Classifying Classification

    ERIC Educational Resources Information Center

    Novakowski, Janice

    2009-01-01

    This article describes the experience of a group of first-grade teachers as they tackled the science process of classification, a targeted learning objective for the first grade. While the two-year process was not easy and required teachers to teach in a new, more investigation-oriented way, the benefits were great. The project helped teachers and…

  13. Phylogenetic structure and host abundance drive disease pressure in communities.

    PubMed

    Parker, Ingrid M; Saunders, Megan; Bontrager, Megan; Weitz, Andrew P; Hendricks, Rebecca; Magarey, Roger; Suiter, Karl; Gilbert, Gregory S

    2015-04-23

    Pathogens play an important part in shaping the structure and dynamics of natural communities, because species are not affected by them equally. A shared goal of ecology and epidemiology is to predict when a species is most vulnerable to disease. A leading hypothesis asserts that the impact of disease should increase with host abundance, producing a 'rare-species advantage'. However, the impact of a pathogen may be decoupled from host abundance, because most pathogens infect more than one species, leading to pathogen spillover onto closely related species. Here we show that the phylogenetic and ecological structure of the surrounding community can be important predictors of disease pressure. We found that the amount of tissue lost to disease increased with the relative abundance of a species across a grassland plant community, and that this rare-species advantage had an additional phylogenetic component: disease pressure was stronger on species with many close relatives. We used a global model of pathogen sharing as a function of relatedness between hosts, which provided a robust predictor of relative disease pressure at the local scale. In our grassland, the total amount of disease was most accurately explained not by the abundance of the focal host alone, but by the abundance of all species in the community weighted by their phylogenetic distance to the host. Furthermore, the model strongly predicted observed disease pressure for 44 novel host species we introduced experimentally to our study site, providing evidence for a mechanism to explain why phylogenetically rare species are more likely to become invasive when introduced. Our results demonstrate how the phylogenetic and ecological structure of communities can have a key role in disease dynamics, with implications for the maintenance of biodiversity, biotic resistance against introduced weeds, and the success of managed plants in agriculture and forestry. PMID:25903634

  14. Phylogenetic placement of the Spirosomaceae

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Maloy, S.; Mandelco, L.; Raj, H. D.

    1990-01-01

    Comparative analysis of 16S rRNA sequences shows that the family Spirosomaceae belongs within the eubacterial phylum defined by the flavobacteria and bacteriodes. Its constituent genera, Spirosoma, Flectobacillus, and Runella form a monophyletic grouping therein. The phylogenetic assignment is based not only upon evolutionary distance analysis, but also upon sequence signatures and higher order structural synapomorphies in 16S rRNA. Another genus peripherally associated with the Spirosomaceae, Ancylobacter ("Microcyclus"), does not cluster with the flavobacteria and their relatives, but rather belongs to the alpha subdivision of the purple bacteria.

  15. Phylogenetic Analysis of Poliovirus Sequences.

    PubMed

    Jorba, Jaume

    2016-01-01

    Comparative genomic sequencing is a major surveillance tool in the Polio Laboratory Network. Due to the rapid evolution of polioviruses (~1 % per year), pathways of virus transmission can be reconstructed from the pathways of genomic evolution. Here, we describe three main phylogenetic methods; estimation of genetic distances, reconstruction of a maximum-likelihood (ML) tree, and estimation of substitution rates using Bayesian Markov chain Monte Carlo (MCMC). The data set used consists of complete capsid sequences from a survey of poliovirus sequences available in GenBank. PMID:26983737

  16. Making Mosquito Taxonomy Useful: A Stable Classification of Tribe Aedini that Balances Utility with Current Knowledge of Evolutionary Relationships

    PubMed Central

    Wilkerson, Richard C.; Linton, Yvonne-Marie; Fonseca, Dina M.; Schultz, Ted R.; Price, Dana C.; Strickman, Daniel A.

    2015-01-01

    The tribe Aedini (Family Culicidae) contains approximately one-quarter of the known species of mosquitoes, including vectors of deadly or debilitating disease agents. This tribe contains the genus Aedes, which is one of the three most familiar genera of mosquitoes. During the past decade, Aedini has been the focus of a series of extensive morphology-based phylogenetic studies published by Reinert, Harbach, and Kitching (RH&K). Those authors created 74 new, elevated or resurrected genera from what had been the single genus Aedes, almost tripling the number of genera in the entire family Culicidae. The proposed classification is based on subjective assessments of the “number and nature of the characters that support the branches” subtending particular monophyletic groups in the results of cladistic analyses of a large set of morphological characters of representative species. To gauge the stability of RH&K’s generic groupings we reanalyzed their data with unweighted parsimony jackknife and maximum-parsimony analyses, with and without ordering 14 of the characters as in RH&K. We found that their phylogeny was largely weakly supported and their taxonomic rankings failed priority and other useful taxon-naming criteria. Consequently, we propose simplified aedine generic designations that 1) restore a classification system that is useful for the operational community; 2) enhance the ability of taxonomists to accurately place new species into genera; 3) maintain the progress toward a natural classification based on monophyletic groups of species; and 4) correct the current classification system that is subject to instability as new species are described and existing species more thoroughly defined. We do not challenge the phylogenetic hypotheses generated by the above-mentioned series of morphological studies. However, we reduce the ranks of the genera and subgenera of RH&K to subgenera or informal species groups, respectively, to preserve stability as new data

  17. Making Mosquito Taxonomy Useful: A Stable Classification of Tribe Aedini that Balances Utility with Current Knowledge of Evolutionary Relationships.

    PubMed

    Wilkerson, Richard C; Linton, Yvonne-Marie; Fonseca, Dina M; Schultz, Ted R; Price, Dana C; Strickman, Daniel A

    2015-01-01

    The tribe Aedini (Family Culicidae) contains approximately one-quarter of the known species of mosquitoes, including vectors of deadly or debilitating disease agents. This tribe contains the genus Aedes, which is one of the three most familiar genera of mosquitoes. During the past decade, Aedini has been the focus of a series of extensive morphology-based phylogenetic studies published by Reinert, Harbach, and Kitching (RH&K). Those authors created 74 new, elevated or resurrected genera from what had been the single genus Aedes, almost tripling the number of genera in the entire family Culicidae. The proposed classification is based on subjective assessments of the "number and nature of the characters that support the branches" subtending particular monophyletic groups in the results of cladistic analyses of a large set of morphological characters of representative species. To gauge the stability of RH&K's generic groupings we reanalyzed their data with unweighted parsimony jackknife and maximum-parsimony analyses, with and without ordering 14 of the characters as in RH&K. We found that their phylogeny was largely weakly supported and their taxonomic rankings failed priority and other useful taxon-naming criteria. Consequently, we propose simplified aedine generic designations that 1) restore a classification system that is useful for the operational community; 2) enhance the ability of taxonomists to accurately place new species into genera; 3) maintain the progress toward a natural classification based on monophyletic groups of species; and 4) correct the current classification system that is subject to instability as new species are described and existing species more thoroughly defined. We do not challenge the phylogenetic hypotheses generated by the above-mentioned series of morphological studies. However, we reduce the ranks of the genera and subgenera of RH&K to subgenera or informal species groups, respectively, to preserve stability as new data become

  18. Phylogenetic inference of Indian malaria vectors from multilocus DNA sequences.

    PubMed

    Dixit, Jyotsana; Srivastava, Hemlata; Sharma, Meenu; Das, Manoj K; Singh, O P; Raghavendra, K; Nanda, Nutan; Dash, Aditya P; Saksena, D N; Das, Aparup

    2010-08-01

    Inferences on the taxonomic positions, phylogenetic interrelationships and divergence time among closely related species of medical importance is essential to understand evolutionary patterns among species, and based on which, disease control measures could be devised. To this respect, malaria is one of the important mosquito borne diseases of tropical and sub-tropical parts of the globe. Taxonomic status of malaria vectors has been so far documented based on morphological, cytological and few molecular genetic features. However, utilization of multilocus DNA sequences in phylogenetic inferences are still in dearth. India contains one of the richest resources of mosquito species diversity but little molecular taxonomic information is available in Indian malaria vectors. We herewith utilized the whole genome sequence information of An. gambiae to amplify and sequence three orthologous nuclear genetic regions in six Indian malaria vector species (An. culicifacies, An. minimus, An. sundaicus, An. fluviatilis, An. annularis and An. stephensi). Further, we utilized the previously published DNA sequence information on the COII and ITS2 genes in all the six species, making the total number of loci to five. Multilocus molecular phylogenetic study of Indian anophelines and An. gambiae was conducted at each individual genetic region using Neighbour Joining (NJ), Maximum Likelihood (ML), Maximum Parsimony (MP) and Bayesian approaches. Although tree topologies with COII, and ITS2 genes were similar, for no other three genetic regions similar tree topologies were observed. In general, the reconstructed phylogenetic status of Indian malaria vectors follows the pattern based on morphological and cytological classifications that was reconfirmed with COII and ITS2 genetic regions. Further, divergence times based on COII gene sequences were estimated among the seven Anopheles species which corroborate the earlier hypothesis on the radiation of different species of the Anopheles

  19. Phycas: software for Bayesian phylogenetic analysis.

    PubMed

    Lewis, Paul O; Holder, Mark T; Swofford, David L

    2015-05-01

    Phycas is open source, freely available Bayesian phylogenetics software written primarily in C++ but with a Python interface. Phycas specializes in Bayesian model selection for nucleotide sequence data, particularly the estimation of marginal likelihoods, central to computing Bayes Factors. Marginal likelihoods can be estimated using newer methods (Thermodynamic Integration and Generalized Steppingstone) that are more accurate than the widely used Harmonic Mean estimator. In addition, Phycas supports two posterior predictive approaches to model selection: Gelfand-Ghosh and Conditional Predictive Ordinates. The General Time Reversible family of substitution models, as well as a codon model, are available, and data can be partitioned with all parameters unlinked except tree topology and edge lengths. Phycas provides for analyses in which the prior on tree topologies allows polytomous trees as well as fully resolved trees, and provides for several choices for edge length priors, including a hierarchical model as well as the recently described compound Dirichlet prior, which helps avoid overly informative induced priors on tree length. PMID:25577605

  20. Multipolar consensus for phylogenetic trees.

    PubMed

    Bonnard, Cécile; Berry, Vincent; Lartillot, Nicolas

    2006-10-01

    Collections of phylogenetic trees are usually summarized using consensus methods. These methods build a single tree, supposed to be representative of the collection. However, in the case of heterogeneous collections of trees, the resulting consensus may be poorly resolved (strict consensus, majority-rule consensus, ...), or may perform arbitrary choices among mutually incompatible clades, or splits (greedy consensus). Here, we propose an alternative method, which we call the multipolar consensus (MPC). Its aim is to display all the splits having a support above a predefined threshold, in a minimum number of consensus trees, or poles. We show that the problem is equivalent to a graph-coloring problem, and propose an implementation of the method. Finally, we apply the MPC to real data sets. Our results indicate that, typically, all the splits down to a weight of 10% can be displayed in no more than 4 trees. In addition, in some cases, biologically relevant secondary signals, which would not have been present in any of the classical consensus trees, are indeed captured by our method, indicating that the MPC provides a convenient exploratory method for phylogenetic analysis. The method was implemented in a package freely available at http://www.lirmm.fr/~cbonnard/MPC.html PMID:17060203

  1. Phylogenetic Origins of Brain Organisers

    PubMed Central

    Robertshaw, Ellen; Kiecker, Clemens

    2012-01-01

    The regionalisation of the nervous system begins early in embryogenesis, concomitant with the establishment of the anteroposterior (AP) and dorsoventral (DV) body axes. The molecular mechanisms that drive axis induction appear to be conserved throughout the animal kingdom and may be phylogenetically older than the emergence of bilateral symmetry. As a result of this process, groups of patterning genes that are equally well conserved are expressed at specific AP and DV coordinates of the embryo. In the emerging nervous system of vertebrate embryos, this initial pattern is refined by local signalling centres, secondary organisers, that regulate patterning, proliferation, and axonal pathfinding in adjacent neuroepithelium. The main secondary organisers for the AP neuraxis are the midbrain-hindbrain boundary, zona limitans intrathalamica, and anterior neural ridge and for the DV neuraxis the notochord, floor plate, and roof plate. A search for homologous secondary organisers in nonvertebrate lineages has led to controversy over their phylogenetic origins. Based on a recent study in hemichordates, it has been suggested that the AP secondary organisers evolved at the base of the deuterostome superphylum, earlier than previously thought. According to this view, the lack of signalling centres in some deuterostome lineages is likely to reflect a secondary loss due to adaptive processes. We propose that the relative evolutionary flexibility of secondary organisers has contributed to a broader morphological complexity of nervous systems in different clades. PMID:24278699

  2. Phylogenetic Conservatism in Plant Phenology

    NASA Technical Reports Server (NTRS)

    Davies, T. Jonathan; Wolkovich, Elizabeth M.; Kraft, Nathan J. B.; Salamin, Nicolas; Allen, Jenica M.; Ault, Toby R.; Betancourt, Julio L.; Bolmgren, Kjell; Cleland, Elsa E.; Cook, Benjamin I.; Crimmins, Theresa M.; Mazer, Susan J.; McCabe, Gregory J.; Pau, Stephanie; Regetz, Jim; Schwartz, Mark D.; Travers, Steven E.

    2013-01-01

    Phenological events defined points in the life cycle of a plant or animal have been regarded as highly plastic traits, reflecting flexible responses to various environmental cues. The ability of a species to track, via shifts in phenological events, the abiotic environment through time might dictate its vulnerability to future climate change. Understanding the predictors and drivers of phenological change is therefore critical. Here, we evaluated evidence for phylogenetic conservatism the tendency for closely related species to share similar ecological and biological attributes in phenological traits across flowering plants. We aggregated published and unpublished data on timing of first flower and first leaf, encompassing 4000 species at 23 sites across the Northern Hemisphere. We reconstructed the phylogeny for the set of included species, first, using the software program Phylomatic, and second, from DNA data. We then quantified phylogenetic conservatism in plant phenology within and across sites. We show that more closely related species tend to flower and leaf at similar times. By contrasting mean flowering times within and across sites, however, we illustrate that it is not the time of year that is conserved, but rather the phenological responses to a common set of abiotic cues. Our findings suggest that species cannot be treated as statistically independent when modelling phenological responses.Closely related species tend to resemble each other in the timing of their life-history events, a likely product of evolutionarily conserved responses to environmental cues. The search for the underlying drivers of phenology must therefore account for species' shared evolutionary histories.

  3. Neuromuscular disease classification system

    NASA Astrophysics Data System (ADS)

    Sáez, Aurora; Acha, Begoña; Montero-Sánchez, Adoración; Rivas, Eloy; Escudero, Luis M.; Serrano, Carmen

    2013-06-01

    Diagnosis of neuromuscular diseases is based on subjective visual assessment of biopsies from patients by the pathologist specialist. A system for objective analysis and classification of muscular dystrophies and neurogenic atrophies through muscle biopsy images of fluorescence microscopy is presented. The procedure starts with an accurate segmentation of the muscle fibers using mathematical morphology and a watershed transform. A feature extraction step is carried out in two parts: 24 features that pathologists take into account to diagnose the diseases and 58 structural features that the human eye cannot see, based on the assumption that the biopsy is considered as a graph, where the nodes are represented by each fiber, and two nodes are connected if two fibers are adjacent. A feature selection using sequential forward selection and sequential backward selection methods, a classification using a Fuzzy ARTMAP neural network, and a study of grading the severity are performed on these two sets of features. A database consisting of 91 images was used: 71 images for the training step and 20 as the test. A classification error of 0% was obtained. It is concluded that the addition of features undetectable by the human visual inspection improves the categorization of atrophic patterns.

  4. Reanalysis and Simulation Suggest a Phylogenetic Microarray Does Not Accurately Profile Microbial Communities

    PubMed Central

    Midgley, David J.; Greenfield, Paul; Shaw, Janet M.; Oytam, Yalchin; Li, Dongmei; Kerr, Caroline A.; Hendry, Philip

    2012-01-01

    The second generation (G2) PhyloChip is designed to detect over 8700 bacteria and archaeal and has been used over 50 publications and conference presentations. Many of those publications reveal that the PhyloChip measures of species richness greatly exceed statistical estimates of richness based on other methods. An examination of probes downloaded from Greengenes suggested that the system may have the potential to distort the observed community structure. This may be due to the sharing of probes by taxa; more than 21% of the taxa in that downloaded data have no unique probes. In-silico simulations using these data showed that a population of 64 taxa representing a typical anaerobic subterranean community returned 96 different taxa, including 15 families incorrectly called present and 19 families incorrectly called absent. A study of nasal and oropharyngeal microbial communities by Lemon et al (2010) found some 1325 taxa using the G2 PhyloChip, however, about 950 of these taxa have, in the downloaded data, no unique probes and cannot be definitively called present. Finally, data from Brodie et al (2007), when re-examined, indicate that the abundance of the majority of detected taxa, are highly correlated with one another, suggesting that many probe sets do not act independently. Based on our analyses of downloaded data, we conclude that outputs from the G2 PhyloChip should be treated with some caution, and that the presence of taxa represented solely by non-unique probes be independently verified. PMID:22457798

  5. Virus classification in 60-dimensional protein space.

    PubMed

    Li, Yongkun; Tian, Kun; Yin, Changchuan; He, Rong Lucy; Yau, Stephen S-T

    2016-06-01

    Due to vast sequence divergence among different viral groups, sequence alignment is not directly applicable to genome-wide comparative analysis of viruses. More and more attention has been paid to alignment-free methods for whole genome comparison and phylogenetic tree reconstruction. Among alignment-free methods, the recently proposed "Natural Vector (NV) representation" has successfully been used to study the phylogeny of multi-segmented viruses based on a 12-dimensional genome space derived from the nucleotide sequence structure. But the preference of proteomes over genomes for the determination of viral phylogeny was not deeply investigated. As the translated products of genes, proteins directly form the shape of viral structure and are vital for all metabolic pathways. In this study, using the NV representation of a protein sequence along with the Hausdorff distance suitable to compare point sets, we construct a 60-dimensional protein space to analyze the evolutionary relationships of 4021 viruses by whole-proteomes in the current NCBI Reference Sequence Database (RefSeq). We also take advantage of the previously developed natural graphical representation to recover viral phylogeny. Our results demonstrate that the proposed method is efficient and accurate for classifying viruses. The accuracy rates of our predictions such as for Baltimore II viruses are as high as 95.9% for family labels, 95.7% for subfamily labels and 96.5% for genus labels. Finally, we discover that proteomes lead to better viral classification when reliable protein sequences are abundant. In other cases, the accuracy rates using proteomes are still comparable to that of genomes. PMID:26988414

  6. New Phylogenetic Groups of Torque Teno Virus Identified in Eastern Taiwan Indigenes.

    PubMed

    Hsiao, Kuang-Liang; Wang, Li-Yu; Lin, Chiung-Ling; Liu, Hsin-Fu

    2016-01-01

    Torque teno virus (TTV) is a single-stranded DNA virus highly prevalent in the world. It has been detected in eastern Taiwan indigenes with a low prevalence of 11% by using N22 region of which known to underestimate TTV prevalence excessively. In order to clarify their realistic epidemiology, we re-analyzed TTV prevalence with UTR region. One hundred and forty serum samples from eastern Taiwanese indigenous population were collected and TTV DNA was detected in 133 (95%) samples. Direct sequencing revealed an extensive mix-infection of different TTV strains within the infected individual. Entire TTV open reading frame 1 was amplified and cloned from a TTV positive individual to distinguish mix-infected strains. Phylogenetic analysis showed eleven isolates were clustered into a monophyletic group that is distinct from all known groups. In addition, another our isolate was clustered with recently described Hebei-1 strain and formed an independent clade. Based on the distribution pattern of pairwise distances, both new clusters were placed at phylogenetic group level, designed as the 6th and 7th phylogenetic group. In present study, we showed a very high prevalence of TTV infection in eastern Taiwan indigenes and indentified new phylogenetic groups from the infected individual. Both intra- and inter-phylogenetic group mix-infections can be found from one healthy person. Our study has further broadened the field of human TTVs and proposed a robust criterion for classification of the major TTV phylogenetic groups. PMID:26901643

  7. New Phylogenetic Groups of Torque Teno Virus Identified in Eastern Taiwan Indigenes

    PubMed Central

    Hsiao, Kuang-Liang; Wang, Li-Yu; Lin, Chiung-Ling; Liu, Hsin-Fu

    2016-01-01

    Torque teno virus (TTV) is a single-stranded DNA virus highly prevalent in the world. It has been detected in eastern Taiwan indigenes with a low prevalence of 11% by using N22 region of which known to underestimate TTV prevalence excessively. In order to clarify their realistic epidemiology, we re-analyzed TTV prevalence with UTR region. One hundred and forty serum samples from eastern Taiwanese indigenous population were collected and TTV DNA was detected in 133 (95%) samples. Direct sequencing revealed an extensive mix-infection of different TTV strains within the infected individual. Entire TTV open reading frame 1 was amplified and cloned from a TTV positive individual to distinguish mix-infected strains. Phylogenetic analysis showed eleven isolates were clustered into a monophyletic group that is distinct from all known groups. In addition, another our isolate was clustered with recently described Hebei-1 strain and formed an independent clade. Based on the distribution pattern of pairwise distances, both new clusters were placed at phylogenetic group level, designed as the 6th and 7th phylogenetic group. In present study, we showed a very high prevalence of TTV infection in eastern Taiwan indigenes and indentified new phylogenetic groups from the infected individual. Both intra- and inter-phylogenetic group mix-infections can be found from one healthy person. Our study has further broadened the field of human TTVs and proposed a robust criterion for classification of the major TTV phylogenetic groups. PMID:26901643

  8. Phylogenetic analysis of the spirochetes.

    PubMed Central

    Paster, B J; Dewhirst, F E; Weisburg, W G; Tordoff, L A; Fraser, G J; Hespell, R B; Stanton, T B; Zablen, L; Mandelco, L; Woese, C R

    1991-01-01

    The 16S rRNA sequences were determined for species of Spirochaeta, Treponema, Borrelia, Leptospira, Leptonema, and Serpula, using a modified Sanger method of direct RNA sequencing. Analysis of aligned 16S rRNA sequences indicated that the spirochetes form a coherent taxon composed of six major clusters or groups. The first group, termed the treponemes, was divided into two subgroups. The first treponeme subgroup consisted of Treponema pallidum, Treponema phagedenis, Treponema denticola, a thermophilic spirochete strain, and two species of Spirochaeta, Spirochaeta zuelzerae and Spirochaeta stenostrepta, with an average interspecies similarity of 89.9%. The second treponeme subgroup contained Treponema bryantii, Treponema pectinovorum, Treponema saccharophilum, Treponema succinifaciens, and rumen strain CA, with an average interspecies similarity of 86.2%. The average interspecies similarity between the two treponeme subgroups was 84.2%. The division of the treponemes into two subgroups was verified by single-base signature analysis. The second spirochete group contained Spirochaeta aurantia, Spirochaeta halophila, Spirochaeta bajacaliforniensis, Spirochaeta litoralis, and Spirochaeta isovalerica, with an average similarity of 87.4%. The Spirochaeta group was related to the treponeme group, with an average similarity of 81.9%. The third spirochete group contained borrelias, including Borrelia burgdorferi, Borrelia anserina, Borrelia hermsii, and a rabbit tick strain. The borrelias formed a tight phylogenetic cluster, with average similarity of 97%. THe borrelia group shared a common branch with the Spirochaeta group and was closer to this group than to the treponemes. A single spirochete strain isolated fromt the shew constituted the fourth group. The fifth group was composed of strains of Serpula (Treponema) hyodysenteriae and Serpula (Treponema) innocens. The two species of this group were closely related, with a similarity of greater than 99%. Leptonema illini

  9. Multisensor classification of sedimentary rocks

    NASA Technical Reports Server (NTRS)

    Evans, Diane

    1988-01-01

    A comparison is made between linear discriminant analysis and supervised classification results based on signatures from the Landsat TM, the Thermal Infrared Multispectral Scanner (TIMS), and airborne SAR, alone and combined into extended spectral signatures for seven sedimentary rock units exposed on the margin of the Wind River Basin, Wyoming. Results from a linear discriminant analysis showed that training-area classification accuracies based on the multisensor data were improved an average of 15 percent over TM alone, 24 percent over TIMS alone, and 46 percent over SAR alone, with similar improvement resulting when supervised multisensor classification maps were compared to supervised, individual sensor classification maps. When training area signatures were used to map spectrally similar materials in an adjacent area, the average classification accuracy improved 19 percent using the multisensor data over TM alone, 2 percent over TIMS alone, and 11 percent over SAR alone. It is concluded that certain sedimentary lithologies may be accurately mapped using a single sensor, but classification of a variety of rock types can be improved using multisensor data sets that are sensitive to different characteristics such as mineralogy and surface roughness.

  10. Phylogenetic organization of bacterial activity

    PubMed Central

    Morrissey, Ember M; Mau, Rebecca L; Schwartz, Egbert; Caporaso, J Gregory; Dijkstra, Paul; van Gestel, Natasja; Koch, Benjamin J; Liu, Cindy M; Hayer, Michaela; McHugh, Theresa A; Marks, Jane C; Price, Lance B; Hungate, Bruce A

    2016-01-01

    Phylogeny is an ecologically meaningful way to classify plants and animals, as closely related taxa frequently have similar ecological characteristics, functional traits and effects on ecosystem processes. For bacteria, however, phylogeny has been argued to be an unreliable indicator of an organism's ecology owing to evolutionary processes more common to microbes such as gene loss and lateral gene transfer, as well as convergent evolution. Here we use advanced stable isotope probing with 13C and 18O to show that evolutionary history has ecological significance for in situ bacterial activity. Phylogenetic organization in the activity of bacteria sets the stage for characterizing the functional attributes of bacterial taxonomic groups. Connecting identity with function in this way will allow scientists to begin building a mechanistic understanding of how bacterial community composition regulates critical ecosystem functions. PMID:26943624

  11. Phylogenetic organization of bacterial activity.

    PubMed

    Morrissey, Ember M; Mau, Rebecca L; Schwartz, Egbert; Caporaso, J Gregory; Dijkstra, Paul; van Gestel, Natasja; Koch, Benjamin J; Liu, Cindy M; Hayer, Michaela; McHugh, Theresa A; Marks, Jane C; Price, Lance B; Hungate, Bruce A

    2016-09-01

    Phylogeny is an ecologically meaningful way to classify plants and animals, as closely related taxa frequently have similar ecological characteristics, functional traits and effects on ecosystem processes. For bacteria, however, phylogeny has been argued to be an unreliable indicator of an organism's ecology owing to evolutionary processes more common to microbes such as gene loss and lateral gene transfer, as well as convergent evolution. Here we use advanced stable isotope probing with (13)C and (18)O to show that evolutionary history has ecological significance for in situ bacterial activity. Phylogenetic organization in the activity of bacteria sets the stage for characterizing the functional attributes of bacterial taxonomic groups. Connecting identity with function in this way will allow scientists to begin building a mechanistic understanding of how bacterial community composition regulates critical ecosystem functions. PMID:26943624

  12. Accurate and efficient reconstruction of deep phylogenies from structured RNAs

    PubMed Central

    Stocsits, Roman R.; Letsch, Harald; Hertel, Jana; Misof, Bernhard; Stadler, Peter F.

    2009-01-01

    Ribosomal RNA (rRNA) genes are probably the most frequently used data source in phylogenetic reconstruction. Individual columns of rRNA alignments are not independent as a consequence of their highly conserved secondary structures. Unless explicitly taken into account, these correlation can distort the phylogenetic signal and/or lead to gross overestimates of tree stability. Maximum likelihood and Bayesian approaches are of course amenable to using RNA-specific substitution models that treat conserved base pairs appropriately, but require accurate secondary structure models as input. So far, however, no accurate and easy-to-use tool has been available for computing structure-aware alignments and consensus structures that can deal with the large rRNAs. The RNAsalsa approach is designed to fill this gap. Capitalizing on the improved accuracy of pairwise consensus structures and informed by a priori knowledge of group-specific structural constraints, the tool provides both alignments and consensus structures that are of sufficient accuracy for routine phylogenetic analysis based on RNA-specific substitution models. The power of the approach is demonstrated using two rRNA data sets: a mitochondrial rRNA set of 26 Mammalia, and a collection of 28S nuclear rRNAs representative of the five major echinoderm groups. PMID:19723687

  13. Accurate and efficient reconstruction of deep phylogenies from structured RNAs.

    PubMed

    Stocsits, Roman R; Letsch, Harald; Hertel, Jana; Misof, Bernhard; Stadler, Peter F

    2009-10-01

    Ribosomal RNA (rRNA) genes are probably the most frequently used data source in phylogenetic reconstruction. Individual columns of rRNA alignments are not independent as a consequence of their highly conserved secondary structures. Unless explicitly taken into account, these correlation can distort the phylogenetic signal and/or lead to gross overestimates of tree stability. Maximum likelihood and Bayesian approaches are of course amenable to using RNA-specific substitution models that treat conserved base pairs appropriately, but require accurate secondary structure models as input. So far, however, no accurate and easy-to-use tool has been available for computing structure-aware alignments and consensus structures that can deal with the large rRNAs. The RNAsalsa approach is designed to fill this gap. Capitalizing on the improved accuracy of pairwise consensus structures and informed by a priori knowledge of group-specific structural constraints, the tool provides both alignments and consensus structures that are of sufficient accuracy for routine phylogenetic analysis based on RNA-specific substitution models. The power of the approach is demonstrated using two rRNA data sets: a mitochondrial rRNA set of 26 Mammalia, and a collection of 28S nuclear rRNAs representative of the five major echinoderm groups. PMID:19723687

  14. Phylogenetic signal dissection identifies the root of starfishes.

    PubMed

    Feuda, Roberto; Smith, Andrew B

    2015-01-01

    Relationships within the class Asteroidea have remained controversial for almost 100 years and, despite many attempts to resolve this problem using molecular data, no consensus has yet emerged. Using two nuclear genes and a taxon sampling covering the major asteroid clades we show that non-phylogenetic signal created by three factors--Long Branch Attraction, compositional heterogeneity and the use of poorly fitting models of evolution--have confounded accurate estimation of phylogenetic relationships. To overcome the effect of this non-phylogenetic signal we analyse the data using non-homogeneous models, site stripping and the creation of subpartitions aimed to reduce or amplify the systematic error, and calculate Bayes Factor support for a selection of previously suggested topological arrangements of asteroid orders. We show that most of the previous alternative hypotheses are not supported in the most reliable data partitions, including the previously suggested placement of either Forcipulatida or Paxillosida as sister group to the other major branches. The best-supported solution places Velatida as the sister group to other asteroids, and the implications of this finding for the morphological evolution of asteroids are presented. PMID:25955729

  15. Progress, pitfalls and parallel universes: a history of insect phylogenetics.

    PubMed

    Kjer, Karl M; Simon, Chris; Yavorskaya, Margarita; Beutel, Rolf G

    2016-08-01

    The phylogeny of insects has been both extensively studied and vigorously debated for over a century. A relatively accurate deep phylogeny had been produced by 1904. It was not substantially improved in topology until recently when phylogenomics settled many long-standing controversies. Intervening advances came instead through methodological improvement. Early molecular phylogenetic studies (1985-2005), dominated by a few genes, provided datasets that were too small to resolve controversial phylogenetic problems. Adding to the lack of consensus, this period was characterized by a polarization of philosophies, with individuals belonging to either parsimony or maximum-likelihood camps; each largely ignoring the insights of the other. The result was an unfortunate detour in which the few perceived phylogenetic revolutions published by both sides of the philosophical divide were probably erroneous. The size of datasets has been growing exponentially since the mid-1980s accompanied by a wave of confidence that all relationships will soon be known. However, large datasets create new challenges, and a large number of genes does not guarantee reliable results. If history is a guide, then the quality of conclusions will be determined by an improved understanding of both molecular and morphological evolution, and not simply the number of genes analysed. PMID:27558853

  16. Progress, pitfalls and parallel universes: a history of insect phylogenetics

    PubMed Central

    Simon, Chris; Yavorskaya, Margarita; Beutel, Rolf G.

    2016-01-01

    The phylogeny of insects has been both extensively studied and vigorously debated for over a century. A relatively accurate deep phylogeny had been produced by 1904. It was not substantially improved in topology until recently when phylogenomics settled many long-standing controversies. Intervening advances came instead through methodological improvement. Early molecular phylogenetic studies (1985–2005), dominated by a few genes, provided datasets that were too small to resolve controversial phylogenetic problems. Adding to the lack of consensus, this period was characterized by a polarization of philosophies, with individuals belonging to either parsimony or maximum-likelihood camps; each largely ignoring the insights of the other. The result was an unfortunate detour in which the few perceived phylogenetic revolutions published by both sides of the philosophical divide were probably erroneous. The size of datasets has been growing exponentially since the mid-1980s accompanied by a wave of confidence that all relationships will soon be known. However, large datasets create new challenges, and a large number of genes does not guarantee reliable results. If history is a guide, then the quality of conclusions will be determined by an improved understanding of both molecular and morphological evolution, and not simply the number of genes analysed. PMID:27558853

  17. Phylogenetic Signal Dissection Identifies the Root of Starfishes

    PubMed Central

    Feuda, Roberto; Smith, Andrew B.

    2015-01-01

    Relationships within the class Asteroidea have remained controversial for almost 100 years and, despite many attempts to resolve this problem using molecular data, no consensus has yet emerged. Using two nuclear genes and a taxon sampling covering the major asteroid clades we show that non-phylogenetic signal created by three factors - Long Branch Attraction, compositional heterogeneity and the use of poorly fitting models of evolution – have confounded accurate estimation of phylogenetic relationships. To overcome the effect of this non-phylogenetic signal we analyse the data using non-homogeneous models, site stripping and the creation of subpartitions aimed to reduce or amplify the systematic error, and calculate Bayes Factor support for a selection of previously suggested topological arrangements of asteroid orders. We show that most of the previous alternative hypotheses are not supported in the most reliable data partitions, including the previously suggested placement of either Forcipulatida or Paxillosida as sister group to the other major branches. The best-supported solution places Velatida as the sister group to other asteroids, and the implications of this finding for the morphological evolution of asteroids are presented. PMID:25955729

  18. LABEL: Fast and Accurate Lineage Assignment with Assessment of H5N1 and H9N2 Influenza A Hemagglutinins

    PubMed Central

    Shepard, Samuel S.; Davis, C. Todd; Bahl, Justin; Rivailler, Pierre; York, Ian A.; Donis, Ruben O.

    2014-01-01

    The evolutionary classification of influenza genes into lineages is a first step in understanding their molecular epidemiology and can inform the subsequent implementation of control measures. We introduce a novel approach called Lineage Assignment By Extended Learning (LABEL) to rapidly determine cladistic information for any number of genes without the need for time-consuming sequence alignment, phylogenetic tree construction, or manual annotation. Instead, LABEL relies on hidden Markov model profiles and support vector machine training to hierarchically classify gene sequences by their similarity to pre-defined lineages. We assessed LABEL by analyzing the annotated hemagglutinin genes of highly pathogenic (H5N1) and low pathogenicity (H9N2) avian influenza A viruses. Using the WHO/FAO/OIE H5N1 evolution working group nomenclature, the LABEL pipeline quickly and accurately identified the H5 lineages of uncharacterized sequences. Moreover, we developed an updated clade nomenclature for the H9 hemagglutinin gene and show a similarly fast and reliable phylogenetic assessment with LABEL. While this study was focused on hemagglutinin sequences, LABEL could be applied to the analysis of any gene and shows great potential to guide molecular epidemiology activities, accelerate database annotation, and provide a data sorting tool for other large-scale bioinformatic studies. PMID:24466291

  19. Demonstrating Biological Classification Using a Simulation of Natural Taxa.

    ERIC Educational Resources Information Center

    Vogt, Kenneth D.

    1995-01-01

    A review of introductory college level and high school biology texts reveals that concepts and theories behind classification are usually poorly discussed. Suggests ways in which card games can be used to teach differences between the phenetic and phylogenetic approaches. (LZ)

  20. Propionibacterium acnes Types I and II Represent Phylogenetically Distinct Groups

    PubMed Central

    McDowell, Andrew; Valanne, Susanna; Ramage, Gordon; Tunney, Michael M.; Glenn, Josephine V.; McLorinan, Gregory C.; Bhatia, Ajay; Maisonneuve, Jean-Francois; Lodes, Michael; Persing, David H.; Patrick, Sheila

    2005-01-01

    Although two phenotypes of the opportunistic pathogen Propionibacterium acnes (types I and II) have been described, epidemiological investigations of their roles in different infections have not been widely reported. Using immunofluorescence microscopy with monoclonal antibodies (MAbs) QUBPa1 and QUBPa2, specific for types I and II, respectively, we investigated the prevalences of the two types among 132 P. acnes isolates. Analysis of isolates from failed prosthetic hip implants (n = 40) revealed approximately equal numbers of type I and II organisms. Isolates from failed prosthetic hip-associated bone (n = 6) and tissue (n = 38) samples, as well as isolates from acne (n = 22), dental infections (n = 8), and skin removed during surgical incision (n = 18) were predominately of type I. A total of 11 (8%) isolates showed atypical MAb labeling and could not be conclusively identified. Phylogenetic analysis of P. acnes by nucleotide sequencing revealed the 16S rRNA gene to be highly conserved between types I and II. In contrast, sequence analysis of recA and a putative hemolysin gene (tly) revealed significantly greater type-specific polymorphisms that corresponded to phylogenetically distinct cluster groups. All 11 isolates with atypical MAb labeling were identified as type I by sequencing. Within the recA and tly phylogenetic trees, nine of these isolates formed a cluster distinct from other type I organisms, suggesting a further phylogenetic subdivision within type I. Our study therefore demonstrates that the phenotypic differences between P. acnes types I and II reflect deeper differences in their phylogeny. Furthermore, nucleotide sequencing provides an accurate method for identifying the type status of P. acnes isolates. PMID:15634990

  1. Classification in Australia.

    ERIC Educational Resources Information Center

    McKinlay, John

    Despite some inroads by the Library of Congress Classification and short-lived experimentation with Universal Decimal Classification and Bliss Classification, Dewey Decimal Classification, with its ability in recent editions to be hospitable to local needs, remains the most widely used classification system in Australia. Although supplemented at…

  2. Use of whole genome sequences to develop a molecular phylogenetic framework for Rhodococcus fascians and the Rhodococcus genus

    PubMed Central

    Creason, Allison L.; Davis, Edward W.; Putnam, Melodie L.; Vandeputte, Olivier M.; Chang, Jeff H.

    2014-01-01

    The accurate diagnosis of diseases caused by pathogenic bacteria requires a stable species classification. Rhodococcus fascians is the only documented member of its ill-defined genus that is capable of causing disease on a wide range of agriculturally important plants. Comparisons of genome sequences generated from isolates of Rhodococcus associated with diseased plants revealed a level of genetic diversity consistent with them representing multiple species. To test this, we generated a tree based on more than 1700 homologous sequences from plant-associated isolates of Rhodococcus, and obtained support from additional approaches that measure and cluster based on genome similarities. Results were consistent in supporting the definition of new Rhodococcus species within clades containing phytopathogenic members. We also used the genome sequences, along with other rhodococcal genome sequences to construct a molecular phylogenetic tree as a framework for resolving the Rhodococcus genus. Results indicated that Rhodococcus has the potential for having 20 species and also confirmed a need to revisit the taxonomic groupings within Rhodococcus. PMID:25237311

  3. Molecular and Morphological Analyses Reveal Phylogenetic Relationships of Stingrays Focusing on the Family Dasyatidae (Myliobatiformes)

    PubMed Central

    Lim, Kean Chong; Lim, Phaik-Eem; Chong, Ving Ching; Loh, Kar-Hoe

    2015-01-01

    Elucidating the phylogenetic relationships of the current but problematic Dasyatidae (Order Myliobatiformes) was the first priority of the current study. Here, we studied three molecular gene markers of 43 species (COI gene), 33 species (ND2 gene) and 34 species (RAG1 gene) of stingrays to draft out the phylogenetic tree of the order. Nine character states were identified and used to confirm the molecularly constructed phylogenetic trees. Eight or more clades (at different hierarchical level) were identified for COI, ND2 and RAG1 genes in the Myliobatiformes including four clades containing members of the present Dasyatidae, thus rendering the latter non-monophyletic. The uncorrected p-distance between these four ‘Dasytidae’ clades when compared to the distance between formally known families confirmed that these four clades should be elevated to four separate families. We suggest a revision of the present classification, retaining the Dasyatidae (Dasyatis and Taeniurops species) but adding three new families namely, Neotrygonidae (Neotrygon and Taeniura species), Himanturidae (Himantura species) and Pastinachidae (Pastinachus species). Our result indicated the need to further review the classification of Dasyatis microps. By resolving the non-monophyletic problem, the suite of nine character states enables the natural classification of the Myliobatiformes into at least thirteen families based on morphology. PMID:25867639

  4. Molecular phylogenetics and character evolution of morphologically diverse groups, Dendrobium section Dendrobium and allies

    PubMed Central

    Takamiya, Tomoko; Wongsawad, Pheravut; Sathapattayanon, Apirada; Tajima, Natsuko; Suzuki, Shunichiro; Kitamura, Saki; Shioda, Nao; Handa, Takashi; Kitanaka, Susumu; Iijima, Hiroshi; Yukawa, Tomohisa

    2014-01-01

    It is always difficult to construct coherent classification systems for plant lineages having diverse morphological characters. The genus Dendrobium, one of the largest genera in the Orchidaceae, includes ∼1100 species, and enormous morphological diversification has hindered the establishment of consistent classification systems covering all major groups of this genus. Given the particular importance of species in Dendrobium section Dendrobium and allied groups as floriculture and crude drug genetic resources, there is an urgent need to establish a stable classification system. To clarify phylogenetic relationships in Dendrobium section Dendrobium and allied groups, we analysed the macromolecular characters of the group. Phylogenetic analyses of 210 taxa of Dendrobium were conducted on DNA sequences of internal transcribed spacer (ITS) regions of 18S–26S nuclear ribosomal DNA and the maturase-coding gene (matK) located in an intron of the plastid gene trnK using maximum parsimony and Bayesian methods. The parsimony and Bayesian analyses revealed 13 distinct clades in the group comprising section Dendrobium and its allied groups. Results also showed paraphyly or polyphyly of sections Amblyanthus, Aporum, Breviflores, Calcarifera, Crumenata, Dendrobium, Densiflora, Distichophyllae, Dolichocentrum, Holochrysa, Oxyglossum and Pedilonum. On the other hand, the monophyly of section Stachyobium was well supported. It was found that many of the morphological characters that have been believed to reflect phylogenetic relationships are, in fact, the result of convergence. As such, many of the sections that have been recognized up to this point were found to not be monophyletic, so recircumscription of sections is required. PMID:25107672

  5. Synopsis of Trichosanthes (Cucurbitaceae) based on recent molecular phylogenetic data

    PubMed Central

    de Boer, Hugo J.; Thulin, Mats

    2012-01-01

    Abstract The snake gourd genus, Trichosanthes, is the largest genus in the Cucurbitaceae family, with over 90 species. Recent molecular phylogenetic data have indicated that the genus Gymnopetalum is to be merged with Trichosanthes to maintain monophyly. A revised infrageneric classification of Trichosanthes including Gymnopetalum is proposed with two subgenera, (I) subg. Scotanthus comb. nov. and (II) subg. Trichosanthes, eleven sections, (i) sect. Asterospermae, (ii) sect. Cucumeroides, (iii) sect. Edulis, (iv) sect. Foliobracteola, (v) sect. Gymnopetalum, (vi) sect. Involucraria, (vii) sect. Pseudovariifera sect. nov., (viii) sect. Villosae stat. nov., (ix) sect. Trichosanthes, (x) sect. Tripodanthera, and (xi) sect. Truncata. A synopsis of Trichosanthes with the 91 species recognized here is presented, including four new combinations, Trichosanthes orientalis, Trichosanthes tubiflora, Trichosanthes scabra var. pectinata, Trichosanthes scabra var. penicaudii, and a clarified nomenclature of Trichosanthes costata and Trichosanthes scabra. PMID:22645411

  6. Phylogenetic mapping of bacterial morphology

    NASA Technical Reports Server (NTRS)

    Siefert, J. L.; Fox, G. E.

    1998-01-01

    The availability of a meaningful molecular phylogeny for bacteria provides a context for examining the historical significance of various developments in bacterial evolution. Herein, the classical morphological descriptions of selected members of the domain Bacteria are mapped upon the genealogical ancestry deduced from comparison of small-subunit rRNA sequences. For the species examined in this study, a distinct pattern emerges which indicates that the coccus shape has arisen and accumulated independently multiple times in separate lineages and typically survived as a persistent end-state morphology. At least two other morphologies persist but have evolved only once. This study demonstrates that although bacterial morphology is not useful in defining bacterial phylogeny, it is remarkably consistent with that phylogeny once it is known. An examination of the experimental evidence available for morphogenesis as well as microbial fossil evidence corroborates these findings. It is proposed that the accumulation of persistent morphologies is a result of the biophysical properties of peptidoglycan and their genetic control, and that an evolved body-plan strategy based on peptidoglycan may have been a fate-sealing step in the evolution of Bacteria. More generally, this study illustrates that significant evolutionary insights can be obtained by examining biological and biochemical data in the context of a reliable phylogenetic structure.

  7. Phylogenetic development of myelin glycosphingolipids.

    PubMed

    Kishimoto, Y

    1986-12-15

    Myelin is a highly specialized membrane, which enwraps axons and facilitates saltatory nerve conduction in vertebrates. Galactocerebroside and its sulfate ester, sulfatide, are highly localized in myelin. To understand the role played by these galactosphingolipids we investigated the changes of these myelin-specific compounds during the course of the evolution of myelin. We found that urodele nerve myelin lacks alpha-hydroxy fatty acid-containing galactosphingolipids. Our morphological and physiological studies of urodele nerves indicated that these hydroxy fatty acid-containing galactosphingolipids probably contribute to fast nerve conduction. Also it is suspected that they are involved in the regulation of the thickness of myelin in relation to the size of the axon. In another study, we discovered that glucocerebroside, which has glucose instead of galactose as its carbohydrate component, is abundantly present in the myelin-like sheath membrane of crustacean nerves. Subsequently, the phylogenetic study indicated that galactocerebrosides were limited to the nervous system of deuterostomes, while all protostome nerves contain glucocerebrosides. The role of glucocerebrosides in multilayered membranes and in the conduction velocity of the protostome nervous system is discussed. PMID:3549016

  8. Classifying the bacterial gut microbiota of termites and cockroaches: A curated phylogenetic reference database (DictDb).

    PubMed

    Mikaelyan, Aram; Köhler, Tim; Lampert, Niclas; Rohland, Jeffrey; Boga, Hamadi; Meuser, Katja; Brune, Andreas

    2015-10-01

    Recent developments in sequencing technology have given rise to a large number of studies that assess bacterial diversity and community structure in termite and cockroach guts based on large amplicon libraries of 16S rRNA genes. Although these studies have revealed important ecological and evolutionary patterns in the gut microbiota, classification of the short sequence reads is limited by the taxonomic depth and resolution of the reference databases used in the respective studies. Here, we present a curated reference database for accurate taxonomic analysis of the bacterial gut microbiota of dictyopteran insects. The Dictyopteran gut microbiota reference Database (DictDb) is based on the Silva database but was significantly expanded by the addition of clones from 11 mostly unexplored termite and cockroach groups, which increased the inventory of bacterial sequences from dictyopteran guts by 26%. The taxonomic depth and resolution of DictDb was significantly improved by a general revision of the taxonomic guide tree for all important lineages, including a detailed phylogenetic analysis of the Treponema and Alistipes complexes, the Fibrobacteres, and the TG3 phylum. The performance of this first documented version of DictDb (v. 3.0) using the revised taxonomic guide tree in the classification of short-read libraries obtained from termites and cockroaches was highly superior to that of the current Silva and RDP databases. DictDb uses an informative nomenclature that is consistent with the literature also for clades of uncultured bacteria and provides an invaluable tool for anyone exploring the gut community structure of termites and cockroaches. PMID:26283320

  9. Arhynchobdellida (Annelida: Oligochaeta: Hirudinida): phylogenetic relationships and evolution.

    PubMed

    Borda, Elizabeth; Siddall, Mark E

    2004-01-01

    A remarkable diversity of life history strategies, geographic distributions, and morphological characters provide a rich substrate for investigating the evolutionary relationships of arhynchobdellid leeches. The phylogenetic relationships, using parsimony analysis, of the order Arhynchobdellida were investigated using nuclear 18S and 28S rDNA, mitochondrial 12S rDNA, and cytochrome c oxidase subunit I sequence data, as well as 24 morphological characters. Thirty-nine arhynchobdellid species were selected to represent the seven currently recognized families. Sixteen rhynchobdellid leeches from the families Glossiphoniidae and Piscicolidae were included as outgroup taxa. Analysis of all available data resolved a single most-parsimonious tree. The cladogram conflicted with most of the traditional classification schemes of the Arhynchobdellida. Monophyly of the Erpobdelliformes and Hirudiniformes was supported, whereas the families Haemadipsidae, Haemopidae, and Hirudinidae, as well as the genera Hirudo or Aliolimnatis, were found not to be monophyletic. The results provide insight on the phylogenetic positions for the taxonomically problematic families Americobdellidae and Cylicobdellidae, the genera Semiscolex, Patagoniobdella, and Mesobdella, as well as genera traditionally classified under Hirudinidae. The evolution of dietary and habitat preferences is examined. PMID:15022771

  10. Phylogenetic Relationships of American Willows (Salix L., Salicaceae)

    PubMed Central

    Lauron-Moreau, Aurélien; Pitre, Frédéric E.; Argus, George W.; Labrecque, Michel; Brouillet, Luc

    2015-01-01

    Salix L. is the largest genus in the family Salicaceae (450 species). Several classifications have been published, but taxonomic subdivision has been under continuous revision. Our goal is to establish the phylogenetic structure of the genus using molecular data on all American willows, using three DNA markers. This complete phylogeny of American willows allows us to propose a biogeographic framework for the evolution of the genus. Material was obtained for the 122 native and introduced willow species of America. Sequences were obtained from the ITS (ribosomal nuclear DNA) and two plastid regions, matK and rbcL. Phylogenetic analyses (parsimony, maximum likelihood, Bayesian inference) were performed on the data. Geographic distribution was mapped onto the tree. The species tree provides strong support for a division of the genus into two subgenera, Salix and Vetrix. Subgenus Salix comprises temperate species from the Americas and Asia, and their disjunction may result from Tertiary events. Subgenus Vetrix is composed of boreo-arctic species of the Northern Hemisphere and their radiation may coincide with the Quaternary glaciations. Sixteen species have ambiguous positions; genetic diversity is lower in subg. Vetrix. A molecular phylogeny of all species of American willows has been inferred. It needs to be tested and further resolved using other molecular data. Nonetheless, the genus clearly has two clades that have distinct biogeographic patterns. PMID:25880993

  11. Phylogenetic placement of Hydra and relationships within Aplanulata (Cnidaria: Hydrozoa).

    PubMed

    Nawrocki, Annalise M; Collins, Allen G; Hirano, Yayoi M; Schuchert, Peter; Cartwright, Paulyn

    2013-04-01

    The model organism Hydra belongs to the hydrozoan clade Aplanulata. Despite being a popular model system for development, little is known about the phylogenetic placement of this taxon or the relationships of its closest relatives. Previous studies have been conflicting regarding sister group relationships and have been unable to resolve deep nodes within the clade. In addition, there are several putative Aplanulata taxa that have never been sampled for molecular data or analyzed using multiple markers. Here, we combine the fast-evolving cytochrome oxidase 1 (CO1) mitochondrial marker with mitochondrial 16S, nuclear small ribosomal subunit (18S, SSU) and large ribosomal subunit (28S, LSU) sequences to examine relationships within the clade Aplanulata. We further discuss the relative contribution of four different molecular markers to resolving phylogenetic relationships within Aplanulata. Lastly, we report morphological synapomorphies for some of the major Aplanulata genera and families, and suggest new taxonomic classifications for two species of Aplanulata, Fukaurahydra anthoformis and Corymorpha intermedia, based on a preponderance of molecular and morphological data that justify the designation of these species to different genera. PMID:23280366

  12. The space of ultrametric phylogenetic trees.

    PubMed

    Gavryushkin, Alex; Drummond, Alexei J

    2016-08-21

    The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. PMID:27188249

  13. Remote Sensing Information Classification

    NASA Technical Reports Server (NTRS)

    Rickman, Douglas L.

    2008-01-01

    This viewgraph presentation reviews the classification of Remote Sensing data in relation to epidemiology. Classification is a way to reduce the dimensionality and precision to something a human can understand. Classification changes SCALAR data into NOMINAL data.

  14. Classification and knowledge

    NASA Technical Reports Server (NTRS)

    Kurtz, Michael J.

    1989-01-01

    Automated procedures to classify objects are discussed. The classification problem is reviewed, and the relation of epistemology and classification is considered. The classification of stellar spectra and of resolved images of galaxies is addressed.

  15. Molecular systematics of the Amazonian genus Aldina, a phylogenetically enigmatic ectomycorrhizal lineage of papilionoid legumes.

    PubMed

    Ramos, Gustavo; de Lima, Haroldo Cavalcante; Prenner, Gerhard; de Queiroz, Luciano Paganucci; Zartman, Charles E; Cardoso, Domingos

    2016-04-01

    Aldina (Leguminosae) is among the very few ecologically successful ectomycorrhizal lineages in a family largely marked by the evolution of nodulating symbiosis. The genus comprises 20 species predominantly distributed in Amazonia and has been traditionally classified in the tribe Swartzieae because of its radial flowers with an entire calyx and numerous free stamens. The taxonomy of Aldina is complicated due to its poor representation in herbaria and the lack of a robust phylogenetic hypothesis of relationship. Recent phylogenetic analyses of matK and trnL sequences confirmed the placement of Aldina in the 50-kb inversion clade, although the genus remained phylogenetically isolated or unresolved in the context of the evolutionary history of the main early-branching papilionoid lineages. We performed maximum likelihood and Bayesian analyses of combined chloroplast datasets (matK, rbcL, and trnL) and explored the effect of incomplete taxa or missing data in order to shed light on the enigmatic phylogenetic position of Aldina. Unexpectedly, a sister relationship of Aldina with the Andira clade (Andira and Hymenolobium) is revealed. We suggest that a new tribal phylogenetic classification of the papilionoid legumes should place Aldina along with Andira and Hymenolobium. These results highlight yet another example of the independent evolution of radial floral symmetry within the early-branching Papilionoideae, a large collection of florally heterogeneous lineages dominated by papilionate or bilaterally symmetric flower morphology. PMID:26748266

  16. Soil classifications systems review. Final report

    SciTech Connect

    1997-11-01

    Systems used to classify soils are discussed and compared. Major types of classification systems that are reviewed include natural systems, technical systems, the FAO/UNESCO world soil map, soil survey map units, and numerical taxonomy. Natural Classification systems discussed in detail are the United States system, Soil Taxonomy, and the Russian and Canadian systems. Included in the section on technical classification systems are reviews on the AASHO and Unified (ASTM) classification systems. The review of soil classification systems was conducted to establish improved availability of accurate ground thermal conductivity and other heat transfer related properties information. These data are intended to help in the design of closed-loop ground heat exchange systems.

  17. On Tree-Based Phylogenetic Networks.

    PubMed

    Zhang, Louxin

    2016-07-01

    A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model. PMID:27228397

  18. Molecular Phylogenetics: Mathematical Framework and Unsolved Problems

    NASA Astrophysics Data System (ADS)

    Xia, Xuhua

    Phylogenetic relationship is essential in dating evolutionary events, reconstructing ancestral genes, predicting sites that are important to natural selection, and, ultimately, understanding genomic evolution. Three categories of phylogenetic methods are currently used: the distance-based, the maximum parsimony, and the maximum likelihood method. Here, I present the mathematical framework of these methods and their rationales, provide computational details for each of them, illustrate analytically and numerically the potential biases inherent in these methods, and outline computational challenges and unresolved problems. This is followed by a brief discussion of the Bayesian approach that has been recently used in molecular phylogenetics.

  19. Ebolavirus classification based on natural vectors.

    PubMed

    Zheng, Hui; Yin, Changchuan; Hoang, Tung; He, Rong Lucy; Yang, Jie; Yau, Stephen S-T

    2015-06-01

    According to the WHO, ebolaviruses have resulted in 8818 human deaths in West Africa as of January 2015. To better understand the evolutionary relationship of the ebolaviruses and infer virulence from the relationship, we applied the alignment-free natural vector method to classify the newest ebolaviruses. The dataset includes three new Guinea viruses as well as 99 viruses from Sierra Leone. For the viruses of the family of Filoviridae, both genus label classification and species label classification achieve an accuracy rate of 100%. We represented the relationships among Filoviridae viruses by Unweighted Pair Group Method with Arithmetic Mean (UPGMA) phylogenetic trees and found that the filoviruses can be separated well by three genera. We performed the phylogenetic analysis on the relationship among different species of Ebolavirus by their coding-complete genomes and seven viral protein genes (glycoprotein [GP], nucleoprotein [NP], VP24, VP30, VP35, VP40, and RNA polymerase [L]). The topology of the phylogenetic tree by the viral protein VP24 shows consistency with the variations of virulence of ebolaviruses. The result suggests that VP24 be a pharmaceutical target for treating or preventing ebolaviruses. PMID:25803489

  20. Simple and fast classification of non-LTR retrotransposons based on phylogeny of their RT domain protein sequences

    PubMed Central

    Kapitonov, Vladimir V.; Tempel, Sébastien; Jurka, Jerzy

    2009-01-01

    Rapidly growing number of sequenced genomes requires fast and accurate computational tools for analysis of different transposable elements (TEs). In this paper we focus on rapid and reliable procedure for classification of autonomous non-LTR retrotransposons based on alignment and clustering of their reverse transcriptase (RT) domains. Typically, the RT domain protein sequences encoded by different non-LTR retrotransposons are similar to each other in terms of significant BLASTP E-values. Therefore, they can be easily detected by the routine BLASTP searches of genomic DNA sequences coding for proteins similar to the RT domains of known non-LTR retrotransposons. However, detailed classification of non-LTR retrotransposons, i.e. their assignment to specific clades, is a slow and complex procedure that is not formalized or integrated as a standard set of computational methods and data. Here we describe a tool (RTclass1) designed for the fast and accurate automated assignment of novel non-LTR retrotransposons to known or novel clades using phylogenetic analysis of the RT domain protein sequences. RTclass1 classifies a particular non-LTR retrotransposon based on its RT domain in less than 10 minutes on a standard desktop computer and achieves 99.5% accuracy. RT1class1 works either as a standalone program installed locally or as a web-server that can be accessed distantly by uploading sequence data through the internet (http://www.girinst.org/RTphylogeny/RTclass1). PMID:19651192

  1. Discriminating the effects of phylogenetic hypothesis, tree resolution and clade age estimates on phylogenetic signal measurements.

    PubMed

    Seger, G D S; Duarte, L D S; Debastiani, V J; Kindel, A; Jarenkow, J A

    2013-09-01

    Understanding how species traits evolved over time is the central question to comprehend assembly rules that govern the phylogenetic structure of communities. The measurement of phylogenetic signal (PS) in ecologically relevant traits is a first step to understand phylogenetically structured community patterns. The different methods available to estimate PS make it difficult to choose which is most appropriate. Furthermore, alternative phylogenetic tree hypotheses, node resolution and clade age estimates might influence PS measurements. In this study, we evaluated to what extent these parameters affect different methods of PS analysis, and discuss advantages and disadvantages when selecting which method to use. We measured fruit/seed traits and flowering/fruiting phenology of endozoochoric species occurring in Southern Brazilian Araucaria forests and evaluated their PS using Mantel regressions, phylogenetic eigenvector regressions (PVR) and K statistic. Mantel regressions always gave less significant results compared to PVR and K statistic in all combinations of phylogenetic trees constructed. Moreover, a better phylogenetic resolution affected PS, independently of the method used to estimate it. Morphological seed traits tended to show higher PS than diaspores traits, while PS in flowering/fruiting phenology depended mostly on the method used to estimate it. This study demonstrates that different PS estimates are obtained depending on the chosen method and the phylogenetic tree resolution. This finding has implications for inferences on phylogenetic niche conservatism or ecological processes determining phylogenetic community structure. PMID:23368095

  2. Phylogenetic relationships among subsurface microorganisms

    SciTech Connect

    Nierzwicki-Bauer, S.A.

    1991-01-01

    This report summarizes the progress made from 6/90--3/91 toward completion of our project, Phylogenetic Relationships among subsurface microorganisms. 16S rRNA was sequenced, and based on the sequence the SMCC isolates were assigned to preliminary groups. Microorganisms were obtained at various depths at the Savannah River Site, including the Surface (0 m), Congaree (91 m), and Middendorf (244 m, 259 m, 265 m). Sequence data from four isolates from the Congaree formation indicate these microorganisms can be divided into Pseudomonas spp. or Acinetobacter spp. Three 16S rRNA probes were synthesized based on sequence data. The synthesized probes were tested through in situ hybridization. Optimal conditions for in situ hybridization were determined. Because stability of RNA-DNA hybrids is dependent on hybridization stringency, related organisms can be differentiated using a single probe under different strigencies. The results of these hybridizations agree with results obtained by Balkwill and Reeves using restriction fragment length polymorphism analysis. The RNA content of a cell reflects its metabolic state. Cells which are starved for four days are not detectable with the homologous 16S rRNA probe. However, within 15 minutes of refeeding, detectable rRNA appeared. This suggests that organisms which are undetectable in environmental samples due to starvation may be detectable after addition of nutrients. Stepwise addition of specific nutrients could indicate which nutrients are rate limiting for growth. Preliminary experiments with soil samples from the Hanford Site indicate indigenous microorganisms can be detected by oligionucleotide probes. Further, using multiple probes based on universal sequences increases the number of organisms detected. Double label experiments, using a rhodamine-labelled oligionucleotide probe with free coumarin succinimidyl ester will allow simultaneous detection of total bacteria and specific 16S rRNA containing bacteria. 4 tabs. (MHB)

  3. NNLOPS accurate associated HW production

    NASA Astrophysics Data System (ADS)

    Astill, William; Bizon, Wojciech; Re, Emanuele; Zanderighi, Giulia

    2016-06-01

    We present a next-to-next-to-leading order accurate description of associated HW production consistently matched to a parton shower. The method is based on reweighting events obtained with the HW plus one jet NLO accurate calculation implemented in POWHEG, extended with the MiNLO procedure, to reproduce NNLO accurate Born distributions. Since the Born kinematics is more complex than the cases treated before, we use a parametrization of the Collins-Soper angles to reduce the number of variables required for the reweighting. We present phenomenological results at 13 TeV, with cuts suggested by the Higgs Cross section Working Group.

  4. How to accurately bypass damage

    PubMed Central

    Broyde, Suse; Patel, Dinshaw J.

    2016-01-01

    Ultraviolet radiation can cause cancer through DNA damage — specifically, by linking adjacent thymine bases. Crystal structures show how the enzyme DNA polymerase η accurately bypasses such lesions, offering protection. PMID:20577203

  5. Phylogenetics and the origin of species

    PubMed Central

    Avise, John C.; Wollenberg, Kurt

    1997-01-01

    A recent criticism that the biological species concept (BSC) unduly neglects phylogeny is examined under a novel modification of coalescent theory that considers multiple, sex-defined genealogical pathways through sexual organismal pedigrees. A competing phylogenetic species concept (PSC) also is evaluated from this vantage. Two analytical approaches are employed to capture the composite phylogenetic information contained within the braided assemblages of hereditary pathways of a pedigree: (i) consensus phylogenetic trees across allelic transmission routes and (ii) composite phenograms from quantitative values of organismal coancestry. Outcomes from both approaches demonstrate that the supposed sharp distinction between biological and phylogenetic species concepts is illusory. Historical descent and reproductive ties are related aspects of phylogeny and jointly illuminate biotic discontinuity. PMID:9223259

  6. Accurate Evaluation of Quantum Integrals

    NASA Technical Reports Server (NTRS)

    Galant, David C.; Goorvitch, D.

    1994-01-01

    Combining an appropriate finite difference method with Richardson's extrapolation results in a simple, highly accurate numerical method for solving a Schr\\"{o}dinger's equation. Important results are that error estimates are provided, and that one can extrapolate expectation values rather than the wavefunctions to obtain highly accurate expectation values. We discuss the eigenvalues, the error growth in repeated Richardson's extrapolation, and show that the expectation values calculated on a crude mesh can be extrapolated to obtain expectation values of high accuracy.

  7. Phylogenetic Approaches to Natural Product Structure Prediction

    PubMed Central

    Ziemert, Nadine; Jensen, Paul R.

    2015-01-01

    Phylogenetics is the study of the evolutionary relatedness among groups of organisms. Molecular phylogenetics uses sequence data to infer these relationships for both organisms and the genes they maintain. With the large amount of publicly available sequence data, phylogenetic inference has become increasingly important in all fields of biology. In the case of natural product research, phylogenetic relationships are proving to be highly informative in terms of delineating the architecture and function of the genes involved in secondary metabolite biosynthesis. Polyketide synthases and nonribosomal peptide synthetases provide model examples in which individual domain phylogenies display different predictive capacities, resolving features ranging from substrate specificity to structural motifs associated with the final metabolic product. This chapter provides examples in which phylogeny has proven effective in terms of predicting functional or structural aspects of secondary metabolism. The basics of how to build a reliable phylogenetic tree are explained along with information about programs and tools that can be used for this purpose. Furthermore, it introduces the Natural Product Domain Seeker, a recently developed Web tool that employs phylogenetic logic to classify ketosynthase and condensation domains based on established enzyme architecture and biochemical function. PMID:23084938

  8. How does cognition evolve? Phylogenetic comparative psychology

    PubMed Central

    Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

    2014-01-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850

  9. Maximizing the phylogenetic diversity of seed banks.

    PubMed

    Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

    2015-04-01

    Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. PMID:25196170

  10. Phylogenetic niche conservatism in C4 grasses.

    PubMed

    Liu, Hui; Edwards, Erika J; Freckleton, Robert P; Osborne, Colin P

    2012-11-01

    Photosynthetic pathway is used widely to discriminate plant functional types in studies of global change. However, independent evolutionary lineages of C(4) grasses with different variants of C(4) photosynthesis show different biogeographical relationships with mean annual precipitation, suggesting phylogenetic niche conservatism (PNC). To investigate how phylogeny and photosynthetic type differentiate C(4) grasses, we compiled a dataset of morphological and habitat information of 185 genera belonging to two monophyletic subfamilies, Chloridoideae and Panicoideae, which together account for 90 % of the world's C(4) grass species. We evaluated evolutionary variance and covariance of morphological and habitat traits. Strong phylogenetic signals were found in both morphological and habitat traits, arising mainly from the divergence of the two subfamilies. Genera in Chloridoideae had significantly smaller culm heights, leaf widths, 1,000-seed weights and stomata; they also appeared more in dry, open or saline habitats than those of Panicoideae. Controlling for phylogenetic structure showed significant covariation among morphological traits, supporting the hypothesis of phylogenetically independent scaling effects. However, associations between morphological and habitat traits showed limited phylogenetic covariance. Subfamily was a better explanation than photosynthetic type for the variance in most morphological traits. Morphology, habitat water availability, shading, and productivity are therefore all involved in the PNC of C(4) grass lineages. This study emphasized the importance of phylogenetic history in the ecology and biogeography of C(4) grasses, suggesting that divergent lineages need to be considered to fully understand the impacts of global change on plant distributions. PMID:22569558

  11. Phylogenetic structure in tropical hummingbird communities

    PubMed Central

    Graham, Catherine H.; Parra, Juan L.; Rahbek, Carsten; McGuire, Jimmy A.

    2009-01-01

    How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining an expensive means of locomotion at high elevations. We found that communities in the lowlands on opposite sides of the Andes tend to be phylogenetically similar despite their large differences in species composition, a pattern implicating the Andes as an important dispersal barrier. In contrast, along the steep environmental gradient between the lowlands and the Andes we found evidence that species turnover is comprised of relatively distantly related species. The integration of local and regional patterns of diversity across environmental gradients and biogeographic barriers provides insight into the potential underlying mechanisms that have shaped community composition and phylogenetic diversity in one of the most species-rich, complex regions of the world. PMID:19805042

  12. Phylogenetic structure in tropical hummingbird communities.

    PubMed

    Graham, Catherine H; Parra, Juan L; Rahbek, Carsten; McGuire, Jimmy A

    2009-11-17

    How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining an expensive means of locomotion at high elevations. We found that communities in the lowlands on opposite sides of the Andes tend to be phylogenetically similar despite their large differences in species composition, a pattern implicating the Andes as an important dispersal barrier. In contrast, along the steep environmental gradient between the lowlands and the Andes we found evidence that species turnover is comprised of relatively distantly related species. The integration of local and regional patterns of diversity across environmental gradients and biogeographic barriers provides insight into the potential underlying mechanisms that have shaped community composition and phylogenetic diversity in one of the most species-rich, complex regions of the world. PMID:19805042

  13. Classification of the acanthocephala.

    PubMed

    Amin, Omar M

    2013-09-01

    In 1985, Amin presented a new system for the classification of the Acanthocephala in Crompton and Nickol's (1985) book 'Biology of the Acanthocephala' and recognized the concepts of Meyer (1931, 1932, 1933) and Van Cleave (1936, 1941, 1947, 1948, 1949, 1951, 1952). This system became the standard for the taxonomy of this group and remains so to date. Many changes have taken place and many new genera and species, as well as higher taxa, have been described since. An updated version of the 1985 scheme incorporating new concepts in molecular taxonomy, gene sequencing and phylogenetic studies is presented. The hierarchy has undergone a total face lift with Amin's (1987) addition of a new class, Polyacanthocephala (and a new order and family) to remove inconsistencies in the class Palaeacanthocephala. Amin and Ha (2008) added a third order (and a new family) to the Palaeacanthocephala, Heteramorphida, which combines features from the palaeacanthocephalan families Polymorphidae and Heteracanthocephalidae. Other families and subfamilies have been added but some have been eliminated, e.g. the three subfamilies of Arythmacanthidae: Arhythmacanthinae Yamaguti, 1935; Neoacanthocephaloidinae Golvan, 1960; and Paracanthocephaloidinae Golvan, 1969. Amin (1985) listed 22 families, 122 genera and 903 species (4, 4 and 14 families; 13, 28 and 81 genera; 167, 167 and 569 species in Archiacanthocephala, Eoacanthocephala and Palaeacanthocephala, respectively). The number of taxa listed in the present treatment is 26 families (18% increase), 157 genera (29%), and 1298 species (44%) (4, 4 and 16; 18, 29 and 106; 189, 255 and 845, in the same order), which also includes 1 family, 1 genus and 4 species in the class Polyacanthocephala Amin, 1987, and 3 genera and 5 species in the fossil family Zhijinitidae. PMID:24261131

  14. Learning accurate very fast decision trees from uncertain data streams

    NASA Astrophysics Data System (ADS)

    Liang, Chunquan; Zhang, Yang; Shi, Peng; Hu, Zhengguo

    2015-12-01

    Most existing works on data stream classification assume the streaming data is precise and definite. Such assumption, however, does not always hold in practice, since data uncertainty is ubiquitous in data stream applications due to imprecise measurement, missing values, privacy protection, etc. The goal of this paper is to learn accurate decision tree models from uncertain data streams for classification analysis. On the basis of very fast decision tree (VFDT) algorithms, we proposed an algorithm for constructing an uncertain VFDT tree with classifiers at tree leaves (uVFDTc). The uVFDTc algorithm can exploit uncertain information effectively and efficiently in both the learning and the classification phases. In the learning phase, it uses Hoeffding bound theory to learn from uncertain data streams and yield fast and reasonable decision trees. In the classification phase, at tree leaves it uses uncertain naive Bayes (UNB) classifiers to improve the classification performance. Experimental results on both synthetic and real-life datasets demonstrate the strong ability of uVFDTc to classify uncertain data streams. The use of UNB at tree leaves has improved the performance of uVFDTc, especially the any-time property, the benefit of exploiting uncertain information, and the robustness against uncertainty.

  15. Hyperspectral image classification for mapping agricultural tillage practices

    Technology Transfer Automated Retrieval System (TEKTRAN)

    An efficient classification framework for mapping agricultural tillage practice using hyperspectral remote sensing imagery is proposed, which has the potential to be implemented practically to provide rapid, accurate, and objective surveying data for precision agricultural management and appraisal f...

  16. Measuring community similarity with phylogenetic networks.

    PubMed

    Parks, Donovan H; Beiko, Robert G

    2012-12-01

    Environmental drivers of biodiversity can be identified by relating patterns of community similarity to ecological factors. Community variation has traditionally been assessed by considering changes in species composition and more recently by incorporating phylogenetic information to account for the relative similarity of taxa. Here, we describe how an important class of measures including Bray-Curtis, Canberra, and UniFrac can be extended to allow community variation to be computed on a phylogenetic network. We focus on phylogenetic split systems, networks that are produced by the widely used median network and neighbor-net methods, which can represent incongruence in the evolutionary history of a set of taxa. Calculating β diversity over a split system provides a measure of community similarity averaged over uncertainty or conflict in the available phylogenetic signal. Our freely available software, Network Diversity, provides 11 qualitative (presence-absence, unweighted) and 14 quantitative (weighted) network-based measures of community similarity that model different aspects of community richness and evenness. We demonstrate the broad applicability of network-based diversity approaches by applying them to three distinct data sets: pneumococcal isolates from distinct geographic regions, human mitochondrial DNA data from the Indonesian island of Nias, and proteorhodopsin sequences from the Sargasso and Mediterranean Seas. Our results show that major expected patterns of variation for these data sets are recovered using network-based measures, which indicates that these patterns are robust to phylogenetic uncertainty and conflict. Nonetheless, network-based measures of community similarity can differ substantially from measures ignoring phylogenetic relationships or from tree-based measures when incongruent signals are present in the underlying data. Network-based measures provide a methodology for assessing the robustness of β-diversity results in light of

  17. Probabilistic Graphical Model Representation in Phylogenetics

    PubMed Central

    Höhna, Sebastian; Heath, Tracy A.; Boussau, Bastien; Landis, Michael J.; Ronquist, Fredrik; Huelsenbeck, John P.

    2014-01-01

    Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to grasp complex phylogenetic models, including their assumptions and parameter/variable dependencies. Graphical modeling is a unifying framework that has gained in popularity in the statistical literature in recent years. The core idea is to break complex models into conditionally independent distributions. The strength lies in the comprehensibility, flexibility, and adaptability of this formalism, and the large body of computational work based on it. Graphical models are well-suited to teach statistical models, to facilitate communication among phylogeneticists and in the development of generic software for simulation and statistical inference. Here, we provide an introduction to graphical models for phylogeneticists and extend the standard graphical model representation to the realm of phylogenetics. We introduce a new graphical model component, tree plates, to capture the changing structure of the subgraph corresponding to a phylogenetic tree. We describe a range of phylogenetic models using the graphical model framework and introduce modules to simplify the representation of standard components in large and complex models. Phylogenetic model graphs can be readily used in simulation, maximum likelihood inference, and Bayesian inference using, for example, Metropolis–Hastings or Gibbs sampling of the posterior distribution. [Computation; graphical models; inference; modularization; statistical phylogenetics; tree plate.] PMID:24951559

  18. Hierarchical classification of glycoside hydrolases.

    PubMed

    Naumoff, D G

    2011-06-01

    This review deals with structural and functional features of glycoside hydrolases, a widespread group of enzymes present in almost all living organisms. Their catalytic domains are grouped into 120 amino acid sequence-based families in the international classification of the carbohydrate-active enzymes (CAZy database). At a higher hierarchical level some of these families are combined in 14 clans. Enzymes of the same clan have common evolutionary origin of their genes and share the most important functional characteristics such as composition of the active center, anomeric configuration of cleaved glycosidic bonds, and molecular mechanism of the catalyzed reaction (either inverting, or retaining). There are now extensive data in the literature concerning the relationship between glycoside hydrolase families belonging to different clans and/or included in none of them, as well as information on phylogenetic protein relationship within particular families. Summarizing these data allows us to propose a multilevel hierarchical classification of glycoside hydrolases and their homologs. It is shown that almost the whole variety of the enzyme catalytic domains can be brought into six main folds, large groups of proteins having the same three-dimensional structure and the supposed common evolutionary origin. PMID:21639842

  19. Terrain classification for a UGV

    NASA Astrophysics Data System (ADS)

    Sarwal, Alok; Baker, Chris; Rosenblum, Mark

    2005-05-01

    This work addresses the issue of Terrain Classification that can be applied for path planning for an Unmanned Ground Vehicle (UGV) platform. We are interested in classification of features such as rocks, bushes, trees and dirt roads. Currently, the data is acquired from a color camera mounted on the UGV as we can add range data from a second sensor in the future. The classification is accomplished by first, coarse segmenting a frame and then refining the initial segmentations through a convenient user interface. After the first frame, temporal information is exploited to improve the quality of the image segmentation and help classification adapt to changes due to ambient lighting, shadows, and scene changes as the platform moves. The Mean Shift Classifier algorithm provides segmentation of the current frame data. We have tested the above algorithms with four sequence of frames acquired in an environment with terrain representative of the type we expect to see in the field. A comparison of the results from this algorithm was done with accurate manually-segmented (ground-truth) data, for each frame in the sequence.

  20. Trends and concepts in fern classification

    PubMed Central

    Christenhusz, Maarten J. M.; Chase, Mark W.

    2014-01-01

    Background and Aims Throughout the history of fern classification, familial and generic concepts have been highly labile. Many classifications and evolutionary schemes have been proposed during the last two centuries, reflecting different interpretations of the available evidence. Knowledge of fern structure and life histories has increased through time, providing more evidence on which to base ideas of possible relationships, and classification has changed accordingly. This paper reviews previous classifications of ferns and presents ideas on how to achieve a more stable consensus. Scope An historical overview is provided from the first to the most recent fern classifications, from which conclusions are drawn on past changes and future trends. The problematic concept of family in ferns is discussed, with a particular focus on how this has changed over time. The history of molecular studies and the most recent findings are also presented. Key Results Fern classification generally shows a trend from highly artificial, based on an interpretation of a few extrinsic characters, via natural classifications derived from a multitude of intrinsic characters, towards more evolutionary circumscriptions of groups that do not in general align well with the distribution of these previously used characters. It also shows a progression from a few broad family concepts to systems that recognized many more narrowly and highly controversially circumscribed families; currently, the number of families recognized is stabilizing somewhere between these extremes. Placement of many genera was uncertain until the arrival of molecular phylogenetics, which has rapidly been improving our understanding of fern relationships. As a collective category, the so-called ‘fern allies’ (e.g. Lycopodiales, Psilotaceae, Equisetaceae) were unsurprisingly found to be polyphyletic, and the term should be abandoned. Lycopodiaceae, Selaginellaceae and Isoëtaceae form a clade (the lycopods) that is

  1. The Evolutionary Ecology of Plant Disease: A Phylogenetic Perspective.

    PubMed

    Gilbert, Gregory S; Parker, Ingrid M

    2016-08-01

    An explicit phylogenetic perspective provides useful tools for phytopathology and plant disease ecology because the traits of both plants and microbes are shaped by their evolutionary histories. We present brief primers on phylogenetic signal and the analytical tools of phylogenetic ecology. We review the literature and find abundant evidence of phylogenetic signal in pathogens and plants for most traits involved in disease interactions. Plant nonhost resistance mechanisms and pathogen housekeeping functions are conserved at deeper phylogenetic levels, whereas molecular traits associated with rapid coevolutionary dynamics are more labile at branch tips. Horizontal gene transfer disrupts the phylogenetic signal for some microbial traits. Emergent traits, such as host range and disease severity, show clear phylogenetic signals. Therefore pathogen spread and disease impact are influenced by the phylogenetic structure of host assemblages. Phylogenetically rare species escape disease pressure. Phylogenetic tools could be used to develop predictive tools for phytosanitary risk analysis and reduce disease pressure in multispecies cropping systems. PMID:27359365

  2. Teaching Molecular Phylogenetics through Investigating a Real-World Phylogenetic Problem

    ERIC Educational Resources Information Center

    Zhang, Xiaorong

    2012-01-01

    A phylogenetics exercise is incorporated into the "Introduction to biocomputing" course, a junior-level course at Savannah State University. This exercise is designed to help students learn important concepts and practical skills in molecular phylogenetics through solving a real-world problem. In this application, students are required to identify…

  3. A statistical approach to root system classification

    PubMed Central

    Bodner, Gernot; Leitner, Daniel; Nakhforoosh, Alireza; Sobotik, Monika; Moder, Karl; Kaul, Hans-Peter

    2013-01-01

    Plant root systems have a key role in ecology and agronomy. In spite of fast increase in root studies, still there is no classification that allows distinguishing among distinctive characteristics within the diversity of rooting strategies. Our hypothesis is that a multivariate approach for “plant functional type” identification in ecology can be applied to the classification of root systems. The classification method presented is based on a data-defined statistical procedure without a priori decision on the classifiers. The study demonstrates that principal component based rooting types provide efficient and meaningful multi-trait classifiers. The classification method is exemplified with simulated root architectures and morphological field data. Simulated root architectures showed that morphological attributes with spatial distribution parameters capture most distinctive features within root system diversity. While developmental type (tap vs. shoot-borne systems) is a strong, but coarse classifier, topological traits provide the most detailed differentiation among distinctive groups. Adequacy of commonly available morphologic traits for classification is supported by field data. Rooting types emerging from measured data, mainly distinguished by diameter/weight and density dominated types. Similarity of root systems within distinctive groups was the joint result of phylogenetic relation and environmental as well as human selection pressure. We concluded that the data-define classification is appropriate for integration of knowledge obtained with different root measurement methods and at various scales. Currently root morphology is the most promising basis for classification due to widely used common measurement protocols. To capture details of root diversity efforts in architectural measurement techniques are essential. PMID:23914200

  4. A statistical approach to root system classification.

    PubMed

    Bodner, Gernot; Leitner, Daniel; Nakhforoosh, Alireza; Sobotik, Monika; Moder, Karl; Kaul, Hans-Peter

    2013-01-01

    Plant root systems have a key role in ecology and agronomy. In spite of fast increase in root studies, still there is no classification that allows distinguishing among distinctive characteristics within the diversity of rooting strategies. Our hypothesis is that a multivariate approach for "plant functional type" identification in ecology can be applied to the classification of root systems. The classification method presented is based on a data-defined statistical procedure without a priori decision on the classifiers. The study demonstrates that principal component based rooting types provide efficient and meaningful multi-trait classifiers. The classification method is exemplified with simulated root architectures and morphological field data. Simulated root architectures showed that morphological attributes with spatial distribution parameters capture most distinctive features within root system diversity. While developmental type (tap vs. shoot-borne systems) is a strong, but coarse classifier, topological traits provide the most detailed differentiation among distinctive groups. Adequacy of commonly available morphologic traits for classification is supported by field data. Rooting types emerging from measured data, mainly distinguished by diameter/weight and density dominated types. Similarity of root systems within distinctive groups was the joint result of phylogenetic relation and environmental as well as human selection pressure. We concluded that the data-define classification is appropriate for integration of knowledge obtained with different root measurement methods and at various scales. Currently root morphology is the most promising basis for classification due to widely used common measurement protocols. To capture details of root diversity efforts in architectural measurement techniques are essential. PMID:23914200

  5. Worldwide phylogenetic relationship of avian poxviruses

    USGS Publications Warehouse

    Gyuranecz, Miklós; Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we have expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g. starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy.

  6. Prioritizing Populations for Conservation Using Phylogenetic Networks

    PubMed Central

    Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.

    2014-01-01

    In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451

  7. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  8. Worldwide Phylogenetic Relationship of Avian Poxviruses

    PubMed Central

    Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of the Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups, and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g., starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy. PMID:23408635

  9. Fungal phylogenetic diversity drives plant facilitation.

    PubMed

    Montesinos-Navarro, Alicia; Segarra-Moragues, J G; Valiente-Banuet, A; Verdú, M

    2016-06-01

    Plant-plant facilitation is a crucial ecological process, as many plant species (facilitated) require the presence of an established individual (nurse) to recruit. Some plant facilitative interactions disappear during the ontogenetic development of the facilitated plant but others persist, even when the two plants are adults. We test whether the persistence of plant facilitative interactions is explained by the phylogenetic diversity of mutualistic and non-mutualistic fungi that the nurse and the facilitated species add to the shared rhizosphere. We classify plant facilitative interactions as persistent and non-persistent interactions and quantify the phylogenetic diversity of mutualistic and non-mutualistic fungi added by the plant species to the shared rhizosphere. Our results show that the facilitated species add less phylogenetic diversity of non-mutualistic fungi when plant facilitative interactions persist than when they do not persist. However, persistent and non-persistent facilitative interactions did not differ in the phylogenetic diversity of mutualistic fungi added by the facilitated species to the shared rhizosphere. Finally, the fungal phylogenetic diversity added by the nurse to the shared rhizosphere did not differ between persistent and non-persistent interactions. This study suggests that considering the fungal associates of the plant species involved in facilitative interactions can shed light on the mechanisms of persistence for plant-plant interactions. PMID:26915080

  10. Phylogenetic analysis of the Trypanosoma genus based on the heat-shock protein 70 gene.

    PubMed

    Fraga, Jorge; Fernández-Calienes, Aymé; Montalvo, Ana Margarita; Maes, Ilse; Deborggraeve, Stijn; Büscher, Philippe; Dujardin, Jean-Claude; Van der Auwera, Gert

    2016-09-01

    Trypanosome evolution was so far essentially studied on the basis of phylogenetic analyses of small subunit ribosomal RNA (SSU-rRNA) and glycosomal glyceraldehyde-3-phosphate dehydrogenase (gGAPDH) genes. We used for the first time the 70kDa heat-shock protein gene (hsp70) to investigate the phylogenetic relationships among 11 Trypanosoma species on the basis of 1380 nucleotides from 76 sequences corresponding to 65 strains. We also constructed a phylogeny based on combined datasets of SSU-rDNA, gGAPDH and hsp70 sequences. The obtained clusters can be correlated with the sections and subgenus classifications of mammal-infecting trypanosomes except for Trypanosoma theileri and Trypanosoma rangeli. Our analysis supports the classification of Trypanosoma species into clades rather than in sections and subgenera, some of which being polyphyletic. Nine clades were recognized: Trypanosoma carassi, Trypanosoma congolense, Trypanosoma cruzi, Trypanosoma grayi, Trypanosoma lewisi, T. rangeli, T. theileri, Trypanosoma vivax and Trypanozoon. These results are consistent with existing knowledge of the genus' phylogeny. Within the T. cruzi clade, three groups of T. cruzi discrete typing units could be clearly distinguished, corresponding to TcI, TcIII, and TcII+V+VI, while support for TcIV was lacking. Phylogenetic analyses based on hsp70 demonstrated that this molecular marker can be applied for discriminating most of the Trypanosoma species and clades. PMID:27180897